Skip to content

Add assignEcByOrthologs.pl: transitive EC assignment via ortholog groups#103

Open
jbrestel wants to merge 6 commits intomasterfrom
orthomcl-ec
Open

Add assignEcByOrthologs.pl: transitive EC assignment via ortholog groups#103
jbrestel wants to merge 6 commits intomasterfrom
orthomcl-ec

Conversation

@jbrestel
Copy link
Member

@jbrestel jbrestel commented Mar 4, 2026

Uses group co-membership and shared InterPro family profiles to assign EC numbers to unannotated proteins by majority vote within profile clusters.

Uses group co-membership and shared InterPro family profiles to assign
EC numbers to unannotated proteins by majority vote within profile clusters.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@jbrestel jbrestel requested review from rdemko2332 and stubrown March 4, 2026 05:21
jbrestel and others added 5 commits March 4, 2026 00:48
When --verbose is on, print the reason each EC in a group was not
transitively assigned: either subsumed by a more specific EC during
hierarchy normalization, or below the majority-vote threshold.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Add cluster_mean_length and length_vs_cluster_mean (fractional deviation
from cluster mean) columns, plus group-level counts: group_size,
n_annotated_in_group, n_supporting_in_group.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
…format

- Read aa_sequence_id directly from file (no identifier lookup needed)
- Populate new cluster/group stat columns: num_supporting_cluster,
  num_protein_cluster, num_any_ec_cluster, num_supporting_group,
  num_protein_group, num_any_ec_group, length_mean
- Map confidence_score -> domain_score, length_vs_cluster_mean -> length_score
- Load all rows (both novel and already-annotated)
- Remove unused imports and aaSeqLocusTagMappingSql argument

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Round domain_score and length_score to 4 decimal places, cast
length_mean to integer, and explicitly int() all count columns.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant