Conversation
|
|
I did some more tests and they seem to stay the same in a system, but change depending on CPU model, probably because of non-determinism in the FAISS index search or the ESM embedding computation, and not due to a container/environment difference. Likely causes (by the bot -I excluded one that didn't make sense for this case): |
| path("models/level0"), emit: level0 | ||
| path("models/level1"), emit: level1 | ||
| path("models/level2"), emit: level2 |
There was a problem hiding this comment.
I would put these all on one tuple, they are all related and can't be used in any other way (e.g. with .bam and .bai) - that way you don't have to do any .combine shenanigans in this case
There was a problem hiding this comment.
Also agreed. Coming up
New protein annotation software for CAZyme prediciton from amino acid sequences.
PR includes the setup module needed to download models from hugging face.
The bioconda recipe review process is slow, so for now, using a pip installation (env and containers through Seqera Containers).
This is the CPU version. Will probably create a separate GPU (dramatic speed increase) one after this gets merged.
PR checklist
topic: versions- See version_topicslabelnf-core modules test <MODULE> --profile dockernf-core modules test <MODULE> --profile singularitynf-core modules test <MODULE> --profile condanf-core subworkflows test <SUBWORKFLOW> --profile dockernf-core subworkflows test <SUBWORKFLOW> --profile singularitynf-core subworkflows test <SUBWORKFLOW> --profile conda