-
Notifications
You must be signed in to change notification settings - Fork 30
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feature names hardcoded in extract_annotations_from_gff.py #25
Labels
bug
Something isn't working
Comments
Hi @mohammedkhalfan, |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Description of the bug
I'm not sure if this is a bug or by design. There are parameters to override the default feature names in the host and pathogen annotations, but I'm not sure if they are being used in all parts of the pipeline. For instance, I get failures when running the step which calls extract_annotations_from_gff.py because it cannot find any features. When I look in the script I can see that 'gene_type', 'gene_id', 'gene_name', and 'transcript_name' are hardcoded, and thus the parameters which I specify to override these feature names are not used in this script. If I manually edit this file to use the corresponding feature names, it works.
Steps to reproduce
Use a reference annotation from Ensembl such as Saccharomyces_cerevisiae.R64-1-1.34.gff3 or Schizosaccharomyces_pombe.ASM294v2.51.gff3 which does not contain the default feature names.
Expected behaviour
I expect the feature names I specify when running the pipeline to override the default names, but the names in this script are hardcoded.
The text was updated successfully, but these errors were encountered: