Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bamstats not working with Ensembl GTF (with "gene_biotype" attribute) #31

Open
usergsc opened this issue Dec 2, 2020 · 2 comments
Open

Comments

@usergsc
Copy link

usergsc commented Dec 2, 2020

Ensembl GTFs have "gene_biotype", instead of "gene_type", attribute.
ftp://ftp.ensembl.org/pub/release-102/gtf/homo_sapiens.

This affects the calculation of fraction_rrna in the bamstats program, if nothing else.

@emi80
Copy link
Member

emi80 commented Dec 3, 2020

Hi @usergsc,

thanks for reporting this.

A possible fix would be using a list of attributes when detecting rRNA genes in the annotation. The list would include both gene_type and gene_biotype (in order) and use the first one available from each entry.

Best,
Emilio

@usergsc
Copy link
Author

usergsc commented Dec 3, 2020

Hi Emilio,

It would be helpful if the list of expected GTF attributes are listed in the README file, so users are aware of the requirements, and can update their GTF files if necessary.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants