Skip to content

Releases: ncbi/egapx

v0.3.1-alpha

18 Nov 22:53
1057884
Compare
Choose a tag to compare

Bug fix release 0.3.1-alpha
Fixed issues:

  • Issue #44, #49 FTP access for ortho files (Could not find path for ortho taxid)
  • Issue #47 incorrect temp directory for rnaseq_divide_by_strandedness
  • Issue #37 gnomon_training error

Release 0.3.0-alpha

05 Nov 22:17
f21ac06
Compare
Choose a tag to compare

New features integrated from RefSeq EGAP:

  • ortholog analysis vs a pre-defined reference species
  • refinement of gene biotype (protein-coding, pseudogene, lncRNA) based on annotation and orthology properties
  • Assignment of gene symbols, names, and protein names based on orthology or comparison to SwissProt proteins
  • Better annotation of single-exon protein-coding genes based on well supported proteins
  • Automatic selection of organism symbol format, ortholog reference species, protein reference sets, maximum intron size, and some annotation-related parameters
  • Added target protein sets for plant clades and additional vertebrates
  • Integration of structural and functional annotation into final output, including: ASN.1, GFF, GTF, mRNA FASTA, CDS FASTA, protein FASTA

Execution improvements:

  • Added versioning for EGAPx (egapx.py runner, Docker/Singularity images)
  • Added check for user input files
  • Improved support for pre-download of reference files
  • Updated STAR to produce csi index instead of bai index to work for large sequences
  • Increased time limit for chainer
  • Updated chunk size for miniprot tasks to 25k
  • Enable skipping gnomon training when parameters from closely-related taxa are available
  • Relocated Python requirements.txt to repo root

Future plans:

  • Workflow for GenBank submission. Contact us if you want to help with testing.
  • long-read transcript evidence using minimap2
  • short ncRNA prediction with tRNAscan and Rfam

Release v0.2-alpha

26 Jul 00:44
314cbbc
Compare
Choose a tag to compare
  • Updated resource allocation for different tasks
  • Added support for non-SRA reads
  • Added option for off-line mode
  • Bug fixes

Release v0.1.2-alpha

09 May 13:31
5a916d3
Compare
Choose a tag to compare
  • Added configs for biowulf cluster, and biowulf local
  • Added config for SLURM, that users will need to edit according to their cluster specifications
  • bug fixes

EGAPx alpha release

01 Apr 14:48
203afce
Compare
Choose a tag to compare
EGAPx alpha release Pre-release
Pre-release

This version of EGAPx is an alpha release with limited features and organism scope to collect initial feedback on execution. Outputs are not yet complete and not intended for production use.