Skip to content

Releases: vgteam/vg

vg 1.62.0 - Ranzano

16 Dec 21:38
230872c
Compare
Choose a tag to compare

Download for Linux

Don't forget to mark the static binary executable:

chmod +x vg

Docker Image: quay.io/vgteam/vg:v1.62.0

Buildable Source Tarball: vg-v1.62.0.tar.gz

Includes source for vg and all submodules. Use this instead of Github's "Source Code" downloads; those will not build as they do not include code for bundled dependencies that the vg build process needs.

This release includes:

  • GBWT construction from a GAM/GAF file now uses parallel construction jobs.
  • vg chunk and vg find now generate subpaths with subrange metadata when cutting up paths.
  • vg gbwt will accept subranges on fragment 0 and discard the fragment number.
  • vg map, vg mpmap, and vg giraffe can now annotate output with SAM-style flags from FASTQ comments with --comments-as-tags
  • vg surject now detects when multipath alignments obviously don't belong to the graph they are being surjected to.
  • Updated libbdsg to check if a distance index actually has distances and to improve memory use of distance indexing when not including distances
  • vg index now includes include flag --no-nested-distance to build a distance index with distances only on the top-level chain
  • Add --snarl-sample to vg stats -R. This adds BED-style reference coordinates to the front of each row in the snarl output table, using the input sample to select reference paths. If no selected path is found .'s are written. If multiple paths / intervals find (in case of cyclces), the first one found is printed.
  • vg deconstruct -n bug that bypassed some nested sites fixed.
  • When reading a .gff3 file with vg rna, validate exon ordering by base-pair position instead of number attribute. This allows reverse-strand exons to be numbered either by base-pair order or transcription order.
  • Have vg rna gracefully ignore features with bad chromosome names if they're not included in --feature-type and thus won't be parsed anyways

Updated Submodules

The gbwt, gbwtgraph, gcsa2, libbdsg, libvgio, sdsl-lite, and sublinear-Li-Stephens submodules have been updated.

vg 1.61.0 - Plodio

04 Nov 19:17
cb82ebb
Compare
Choose a tag to compare

Download for Linux

Don't forget to mark the static binary executable:

chmod +x vg

Docker Image: quay.io/vgteam/vg:v1.61.0

Buildable Source Tarball: vg-v1.61.0.tar.gz

Includes source for vg and all submodules. Use this instead of Github's "Source Code" downloads; those will not build as they do not include code for bundled dependencies that the vg build process needs.

This release includes:

  • Changed the protocol of the elfutils submodule from git:// to https://
  • Set snarl distance to 0 in path normalizer now that distance index is fixed.
  • GCSA indexing now checks for I/O errors on temporary kmer files.
  • make get-deps can no longer demand you set a timezone even when DEBIAN_FRONTEND=noninteractive in the calling shell.
  • vg chunk can now reference-index particular haplotype paths when requested.
  • vg paths -o added to apply a ReferencePathOverlayHelper before looking at the paths.
  • Mac CI now uses macOS 15 base image.
  • source_me.sh is no longer needed.
  • fix validate bug on protobuf graphs
  • vg surject --prune-low-cplx should no longer assess different contexts for different orientations of the same read, and should properly detect short tail anchors.
  • sublinear-Li-Stephens now builds with the same compiler as the rest of the vg build
  • vg rna can now output a haplotypes GBWT with node IDs matching the spliced graph

Updated Submodules

The libbdsg, sdsl-lite, and sublinear-Li-Stephens submodules have been updated.

The elfutils submodule URL has changed; make sure to git submodule sync.

vg 1.60.0 - Annicco

23 Sep 19:07
d6ea214
Compare
Choose a tag to compare

Download for Linux

Don't forget to mark the static binary executable:

chmod +x vg

Docker Image: quay.io/vgteam/vg:v1.60.0

Buildable Source Tarball: vg-v1.60.0.tar.gz

Includes source for vg and all submodules. Use this instead of Github's "Source Code" downloads; those will not build as they do not include code for bundled dependencies that the vg build process needs.

This release includes:

  • vg's self-reported version information should no longer be able to lag behind the actual Git revision checked out
  • vg stats --snarl-contents added to print a table of snarl / depth / parent / nested node ids
  • vg inject can now handle reads that softclip after the end of the target path
  • vg paths -n option added to normalize graphs using path information to "snap together" redundant paths through snarls. After running, no two path traversals through a snarl will ever produce the same sequence string without the traversals themselves being identical.
  • vg build system no longer patches kff-cpp-api CMakeLists.txt.

Updated Submodules

The kff-cpp-api, libbdsg and sublinear-Li-Stephens submodules have been updated.

vg 1.59.0 - Casatico

12 Aug 20:16
e6f0326
Compare
Choose a tag to compare

Download for Linux

Don't forget to mark the static binary executable:

chmod +x vg

Docker Image: quay.io/vgteam/vg:v1.59.0

Buildable Source Tarball: vg-v1.59.0.tar.gz

Includes source for vg and all submodules. Use this instead of Github's "Source Code" downloads; those will not build as they do not include code for bundled dependencies that the vg build process needs.

This release includes:

  • Experimental haploid scoring model for haplotype sampling.
  • Faster BAM output when using a GBZ format graph
  • vg inject no longer reports invalid alignments
  • vg giraffe --parameter-preset long-form option should now work again
  • vg surject now complains if extra command line arguemnts are provided
  • vg Mac builds no longer pass -headerpad without an argument to the linker
  • vg giraffe in non-chaining single-end mode will again prioritize minimizers the same way as in paired-end mode
  • README no longer interpretable as suggesting that users build with very old Clang
  • vg surject now uses an unlimited --max-anchors by default
  • vg clip faster on graphs with high path depth
  • vg surject has improved base-level alignment accuracy with long reads
  • Random walk band padding is now available outside MultipathAligner
  • Gapless extension queue order should no longer cause vg giraffe output to very across compilers
  • vg giraffe --show-work no longer claims to not be doing rescue when it is
  • vg annotate can now output alignments in GAF.
  • When reading more than 1000 BED or GFF records, vg will no longer dump the first records to standard output and forget about them.

Updated Submodules

The libbdsg and libhandlegraph submodules have been updated.

vg 1.58.0 - Cartari

01 Jul 21:13
a049c6b
Compare
Choose a tag to compare

Download for Linux

Don't forget to mark the static binary executable:

chmod +x vg

Docker Image: quay.io/vgteam/vg:v1.58.0

Buildable Source Tarball: vg-v1.58.0.tar.gz

Includes source for vg and all submodules. Use this instead of Github's "Source Code" downloads; those will not build as they do not include code for bundled dependencies that the vg build process needs.

This release includes:

  • vg deconstruct now does path-based (formerly -e) deconstruction by default. Old default behaviour of exhaustively processing (tiny) sites is deprecated.
  • if -a is not used, vg deconstruct will recursive on child snarls of snarls it failed to process (like vg call) - functionality that was, I think, dropped a while back.
  • Experimental option -L added to vg deconstruct in order to cluster similar allele traversals together. The value given is a (length-weighted) threshold for the jaccard coefficient between the oriented nodes of two traversals. So if -L 0.75 is given, then alleles that have >= 0.75 similarity based on their graph positions will be merged into one. Two new FORMAT fields are added to keep track of the difference, TS (jaccard distance) and TL (length difference). Clustering is done greedily starting with selected reference paths.
  • new (experimental) option -n added to vg deconstruct. Like -a, it genotypes nested sites, but unlike -a it does so top-down, setting various tags that keep track of the nesting relationship at the allele level (and also linking every site back to its position on the LV=0 reference chromosome). *-alleles (used in recent VCF versions to represent spanning alleles) are used. This option will not support nested insertions on GBZ/GBWT input -- so in practice it should be used on chromosome-level .vg files (I will look into relaxing this).
  • -R option added to vg deconstruct to toggle whether star-alleles are reported with -n.
  • README now explains how to get vg on your PATH
  • README now explains how to build on multiple threads
  • vg can now read GAM files generated by the long-read Giraffe prototype
  • vg filter now lets you require exact matches for name filters instead of prefix matches with --exact-name.
  • deconstruct/call can write giant VCF lines. This happens in, say, large svs with lots of samples that each get their own allele due to nested variation (hopefully deconstruct -L can mitigate this via merging). Giant AT fields for each allele don't help. bcf apparently has a 2 gig line limit, and there's a case of deconstruct seemingly truncating large records. vg deconstruct / call are now modified to drop (with a warning) any lines >2Gb to avoid these issues.
  • vg giraffe should no longer crash when mapping paired-end reads and reporting secondaries without a fragment length distribution
  • vg inject now supports GAF format with the new --output-format/-o option

Updated Submodules

  • libvgio

vg 1.57.0 - Franchini

20 May 18:50
8ece064
Compare
Choose a tag to compare

Download for Linux

Don't forget to mark the static binary executable:

chmod +x vg

Docker Image: quay.io/vgteam/vg:v1.57.0

Buildable Source Tarball: vg-v1.57.0.tar.gz

Includes source for vg and all submodules. Use this instead of Github's "Source Code" downloads; those will not build as they do not include code for bundled dependencies that the vg build process needs.

This release includes:

  • Added haplotype sampling example
  • Added subpath support to vg annotate
  • Duplicate words removed from documentation
  • WFAExtender alignments are now scored using the full length bonus
  • Extra debug messages removed from vg chunk
  • libdeflate build should now have more space to be renamed

Updated Submodules

None

vg 1.56.0 - Collalto

08 Apr 20:31
963d8d9
Compare
Choose a tag to compare

Download for Linux

Don't forget to mark the static binary executable:

chmod +x vg

Docker Image: quay.io/vgteam/vg:v1.56.0

Buildable Source Tarball: vg-v1.56.0.tar.gz

Includes source for vg and all submodules. Use this instead of Github's "Source Code" downloads; those will not build as they do not include code for bundled dependencies that the vg build process needs.

This release includes:

  • vg GFA output should now use P lines instead of W lines when nonzero phase blocks are present, to allow them to be round-tripped.
  • vg call -c and -C options changed to limit search based on all alleles and not just reference allele. This means these options work much better in practice to prevent vg call from being lost in giant snarls.
  • --progress option added to vg call
  • Paths with subranges on phase block 0 (or on haplotype paths without explicit phase blocks) can now be saved to GFA again.
  • vg now supports reading from indexed GAF files of reads in many places where an indexed GAM file can be used

Updated Submodules

  • htslib

vg 1.55.0 - Bernolda

26 Feb 22:30
67c20e9
Compare
Choose a tag to compare

Download for Linux

Don't forget to mark the static binary executable:

chmod +x vg

Docker Image: quay.io/vgteam/vg:v1.55.0

Buildable Source Tarball: vg-v1.55.0.tar.gz

Includes source for vg and all submodules. Use this instead of Github's "Source Code" downloads; those will not build as they do not include code for bundled dependencies that the vg build process needs.

This release includes:

  • vg gamcompare now keeps track of whether reads could have possibly been gotten correct, based on there being a truth position for them in the truth set, and excludes reads without a truth from plots and accuracy statistics.
  • vg giraffe no longer produces alignments with deletions on one end
  • Multithreaded path cover / local haplotypes GBWT construction.
  • CI should no longer fail due to truncated FTP download
  • GBZ paper citation link now points to the right place
  • vg surject will left align relative to the forward strand most of the time.

Updated Submodules

  • gbwtgraph

vg 1.54.0 - Parafada

18 Jan 11:27
40cc426
Compare
Choose a tag to compare

Download for Linux

Don't forget to mark the static binary executable:

chmod +x vg

Docker Image: quay.io/vgteam/vg:v1.54.0

Buildable Source Tarball: vg-v1.54.0.tar.gz

Includes source for vg and all submodules. Use this instead of Github's "Source Code" downloads; those will not build as they do not include code for bundled dependencies that the vg build process needs.

This release includes:

  • Integrated haplotype sampling in vg giraffe now does diploid sampling.
  • GBWTGraph algorithm for parsing GFA now handles P-line names of the form sample#contig correctly.

vg 1.53.0 - Valmontone

04 Dec 21:27
25b02cc
Compare
Choose a tag to compare

Download for Linux

Don't forget to mark the static binary executable:

chmod +x vg

Docker Image: quay.io/vgteam/vg:v1.53.0

Buildable Source Tarball: vg-v1.53.0.tar.gz

Includes source for vg and all submodules. Use this instead of Github's "Source Code" downloads; those will not build as they do not include code for bundled dependencies that the vg build process needs.

This release includes:

  • vg sim no longer crashes on graphs with 1-node cycles
  • vg autoindex can identify haplotypes specified as P-lines in a GFA
  • Set reference samples in GBWT or GBZ with vg gbwt option --set-reference.
  • vg rna no longer projects transcripts twice onto a reference given by RS tag in a GFA
  • vg rna assigns unique names to twice-projected transcripts on cyclic haplotypes
  • GBWT construction automatically increases buffer size if the paths are too long.
  • In vg haplotypes, the default number of candidates for diploid sampling is now 32.
  • vg giraffe now explains that --named-coordinates works for GAF output
  • libvgio now uses quoted includes internally
  • vg's README now prominently lists some recommended papers to cite when using parts of vg in your work
  • Updated dozeu submodule should no longer crash vg giraffe and vg surject.

Updated Submodules

  • dozeu
  • gbwt
  • libhandlegraph
  • libvgio