-
Notifications
You must be signed in to change notification settings - Fork 23
align_pair_seq
align_pair_seq creates a pairwise alignment of the first two sequences in the stream.
The algorithm used works by locating all Maximally Extended Matches (MEMs) between two sequences within a given search space and seeded by matches of a given kmer size. All MEMs located are scored according to the length and distance to the closest diagonal in the search space and the highest scoring MEM is picked and added to the alignment chain. Next the right and left search space is recursed into.
... | align_pair_seq [options]
[-? | --help] # Print full usage description.
[-I <file!> | --stream_in=<file!>] # Read input from stream file - Default=STDIN
[-O <file> | --stream_out=<file>] # Write output to stream file - Default=STDOUT
[-v | --verbose] # Verbose output.
Consider the following FASTA entries in the file test.fna
:
>test1
tgcatgctagctatagccgtttgtacgatggctagccagcag
>test2
tgctacctagctagcatacgtacgatgatgctaggtagctgg
To create a pairwise alignment of these do:
read_fasta -i test.fna | align_pair_seq | write_align -x
. . . .
test1 TGCaTg-CTAGCTAtaGCcgTttGTACGATG--GCTAGccAGCa-G
||| | ||||||| || | |||||||| ||||| ||| |
test2 TGC-TacCTAGCTA--GCa-TacGTACGATGatGCTAGgtAGCtgG
. . . .
Martin Asser Hansen - Copyright (C) - All rights reserved.
June 2009
GNU General Public License version 2
http://www.gnu.org/copyleft/gpl.html
align_pair_seq is part of the Biopieces framework.