Skip to content

Commit

Permalink
Update readme
Browse files Browse the repository at this point in the history
  • Loading branch information
mcmero committed May 8, 2024
1 parent a0b56a0 commit 76c4c5c
Showing 1 changed file with 9 additions and 3 deletions.
12 changes: 9 additions & 3 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,13 +1,18 @@
# Nanopore Overhang Pipeline
# Nanopore demux-count pipeline

A Nextflow pipeline for the preprocessing of reads using the overhang protocol run on Oxford Nanopore Technology. The pipeline locates the specified forward and reverse primer sequences and trims the read N bases upstream of the forward primer and N bases downstream of the reverse primer, where N is the barcode length.
A Nextflow pipeline for the processing of reads from Oxford Nanopore Technology. The pipeline locates the specified forward and reverse primer sequences and trims the read N bases upstream of the forward primer and N bases downstream of the reverse primer, where N is the barcode length.

If demultiplexing by barcodes is required, the pipeline can split the data into separate fastq files by sample. Sequences of interest can also be counted if a fasta file is provided of guide sequences.

The read structure is typically:

`[sequence][fwd_index][fwd_primer][sequence_of_interest][rev_primer][rev_index][sequence]`

The pipeline uses the following tools:

- [splitcode](https://github.com/pachterlab/splitcode) for demultiplexing
- [minimap2](https://github.com/lh3/minimap2) for alignment

## How to install (WEHI only)

The easiest way to run the pipeline is to use the [Seqera Platform](https://seqera.services.biocommons.org.au/) service provided to WEHI by Australian Biocommons. You can find more information about Seqera Platform (formerly Nextflow Tower) on WEHI's [Research Computing page](https://wehieduau.sharepoint.com/sites/rc2/SitePages/Nextflow-Tower.aspx). See the [Configuration](https://github.com/WEHIGenomicsRnD/nf-qc-pipe#tower-configuration) section for more info.
Expand Down Expand Up @@ -52,7 +57,8 @@ Here are the parameters you will need to set:
- `--barcode_length`: how many bases to trim to the left and right of the primer sequences. If your barcode includes spacers make sure to take that into account (i.e., non-informative bases between the index and primer). Set this to 0 if you do not have barcodes.
- `--index_template_file`: if demultiplexing, use this index file to specify or lookup indexes (see below for format).
- `--guides_fasta`: (optional) fasta file contains guide sequences to count.
- `--use_db`: boolean value, whether or not to look up indexes in the Genomics database.
- `--use_db`: boolean value, default: false. Whether or not to look up indexes in the Genomics database.
- `--lenient_counts`: boolean value, default: false. If true, reads do not have to span the whole guide sequence to be counted (they will be counted as a partial map).

### Index template file format

Expand Down

0 comments on commit 76c4c5c

Please sign in to comment.