dsl2 port #182

phue · 2021-01-19T20:14:32Z

This is some initial effort on porting nf-core/methylseq to dsl2.

My aim was to retain the functionality that was already there, I think that all features from v1.5 are working now.

A breaking change however is, that the pipeline now requires a samplesheet similar to what is already used in nf-core/nanoseq for example. It is supposed to have 4 columns:

sample	fastq_1	fastq_2	genome

The idea behind this change is to enable mapping of samples against different references (#181), something that is very useful for certain use cases.
Bonus: the samplesheet makes the single_end parameter obsolete

Would be great to get some opinions on this @nf-core/core

TODOs:

See nextflow-io/nextflow#1800 (comment)

This code works for ch_multiqc_custom_config - why not here?

* fastqc * picard/markduplicates * preseq/lcextrap * samtools/flagstat * samtools/index * samtools/stats * samtools/sort * trimgalore * multiqc

* bismark/genome_preparation * bismark/align * bismark/deduplicate * bismark/extract * bismark/report * bismark/summary TODO: write tests and add to nf-core/modules

* bwameth/align * bwameth/index TODO: write tests and add to nf-core/modules

* methyldackel/extract * methyldackel/mbias TODO: write tests and add to nf-core modules

TODO: write tests and add to nf-core/modules

this is inspired by the functionality in nf-core/nanoseq and nf-core/rnaseq The idea is to require a samplesheet to run the pipeline, which will allow for single/paired end auto-detection and mapping samples against different reference genomes. addresses nf-core#181

TODO: needs change in nf-core/test-datasets

baseDir is deprecated

drpatelh · 2021-01-19T23:48:37Z

Ah man! I will never get tired of seeing these initial DSL2 PRs appearing out of nowhere 😍 Looks great by just looking at the file changes!

The fact that nf-core/modules will get more and more padded out is a huge bonus too!

Nice work 🕺🏽

phue · 2021-01-20T14:32:33Z

@drpatelh Your nf-core/rnaseq port was a very helpful guideline to figure out how to do things! Thanks for that 👍

* methyldackel/extract * methyldackel/mbias TODO: write tests and add to nf-core modules

TODO: write tests and add to nf-core/modules

this is inspired by the functionality in nf-core/nanoseq and nf-core/rnaseq The idea is to require a samplesheet to run the pipeline, which will allow for single/paired end auto-detection and mapping samples against different reference genomes. addresses nf-core#181

TODO: needs change in nf-core/test-datasets

the pipeline now requires a samplesheet

remove conda and docker related actions

merging dsl2-template

phue · 2021-03-24T18:26:24Z

closing this because there is now a dsl2 branch here

ewels and others added 24 commits November 18, 2020 23:12

Try Channel.empty().collect()

a776add

Try again with various combinations of Channels and collect statements

9e273fe

Try making CI Nextflow installation logs quieter

c4e6462

See nextflow-io/nextflow#1800 (comment)

Brackets shouldn't make any difference

a5b0ea7

This code works for ch_multiqc_custom_config - why not here?

Fix logic for known_splices in bismark analysis

04fbaf8

Template update for nf-core/tools version 1.12

adfaff4

Template update for nf-core/tools version 1.12.1

4b90b02

check in required nf-core modules

0625da5

* fastqc * picard/markduplicates * preseq/lcextrap * samtools/flagstat * samtools/index * samtools/stats * samtools/sort * trimgalore * multiqc

add required bismark modules

a666c5d

* bismark/genome_preparation * bismark/align * bismark/deduplicate * bismark/extract * bismark/report * bismark/summary TODO: write tests and add to nf-core/modules

add required bwameth modules

b71b35a

* bwameth/align * bwameth/index TODO: write tests and add to nf-core/modules

add required methyldackel modules

78b492a

* methyldackel/extract * methyldackel/mbias TODO: write tests and add to nf-core modules

add required qualimap/bamqc module

470b74e

TODO: write tests and add to nf-core/modules

add required samtools/faidx module

8fa9177

TODO: write tests and add to nf-core/modules

add functions from nf-core/tools dsl2 template

aa84f29

add bismark subworkflow

7dc73ff

add bwameth subworkflow

811c806

wire up subworkflows

f31c396

add file placeholder

ed6e252

update test.config to use samplesheet

30bd4f3

TODO: needs change in nf-core/test-datasets

update base.config

2c7eda9

update igenomes.config

4152bae

baseDir is deprecated

sync some changes from dsl2 template

9dc0108

bump version to 2.0dev

e4abecd

This was referenced Jan 30, 2021

modules for bisulfite sequencing data nf-core/modules#129

Closed

Create samplesheet.csv phue/test-datasets#1

Merged

methylseq: add samplesheet.csv nf-core/test-datasets#214

Merged

Strip brackets

94b1a57

phue added 16 commits March 22, 2021 15:59

add required methyldackel modules

1f76e0d

* methyldackel/extract * methyldackel/mbias TODO: write tests and add to nf-core modules

add required qualimap/bamqc module

51e7518

TODO: write tests and add to nf-core/modules

add required samtools/faidx module

1a9d597

TODO: write tests and add to nf-core/modules

add functions from nf-core/tools dsl2 template

baf4faa

add bismark subworkflow

3994b73

add bwameth subworkflow

d6cb697

Merge dev

e544a1a

add file placeholder

70882f2

update test.config to use samplesheet

9ebf9ac

TODO: needs change in nf-core/test-datasets

update base.config

25b7f65

sync some changes from dsl2 template

2aa04a5

bump version to 2.0dev

bbf3d28

update test profile for dsl2

0eb6e9f

the pipeline now requires a samplesheet

remove files that become obsolete with dsl2

9be06d4

Merge branch 'dsl2' of https://github.com/phue/methylseq into dsl2

e83dddc

phue added the help wanted Extra attention is needed label Mar 22, 2021

phue added 9 commits March 22, 2021 17:18

update ci.yml

655d938

remove conda and docker related actions

update nextflow.config

f1df433

prepare for sync with nf-core/modules

a88c631

Template update for nf-core/tools version 1.14.dev0

99c3768

Merge branch 'TEMPLATE' into dsl2

8eb68bc

merging dsl2-template

sync with nf-core/modules

0243d13

cleanup some duplicated files

867b4ee

finish restructuring; create local modules where needed

7b3c04a

update nextflow_schema.json

34305ef

phue closed this Mar 24, 2021

phue mentioned this pull request Mar 25, 2021

dsl2 port #199

Merged

3 tasks

	multicore = ''
	if( task.cpus ){
	// Numbers based on recommendation by Felix for a typical mouse genome
	if( params.single_cell \|\| params.zymo \|\| params.non_directional ){
	cpu_per_multicore = 5
	mem_per_multicore = (18.GB).toBytes()
	} else {
	cpu_per_multicore = 3
	mem_per_multicore = (13.GB).toBytes()
	}
	// Check if the user has specified this and overwrite if so
	if(params.bismark_align_cpu_per_multicore) {
	cpu_per_multicore = (params.bismark_align_cpu_per_multicore as int)
	}
	if(params.bismark_align_mem_per_multicore) {
	mem_per_multicore = (params.bismark_align_mem_per_multicore as nextflow.util.MemoryUnit).toBytes()
	}
	// How many multicore splits can we afford with the cpus we have?
	ccore = ((task.cpus as int) / cpu_per_multicore) as int
	// Check that we have enough memory, assuming 13GB memory per instance (typical for mouse alignment)
	try {
	tmem = (task.memory as nextflow.util.MemoryUnit).toBytes()
	mcore = (tmem / mem_per_multicore) as int
	ccore = Math.min(ccore, mcore)
	} catch (all) {
	log.debug "Warning: Not able to define bismark align multicore based on available memory"
	}
	if( ccore > 1 ){
	multicore = "--multicore $ccore"
	}
	}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dsl2 port #182

dsl2 port #182

phue commented Jan 19, 2021 •

edited

Loading

drpatelh commented Jan 19, 2021

phue commented Jan 20, 2021

phue commented Mar 24, 2021

dsl2 port #182

dsl2 port #182

Conversation

phue commented Jan 19, 2021 • edited Loading

drpatelh commented Jan 19, 2021

phue commented Jan 20, 2021

phue commented Mar 24, 2021

phue commented Jan 19, 2021 •

edited

Loading