nf-core/ampliseq
Amplicon sequencing analysis workflow using DADA2 and QIIME2
2.3.0
). The latest
stable release is
2.14.0
.
Either a tab-seperated sample sheet, a fasta file, or a folder containing zipped FastQ files
string
Forward primer sequence
string
Reverse primer sequence
string
Path to metadata sheet, when missing most downstream analysis are skipped (barplots, PCoA plots, …).
string
Define where the pipeline should find input data and save output data.
If data is single-ended PacBio reads instead of Illumina
boolean
If data is single-ended IonTorrent reads instead of Illumina
boolean
If data is single-ended Illumina reads instead of paired-end
boolean
Part of ITS region to use for taxonomy assignment: “full”, “its1”, or “its2”
string
Cutoff for partial ITS sequences. Only full sequences by default.
integer
If samples were sequenced in multiple sequencing runs
boolean
If analysing ITS amplicons or any other region with large length variability with Illumina paired end reads
boolean
Not recommended: When paired end reads are not sufficiently overlapping for merging.
boolean
Mode of sample inference: “independent”, “pooled” or “pseudo”
string
Comma separated list of metadata column headers for statistics.
string
Formula for QIIME2 ADONIS metadata feature importance test for beta diversity distances
string
Naming of sequencing files
string
/*_R{1,2}_001.fastq.gz
If the functional potential of the bacterial community is predicted.
boolean
If data should be exported in SBDI (Swedish biodiversity infrastructure) Excel format.
boolean
Specifies the random seed.
integer
100
The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.
string
Email address for completion summary.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
Cutadapt will retain untrimmed reads, choose only if input reads are not expected to contain primer sequences.
boolean
Cutadapt will be run twice to ensure removal of potential double primers
boolean
DADA2 read truncation value for forward strand, set this to 0 for no truncation
integer
DADA2 read truncation value for reverse strand, set this to 0 for no truncation
integer
If —trunclenf and —trunclenr are not set, these values will be automatically determined using this median quality score
integer
25
Assures that values chosen with —trunc_qmin will retain a fraction of reads.
number
0.75
DADA2 read filtering option
integer
2
DADA2 read filtering option
integer
DADA2 read filtering option
integer
50
Enable SSU filtering. Comma separated list of kingdoms in Barrnap.
string
Minimum taxonomy agglomeration level for DADA2 classification
integer
2
Maximum taxonomy agglomeration level for DADA2 classification
integer
7
Minimum taxonomy agglomeration level for QIIME2 classification
integer
2
Maximum taxonomy agglomeration level for QIIME2 classification
integer
6
Name of supported database, and optionally also version number
string
If the expected amplified sequences are extracted from the DADA2 reference taxonomy database
boolean
Name of supported database, and optionally also version number
string
Path to QIIME2 trained classifier file (typically *-classifier.qza)
string
Ignore input files considered too small (<1KB) for individual samples and continue the pipeline without those samples.
boolean
Ignore files considered too small (<1KB) after trimming and continue the pipeline without those samples.
boolean
Comma separated list of unwanted taxa, to skip taxa filtering use “none”
string
mitochondria,chloroplast
Abundance filtering
integer
1
Prevalence filtering
integer
1
Skip FastQC
boolean
Skip primer trimming with cutadapt. This is not recommended! Use only in case primer sequences were removed before and the data does not contain any primer sequences.
boolean
Skip annotating SSU matches.
boolean
Skip all steps that are executed by QIIME2, including QIIME2 software download, taxonomy assignment by QIIME2, barplots, relative abundance tables, diversity analysis, differential abundance testing.
boolean
Skip taxonomic classification. Incompatible with --sbdiexport
boolean
Skip species level when using DADA2 for taxonomic classification. This reduces the required memory dramatically under certain conditions. Incompatible with --sbdiexport
boolean
Skip producing barplot
boolean
Skip producing any relative abundance tables
boolean
Skip alpha rarefaction
boolean
Skip alpha and beta diversity analysis
boolean
Skip differential abundance testing
boolean
Skip MultiQC reporting
boolean
Parameters used to describe centralised config profiles. These should not be edited.
Git commit id for Institutional configs.
string
master
Base directory for Institutional configs.
string
https://raw.githubusercontent.com/nf-core/configs/master
Institutional config name.
string
Institutional config description.
string
Institutional config contact information.
string
Institutional config URL link.
string
MultiQC report title. Printed as page header, used for filename if not otherwise specified.
string
Less common options for the pipeline, typically set in a config file.
Display help text.
boolean
Method used to save pipeline results to output directory.
string
Email address for completion summary, only when pipeline fails.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
Send plain-text email instead of HTML.
boolean
File size limit when attaching MultiQC reports to summary emails.
string
25.MB
^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$
Do not use coloured log outputs.
boolean
Custom config file to supply to MultiQC.
string
Directory to keep pipeline Nextflow logs and reports.
string
${params.outdir}/pipeline_info
Boolean whether to validate parameters against the schema at runtime
boolean
true
Show all params when using --help
boolean
Run this workflow with Conda. You can also use ‘-profile conda’ instead of providing this parameter.
boolean
Set the top limit for requested resources for any single job.
Maximum number of CPUs that can be requested for any single job.
integer
16
Maximum amount of memory that can be requested for any single job.
string
128.GB
^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$
Maximum amount of time that can be requested for any single job.
string
240.h
^(\d+\.?\s*(s|m|h|day)\s*)+$