nf-core/mag
Assembly and binning of metagenomes
Version history
Added
- #718 - Add support for independent long-read metagenomic assembly (requested by @ljmesi and many others, added by @muabnezor)
- #718 - Added metaMDBG and (meta)Flye as long read assemblers (added by @muabnezor)
- #718 - Added host removal for long reads using minimap2 as aligner (added by @muabnezor)
- #827 - Added nf-test CI testing for all test profiles (added by @jfy133)
- #829 - Add
--skip_shortread_qcand--skip_longread_qcparams for skipping certain default preprocessing steps (added by @erikrikarddaniel) - #846 - Improve documentation of
groupsamplesheet column (added by @vinisalazar) - #855 - Add basic nf-tests for test_longreadonly, test_longreadonly_alternatives, test_hybrid and test_assembly_input (added by @dialvarezs)
- #864 - Add
--gtdbtk_skip_aniscreento disable fast classification of genomes by ANI using skani in GTDB-Tk (by @jfy133 and @prototaxites).
Changed
- #718 - Refactored all assembly steps into subworkflows (added by @muabnezor)
- #799 - Add
--cat_classify_unbinned, to enable taxonomic classification of unbinned contigs using CAT (requested by @amizeranschi, added by @dialvarezs) - #799 - Upgraded to latest version of CAT_pack modules (requested by @maxibor, added by @dialvarezs)
- #811 - Update util modules, and remove aria2 module to replace with native Nextflow downloading of CheckM database (by @dialvarezs)
- #816 - Removed all leftover references to conda ‘defaults’ channel (by @jfy133)
- #823 - Updated to nf-core 3.3.1
TEMPLATE(by @jfy133 ) - #827 - Updated to nf-core 3.3.2
TEMPLATE(by @dialvarezs) - #841 - MultiQC config updated to support CheckM, CheckM2, and GTDB-Tk (by @harper357)
- #844 - Change loading mechanism of internal PhiX/Lambda databases to improve Dev UX when schema building (by @jfy133)
- #851 - Improve structure of local modules and subworkflows (by @dialvarezs)
- #853 - Update nf-core modules and subworkflows (by @dialvarezs)
- #856 - Update more nf-core modules (by @dialvarezs)
Fixed
- #843 - Fixed issue with large format Bowtie2 index files not being emitted from index module (reported by Nick Eckersley, fix by @jfy133)
- #847 - Allow the BBNorm process to use only 0.8 of the memory allocated to the task to stop if from oversubscribing memory (reported by and fix by @erikrikarddaniel)
- #850 - Fixed some modules of the GDTBTk subworkflow not being represented in version lists (fix by @jfy133)
- #852 - Fixed version reporting by ensure all modules are represented in final version.yml for MultiQC (by @jfy133)
- #854 - Update porechop/abi to a patched version to prevent duplicated read names (reported by @palec87, fix by @jfy133)
- #858 - Fix a single parameter validation failure reporting errors for all parameters by updated nf-schema to 2.5.1 (reported by @Pranjal-Bioinfo, fix by @nvnieuwk and @jfy133)
- #864 - Fix missing multi-threading of MetaEuk easypredict (reported by @OlivierCoen, fix by @prototaxites).
Dependencies
| Tool | Previous version | New version |
|---|---|---|
| bcftools | 1.17 | 1.21 |
| BUSCO | 5.8.3 | 6.0.0 |
| CAT | 5.2.3 | 6.0.1 |
| centrifuge | 1.0.4.1 | 1.0.4.2 |
| dastool | 1.1.6 | 1.1.7 |
| nanolyse | 1.41.6 | 1.44.1 |
| fastp | 0.23.4 | 0.24.0 |
| flye | 2.9.5 | |
| Freebayes | 1.3.6 | 1.3.10 |
| geNomad | 1.5.2 | 1.11.0 |
| GTDB-Tk | 2.4.0 | 2.5.2 |
| metabat2 | 2.15 | 2.17 |
| metamdbg | 1.0 | |
| minimap2 | 2.29 | |
| mmseqs2 | 14.7e284 | 17.b804f |
| samtools | 1.21 | |
| nf-core | 3.2.0 | 3.3.2 |
| pydamage | 0.7.0 | 1.0.0 |
| seqtk | 1.3 | 1.4 |
| porechop_abi | 0.5.0 | 0.5.0post1 |
| NanoPlot | 1.44.1 | 1.46.1 |
Deprecated
- #799 - Removed
--cat_official_taxonomyin favour of--cat_allow_unofficial_lineagesto control CAT’s use of unofficial lineages (added by @dialvarezs) - #825 - Removed
--centrifuge_db,--kraken2_db,--krona_dband--skip_kronaparameters following the removal of taxonomic profiling functionality. See nf-core/taxprofiler for replacement (added by @dialvarezs) - #851 - Remove
POOL_READ_*local modules in favor of nf-core cat/fastq (by @dialvarezs) - #855 - Remove test_adapterremoval, test_ancient_dna, test_bbnorm, test_busco_auto, test_host_rm, test_hybrid_host_rm, test_binrefinement, test_concoct and test_longread profiles (added by @dialvarezs)
- #864 - Remove
--gtdb_mashdue to dropping of support by GTDBTk itself (by @prototaxites and @jfy133)
Added
- #730 - Added
--busco_db_lineageto allow specifying a specific lineage for BUSCO database (added by @jfy133, @dialvarezs) - #730 - Added a new documentation section on database setup (by @jfy133, @dialvarezs)
- #784 - Added
--bin_min_sizeand--bin_max_sizeparameters to filter out bins based on size (requested by @maxibor, @alexhbnr, added by @jfy133, @prototaxites) - #793 - Document use of a SquashFS image with
--gtdb_db, useful for limited inode infrastructure (by @muniheart) - #805 - Add support for fastp’s
--trim_poly_goption (by @jfy133)
Changed
- #730 - Migrate from local BUSCO module to nf-core one, updating version (by @dialvarezs)
- #730 - Use BUSCO database from nf-core test datasets (by @dialvarezs)
- #788 - Tweak method of loading GTDB database in GTDBTK_CLASSIFYWF for more stability (reported by @alexhbnr, fix by @jfy133)
- #800 - Default branch is now set to
main(by @jfy133 and @mirpedrol) - #801 - Increase CheckM memory requests to match recommended requirements (by @jfy133)
Fixed
- #789 - Improve
--bowtie2_modedescription to clarify default settings (reported by @IceGreb, fix by @jfy133) - #798 - Fix overly strict database validation for
--metauk_dband improve documentation (reported by @ruqse, fix by @jfy133) - #804 - Fix broken memory specification for FASTQC (reported by @jmichaelegana, fix by @awgymer & @jfy133)
Dependencies
| Tool | Previous version | New version |
|---|---|---|
| BUSCO | 5.4.3 | 5.8.3 |
| csvtk | 0.31.0 | |
| nextflow | 24.04.2 | 25.04.2 |
Deprecated
- #730 - Remove
--busco_auto_lineage_prokdue to update and simplified usage of BUSCO (added by @jfy133, @dialvarezs)
Added
- #745 - Added pipeline parameter
spades_downstreaminputto use contigs instead of scaffolds (by @Pranjal-Bioinfo, @jfy133, @GallVp & @sateeshperi). - #745 - Added
trimmomaticas an additional pre-processing tool (by @Pranjal-Bioinfo, @jfy133, @GallVp & @sateeshperi). - #745 - Added parameters for
concoct/cut_up_fasta.pyincludingbin_concoct_chunksize,bin_concoct_overlapandbin_concoct_donotconcatlast(by @Pranjal-Bioinfo, @jfy133, @GallVp & @sateeshperi). - #777 - Improved input validation through additional JSON keywords and error messages (by @agusinac)
Changed
- #774 - Update CheckM2 to v1.1.0 and default database (by @dialvarezs).
Fixed
- #726 - Fix formatting errors to follow Nextflow best practice (by @dialvarezs).
- #769 - Fix megahit not emitting correct filenames due to suboptimal arguments ordering (reported and fix by @IceGreb)
- #771 - Fix misspecified checkm2 database parameter check (reported by @dpelegri and fix by @jfy133)
Dependencies
| Tool | Previous version | New version |
|---|---|---|
| CheckM2 | 1.0.2 | 1.1.0 |
| SPAdes | 4.0.0 | 4.1.0 |
Added
- #758 - Added new diagram in metro-map style (by @jfy133, @prototaxites, @d4straub)
Changed
- #731 - Updated to nf-core 3.1.2
TEMPLATE(by @jfy133) - #755 - Updated to nf-core 3.2.0
TEMPLATE(by @jfy133)
Fixed
- #748 - Fix broken phix reference channel when skipping phix removal (reported by @amizeranschi, fix by @muabnezor)
- #752 - Fix QUAST results not being displayed when skipping certain steps (reported by @amizeranschi, fix by @jfy133)
- #753 - Fix iGenomes reference support for host removal reference genome (reported by @Thomieh73, fix by @jfy133)
- #759 - Fixed parameters that allow both files or directories to not error with directories, and general file input validation improvements (reported by @mjfi2sb3, fix by @jfy133)
Added
- #692 - Added Nanoq as optional longread filtering tool (added by @muabnezor)
- #692 - Added chopper as optional longread filtering tool and/or phage lambda removal tool (added by @muabnezor)
- #707 - Make Bin QC a subworkflow (added by @dialvarezs)
- #707 - Added CheckM2 as an alternative bin completeness and QC tool (added by @dialvarezs)
- #708 - Added
--exclude_unbins_from_postbinningparameter to exclude unbinned contigs from post-binning processes, speeding up Prokka in some cases (added by @dialvarezs) - #732 - Added support for Prokka’s compliance mode with
--prokka_with_compliance --prokka_compliance_centre <xyz>(reported by @audy and @Thomieh73, added by @jfy133)
Changed
Fixed
- #707 - Fixed channel passed as GUNC input (added by @dialvarezs)
- #724 - Fix quoting in
utils_nfcore_mag_pipeline/main.nf(added by @dialvarezs) - #716 - Make short read processing a subworkflow (added by @muabnezor)
- #708 - Fixed channel passed as GUNC input (added by @dialvarezs)
- #729 - Fixed misspecified multi-FASTQ input for single-end data in MEGAHIT (reported by John Richards, fix by @jfy133)
Dependencies
| Tool | Previous version | New version |
|---|---|---|
| CheckM | 1.2.1 | 1.2.3 |
| CheckM2 | 1.0.2 | |
| chopper | 0.9.0 | |
| GUNC | 1.0.5 | 1.0.6 |
| nanoq | 0.10.0 |
Fixed
Added
- #674 - Added
--longread_adaptertrimming_toolWhere user can chose between porechop_abi (default) and porechop (added by @muabnezor)
Changed
- #674 - Changed to porechop-abi as default adapter trimming tool for long reads. User can still use porechop if preferred (added by @muabnezor)
- #666 - Update SPAdes to version 4.0.0, replace both METASPADES and MEGAHIT with official nf-core modules (requested by @elsherbini, fix by @jfy133)
- #666 - Update URLs to GTDB database downloads due to server move (reported by @Jokendo-collab, fix by @jfy133)
- #695 - Updated to nf-core 3.0.2
TEMPLATE(by @jfy133) - #695 - Switch more stable Zenodo link for CheckM data (by @jfy133)
Fixed
- #674 - Make longread preprocessing a subworkflow (added by @muabnezor)
- #674 - Add porechop and filtlong logs to multiqc (added by @muabnezor)
- #674 - Change local filtlong module to the official nf-core/filtlong module (added by @muabnezor)
- #690 - MaxBin2 now using the abundance information from different samples rather than an average (reported by @uel3 and fixed by @d4straub)
- #698 - Updated prodigal module to not pick up input symlinks for compression causing pigz errors (reported by @zackhenny, fix by @jfy133 )
Dependencies
| Tool | Previous version | New version |
|---|---|---|
| Porechop_ABI | 0.5.0 | |
| Filtlong | 0.2.0 | 0.2.1 |
| SPAdes | 3.15.3 | 4.0.0 |
[!CAUTION] This release contains a potentially ‘breaking change’ for some users. The
--gtdbtk_pplacer_scratchflag has been replaced with--gtdbtk_pplacer_useram. Check the parameter documentation for more details.
Added
- #665 - Add support for supplying pre-made bowtie host reference index (requested by @simone-pignotti, added by @jfy133)
- #670 - Added
--gtdbtk_pplacer_useramto run GTDBTk in memory mode rather than write to disk (requested by @harper357, fixed by @jfy133)
Changed
- #664 - Update GTDBTk to latest version, with updated column names, update GTDB to release 220 (by @dialvarezs)
- #676 - Added exit code 12 to valid SPAdes retry codes, due to OOM errors from spades-hammer (reported by @bawee, fix by @jfy133)
Fixed
- #667 - Fix pipeline crashing if only CONCOCT selected during binning (reported and fixed by @jfy133)
- #670 - Re-add missing GTDBTk parameters into GTDBTk module (reported by harper357, fixed by @jfy133)
- #672 - Fix GTDB-Tk per-sample TSV files not being published in output directory (reported by @jhayer, fix by @jfy133)
Dependencies
| Tool | Previous version | New version |
|---|---|---|
| GTDBTk | 2.3.2 | 2.4.0 |
Deprecated
Fixed
- #648 - Fix sample ID/assembly ID check failure when no IDs match (reported by @zackhenny, fix by @prototaxites)
- #646 - GTDB-Tk directory input now creates a value channel so it runs for all entries to the process and not just the first (reported by @amizeranschi, fix by @prototaxites).
- #639 - Fix pipeline failure when a sample produces only a single bin (fix by @d-callan)
- #651 - Replace base container for bash only modules to reduce number of containers in pipeline (reported and fixed by @harper357)
- #652 - Fix documentation typo in using user-defined assembly parameters (reported and fixed by @amizeranschi)
- #653 - Fix overwriting of per-bin ‘raw’ GUNC RUN output files (multi-bin summary tables not affected) (reported by @zackhenny and fixed by @jfy133)
Changed
- #633 - Changed BUSCO to use offline mode when the database is specified by the user (reported by @ChristophKnapp and many others, fix by @jfy133)
- #632 - Use default NanoLyse log of just removed reads rather than custom (by @jfy133)
Fixed
- #630 - Fix CONCOCT empty bins killing the pipeline, and allow for true multithreading again (removing OPENBLAS loop) (reported by @maxibor, fix by @maxibor and @jfy133)
Dependencies
| Tool | Previous version | New version |
|---|---|---|
| Porechop | 0.2.3_seqan2.1.1 | 0.2.4 |
| NanoPlot | 1.26.3 | 1.41.6 |
| NanoLyse | 1.1.0 | 1.2.0 |
[!CAUTION] This release contains a potentially ‘breaking change’ for some users. The pipeline no longer directly accepts FASTQ files via
--input. You must use a samplesheet and specify the FASTQs there.
Added
Changed
- #599 - Update to nf-core v2.13.1
TEMPLATE(by @jfy133) - #614 - Update to nf-core v2.14.1
TEMPLATE(by @jfy133)
Fixed
- #606 - Prevent pipeline crash when premade mashdb given to or no alignments found with GTDB-TK_CLASSIFYWF (reported by @cedwardson4, fix by @jfy133)
Deprecated
Changed
- #581 - Added explicit licence text to headers of all custom scripts (reported by @FriederikeHanssen and @maxibor, fix by @jfy133)
Fixed
Added
- #562 - Add CAT summary into the global bin_summary (by @maxibor)
- #565 - Add warning of empty GTDB-TK results if no contigs pass completeness filter (by @jfy133 and @maxibor)
Changed
- #563 Update to nf-core v2.12
TEMPLATE(by @CarsonJM) - #566 - More logical ordering of MultiQC sections (assembly and bin sections go together respectively) (fix by @jfy133)
Fixed
- #548 - Fixes to (reported by @maxibor, @PPpissar, @muniheart, @llborcard, fix by @maxibor)
- GTDBK-TK execution
- CAT/QUAST/DEPTH bin summary file name collisions
- BUSCO database parsing
- Correct CAT name files
- #558 - Fix bug in run merging when dealing with single end data (reported by @roberta-davidson, fix by @jfy133)
Fixed
- #489 - Fix file name collision clashes for CHECKM, CAT, GTDBTK, and QUAST (reported by @tillenglert and @maxibor, fix by @maxibor)
- #533 - Fix glob pattern for publishing MetaBAT2 bins in results (reported by @patriciatran, fix by @jfy133)
- #535 - Fix input validation pattern to again allow direct FASTQ input (reported by @lennijusten, @emnilsson, fix by @jfy133, @d4straub, @mahesh-panchal, @nvnieuwk)
Dependencies
| Tool | Previous version | New version |
|---|---|---|
| CAT | 4.6 | 5.2.3 |
Deprecated
Added
- #504 - New parameters
--busco_db,--kraken2_db, and--centrifuge_dbnow support directory input of a pre-uncompressed database archive directory (by @gregorysprenger).
Changed
- #511 - Update to nf-core 2.10
TEMPLATE(by @jfy133) - #504 -
--save_busco_referenceis now replaced by--save_busco_db(by @gregorysprenger).
Fixed
- #514 - Fix missing CONCOCT files in downstream output (reported by @maxibor, fix by @jfy133)
- #515 - Fix overwriting of GUNC output directories when running with domain classification (reported by @maxibor, fix by @jfy133)
- #516 - Fix edge-case bug where MEGAHIT re-uses previous work directory on resume and fails (reported by @husensofteng, fix by @prototaxites)
- #520 - Fix missing Tiara output files (fix by @jfy133)
- #522 - Fix ‘nulls’ in depth plot PNG files (fix by @jfy133)
Deprecated
- #504 -
--busco_reference,--busco_download_path,--save_busco_referenceparameters have been deprecated and replaced with new parameters (by @gregorysprenger).
Added
- #497 - Adds support for pointing at a local db for krona, using the parameter
--krona_db(by @willros). - #395 - Adds support for fast domain-level classification of bins using Tiara, to allow bins to be separated into eukaryotic and prokaryotic-specific processes.
- #422 - Adds support for normalization of read depth with BBNorm (added by @erikrikarddaniel and @fabianegli)
- #439 - Adds ability to enter the pipeline at the binning stage by providing a CSV of pre-computed assemblies (by @prototaxites)
- #459 - Adds ability to skip damage correction step in the ancient DNA workflow and just run pyDamage (by @jfy133)
- #364 - Adds geNomad nf-core modules for identifying viruses in assemblies (by @PhilPalmer and @CarsonJM)
- #481 - Adds MetaEuk for annotation of eukaryotic MAGs, and MMSeqs2 to enable downloading databases for MetaEuk (by @prototaxites)
- #437 -
--gtdb_dbalso now supports directory input of an pre-uncompressed GTDB archive directory (reported by @alneberg, fix by @jfy133) - #494 - Adds support for saving the BAM files from Bowtie2 mapping of input reads back to assembly (fix by @jfy133)
Changed
- #428 #467 - Update to nf-core 2.8, 2.9
TEMPLATE(by @jfy133) - #429 - Replaced hardcoded CheckM database auto-download URL to a parameter (reported by @erikrikarddaniel, fix by @jfy133)
- #441 - Deactivated CONCOCT in AWS ‘full test’ due to very long runtime (fix by @jfy133).
- #442 - Remove warning when BUSCO finds no genes in bins, as this can be expected in some datasets (reported by @Lumimar, fix by @jfy133).
- #444 - Moved BUSCO bash code to script (by @jfy133)
- #477 -
--gtdbparameter is split into--skip_gtdbtkand--gtdb_dbto allow finer control over GTDB database retrieval (fix by @jfy133) - #500 - Temporarily disabled downstream processing of both refined and raw bins due to bug (by @jfy133)
Fixed
- #496 - Fix help text for paramters
--bowtie2_mode,spades_optionsandmegahit_options(by @willros) - #400 - Fix duplicated Zenodo badge in README (by @jfy133)
- #406 - Fix CheckM database always downloading, regardless if CheckM is selected (by @jfy133)
- #419 - Fix bug with busco_clean parameter, where it is always activated (by @prototaxites)
- #426 - Fixed typo in help text for parameters
--host_genomeand--host_fasta(by @tillenglert) - #434 - Fix location of samplesheet for AWS full tests (reported by @Lfulcrum, fix by @jfy133)
- #438 - Fixed version inconsistency between conda and containers for GTDBTK_CLASSIFYWF (by @jfy133)
- #439 - Fix bug in assembly input (by @prototaxites)
- #447 - Remove
default: Nonefrom parameter schema (by @drpatelh) - #449 - Fix results file overwriting in Ancient DNA workflow (reported by @alexhbnr, fix by @jfy133)
- #470 - Fix binning preparation from running even when binning was requested to be skipped (reported by @prototaxites, fix by @jfy133)
- #480 - Improved
-resumereliability through better meta map preservation (reported by @prototaxites, fix by @jfy133) - #493 - Update
METABAT2nf-core module so that it reduced the number of unnecessary file moves, enabling virtual filesystems (fix by @adamrtalbot) - #500 - Fix MaxBin2 bins not being saved in results directly properly (reported by @Perugolate, fix by @jfy133)
Dependencies
| Tool | Previous version | New version |
|---|---|---|
| BCFtools | 1.16 | 1.17 |
| SAMtools | 1.16.1 | 1.17 |
| fastp | 0.23.2 | 0.23.4 |
| MultiQC | 1.14 | 1.15 |
Fixed
Fixed
- #458 - Correct the major issue in ancient DNA workflow of binning refinement being performed on uncorrected contigs instead of aDNA consensus recalled contigs (issue #449)
- #451 - Fix results file overwriting in Ancient DNA workflow (reported by @alexhbnr, fix by @jfy133, and integrated by @maxibor in #458 )
Added
- #350 - Adds support for CheckM as alternative bin completeness and QC tool (added by @jfy133 and @skrakau)
- #353 - Added the busco_clean parameter to optionally clean each BUSCO directory after a successful (by @prototaxites)
- #361 - Added the skip_clipping parameter to skip read preprocessing with fastp or adapterremoval. Running the pipeline with skip_clipping, keep_phix and without specifying a host genome or fasta file skips the FASTQC_TRIMMED process (by @prototaxites)
- #365 - Added CONCOCT as an additional (optional) binning tool (by @jfy133)
- #366 - Added CAT_SUMMARISE process and cat_official_taxonomy parameter (by @prototaxites)
- #372 - Allow CAT_DB to take an extracted database as well as a tar.gz file (by @prototaxites).
- #380 - Added support for saving processed reads (clipped, host removed etc.) to results directory (by @jfy133)
- #394 - Added GUNC for additional chimeric bin/contamination QC (added by @jfy133)
Changed
- #340,#368,#373 - Update to nf-core 2.7.2
TEMPLATE(by @jfy133, @d4straub, @skrakau) - #373 - Removed parameter
--enable_conda. Updated local modules to new conda syntax and updated nf-core modules (by @skrakau) - #385 - CAT also now runs on unbinned contigs as well as binned contigs (added by @jfy133)
- #399 - Removed undocumented BUSCO_PLOT process (previously generated
*.busco_figure.pngplots unsuitable for metagenomics) (by @skrakau).
Fixed
- #345 - Bowtie2 mode changed to global alignment for ancient DNA mode (
--very-sensitivemode) to prevent soft clipping at the end of reads when running in local mode. (by @maxibor) - #349 - Add a warning that pipeline will reset minimum contig size to 1500 specifically MetaBAT2 process, if a user supplies below this threshold. (by @jfy133)
- #352 - Escape the case in the BUSCO module that BUSCO can just detect a root lineage but is not able to find any marker genes (by @alexhbnr)
- #355 - Include error code 21 for retrying with higher memory for SPAdes and hybridSPAdes (by @mglubber)
Dependencies
| Tool | Previous version | New version |
|---|---|---|
| BUSCO | 5.1.0 | 5.4.3 |
| BCFtools | 1.14 | 1.16 |
| Freebayes | 1.3.5 | 1.3.6 |
| SAMtools | 1.15 | 1.16.1 |
- Fix too many symbolic links issue in local convert_depths module (reported by @ChristophKnapp and fixed by @apeltzer, @jfy133)
- Each sample now gets it’s own result directory for PyDamage analysis and filter (reported and fixed by @maxibor)
See full CHANGELOG for more information
- Restructure binning subworkflow in preparation for aDNA workflow and extended binning
- Add ancient DNA subworkflow
- Add MaxBin2 as second contig binning tool
- Add AdapterRemoval2 as an alternative read trimmer
- Add DAS Tool for bin refinement
- Activate pipeline-specific institutional nf-core/configs
- Add extra results folder
GenomeBinning/depths/contigsfor[assembler]-[sample/group]-depth.txt.gz, andGenomeBinning/depths/binsforbin_depths_summary.tsvand[assembler]-[binner]-[sample/group]-binDepths.heatmap.png - Updated some software: fastp 0.20.1 > 0.23.2, MultiQC 1.9 > 1.12
- Fix several bugs
See full CHANGELOG for more information
Contributors
- Add bin gene annotation with PROKKA
- Add prokaryotic gene finding with prodigal for each metagenome
- Add pipeline preprint information
- Updated some software: MultiQC 1.9 > 1.11, MEGAHIT 1.2.7 > 1.2.9, SPAdes 3.13.1 > 3.15.3
- Fix several bugs
See full CHANGELOG, for more information.
- Add bin abundance estimation based on median sequencing depths of corresponding contigs
- Add generation of heat maps with bin abundances across samples
- Output predicted genes for bins
- Fix handling of
BUSCOoutput when run in auto lineage selection mode
See full CHANGELOG, for more information.
- Switch to Nextflow
DSL2 - Changed
--inputfile format fromTSVtoCSVformat, requires header now - Add
BUSCOautomated lineage selection functionality - Add taxonomic bin classification with
GTDB-Tk - Add process for
CATdatabase creation as an alternative to using pre-built databases - Allow different folder structures for
Kraken2databases - Requires nextflow version
>= 21.04.0
See full CHANGELOG, for more information.
- Manifest file has to be handed over via
--inputparameter now - Changed format of manifest input file: requires a ‘.tsv’ suffix and additionally contains group ID
- Add
--coassemble_groupparameter to allow group-wise co-assembly - Add
--binning_map_modeparameter allowing different mapping strategies to compute co-abundances used for binning - TSV
--inputfile allows now also entries containing only short reads
See full CHANGELOG, for more information.
- Fixed processing of
--inputparameter
See full CHANGELOG, for more information.
- Add full-size test
- Add worfklow overview figure to
README - Fix
seaborntov0.10.1to avoidnanoploterror
See full CHANGELOG, for more information.
- Add host read removal with
Bowtie 2 - Add separate
MultiQCsection forFastQCafter preprocessing - Add
MetaBAT2RNG seed parameter--metabat_rng_seedand set the default to 1 which ensures reproducible binning results - Add parameters
--megahit_fix_cpu_1,--spades_fix_cpusand--spadeshybrid_fix_cpusto ensure reproducible results from assembly tools - Fixed channel joining for multiple samples causing
MetaBAT2error - Compress assembly files
- Fix BUSCO errors
See full CHANGELOG, for more information.