nf-core/differentialabundance: Contributing Guidelines
Hi there! Many thanks for taking an interest in improving nf-core/differentialabundance.
We try to manage the required tasks for nf-core/differentialabundance using GitHub issues, you probably came to this page when creating one. Please use the pre-filled template to save time.
However, don’t be put off by this template - other more general issues and suggestions are welcome! Contributions to the code are even more welcome ;)
If you need help using or modifying nf-core/differentialabundance then the best place to ask is on the nf-core Slack #differentialabundance channel (join our Slack here).
Contribution workflow
If you’d like to write some code for nf-core/differentialabundance, the standard workflow is as follows:
- Check that there isn’t already an issue about your idea in the nf-core/differentialabundance issues to avoid duplicating work. If there isn’t one already, please create one so that others know you’re working on this
- Fork the nf-core/differentialabundance repository to your GitHub account
- Make the necessary changes / additions within your forked repository following Pipeline conventions
- Use nf-core pipelines schema buildand add any new parameters to the pipeline JSON schema (requires nf-core tools >= 1.10).
- Submit a Pull Request against the devbranch and wait for the code to be reviewed and merged
If you’re not used to this workflow with git, you can start with some docs from GitHub or even their excellent git resources.
Tests
You have the option to test your changes locally by running the pipeline. For receiving warnings about process selectors and other debug information, it is recommended to use the debug profile. Execute all the tests with the following command:
nf-test test --profile debug,test,docker --verboseWhen you create a pull request with changes, GitHub Actions will run automatic tests. Typically, pull-requests are only fully reviewed when these tests are passing, though of course we can help out before then.
There are typically two types of tests that run:
Lint tests
nf-core has a set of guidelines which all pipelines must adhere to.
To enforce these and ensure that all pipelines stay in sync, we have developed a helper tool which runs checks on the pipeline code. This is in the nf-core/tools repository and once installed can be run locally with the nf-core pipelines lint <pipeline-directory> command.
If any failures or warnings are encountered, please follow the listed URL for more documentation.
Pipeline tests
Each nf-core pipeline should be set up with a minimal set of test-data.
GitHub Actions then runs the pipeline on this data to ensure that it exits successfully.
If there are any failures then the automated tests fail.
These tests are run both with the latest available version of Nextflow and also the minimum required version that is stated in the pipeline code.
Patch
:warning: Only in the unlikely and regretful event of a release happening with a bug.
- On your own fork, make a new branch patchbased onupstream/mainorupstream/master.
- Fix the bug, and bump version (X.Y.Z+1).
- Open a pull-request from patchtomain/masterwith the changes.
Getting help
For further information/help, please consult the nf-core/differentialabundance documentation and don’t hesitate to get in touch on the nf-core Slack #differentialabundance channel (join our Slack here).
Pipeline contribution conventions
To make the nf-core/differentialabundance code and processing logic more understandable for new contributors and to ensure quality, we semi-standardise the way the code and other contributions are written.
Adding a new step
If you wish to contribute a new step, please use the following coding standards:
- Define the corresponding input channel into your new process from the expected previous process channel.
- Write the process block (see below).
- Define the output channel if needed (see below).
- Add any new parameters to nextflow.configwith a default (see below).
- Add any new parameters to nextflow_schema.jsonwith help text (via thenf-core pipelines schema buildtool).
- Add sanity checks and validation for all relevant parameters.
- Perform local tests to validate that the new code works as expected.
- If applicable, add a new test command in .github/workflow/ci.yml.
- Update MultiQC config assets/multiqc_config.ymlso relevant suffixes, file name clean up and module plots are in the appropriate order. If applicable, add a MultiQC module.
- Add a description of the output files and if relevant any appropriate images from the MultiQC report to docs/output.md.
This pipeline allows the iteration of multiple configs through one run. This is achieved by using paramsheet, which is a compact file with multiple nextflow configs, being each row one config. Each config is parsed into the meta of the workflow channels. In this way, one can run the same modules multiple times with different configuration sets. In order to ensure the correct behaviour for this, and proper resume of the pipeline, additional steps are needed when adding a new module:
- Ensure to run prepareModuleInputbefore calling the new module. This function will parse the channel meta to only include the parameters relevant for the module.
- Ensure to run prepareModuleOutputafter calling the new module. This function will extend the meta to include the entire parameters set.
For more information of how to use these two functions, please check the pipeline utils subworkflow at subworkflows/local/utils_nfcore_differentialabundance.nf.
Depending on the case, you may need to modify the  getRelevantParams function called by prepareModuleInput.
Default values
Parameters should be initialised / defined with default values within the params scope in nextflow.config.
Once there, use nf-core pipelines schema build to add to nextflow_schema.json.
Default processes resource requirements
Sensible defaults for process resource requirements (CPUs / memory / time) for a process should be defined in conf/base.config. These should generally be specified generic with withLabel: selectors so they can be shared across multiple processes/steps of the pipeline. A nf-core standard set of labels that should be followed where possible can be seen in the nf-core pipeline template, which has the default process as a single core-process, and then different levels of multi-core configurations for increasingly large memory requirements defined with standardised labels.
The process resources can be passed on to the tool dynamically within the process with the ${task.cpus} and ${task.memory} variables in the script: block.
Naming schemes
Please use the following naming schemes, to make it easy to understand what is going where.
- initial process channel: ch_output_from_<process>
- intermediate and terminal channels: ch_<previousprocess>_for_<nextprocess>
Nextflow version bumping
If you are using a new feature from core Nextflow, you may bump the minimum required version of nextflow in the pipeline with: nf-core pipelines bump-version --nextflow . [min-nf-version]
Images and figures
For overview images and other documents we follow the nf-core style guidelines and examples.
GitHub Codespaces
This repo includes a devcontainer configuration which will create a GitHub Codespaces for Nextflow development! This is an online developer environment that runs in your browser, complete with VSCode and a terminal.
To get started:
- Open the repo in Codespaces
- Tools installed
- nf-core
- Nextflow
 
Devcontainer specs: