This is a limited proof of concept to search for research data, not a production system.

Search the MIT Libraries

Title: nf-core/viralrecon: nf-core/viralrecon v1.0.0 - Mercury Bat

Type Software Harshil Patel, Sarai Varona, Sara Monzón, Jose Espinosa-Carrasco, Michael L Heuer, Gisela Gabernet, MiguelJulia, Stephen Kelly, Katrin Sameith, Maxime Garcia, jcurado (2020): nf-core/viralrecon: nf-core/viralrecon v1.0.0 - Mercury Bat. Zenodo. Software. https://zenodo.org/record/3901629

Authors: Harshil Patel (The Francis Crick Institute) ; Sarai Varona ; Sara Monzón (BU-ISCIII) ; Jose Espinosa-Carrasco ; Michael L Heuer (UC Berkeley AMPLab/RISE Lab) ; Gisela Gabernet (@qbicsoftware) ; MiguelJulia ; Stephen Kelly ; Katrin Sameith (DRESDEN-concept Genome Center) ; Maxime Garcia (@SciLifeLab | Karolinska Institutet) ; jcurado (@Flomics) ;

Links

Summary

[1.0.0] - 2020-06-01

Initial release of nf-core/viralrecon, created with the nf-core template.

This pipeline is a re-implementation of the SARS_Cov2_consensus-nf and SARS_Cov2_assembly-nf pipelines initially developed by Sarai Varona and Sara Monzon from BU-ISCIII. Porting both of these pipelines to nf-core was an international collaboration between numerous contributors and developers, led by Harshil Patel from the The Bioinformatics & Biostatistics Group at The Francis Crick Institute, London. We appreciated the need to have a portable, reproducible and scalable pipeline for the analysis of COVID-19 sequencing samples and so the Avengers Assembled!

Pipeline summary

Download samples via SRA, ENA or GEO ids (ENA FTP, parallel-fastq-dump; if required) Merge re-sequenced FastQ files (cat; if required) Read QC (FastQC) Adapter trimming (fastp) Variant calling Read alignment (Bowtie 2) Sort and index alignments (SAMtools) Primer sequence removal (iVar; amplicon data only) Duplicate read marking (picard; removal optional) Alignment-level QC (picard, SAMtools) Choice of multiple variant calling and consensus sequence generation routes (VarScan 2, BCFTools, BEDTools || iVar variants and consensus || BCFTools, BEDTools) Variant annotation (SnpEff, SnpSift) Consensus assessment report (QUAST) De novo assembly Primer trimming (Cutadapt; amplicon data only) Removal of host reads (Kraken 2) Choice of multiple assembly tools (SPAdes || metaSPAdes || Unicycler || minia) Blast to reference genome (blastn) Contiguate assembly (ABACAS) Assembly report (PlasmidID) Assembly assessment report (QUAST) Call variants relative to reference (Minimap2, seqwish, vg, Bandage) Variant annotation (SnpEff, SnpSift) Present QC and visualisation for raw read, alignment, assembly and variant calling results (MultiQC)

More information

  • DOI: 10.5281/zenodo.3901629

Dates

  • Publication date: 2020
  • Issued: June 01, 2020

Rights

  • info:eu-repo/semantics/openAccess Open Access

Much of the data past this point we don't have good examples of yet. Please share in #rdi slack if you have good examples for anything that appears below. Thanks!

Format

electronic resource

Relateditems

DescriptionItem typeRelationshipUri
IsSupplementTohttps://github.com/nf-core/viralrecon/tree/1.0.0
Citeshttps://github.com/BU-ISCIII/SARS_Cov2_consensus-nf
Citeshttps://github.com/BU-ISCIII/SARS_Cov2_assembly-nf
IsVersionOfhttps://doi.org/10.5281/zenodo.3901628
IsPartOfhttps://zenodo.org/communities/covid-19
IsPartOfhttps://zenodo.org/communities/zenodo