| Literature DB >> 33815688 |
Laura Ciuffreda1, Héctor Rodríguez-Pérez1, Carlos Flores1,2,3,4.
Abstract
Since its introduction, nanopore sequencing has enhanced our ability to study complex microbial samples through the possibility to sequence long reads in real time using inexpensive and portable technologies. The use of long reads has allowed to address several previously unsolved issues in the field, such as the resolution of complex genomic structures, and facilitated the access to metagenome assembled genomes (MAGs). Furthermore, the low cost and portability of platforms together with the development of rapid protocols and analysis pipelines have featured nanopore technology as an attractive and ever-growing tool for real-time in-field sequencing for environmental microbial analysis. This review provides an up-to-date summary of the experimental protocols and bioinformatic tools for the study of microbial communities using nanopore sequencing, highlighting the most important and recent research in the field with a major focus on infectious diseases. An overview of the main approaches including targeted and shotgun approaches, metatranscriptomics, epigenomics, and epitranscriptomics is provided, together with an outlook to the major challenges and perspectives over the use of this technology for microbial studies.Entities:
Keywords: AMR, antimicrobial resistance; Bioinformatics; HMW, high molecular weight; LCA, lowest common ancestor; MAGs, Metagenome assembled genomes; Metagenomics; Metatranscriptomics; NGS, next-generation sequencing; Nanopore sequencing; ONT, Oxford Nanopore Technologies; PAIs, pathogenicity islands; SARS-CoV-2, severe acute respiratory syndrome coronavirus 2; Targeted sequencing; UMAP, Uniform Manifold Approximation and Projection
Year: 2021 PMID: 33815688 PMCID: PMC7985215 DOI: 10.1016/j.csbj.2021.02.020
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1General workflow for generation and analysis of data for targeted and shotgun metagenomics studies. Created with biorender.com. QC: quality control.
Summary of available ONT library preparation kits for sequencing of microbial communities.
| ONT library preparation strategy | Input ng recommendation | Preparation time | Multiplexing | Application |
|---|---|---|---|---|
| 16S Rapid Barcoding Kit | < 10 ng gDNA | 10 min + PCR | Up to 12 or 24 samples | Targeted 16S rRNA gene sequencing |
| Rapid Sequencing Kit | ≥ 400 ng HMW DNA | 10 min | Up to 12 samples | Metagenomics and epigenomics, amplification-free |
| Rapid PCR Sequencing Kit | ≤ 10 ng gDNA | 15 min + PCR | Up to 12 samples | Metagenomics, requires amplification |
| Ligation Sequencing Kit | ≥ 1000 ng dsDNA | 60 min | Up to 96 samples | Metagenomics and epigenomics, amplification-free, high-throughput |
| PCR Sequencing Kit | ≤ 100 ng gDNA | 60 min + PCR | Up to 12 samples | Metagenomics, requires amplification, high-throughput |
| Direct cDNA Sequencing Kit | 100 ng poly-A+ RNA | 270 min | Up to 24 samples | Metatrascriptomics, requires retrotranscription |
| PCR cDNA Sequencing Kit | 1 ng poly-A+ or 50 ng total RNA | 165 min | Up to 12 samples | Metatranscriptomics, requires retrotranscription and amplification |
| Direct RNA Sequencing Kit | 500 ng poly-A+ RNA | 105 min | None | Metatrascriptomics and epitranscriptomics, retrotranscription- and amplification-free |
HMW: high-molecular weight.
Main long-read bioinformatics tools for targeted and shotgun approaches.
| Type | Reference | Application | Brief description |
|---|---|---|---|
| BLAST, MEGABLAST | Targeted; Shotgun | Gold-standard alignment tools for classification of nucleotide and protein sequences. Feature web-based version and multiple implementations for specific purposes. | |
| minimap2 | Targeted; Shotgun | Versatile tool for fast read alignments against large reference databases. | |
| Kraken, Kraken2 | Targeted; Shotgun | Taxonomic classification tool implementing an accurate and fast k-mer matching. | |
| KrakenUniq | Shotgun | Classifier that combines Kraken classification tool with the assessment of the coverage of unique k-mers for better recall and precision. | |
| Bracken | Targeted; Shotgun | Relative abundance estimation tool for single-level abundance using Kraken read classification output. | |
| Metamaps | Shotgun | Read assignment and sample composition estimation for nanopore metagenomic datasets. | |
| Centrifuge | Targeted; Shotgun | Read classification based on the Burrows-Wheeler transform (BWT) and the Ferragina-Manzini (FM) index that performs fast classification relying on small pre-computed index databases. | |
| Mash | Targeted; Shotgun | Fast genome and metagenome distance estimation tool that computes distances between sequences using the MinHash algorithm. | |
| Canu | Shotgun | Assembly pipeline for long-reads that compute and process read overlaps for the generation of contigs and draft assemblies. | |
| miniasm | Shotgun | Fast OLC-based assembler for long reads that builds assembly graphs from all-vs-all read mappings. | |
| wtdbg2 | Shotgun | De-novo sequence assembler for uncorrected long-reads based on Fuzzy Bruijn graphs to compute contigs. | |
| OPERA-MS | Shotgun | Hybrid metagenomic assembler that first performs a short-read assembly and then maps short and long reads to resolve contiguity of contigs. | |
| MetaFlye | Shotgun | Metagenomic assembler from the Flye package featuring repeat graphs to compute high-quality metagenome assemblies. | |
| MetaSPAdes | Shotgun | Metagenomic assembly module from SPAdes assembler that features a hybrid assembly option. | |
| Nanopolish | Targeted; Shotgun | Signal-level analysis tool with modules that performs sequence polishing, base modification detection and variant calling. | |
| Medaka | Targeted; Shotgun | Neural network-based sequence correction and variant calling tool. | |
| MEGAN-LR | Shotgun | Long-read implementation of the MEGAN workflow. Features taxonomic and functional analysis. | |
| NanoCLUST | Targeted | Analysis pipeline for UMAP-based classification of amplicon-based full-length 16S rRNA nanopore reads. | |
| Reticulatus | Shotgun | Snakemake-based pipeline for assembly and polishing of long genomes from long nanopore reads. | |
| MUFFIN | Shotgun | Metagenomics workflow for hybrid assembly, differential coverage binning, transcriptomics and pathway analysis. | |
| NanoSPC | Shotgun | Metagenomic analysis pipeline that includes viral and bacterial pathogen identification, genome assembly and variant calling. | |
| BusyBee | Shotgun | Web-based metagenomic analysis pipeline for long-reads and contigs that features taxonomic and functional annotation of AMR elements along with a comprehensive visualization of results. | |
Summary of the most widely used reference databases for metataxonomics and metagenomics analysis.
| Database name | Reference | Description |
|---|---|---|
| GreenGenes | 16S rRNA database from Genbank sequences, manually curated and modified by the user community. | |
| SILVA | Small and large rRNA subunits database including 16S rRNA sequences from the European Nucleotide Archive. | |
| The Ribosomal Database Project (RDP) | 16S rRNA taxonomically annotated sequence collection from the INSDC database. | |
| RefSeqTargeted Loci Project | ( | BLAST specific marker gene databases for Bacteria (16S/23S) and Fungi (28S/18S) extracted and curated from GenBank sequences. |
| nt/nr | Default database for BLAST sequence searches including RefSeq RNA and GenBank sequences. | |
| RefSeq | Non-redundant and NCBI curated and annotated database based on Genbank sequences. | |
| GenBank | Main NCBI nucleotide database with the largest complete and draft microbial genomes sequence collection. | |
| Kyoto Encyclopedia of Genes and Genomes (KEGG) | Manually curated set of 18 databases for annotating cellular and organism-level functions from nucleotide sequences. | |
| Integrated reference catalog of the human gut microbiome (IGC) | Gut-specific annotated microbial genes from KEGG functional databases. | |
| Comprehensive Antibiotic Resistance Database (CARD) | Bioinformatic resources and database for the annotation of antimicrobial resistance genes (AMR) and mutations from genomic sequences. | |
| DeepARG-DB | Antibiotic resistance genes database generated by a deep-learning prediction algorithm trained with ARG from other sequence collections. | |
| MEGARes | Hand-curated database containing AMR genes optimized for use with high-throughput sequencing data. | |
Summary of main successful clinical applications of NS.
| Clinical application of NS | Approach used | Reference |
|---|---|---|
| Rapid pathogen identification in clinical samples | 16S rRNA targeted | |
| RNA sequencing | ||
| Rapid identification of pathogens and AMR genes | Shotgun metagenomics | |
| Surveillance of pathogens and AMR in hospital settings | Shotgun metagenomics | |
| Genomic surveillance for viral outbreaks | RNA sequencing |