Literature DB >> 32768390

STROBE-metagenomics: a STROBE extension statement to guide the reporting of metagenomics studies.

Tehmina Bharucha¹, Clarissa Oeser², Francois Balloux³, Julianne R Brown⁴, Ellen C Carbo⁵, Andre Charlett⁶, Charles Y Chiu⁷, Eric C J Claas⁵, Marcus C de Goffau⁸, Jutte J C de Vries⁵, Marc Eloit⁹, Susan Hopkins¹⁰, Jim F Huggett¹¹, Duncan MacCannell¹², Sofia Morfopoulou¹³, Avindra Nath¹⁴, Denise M O'Sullivan¹⁵, Lauren B Reoma¹⁴, Liam P Shaw¹⁶, Igor Sidorov⁵, Patricia J Simner¹⁷, Le Van Tan¹⁸, Emma C Thomson¹⁹, Lucy van Dorp³, Michael R Wilson²⁰, Judith Breuer²¹, Nigel Field².

Abstract

The term metagenomics refers to the use of sequencing methods to simultaneously identify genomic material from all organisms present in a sample, with the advantage of greater taxonomic resolution than culture or other methods. Applications include pathogen detection and discovery, species characterisation, antimicrobial resistance detection, virulence profiling, and study of the microbiome and microecological factors affecting health. However, metagenomics involves complex and multistep processes and there are important technical and methodological challenges that require careful consideration to support valid inference. We co-ordinated a multidisciplinary, international expert group to establish reporting guidelines that address specimen processing, nucleic acid extraction, sequencing platforms, bioinformatics considerations, quality assurance, limits of detection, power and sample size, confirmatory testing, causality criteria, cost, and ethical issues. The guidance recognises that metagenomics research requires pragmatism and caution in interpretation, and that this field is rapidly evolving.

Entities: Chemical Disease Gene Species

Mesh：

Year: 2020 PMID： 32768390 PMCID： PMC7406238 DOI： 10.1016/S1473-3099(20)30199-7

Source DB: PubMed Journal: Lancet Infect Dis ISSN： 1473-3099 Impact factor: 25.071

Background

The term metagenome was coined in 1998 to describe the collection of genomes from microbes present in environmental soil samples by using approaches previously used to study single genomes. The sequencing of genetic material from clinical samples has become common practice in research on clinical microorganisms. In this context, metagenomics refers to the application of sequencing methods that can identify coexistent genomic material from any organism present in patient samples (ie, microorganism and host nucleic acid), usually with the aim of pathogen identification for clinical diagnosis or research.2, 3, 4 Examples of practical applications include pathogen detection and discovery, species characterisation or subtyping, antimicrobial resistance detection, virulence profiling, and studies of the microbiome and microecological drivers of health and disease.5, 6, 7, 8, 9, 10, 11, 12 Metagenomics is also being introduced as a diagnostic tool for causal studies of clinical syndromes (such as encephalitis),13, 14 for exploring the microbiome,15, 16 and for tracking disease outbreaks.17, 18 A current example of the transformational effect of direct sequencing of clinical samples has been the application for rapid investigation and dissemination of information on severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), which causes COVID-19.11, 12 Metagenomics data are generated using high-throughput sequencing methods, also referred to as deep, next-generation, massively parallel, or shotgun sequencing. In this Review, for simplicity, we refer to all these approaches as sequencing. We also include capture probe enrichment-based sequencing methods that use nucleotide probes to increase sensitivity and targeted amplicon sequencing—eg, sequencing the 16S ribosomal ribonucleic acid (rRNA) gene to identify bacteria. Capture probe enrichment-based sequencing and targeted amplicon sequencing might not be considered true examples of metagenomics and are not the focus of our Review; however, some similar considerations about reporting of results apply. Metagenomic sequencing has advantages for pathogen identification over conventional methods, such as culture or targeted PCR, because many or most microbial species present within a sample can be detected simultaneously with high taxonomic resolution. Detailed characterisation of microbial communities and population dynamics also enables the study of ecological interactions. Furthermore, this method does not require culture techniques, and therefore can be used for microbial species that are difficult or time consuming to grow. This is particularly relevant for diagnostic applications, where routine culture is in decline.20, 21 The term metagenomics refers to the use of sequencing methods to simultaneously identify genomic material from all organisms present in a sample, with the advantage of greater taxonomic resolution than culture or other methods. Applications include pathogen detection and discovery, species characterisation, antimicrobial resistance detection, virulence profiling, and study of the microbiome and microecological factors affecting health. Metagenomics involves complex and multistep processes and there are important technical and methodological challenges that require careful consideration to support valid inference. We co-ordinated a multidisciplinary, international expert group to establish reporting guidelines that address specimen processing, nucleic acid extraction, sequencing platforms, bioinformatics considerations, quality assurance, limits of detection, power and sample size, confirmatory testing, causality criteria, cost, and ethical issues. The guidance recognises that metagenomics research requires pragmatism and caution in interpretation, and that this field is rapidly evolving. Reporting standards should support clarity, consistency, and robustness of research. However, appropriate study design for metagenomics research is not well defined and metagenomic technologies pose important technical challenges. These challenges include methodological artefacts introduced by wet laboratory methods and the effect that different computational approaches have on the analysis of multivariate and complex data. Furthermore, the ethical implications of sequencing are substantial and data privacy considerations are increasingly recognised. The multiple steps and different expertise required to generate and analyse metagenomic sequence data involves numerous decision points, which could introduce bias and affect downstream inference about the presence and abundance of microbial species in the sample. A metagenome result should therefore be interpreted as one of many possible representations of the true sample composition of a given microbiome. Understanding and reporting sources of bias and limitations to valid inference should improve protocol performance and enable metagenomic research to proceed with transparent recognition of the limitations. However, existing reporting statements for epidemiology studies, including STROBE (STrengthening the Reporting of OBservational studies in Epidemiology) and its infectious disease molecular epidemiology extension, STROME-ID (Strengthening the Reporting of Molecular Epidemiology for Infectious Diseases), do not fully address issues specific to metagenomics. For this reason, scientific journals, and their readers, might not be adequately equipped with a standardised set of guidelines to evaluate and critically appraise clinical and epidemiological studies applying metagenomics. We aimed to improve the clarity and consistency of metagenomics research reporting, ranging from clinical diagnostics to microbiome studies, with suggestions for optimal practice and recommendations for robust and accurate reporting.

Titles and abstracts

The term metagenomics should be included in the title or abstract, and the keywords of the study when these methods contribute substaintially to the results reported

Clear and concise language incorporating standardised terminology, with references if appropriate, enables the accurate indexing of published studies in recognised databases. This is crucial for easy information retrieval and knowledge dissemination. For example, a systematic literature review of studies applying metagenomics in encephalitis using medical subject headings and keyword searches for the terms sequencing or metagenomics in four databases (PubMed, Embase, Web of Science, and Cochrane) failed to identify two relevant studies that did not report the terms.25, 26 These studies were identified by experts in the field who were directly involved with the studies.

Describing methods and study design

Describe specimen collection, handling and storage processes, and nucleic acid extraction methods

Steps involved in sample collection, handling, and processing are frequently poorly reported in publications and yet they will have considerable effect on the results and reproducibility of a study and could introduce variability artefacts.27, 28, 29, 30 In particular, many studies use material banked and collected originally for other purposes. In this Review, we describe important potential sources of error and their contribution to bias. Nucleic acids, particularly RNA, are labile. Consequently, the collection methods, addition of nucleic acid stabilisers, and time to processing can affect the results obtained. To address these issues, reporting should include durations, volumes, temperatures, and methods used before, during, and after the storage of samples.32, 33 Extraction methods contribute to another major source of method-induced variation—eg, by being DNA or RNA specific, or tailored to specific organism types—so should be described. Other details of sample preparation methods should also be reported including filtration, centrifugation, DNA digestion, rRNA depletion, separation in RNA or DNA, and random amplification. Standardised protocols of sample preparation methods should also be followed, if available and appropriate, and documented clearly in the publication methods. Authors should also consider submitting to standardised protocol repositories to provide transparency in the study design and methodology.

Describe sequencing methods, including sequencing depth

Different metagenomic sequencing platforms might produce different types of reads—eg, single versus paired-end, and short (100–300 bp) versus long (>1000 bp). Sequencing platforms have different error rates, with the probability of a nucleic base being read incorrectly ranging from less than 0·01% for Illumina sequencers to 5–10% for Oxford Nanopore Technologies sequencers (current figures as of February, 2020). Additionally, sequencers often read a base incorrectly when processing samples with large homopolymer repeats, GC-rich, structurally repetitive, and other complex regions of the genome. Consequent false-positive and false-negative errors need consideration when reporting species composition. Sequencing depth refers to the number of times a particular nucleic base is represented within reads or the redundancy of coverage, and has implications for identification of low abundant transcripts and confidence in sequencing data. However, sequencing depth must be balanced according to the research question and the available resources. There are several factors that affect sequencing depth, including the sequencing platform and the sequence that is being read (eg, species diversity of the sample).37, 38, 39

Describe methods used for bioinformatics analysis

For the purposes of this statement, the term bioinformatics applies to all analysis steps involving raw sequencing data, including base calling, de-multiplexing, trimming and removal of reads (eg, reads of low quality, low complexity, adapters and indexes, or of human origin), read normalisation, alignment of sequence reads to reference databases, de-novo assembling of genomes, and taxonomic assignment of reads, assembled contigs, or both. There are multiple viable options for many of these tasks, with ongoing debate in the community about optimal methods, which can depend on the scientific question at hand. The field of metagenomics is developing rapidly and methods once considered best practice can be superseded following new analytical advances. There should be clear descriptions of the bioinformatics methods used, including, at a minimum, the software name, version, and the main commands run with values for the essential parameters or flags. It is also advisable to make data and programming code open access, whether as supplementary files or shared online—eg, via Github or Figshare. Where possible, a version-controlled container, package, or easily installable version of the complete analytical pipeline (including all dependencies and required databases) could be made available for download and review. The open source release of bioinformatics workflows should be encouraged wherever possible to improve transparency and reproducibility, and should include adequate validation datasets, meaningful documentation, and examples of expected outputs and reports (appendix pp 1–2).

Describe quality assurance methods, including internal and external quality controls

An important strength of metagenomics analyses is their ability to detect any genomic material present within one sample. However, detection applies equally to true sample material and to any contaminating nucleic acids present in a sample, which can be introduced at any stage from sample collection to processing. For example, contamination could come from the extraction kit, the so-called kitome, or at the point of specimen collection. Sampling is rarely done under completely sterile conditions, and tissues obtained from tissue banks are therefore often contaminated. Low biomass and low abundance sites (for example tumours, the brain, and fetal tissues such as the placenta) are particularly prone to the risk of misclassifying contaminants. To show attempts to ensure internal validity and reproducibility and identify potential contamination, internal controls for all extraction and sequencing processes should be reported as part of standard operating procedures.4, 27 Positive controls are usually spiked with DNA or RNA—eg, synthetic nucleic acid standards such as sequins—and negative controls are usually a blank (eg, water) sample or ideally a similar or identical matrix (tissue, body fluids, etc) that are expected to contain no microorganism genomic material based on patient factors and test results. For clinical metagenomics, formal laboratory implementation involves a system of external controls. Arranging this system of external controls is difficult; however, publicly and commercially available controls and mock community samples are now available and we recommend that their use should be reported.48, 49

Describe use of orthogonal methods to confirm pathogen identity, function, and viability

The conventional methods in microbiology for confirming the presence of a pathogen are culture or growth of the pathogen from a clinical sample and immunohistochemistry, the histological localisation of candidate species in tissue biopsies. However, traditional culture can be difficult when antibiotics have been administered before sampling or for pathogens that are slow growing, fastidious, present in low-concentration, or currently undescribed. Sequencing has high discriminative power and could have higher sensitivity than culture-based methods. For example, in a polymicrobial sample, growth can be affected by presence of other competing bacteria or by inadequate growth conditions. Metagenomics methods have consistently shown higher classification accuracy when comparing taxonomic profiles of synthetic polymicrobial samples obtained from extended quantitative culture with non-selective media. Confirmatory assays appropriate to the study setting, justification for the methods used, and a description of their limitations should be reported. For cases in which confirmatory assays are not possible (eg, because of high cost or low volume of samples) an explanation should be provided. Rigorous validation of the method used, particularly for pathogens and proficiency testing, especially in clinical laboratories should be described (appendix pp 2–3).

Describe the criteria used to assess the role of pathogens in disease aetiology

Confirming the presence of microbial DNA or RNA in association with disease is an important step in establishing a causal relationship between a microorganism and disease.51, 52 A major challenge for metagenomics research and diagnostics is distinguishing pathogens from commensals or contaminants.53, 54 Interpretation of microbiome investigations can be further complicated if a misbalance in variation and abundance of different bacteria—sometimes referred to as dysbiosis—is suspected to be the cause of the condition. It is also worth considering that the cause of some diseases might involve multiple sequential or interacting species, which can be collectively important.56, 57 Furthermore, sequencing investigations can identify novel organisms, for which the clinical significance will be unknown. These issues are particularly relevant in the investigation of the cause of CNS infections. Several criteria to establish causality have been proposed over the past century, including the incorporation of metagenomic technologies (appendix 7–9).58, 59

State the time from collection to results and cost consideration

The time from sample collection to processing (transport time), including cold-chain transportation and transit, can affect the compositional profile of microorganisms inferred from metagenomics. Overgrowth or degradation can occur during the period between collection and (cryo)storage with the result that the sequencing profile may not accurately reflect the composition of the sample at the time of collection. An extended duration of storage can result in a shift in the relative representation of bacterial taxa and substantial variability in metagenomics data. For example, faecal samples stored for longer than 3 months at −80°C experience selective loss of Bacteroides spp.6, 60, 61 If the sample is obtained post mortem, it is essential to report the time from death to sample acquisition given extravasation of gut bacteria into the bloodstream that can complicate interpretation of metagenomic data. For some applications, it might be relevant to report the overall turnaround time of the bioinformatic analyses—ie, including computational time for bioinformatics analysis. For example, Oxford Nanopore technology may be deployed in the field or at point of need, allowing sequencing to be done rapidly in near real-time; still, actionable results are also dependent on the time required for computational analysis.62, 62 The turnaround time of bioinformatic analyses is crucial in the context of clinical applications, when metagenomics is used to help to guide or tailor patient treatment. Variables such as sequencing run time and total computational analysis time (with system specifications—eg, number of cores and amount of memory used) should be stated clearly, as should the sequencing depth.

Setting

State whether sample collection was retrospective or prospective

As described in the STAndards for Reporting of Diagnostic accuracy (STARD) guidelines, clarity is needed regarding the sequence of events in diagnostic testing to ensure that sources of bias are addressed. The analyte can degrade if there is a long time in between sample collection and the metagenomics assay. Retrospective sampling might also lead to bias in the samples tested. For instance, when comparing studies of unidentified encephalitis, samples retrospectively selected for metagenomics might be those that are difficult to diagnose (eg, with a low titre) or taken at later timepoints in the course of infection, and therefore more likely to be non-infectious.

Participants

Consider factors influencing microbiota compositions when selecting participants

Most diagnostic and public health laboratories do not yet use metagenomic technologies routinely. As such, patients included in metagenomics studies are often from tertiary referral or specialist centres, which are unlikely to be representative of the wider population, as discussed in STROBE and STROME-ID.22, 23 This limitation can introduce challenges for appropriate selection of controls for case-control studies and for studies assessing the strength of disease associations. Species composition of human microbiomes are affected by various host factors, including age, sex, behaviour (eg, diet and lifestyle), and environment.67, 68 Exposure to pharmacological substances can also profoundly influence microbiome composition. For example, a single standard course of antibiotics has been shown to alter species composition of the gut and oral microbiomes for over a year.69, 70 Matching of cases and controls is particularly challenging for metagenomics studies given the broad range of microbes considered. Metagenomics studies should aim to minimise and statistically control for host confounders or, at a minimum, list those confounders that might affect interpretation of results.

Bias

Bias is a source of error that remains constant with replication affecting trueness; it is separate to random error, which affects the precision of an experiment. Together, these sources of error contribute to measurement uncertainty that, when conducting metagenomics sequencing, has many potential sources (figure 1 ). Replication, including replication of the whole process, provides a means to estimate random error, which can vary when using different sequencing strategies. Adherence to strictly described laboratory protocols can improve random error and reproducibility, but it cannot be used alone to remove bias.

Figure 1

Sources of uncertainty diagram highlighting potential contributing sources

For simplicity, this figure considers the sequencing of DNA from an environment and does not consider the process beyond the data output from the sequencer. The arrows pointing towards the central black arrow show the experimental process from left to right and the sources of variability that could contribute uncertainty. Conceptually it is clear how some of these factors contribute to systematic effects (bias). However, in addition these factors also contribute to the random error (variance) that will influence the precision of a potential finding. QC=quality control.

Sources of uncertainty diagram highlighting potential contributing sources For simplicity, this figure considers the sequencing of DNA from an environment and does not consider the process beyond the data output from the sequencer. The arrows pointing towards the central black arrow show the experimental process from left to right and the sources of variability that could contribute uncertainty. Conceptually it is clear how some of these factors contribute to systematic effects (bias). However, in addition these factors also contribute to the random error (variance) that will influence the precision of a potential finding. QC=quality control.

Address potential sources of bias (sampling, transport, storage, library preparation, and sequencing)

Bias can occur at each step of a diagnostic sequencing pipeline (panel 1 ) and is more difficult to evaluate than random error. For metagenomics studies, microbiological contamination of samples can introduce bias. Experimental bias that is caused at different stages of a metagenomics experiment is more challenging to control for than selection bias or contamination. The fact that the microbiome is composed of many different microorganisms means that a given protocol could lead to certain groups being over-represented in the processed samples. For example, enrichment protocols can introduce bias for pathogen detection. Capture probe-targeted sequencing will limit detection to targeted sequences, and 16S rRNA gene sequencing has limitations with regard to the level of taxonomic classification. This precise form of bias does not exist in untargeted metagenomics; however, other experimental bias can occur at different protocol stages, including during sampling, nucleic acid extraction, or post-extraction steps. Studies using 16S should consider that different primers amplify different bacterial families with varying degrees of success because of mismatches, resulting in potential bias in abundance and diversity metrics, which cannot be completely corrected bioinformatically. Specimen collection methods Collection without a cold chain, or nucleic acid stabilising agents, can cause nucleic acid degradation and potential false-negative results or overgrowth of selected organisms, which leads to misinterpretation of abundance. Multiple freeze-thaw cycles can also cause nucleic acid degradation. Nucleic acid extraction method The absence of a bead-beating step could make the detection of some bacteria difficult (ie, bacteria do not lyse properly so their DNA is not released and will not be sequenced). Small specimen volumes can reduce the ability to detect low-level organisms. Sequencing library preparation Poly-A tail enrichment of RNA will not include fragmented pathogen genomes; DNA sequencing alone will not detect RNA viruses. Targeting of sequences Capture probe-targeted sequencing will limit detection to targeted, known sequences. 16S targeted sequencing, as opposed to whole genome sequencing, will have limitations for the level of taxonomic classification. Sequencing methods High-level sample multiplexing can lead to insufficient read depth to detect organisms present at low levels. Computational contamination can occur between samples pooled on the same sequencing run due to a sample barcode for a sequence being misread and misassigned to another sample on the same run. This is termed barcode bleed-through; dual barcodes drop the rate of bleed through dramatically compared with single barcodes. Unique molecular identifiers are an even more powerful way to identify this phenomenon when compared with dual barcodes. Processing controls Negative controls allow some contaminating organisms to be identified. Internal positive controls, reference standards such as sequins, reduce bias introduced by experimental variability and can improve recognition of low-level organisms. Analysis methods A small curated database, or highly stringent criteria might not include novel or unexpected organisms, leading to false negative results. An uncurated database or lenient criteria might also identify organisms incorrectly. By reporting the potential sources of bias for a given study (figure 1) their potential influence can be considered with mitigation or compensation strategies or caveats made to improve interpretation. The complexity and multistep nature of microbiome measurement means that any metagenomics experiment should be considered and reported as a representative result, rather than assuming that it perfectly reflects the microbes present and their abundance. It is also why the term unbiased, which is often used when describing metagenomic experiments that do not use enrichment, should be used with caution (or not at all). The term untargeted metagenomics could be used instead (appendix pp 3–4).

Address potential bias introduced by bioinformatics analysis

Classification algorithms rely on alignment of sequencing reads and contigs obtained from overlapping reads against reference genomes. In the case of the alignment of assembled contigs, reads that cannot be built into contigs (unassigned reads) are discarded, which can lead to a potential loss of information. Classification of reads might be slow and a smaller database could be built with unique sequences representing certain taxa. However, this can lead to bias in the assignment of homologous sequences and should be clearly reported. Samples containing low abundance pathogens might produce false-negative results by not classifying sequencing reads as relevant or produce false-positive results if reads are non-specific. Subsequent alignment of sequence reads against a reference genome of the candidate pathogen(s) identified by the metagenomics analysis can provide necessary validation—wide and distributed coverage of the reference genome and high mapping identity is unlikely to result in a false positive. The level of coverage might be limited in samples with low pathogen load but still can be a true-positive result. Sufficient read depth is not always available for metagenomics data from clinical samples, which often contain a large proportion of reads derived from the host. Additionally, high read depth can generally be achieved only for microbes present at high-copy number. Authors should report where these considerations are relevant. Assessing the quality of reads before downstream classification is crucial for ensuring accuracy of taxonomic assignment. This quality control usually includes removal of adapters, background sequences (human, host, or known), low-complexity sequence reads, trimming of low-quality bases at the ends of reads, and removal of primer sequences. The total number of reads in each sample can be affected by factors including DNA extraction methods, sample handling, library preparation, differences in sequencing depth. As such, it is generally advisable to normalise read abundance between samples before any analysis and report where this is done. Sophisticated statistical modelling approaches can deal with variation in read numbers between samples without loss of data (eg, DESeq2).

Describe or address limitations of reference databases

The use of reference databases should be clearly described. It is crucial that the reference database, genomic data download date, and a description of the procedures behind the inclusion and indexing of reference sequences are clearly presented. Limitations of reference databases can interfere with correct assignment of sequences (figure 2 ). Curated reference databases might not include all the relevant microbial diversity. Conversely, non-curated databases can comprise incorrectly named, incomplete, low sequencing quality, or artefactual sequences. Studies have shown that sequences arising from sample contamination or incompleteness (eg, an incomplete region of a genome that contains an important mutation) are frequent features of reference databases, particularly when draft genomes are included. For example, over 1000 published microbial genome sequences have been identified as contaminated with phiX174, a bacteriophage used as a control in Illumina sequencing, and 2250 NCBI GenBank draft bacterial and archaeal genomes contain spurious human sequences. Additionally, false-negative results might be due to a focal species missing taxonomic representation in the databases, which have an inherent curatorial bias to known human associated pathogens (appendix pp 4–5).

Figure 2

The importance of reference database choice, design, and versioning in taxonomic profiling of clinical metagenomics samples

(A) Schematic representation of a typical clinical metagenomics sample with species assigned as coloured DNA and grey denoting DNA deriving from the host, contaminants, unidentified taxa, or taxa sequenced at low depth. The pie chart provides the full metagenomic composition with the bar providing the species composition excluding host DNA and contaminants. (B) Taxonomic profiling based on database 1. Species confidently assigned are highlighted by colours with unassigned species shown in grey. Using database 1, species A, B, and D are correctly assigned. Species that are misassigned are outlined with a circle. In this instance, sequences from species C are assigned to the closely related species C' because of the lack of a representative of species C in the reference database. Additionally, the reference database contains a partially contaminated sequence from species E, which is misassigned to contaminant sequences in the test clinical metagenomics sample. This affects the inference of species composition shown in the bar. (C) The addition of species F to database 2 allows assignment of a greater proportion of the species present in the original clinical metagenomics sample. Quality control and improvement of reference species E, now species E (QC), removes the spurious assignment of contaminant species. Species C is still misassigned to species C', its closest representative in the database. (D) Updating the reference database to include species C results in the correct assignment of sequences to species C rather than species C'. Species F is taxonomically reassigned to species X, leading to a change in the assigned species name despite no change in the data in the reference or query datasets. In all cases the pink sequences present in the original clinical metagenomics sample are not assigned as this species is not present in any of the three reference databases.

The importance of reference database choice, design, and versioning in taxonomic profiling of clinical metagenomics samples (A) Schematic representation of a typical clinical metagenomics sample with species assigned as coloured DNA and grey denoting DNA deriving from the host, contaminants, unidentified taxa, or taxa sequenced at low depth. The pie chart provides the full metagenomic composition with the bar providing the species composition excluding host DNA and contaminants. (B) Taxonomic profiling based on database 1. Species confidently assigned are highlighted by colours with unassigned species shown in grey. Using database 1, species A, B, and D are correctly assigned. Species that are misassigned are outlined with a circle. In this instance, sequences from species C are assigned to the closely related species C' because of the lack of a representative of species C in the reference database. Additionally, the reference database contains a partially contaminated sequence from species E, which is misassigned to contaminant sequences in the test clinical metagenomics sample. This affects the inference of species composition shown in the bar. (C) The addition of species F to database 2 allows assignment of a greater proportion of the species present in the original clinical metagenomics sample. Quality control and improvement of reference species E, now species E (QC), removes the spurious assignment of contaminant species. Species C is still misassigned to species C', its closest representative in the database. (D) Updating the reference database to include species C results in the correct assignment of sequences to species C rather than species C'. Species F is taxonomically reassigned to species X, leading to a change in the assigned species name despite no change in the data in the reference or query datasets. In all cases the pink sequences present in the original clinical metagenomics sample are not assigned as this species is not present in any of the three reference databases.

Study size

Describe clearly how power calculations were made

Whenever comparisons in metagenomic species composition between two or more groups are made, authors should report relevant parameters such as significance level, power threshold, sequencing depth, effect size, number of comparisons, methods used to correct for multiple comparisons, and details of the statistical methods used for power calculations. It should be clearly stated how an effect size was derived and a rationale for the clinical relevance of the specific effect size should be given. If no power calculation was made, an explanation should be given about why this was not considered feasible or useful (appendix pp 5–6).

Statistical methods

State the limit of detection, including analytical sensitivity and specificity

The limit of detection (LOD) refers to the minimum quantity of genomic material from an organism required for its detection and should be stated in metagenomics studies. Determination of the LOD for a metagenomics study is dependent on the sequencing technology, sequencing depth, read length, representation of genomes related to the taxa of interest in the reference database, and the complexity of the community and amount of host nucleic acid in the sample. Simple calculations give estimates for the LOD (eg, for 106 reads per sample, the LOD is one read per sample), which corresponds to a relative abundance of the order of magnitude of 10−6 (ie, ∼0·0001%). Formal calculations of LOD that are needed for clinical validation should be done using probit analysis. In practice, the LOD will be considerably higher than that derived from these calculations because a single read from a taxon is very likely to be due to contamination or misclassification. Rather than trusting such calculations, the use of positive (spiked) controls and negative controls in the sequencing run allows assessment of sensitivity and specificity. With a single infection, the number of on-target reads will be correlated with the signal in the sample but mixed infections and coinfections will influence sensitivity. Experimentally validating these for model organisms that represent the specific pathogens of interest (eg, a DNA virus, an RNA virus, Gram-negative and Gram-positive bacteria, etc) is recommended, particularly for diagnostic tests.

Discussion

Attempt or acknowledge the need for functional or phenotypic validation

Genotypic data do not always correlate with clinical phenotype; for example, mechanisms that involve inducible resistance, gene expression and regulation, or post-translational modifications. In studies investigating mixed microbial communities it may not always be possible to determine which taxon a particular gene belongs to.88, 89 This is also relevant in the establishment of causality. Efforts should be made to undertake phenotypic and functional validation to assess the inferred results. If this is not possible, or beyond the scope of the study, the limitations of inferring results solely from genotypic data should be acknowledged and discussed, including known caveats and restrictions on making key assumptions.

Consider the need for species or strain resolution

Different strains or lineages within a species can differ widely in their phenotypic characteristics. For example, sequencing with strain-level resolution enabled identification of specific strains of Escherichia coli associated with necrotising enterocolitis in preterm newborns and lineages of Salmonella enterica associated with varying clinical phenotypes. Therefore, profiling microbial communities with sub-species resolution can be useful, although de novo assembly of metagenomic reads remains a methodological challenge. The strain and species resolution capacity of the assay used should be clearly stated with consideration for how the resolution applies to the study in question. In particular, microbial community profiling using 16S rRNA gene sequencing cannot identify individual species within some genera and should never be used to identify to the strain level. As recommended in STROME-ID, a definition or reference to published definitions of a strain should be provided.

Other information

Report any ethical considerations with specific implications for metagenomics

Metagenomics produces a vast amount of host and pathogen data, which are untargeted and sometimes not of immediate interest. Molecular methods to deplete human genomic material exist; however, they remain imperfect. It might be sufficient to detail in a protocol that the host data will be removed, and not analysed, although this approach could lead to bias in microbial reads caused by the in silico host-depletion method—host genomes can contain viable viral genomes and non-viable genetic material derived from or shared with microorganisms. In these cases, the method used to identify and exclude host reads—eg, through mapping of all reads to the host reference genome—should be reported. including the choice of mapping algorithm and programme parameters. Even if data analysis is restricted to non-human reads, it could still unveil potentially sensitive information, such as a new diagnosis of HIV. It has also been shown that more than 80% of individuals can be identified from populations of hundreds using their gut microbiome profile. These issues pose real concerns, particularly with the increasing requirement for data to be made publicly available. For all these reasons, specific ethical implications relating to metagenomics data and corresponding approvals should be stated, and appropriate ethical approval should be obtained.

Conclusions

Metagenomics has already made a significant impact on pathogen detection and characterisation, and we probably still underestimate its full potential. Increasing use of metagenomics has been accompanied by recognition of complex issues at every stage in the pipeline—ie, sample collection, sequencing, and analysis. Standards for reporting are therefore needed to ensure clarity, consistency, and robustness of research. The guidance given in this paper constitutes a set of recommendations and we recognise that research studies need to be pragmatic and use available resources. Nonetheless, reporting known and potential limitations should minimise misrepresentation. It is inevitable that the field of metagenomics will continue to advance steadily and these guidelines will need to be updated.

Search strategy and selection criteria

In 2018, a STROBE-metagenomics working group was established, identified through notable researchers in the field, including a geographically diverse group of epidemiologists, statisticians, bioinformaticians, neurologists, virologists, microbiologists, and specialists in public health and infectious diseases. Participants met to agree the structure and content of the statement, and the proposal was registered with the Equator Network. Specific issues to be covered were identified (panel 2 ). A systematic approach was taken to gather evidence to support the recommendations, with literature searches performed in PubMed, searching references of articles, and supplemented by expert opinion. Literature searches were done in PubMed using medical subject headings terms and keywords “(?sequenc* OR metagenom* OR Illumina OR RNA-seq OR RNASeq OR (Roche 454) OR (Ion torrent) OR (Proton / PGM) OR MiSeq OR HiSeq OR NextSeq OR MinION OR Nanopore OR PacBio) AND (infectio* OR microorganism OR microorganisms OR pathogen OR pathogens OR bacteria* OR virus OR viral OR fungus OR fungi OR parasite OR parasites OR parasitic)”, searching references of articles, and supplemented by expert opinion from within the group. Articles were limited to those in English language published between January, 2000, and June, 2019. Areas that were adequately addressed in existing STROBE and STROME-ID statements were not covered. Iterative versions of the guidelines and manuscript were circulated to develop a consensus. The STROBE-metagenomics extension has been developed to complement the STROBE and STROME-ID statements, with the new recommendations organised alongside the existing table. The guidelines discussed therefore cover only the new proposals for reporting. Specimen collection, handling, preservation, and storage Nucleic acid extraction Sequencing instrumentation and processing, including library preparation Bioinformatic analysis method, including workflow, database composition, and parameterisation Quality assurance measures, including internal quality control, such as the use of adequate internal and external controls Limits of detection, including analytical sensitivity, and specificity for clinical testing Power and sample size calculations Use of orthogonal methods to confirm sequencing results Criteria to confirm the role of pathogen(s) in disease aetiology Turnaround time Cost Ethical considerations Specific issues related to applications, such as in the diagnosis of CNS infections, and investigation of antimicrobial resistance For more on protocol sharing see http://www.protocols.io/ This online publication has been corrected. The corrected version first appeared at thelancet.com/infection on October 23, 2020

90 in total

1. The STARD statement for reporting studies of diagnostic accuracy: explanation and elaboration.

Authors: Patrick M Bossuyt; Johannes B Reitsma; David E Bruns; Constantine A Gatsonis; Paul P Glasziou; Les M Irwig; David Moher; Drummond Rennie; Henrica C W de Vet; Jeroen G Lijmer
Journal: Clin Chem Date: 2003-01 Impact factor: 8.327

2. Messages from the third International Conference on Clinical Metagenomics (ICCMg3).

Authors: Etienne Ruppé; Jacques Schrenzel
Journal: Microbes Infect Date: 2019-03-02 Impact factor: 2.700

3. Clinical Metagenomic Sequencing for Diagnosis of Meningitis and Encephalitis.

Authors: Michael R Wilson; Hannah A Sample; Kelsey C Zorn; Shaun Arevalo; Guixia Yu; John Neuhaus; Scot Federman; Doug Stryke; Benjamin Briggs; Charles Langelier; Amy Berger; Vanja Douglas; S Andrew Josephson; Felicia C Chow; Brent D Fulton; Joseph L DeRisi; Jeffrey M Gelfand; Samia N Naccache; Jeffrey Bender; Jennifer Dien Bard; Jamie Murkey; Magrit Carlson; Paul M Vespa; Tara Vijayan; Paul R Allyn; Shelley Campeau; Romney M Humphries; Jeffrey D Klausner; Czarina D Ganzon; Fatemeh Memar; Nicolle A Ocampo; Lara L Zimmermann; Stuart H Cohen; Christopher R Polage; Roberta L DeBiasi; Barbara Haller; Ronald Dallas; Gabriela Maron; Randall Hayden; Kevin Messacar; Samuel R Dominguez; Steve Miller; Charles Y Chiu
Journal: N Engl J Med Date: 2019-06-13 Impact factor: 91.245

4. Metagenomic microbial community profiling using unique clade-specific marker genes.

Authors: Nicola Segata; Levi Waldron; Annalisa Ballarini; Vagheesh Narasimhan; Olivier Jousson; Curtis Huttenhower
Journal: Nat Methods Date: 2012-06-10 Impact factor: 28.547

5. The truth about metagenomics: quantifying and counteracting bias in 16S rRNA studies.

Authors: J Paul Brooks; David J Edwards; Michael D Harwich; Maria C Rivera; Jennifer M Fettweis; Myrna G Serrano; Robert A Reris; Nihar U Sheth; Bernice Huang; Philippe Girerd; Jerome F Strauss; Kimberly K Jefferson; Gregory A Buck
Journal: BMC Microbiol Date: 2015-03-21 Impact factor: 3.605

6. Long term storage of dry versus frozen RNA for next generation molecular studies.

Authors: Eric Seelenfreund; William A Robinson; Carol M Amato; Aik-Choon Tan; Jihye Kim; Steven E Robinson
Journal: PLoS One Date: 2014-11-07 Impact factor: 3.240

7. Clinical metagenomics of bone and joint infections: a proof of concept study.

Authors: Etienne Ruppé; Vladimir Lazarevic; Myriam Girard; William Mouton; Tristan Ferry; Frédéric Laurent; Jacques Schrenzel
Journal: Sci Rep Date: 2017-08-10 Impact factor: 4.379

8. Laboratory validation of a clinical metagenomic sequencing assay for pathogen detection in cerebrospinal fluid.

Authors: Steve Miller; Samia N Naccache; Erik Samayoa; Kevin Messacar; Shaun Arevalo; Scot Federman; Doug Stryke; Elizabeth Pham; Becky Fung; William J Bolosky; Danielle Ingebrigtsen; Walter Lorizio; Sandra M Paff; John A Leake; Rick Pesano; Roberta DeBiasi; Samuel Dominguez; Charles Y Chiu
Journal: Genome Res Date: 2019-04-16 Impact factor: 9.043

Review 9. The changing face of pathogen discovery and surveillance.

Authors: W Ian Lipkin
Journal: Nat Rev Microbiol Date: 2013-01-03 Impact factor: 60.633

10. Integrating host response and unbiased microbe detection for lower respiratory tract infection diagnosis in critically ill adults.

Authors: Charles Langelier; Katrina L Kalantar; Farzad Moazed; Michael R Wilson; Emily D Crawford; Thomas Deiss; Annika Belzer; Samaneh Bolourchi; Saharai Caldera; Monica Fung; Alejandra Jauregui; Katherine Malcolm; Amy Lyden; Lillian Khan; Kathryn Vessel; Jenai Quan; Matt Zinter; Charles Y Chiu; Eric D Chow; Jenny Wilson; Steve Miller; Michael A Matthay; Katherine S Pollard; Stephanie Christenson; Carolyn S Calfee; Joseph L DeRisi
Journal: Proc Natl Acad Sci U S A Date: 2018-11-27 Impact factor: 11.205

8 in total

Review 1. Probiotic supplementation for neonates with congenital gastrointestinal surgical conditions: guidelines for future research.

Authors: Shripada Rao; Meera Esvaran; Liwei Chen; Chooi Kok; Anthony D Keil; Ian Gollow; Karen Simmer; Bernd Wemheuer; Patricia Conway; Sanjay Patole
Journal: Pediatr Res Date: 2022-05-03 Impact factor: 3.756

Review 2. Metagenomics-enabled microbial surveillance.

Authors: Karrie K K Ko; Kern Rei Chng; Niranjan Nagarajan
Journal: Nat Microbiol Date: 2022-04-01 Impact factor: 17.745

Review 3. Reporting guidelines for human microbiome research: the STORMS checklist.

Authors: Curtis Huttenhower; Jennifer B Dowd; Heidi E Jones; Levi Waldron; Chloe Mirzayi; Audrey Renson; Fatima Zohra; Shaimaa Elsafoury; Ludwig Geistlinger; Lora J Kasselman; Kelly Eckenrode; Janneke van de Wijgert; Amy Loughman; Francine Z Marques; David A MacIntyre; Manimozhiyan Arumugam; Rimsha Azhar; Francesco Beghini; Kirk Bergstrom; Ami Bhatt; Jordan E Bisanz; Jonathan Braun; Hector Corrada Bravo; Gregory A Buck; Frederic Bushman; David Casero; Gerard Clarke; Maria Carmen Collado; Paul D Cotter; John F Cryan; Ryan T Demmer; Suzanne Devkota; Eran Elinav; Juan S Escobar; Jennifer Fettweis; Robert D Finn; Anthony A Fodor; Sofia Forslund; Andre Franke; Cesare Furlanello; Jack Gilbert; Elizabeth Grice; Benjamin Haibe-Kains; Scott Handley; Pamela Herd; Susan Holmes; Jonathan P Jacobs; Lisa Karstens; Rob Knight; Dan Knights; Omry Koren; Douglas S Kwon; Morgan Langille; Brianna Lindsay; Dermot McGovern; Alice C McHardy; Shannon McWeeney; Noel T Mueller; Luigi Nezi; Matthew Olm; Noah Palm; Edoardo Pasolli; Jeroen Raes; Matthew R Redinbo; Malte Rühlemann; R Balfour Sartor; Patrick D Schloss; Lynn Schriml; Eran Segal; Michelle Shardell; Thomas Sharpton; Ekaterina Smirnova; Harry Sokol; Justin L Sonnenburg; Sujatha Srinivasan; Louise B Thingholm; Peter J Turnbaugh; Vaibhav Upadhyay; Ramona L Walls; Paul Wilmes; Takuji Yamada; Georg Zeller; Mingyu Zhang; Ni Zhao; Liping Zhao; Wenjun Bao; Aedin Culhane; Viswanath Devanarayan; Joaquin Dopazo; Xiaohui Fan; Matthias Fischer; Wendell Jones; Rebecca Kusko; Christopher E Mason; Tim R Mercer; Susanna-Assunta Sansone; Andreas Scherer; Leming Shi; Shraddha Thakkar; Weida Tong; Russ Wolfinger; Christopher Hunter; Nicola Segata
Journal: Nat Med Date: 2021-11-17 Impact factor: 87.241

4. Metatranscriptomics Analysis Reveals Diverse Viral RNA in Cutaneous Papillomatous Lesions of Cattle.

Authors: Adriana O Fernandes; Gerlane S Barros; Marcus Va Batista
Journal: Evol Bioinform Online Date: 2022-03-14 Impact factor: 2.031

5. Metagenomic Sequencing as a Pathogen-Agnostic Clinical Diagnostic Tool for Infectious Diseases: a Systematic Review and Meta-analysis of Diagnostic Test Accuracy Studies.

Authors: Kumeren N Govender; Teresa L Street; Nicholas D Sanderson; David W Eyre
Journal: J Clin Microbiol Date: 2021-08-18 Impact factor: 5.948