Literature DB >> 31387904

Reply to Sun et al., "Identifying Composition Novelty in Microbiome Studies: Improvement of Prediction Accuracy".

Xiaoquan Su1,2,3, Gongchao Jing4,2,3, Daniel McDonald5, Honglei Wang4,2,3, Zengbin Wang4,2,3, Antonio Gonzalez5, Zheng Sun4,2,3, Shi Huang4,2,3, Jose Navas6, Rob Knight7,6,8,9, Jian Xu1,2,3.   

Abstract

Entities:  

Keywords:  bioinformatics; community similarity; data mining; database search; microbial ecology; microbiome; microbiome novelty; novelty; search

Mesh:

Substances:

Year:  2019        PMID: 31387904      PMCID: PMC6686038          DOI: 10.1128/mBio.01234-19

Source DB:  PubMed          Journal:  mBio            Impact factor:   7.867


× No keyword cloud information.

REPLY

To quantitatively measure the beta diversities between microbiomes, Microbiome Search Engine (MSE) (1) calculates phylogeny similarity using operational taxonomy unit (OTU) profiles; for both query and database samples, all 16S rRNA gene sequences are mapped to the Greengenes database (version 13-8) (2) for reference-based OTU picking with a 97% cutoff. Thus, in MSE, the comparison between query and database samples is approximately at the species level (3), although the actual taxonomic resolution varies according to taxon, due to differences in the evolutionary rates of the 16S rRNAs. Moreover, in MSE, both the relative abundance (with 16S rRNA gene copy number normalization [4]) and the phylogenetic structures of OTUs are utilized for similarity calculation (as in UniFrac [5, 6]), yet the speed is optimized by nonrecursive computing to enable real-time responses (7). By comparing the query sample (i.e., dust from university dormitories) provided by Sun et al. (8) and the MSE top-hit samples, which are from mosquito tissues, we found that although abundant sequences of the two (query and the top-hit) samples are distributed among different OTUs (species) within the Pseudomonas genus, they are still very close in the common OTU-based phylogenetic tree (extracted from the Greengenes tree) (Fig. 1a), resulting in a high similarity of 0.916. To test whether this match is significant, we ranked this value in pairwise similarity calculation among all microbiomes (n = 177,022) in MSE [in total, (n · n – 1)/2 = 15,668,305,731 times). The resulting P value of the permutation test is 0.0009, suggesting a highly significant match. This might have revealed potential interaction or transmission between mosquitos and dust, as these mosquitos were collected from residential properties and buildings (samples for generating 16S rRNA amplicon libraries were prepared by grinding one insect or a pool of individual insects [9]) (Table 1), or it might have highlighted communities that are distinct yet still dominated by microbes that are similar to one another when the overall picture of the bacterial tree is considered.
FIG 1

Comparison between the query microbiome (dorm dust) and the top hits reported by MSE-based searches. (a) Distribution of OTUs in the common phylogeny tree between the query and the top hit from the full MSE reference database. Those abundant OTUs from the Pseudomonas genus are marked in the red box, and the shared subbranches of the query and the hits are indicated in blue. (b) The similarities between the query sample and each of the top 10 hits against the building reference samples are significantly lower than those between the query and each of the 10 hits against the entire database, as suggested by both t test (b) and PCoA (c). PC1 and PC2, principal components 1 and 2, respectively.

TABLE 1

Details for the top 10 hits for the query microbiome, dorm dust

MSE database ID of top 10 hitHabitatSimilaritySampling locationSampling date (yr/mo/day)Reference
IDs from entire MSE database
    S_10815.C1OtvW34TOR2012Mosquito tissue0.91586Toronto, Canada2012/8/219
    S_10815.NOjW34MSL2012Mosquito tissue0.91350Toronto, Canada2012/7/249
    S_10815.3A081OjW32LAM2012Mosquito tissue0.91291Toronto, Canada2012/8/79
    S_10815.Can2CxW32MSL2012Mosquito tissue0.91283Toronto, Canada2012/8/229
    S_10815.O3AvW34TOR2012Mosquito tissue0.91260Toronto, Canada2012/6/129
    S_10815.Y12A2AnpW31PEE2012Mosquito tissue0.91183Toronto, Canada2012/8/19
    S_10815.C1AvW30TOR2012Mosquito tissue0.91134Toronto, Canada2012/7/249
    S_10815.Can10AvW32MSL2012Mosquito tissue0.91097Toronto, Canada2012/8/159
    S_10815.M1AvW32WEC2012Mosquito tissue0.91095Toronto, Canada2012/7/319
    S_10815.B4AvW25TOR2013Mosquito tissue0.91088Toronto, Canada2013/6/189
IDs from “Building” subset of reference microbiomes in MSE database
    S_10172.815Room surface dust0.90388Chicago, IL, USA2017/5/2410
    S_10172.828Nurse station surface dust0.90063Chicago, IL, USA2017/5/2410
    S_1772.H23CbKitchen cutting board0.89745Raleigh-Durham, NC, USA2013/5/2211
    S_10172.286Cold tap water0.89666Chicago, IL, USA2017/5/2410
    S_10172.830Nurse station surface dust0.89300Chicago, IL, USA2017/5/2410
    S_SRR5574403Kitchen dust0.89109Oakland, CA, USA2017/5/1712
    S_10423.34E7LN0ZRJUQBCarpet dust0.88931Toronto, Canada2004/7/1413
    S_10172.10456Cold tap water0.88743Chicago, IL, USA2017/5/2410
    S_10172.8331Glove0.88592Chicago, IL, USA2017/5/2410
    S_10172.291Room surface dust0.88534Chicago, IL, USA2017/5/2410
Comparison between the query microbiome (dorm dust) and the top hits reported by MSE-based searches. (a) Distribution of OTUs in the common phylogeny tree between the query and the top hit from the full MSE reference database. Those abundant OTUs from the Pseudomonas genus are marked in the red box, and the shared subbranches of the query and the hits are indicated in blue. (b) The similarities between the query sample and each of the top 10 hits against the building reference samples are significantly lower than those between the query and each of the 10 hits against the entire database, as suggested by both t test (b) and PCoA (c). PC1 and PC2, principal components 1 and 2, respectively. Details for the top 10 hits for the query microbiome, dorm dust To test whether microbiomes from similar environments are more similar to each other than those from distinct environments, we next searched the query sample (which is dust collected inside a building) against all “building” samples in the reference database of MSE (a subset that includes 11,248 samples that were labeled as “building” from 35 studies). The similarities between the query and each of the top 10 hits (10–13) (Table 1) against the building reference samples are significantly lower than those between the query and each of the top 10 hits against the entire database (Fig. 1b) (t test P value = 2.75E–08). Findings from principal-component analysis (PCoA) support this conclusion, because the query sample is closer to the mosquito samples (i.e., to hits from the entire database) than to the building sample hits (i.e., hits from the building database) (Fig. 1c). These results suggest that microbiomes from similar environments can indeed be more different from each other than from certain samples from other environments that would intuitively be considered distinct. In our current MSE implementation (1), the microbiome novelty score (MNS) is calculated based on the top hits against the whole reference database in MSE, rather than against only a subset of the reference microbiomes or those from a specific environment. We are grateful to Sun et al.’s suggestion of allowing the choice of reference databases when using MSE. In the upcoming release of MSE (http://mse.ac.cn), we plan to allow the selection of a specific environment or ecosystem as the reference database to search against, although we caution strongly that such restricted searches may lead to incorrect interpretation of results when the databases are not comprehensive. Recently, amplicon sequence variant (ASV)-based approaches have been developed to improve the resolution of classifying 16S rRNA genes (14–16), but they require a unified sequencing platform and identical gene amplicon regions among the data sets. At present, the majority of historical microbiome samples were produced via a variety of platforms and amplicon regions; e.g., the V1-V3 and V3-V5 regions of 16S rRNA gene were sequenced via Roche 454 in the Human Microbiome Project (17), while the V4 region was sequenced via Illumina HiSeq and MiSeq in the Earth Microbiome Project (18). This reality limits the prospect of adopting the ASV scheme in MSE for searching against the current 16S rRNA-based microbiome data space. On the other hand, with the rapid accumulation of shotgun metagenomic data sets, we expect MSE to accommodate such data sets and eventually allow microbiome searches at the strain level, as Sun et al. have suggested.
  18 in total

1.  Bacterial colonization and succession in a newly opened hospital.

Authors:  Simon Lax; Naseer Sangwan; Daniel Smith; Peter Larsen; Kim M Handley; Miles Richardson; Kristina Guyton; Monika Krezalek; Benjamin D Shogan; Jennifer Defazio; Irma Flemming; Baddr Shakhsheer; Stephen Weber; Emily Landon; Sylvia Garcia-Houchins; Jeffrey Siegel; John Alverdy; Rob Knight; Brent Stephens; Jack A Gilbert
Journal:  Sci Transl Med       Date:  2017-05-24       Impact factor: 17.956

2.  Structure, function and diversity of the healthy human microbiome.

Authors: 
Journal:  Nature       Date:  2012-06-13       Impact factor: 49.962

3.  Parallel-META 3: Comprehensive taxonomical and functional analysis platform for efficient comparison of microbial communities.

Authors:  Gongchao Jing; Zheng Sun; Honglei Wang; Yanhai Gong; Shi Huang; Kang Ning; Jian Xu; Xiaoquan Su
Journal:  Sci Rep       Date:  2017-01-12       Impact factor: 4.379

4.  Exact sequence variants should replace operational taxonomic units in marker-gene data analysis.

Authors:  Benjamin J Callahan; Paul J McMurdie; Susan P Holmes
Journal:  ISME J       Date:  2017-07-21       Impact factor: 10.302

5.  Deblur Rapidly Resolves Single-Nucleotide Community Sequence Patterns.

Authors:  Amnon Amir; Daniel McDonald; Jose A Navas-Molina; Evguenia Kopylova; James T Morton; Zhenjiang Zech Xu; Eric P Kightley; Luke R Thompson; Embriette R Hyde; Antonio Gonzalez; Rob Knight
Journal:  mSystems       Date:  2017-03-07       Impact factor: 6.496

6.  A communal catalogue reveals Earth's multiscale microbial diversity.

Authors:  Luke R Thompson; Jon G Sanders; Daniel McDonald; Amnon Amir; Joshua Ladau; Kenneth J Locey; Robert J Prill; Anupriya Tripathi; Sean M Gibbons; Gail Ackermann; Jose A Navas-Molina; Stefan Janssen; Evguenia Kopylova; Yoshiki Vázquez-Baeza; Antonio González; James T Morton; Siavash Mirarab; Zhenjiang Zech Xu; Lingjing Jiang; Mohamed F Haroon; Jad Kanbar; Qiyun Zhu; Se Jin Song; Tomasz Kosciolek; Nicholas A Bokulich; Joshua Lefler; Colin J Brislawn; Gregory Humphrey; Sarah M Owens; Jarrad Hampton-Marcell; Donna Berg-Lyons; Valerie McKenzie; Noah Fierer; Jed A Fuhrman; Aaron Clauset; Rick L Stevens; Ashley Shade; Katherine S Pollard; Kelly D Goodwin; Janet K Jansson; Jack A Gilbert; Rob Knight
Journal:  Nature       Date:  2017-11-01       Impact factor: 49.962

7.  Identifying and Predicting Novelty in Microbiome Studies.

Authors:  Xiaoquan Su; Gongchao Jing; Daniel McDonald; Honglei Wang; Zengbin Wang; Antonio Gonzalez; Zheng Sun; Shi Huang; Jose Navas; Rob Knight; Jian Xu
Journal:  MBio       Date:  2018-11-13       Impact factor: 7.867

8.  Performance of Microbiome Sequence Inference Methods in Environments with Varying Biomass.

Authors:  Vincent Caruso; Xubo Song; Mark Asquith; Lisa Karstens
Journal:  mSystems       Date:  2019-02-19       Impact factor: 6.496

9.  Fast UniFrac: facilitating high-throughput phylogenetic analyses of microbial communities including analysis of pyrosequencing and PhyloChip data.

Authors:  Micah Hamady; Catherine Lozupone; Rob Knight
Journal:  ISME J       Date:  2009-08-27       Impact factor: 10.302

10.  Home life: factors structuring the bacterial diversity found within and between homes.

Authors:  Robert R Dunn; Noah Fierer; Jessica B Henley; Jonathan W Leff; Holly L Menninger
Journal:  PLoS One       Date:  2013-05-22       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.