| Literature DB >> 24475242 |
Ching-chia Yang1, Wataru Iwasaki1.
Abstract
MetaMetaDB (http://mmdb.aori.u-tokyo.ac.jp/) is a database and analytic system for investigating microbial habitability, i.e., how a prokaryotic group can inhabit different environments. The interaction between prokaryotes and the environment is a key issue in microbiology because distinct prokaryotic communities maintain distinct ecosystems. Because 16S ribosomal RNA (rRNA) sequences play pivotal roles in identifying prokaryotic species, a system that comprehensively links diverse environments to 16S rRNA sequences of the inhabitant prokaryotes is necessary for the systematic understanding of the microbial habitability. However, existing databases are biased to culturable prokaryotes and exhibit limitations in the comprehensiveness of the data because most prokaryotes are unculturable. Recently, metagenomic and 16S rRNA amplicon sequencing approaches have generated abundant 16S rRNA sequence data that encompass unculturable prokaryotes across diverse environments; however, these data are usually buried in large databases and are difficult to access. In this study, we developed MetaMetaDB (Meta-Metagenomic DataBase), which comprehensively and compactly covers 16S rRNA sequences retrieved from public datasets. Using MetaMetaDB, users can quickly generate hypotheses regarding the types of environments a prokaryotic group may be adapted to. We anticipate that MetaMetaDB will improve our understanding of the diversity and evolution of prokaryotes.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24475242 PMCID: PMC3903645 DOI: 10.1371/journal.pone.0087126
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The number of representative 16S rRNA sequences in each environmental category.
| Environment | Number of files | 16S rRNA sequences | representative 16S rRNA sequences | Average (bp) | Min (bp) | Max(bp) |
| ant fungus garden | 127 | 68,118 | 11,556 | 393.07 | 200 | 576 |
| ant | 9 | 10,915 | 3,436 | 377.50 | 200 | 583 |
| beach sand | 4 | 2,038 | 1,234 | 357.52 | 200 | 538 |
| beetle | 19 | 11,895 | 1,708 | 397.53 | 200 | 561 |
| biofilm | 29 | 24,317 | 3,723 | 449.99 | 204 | 806 |
| bioreactor | 39 | 181,862 | 33,591 | 436.26 | 200 | 620 |
| bioreactor sludge | 226 | 286,831 | 50,666 | 242.91 | 200 | 541 |
| bovine gut | 16 | 319,384 | 59,564 | 362.20 | 200 | 655 |
| chicken gut | 5 | 3,579 | 1,496 | 265.18 | 200 | 406 |
| compost | 24 | 14,914 | 2,195 | 381.07 | 200 | 599 |
| coral | 14 | 589 | 192 | 289.62 | 200 | 467 |
| fish | 26 | 87,742 | 49,706 | 420.24 | 200 | 861 |
| food fermentation | 1 | 90,024 | 51,908 | 380.78 | 200 | 673 |
| food | 15 | 79,971 | 37,250 | 399.89 | 200 | 915 |
| freshwater | 34 | 102,460 | 35,880 | 351.29 | 200 | 797 |
| freshwater sediment | 2 | 9,368 | 7,128 | 404.95 | 200 | 803 |
| groundwater | 1 | 4,524 | 1,296 | 365.04 | 200 | 550 |
| gut | 90 | 299,036 | 43,670 | 288.22 | 200 | 621 |
| honeybee | 2 | 34,323 | 23,479 | 377.10 | 200 | 820 |
| hot springs | 13 | 10,312 | 2,646 | 375.05 | 200 | 572 |
| human gut | 247 | 216,566 | 64,787 | 252.06 | 200 | 592 |
| human | 1,272 | 311,651 | 53,834 | 365.09 | 200 | 842 |
| human oral | 257 | 242,633 | 11,574 | 287.97 | 200 | 547 |
| human skin | 173 | 55,863 | 22,114 | 362.43 | 200 | 866 |
| hydrocarbon | 506 | 413,879 | 46,431 | 413.31 | 200 | 704 |
| hydrothermal vent | 1 | 739 | 235 | 251.52 | 200 | 296 |
| hypersaline lake | 5 | 6,957 | 2,474 | 399.30 | 200 | 548 |
| ice | 3 | 456 | 421 | 249.59 | 200 | 312 |
| marine | 823 | 79,871 | 27,447 | 294.65 | 200 | 582 |
| marine sediment | 80 | 21,155 | 9,393 | 342.97 | 200 | 680 |
| microbial mat | 4 | 4,561 | 1,930 | 247.02 | 201 | 313 |
| mine drainage | 4 | 1,217 | 674 | 357.20 | 200 | 553 |
| mosquito | 24 | 46,888 | 4,324 | 375.05 | 200 | 563 |
| mouse gut | 20 | 37,536 | 11,021 | 247.95 | 200 | 512 |
| oil production facility | 17 | 4,893 | 1,691 | 232.90 | 200 | 523 |
| phyllosphere | 35 | 2,108 | 1,199 | 263.95 | 201 | 583 |
| rhizosphere | 13 | 67,371 | 17,514 | 374.79 | 200 | 565 |
| root | 2 | 7,260 | 1,590 | 377.01 | 200 | 512 |
| saltern | 15 | 2,309 | 651 | 393.71 | 200 | 570 |
| sediment | 89 | 137,175 | 37,739 | 387.37 | 200 | 772 |
| soil | 257 | 1,775,170 | 490,776 | 341.19 | 200 | 806 |
| stromatolite | 7 | 17,441 | 2,633 | 246.28 | 202 | 266 |
| termite gut | 1 | - | - | - | - | - |
| termite | 8 | 12636 | 1557 | 388.31 | 200 | 732 |
| wasp | 7 | 8602 | 616 | 369.08 | 200 | 565 |
| wastewater | 40 | 20373 | 6264 | 373.78 | 200 | 526 |
|
|
|
|
|
|
|
|
All the reads from the termite gut were shorter than 200 bp, and the average length was not calculated.
“Average” represents the average length, whereas “Min” and “Max” represent the lengths of the shortest and the longest sequences.
Figure 1Flowchart for generating the MetaMetaDB content.
A large amount of data was effectively condensed into a small number of representative 16S rRNA sequences. The asterisks indicate the processes done by NGS QC Toolkit. The numbers in the parentheses represent the numbers of sequences. The tool used in each step is described in the parentheses of that step.
Figure 2Screen image of a sample results page.
MetaMetaDB produces one figure and one statistical description for each query sequence and provides a link to download all of the figures and text files. The y-axis represents the MHIs, and the columns show the MHIs calculated by the hits above the different identities. The MHIs and the number of hit(s) used for the calculation are listed under the figure.
Habitats and MHIs inferred by MetaMetaDB.
| Domain | Species | Habitat | Number of 16S rRNA | MHI |
| Archaea |
| Wastewater, Sludge | 2 | (97%, 14 hits) wastewater: 59.12%; compost: 17.71%; rhizosphere: 11.91%; human skin: 11.26% |
| Bacteria |
| Soil | 10 | (97%, 164 hits) hydrocarbon: 23.68%; wastewater: 19.89%; root: 10.44%; compost: 8.94%; ant fungus garden: 7.33%; rhizosphere: 6.68%; food: 5.49%; sediment: 5.47%; fish: 5.04%; gut: 4.72%; soil: 2.33% |
| Bacteria |
| Human intestinal microflora, Host | 7 | (97%, 1,032 hits) hydrothermal vent: 33.66%; termite: 14.31%; freshwater: 8.16%; other: 7.52%; ant fungus garden: 5.17%; marine: 4.63%; microbial mat: 4.50%; gut: 4.42%; beetle: 4.12%; human gut: 3.48%; phyllosphere: 3.10%; ant: 2.31%; wastewater: 1.79%; hydrocarbon: 1.47%; sediment: 1.37% |
| Bacteria |
| Wastewater, Intestinal microflora, Host, Fresh water | 8 | (97%, 36 hits) freshwater: 88.15%; marine: 11.85% |
| Bacteria |
| Human stomach, Host | 2 | (97%, 3 hits) human gut: 63.82%; gut: 36.18% |
The habitats were retrieved from the Genomes OnLine Database (GOLD) [40].
Only MHIs that were calculated by hits above highest identity threshold are listed.