| Literature DB >> 24418497 |
Chih-Min Chiu1, Feng-Mao Lin1, Tzu-Hao Chang2, Wei-Chih Huang1, Chao Liang1, Ting Yang1, Wei-Yun Wu1, Tzu-Ling Yang1, Shun-Long Weng1,3,4,5, Hsien-Da Huang1,6.
Abstract
BACKGROUND: The human body plays host to a vast array of bacteria, found in oral cavities, skin, gastrointestinal tract and the vagina. Some bacteria are harmful while others are beneficial to the host. Despite the availability of many methods to identify bacteria, most of them are only applicable to specific and cultivable bacteria and are also tedious. Based on high throughput sequencing technology, this work derives 16S rRNA sequences of bacteria and analyzes probiotics and pathogens species.Entities:
Year: 2014 PMID: 24418497 PMCID: PMC3901789 DOI: 10.1186/2043-9113-4-1
Source DB: PubMed Journal: J Clin Bioinforma ISSN: 2043-9113
Results of quality filtering and taxonomy assignment
| B011 | 125420 | 117451 | 93.65% | 90952 | 77.44% | 60 | 0.07% | 3509 | 3.86% |
| B012 | 132240 | 120134 | 90.85% | 94679 | 78.81% | 3457 | 3.65% | 20109 | 21.24% |
| B013 | 151876 | 142585 | 93.88% | 99025 | 69.45% | 3452 | 3.49% | 21341 | 21.55% |
| B014 | 134619 | 126784 | 94.18% | 95377 | 75.23% | 611 | 0.64% | 6665 | 6.99% |
| B016 | 135457 | 126507 | 93.39% | 89407 | 70.67% | 49 | 0.05% | 20870 | 23.34% |
| B017 | 141682 | 131968 | 93.14% | 89465 | 67.79% | 1064 | 1.19% | 8944 | 10.00% |
| B018 | 111228 | 102382 | 92.05% | 56981 | 55.66% | 910 | 1.60% | 11630 | 20.41% |
| B019 | 128532 | 120719 | 93.92% | 76877 | 63.68% | 305 | 0.40% | 2775 | 3.61% |
| B020 | 128441 | 121957 | 94.95% | 89618 | 73.48% | 123 | 0.14% | 3673 | 4.10% |
| B031 | 140941 | 132311 | 93.88% | 97962 | 74.04% | 2129 | 2.17% | 5194 | 5.30% |
| B033 | 142462 | 134554 | 94.45% | 80548 | 59.86% | 229 | 0.28% | 2725 | 3.38% |
| B034 | 148854 | 140059 | 94.09% | 106050 | 75.72% | 9857 | 9.29% | 15436 | 14.56% |
| Total | 1621752 | 1517411 | 93.54% | 1066941 | 70.31% | 22246 | 2.09% | 122871 | 11.52% |
Figure 1Relative abundance of probiotics and pathogenic bacteria from human gut of all samples. (A) The percentage of probiotics was identified in the samples. (B) The proportion of pathogenic bacteria was identified in the samples in the case study.
The quantities (matched sequenced reads) of probiotics identified in the samples in the case study
| 0 | 81 | 6 | 1 | 2 | 0 | 3 | 0 | 1 | 0 | 9 | 1 | ||
| 4 | 3 | 1520 | 81 | 1 | 372 | 185 | 177 | 1 | 5 | 0 | 375 | ||
| 0 | 101 | 37 | 3 | 1 | 16 | 32 | 1 | 1 | 0 | 0 | 50 | ||
| 0 | 3 | 3 | 0 | 0 | 84 | 2 | 0 | 0 | 0 | 0 | 21 | ||
| 0 | 1092 | 465 | 96 | 6 | 102 | 212 | 13 | 2 | 9 | 18 | 79 | ||
| 3 | 1859 | 1092 | 198 | 27 | 256 | 439 | 34 | 5 | 15 | 55 | 238 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 10 | ||
| 0 | 10 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | ||
| 0 | 0 | 0 | 0 | 1 | 0 | 0 | 4 | 0 | 1 | 0 | 28 | ||
| 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 77 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 7 | ||
| 0 | 1 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | ||
| 1 | 0 | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | ||
| 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 2 | ||
| 2 | 1 | 2 | 8 | 1 | 5 | 3 | 1 | 3 | 11 | 2 | 6753 | ||
| 2 | 0 | 0 | 6 | 1 | 0 | 2 | 0 | 0 | 16 | 1 | 10 | ||
| 48 | 305 | 324 | 213 | 8 | 229 | 31 | 75 | 110 | 2071 | 144 | 2204 | ||
For each species, if the number of reads is 0 for all samples, that species was not shown.
*The leading three probiotics are Lactococcus salivarius, Streptococcus thermophilus and Bifidobacterium longum.
The quantities (matched sequenced reads) of pathogens identified in the samples in the case study
| 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| 0 | 0 | 0 | 11 | 0 | 0 | 0 | 0 | 0 | 11 | 1 | 40 | ||
| 0 | 38 | 5048 | 4 | 5 | 2 | 1 | 361 | 153 | 1211 | 115 | 59 | ||
| 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | ||
| 0 | 1 | 2 | 0 | 0 | 0 | 0 | 3 | 10 | 24 | 12 | 93 | ||
| 0 | 1 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | ||
| 57 | 13 | 1 | 4 | 8 | 4 | 0 | 20 | 5 | 19 | 6 | 38 | ||
| 41 | 8 | 2 | 6 | 5 | 2 | 1 | 22 | 3 | 13 | 1 | 32 | ||
| 1744 | 8560 | 7900 | 3637 | 10651 | 4404 | 5691 | 1424 | 1733 | 210 | 165 | 4483 | ||
| 2 | 1771 | 2 | 1055 | 8 | 49 | 1 | 171 | 15 | 1802 | 2322 | 4502 | ||
| 0 | 2 | 0 | 3 | 1 | 0 | 1 | 1 | 1 | 1 | 1 | 1 | ||
| 1 | 6 | 6 | 4 | 2 | 2 | 3 | 1 | 3 | 0 | 0 | 3 | ||
| 1570 | 9291 | 7978 | 1849 | 9864 | 4209 | 5726 | 622 | 1658 | 303 | 44 | 4495 | ||
| 41 | 243 | 239 | 32 | 308 | 122 | 192 | 8 | 41 | 1 | 1 | 98 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | ||
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | ||
| 0 | 69 | 3 | 0 | 5 | 0 | 1 | 0 | 0 | 3 | 6 | 5 | ||
| 46 | 26 | 9 | 16 | 1 | 36 | 3 | 46 | 25 | 272 | 5 | 154 | ||
| 7 | 76 | 149 | 14 | 10 | 112 | 9 | 94 | 23 | 417 | 45 | 1428 | ||
| 0 | 3 | 0 | 0 | 0 | 1 | 1 | 0 | 1 | 0 | 1 | 0 | ||
| 0 | 0 | 1 | 29 | 2 | 0 | 0 | 1 | 0 | 906 | 0 | 5 | ||
For each species, if the number of reads is 0 for all samples, that species was not shown.
*The leading three pathogens are Escherichia coli, Salmonella enterica and Haemophilus influenzae.
The result of disease risk evaluations of 12 samples
| 2.67E-01 | 2.67E-01 | 2.67E-01 | 1.00E + 00 | 1.00E + 00 | 2.67E-01 | 1.00E + 00 | 1.00E + 00 | 2.67E-01 | 2.67E-01 | 1.00E + 00 | ||
| 1.34E-01 | 1.34E-01 | 1.00E + 00 | 1.34E-01 | 1.00E + 00 | 1.34E-01 | 1.34E-01 | 1.00E + 00 | 1.00E + 00 | 1.34E-01 | 1.00E + 00 | ||
| 3.33E-01 | 7.06E-01 | 1.00E + 00 | 3.33E-01 | 1.00E + 00 | 3.33E-01 | 3.33E-01 | 1.00E + 00 | 1.10E-01 | 1.10E-01 | 7.06E-01 | 3.33E-01 | |
| 9.30E-02 | 4.15E-01 | 1.00E + 00 | 4.15E-01 | 1.00E + 00 | 9.30E-02 | 4.15E-01 | 1.00E + 00 | 9.30E-02 | 1.00E + 00 | 1.00E + 00 | 1.00E + 00 | |
| 4.88E-01 | 2.59E-01 | 9.35E-01 | 7.47E-01 | 7.47E-01 | 7.47E-01 | 2.59E-01 | 4.88E-01 | 4.88E-01 | 7.47E-01 | |||
| 1.83E-01 | 1.00E + 00 | 1.00E + 00 | 1.00E + 00 | 1.00E + 00 | 1.83E-01 | 1.83E-01 | 1.00E + 00 | 1.00E + 00 | 1.00E + 00 | 1.00E + 00 | 1.83E-01 | |
| 1.89E-01 | 1.00E + 00 | 1.00E + 00 | 1.00E + 00 | 1.00E + 00 | 1.89E-01 | 1.89E-01 | 1.89E-01 | 1.00E + 00 | 1.89E-01 | 1.00E + 00 | 1.89E-01 |
The bold numbers represent two samples had reached significance level with P-value less than 0.05 of distribution in three diseases compared to 98 sample control group using evaluation model.
Figure 2System flow of bioinformatics analysis in the proposed platform. The proposed platform comprises the analysis pipeline of NGS, construction of probiotics and pathogens database, bacterial disease risk model evaluation and the application of individualized bacteria sequencing profile.
Disease-related biomarkers of seven diseases
| - | 2.86E-03 | 1.52E-01 | 35 | 35 | 20039451 | ||
| - | 1.41E-03 | 4.61E-02 | 14 | 12 | 22315951 | ||
| - | 6.10E-05 | 9.45E-03 | 14 | 12 | 22315951 | ||
| - | 5.39E-05 | 1.73E-02 | 14 | 12 | 22315951 | ||
| + | 1.00E-02 | 4.26E-01 | 14 | 12 | 22315951 | ||
| + | 1.16E-05 | 4.98E-03 | 8 | 15 | 20014457 | ||
| - | 2.46E-03 | 5.36E-01 | 23 | 13 | 20876719 | ||
| - | 5.39E-05 | 1.73E-02 | 33 | 30 | 19498350 | ||
| - | 3.11E-03 | 6.74E-02 | 3 | 3 | 19164560 | ||
| - | 1.43E-05 | 1.78E-02 | 3 | 3 | 19164560 | ||
| - | 1.43E-05 | 1.78E-02 | 3 | 3 | 19164560 | ||
| + | 7.70E-04 | 2.15E-02 | 15 | 13 | 19849869 | ||
| + | 6.10E-05 | 9.45E-03 | 20 | 20 | 19774074 | ||
| + | 3.26E-05 | 4.72E-03 | 3 | 3 | 19164560 | ||
| + | 1.35E-04 | 6.64E-03 | 3 | 3 | 19164560 | ||
| - | 7.63E-04 | 5.44E-02 | 13 | 22 | 21073731 | ||
| - | 1.55E-03 | 4.21E-02 | 13 | 22 | 21073731 | ||
| - | 2.22E-05 | 1.68E-03 | 13 | 22 | 21073731 | ||
| - | 7.70E-04 | 2.15E-02 | 13 | 27 | 19235886 | ||
| - | 9.18E-02 | 4.50E-01 | 13 | 27 | 19235886 | ||
| - | 2.48E-03 | 6.03E-02 | 31 | 30 | 21253779 | ||
| - | 9.65E-06 | 1.05E-03 | 13 | 27 | 19235886 | ||
| - | 5.39E-05 | 1.73E-02 | 13 | 27 | 19235886 | ||
| - | 2.04E-04 | 1.81E-02 | 13 | 22 | 21073731 | ||
| + | 2.86E-03 | 1.52E-01 | 9 | 9 | 16954244 | ||
| - | 6.10E-05 | 9.45E-03 | 68 | 256 | 17604093 | ||
| - | 8.09E-05 | 1.84E-02 | 7 | 27 | 20626364 | ||
| + | 6.56E-02 | 6.37E-01 | 68 | 256 | 17604093 | ||
| + | 0.00E + 00 | 1.06E-04 | 15 | 15 | 21963389 | ||
| - | 7.63E-04 | 5.44E-02 | 46 | 56 | 21850056 | ||
| - | 1.41E-03 | 4.61E-02 | 46 | 56 | 21850056 | ||
| - | 3.32E-05 | 2.64E-02 | 50 | 38 | 7574628 | ||
| - | 1.36E-03 | 7.92E-02 | 46 | 56 | 21850056 | ||
| - | 1.91E-05 | 2.89E-03 | 21 | 23 | 20740058 | ||
| - | 2.39E-05 | 2.09E-03 | 50 | 38 | 7574628 | ||
| - | 4.07E-04 | 2.60E-02 | 46 | 56 | 21850056 | ||
| - | 9.39E-04 | 4.85E-02 | 46 | 56 | 21850056 | ||
| + | 3.05E-03 | 1.85E-01 | 46 | 56 | 21850056 | ||
| + | 1.51E-03 | 8.84E-02 | 46 | 56 | 21850056 | ||
| + | 7.22E-06 | 1.92E-02 | 46 | 56 | 21850056 | ||
| + | 0.00E + 00 | 1.59E-05 | 46 | 56 | 21850056 | ||
| + | 7.70E-04 | 2.15E-02 | 50 | 38 | 7574628 | ||
| + | 0.00E + 00 | 4.95E-04 | 50 | 38 | 7574628 | ||
| + | 1.12E-04 | 6.83E-03 | 46 | 56 | 21850056 | ||
| + | 0.00E + 00 | 6.77E-05 | 50 | 38 | 7574628 | ||
| + | 0.00E + 00 | 1.19E-04 | 46 | 56 | 21850056 | ||
| + | 0.00E + 00 | 2.60E-05 | 50 | 38 | 7574628 | ||
| + | 0.00E + 00 | 1.10E-04 | 50 | 38 | 7574628 | ||
| + | 1.04E-05 | 2.64E-03 | 50 | 38 | 7574628 | ||
| + | 7.09E-05 | 2.05E-02 | 50 | 38 | 7574628 | ||
| + | 8.13E-05 | 1.34E-02 | 50 | 38 | 7574628 | ||
| + | 4.87E-05 | 2.94E-02 | 50 | 38 | 7574628 | ||
| + | 1.00E-02 | 4.26E-01 | 10 | 10 | 21647227 | ||
| + | 1.35E-04 | 6.64E-03 | 50 | 38 | 7574628 | ||
| + | 5.67E-05 | 6.08E-03 | 21 | 23 | 20740058 | ||
| + | 1.56E-05 | 3.60E-03 | 50 | 38 | 7574628 | ||
| + | 1.66E-03 | 6.79E-02 | 21 | 23 | 20740058 | ||
| - | 7.63E-04 | 5.44E-02 | 11 | 22 | 21073731 | ||
| - | 1.55E-03 | 4.21E-02 | 11 | 22 | 21073731 | ||
| - | 2.22E-05 | 1.68E-03 | 11 | 22 | 21073731 | ||
| - | 7.70E-04 | 2.15E-02 | 23 | 23 | 22339879 | ||
| - | 2.87E-01 | 7.95E-01 | 62 | 46 | 21820992 | ||
| - | 5.39E-05 | 1.73E-02 | 62 | 46 | 21820992 | ||
| - | 2.04E-04 | 1.81E-02 | 11 | 22 | 21073731 | ||
| - | 1.66E-03 | 6.79E-02 | 62 | 46 | 21820992 | ||
| + | 2.86E-03 | 1.52E-01 | 14 | 18 | 22356587 | ||
| + | 1.02E-05 | 1.69E-03 | 22 | 22 | 21741921 | ||
| + | 3.32E-05 | 2.64E-02 | 23 | 23 | 22339879 | ||
| + | 1.75E-02 | 4.69E-01 | 22 | 22 | 21741921 | ||
| + | 1.22E-03 | 4.08E-02 | 62 | 46 | 21820992 | ||
| + | 0.00E + 00 | 1.19E-04 | 23 | 23 | 22339879 | ||
| + | 1.12E-05 | 7.82E-03 | 26 | 26 | 19903265 | ||
| + | 6.10E-05 | 9.45E-03 | 23 | 23 | 22339879 | ||
| + | 5.67E-05 | 6.08E-03 | 62 | 46 | 21820992 | ||
| - | 6.10E-05 | 9.45E-03 | 12 | 12 | 19714856 | ||
| - | 5.39E-05 | 1.73E-02 | 67 | 20 | 101 | ||
| + | 7.22E-06 | 1.92E-02 | 22 | 22 | 17893165 | ||
| + | 7.70E-04 | 2.15E-02 | 22 | 22 | 17893165 |
The associations between bacterium and disease are majorly collected from case–control studies which the quantities of bacterium are obtained from deep sequencing data. The proportion of 78 bacteria from control group was applied as risk markers (constipation: 6, obesity: 9, IBS: 17, UC: 10, CC: 28, AD: 4, AR: 4) to predict disease risk to seven diseases in this study.
Figure 3An example for evaluating the risk of obesity by using bacterial disease risk evaluation model. The model used lower and upper proportion bound of 9 markers from 98 control samples to define risk markers of these two samples (B034 and B031) following by using binomial test.