| Literature DB >> 25208077 |
Nele Ilmberger1, Simon Güllert1, Joana Dannenberg1, Ulrich Rabausch1, Jeremy Torres1, Bernd Wemheuer2, Malik Alawi3, Anja Poehlein2, Jennifer Chow1, Dimitrij Turaev4, Thomas Rattei4, Christel Schmeisser1, Jesper Salomon5, Peter B Olsen5, Rolf Daniel2, Adam Grundhoff6, Martin S Borchert5, Wolfgang R Streit1.
Abstract
A phylogenetic and metagenomic study of elephant feces samples (derived from a three-weeks-old and a six-years-old Asian elephant) was conducted in order to describe the microbiota inhabiting this large land-living animal. The microbial diversity was examined via 16S rRNA gene analysis. We generated more than 44,000 GS-FLX+454 reads for each animal. For the baby elephant, 380 operational taxonomic units (OTUs) were identified at 97% sequence identity level; in the six-years-old animal, close to 3,000 OTUs were identified, suggesting high microbial diversity in the older animal. In both animals most OTUs belonged to Bacteroidetes and Firmicutes. Additionally, for the baby elephant a high number of Proteobacteria was detected. A metagenomic sequencing approach using Illumina technology resulted in the generation of 1.1 Gbp assembled DNA in contigs with a maximum size of 0.6 Mbp. A KEGG pathway analysis suggested high metabolic diversity regarding the use of polymers and aromatic and non-aromatic compounds. In line with the high phylogenetic diversity, a surprising and not previously described biodiversity of glycoside hydrolase (GH) genes was found. Enzymes of 84 GH families were detected. Polysaccharide utilization loci (PULs), which are found in Bacteroidetes, were highly abundant in the dataset; some of these comprised cellulase genes. Furthermore the highest coverage for GH5 and GH9 family enzymes was detected for Bacteroidetes, suggesting that bacteria of this phylum are mainly responsible for the degradation of cellulose in the Asian elephant. Altogether, this study delivers insight into the biomass conversion by one of the largest plant-fed and land-living animals.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25208077 PMCID: PMC4160196 DOI: 10.1371/journal.pone.0106707
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Overall numbers of sequences and contigs generated for the two elephant feces samples.
| Parameter | Six-years-old elephant | Three-weeks-old elephant |
|
| ||
| No. of GS-FLX+454 sequences | 56,124 | 44,508 |
|
| ||
| No. of Illumina reads | 415,784,264 | 200,946,703 |
| No. of bases assembled | 929,519,943 | 138,494,152 |
| No. of assembled sequences | 260,535 | 50,765 |
| Coverage | 42 | 73 |
| Mean contig size (bp) | 3,566 | 5,801 |
| N50 (bp) | 5,751 | 22,052 |
| Largest contig (bp) | 344,979 | 597,113 |
| Gene count | 1,068,385 | 173,134 |
| GC mean % | 43,96 | 47,71 |
| Protein coding genes | 1,052,245 | 171,134 |
| with COG | 629,485 | 111,920 |
| with Pfam | 818,373 | 136,417 |
| with KEGG | 210,308 | 42,640 |
Figure 1Rarefaction curves calculated for the feces sample of the three-weeks-old and the six-years-old Asian elephant at 3% genetic distance of 16S rRNA genes.
The curve for the six-years-old elephant comprises 8,014 archaeal sequences which were clustered to 54 OTUs. The sequences were denoised employing Acacia. Chimeric sequences were removed using UCHIME in reference mode with the most recent SILVA SSU database as reference dataset (SSURef 115 NR).
Figure 2Relative abundances of different phyla and classes in the two elephant feces samples.
A: Relative abundance of phyla in the feces of the three-weeks-old and the six-years-old Asian elephant based on 16S rRNA gene sequences. For the three-weeks-old elephant no Archaea were observed. B: Phylogenetic comparison on class level between both elephants. Heat map colors indicate the abundances of the respective 16S rRNA genes.
Figure 3Phylogenetic analysis of the elephant feces in comparison with other fecal and intestinal metagenome data sets.
Data indicate the phylogenetic relation based on gene similarities in the metagenome sequences. The percent of sequences assigned to each phylum according to IMG/M ER is shown based on the total number of obtained sequences of each data set. Sequence data for the metagenomes were extracted from the IMG/M ER web page of the US Department of Energy Joint Genome Institute and the respective bioprojects (IMG Genome IDs: Six-years-old Elephant (this study): 3300001598; three-weeks-old Elephant (this study): 3300001919; Green Cockroach: 2228664000; Termite: 3300001544; Dog: 2019105001; Reindeer: 2088090000; Neotropical Beetle: 3300000114; Asian Long-Horned Beetle: 2084038013; Bovine Rumen: 2061766007; Northwest Shipworm: 2189573029; Human stool: 7000000038).
Selected and recent metagenome studies published from insect or mammalian fecal and gut samples.
| Microbiome | OTUs detected | Assemble DNA (Mbp) | Sequencing strategy | No. of GH families identified | Total no. of CAZymes | No. of GH per Mbp assembled DNA | f)Total no. of cellulolytic enzymes | Reference |
| Iberian lynx | n.d. | 18.3 | 454 GS FLX | 42 | 372 | 20.3 | 39 |
|
| Tammar wallaby | 236 | 85.2 | 454 GS FLX | 53 | 557 | 6.5 | 175 |
|
| Reindeera | 1,182 | 26 | 454 GS FLX | 30 | d)5,000 | e)9.9 | d)400 |
|
| Giant panda | 85 | 37 | Illumina | 44 | 440 | 11.9 | 124 |
|
| Bovine | 1,000 | b)1,930 | Illumina | 60 | 27,755 | 14.4 | 5,670 |
|
| Bovine | 161–259 | c)104 | 454 GS20 | 35 | 3,828 | c)36.8 | 1,017 |
|
| Termite | 216 | 71 | 454 GS20+ Sanger | 45 | 703 | 9.9 | 259 |
|
| Elephant gut (six-years-old) | 2,656 | 929.5 | Illumina | 82 | 11,038 | 11.9 | 2,074 | This study |
| Elephant gut(three-weeks-old) | 380 | 138.5 | Illumina | 70 | 1,873 | 13.5 | 242 | This study |
n.d., not determined; a), in 2 samples; b), 286 Gb resulted in approx. 2 Gb of assembled DNAs >1 kb scaffolds; c),unassembled; d), in unassembled reads; e), per 503 Mbp unassembled DNA; f), total gene count with respect to the GH1, GH3, GH 5, GH6, GH8, GH9, GH44, GH45, GH48, GH51, GH74, GH94 family enzymes. Total gene counts, size of assembled DNAs, OTUs and numbers of carbohydrate active enzymes were extracted from the indicated references.
Gene count, relative gene count and relative coverage of genes for families of carbohydrate-active enzymes discovered in the sequences of the elephant feces samples according to CAZy.
| Baby elephant | Six-years-old elephant | |||||
|
| Gene count | Relative count | Relative coverage | Gene count | Relative count | Relative coverage |
|
| 33 | 0.01761 | 0.00979 | 103 | 0.00933 | 0.00667 |
| 2 | 220 | 0.11745 | 0.09455 | 917 | 0.08311 | 0.082006 |
|
| 148 | 0.07901 | 0.08355 | 804 | 0.07287 | 0.07784 |
| 4 | 16 | 0.00854 | 0.00430 | 57 | 0.00517 | 0.00405 |
|
| 19 | 0.0101 | 0.01064 | 517 | 0.04686 | 0.04808 |
|
| 4 | 0.00213 | 0.00134 | 85 | 0.00770 | 0.01198 |
|
| 9 | 0.00481 | 0.00406 | 119 | 0.01078 | 0.01199 |
| 10 | 6 | 0.00320 | 0.00836 | 258 | 0.02338 | 0.02915 |
| 11 | – | – | – | 20 | 0.00181 | 0.00142 |
| 13 | 124 | 0.06620 | 0.07574 | 845 | 0.07658 | 0.07423 |
| 15 | 1 | 0.00053 | 0.00019 | 1 | 0.00009 | 0.00008 |
| 16 | 22 | 0.01174 | 0.01399 | 198 | 0.01794 | 0.01552 |
| 17 | – | – | – | 5 | 0.00045 | 0.00022 |
| 18 | 51 | 0.02723 | 0.03378 | 86 | 0.00779 | 0.00824 |
| 19 | 1 | 0.00053 | 0.00150 | 3 | 0.00027 | 0.00011 |
| 20 | 108 | 0.05766 | 0.05088 | 266 | 0.02411 | 0.01881 |
| 23 | 69 | 0.03684 | 0.06012 | 294 | 0.02664 | 0.03324 |
| 24 | 12 | 0.00641 | 0.00685 | 26 | 0.00236 | 0.00191 |
| 25 | 30 | 0.01602 | 0.03030 | 229 | 0.02075 | 0.02255 |
| 26 | 8 | 0.00427 | 0.00341 | 103 | 0.00933 | 0.00931 |
| 27 | 12 | 0.00641 | 0.00591 | 158 | 0.01432 | 0.01366 |
| 28 | 26 | 0.01388 | 0.01502 | 242 | 0.02193 | 0.02112 |
| 29 | 79 | 0.04218 | 0.04032 | 376 | 0.03408 | 0.02950 |
|
| 17 | 0.00908 | 0.00727 | 95 | 0.00861 | 0.00975 |
| 31 | 35 | 0.01869 | 0.01000 | 318 | 0.02882 | 0.02962 |
| 32 | 21 | 0.01121 | 0.00879 | 119 | 0.01078 | 0.01409 |
| 33 | 31 | 0.01655 | 0.01639 | 149 | 0.01350 | 0.01003 |
| 35 | 36 | 0.01922 | 0.01745 | 123 | 0.01115 | 0.01155 |
| 36 | 32 | 0.01708 | 0.01637 | 310 | 0.02809 | 0.02396 |
| 37 | 3 | 0.00160 | 0.00112 | 17 | 0.00154 | 0.00144 |
| 38 | 12 | 0.00641 | 0.00653 | 81 | 0.00734 | 0.00645 |
| 39 | 3 | 0.00160 | 0.00374 | 89 | 0.00807 | 0.00554 |
| 42 | 9 | 0.00481 | 0.00262 | 37 | 0.00335 | 0.00247 |
| 43 | 98 | 0.05232 | 0.03978 | 894 | 0.08102 | 0.09836 |
|
| – | – | – | 7 | 0.00063 | 0.00116 |
|
| – | – | – | 7 | 0.00063 | 0.00084 |
| 46 | – | – | – | 1 | 0.00009 | 0.00004 |
| 50 | 2 | 0.00107 | 0.00343 | 26 | 0.00236 | 0.00184 |
|
| 24 | 0.01281 | 0.00653 | 239 | 0.02166 | 0.02139 |
| 53 | 6 | 0.00320 | 0.00632 | 88 | 0.00798 | 0.00976 |
| 54 | – | – | – | 13 | 0.00118 | 0.00136 |
| 55 | 2 | 0.00107 | 0.00030 | 8 | 0.00073 | 0.00052 |
| 57 | 11 | 0.00587 | 0.00893 | 115 | 0.01042 | 0.00941 |
| 63 | 7 | 0.00374 | 0.00434 | 57 | 0.00517 | 0.00318 |
| 64 | – | – | – | 2 | 0.00018 | 0.00008 |
| 65 | 7 | 0.00374 | 0.00574 | 39 | 0.00353 | 0.00285 |
| 66 | 3 | 0.00160 | 0.00068 | 10 | 0.00091 | 0.00069 |
| 70 | – | – | – | 1 | 0.00009 | 0.00002 |
| 73 | 36 | 0.01922 | 0.01851 | 202 | 0.01831 | 0.01721 |
|
| – | – | – | 11 | 0.00100 | 0.00084 |
| 76 | 12 | 0.00641 | 0.00510 | 9 | 0.00082 | 0.00081 |
| 77 | 28 | 0.01495 | 0.01611 | 193 | 0.01749 | 0.01614 |
| 78 | 41 | 0.02189 | 0.03159 | 413 | 0.03743 | 0.02897 |
| 79 | 3 | 0.00160 | 0.00264 | – | – | – |
| 81 | – | – | – | 3 | 0.00027 | 0.00009 |
| 84 | 9 | 0.00481 | 0.00250 | 19 | 0.00172 | 0.00165 |
| 88 | 33 | 0.01762 | 0.01190 | 54 | 0.00489 | 0.00342 |
| 92 | 79 | 0.042178 | 0.04649 | 213 | 0.01930 | 0.01900 |
| 93 | – | – | – | 2 | 0.00018 | 0.00009 |
|
| 5 | 0.00267 | 0.00140 | 89 | 0.00807 | 0.00906 |
| 95 | 47 | 0.02509 | 0.01727 | 164 | 0.01486 | 0.01587 |
| 97 | 44 | 0.02349 | 0.02573 | 197 | 0.01785 | 0.02091 |
| 98 | 1 | 0.00053 | 0.00021 | 23 | 0.00208 | 0.00330 |
| 99 | – | – | – | 2 | 0.00018 | 0.00012 |
| 102 | 3 | 0.00160 | 0.00084 | 4 | 0.00036 | 0.00010 |
| 103 | 3 | 0.00160 | 0.01026 | 2 | 0.00018 | 0.00005 |
| 104 | 4 | 0.00214 | 0.00193 | – | – | – |
| 105 | 25 | 0.01335 | 0.00548 | 126 | 0.01142 | 0.01231 |
| 106 | 13 | 0.00694 | 0.01614 | 97 | 0.00879 | 0.00832 |
| 108 | 6 | 0.00320 | 0.00165 | 14 | 0.00127 | 0.00081 |
| 109 | 22 | 0.01175 | 0.01299 | 125 | 0.01133 | 0.00876 |
| 110 | 15 | 0.00801 | 0.00812 | 30 | 0.00272 | 0.00237 |
| 113 | – | – | – | 11 | 0.00100 | 0.00036 |
| 115 | 7 | 0.00374 | 0.00223 | 74 | 0.00671 | 0.00891 |
| 116 | 8 | 0.00427 | 0.00147 | 26 | 0.00236 | 0.00139 |
| 117 | 4 | 0.00214 | 0.00079 | 9 | 0.00082 | 0.00055 |
| 120 | 6 | 0.00320 | 0.00197 | 45 | 0.00408 | 0.00468 |
| 123 | 10 | 0.00534 | 0.00288 | 46 | 0.00417 | 0.00276 |
| 125 | 17 | 0.00908 | 0.01145 | 35 | 0.00317 | 0.00313 |
| 126 | – | – | – | 2 | 0.00018 | 0.00004 |
| 127 | 11 | 0.00587 | 0.00802 | 88 | 0.00798 | 0.00639 |
| 128 | 1 | 0.00053 | 0.00298 | 15 | 0.00136 | 0.00121 |
| 129 | 1 | 0.00053 | 0.00024 | 7 | 0.00063 | 0.00034 |
| 130 | 22 | 0.01175 | 0.01020 | 127 | 0.01151 | 0.01197 |
GH families according to the CAZy database http://www.cazy.org; Searches for glycoside hydrolases were performed with pfam HMMs, named in accordance with the CAZy nomenclature scheme. - no count observed, GH families not listed were not detected in any of the samples; the relative counts indicate the relative number in comparison to all GHs in the respective sample, and the relative coverage indicates the respective coverage in relation to the coverage of all GHs in the respective sample; GH families with cellulases are in bold.
Figure 4Overall coverage of selected cellulolytic GH family genes in the feces samples in relation to their phylogenetic affiliation.
A) three-weeks-old elephant, B) six-years-old elephant.
Figure 5Physical map of selected putative polysaccharide utilization loci (PULs).
A)–E) show regions from the sequence data derived from the microbiome of the six-years-old elephant. The different contigs are available from www.jgi.doe.gov (DOE Joint Genome Institute) under the IMG Project Id: 50566 with the scaffold id numbers: EMG_10007792, EMG_10000304, EMG_10002947, EMG_10003848 and EMG_10000174. F–J) indicate PULs from the literature or databases. F) from Tammar wallaby foregut [9], G) from Svalbard reindeer rumen [8], [21], H) Bacteroides thetaiotaomicron [3], I) B. faecis CAG:32 (GenBank: FR891562.1), J) Prevotella sp. Sc00026 (GenBank JX424618.1). *indicates IPT/TIG domain containing hypothetical proteins which have also been annotated as SusE and SusF proteins.