| Literature DB >> 26697361 |
Mushafau Adewale Akinsanya1, Joo Kheng Goh2, Siew Ping Lim2, Adeline Su Yien Ting2.
Abstract
Next generation sequencing (NGS) enables rapid analysis of the composition and diversity of microbial communities in several habitats. We applied the high throughput techniques of NGS to the metagenomics study of endophytic bacteria in Aloe vera plant, by assessing its PCR amplicon of 16S rDNA sequences (V3-V4 regions) with the Illumina metagenomics technique used to generate a total of 5,199,102 reads from the samples. The analyses revealed Proteobacteria, Firmicutes, Actinobacteria and Bacteriodetes as the predominant genera. The roots have the largest composition with 23% not present in other tissues. The stems have more of the genus-Pseudomonas and the unclassified Pseudomonadaceae. The α-diversity analysis indicated the richness and inverse Simpson diversity index of the bacterial endophyte communities for the leaf, root and stem tissues to be 2.221, 6.603 and 1.491 respectively. In a similar study on culturable endophytic bacteria in the same A. vera plants (unpublished work), the dominance of Pseudomonas and Bacillus genera was similar, with equal proportion of four species each in root, stem and leaf tissues. It is evident that NGS technology captured effectively the metagenomics of microbiota in plant tissues and this can improve our understanding of the microbial-plant host interactions.Entities:
Keywords: Aloe vera; Bacterial endophytes; Illumina; Metagenomics; NGS; α-Diversity
Year: 2015 PMID: 26697361 PMCID: PMC4664749 DOI: 10.1016/j.gdata.2015.09.004
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Raw reads statistics and sequence quality assessment of 16S rRNA sequence from A. vera tissues.
| Sample reference | Sample label | Sequence type | Sequence format | Read type | Read size (bp) | Total number of reads | Total sequence length (nt) | GC% |
|---|---|---|---|---|---|---|---|---|
| Root | 22 | Illumina MiSeq | Fastq | Paired-end | 35–151 | 2,528,030 | 361,652,861 | 52.01 |
| Stem | 23 | 1,298,892 | 191,468,046 | 50.01 | ||||
| Leaf | 24 | 1,372,180 | 200,256,446 | 50.86 |
Raw data from MiSeq sequencing reporter (MSR).
Sequence processed details: merged sequence.
| Sample reference | Before merge process | After merge process |
|---|---|---|
| Root | 2,528,030 (361,652,861) | 1,264,015 (220,836,340) |
| Stem | 1,298,892 (191,468,046) | 649,446 (124,733,765) |
| Leaf | 1,372,180 (200,256,446) | 686,090 (127,684,140) |
| Total | 2,599,551 (473,254,245) |
Sequence input (forward and reverse sequences), quality encoding (Illumine 1.8 +) and Alignment method (needleman).
Sequence processed details.
| Sequence details | Number of sequence | Percentage |
|---|---|---|
| Merge sequence | 2,599,551 | |
| Removed redundancy sequence | 152,919 | 100 |
| Contaminant removal | ||
| Chimeric | 1779 | 1.16 |
| Chloroplast | 34,816 | 22.77 |
| Mitochondria | 487 | 0.32 |
| Eukaryote | 42 | 0.03 |
| Unknown | 3 | 0 |
| Cleaned sequence | 115,792 | 75.72 |
Fig. 1Bacterial taxonomic composition histogram. The average composition of bacteria communities obtained from surface sterilized tissues of A. vera using culture-independent method (MiSeq Illumina platform) was analysed and compared. The nomenclatures of the phylotypes are based on the SILVA rRNA database (http://www.mothur.org/wiki/Silva_reference_files).
Diversity and richness of the communities in plant tissue samples.
| Group | Method | Number of sequences | Coverage % | Observed OTUs | Inverse Simpson diversity index |
|---|---|---|---|---|---|
| Leaf | Average | 430,023 | 99.992 | 175.0 | 2.221 |
| Root | Average | 430,023 | 99.994 | 211.0 | 6.603 |
| Stem | Average | 430,023 | 99.991 | 147.9 | 1.491 |
| Leaf | STDEV | 0 | 0 | 0 | 0.000071 |
| Root | STDEV | 0 | 0 | 0 | 0 |
| Stem | STDEV | 0 | 0 | 0.063119 | 0.000024 |
ANOVA statistical analysis showed that there are significant differences of the observed OTUs and inverse Simpson diversity index between the tissues. Coverage also reflected over 99% sampling of the communities in the tissues.
Fig. 2Venn diagram describing the OTU distribution across tissue samples.