| Literature DB >> 28330225 |
Sudipta Sankar Bora1, Jyotshna Keot1, Saurav Das1, Kishore Sarma1, Madhumita Barooah2.
Abstract
This is the first report on the microbial diversity of xaj-pitha, a rice wine fermentation starter culture through a metagenomics approach involving Illumine-based whole genome shotgun (WGS) sequencing method. Metagenomic DNA was extracted from rice wine starter culture concocted by Ahom community of Assam and analyzed using a MiSeq® System. A total of 2,78,231 contigs, with an average read length of 640.13 bp, were obtained. Data obtained from the use of several taxonomic profiling tools were compared with previously reported microbial diversity studies through the culture-dependent and culture-independent method. The microbial community revealed the existence of amylase producers, such as Rhizopus delemar, Mucor circinelloides, and Aspergillus sp. Ethanol producers viz., Meyerozyma guilliermondii, Wickerhamomyces ciferrii, Saccharomyces cerevisiae, Candida glabrata, Debaryomyces hansenii, Ogataea parapolymorpha, and Dekkera bruxellensis, were found associated with the starter culture along with a diverse range of opportunistic contaminants. The bacterial microflora was dominated by lactic acid bacteria (LAB). The most frequent occurring LAB was Lactobacillus plantarum, Lactobacillus brevis, Leuconostoc lactis, Weissella cibaria, Lactococcus lactis, Weissella para mesenteroides, Leuconostoc pseudomesenteroides, etc. Our study provided a comprehensive picture of microbial diversity associated with rice wine fermentation starter and indicated the superiority of metagenomic sequencing over previously used techniques.Entities:
Keywords: Metagenomics; Rice wine and microflora; Starter culture; Taxonomy
Year: 2016 PMID: 28330225 PMCID: PMC4947050 DOI: 10.1007/s13205-016-0471-1
Source DB: PubMed Journal: 3 Biotech ISSN: 2190-5738 Impact factor: 2.406
Biochemical composition of starter cultures collected from Titabar sub-division, Jorhat district, Assam, India
| Sample code | Weight in gm/shape | Moisture | Crude fat % | Crude protein % | Crude fiber % | Ash % | Starch % | Total soluble sugar % | Reducing sugar % | Non-reducing sugar % |
|---|---|---|---|---|---|---|---|---|---|---|
| ABT-S4J3 | 11.598/round | 13.61 | 0.76 | 7.48 | 1.863 | 1.13 | 74.58 | 1.211 | 0.297 | 0.914 |
| ABT-S5J3 | 12.024/oval | 14.46 | 0.96 | 8.74 | 2.006 | 1.27 | 72.39 | 1.267 | 0.276 | 0.991 |
| ABT-S6J3 | 12.560/oval | 14.34 | 0.87 | 8.19 | 1.762 | 1.14 | 73.37 | 1.013 | 0.345 | 0.668 |
| ABT-S7J3 | 14.462/round | 13.89 | 0.86 | 7.36 | 1.634 | 0.83 | 75.29 | 1.027 | 0.297 | 0.730 |
| ABT-S8J3 | 12.220/oval | 13.21 | 0.76 | 6.38 | 2.224 | 0.61 | 76.78 | 1.164 | 0.067 | 1.097 |
Fig. 1Analysis strategy performed to analyze microbial diversity prevalent in the starter culture sample. DNA from an efficient starter sample was used for whole genome shotgun (WGS) sequencing
Fig. 2Flow chart depicting functional category hit distribution. 5302 sequences failed quality control. Of those, de-replication identified 1240 sequences (0.4 % of total) as artificial duplicate reads (ADRs). These include protein databases, protein databases with functional hierarchy information, and ribosomal RNA databases. The bars representing annotated reads are colored by e value range
Sequence statistics for the starter sample covering raw and high-quality sequence data used in the downstream analysis
| Analysis statistics | Values (in number) |
|---|---|
| Number of contig | 278,231 |
| Total contig length | 178,534,809 |
| Maximum contig length | 213,370 |
| Minimum contig length | 275 |
| Base-pair count | 178,540,333 |
| Mean sequence length | 641 ± 1650 |
| Mean GC percent | 42 ± 10 |
| Artificial Duplicate Reads (ADRs) | 1240 |
| Post QC base-pair count | 143,119,277 |
| Post QC sequence count | 272,929 |
| Mean sequence length | 524 ± 397 |
| Predicted protein features | 305,993 |
| Predicted rRNA features | 35,780 |
| Identified protein features | 114,064 |
| Identified rRNA features | 476 |
| Identified functional categories | 61,377 |
Fig. 3MG-RAST analyses of starter sample WGS metagenomics sequences. a Genus-level assignment of the sequences; b Functional annotation of predicted protein sequences
Fig. 4Phylum abundance ordered from the most abundant to least abundant. Only the top 50 most abundant are shown. The Y-axis plots the abundances of annotations in each phylum on a log scale. The rank abundance curve is a tool for visually representing taxonomic richness and evenness
Fig. 5Phylogenetic diversity was computed using the LCA algorithm based on a BLASTX comparison of all the contigs against the NCBI-NR database. Each circle represents a taxon in the NCBI taxonomy and is labeled by its name and the number of contigs that are assigned either directly to the taxon, or indirectly via one of its subtaxa. The size of the circle is scaled logarithmically to represent the number of contigs assigned directly to the taxon