| Literature DB >> 30667380 |
Hui-Zeng Sun1, Mingyuan Xue2, Le Luo Guan1, Jianxin Liu2.
Abstract
With the help of the bacteria in the rumen, ruminants can effectively convert human inedible plant fiber to edible food (meat and milk). However, the understanding of rumen bacteriome in dairy cows is still limited, especially in a large population under the same diet, breed, and milking period. Here we described the sequencing data of 16S rRNA gene of rumen bacteriome from 334 mid-lactation Holstein dairy cows generated using the Illumina HiSeq 2500 (PE250) platform. A total of 24,030,828 raw reads with an average of 71,946 ± 13,450 sequences per sample were obtained. The top ten genera with highest relative abundance accounted for 60.65% of total bacterial sequences. We observed 4,460 overall operational taxonomic units (1,827 ± 94 per sample) based on a 97% nucleotide sequence identity between reads. Totally 6,082 amplicon sequence variants (672 ± 131 per sample) were identified in 334 samples. The shareable datasets can be re-used by researchers to assess other rumen bacterial-related biological functions in dairy cows towards the improvement of animal production and health.Entities:
Mesh:
Substances:
Year: 2019 PMID: 30667380 PMCID: PMC6343516 DOI: 10.1038/sdata.2018.301
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1The reads output of sequencing data.
Q20 and Q30 refer to the percentage of bases with the quality score greater than 20 (sequencing error rate less than 1%) and 30 (sequencing error rate less than 0.1%) in the effective tag, respectively.
Figure 2The species accumulation boxplot and phylogenetic relationships.
(a) The species accumulation boxplot. The x-axis represents the number of samples, and the y-axis represents the number of identified OTUs. (b) The taxonomy tree generated from all the samples from kingdom to species levels. Only the top 10 most abundant genera related results were displayed. The average percentage of each taxa based on the total bacterial sequencing reads at different levels from 334 samples was labeled. The piechart with different colors within the circle indicates different samples. (c) OTU cluster tree under the top 10 most abundant genera. OTU: operational taxonomic unit.
Figure 3The quality assessment of sequencing data.
(a) The length distribution of merged reads. (b) The quality score distribution of sequencing data. (c) The error rate distribution of sequencing reads.