| Literature DB >> 32945998 |
Jiakuo Yan1, Xiaoyang Wu1, Jun Chen2, Yao Chen2, Honghai Zhang3.
Abstract
Sable (Martes zibellina), a member of family Mustelidae, order Carnivora, is primarily distributed in the cold northern zone of Eurasia. The purpose of this study was to explore the intestinal flora of the sable by metagenomic library-based techniques. Libraries were sequenced on an Illumina HiSeq 4000 instrument. The effective sequencing data of each sample was above 6000 M, and the ratio of clean reads to raw reads was over 98%. The total ORF length was approximately 603,031, equivalent to 347.36 Mbp. We investigated gene functions with the KEGG database and identified 7140 KEGG ortholog (KO) groups comprising 129,788 genes across all of the samples. We selected a subset of genes with the highest abundances to construct cluster heat maps. From the results of the KEGG metabolic pathway annotations, we acquired information on gene functions, as represented by the categories of metabolism, environmental information processing, genetic information processing, cellular processes and organismal systems. We then investigated gene function with the CAZy database and identified functional carbohydrate hydrolases corresponding to genes in the intestinal microorganisms of sable. This finding is consistent with the fact that the sable is adapted to cold environments and requires a large amount of energy to maintain its metabolic activity. We also investigated gene functions with the eggNOG database; the main functions of genes included gene duplication, recombination and repair, transport and metabolism of amino acids, and transport and metabolism of carbohydrates. In this study, we attempted to identify the complex structure of the microbial population of sable based on metagenomic sequencing methods, which use whole metagenomic data, and to map the obtained sequences to known genes or pathways in existing databases, such as CAZy, KEGG, and eggNOG. We then explored the genetic composition and functional diversity of the microbial community based on the mapped functional categories.Entities:
Keywords: Functional database; Gene function annotation; Gut microbiota; Metagenomics; Sable (Martes zibellina)
Year: 2020 PMID: 32945998 PMCID: PMC7501400 DOI: 10.1186/s13568-020-01103-6
Source DB: PubMed Journal: AMB Express ISSN: 2191-0855 Impact factor: 3.298
Table of the information in samples
| Species | Sample | Sex | Time |
|---|---|---|---|
| Sable | MZF.1 | Female | 2017.11 |
| Sable | MZF.2 | Female | 2017.11 |
| Sable | MZF.3 | Female | 2017.11 |
| Sable | MZF.4 | Female | 2017.11 |
| Sable | MZF.5 | Female | 2017.11 |
| Sable | MZM.1 | Male | 2017.11 |
| Sable | MZM.2 | Male | 2017.11 |
| Sable | MZM.3 | Male | 2017.11 |
| Sable | MZM.4 | Male | 2017.11 |
| Sable | MZS.1 | Male | 2017.11 |
| Sable | MZB.1 | Male | 2017.11 |
| Sable | MZB.2 | Male | 2017.11 |
The detection report of DNA
| Sample name | Concentration (ng/µl) | Volume (µl) | Total (ng) | Library type | Test results |
|---|---|---|---|---|---|
| MZF.1 | 11.4 | 51 | 581 | Meta | A |
| MZF.2 | 13 | 51 | 663 | Meta | A |
| MZF.3 | 7.6 | 51 | 388 | Meta | A |
| MZF.4 | 10 | 51 | 510 | Meta | A |
| MZF.5 | 0.8 | 51 | 41 | Meta | A |
| MZM.1 | 37 | 51 | 1887 | Meta | A |
| MZM.2 | 5.7 | 51 | 291 | Meta | A |
| MZM.3 | 16 | 51 | 816 | Meta | A |
| MZM.4 | 2.54 | 51 | 130 | Meta | A |
| MZS.1 | 7.2 | 51 | 367 | Meta | A |
| MZB.1 | 4.7 | 51 | 240 | Meta | A |
| MZB.2 | 21 | 51 | 1071 | Meta | A |
The statistical information of sample data
| Sample | Raw data | Clean data | Clean_Q20 | Clean_Q30 | Clean_GC (%) | Effective (%) |
|---|---|---|---|---|---|---|
| MZF.1 | 6423.43 | 6348.62 | 97.36 | 95.16 | 52.27 | 98.835 |
| MZF.2 | 6400.51 | 371.69 | 97.28 | 95.07 | 49.81 | 99.55 |
| MZF.3 | 6337.84 | 6290.67 | 97.46 | 95.33 | 51.35 | 99.256 |
| MZF.4 | 6792.60 | 6769.70 | 97.12 | 94.95 | 41.59 | 99.663 |
| MZF.5 | 6390.60 | 6341.12 | 96.89 | 95.33 | 45.32 | 99.226 |
| MZM.1 | 6272.72 | 6260.54 | 97.11 | 94.76 | 47.25 | 99.806 |
| MZM.2 | 7024.25 | 7010.36 | 96.72 | 94.53 | 38.97 | 99.802 |
| MZM.3 | 6252.62 | 6230.76 | 97.25 | 95.01 | 50.56 | 99.65 |
| MZM.4 | 6612.68 | 6556.58 | 95.95 | 94.00 | 41.27 | 99.152 |
| MZS.1 | 6438.21 | 6360.45 | 96.71 | 94.22 | 42.41 | 98.792 |
| MZB.1 | 6266.57 | 6244.48 | 96.58 | 94.82 | 47.00 | 99.648 |
| MZB.2 | 6809.10 | 6693.22 | 96.75 | 91.59 | 47.48 | 98.298 |
The statistical information of sample assembled results
| Sample | Total len. (bp) | Num. | Average len. (bp) | N50 Len. (bp) | Max len. (bp) |
|---|---|---|---|---|---|
| MZF.1 | 36,356,589 | 34,785 | 1045.18 | 1087 | 73,452 |
| MZF.2 | 31,081,373 | 22,094 | 1406.78 | 1711 | 305,593 |
| MZF.3 | 18,869,947 | 14,803 | 1274.74 | 1492 | 39,052 |
| MZF.4 | 62,710,239 | 46,632 | 1344.79 | 1670 | 91,860 |
| MZF.5 | 32,307,271 | 23,551 | 1,371.80 | 1703 | 52,978 |
| MZM.1 | 45,524,309 | 37,738 | 1206.33 | 1328 | 226,903 |
| MZM.2 | 79,892,828 | 65,995 | 1210.59 | 1373 | 185,211 |
| MZM.3 | 14,205,478 | 11,729 | 1211.14 | 1341 | 41,989 |
| MZM.4 | 33,869,693 | 26,595 | 1273.54 | 1511 | 86,091 |
| MZS.1 | 33,880,650 | 26,234 | 1291.48 | 1419 | 243,912 |
| MZB.1 | 30,495,188 | 17,768 | 1716.30 | 2789 | 400,796 |
| MZB.2 | 2,101,244 | 2820 | 745.12 | 692 | 15,984 |
| NOVO_MIX | 205,647 | 273 | 753.29 | 700 | 5562 |
The statistical information of predicted gene
| Sample | ORFs NO | Integrity: none | Integrity: all | Total length | Average length |
|---|---|---|---|---|---|
| MZB.1 | 39,513 | 3,004 (7.6%) | 19,623 (49.66%) | 25.86 | 654.52 |
| MZB.2 | 1766 | 103 (5.83%) | 718 (40.66%) | 0.5 | 284.09 |
| MZF.1 | 57,458 | 6,949 (12.09%) | 16,882 (29.38%) | 31.42 | 546.78 |
| MZF.2 | 43,961 | 3,734 (8.49%) | 18,541 (42.18%) | 26.64 | 606 |
| MZF.3 | 27,858 | 2,972 (10.67%) | 10,265 (36.85%) | 16.63 | 597.07 |
| MZF.4 | 91,228 | 8,049 (8.82%) | 37,356 (40.95%) | 54.37 | 595.93 |
| MZF.5 | 47,236 | 3,832 (8.11%) | 20,673 (43.77%) | 27.95 | 591.7 |
| MZM.1 | 67,006 | 7,037 (10.5%) | 24,051 (35.89%) | 39.32 | 586.82 |
| MZM.2 | 105,623 | 8,052 (7.62%) | 46,010 (43.56%) | 54.35 | 514.59 |
| MZM.3 | 21,266 | 2,107 (9.91%) | 8,287 (38.97%) | 12.07 | 567.54 |
| MZM.4 | 50,935 | 4,540 (8.91%) | 20,947 (41.12%) | 29.4 | 577.3 |
| MZS.1 | 49,063 | 4,797 (9.78%) | 20,141 (41.05%) | 28.82 | 587.47 |
| NOVO_MIX | 118 | 4 (3.39%) | 57 (48.31%) | 0.03 | 234.56 |
Fig. 1Cluster heat map of relative abundance at genus level
Fig. 2Relative abundance of gut bacterial at the phylum taxonomical level
Fig. 3Principal Component Analysis
Fig. 4Relative abundance of pathways
Fig. 5KEGG pathway annotation
Fig. 6Relative abundance of carbohydrates
Fig. 7Number of matched genes of carbohydrates
Fig. 8Relative abundance of function class
Fig. 9Number of matched genes of function class
Fig. 10ARO distribution and abundance cluster heat map. The right vertical axis is the ARO name, and the left vertical axis is the ARO cluster tree
Fig. 11The overview circle graph of resistance gene