Literature DB >> 33205903

Analysis of the vaginal microbiome of giant pandas using metagenomics sequencing.

Lan Zhang¹, Caiwu Li², Yaru Zhai¹, Lan Feng¹, Keke Bai¹, Zhizhong Zhang², Yan Huang², Ti Li², Desheng Li², Hao Li¹, Pengfei Cui¹, Danyu Chen¹, Hongning Wang¹, Xin Yang¹.

Abstract

In this study, a total of 14 vaginal samples (GPV1-14) from giant pandas were analyzed. These vaginal samples were divided into two groups as per the region and age of giant pandas. All the vaginal samples were analyzed using metagenomic sequencing. As per the outcomes of metagenomic analysis, Proteobacteria (39.04%), Firmicutes (5.27%), Actinobacteria (2.94%), and Basidiomycota (2.77%) were found to be the dominant phyla in the microbiome of the vaginal samples. At the genus level, Pseudomonas (21.90%) was found to be the most dominant genus, followed by Streptococcus (3.47%), Psychrobacter (1.89%), and Proteus (1.38%). Metastats analysis of the microbial species in the vaginal samples of giant pandas from Wolong Nature Reserve, Dujiangyan and Ningbo Youngor Zoo, and Ya'an Bifengxia Nature Reserve was found to be significantly different (p < 0.05). Age groups, that is, AGE1 (5-10 years old) and AGE2 (11-16 years old), also demonstrated significantly different inter-group microbial species (p < 0.05). For the first time, Chlamydia and Neisseria gonorrhoeae were detected in giant pandas' reproductive tract. GPV3 vaginal sample (2.63%) showed highest Chlamydia content followed by GPV14 (0.91%), and GPV7 (0.62%). GPV5 vaginal sample (7.17%) showed the highest Neisseria gonorrhoeae content, followed by GPV14 (7.02%), and GPV8 (6.50%). Furthermore, we employed eggNOG, CAZy, KEGG, and NCBI databases to investigate the functional significance of giant panda's vaginal microbial community. The outcomes indicated that giant panda's vaginal microbes were involved in biological processes. The data from this study will help in improving the reproductive health of giant pandas.

Entities: Chemical Disease Gene Species

Keywords: giant panda; metagenomic sequencing; vaginal microbiome

Year: 2020 PMID： 33205903 PMCID： PMC7755806 DOI： 10.1002/mbo3.1131

Source DB: PubMed Journal: Microbiologyopen ISSN： 2045-8827 Impact factor: 3.139

Carbohydrate‐Active enzymes Database evolutionary genealogy of genes: Non‐supervised Orthologous Groups giant panda vagina kyoto encyclopedia of genes and genomes

INTRODUCTION

The giant panda is a national treasure of China (Peng et al., (2001)). Giant pandas are endangered species with low fertility rates and on the brink of extinction. Their natural habitat occurs in small patches, which is spread across an area of 12,000 km2 Hu, (1997); Peng et al., 2007). Increasing the reproductive rate of the giant panda is a crucial part of current conservation strategies (Hu, 1997). Animals harbor diverse microorganisms in large numbers. Thus, the microbial investigation of the reproductive organs becomes crucial (Yan et al., 2019). As demonstrated in previous studies, microbiota affects the reproductive system either directly or indirectly (Bradford & Ravel, 2017; Ma et al., 2017; Moraes et al., 2014; Pal, 2015; Stumpf et al., 2013). Imbalanced vaginal microbiota and vaginal infection have been directly correlated to infertility as per the previous studies (Li et al., 2019). For instance, fungal infection of the reproductive system, such as human vulvovaginitis (Hedayati et al., 2015), cervicitis (Dascanio et al., 2010), and cow's clinical vaginitis (Al‐Fatllawy, 2014), severely affects fertility. On par with the pigs (De Puysseleyr et al., 2014; Schautteet & Vanrompay, 2011), cattle (Knudtson & Kirkbride, 1992), and horse's (Cafarchia et al., 2013) vaginal microbiome investigations, the vaginal microbiome of the giant panda was also investigated. These studies provided a comprehensive description of bacterial community composition in the vagina, uterus, and fungal community composition in the vagina of healthy giant pandas along with the diversity and abundance of these microbes (Chen et al., 2018; Yang et al., 2016, 2017). The majority of the vaginal microbiome studies carried out so far have adapted either the conventional method of microbial isolation, culture, and identification or high‐throughput sequencing. However, these methods could not elucidate the giant panda's complete vaginal microbiome. A majority of vaginal microorganisms have strict growth requirements, which makes their in vitro culture a challenging task. Vaginal microbes interact and restrict each other's growth, and thus protect reproductive efficiency in giant pandas. To date, limited studies have investigated the genitourinary tract of giant pandas. Due to its high efficiency and accuracy, in the current study, the metagenomic sequencing method was employed to investigate the giant panda's vaginal microbiome and functional characterization of the associated genes. Compared with the 16S amplicon sequencing method, metagenomic sequencing is widely used to examine the microbiome, gene function, gene activity, and cooperative relationship between the microorganisms and the correlation between microorganisms and the environment. In this study, a comprehensive analysis of the reproductive tract microbiome from giant pandas of different ages residing in different locations was conducted using metagenomic sequencing (Doonan et al., 2016). In the giant pandas’ vaginal samples, we examined microbial interaction and the effect of age and region on this interaction. Furthermore, in this study, the metagenomic sequencing method was employed to examine the vaginal microbiome of giant pandas for the first time. The data from this study can be utilized to improve the reproductive health of giant pandas in the future.

MATERIALS AND METHODS

Samples

A total of 14 vaginal samples from giant pandas (GPV 1‐14) were collected for which 14 giant pandas were sampled from four different geographical locations in China, that is, Wolong Nature Reserve, Ya'an Bifengxia Nature Reserve, China Giant Panda Conservation Research Center Dujiangyan Base, and Ningbo Youngor Zoo. Long sterile swabs were used to collect the vaginal secretion of 5‐23 years old giant panda in estrous during the period of June‐August, 2018. Subsequently, each swab was immediately dipped in 2 ml phosphate‐buffered saline (PBS) in sterile 5 ml screw‐cap tubes (Yang et al., 2017). These samples were transported to the Animal Disease Prevention and Food Safety Key Laboratory of Sichuan Province and stored in freezers at −80°C until the DNA extraction procedure (Yang et al., 2016, 2017). The 14 vaginal samples taken from these giant pandas were divided into two groups based on age and region (Yang et al., 2016, 2017). In the age group, 14 vaginal samples were categorized as AGE1, AGE2, AGE3. AGE1 contained seven vaginal samples from 5‐10 years old giant pandas. These samples were marked as GPV1, GPV2, GPV3, GPV4, GPV5, GPV6, and GPV7. AGE2 contained five vaginal samples from 11 to 16 years old giant pandas, which were marked as GPV8, GPV9, GPV10, GPV11, and GPV12. AGE3 contained two vaginal samples from 17‐23 years old giant pandas, which were marked as GPV13 and GPV14. In the region group, 14 vaginal samples were categorized as WL, YA, DN. WL contained five vaginal samples from Wolong region giant pandas, and these samples were marked as GPV1, GPV2, GPV8, GPV13, and GPV14. YA contained six vaginal samples from the Ya'an Bifengxia region's giant pandas, and these samples were marked as GPV4, GPV5, GPV9, GPV10, GPV11, and GPV12. DN contained three giant panda's vaginal samples from Dujiangyan Base and Ningbo Youngor Zoo and marked as GPV3, GPV6, and GPV7 (Table 1). As per the protection centers, none of the sampled giant pandas had any reproductive problems.

TABLE 1

The sample group information.

Basis of grouping	Group name	Sample serial number
5–10 years old	AGE1	GPV1‐GPV7
11–16 years old	AGE2	GPV8‐GPV12
17–23 years old	AGE3	GPV13‐GPV14
Wolong Nature Reserve	WL	GPV1, GPV2, GPV8, GPV13, GPV14
Dujiangyan Base and Ningbo Youngor Zoo	DN	GPV3, GPV6, GPV7
Ya'an Bifengxia Nature Reserve	YA	GPV4,GPV5,GPV9,GPV10,GPV11,GPV12

The sample group information.

DNA extraction and metagenomic sequencing

The whole‐genome DNA was extracted from the vaginal samples using Universal Genomic DNA Kit (Kang for the century Biotechnology Co., Ltd.), as per the manufacturer's instruction. The microbial cells were lysed with 180 µL of the enzymatic buffer. DNA purity and concentration were determined using 1% agarose gel electrophoresis before being sent to the Novogene Bioinformatics Technology Co. Ltd. The metagenomic sequencing of these DNA samples was performed by the quay Co. Ltd (Chen & Pachter, 2005; Yang et al., 2016, 2017) on the Illumina HiSeq sequencing platform and dual‐end sequencing (paired‐end).

Analysis of samples

The metagenomic sequencing data analysis was primarily focused on gene prediction and abundance analysis, specimen annotation, dimensionality analysis of species abundance, LEfSe analysis of differential species between groups, cluster analysis of species abundance, and Metastats analysis of different species between groups (Avershina et al., 2013; Oh et al., 2014; Rivas et al., 2013; Segata et al., 2011; White et al., 2009).

Metagenomic sequencing

Metagenomic analysis was used to unravel the microbial composition and microbial interaction between the 14 vaginal samples. Besides, the metagenomic sequencing data were analyzed to identify the metabolic pathways and gene function at the molecular level. The metagenomic sequencing was performed using the following steps:

Library construction

1 μg of DNA was taken per sample as input material. Sequencing libraries were generated using the NEBNext® Ultra™ DNA Library Prep Kit for Illumina (NEB), as per the manufacturer's recommendations, and index codes were added to attribute sequences to each sample. Briefly, DNA samples were fragmented by sonication to obtain DNA fragments of 350 bp size, and these DNA fragments were end‐polished, A‐tailed, and ligated with full‐length adaptor for Illumina sequencing and PCR amplified. The PCR products were purified (AMPure XP system), and DNA libraries were analyzed for size distribution using the Agilent 2100 Bioanalyzer system and quantified using real‐time PCR.

Pretreatment of sequencing results

To obtain clean data for subsequent analysis, the raw data from the Illumina HiSeq sequencing platform were preprocessed using Readfq version 8 (https://github.com/cjfields/readfq) as follows: (a) reads with low‐quality bases (default quality threshold value ≤38) above 40 bp length were removed; (b) reads containing a specific percentage of N base were removed (default length of 10 bp); (c) reads overlapping with the adapter (default length of 15 bp) were also removed.

Metagenome assembly

The clean metagenomic data obtained from the non‐complex environment of the giant panda's vaginal contents were assembled and analyzed (Luo et al., 2012) using SOAPdenovo software version 2.04 (http://soap.genomics.org.cn/soapdenovo.html; Brum et al., 2015; Feng et al., 2015; Qin et al., 2014; Scher et al., 2013). The parameters used for this analysis were as follows: ‐d 1, ‐M 3, ‐R, ‐u, ‐F, ‐K 55. The assembled scaftigs were interrupted from the N connection, and the scaftigs without N were removed (Mende et al., 2012; Nielsen et al., 2014; Qin et al., 2014) (http://bowtiebio.sourceforge.net/bowtie2/index.shtml).

Gene prediction and abundance analysis

The scaftigs (≥500 bp) from each sample were assembled, and the ORFs were predicted using MetaGeneMark version 2.10 (http://topaz.gatech.edu/GeneMark/) software and reads shorter than 100 nt () were filtered from the final result using default parameters. For the predicted ORF, CD‐HIT (Fu et al., 2012; Li & Godzik, 2006) software version 4.5.8 (http://www.bioinformatics.org/cd-hit) was used to reduce redundancy and obtain the unique initial gene catalog (the genes here refers to the nucleotide sequences coded by unique and continuous genes), using the following parameters: ‐c 0.95, ‐G 0, ‐aS 0.9, ‐g 1, ‐d 0 (Sunagawa et al.,; Zeller et al., 2014). The clean data of each sample were mapped to the initial gene catalog using Bowtie version2.2.4 (http://bowtiebio.sourceforge.net/bowtie2/index.shtml) to obtain the number of reads mapped to the genes identified in each sample, with the following parameter setting: ‐‐ end‐to‐end, ‐‐sensitive, ‐I 200, ‐X 400. The genes with ≤2 numbers of reads were filtered (Qin et al., 2012, 2014) for each sample to obtain the gene catalog (unigenes), which was used for subsequent analysis. The basic information: statistic, core‐pan gene analysis, correlation analysis of samples, and Venn figure analysis of genes were based on the abundance of each gene in each sample in the gene catalog (Cotillard et al., 2013; Le Chatelier et al., 2013; Villar et al., 2013).

Taxonomy prediction

The DIAMOND (Buchfink et al., 2015) software version 0.9.9 (https://github.com/bbuchfink/diamond/) was used to blast the unigenes to the sequences of bacteria, fungi, archaea, and viruses fetched from the NR database version 2018‐01‐02 (https://www.ncbi.nlm.nih.gov/) of NCBI with the parameter setting: blastp ‐e 1e‐5. Since each sequence may have multiple aligned results, for the final aligned results of each sequence, the cutoff was set to e‐value ≤the smallest e‐value *10 to take the LCA algorithm, which was applied to the systematic classification of MEGAN (Huson et al., 2011) software to ensure the species annotation information of sequences (Oh et al., 2014). To obtain the table containing the gene number and abundance of each sample at each taxonomical level, taxonomical hierarchy (kingdom, phylum, class, order, family, genus, and species) for each sample was deduced based on the LCA annotation result and the gene abundance table. The species abundance in one sample equals the sum of the gene abundance annotated for the species. The gene number of a species in a sample equals the number of genes whose abundance is nonzero. For Krona analysis, the exhibition of generation situation of relative abundance, the exhibition of abundance cluster heat map, PCA (Avershina et al., 2013; R ade4 package version 2.15.3), and NMDS (Rivas et al., 2013; R vegan package version 2.15.3) decrease‐dimension analysis were based on the abundance table of each taxonomical hierarchy. The difference between groups was tested using ANOSIM (R vegan package version 2.15.3). Different species between groups were searched using Metastats and LEfSe analyses. P‐value was determined for each taxon using the permutation test between groups in the Metastats analysis. Furthermore, Benjamini and Hochberg False Discovery Rate were used to correct the p‐values and acquire the q‐values (White et al., 2009). LEfSe analysis was conducted using the LEfSe software (the default LDA score was 3; Segata et al., 2011). Lastly, random forest (RandoForest; R pROC and randomForest packages (Breiman, 2001) version 2.15.3) software was used to construct a random forest model. Significant species were screened using MeanDecreaseAccuracy and MeanDecreaseGin, and each model was cross‐validated (default 10 times), and the ROC curves were plotted.

Common functional database annotation

The DIAMOND software version 0.9.9 was employed to blast the unigenes to the functional database with the parameter setting of blastp, ‐e 1e‐5 (Feng et al., 2015; Qin et al., 2014). The functional databases used in this analysis were as follows: KEGG (Kanehisa et al., 2006, 2014) version 2018‐01‐01 (http://www.kegg.jp/kegg/), eggNOG (Powell et al.,) version 4.5 (http://eggnogdb.embl.de/#/app/home), and CAZy (Cantarel et al., 2009) version 201801 (http://www.cazy.org/). For each sequence's blast result, the best blast hit was used for subsequent analysis (Bäckhed et al., 2015; Feng et al., 2015; Qin et al., 2014). For the statistic of the relative abundance of different functional hierarchy, the relative abundance of each functional hierarchy equals the sum of relative abundance annotated to that functional level. As per the outcome of the functional annotation and gene abundance table, the gene number table of each sample in each taxonomic hierarchy was obtained. The gene number of a function in a sample equals the gene number that was annotated to this function, and the abundance is nonzero. Based on the abundance table of each taxonomy hierarchy, the counting of annotated gene numbers, the exhibition of the general relative abundance situation, the exhibition of abundance cluster heat map, and the decrease‐dimension analysis of PCA and NMDS were conducted. Similarly, ANOSIM test of the difference between groups (inside) based on the functional abundance, comparative analysis of metabolic pathways, the Metastats and LEfSe analyses of the functional difference between groups were also performed.

RESULTS

Sequencing data

The sequencing data of giant panda's vaginal samples in this study were acquired through the Illumina HiSeq sequencing platform (Odintsova et al., 2017). 94,662.28 Mbp of raw reads was obtained from the metagenomic sequencing, and the average sequencing data volume was 6,761.59 Mbp. The total and average data after quality control were 94,455.81 Mbp and 6,746.84 Mbp, respectively. The effective data rate of quality control was 99.78% (Table A1: https://doi.org/10.5281/zenodo.4067664). A total of 537,000,891 bp scaffolds were assembled with an average length of 1,727.76 bp, the maximum length of 898,339 bp, N50 of 4,454.20 bp, and N90 of 697,60 bp. The scaffolds were interrupted from N to generate scaftigs. It yielded a total of 477,921,089 bp scaftigs. The average length of scaftigs was 1,610 bp. The N50 was 3,830 bp, and the N90 was 701 bp (Table A2: https://doi.org/10.5281/zenodo.4067664).

Gene prediction and abundance analysis

A total of 74,805 genes with an average length of 566.80 bp and 54.96% GC content, accounting for 18.14% of all non‐redundant genes, were identified. The three vaginal samples, GPV1, GPV5, and GPV8, did not show any unique genes. However, GPV12 showed the highest number of unique genes. Out of the three regional groups of giant pandas, the highest number of unique genes appeared in the vaginal samples from the YA group. Out of the three age groups of giant pandas, the highest number of unique genes appeared in the vaginal samples from the AGE2 group.

Species annotation

Specimen annotation overview

A total of 412,312 predicted genes were obtained after the original de‐redundancy of the 14 vaginal samples. 293,671 (71.23%) ORFs were annotated using the NR database. A total of 33 phyla, 78 classes, 184 orders, 383 families, and 1000 genera were identified from the all giant pandas’ vaginal samples. The top 10 species with the highest relative abundance in each sample (group) were selected from the relative abundance table of different classification levels. The remaining species were designated as “Others.” Figure 1 demonstrates the relative abundance of the top 10 microorganisms at the phylum level. Proteobacteria, Firmicutes, Actinobacteria, and Basidiomycota, which accounted for 39.04%, 5.77%, 2.94%, and 2.77%, respectively, were identified as the dominant phyla (Figure 1). However, as demonstrated in Figure 1, different groups showed different dominant phylum. Phyla with more than 1% relative abundance were defined as the dominant phyla.

FIGURE 1

Clustering tree based on Bray–Curtis distance. (a) Clustering tree based on Bray–Curtis distance in the regional groups. (b) Clustering tree based on Bray–Curtis distance in the age groups.

Clustering tree based on Bray–Curtis distance. (a) Clustering tree based on Bray–Curtis distance in the regional groups. (b) Clustering tree based on Bray–Curtis distance in the age groups. Proteobacteria and Firmicutes were identified as the dominant phyla in the WL group, whereas Proteobacteria, Basidiomycetes, Firmicutes, Actinomycetes, and Bacteroides were identified as the dominant phyla in the YA group. The dominant phyla in the DN group were Proteobacteria, Firmicutes, Actinomycetes, Bacteroides, and Chlamydiae. In the AGE1 group, Proteobacteria, Firmicutes, Actinomycetes, and Bacteroidetes were identified as the dominant phylum. However, the AGE2 group showed Proteobacteria, Basidiomycota, Firmicutes, and Actinomycetes as the dominant phyla, and AGE3 showed Proteobacteria and Firmicutes as dominant phyla. Basidiomycota phylum was identified as a dominant phylum only in the YA and AGE2 groups, but it was not identified in any of the other groups. Interestingly, the relative abundance of Chlamydia was higher in the DN group only. Out of the total 1,000 genera identified in the giant panda's vaginal samples, the top 10 genera with the highest relative abundance are depicted in Figure 2. In all the fourteen samples, Pseudomonas was found to be the most abundant genus (21.90%), followed by Streptococcus (3.47%), Psychrobacter (1.89%), and Proteus (1.38%; Figure 2a). Dominant genera did not differ between the region and age group. However, out of the top ten dominant microorganisms with relatively highest abundance, Neisseria was found to be one of the most dominant genera (Figure 2b, c).

FIGURE 2

The relative abundance of features based on metagenome sequencing reads on the genus level. (a) Genus level relative abundance in all the samples. (b) Genus level relative abundance in regional groups. (c) Genus level relative abundance in age groups. The top 35 genera were selected from the relative abundance table of different classification levels, and a heat map was constructed based on the abundance of each sample (Figure 3). The three groups, WL, DN, and YA, showed significant differences at the genus level (Figure 3a). The Mycoplasma in the Tenericutes phylum was mainly from the DN group. The abundant genus identified in the YA group belonged to phyla Proteobacteria, Basidiomycetes, and Actinomycetes. However, the proportion of Fusobacterium in phylum Fusobacteria was identical in the YA and DN groups. The abundant genera identified in the WL group were primarily from phylum Firmicutes. In the AGE group, we observed significant differences at the genus level between each group (Figure 3b). The Mycoplasma in the Tenericutes phylum was primarily contributed by the AGE1 group, whereas the abundant genera in the AGE3 group belonged to the Proteobacteria phylum. Six of the thirteen abundant genera in the AGE2 group belonged to the Basidiomycota phylum.

FIGURE 3

Clustering heat map based on the relative abundance of features based on metagenome sequencing at the genus level. (a) Genus level relative abundance clustering heat map in regional groups. (b) Genus level relative abundance clustering heat map in age groups. The outcomes of the analysis demonstrated that the average relative abundance of Chlamydia, which is the most common cause of vaginal infection, was less than 0.44% in giant panda's vaginal samples. The highest content of Chlamydia was found in GPV3 (2.63%), followed by GPV14 (0.90%) and GPV7 (0.62%).

Dimensionality analysis of species abundance

The WL and DN groups showed identical microbial composition within the group and also between the groups irrespective of the PCA or NMDS analysis (Figure 4a, b). The YA group showed relatively more diverse microbial composition within the group. The microbial composition of the AGE group is depicted in Figure 4c and 4d. As per the PCA or NMDS analysis, the majority of the samples in AGE1 and AGE3 groups showed a highly similar microbial composition. However, each sample in the AGE2 group showed significantly different microbial composition.

FIGURE 4

PCA and NMDS analysis of microorganisms composition at the phylum level. (a) PCA analysis of microorganisms composition in regional groups. (b) NMDS analysis of microorganisms composition in regional groups. (c) PCA analysis of microorganisms composition in age groups. (d) NMDS analysis of microorganisms composition in age groups.

Cluster analysis of species abundance and Metastats analysis of differential species between groups

The three regional groups (WL, DN, and YA; Figure 1a) and three age groups (AGE1, AGE2, and AGE3; Figure 1b) were clustered separately. The giant pandas sampled in this study were all fed at the Giant Panda Research and Protection Center in an identical environment. Region and age significantly affect the components of the microbiota in the reproductive tract of the giant panda. Hypothesis testing of species abundance data between the groups was performed using Metastats analysis. The species with significant differences (p < 0.05, q < 0.05) were screened as per the p‐ and q‐values. The phyla with significant differences between the three groups (WL, YA, and DN) were as follows: Proteobacteria, Cyanobacteria, Chloroflexi, Euryarchaeota, Planctomycetes, Acidobacteria, and Deinococcus‐Thermus (Table A3: https://doi.org/10.5281/zenodo.4067664). Moreover, the phyla, which differed significantly between the AGE1 and AGE2 groups, were as follows: Proteobacteria, Firmicutes, Bacteroidetes, Cyanobacteria, Planctomycetes, Acidobacteria, and Deinococcus‐Thermus (Table A4: https://doi.org/10.5281/zenodo.4067664).

LEfSe analysis of different species between groups

LDA score (LDA Score>4) was used to indicate the significant differences between the species in different groups, as depicted in Figure A1. The LEfSe test identified the differences in microbial communities between the three groups (WL, YA, and DN). Microbiota from all the samples was classified at phylum, class, order, family, genus, and species levels, and a total of 43 differences were detected at each of these levels. Out of these 43 differences, 37 were contributed by the YA group, 5 by the DN group, and 1 by the WL group. The majority of the microbes enriched in the YA group belonged to Saccharomycetes, Tricholomataceae, Agaricomycetes, Microbotryaceae, Microbotryales, Cryptococcaceae, Naemateliaceae, and Tremellaceae. Additionally, the majority of the microbes enriched in the DN group belonged to Actinomycetaceae, Neisseria subflava, Ornithobacterium, and Helicobacter macacae. Furthermore, the primary microbial population enriched in the WL group was of Moraxella osloensis.

FIGURE A1

LDA value distribution map and evolutionary branch diagram of different microorganisms composition in regional and age groups. (a) LDA value distribution map of different microorganisms composition in regional groups. (b) LDA value distribution map of different microorganisms composition in age groups.

A total of 87 differences were detected between the AGE2 and AGE3 groups (Figure A1b). Out of these 87 differences, 82 differences were contributed by the AGE2 group, and 5 differences by the AGE3 group. The majority of the microbial populations enriched in the AGE1 group belonged to Epsilonproteobacteria, Bacteroides, and Corynebacterium, and in the AGE2 group were Burkholderiales, Burkholderiaceae, Gammaproteobacteria, Pseudomonadales, Pseudomonadaceae, Saccharommycetes, Metschnikowiaceae, Agaricomycetes, Polyporales, Tricholomataceae, Microbotryaceae, Cryptococcaceae, Cuniculitremaceae, Naemateliaceae, Tremellales, Tremellaceae, Tremellomycetes, Trichosporonaceae.

Functional annotation and analysis

412,312 genes identified from all the 14 samples were analyzed for functional characterization. Out of the 412,312 genes, 240,050 (58.22%) genes were analyzed using eggNOG database, 9,978 (2.42%) in CAZy database, 259,596 (62.96%) in KEGG database, of which 143,757 (34.87%) genes were queried against 7,791 KEGG ortholog group (KO) in the KEGG database. As per the functional analysis of genes using eggNOG, except for the unknown functions category, 24 functional categories were enriched, and a majority of the genes were enriched in functional categories, such as amino acid transport and metabolism, carbohydrate transport and metabolism, transcription, and replication, recombination, and repair (Figure 5a). Few genes were enriched in the nuclear structure and extracellular structure (each ≤0.01%) categories. The functional characterization of the metagenomic sequencing data revealed that amino acid transport and metabolism, carbohydrate transport and metabolism were the dominant functions of the genes identified from the giant pandas’ vaginal samples. These functions are also involved in microbial growth and development (Feng et al., 2018).

FIGURE 5

Annotated gene number statistics graphs and cluster trees based on Bray–Curtis distance for each database. (a) EggNOG's Unigenes annotation number statistical graph (level 1). (b) CAZy Unigenes annotation number statistical graph (level 1). (c)The number of functional annotations of genes at KEGG (level 1). Besides, we examined the changes associated with the altered carbohydrate‐active enzyme genes abundance. In the CAZy analysis, the abundant enzymes in descending order of abundance were glycoside hydrolases (GHs), glycosyltransferases (GTs), carbohydrate‐binding modules (CBMs), carbohydrate esterases (CEs), auxiliary activities (AAs), and polysaccharide lyases (PLs; Figure 5b). At level two of the CAZy family, 252 carbohydrate‐active enzymes were identified using metagenomic analysis (Table A5: https://doi.org/10.5281/zenodo.4067664). GT2, which constitutes cellulose synthase (EC 2.4.1.12); chitin synthase (EC 2.4.1.16); dolichyl‐phosphate β‐D‐mannosyltransferase (EC 2.4.1.83), and so on, was identified as the most abundant carbohydrate‐active enzyme in all the 14 vaginal samples. Thus, we speculated that glycoside hydrolase is involved in the metabolic function of the giant panda's vagina. Apart from eggNOG and CAZy functional analysis, we also performed a KEGG functional analysis of the 14 samples. Overall, the enriched functional profiles were divided into seven categories, and the most highly enriched category was metabolism, followed by environmental information processing, genetic information processing, cellular processes, human diseases, and organismal systems (Figure 5c, Table A6: https://doi.org/10.5281/zenodo.4067664). In the level two of KEGG analysis, carbohydrate metabolism was the most highly enriched category followed by membrane transport, amino acid metabolism, metabolism of cofactors and vitamins, signal transduction, energy metabolism, cellular community‐prokaryotes, nucleotide metabolism, translation, and lipid metabolism (Table A7: https://doi.org/10.5281/zenodo.4067664). A total of 392 functional groups were identified through the “KEGG Orthology” analysis (Table A8: https://doi.org/10.5281/zenodo.4067664). The signaling pathway enrichment analysis of genes from vaginal samples indicated that these pathways were involved in metabolism, cellular processes, environmental, and genetic information processing. The 392 pathways were divided into 612 functional modules, and out of these modules, M00178 (ribosome, bacteria) and M00179 (ribosome, archaea) modules, part of the map03010 pathways, were most highly enriched (Table A9: https://doi.org/10.5281/zenodo.4067664). It indicated that the dominant microbes present in the vagina of giant pandas were mainly bacteria and archaea. As per the KEGG database enrichment analysis, the ABC transporters, two‐component system, purine metabolism, quorum sensing, ribosome, pyrimidine metabolism, bacterial secretion system, and phosphotransferase system (PTS) were the most highly enriched functional categories (Table A8: https://doi.org/10.5281/zenodo.4067664). Figure A2 depicts the top 10 enriched signaling pathways identified through the KEGG pathway analysis. ABC transporters (PATH: ko02010), involved in membrane transport, were enriched in most of the vaginal samples.

FIGURE A2

Relative abundances of the top 10 KEGG pathways in the region and age groups. (a) Relative abundances of the top 10 KEGG pathways in the region group (level 3). (b) Relative abundances of the top 10 KEGG pathways in the age group (level 3).

AGE and YA groups were identified as the most functionally active groups. Although the relative abundances of the pathways identified in all the vaginal samples did not differ significantly, the relative abundance of the phosphotransferase system (PTS) in AGE2 was significantly lower than AGE1 and AGE3 groups. Besides, the abundance of Pseudomonas aeruginosa, a biofilm‐forming strain in the YA group, was significantly lower than WL and DN groups. These outcomes indicated that differences in age and region impact the function of microbes present in the giant panda's vagina.

DISCUSSION

In the current study, we investigated the vaginal microbiome and associated functional profiles in the vaginal samples collected from the giant pandas from different geographical locations in China and different age groups using metagenomic sequencing. Currently, metagenomic sequencing technology is being widely employed in sequencing studies. Although the metagenomic sequencing method cannot distinguish DNA from dead and living cells, it can substantially eliminate the limitations posed by conventional methods, such as microbial isolation and culture. Apart from being convenient and highly precise technology, metagenomic sequencing can expand the utilization space of microbial resources. As implicated in the previous studies, apart from region and age, other factors also affect the microbiota of the vagina of giant panda significantly. In humans and other animals, feeding conditions severely impact the microbial makeup. Feeding style acts as a crucial driving factor for the colonization of intestinal microbes and influences the structure and function of intestinal microbiota in newborns (Azad et al., 2013) as a majority of the microbes in mammals are contributed through the lactation process in infants. All the giant pandas sampled in this study were fed by the Panda Research and Protection Center in an identical feeding environment. Besides, all the giant panda vaginal samples were collected and processed by the trained staff using a standard protocol. Thus, we decided that the samples can be analyzed to study the effect of region and age. Microorganisms exist in different environmental matrices, such as water, air, and soil, where they are involved in a myriad of ecological functions (Abia et al., 2019). However, age and region of the host organism may also affect the microbial composition in a different environment. In 2017, we employed 16S rRNA high‐throughput sequencing to analyze the reproductive tract microbiota of giant panda, which revealed Proteobacteria (59.2%), Firmicutes (34.4%), Actinobacteria (5.2%) as the most abundant phyla and Pseudomonas (8.0%), Streptococcus (6.3%) as most abundant genera in these samples (Yang et al., 2017). Interestingly, in our previous study on fungal microbiome using HiSeq sequencing, we observed similar dominant microbiota in the giant panda's vagina, as observed in the current study (Chen et al., 2018). Although the proportion of microorganisms in the giant panda's vagina varied as per the analyses in this study, the outcome was in line with the previous study. For instance, Pseudomonas was the most abundant genus in vaginal samples from both the studies. In this study, we determined the effect of age and region on vaginal samples using metagenomic sequencing. The species annotation of metagenomic data from vaginal samples in the age group (AGE1, AGE2, AGE3) showed that the AGE2 group had higher microbial diversity than the AGE1 and AGE3 group. A similar analysis of the vaginal samples in the region group (YA, WL, DN) showed that the YA group had higher microbial diversity as compared to the samples in the WL and DN groups (Tables A10 and A11: https://doi.org/10.5281/zenodo.4067664). The outcomes of PCA, heat map, and species clustering abundance analysis further validated these findings. Thus, we found that age and region affect the microbiome of the giant panda's vagina. Chlamydia, a strict intracellular bacterial parasite (Sachse et al., 2015), causes uterine infection, premature birth, or miscarriage in humans (Moragianni et al., 2019), and in pigs, it causes respiratory diseases, diarrhea, conjunctivitis, and reproductive problems (Li et al., 2011). In females, Chlamydia infection is associated with sexual dysfunction and infertility (Pfennig, 2019). In the current study, Chlamydia was detected in all 14 samples but in different proportions. It is noteworthy that the DN group showed Chlamydia as the dominant bacteria. A majority of the vaginal samples in the DN group were from younger giant pandas. Out of all the 14 vaginal samples, GPV14 showed the highest Chlamydia content (9%). Neisseria gonorrhoeae is a major cause of infertility in female (Costa‐Lourenço et al., 2017). In the current study, Neisseria was detected in all the 14 vaginal samples. Krona was employed to examine the result of species annotations (Ondov et al., 2011). The outcome of this analysis showed that except for GPV1, GPV2, GPV4, and GPV12 samples, Neisseria gonorrhoeae was identified in all the other vaginal samples. GPV11 showed the highest abundance of (4%) of Neisseria gonorrhoeae. Humans are the only natural host of Neisseria gonorrhoeae known so far (Chan et al., 2016). In general, an abnormal microbiome may be one of the crucial causes of pregnancy failure in animals (García‐Velasco et al., 2017). We speculated that Chlamydia and Neisseria gonorrhoeae infections could have an adverse effect on the reproductive ability of pandas as well. The metagenomic sequencing technology has revolutionized the research and extended the understanding of microbial communities in various environmental matrices, such as water (Abia et al., 2018), soil (Su et al., 2014), and even air (Be et al., 2015). Also, functional screening can allow us to deduce the role of these microbial communities. Besides, it can unravel the novel genes that were not reported in previous sequencing analysis datasets (Abia et al., 2019; Al‐Hebshi et al., 2018; Bao et al., 2020; Shobo et al., 2020). In the current study, functional databases of eggNOG, CAZy, and KEGG were employed to analyze the metagenomic sequencing data, and it was found that the giant panda's vaginal microbiome was actively involved in metabolic functions. As per the eggNOG database annotation of the genes, a majority of the genes belonged to the amino acid transport and metabolism category. As per the CAZy analysis, 252 different carbohydrate‐active enzymes were identified from the metagenomic sequencing data of the vaginal samples. The eggNOG and CAZy analysis revealed that in the AGE group, the AGE2 showed significantly higher gene numbers than AGE1 and AGE3 groups. Similarly, in the region group of vaginal samples, the number of genes annotated in the YA group was significantly higher than the WL and the DN groups. A high number of annotated genes in the AGE2 and YA groups indicate that age and region affect the giant panda's vaginal microbial function. Meanwhile, metabolic pathway analysis using the KEGG dataset demonstrated enrichment of the membrane transport and nucleotide metabolic pathways. The ABC transporters (PATH: ko02010), a crucial pathway in microbes (Smart & Fleming, 1996), facilitates the lipid mobilization across the lipid bilayer. Also, it plays a crucial role in the construction and maintenance of the lipid bilayer. The enrichment of the ko02010 pathway may be due to membrane transport function associated with the vagina, such as the secretion of fluids. ABC transporters were one of the largest family of membrane transporters (Veen & Konings, 1998). Recent studies have shown that the ABC transporter is detected in all organisms, and it is involved in the active transport of various substrates across the lipid bilayer (Sasser et al., 2012). The loss‐of‐function mutation of the ABC transporter culminates in genetic disorders (Paumi et al., 2009). In the current study, as per the KEGG analysis, ABC transporters may also affect the reproduction rate of giant pandas. The cluster analysis of age and regional group vaginal samples presented an apparent difference between these two groups. (Figure 6, Figure A3).

FIGURE 6

FIGURE A3

PCA analysis diagram based on functional abundance. PCA results show that the abscissa represents the first principal component and the percentage represents the contribution of the first principal component to the sample difference. The vertical axis represents the second principal component, and the percentage represents the contribution of the second principal component to the sample difference. Each dot in the diagram represents a sample, and the samples in the same group are shown in the same color.

Cluster the trees according to the Bray‐Curtis distance of the KEGG database. (a) Clustering tree based on Bray–Curtis distance in the regional groups. (b) Clustering tree based on Bray–Curtis distance in the age groups. In the current metagenomics analysis study of the giant pandas’ vaginal samples from different age groups and regions, we identified a total of 33 phyla and 1000 genera. Taxonomic profiling showed that the microbiome present in the vagina of giant pandas is affected by age and region. The functional pathways of vaginal microbes were correlated to the composition of the microbial community. A majority of these functions were induced by age and region. Therefore, compared to the region, age had a clear impact on the species and functional abundance of the microbiome in giant pandas’ reproductive tract.

CONCLUSIONS

In this study, we analyzed the vaginal microbiome of giant pandas using metagenomic sequencing. As per the outcomes of our analyses, Proteobacteria, Basidiomyta, Firmicutes, and Actinobacteria were identified as the dominant phyla in vaginal samples from giant pandas. Besides, we found that the geographical location and age affect the giant panda's vaginal microbiome and associated functions involving signaling pathways, proteins, and CAZy. Overall, Chlamydia and Neisseria gonorrhoeae in the reproductive tract of giant pandas may adversely affect the reproductive tract of giant pandas.

CONFLICTS OF INTERESTS

None declared.

AUTHOR CONTRIBUTION

Lan Zhang: Data curation (equal); Formal analysis (equal); Writing‐original draft (lead). Caiwu Li: Funding acquisition (equal); Investigation (equal). Yaru Zhai: Formal analysis (equal). Lan Feng: Data curation (equal). Keke Bai: Formal analysis (equal). Zhizhong Zhang: Funding acquisition (equal); Investigation (equal). Yan Huang: Funding acquisition (equal); Investigation (equal). Ti Li: Funding acquisition (equal); Investigation (equal). Desheng Li: Funding acquisition (equal); Investigation (equal). Hao Li: Data curation (equal); Formal analysis (equal). Pengfei Cui: Data curation (equal); Formal analysis (equal). Danyu Chen: Data curation (equal). Hong‐Ning Wang: Supervision (equal). Xin Yang: Writing‐original draft (equal); Writing‐review & editing (equal).

ETHICS STATEMENT

The animal study was reviewed and approved by the Sichuan University Animal Ethics Committee.

68 in total

1. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

Authors: Weizhong Li; Adam Godzik
Journal: Bioinformatics Date: 2006-05-26 Impact factor: 6.937

2. Integrative analysis of environmental sequences using MEGAN4.

Authors: Daniel H Huson; Suparna Mitra; Hans-Joachim Ruscheweyh; Nico Weber; Stephan C Schuster
Journal: Genome Res Date: 2011-06-20 Impact factor: 9.043

3. Ocean plankton. Structure and function of the global ocean microbiome.

Authors: Shinichi Sunagawa; Luis Pedro Coelho; Samuel Chaffron; Jens Roat Kultima; Karine Labadie; Guillem Salazar; Bardya Djahanschiri; Georg Zeller; Daniel R Mende; Adriana Alberti; Francisco M Cornejo-Castillo; Paul I Costea; Corinne Cruaud; Francesco d'Ovidio; Stefan Engelen; Isabel Ferrera; Josep M Gasol; Lionel Guidi; Falk Hildebrand; Florian Kokoszka; Cyrille Lepoivre; Gipsi Lima-Mendez; Julie Poulain; Bonnie T Poulos; Marta Royo-Llonch; Hugo Sarmento; Sara Vieira-Silva; Céline Dimier; Marc Picheral; Sarah Searson; Stefanie Kandels-Lewis; Chris Bowler; Colomban de Vargas; Gabriel Gorsky; Nigel Grimsley; Pascal Hingamp; Daniele Iudicone; Olivier Jaillon; Fabrice Not; Hiroyuki Ogata; Stephane Pesant; Sabrina Speich; Lars Stemmann; Matthew B Sullivan; Jean Weissenbach; Patrick Wincker; Eric Karsenti; Jeroen Raes; Silvia G Acinas; Peer Bork
Journal: Science Date: 2015-05-22 Impact factor: 47.728

4. Gut microbiome development along the colorectal adenoma-carcinoma sequence.

Authors: Qiang Feng; Suisha Liang; Huijue Jia; Andreas Stadlmayr; Longqing Tang; Zhou Lan; Dongya Zhang; Huihua Xia; Xiaoying Xu; Zhuye Jie; Lili Su; Xiaoping Li; Xin Li; Junhua Li; Liang Xiao; Ursula Huber-Schönauer; David Niederseer; Xun Xu; Jumana Yousuf Al-Aama; Huanming Yang; Jian Wang; Karsten Kristiansen; Manimozhiyan Arumugam; Herbert Tilg; Christian Datz; Jun Wang
Journal: Nat Commun Date: 2015-03-11 Impact factor: 14.919

5. A metagenome-wide association study of gut microbiota in type 2 diabetes.

Authors: Junjie Qin; Yingrui Li; Zhiming Cai; Shenghui Li; Jianfeng Zhu; Fan Zhang; Suisha Liang; Wenwei Zhang; Yuanlin Guan; Dongqian Shen; Yangqing Peng; Dongya Zhang; Zhuye Jie; Wenxian Wu; Youwen Qin; Wenbin Xue; Junhua Li; Lingchuan Han; Donghui Lu; Peixian Wu; Yali Dai; Xiaojuan Sun; Zesong Li; Aifa Tang; Shilong Zhong; Xiaoping Li; Weineng Chen; Ran Xu; Mingbang Wang; Qiang Feng; Meihua Gong; Jing Yu; Yanyan Zhang; Ming Zhang; Torben Hansen; Gaston Sanchez; Jeroen Raes; Gwen Falony; Shujiro Okuda; Mathieu Almeida; Emmanuelle LeChatelier; Pierre Renault; Nicolas Pons; Jean-Michel Batto; Zhaoxi Zhang; Hua Chen; Ruifu Yang; Weimou Zheng; Songgang Li; Huanming Yang; Jian Wang; S Dusko Ehrlich; Rasmus Nielsen; Oluf Pedersen; Karsten Kristiansen; Jun Wang
Journal: Nature Date: 2012-09-26 Impact factor: 49.962

6. Gut metagenome in European women with normal, impaired and diabetic glucose control.

Authors: Fredrik H Karlsson; Valentina Tremaroli; Intawat Nookaew; Göran Bergström; Carl Johan Behre; Björn Fagerberg; Jens Nielsen; Fredrik Bäckhed
Journal: Nature Date: 2013-05-29 Impact factor: 49.962

7. Richness of human gut microbiome correlates with metabolic markers.

Authors: Emmanuelle Le Chatelier; Trine Nielsen; Junjie Qin; Edi Prifti; Falk Hildebrand; Gwen Falony; Mathieu Almeida; Manimozhiyan Arumugam; Jean-Michel Batto; Sean Kennedy; Pierre Leonard; Junhua Li; Kristoffer Burgdorf; Niels Grarup; Torben Jørgensen; Ivan Brandslund; Henrik Bjørn Nielsen; Agnieszka S Juncker; Marcelo Bertalan; Florence Levenez; Nicolas Pons; Simon Rasmussen; Shinichi Sunagawa; Julien Tap; Sebastian Tims; Erwin G Zoetendal; Søren Brunak; Karine Clément; Joël Doré; Michiel Kleerebezem; Karsten Kristiansen; Pierre Renault; Thomas Sicheritz-Ponten; Willem M de Vos; Jean-Daniel Zucker; Jeroen Raes; Torben Hansen; Peer Bork; Jun Wang; S Dusko Ehrlich; Oluf Pedersen
Journal: Nature Date: 2013-08-29 Impact factor: 49.962

Analysis of the vaginal microbiome of giant pandas using metagenomics sequencing.

INTRODUCTION

MATERIALS AND METHODS

Samples

DNA extraction and metagenomic sequencing

Analysis of samples

Metagenomic sequencing

Library construction

Pretreatment of sequencing results

Metagenome assembly

Gene prediction and abundance analysis

Taxonomy prediction

Common functional database annotation

RESULTS

Sequencing data

Gene prediction and abundance analysis

Species annotation

Specimen annotation overview

Dimensionality analysis of species abundance

Cluster analysis of species abundance and Metastats analysis of differential species between groups

LEfSe analysis of different species between groups

Functional annotation and analysis

DISCUSSION

CONCLUSIONS

CONFLICTS OF INTERESTS

AUTHOR CONTRIBUTION

ETHICS STATEMENT

1. Cd-hit: a fast program for clustering and comparing large sets of protein or nucleotide sequences.

2. Integrative analysis of environmental sequences using MEGAN4.

3. Ocean plankton. Structure and function of the global ocean microbiome.

4. Gut microbiome development along the colorectal adenoma-carcinoma sequence.

5. A metagenome-wide association study of gut microbiota in type 2 diabetes.

6. Gut metagenome in European women with normal, impaired and diabetic glucose control.

7. Richness of human gut microbiome correlates with metabolic markers.

8. Isolation of different species of Candida in patients with vulvovaginal candidiasis from sari, iran.

9. Biogeography and individuality shape function in the human skin metagenome.

10. CD-HIT: accelerated for clustering the next-generation sequencing data.

Review 1. Update on the Neisseria Macrophage Infectivity Potentiator-Like PPIase Protein.

2. Heat Stress Altered the Vaginal Microbiome and Metabolome in Rabbits.

Review 3. Interactions between reproductive biology and microbiomes in wild animal species.