Literature DB >> 32939371

Metagenome datasets from women with polycystic ovary syndrome from Irkutsk, Eastern Siberia, Russia.

Natalia Belkova1, Elizaveta Klimenko1, Naliia Vilson1, Alexsandra Sambialova1, Daria Markova1, Ilia Igumnov1, Larisa Suturina1.   

Abstract

For the metagenomic characterization of potential taxonomic and functional diversity of microorganisms associated with polycystic ovary syndrome (PCOS) in women, we surveyed five women with PCOS and collected samples of feces, saliva, and serum. After quality processing, we have obtained from 915,594 to 3,880,379 reads; these 16,693 sequences had ribosomal RNA genes, 2,091,990 sequences contained predicted proteins with known functions, and 3,750,261 sequences had predicted proteins with unknown functions. Host DNA accounted for ca. 0.03% and less in datasets of fecal samples, from 1.41 to 24.94% in saliva samples; the remaining sequences were attributed to archaeal, bacterial, or viral DNA. In serum, from 38.18 to 75.77% were characterized as fragments of the human genome, but the remaining sequences were unidentified. Among microbes, a total of one archaeal and eight bacterial phyla were revealed. Viral DNA was detected in several fecal and one saliva sample and was classified as C2likevirus, Flavivirus, and Streptococcus bacteriophage. The metagenome sequence data were deposited at NCBI SRA as BioProject No. PRJNA625611.
© 2020 Elsevier Inc.

Entities:  

Keywords:  Feces; Gut; Human microbiome; Metagenomics; PCOS; Polycystic ovary syndrome; Saliva; Serum; Women

Year:  2020        PMID: 32939371      PMCID: PMC7479322          DOI: 10.1016/j.dib.2020.106137

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table

Value of the Data

These are the first shotgun metagenome datasets on feces, saliva, and serum sampled from women with polycystic ovary syndrome (PCOS) from the Eastern Siberia of Russia. The data provides valuable information about the diversity and functional potential of archaeal, bacterial, and dsDNA viral communities in the gut and saliva microbiome of women with PCOS. These data are useful for the definition of human microbiome peculiarities associated with PCOS. Raw sequence data can be used for various additional bioinformatics processing.

Data Description

Polycystic ovary syndrome (PCOS) is the most frequent endocrine and metabolic disorder in premenopausal women [1]. Direct interactions of the gut microbiome (GM) and PCOS have been shown in animal model studies [2]. In contrast, the peculiarities of GM associated with PCOS have been studied for female adolescents and women of reproductive years [2,3]. In recent years, pronounced dysbiosis of gut microbial communities has been shown in metabolic diseases (obesity and insulin resistance) associated with PCOS [4,5]. Although the search for correlations between the GM and metabolic diseases has become a hot point for recent research, the geography of the studies on PCOS is very narrow and included mainly Chinese studies. A few studies have been carried out in the USA, Austria, and Spain [2], [3], [4], [5], [6], [7], [8]. The dataset contains raw sequencing data obtained by the shotgun sequencing of feces, saliva, and serum from five women from Irkutsk, Eastern Siberia, Russia. The data files (reads in FASTQ format) were deposited to NCBI SRA as BioProject No. PRJNA625611. Raw data contained 38,261,075 reads totaling 577,737,325 bp with an average length of 151 bps, 127,895 reads failed to pass the QC pipeline. While of the sequences that passed QC, 16,693 sequences had ribosomal RNA genes, 2,091,990 sequences contained predicted proteins with known functions, and 3,750,261 sequences had predicted proteins with unknown functions. Fecal samples contained from 1,275,709 to 3,044,387 sequences, saliva, from 1,701,648 to 3,880,379 sequences, whereas serum contained from 915,594 to 2,765,266 sequences. Host DNA accounted for ca. 0.03% and less in datasets of fecal samples, from 1.41 to 24.94% in saliva samples. The remaining sequences were annotated to archaeal, bacterial, or viral DNA (Table 1). In serum from 38.18 to 75.77% of sequences were characterized as fragments of the human genome, but the remaining sequences were unidentified.
Table 1.

Annotation of datasets from the feces, saliva, and serum from women with PCOS represented by Metaphlan2.

Sample IDHuman sequences (%)Sequences annotated to microorganisms (%)
AchaeaEubacteriaVirus
Feces
48-f1*0.003099.9970
49-f10.010.1999.790
50-f10.01099.920.06
51-f10.030.2399.660.07
48-f2**0.01099.990
49-f20.010.0499.940
50-f20.01099.680.30
52-f200.2799.720
Saliva
48-sl3.21096.790
49-sl4.46095.540
50-sl1.41098.590
51-sl24.94075.060
52-sl1.95094.953.09
Serum
48-sr51.91NDNDND
49-sr75.77NDNDND
50-sr67.48NDNDND
51-sr38.18NDNDND

Comments: * fecal samples with a marker –f1 were isolated with HostZERO Microbial DNA Kit (Zymo Research, USA), ** fecal samples with a marker –f2 were isolated with Quick-DNA Fecal/Soil Microbe Miniprep Kit (Zymo Research, USA); ND, not detected.

Annotation of datasets from the feces, saliva, and serum from women with PCOS represented by Metaphlan2. Comments: * fecal samples with a marker –f1 were isolated with HostZERO Microbial DNA Kit (Zymo Research, USA), ** fecal samples with a marker –f2 were isolated with Quick-DNA Fecal/Soil Microbe Miniprep Kit (Zymo Research, USA); ND, not detected. Metaphlan2 allowed the revealing of a total of one archaeal and eight bacterial phyla in the gut and saliva of women with PCOS. Information about the structure of microbial communities in gut and saliva metagenome is presented in Fig. 1. Abundant genera in the gut metagenome were Faecalibacterium, Bacteroides, Dialister, Prevotella, Alistipes, and Subdoligranulum, representing from 8 to 20% of total sequences. At the same time, Prevotella, Veillonella, Neisseria, Haemophilus, and Porphyromonas were dominant in the saliva metagenome and varied from 5 to 31%. Viral DNA was detected in several fecal and one saliva sample and was classified as C2likevirus, Flavivirus, and Streptococcus bacteriophage.
Fig. 1

The structure of microbial communities in gut (A) and saliva (B) metagenome.

The structure of microbial communities in gut (A) and saliva (B) metagenome. These are the first datasets on the diversity of archaeal, bacterial, and dsDNA viral communities in the gut and saliva metagenome of women with PCOS from Eastern Siberia of Russia, based on Illumina sequencing technology. Datasets were deposited to the SRA NCBI database as BioProject No. PRJNA625611.

Experimental design, materials, and methods

Study cohort

Feces, saliva, and serum were sampled from women referred to the outpatient department (“Center of Innovative Medicine”) of the Scientific Center for Family Health and Human Reproduction Problems (Irkutsk). The datasets comprised participants who met the inclusion criteria and did not meet the exclusion criteria. Inclusion criteria were as follows: premenopausal women with a voluntary signed informed consent and the presence of any two or three PCOS criteria (hyperandrogenism, oligo-anovulation, and polycystic ovarian morphology by pelvic ultrasound) [9]. The exclusion criteria included: hyperprolactinemia, hypothyroidism, none-classic adrenal hyperplasia (NCAH), tumors, obesity, diabetes, glucose intolerance, intake of oral contraceptives, and other hormonal medicines, as well as antimicrobial preparations and insulin sensitizers. General characteristics of the patients included in the datasets are shown in Table 2. The data collection was approved by the Ethics Committee of SCFHHRP (protocol N 6 from 06.09.2019).
Table 2

The clinical characteristics of studied participants.

Clinic parameterPatient 1Patient 2Patient 3Patient 4Patient 5
Age, years2924232420
BMI, kg/m219.1322.820.021.821.97
SBP/DBP, mm Hg120/80128/79114/70127/79127/72
Oligo-anovulation (+/–)+++++
Hyperandrogenemia (+/–)+(TT ↑; FAI ↑; DHEAS ↑)+(FT ↑)+(TT ↑; DHEAS ↑)+(TT ↑)+(TT ↑; FT ↑; DHEAS ↑)
Hirsutism (+/–)+(FG score = 5)(FG score = 2)+(FG score = 6)+(FG score = 7)+(FG score = 5)
PCOM (+/–)+(AFC > 25;OV = 21; 19.7 mm3)+(AFC = 15;OV=8.7; 10.2 mm3)+(AFC > 20;OV=24; 22 mm3)+(AFC = 28; 32OV=19.7; 22 mm3)(AFC = 10; 10OV=8.58; 9.94 mm3)

Comments: BMI, body mass index; PCOM, polycystic ovarian morphology; TT, total testosterone; FT, free testosterone; FAI, free androgen index; DHEAS, dehydroepiandrosterone sulfate; FG, Ferriman–Gallwey score; AFC, antral follicle count; OV, ovarian volume.

The clinical characteristics of studied participants. Comments: BMI, body mass index; PCOM, polycystic ovarian morphology; TT, total testosterone; FT, free testosterone; FAI, free androgen index; DHEAS, dehydroepiandrosterone sulfate; FG, Ferriman–Gallwey score; AFC, antral follicle count; OV, ovarian volume.

Sample collection and treatment

Fecal sampling and total DNA isolation were conducted according to the requirements of International Human Microbiome Standards, IHMS. Patients were instructed in the proper feces and saliva sample collection. Patients were provided with refrigerants for cold transportation of biomaterial to the laboratory within 2 h after collection. For fecal samples, the instruction was done to prevent contamination with urine and toilet water. Saliva samples were collected in the morning after an overnight fast. Patients were instructed not to brush their teeth and to drink only water before saliva sampling. Saliva was collected in the mouth for several minutes and then voided into sterile plastic tubes. This process was repeated until the desired volume of 1–2 ml was reached. Saliva samples were immediately cooled on ice and transported to the laboratory. In the laboratory, all samples were aliquoted, one of which was used directly for DNA extraction; the others were frozen and stored at ‒80 °C. Venous blood was collected in the morning after an overnight fast from the ulnar vein into a tube with SiO2. The sample was evaluated for hemolysis. A sample of the patients without hemolysis was defecated at room temperature for 30 min until a clot formed completely. Then, centrifugation was performed at 3000 rpm for 15 min. The resulting serum was poured into a sterile tube and stored in a refrigerator; DNA was isolated within 2 h after sampling. Total DNA from fecal and saliva samples was extracted using HostZERO Microbial DNA Kit (Zymo Research, USA), which supplies a step to allow the elimination of host DNA. Additionally, fecal samples were treated with the Quick-DNA Fecal/Soil Microbe Miniprep Kit (Zymo Research, USA). The differences in the host and bacterial DNA content between the fecal samples treated with two commercial kits presented in Table 1 and Fig. 2. Total DNA from serum was isolated using AmpliSens® DNA-sorb-B (InterLabService, Russia). DNA quality was tested by 0.8% agarose gel electrophoresis, while the concentration was determined using a Qubit 4 Fluorometer (Thermo Scientific, USA).
Fig. 2

Heat map of bacterial communities structure of gut samples. DNA was isolated with two commercial kits.

Heat map of bacterial communities structure of gut samples. DNA was isolated with two commercial kits.

Library preparation and sequencing

Paired-end libraries were prepared using a Nextera XT DNA Library Preparation Kit, and shotgun sequencing was done using a NextSeq 500/550 High Output Kit (300 Cycles) with the Illumina NextSeq 550 platform according to the manufacturer instruction.

Analysis of metagenome datasets

Following quality control using FastQC v0.11.8 [10], sequences were trimmed (Trimmomatic [11]) and merged. The host sequence was detected and filtered using BWA [12] and SAMtools [13]. Analysis and taxonomy annotation of output data was performed through Metaphlan2 [14] (Table 1) and Krona [15] (Fig. 1).

Declaration of Competing Interest

The authors declare that they have no known competing for financial interests or personal relationships which have, or could be perceived to have, influenced the work reported in this article.
SubjectMicrobiology
Specific subject areaMetagenomics
Type of dataTableFigureMetagenome sequences
How data were acquiredShotgun DNA sequencing using Illumina NextSeq 550 platform.
Data formatRaw dataAnalyzed
Parameters for data collectionFeces, saliva, and serum sampled from five untreated women with PCOS, identified at the outpatient department of the Scientific Center for Family Health and Human Reproduction Problems (SCFHHRP) between September and November 2019.PCOS was verified by the presence of at least two criteria from hyperandrogenism, oligo-anovulation, and polycystic ovarian morphology. The data collection was approved by the Ethics Committee of the SCFHHRP.
Description of data collectionTotal DNA from fecal and saliva samples was extracted using HostZERO Microbial DNA Kit (Zymo Research, USA). Additionally, fecal samples were treated with the Quick-DNA Fecal/Soil Microbe Miniprep Kit (Zymo Research, USA). Total DNA from serum was isolated using AmpliSens® DNA-sorb-B (InterLabService, Russia). Paired-end libraries were prepared using a Nextera XT DNA Library Preparation Kit, and shotgun sequencing was done using NextSeq 500/550 High Output Kit (300 Cycles) with the Illumina NextSeq 550 platform.
Data source locationInstitution: Scientific Center for Family Health and Human Reproduction ProblemsCity: IrkutskCountry: Russia
Data accessibilityRaw data were deposited to NCBIRepository name: SRAData identification number: BioProject PRJNA625611, BioSamples from SAMN14603886 to SAMN14603890Direct URL to data: https://www.ncbi.nlm.nih.gov/bioproject/PRJNA625611
  14 in total

Review 1.  Criteria, prevalence, and phenotypes of polycystic ovary syndrome.

Authors:  Daria Lizneva; Larisa Suturina; Walidah Walker; Soumia Brakta; Larisa Gavrilova-Jordan; Ricardo Azziz
Journal:  Fertil Steril       Date:  2016-05-24       Impact factor: 7.329

2.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

3.  Gut Microbial Diversity in Women With Polycystic Ovary Syndrome Correlates With Hyperandrogenism.

Authors:  Pedro J Torres; Martyna Siakowska; Beata Banaszewska; Leszek Pawelczyk; Antoni J Duleba; Scott T Kelley; Varykina G Thackray
Journal:  J Clin Endocrinol Metab       Date:  2018-04-01       Impact factor: 5.958

4.  Structural and functional profiles of the gut microbial community in polycystic ovary syndrome with insulin resistance (IR-PCOS): a pilot study.

Authors:  Bo Zeng; Zhiwen Lai; Lijin Sun; Zhongbao Zhang; Jianhua Yang; Zaixin Li; Jie Lin; Zhi Zhang
Journal:  Res Microbiol       Date:  2018-10-04       Impact factor: 3.992

5.  Gut Microbiota and the Polycystic Ovary Syndrome: Influence of Sex, Sex Hormones, and Obesity.

Authors:  María Insenser; Mora Murri; Rosa Del Campo; M Ángeles Martínez-García; Elena Fernández-Durán; Héctor F Escobar-Morreale
Journal:  J Clin Endocrinol Metab       Date:  2018-07-01       Impact factor: 5.958

6.  Interactive metagenomic visualization in a Web browser.

Authors:  Brian D Ondov; Nicholas H Bergman; Adam M Phillippy
Journal:  BMC Bioinformatics       Date:  2011-09-30       Impact factor: 3.307

7.  Alterations in Gut Microbiome Composition and Barrier Function Are Associated with Reproductive and Metabolic Defects in Women with Polycystic Ovary Syndrome (PCOS): A Pilot Study.

Authors:  Lisa Lindheim; Mina Bashir; Julia Münzker; Christian Trummer; Verena Zachhuber; Bettina Leber; Angela Horvath; Thomas R Pieber; Gregor Gorkiewicz; Vanessa Stadlbauer; Barbara Obermayer-Pietsch
Journal:  PLoS One       Date:  2017-01-03       Impact factor: 3.240

8.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

9.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

10.  Characteristic gut microbiota and predicted metabolic functions in women with PCOS.

Authors:  Ling Zhou; Zhexin Ni; Wen Cheng; Jin Yu; Shuai Sun; Dongxia Zhai; Chaoqin Yu; Zailong Cai
Journal:  Endocr Connect       Date:  2020-01       Impact factor: 3.335

View more
  1 in total

1.  Metagenomic-based characterization of the gut virome in patients with polycystic ovary syndrome.

Authors:  Liansha Huang; Xiaoling Wu; Shumin Guo; Ying Lv; Peng Zhou; Guangrong Huang; Zuzhen Duan; Wen Sun
Journal:  Front Microbiol       Date:  2022-08-16       Impact factor: 6.064

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.