| Literature DB >> 25211071 |
.
Abstract
Much has been learned about the diversity and distribution of human-associated microbial communities, but we still know little about the biology of the microbiome, how it interacts with the host, and how the host responds to its resident microbiota. The Integrative Human Microbiome Project (iHMP, http://hmp2.org), the second phase of the NIH Human Microbiome Project, will study these interactions by analyzing microbiome and host activities in longitudinal studies of disease-specific cohorts and by creating integrated data sets of microbiome and host functional properties. These data sets will serve as experimental test beds to evaluate new models, methods, and analyses on the interactions of host and microbiome. Here we describe the three models of microbiome-associated human conditions, on the dynamics of preterm birth, inflammatory bowel disease, and type 2 diabetes, and their underlying hypotheses, as well as the multi-omic data types to be collected, integrated, and distributed through public repositories as a community resource.Entities:
Mesh:
Year: 2014 PMID: 25211071 PMCID: PMC5109542 DOI: 10.1016/j.chom.2014.08.014
Source DB: PubMed Journal: Cell Host Microbe ISSN: 1931-3128 Impact factor: 31.316
Summary of Biospecimens, Primary Data, and Derived Properties to Be Collected for Each iHMP Cohort Study with Repositories for Primary Data
| Source of Property | Property Derived from Primary Data | Primary Data from Biospecimen | Biospecimen from Preterm Birth Cohort | Biospecimen from IBD Cohort | Biospecimen from Prediabetic Cohort | Repository for Primary Data |
|---|---|---|---|---|---|---|
| Microbiome | microbial community composition | 16S rRNA gene survey | cervical | stool | anterior nares, stool | SRA |
| microbial community composition | whole metagenome shotgun sequences | vaginal | stool | anterior nares, stool | SRA | |
| predictions of microbial genes, metabolic pathways | whole metagenome shotgun sequences | vaginal | stool | anterior nares, stool | SRA | |
| RNA transcript profiles | whole metatranscriptome shotgun sequences | vaginal | stool | anterior nares, stool | SRA | |
| microbiome metaproteome profiles | LC-MS/MS peptide profiles | – | stool | stool | EBI PRIDE and/or Peptide Atlas | |
| viral community composition | whole virome shotgun sequences | – | stool | anterior nares, stool | SRA | |
| bacterial cultures | bacterial isolates | cervical | – | – | ATCC/BEI | |
| bacterial whole genome sequences | bacterial isolates | cervical | – | – | SRA | |
| bacterial whole genome sequences | bacterial single-cell sequences | – | stool | – | SRA | |
| single-cell bacterial RNA transcript profiles | single-cell bacterial transcript sequences | – | stool | – | SRA | |
| Host | subject exome/whole genome | subject genome sequences | blood (future | blood | blood | dbGaP/SRA |
| RNA transcript profiles | whole transcriptome sequences | vaginal | colon biopsy | PBMCs | dbGaP/SRA and GEO | |
| subject protein profiles | LC-MS/MS peptide profiles | – | stool | PBMCs, serum (future) | EBI PRIDE and/or Peptide Atlas | |
| systemic inflammation levels | cytokine profiles | vaginal | blood | plasma | Study DB | |
| intestinal epithelial cell cultures | intestinal epithelial cell isolates | – | colon biopsy | – | – | |
| subject DNA methylation profiles | reduced representation bisulfite sequencing (RRBS) profiles | – | blood | PBMC (future) | SRA | |
| intestinal inflammation levels | fecal calprotectin protein concentrations | – | stool | – | Study DB | |
| serum antibody composition and levels | serology | – | blood | – | Study DB | |
| subject contaminating genome sequences | human sequence from unfiltered total microbial community sequences | vaginal | stool | stool | dbGaP/SRA | |
| subject profiles for cohort | subject phenotypes, clinical metadata, medical panels | collected on each subject in the study | collected on each subject in the study | collected on each subject in the study | dbGaP | |
| Global (host and microbiome) | protein-protein interaction network between host and microbiome | yeast two-hybrid binary protein complexes | vaginal | – | – | EBI IntAct |
| pathway-level crosstalk between host and gut microbiome | intestinal epithelial cell profiling response to bacterial metabolites | – | colon biopsy | – | dbGaP/SRA and GEO | |
| global metabolite profiles | untargeted and targeted LC-MS metabolomic profiles | – | stool | urine, plasma | Metabolomics Workbench | |
| global lipid profiles | untargeted and targeted LC-MS metabolomic profiles | vaginal | stool | urine, serum | Metabolomics Workbench |
From mothers only
Samples collected for possible future analysis
Glossary of Terms Used to Describe Primary Data in the Paper, in the Figures, and in Table 1.
| Primary Data Type | Definition |
|---|---|
| 16S rRNA gene survey | Sequence-based analysis of 16S ribosomal RNA gene in total DNA extracts; the data are used to develop microbial community compositional profiles. |
| Whole metagenome shotgun sequences | Sequence-based analysis of all genes in total DNA extracts; the data are used to develop microbial community compositional, functional, and genomic profiles. |
| Whole metatranscriptome shotgun sequences | Sequence-based analysis of RNA transcripts in a microbial community by converting RNA in total RNA extracts to complementary DNA and sequencing the cDNA. |
| Whole virome shotgun sequences | Sequence-based analysis of all virus genes and genomes used to develop viral community profiles. These sequence data are derived from two approaches, either by first isolating viral-sized particles from a sample and then sequencing this fraction, or by sequencing the DNA (or cDNA from RNA) extract and computationally determining DNA and RNA viral sequences in the metagenome. |
| Bacterial isolates | Isolation and cultivation of a specific bacterium in a mixed community through the use of selective media or enrichment techniques to preferentially grow one bacterium of interest. |
| Single-cell genome sequences | Physical separation or physical enrichment of a single microbial cell from a mixed population of cells. Sequence-based analysis of the single cell’s genome is conducted using a DNA random priming method to first increase the total DNA concentration in the cell and subsequently sequence this amplified DNA. |
| Single-cell RNA sequences | Sequence-based analysis of RNA transcripts in a cell by converting RNA in total RNA extracts to complementary DNA and sequencing the cDNA. |
| LC-MS/MS peptide profiles | Analysis of individual peptides in a mixture of proteins by a combination of liquid chromatography and mass spectroscopy after enzymatic digestion of the protein mixture. To derive the microbiome metaproteome, data from the companion metagenome is used along with KEGG and other protein databases to verify the microbial proteins in the total protein mixture. To derive the host proteins, a similar approach is used based on search against the human genome. |
| Human subject whole genome sequences | Analysis of the sequence of the host genome using high-throughput DNA sequencing. |
| Whole exome sequences | Analysis of the protein coding regions of the host genome, which is generally done through sequence analysis. |
| Whole transcriptome sequences | Sequence-based analysis of RNA transcripts in host tissue using polyA tail separation of eukaryotic transcripts from a mixture of transcripts, converting the RNA to complementary DNA and sequencing the cDNA. |
| Cytokines | Circulating immune system glycoproteins in blood, plasma, or other bodily fluids are measured through an ELISA assay; data are used as a marker for systemic inflammation. |
| DNA methylation profiles | Measurement of methylated regions of the host genome. This method uses a combination of restriction enzymes and a reduced representation bisulfite sequencing (RRBS) method to enrich for regions of the genome with cytosine-guanine pairs; these regions are then screened for methylated bases. |
| Fecal calprotectin proteins | Fecal calprotectin is a protein found in stool, the concentration of which is measured through a standardized immunoassay method and is used to evaluate levels of intestinal inflammation. |
| Serology | Measurement of antibodies for specific pathogens or protein markers conducted with serum through an ELISA assay. |
| Human subject contaminating genome sequences | Sequence from total nucleotide extracts from most microbially focused sample types yields a combination of host and microbial sequence data. A computational filtering step, using human genome reference sequence, will separate microbiome from human sequences. |
| Interactomes | Analysis of protein-protein interactions between members of the microbiota or between the microbiota and the host. |
| Intestinal epithelial cell profiles | Functional or sequence-based readouts of phenotypes for cell lines derived from primary epithelial cells from individual hosts. |
| Metabolomes | Measure of metabolomic profiles using untargeted and targeted LC-MS methods. |
| Lipidomes | Measure of lipid profiles using 2D UPLC-ESI-MS/MS (ultra performance liquid chromatography combined with electrospray ionization tandem mass spectroscopy) with an initial focus on AA (arachidonic acid) and eicosanoids. |
Figure 1Integrative Multi-Omic Analysis of the Vaginal and Related Microbiomes in Pregnancy: Sample Collection, Assays, and Data Generation Workflow
Samples from pregnant women and neonates will be collected at clinics associated with Virginia Commonwealth University and the Global Alliance to Prevent Prematurity and Stillbirth (GAPPS). Health questionnaires will be administered and samples collected from multiple body sites over multiple visits throughout pregnancy, at delivery, at discharge, and at follow-up visits. Neonates will be sampled at delivery, discharge, and follow-up visits. A multi-omic approach will probe properties of the host and microbial communities to generate an integrative, longitudinal, and comprehensive data set of 16S rRNA gene surveys, mass spectrometry-based lipidomic profiles, and cytokine assays. A subset of samples will be subjected to whole metagenome and metatranscriptome sequencin for cultivation and isolation of bacterial strains for genome sequencing, and for generation of interactome maps.
Figure 2Characterizing the Gut Microbial Ecosystem for Diagnosis and Therapy in Inflammatory Bowel Disease: Sample Collection, Assay, and Data Generation Workflow
Samples from Crohn’s disease patients, ulcerative colitis patients, and non-IBD controls are collected at Massachusetts General Hospital (adult new onset), Emory University (pediatric new onset), Cincinnati Children’s Hospital (pediatric new onset), and Cedars-Sinai Medical Center (adult established). From each patient, three different types of samples are collected: longitudinal stool samples, periodic biopsies, and regularly scheduled blood samples. Biopsies are collected as clinically indicated, blood during clinical visits, and stool samples are self-collected by participants at home and shipped directly to a centralized handling and aliquotting pipeline. Multi-omic data generation (primarily, but not entirely, nucleotide sequence- and mass spectroscopy-based) provides microbial, host, and mixed profiles including 16S rRNA gene surveys, whole metagenome and metatranscriptome shotgun sequences, metabolite and protein profiles, single-cell assays, whole virome shotgun sequences, and serological profiles. Each sample is further accompanied by clinical (bloods/biopsies) or self-reported environmental and dietary (stools) metadata.
Figure 3Microbiome and Host Changes during Respiratory and Other Stress Conditions in Individuals at Risk for Type 2 Diabetes: Sample Collection, Assay, and Data Generation Workflow
All samples are collected at the Stanford Clinical & Translational Research Unit. From each patient in every visit, blood sample and microbiome sample (including nasal swabs and stool and urine samples) are collected. Multi-omic data generation (primarily, but not entirely, nucleotide sequence- and mass spectroscopy-based) will provide profiles of microbial phylogenetic composition, metagenomes, metatranscriptomes, and metaproteomes; host protein profiles, cytokines, and autoantibodies; and global metabolome profiles. Each sample is further accompanied by clinical (blood) or self-reported stress level, environmental, and dietary (stool and urine) metadata.