| Literature DB >> 29378630 |
Raivo Kolde1, Eric A Franzosa2,3, Gholamali Rahnavard2,3, Andrew Brantley Hall3, Hera Vlamakis3, Christine Stevens3, Mark J Daly3,4, Ramnik J Xavier5,6,7, Curtis Huttenhower8,9.
Abstract
BACKGROUND: Despite the increasing recognition that microbial communities within the human body are linked to health, we have an incomplete understanding of the environmental and molecular interactions that shape the composition of these communities. Although host genetic factors play a role in these interactions, these factors have remained relatively unexplored given the requirement for large population-based cohorts in which both genotyping and microbiome characterization have been performed.Entities:
Keywords: Association studies; Human Microbiome Project; Human genome sequence; Microbiome and human genetics; Microbiome metagenome sequence
Mesh:
Year: 2018 PMID: 29378630 PMCID: PMC5789541 DOI: 10.1186/s13073-018-0515-8
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Fig. 1Overview of the Human Microbiome Project host genome and metagenome coverage. Sequencing depth for each host genome (left) and number of reads for all available samples with whole metagenome sequencing
Fig. 2Distribution of genetic variants and comparison with other cohorts. a Discovered variants categorized by frequency and overlap with other cohorts. AC allele count, MAF minor allele frequency. b Distribution of the number of coding mutations by frequency and estimated impact
Fig. 3Correlation between high-level genetic variation and microbiome composition. a The first two components of the genetic principal component analysis are shown, based on common single nucleotide variants, overlaid by self-reported donor ethnicity. AA African-American. b Shown is how much variance in microbiome data on average can be explained by the genetic principal components, when compared to permutation on the same data. Values shown are Z-scores based on permutations, which were also used to calculate empirical p values. c Distribution of genetic principal component R2 values for different species and pathways in stool. Y-axis shows the variance explained, and the X-axis shows permutation-based empirical p values for each of those numbers. Only the names of species with false discovery rate (FDR) < 0.05 and pathways’ FDR < 0.01 are shown. The histogram below displays the distribution of empirical p values, and the Y-axis shows the number of species in a bin. Green bars under the pathway histogram show how the pathways that are associated with fermentation are ranked by R2
Fig. 4Kinship and microbiome similarity and replication of known associations. a Bray-Curtis similarity between the 12 pairs of close relatives (third degree or closer) identified from genetic data compared to similarities between other pairs. The p values correspond to results of t tests between similarity scores for relatives, against all other pairs. b Association between FUT2 secretor variant and B. longum. c Association between genetic variant rs4988235 near the LCT gene and B. longum. In both b and c we display log10 transformed relative abundance