| Literature DB >> 28714965 |
Nathan D Price1,2, Andrew T Magis2, John C Earls2, Gustavo Glusman1, Roie Levy1, Christopher Lausted1, Daniel T McDonald1, Ulrike Kusebauch1, Christopher L Moss1, Yong Zhou1, Shizhen Qin1, Robert L Moritz1, Kristin Brogaard2, Gilbert S Omenn1,3, Jennifer C Lovejoy1,2, Leroy Hood1,4.
Abstract
Personal data for 108 individuals were collected during a 9-month period, including whole genome sequences; clinical tests, metabolomes, proteomes, and microbiomes at three time points; and daily activity tracking. Using all of these data, we generated a correlation network that revealed communities of related analytes associated with physiology and disease. Connectivity within analyte communities enabled the identification of known and candidate biomarkers (e.g., gamma-glutamyltyrosine was densely interconnected with clinical analytes for cardiometabolic disease). We calculated polygenic scores from genome-wide association studies (GWAS) for 127 traits and diseases, and used these to discover molecular correlates of polygenic risk (e.g., genetic risk for inflammatory bowel disease was negatively correlated with plasma cystine). Finally, behavioral coaching informed by personal data helped participants to improve clinical biomarkers. Our results show that measurement of personal data clouds over time can improve our understanding of health and disease, including early transitions to disease states.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28714965 PMCID: PMC5568837 DOI: 10.1038/nbt.3870
Source DB: PubMed Journal: Nat Biotechnol ISSN: 1087-0156 Impact factor: 54.908
Figure 1Types of longitudinal data collected
(A) Timeline of important events in the P100. (B) Schematic of the data collected every three months throughout the study.
Figure 2Top 100 correlations per pair of data types
Subset of top statistically-significant Spearman inter-omic cross-sectional correlations between all datasets collected in our cohort. Each line represents one correlation that was significant after adjustment for multiple hypothesis testing using the method of Benjamini and Hochberg[10] at p<0.05. The mean of all three time points was used to compute the correlations between analytes. Up to 100 correlations per pair of data types are shown in this figure. See Supplementary Figure 1 for the complete inter-omic cross-sectional network.
Figure 3Cardiometabolic community
All vertices and edges of the cardiometabolic community, with lines indicating significant (p<0.05) correlations. Associations with FGF21 (red lines) and gamma-glutamyltyrosine (purple lines) are highlighted.
Figure 4Cholesterol, serotonin, α-diversity, IBD, and bladder cancer communities
(A) Cholesterol community (B) Serotonin community (C) α-diversity community (D) The polygenic score for inflammatory bowel disease is negatively correlated with cystine (E) The polygenic score for bladder cancer is positively correlated with 5-acetylamino-6-formylamino-3-methyluracil (AFMU).
Figure 5Polygenic scores correlate with blood analytes
Spearman correlations between polygenic scores (x-axis) and analyte measurements (y-axis) from our correlation network. The number of measurements used for each pairwise comparison, correlation coefficients, and adjusted p-values are indicated on each figure. Values have been age and/or sex adjusted as described in Online Methods. The line shown is a y~x regression line, and the shaded regions are 95% confidence intervals for the slope of the line.
Longitudinal analysis of clinical changes by round
Generalized estimating equations (GEE) were used to estimate average changes in clinical labs over time. The Δ per round is an estimate of the average change in the population for that analyte by round adjusted for age, sex, and self-reported ancestry. ‘Out-of-range at baseline’ indicates the estimates using only those participants who were out-of-range for that analyte at the beginning of the study. Green rows indicate statistically-significant improvement, while red rows indicate statistically-significant worsening. N/A values are present where no participants were out-of-range at baseline. For example, the mean improvement in vitamin D for the 95 participants that began the study out-of-range was +7.2 ng/mL per round. Several analytes are measured by both Quest and Genova; with the exception of LDL particle number, the direction of effect for significantly changed analytes was concordant across the two labs. An independence working correlation structure was used in the GEE. See Supplementary Table 10 for the complete results.
| Clinical Laboratory Test | Out-of-range at Baseline Participants | |||
|---|---|---|---|---|
| Quadrant | Name | N | Δ per round | P-value |
| Nutrition | Vitamin D | 95 | +7.2 ng/mL/round | 7.1E-25 |
| Nutrition | Mercury | 81 | −0.002 mcg/g/round | 8.9E-09 |
| Diabetes | HbA1c | 52 | −0.085 %/round | 9.2E-06 |
| Cardiovascular | LDL particle number (Quest) | 30 | +130 nmol/L/round | 9.3E-05 |
| Nutrition | Methylmalonic acid (Genova) | 3 | −0.49 mmol/mol creat/round | 2.1E-04 |
| Cardiovascular | LDL pattern (A or B) | 28 | −0.16 /round | 4.8E-04 |
| Inflammation | Interleukin-8 | 10 | −6.1 pg/mL/round | 5.9E-04 |
| Cardiovascular | Total cholesterol (Quest) | 48 | −6.4 mg/dL/round | 7.2E-04 |
| Cardiovascular | LDL cholesterol | 57 | −4.8 mg/dL/round | 8.8E-04 |
| Cardiovascular | LDL particle number (Genova) | 70 | −69 nmol/L/round | 1.2E-03 |
| Cardiovascular | Small LDL particle number (Genova) | 73 | −56 nmol/L/round | 3.5E-03 |
| Diabetes | Fasting glucose (Quest) | 45 | −1.9 mg/dL/round | 8.2E-03 |
| Cardiovascular | Total cholesterol (Genova) | 43 | −5.4 mg/dL/round | 1.2E-02 |
| Diabetes | Insulin | 16 | −2.3 IU/mL/round | 1.5E-02 |
| Inflammation | TNF-alpha | 4 | −6.6 pg/mL/round | 1.8E-02 |
| Diabetes | HOMA-IR | 19 | −0.56 /round | 2.0E-02 |
| Cardiovascular | HDL cholesterol | 5 | +4.5 mg/dL/round | 2.2E-02 |
| Nutrition | Methylmalonic acid (Quest) | 7 | −42 nmol/L/round | 5.2E-02 |
| Cardiovascular | Triglycerides (Genova) | 14 | −18 mg/dL/round | 1.4E-01 |
| Diabetes | Fasting glucose (Genova) | 47 | −0.98 mg/dL/round | 1.5E-01 |
| Nutrition | Arachidonic Acid | 35 | +0.24 wt%/round | 1.9E-01 |
| Inflammation | hs-CRP | 51 | −0.47 mcg/mL/round | 2.1E-01 |
| Cardiovascular | Triglycerides (Quest) | 17 | −14 mg/dL/round | 2.4E-01 |
| Nutrition | Glutathione | 6 | +11 micromol/L/round | 2.5E-01 |
| Nutrition | Zinc | 4 | −0.82 mcg/g/round | 3.0E-01 |
| Nutrition | Ferritin | 10 | −14 ng/mL/round | 3.1E-01 |
| Inflammation | Interleukin-6 | 4 | −1.1 pg/mL/round | 3.8E-01 |
| Cardiovascular | HDL large particle number | 8 | +210 nmol/L/round | 4.9E-01 |
| Nutrition | Copper | 10 | +0.006 mcg/g/round | 6.0E-01 |
| Nutrition | Selenium | 6 | +0.035 mcg/g/round | 6.2E-01 |
| Cardiovascular | Medium LDL particle number | 20 | +2.8 nmol/L/round | 8.5E-01 |
| Cardiovascular | Small LDL particle number (Quest) | 14 | −2.3 nmol/L/round | 8.8E-01 |
| Nutrition | Manganese | 0 | N/A | N/A |
| Nutrition | EPA | 0 | N/A | N/A |
| Nutrition | DHA | 0 | N/A | N/A |