| Literature DB >> 34099068 |
Liron Ganel1,2, Lei Chen1,2, Ryan Christ1, Jagadish Vangipurapu3, Erica Young1,4, Indraniel Das1, Krishna Kanchi1, David Larson1,5, Allison Regier1,2, Haley Abel1,2,5, Chul Joo Kang1, Alexandra Scott1,2, Aki Havulinna6,7, Charleston W K Chiang8,9, Susan Service10, Nelson Freimer10, Aarno Palotie6,11,12, Samuli Ripatti6,12,13, Johanna Kuusisto3,14, Michael Boehnke15, Markku Laakso3,14, Adam Locke1,2, Nathan O Stitziel16,17,18, Ira M Hall19,20,21.
Abstract
BACKGROUND: Mitochondrial genome copy number (MT-CN) varies among humans and across tissues and is highly heritable, but its causes and consequences are not well understood. When measured by bulk DNA sequencing in blood, MT-CN may reflect a combination of the number of mitochondria per cell and cell-type composition. Here, we studied MT-CN variation in blood-derived DNA from 19184 Finnish individuals using a combination of genome (N = 4163) and exome sequencing (N = 19034) data as well as imputed genotypes (N = 17718).Entities:
Keywords: Genome-wide association studies; Human genetics; Human genome sequencing; Mendelian randomization; Metabolic syndrome; Mitochondrial content
Mesh:
Substances:
Year: 2021 PMID: 34099068 PMCID: PMC8185936 DOI: 10.1186/s40246-021-00335-2
Source DB: PubMed Journal: Hum Genomics ISSN: 1473-9542 Impact factor: 6.481
Fig. 1Cardiometabolic trait associations with MT-CN in WGS data. A Phenome-wide association study of normalized MT-CN against 137 cardiometabolic traits in the 4163 sample data set. Traits are grouped into 17 categories, represented by the color of each bar. The top three most significant traits are, in order, fasting serum insulin, C-reactive protein, and fat mass. Exact P values and effect estimates, as calculated by EMMAX, are listed in Supplementary Table S1 (Additional File ). B Association tests between normalized MT-CN and both fat mass and fasting serum insulin using WGS data (N = 4163). Results are shown for the EMMAX test and a permutation test in which mitochondrial haplogroups were adjusted for
Associations of normalized MT-CN with disposition index and Matsuda ISI in METSIM. Testing was done by linear regression using disposition index and Matsuda ISI, respectively, as the dependent variable. P* columns represent the P value from linear regression with additional adjustment for fat mass. Follow-up measurements were taken at a later time point.
| Baseline | Follow-up | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Number | Beta | SE | P | P* | Number | Beta | SE | P | P* | |
| | 2975 | 0.094 | 0.004 | 3.0 x 10-7 | 0.0004 | 2492 | 0.062 | 0.004 | 0.002 | 0.068 |
| | 2842 | 0.091 | 0.003 | 1.3 x 10-6 | 0.0007 | 2452 | 0.067 | 0.004 | 0.0009 | 0.041 |
| | 2453 | 0.069 | 0.003 | 0.0007 | 0.023 | 2449 | 0.067 | 0.004 | 0.0009 | 0.042 |
| | 2975 | 0.192 | 0.005 | 4.3 x 10-26 | 7.3 x 10-17 | 2492 | 0.157 | 0.006 | 3.7 x 10-15 | 8.7 x 10-10 |
| | 2842 | 0.191 | 0.005 | 1.0 x 10-24 | 2.4 x 10-16 | 2452 | 0.161 | 0.006 | 1.3 x 10-15 | 3.0 x 10-10 |
| | 2453 | 0.173 | 0.005 | 7.2 x 10-18 | 5.3 x 10-12 | 2449 | 0.16 | 0.006 | 1.7 x 10-15 | 3.8 x 10-10 |
GREML heritability estimates in each cohort separately and in joint analysis. All analyses in this table were limited to sample sets with available imputed genotype data, yielding slightly lower sample sizes than in other tables.
| WGS-measured MT-CN | WES-measured MT-CN | ||||||
|---|---|---|---|---|---|---|---|
| Number | h2 | SE | Number | h2 | SE | ||
| 4149 | 0.17 | 0.06 | 3916 | 0.11 | 0.06 | ||
| 3916 | 0.16 | 0.06 | 17718 | 0.09 | 0.02 | ||
| 3065 | 0.31 | 0.07 | 2974 | 0.20 | 0.08 | ||
| 2974 | 0.27 | 0.08 | 9791 | 0.11 | 0.03 | ||
| 1084 | 0.20 | 0.22 | 942 | 0.24 | 0.27 | ||
| 942 | 0.35 | 0.27 | 7927 | 0.08 | 0.03 | ||
GREML and GREML-LDMS heritability estimates for normalized MT-CN and low-density lipoprotein (LDL) in METSIM. GREML-LDMS heritability estimates are calculated using PCs 1–10 as fixed-effect covariates. Analyses of imputed array data exclude samples with WGS data.
| Trait | Genotype source (phenotype source) | Number | GREML | GREML-LDMS | ||
|---|---|---|---|---|---|---|
| h | SE | h | SE | |||
| 3065 | 0.31 | 0.07 | 0.31 | 0.09 | ||
| 6789 | 0.11 | 0.04 | 0.14 | 0.05 | ||
| 3062 | 0.34 | 0.08 | 0.38 | 0.10 | ||
| 6787 | 0.25 | 0.04 | 0.32 | 0.05 | ||
Fig. 2Single-marker genetic associations with MT-CN in imputed array data. A Manhattan plot for a genome-wide association test of normalized, WES-measured MT-CN using imputed array genotype data from METSIM (N = 9791). Two loci markers reached the genome-wide significance of 5 × 10-8, identified by lead markers rs2288464 and rs9389268. B Quantile–quantile (QQ) plot for the association test shown in A. This plot is separated by minor allele frequency bin, as indicated by the colors and shapes of the points. C Boxplot showing the distributions of normalized WES-measured MT-CN in METSIM separated by the number of rs9389268 alternate alleles as detected by imputed array genotyping (N = 9791). The EMMAX P value for this variant was 1.62 × 10-8 in imputed data
Single marker association results for rs2288464 and rs9389268. Analyses of imputed FINRISK array data were performed with covariates for FINRISK genotyping batch. Bolded results are significant at the appropriate threshold for the given test (see Methods).
| FINRISK | METSIM | Joint analysis | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Number | MAF | P | Beta | Number | MAF | P | Beta | Number | MAF | P | Beta | ||
| 7927 | 0.148 | 0.613 | 0.0113 | 9791 | 0.165 | 0.119 | 17718 | 0.158 | 9.77 × 10-7 | 0.075 | |||
| 9221 | 0.150 | 0.376 | 0.0186 | 9813 | 0.166 | 0.118 | 19034 | 0.158 | 9.34 × 10-7 | 0.0734 | |||
| 1084 | 0.142 | 0.383 | 0.0532 | 3065 | 0.161 | 0.113 | 0.0561 | 4149 | 0.156 | 0.0655 | 0.0562 | ||
| 7927 | 0.354 | 0.189 | 0.0216 | 9791 | 0.347 | 0.0973 | 17718 | 0.35 | 0.0634 | ||||
| 1084 | 0.351 | 0.788 | 0.0121 | 3065 | 0.347 | 0.150 | 4149 | 0.348 | 7.87 × 10-7 | 0.115 | |||
Fig. 3Gene-based associations with MT-CN in WES data. A Manhattan plot and quantile–quantile (QQ) plot for a gene-based rare variant association test of normalized, WES-measured MT-CN using WES data from both METSIM and FINRISK (N = 19034). The red line represents a Bonferroni significance level of 2.164 × 10-6, as 23105 genes were included in this test. TMBIM1 is the only gene to reach significance at this level. B QQ plot for the test shown in A. This plot is separated by minor allele frequency bin, as indicated by the colors and shapes of the points. C Boxplot showing the distributions of normalized WES-measured MT-CN, separated by the number of WES-detected alternate alleles in TMBIM1 with MAF < 0.01 and CADD score > 20 (N = 19034)
Gene-based rare variant association results for . TMBIM1 was the only genome-wide significant gene in the WES rare-variant association tests of METSIM and the whole dataset. Bolded results are significant at the appropriate threshold for the given test (see Methods).
| FINRISK | METSIM | Joint analysis | |||||||
|---|---|---|---|---|---|---|---|---|---|
| Number | Fraction with rare allele | P | Number | Fraction with rare allele | P | Number | Fraction with rare allele | P | |
| 9221 | 0.016 | 1.57 × 10-3 | 9813 | 0.013 | 19034 | 0.014 | |||
| 1084 | 0.028 | 0.489 | 3065 | 0.014 | 0.01 | 4149 | 0.013 | 0.01 | |
Fig. 4Mendelian randomization approach and results. A Formulation of the Mendelian randomization causality test. G represents genotypes, Z is a genetic instrument value constructed from G, X represents ln(MT-CN), Y represents ln(Insulin), and U represents any confounders of the association between X and Y. The arrow from X to Y is dashed to indicate that although an association is known, the relationship is not known to be causal. In this formulation, a significant association between Z and Y would provide evidence that X is casual for Y. B Strategy for choosing variables to adjust for when building Z in order to enforce MR assumptions. A represents those columns of covariate matrix W that are associated with Y (represented by the solid line between A and Y), and B represents those columns of matrix W that are associated with X conditional on A (represented by the solid line between B and X). Dashed lines represent possible, but unproven associations. The penalized regression of X on G used to build Z is adjusted for A and B (with no penalty) in an attempt to prevent any associations between Z and either A or B (represented by the blue X’s). While an association between B and Y is unlikely (represented by the dashed line between B and Y) because B is not contained in A, B is still adjusted for in the penalized regression to be as conservative as possible. C Strategy for choosing covariates to adjust for in the causality test of Y against Z in another attempt to reduce the impact of any remaining associations between Z and assumption-violating variables. Covariate sets I, II, III, and IV are defined by the presence of known first-order associations (represented by black lines) with Z and Y (see Methods). Yellow lines represent relationships where a first-order association is not known, but a higher-order association is possible. Covariate sets II, III, and IV (colored blue) are adjusted for in the causality test because there is at least one first-order association linking them to Z and Y, so they risk violating MR assumptions 2 or 3. D Results of Mendelian randomization test for causality of MT-CN on fasting serum insulin
Association results between blood cell count traits and MT-CN polygenic risk score in 357656 UK Biobank samples. β refers to the regression coefficient of MT-CN in a linear regression of cell type onto MT-CN PRS and other covariates (see Methods). Bolded results are significant below a nominal α = 0.05.
| Cell type | All samples | No | ||||
|---|---|---|---|---|---|---|
| β | SE | P | β | SE | P | |
| − 0.00856 | 0.00170 | - | - | - | ||
| − 0.00119 | 0.00170 | 0.482 | - | - | - | |
| − 0.00250 | 0.00170 | 0.142 | - | - | - | |
| − 0.00954 | 0.00169 | − 0.00948 | 0.00169 | |||
| 0.00548 | 0.00170 | 0.00548 | 0.00170 | |||
Association results between metabolic syndrome traits and MT-CN polygenic risk score in 357656 UK Biobank samples. β refers to the regression coefficient of MT-CN in a linear regression of cell type onto MT-CN PRS and other covariates (see Methods). Bolded results are significant below a nominal α = 0.05. The weight phenotype tested was that which was measured at the time of impedance measurement.
| Trait | Without platelet and neutrophil adjustment | With platelet and neutrophil adjustment | ||||
|---|---|---|---|---|---|---|
| β | SE | P | β | SE | P | |
| − 0.1681 | 0.0826 | − 0.1467 | 0.0844 | 0.0823 | ||
| − 0.0372 | 0.0175 | − 0.0233 | 0.0175 | 0.1836 | ||
| − 0.0452 | 0.0176 | − 0.0318 | 0.0177 | 0.0716 | ||
| − 0.0015 | 0.0178 | 0.9305 | 0.0206 | 0.0171 | 0.2291 | |
| 0.0573 | 0.0187 | 0.0416 | 0.0187 | |||
| − 0.0500 | 0.0176 | − 0.0330 | 0.0175 | 0.0603 | ||
| − 0.0359 | 0.0176 | − 0.0218 | 0.0178 | 0.2189 | ||