| Literature DB >> 20834067 |
Anatoliy I Yashin1, Deqing Wu, Konstantin G Arbeev, Svetlana V Ukraintseva.
Abstract
The results of genome-wide association studies of complex traits, such as life span or age at onset of chronic disease, suggest that such traits are typically affected by a large number of small-effect alleles. Individually such alleles have little predictive values, therefore they were usually excluded from further analyses. The results of our study strongly suggest that the alleles with small individual effects on longevity may jointly influence life span so that the resulting influence can be both substantial and significant. We show that this joint influence can be described by a relatively simple "genetic dose - phenotypic response" relationship.Entities:
Mesh:
Year: 2010 PMID: 20834067 PMCID: PMC2984609 DOI: 10.18632/aging.100191
Source DB: PubMed Journal: Aging (Albany NY) ISSN: 1945-4589 Impact factor: 5.682
Figure 1.The “genetic dose - phenotypic response” relationship between the numbers of selected 169 longevity alleles contained in individuals' genome and mean life span obtained in the analyses of 550K SNP data on participants of the original FHS cohort. Regression analyses were performed using SAS PROC REG (© SAS Institute, Inc.) with correction for heteroscedasticity.
Figure 2.The “genetic dose - phenotypic response” relationship between the numbers of selected 39 longevity alleles contained in individuals' genome and mean life span obtained in the analyses of 550K SNP data on participants of the original FHS cohort. Regression analyses were performed using SAS PROC REG (© SAS Institute, Inc.) with correction for heteroscedasticity.
Figure 3.The absence of dependence between the numbers of randomly selected 39 genetic variants contained in individuals' genome and life span. These genetic variants were randomly selected from the same pool of SNPs excluding longevity alleles. Regression analyses were performed using SAS PROC REG (© SAS Institute, Inc.) with correction for heteroscedasticity.
Summary characteristics of the 39 SNPs revealed in the study and gene/protein functions for closest genes (known or suggested).
| SNP rs# | Chr # | Position | Ancestral allele | Type | Distance to gene | Closest gene | Gene full name | Gene/protein function |
|---|---|---|---|---|---|---|---|---|
| rs2031577 | 10 | 4050003 | G | INTERGENIC | -17129 | RP11-433J20.2 | H. sapiens chr 10 clone RP11-433J20 | |
| rs6489785 | 12 | 121363724 | C | INTERGENIC | -52622 | HNF1A (TCF1) | HNF1 homeobox A | liver transcription factor |
| rs3847687 | 12 | 131525053 | T | INTRONIC | 0 | GPR133 | G protein-coupled receptor 133 | transmembranic signal transduser; activates G proteins within cell |
| rs4891159 | 18 | 74101941 | G | INTRONIC | 0 | ZNF516 | zinc finger protein 516 | the part of transcription factors |
| rs10445407 | 17 | 79261809 | A | INTRONIC | 0 | SLC38A10 | solute carrier family 38, member 10 | amino acid transporter |
| rs4745062 | 9 | 73784264 | C | INTRONIC | 0 | TRPM3 | transient receptor potential channel | mediates calcium entry potentiated by calcium store depletion |
| rs2024714 | 20 | 60212494 | C | INTRONIC | 0 | CDH4 | R-cadherin (retinal) | calcium-dependent cell-cell adhesion |
| rs7315621 | 12 | 132085196 | G | INTERGENIC | -60412 | AC117500.2 | ||
| rs16975963 | 19 | 38325536 | G | NON CODING GENE | 0 | AC016582.2 | ||
| rs4732038 | 7 | 134250322 | C | INTRONIC | 0 | AKR1B15 | aldo-keto reductase family 1, member B15 | superfamily of reductases that reduce aldehydes and ketones to alcohols |
| rs2516739 | 16 | 2097158 | N/A | INTRONIC | 0 | NTHL1 | nth endonuclease III-like 1 | base excision repair; DNA N-glycosylase of the endonuclease III family |
| rs7874142 | 9 | 137704782 | A | INTRONIC | 0 | COL5A1 | collagen, type V, alpha 1 | regulates the assembly of heterotypic fibers in tissues |
| rs4468878 | 20 | 59928237 | C | INTRONIC | 0 | AL365229.1 | near CDH4 | possibly cell-cell adhesion |
| rs13008689 | 2 | 8530256 | G | INTERGENIC | -153466 | AC011747.3 | ||
| rs2273 | 4 | 76889388 | C | INTRONIC | 0 | SDAD1 | SDA1 domain containing protein | preferentially expressed in fetal tissues |
| rs2882281 | 13 | 90622455 | C | INTERGENIC | -21630 | RP11-388D4.1 | locus tag for a pseudogene | |
| rs2282032 | 14 | 90758891 | G | INTRONIC | 0 | C14orf102 | chromosome 14 open reading frame 102 | |
| rs9876781 | 3 | 48487338 | A | NON CODING GENE | 0 | RP11-24C3.2 | ||
| rs6568433 | 6 | 106829537 | C | INTERGENIC | -39044 | AL109920.3 | ||
| rs9517320 | 13 | 99126303 | A | INTRONIC | 0 | STK24 | serine/threonine kinase 24 | participates in the mitogen-activated protein kinase (MAPK) cascade |
| rs4148546 | 13 | 95680285 | G | INTRONIC | 0 | ABCC4 | ATP-binding cassette, sub-family C (CFTR/MRP) | ATP-binding cassette (ABC) transporter |
| rs9592783 | 13 | 71883214 | G | INTERGENIC | -128884 | DACH1 | dachshund homolog 1 (Drosophila) | a chromatin-associated protein that regulates gene expression and cell fate; highly conserved |
| rs739401 | 11 | 3036324 | T | INTRONIC | 0 | CARS | cysteinyl-tRNA synthetase | catalyzes the aminoacylation of a tRNA; |
| rs10256972 | 7 | 1039003 | C | INTRONIC | 0 | C7orf50 | chromosome 7 open reading frame 50 | |
| rs3212335 | 15 | 27012141 | C | INTRONIC | 0 | GABRB3 | GABA A receptor, beta | ionic channel family that serves as the receptor for GABA; may be associated with memory |
| rs6915183 | 6 | 166706169 | G | INTERGENIC | -12999 | PRR18 | proline rich 18 | |
| rs4721135 | 7 | 1912222 | G | INTRONIC | 0 | MAD1L1 | MAD1 mitotic arrest deficient-like 1 | component of the mitotic spindle-assembly checkpoint |
| rs3106598 | 13 | 61678912 | G | INTERGENIC | -304909 | PCDH20 | protocadherin 20 | transmembrane receptor, a role in specific cell-cell connections in the brain |
| rs1356888 | 2 | 50516018 | C | INTRONIC | 0 | NRXN1 | cell adhesion in nervous system | |
| rs9616906 | 22 | 51104680 | G | UPSTREAM | -3552 | AC000050.2 | ||
| rs13053175 | 22 | 37613309 | T | UPSTREAM | -7992 | RAC2 | ras-related C3 botulinum toxin substrate 2 | GTPase of the RAS superfamily regulating cell growth, cytoskelet, and the protein kinases activation |
| rs5766691 | 22 | 47532396 | G | INTRONIC | 0 | TBC1D22A | TBC1 domain family | |
| rs13118159 | 4 | 1365127 | N/A | INTRONIC | 0 | RP11-1244E8.1 | ||
| rs7168365 | 15 | 53805825 | C | DOWNSTREAM | -113 | WDR72 | WD repeat domain 72 | |
| rs7493138 | 14 | 29021928 | C | INTERGENIC | -213122 | FOXG1 | forkhead box G1 | transcription factors |
| rs432203 | 2 | 70764688 | A | INTRONIC | 0 | TGFA | transforming growth factor, alpha | competes with EGF for binding to the EGF receptor |
| rs6813479 | 4 | 137660383 | A | INTERGENIC | -57494 | RP11-138I17.1 | ||
| rs1327533 | 9 | 113131163 | T | INTRONIC | 0 | SVEP1 | EGF and pentraxin domain containing 1 | |
| rs2826891 | 21 | 22910116 | T | INTRONIC | 0 | NCAM2 | neural cell adhesion molecule 2 | brain protein, superfamily of the immunoglobulin |
*Enrichment with genes related to cell-cell adhesion can be noticed. Since cell-cell adhesion proteins play crucial role in cell sensitivity to contact inhibition and because insensitivity to contact inhibition is critical for cancer development, especially for manifestation of invasion and metastasis, we speculate that this enrichment might potentially be linked to a higher resistance to cancer among long-living individuals.
| # | N1 | N2 | N1SNP | N2SNP | α1 | α1* | α2 | α2* |
|---|---|---|---|---|---|---|---|---|
| 1 | 661 | 512 | 52 | 8 | 0.30 | 0.26 | 0.14 | 0.16 |
| 2 | 689 | 484 | 40 | 9 | 0.42 | 0.35 | 0.21 | 0.23 |
| 3 | 627 | 546 | 18 | 43 | 0.87 | 0.80 | 0.28 | 0.22 |
| 4 | 677 | 496 | 20 | 25 | 0.67 | 0.63 | 0.56 | 0.49 |
| 5 | 680 | 493 | 34 | 16 | 0.47 | 0.41 | 0.33 | 0.31 |
| 6 | 630 | 543 | 32 | 22 | 0.48 | 0.39 | 0.46 | 0.48 |
| 7 | 631 | 542 | 43 | 15 | 0.40 | 0.31 | 0.22 | 0.27 |
| 8 | 658 | 515 | 14 | 39 | 0.86 | 0.99 | 0.33 | 0.25 |
| 9 | 647 | 526 | 31 | 18 | 0.48 | 0.38 | 0.38 | 0.42 |
| 10 | 672 | 501 | 37 | 10 | 0.44 | 0.37 | 0.24 | 0.27 |
The results of 10 experiments in which genetic variants individually affecting life span (longevity SNPs) were selected twice using data on two populations representing genetically independent genotyped individuals in the original Framingham Heart Study (FHS) cohort for whom life span data are available. The longevity SNPs selected from data on the first population were used for evaluating linear “genetic dose” - “life span response” relationship on the same population, as well as on the second population of individuals. In turn, longevity SNPs selected from data on the second population were used for evaluating linear “genetic dose” - “life span response? relationship on the same population, as well as on the first population of individuals. Column “#” shows experiment's number. Columns N1 and N2 show the number of individuals in the first and in the second (genetically independent) populations. Columns N1SNP and N2SNP show the number of longevity SNPs selected using data on the first (original) and on the second (rest) populations respectively. Column α1 shows the estimate of the slope of the regression line describing dependence of life span on the number of longevity SNPs contained in the genomes of individuals from the first population. Column α1* shows the estimate of the slope of the regression line describing dependence between life span and the number of selected longevity SNPs contained in genomes of individuals from the second (independent) population. The estimates α1 and α1* use SNPs selected in the analyses of connection between SNPs and life span in the first (original) population. Column α2 shows the estimate of the slope of the regression line describing dependence of life span on the number of longevity SNPs contained in the genomes of individuals from the second population. Column α2* shows the estimate of the slope of the regression line describing dependence between life span and the number of selected longevity SNPs contained in genomes of individuals from the first population. The estimates α2 and α2* use SNPs selected in the analyses of connection between genes and life span in the second (rest) population. All four estimates are highly significant (p<1×10-10).