Literature DB >> 25333069

Disease variants in genomes of 44 centenarians.

Yun Freudenberg-Hua1, Jan Freudenberg2, Vladimir Vacic3, Avinash Abhyankar3, Anne-Katrin Emde3, Danny Ben-Avraham4, Nir Barzilai4, Dayna Oschwald3, Erika Christen5, Jeremy Koppel1, Blaine Greenwald6, Robert B Darnell7, Soren Germer3, Gil Atzmon4, Peter Davies5.   

Abstract

To identify previously reported disease mutations that are compatible with extraordinary longevity, we screened the coding regions of the genomes of 44 Ashkenazi Jewish centenarians. Individual genome sequences were generated with 30× coverage on the Illumina HiSeq 2000 and single-nucleotide variants were called with the genome analysis toolkit (GATK). We identified 130 coding variants that were annotated as "pathogenic" or "likely pathogenic" based on the ClinVar database and that are infrequent in the general population. These variants were previously reported to cause a wide range of degenerative, neoplastic, and cardiac diseases with autosomal dominant, autosomal recessive, and X-linked inheritance. Several of these variants are located in genes that harbor actionable incidental findings, according to the recommendations of the American College of Medical Genetics. In addition, we found risk variants for late-onset neurodegenerative diseases, such as the APOE ε4 allele that was even present in a homozygous state in one centenarian who did not develop Alzheimer's disease. Our data demonstrate that the incidental finding of certain reported disease variants in an individual genome may not preclude an extraordinarily long life. When the observed variants are encountered in the context of clinical sequencing, it is thus important to exercise caution in justifying clinical decisions.

Entities:  

Keywords:  Aging; Ashkenazi; centenarian; disease gene; incidental finding; whole genome sequencing

Year:  2014        PMID: 25333069      PMCID: PMC4190879          DOI: 10.1002/mgg3.86

Source DB:  PubMed          Journal:  Mol Genet Genomic Med        ISSN: 2324-9269            Impact factor:   2.183


Introduction

Human genetic studies have linked many variants to human diseases or nondisease phenotypes. How to handle the incidental finding of a disease variant is a topic of current discussion (Green et al. 2013a; Klitzman et al. 2013). Incidental findings often occur, when genome sequencing data are screened for disease-causing variants that are recorded in databases such as Online Mendelian Inheritance in Man (OMIM) (Hamosh et al. 2005) or, more recently, ClinVar (Landrum et al. 2014) and the Genome Wide Association Studies (GWAS) catalog (Hindorff et al. 2009). However, the penetrance of these recorded variants spans a broad spectrum, ranging from complete penetrance for a set of monogenic mutations to the very small effect sizes of many GWAS hits. While it is widely known that most GWAS hits have only limited clinical prognostic relevance, the penetrance and prognostic value of many previously reported monogenic mutations is less clear. Recently, a significant percentage of such putative “disease mutations” were claimed to be of questionable pathogenicity (Cassa et al. 2013; Dorschner et al. 2013; Flannick et al. 2013; Kenna et al. 2013; Riggs et al. 2013). Accordingly, it is an important aim of current human genetic research to systematically assess the clinical significance of genetic variants (Duzkale et al. 2013). As one way to identify those mutations that may require reinterpretation, we use a sample of individuals with exceptional longevity to identify reported disease mutations for which the pathogenicity status should be preferentially reevaluated. If a mutation for a dominant late-onset disease is encountered in the genome of a younger individual, a reduced penetrance is difficult to assign without longitudinal follow-up. In addition to the uncertain significance of many reported mutations for dominant diseases, the traditional assumption that heterozygous carriers for recessive disease alleles are unaffected (Nussbaum et al. 2007) may not hold true. For instance, Gaucher's disease alleles in the GBA gene have been found to strongly increase the risk of developing Parkinson's disease (Sidransky et al. 2009) as well as dementia with Lewy bodies (Nalls et al. 2013). Similarly, heterozygous carriers for autosomal recessive disease alleles for Alport syndromes have increased risk of renal failure (Temme et al. 2012). Therefore, different lines of evidence need to be integrated to assign a clinical significance score to each variant. For mutations previously linked to dominant diseases that negatively affect life expectancy, their observation in centenarian genomes may be viewed as evidence for unclear pathogenicity or at least incomplete penetrance. To a certain extent, reduced penetrance could result from a variety of protective genetic, epigenetic, environmental, and random factors (Cooper et al. 2013) or other buffering mechanisms (Bergman et al. 2007). In the present study, we search the coding regions of 44 Ashkenazi Jewish (AJ) centenarians for variants that were previously reported as causal mutations for medically relevant phenotypes. To this end, we use the recently established and publicly accessible ClinVar database (Riggs et al. 2013) which not only provides highly structured data access but also includes community-based data curation (Landrum et al. 2014). We chose ClinVar because of the quality of its content. Our study is meant to be part of the community effort to further improve its quality, which is eased by its free availability. Such databases are gaining in importance, because the increased utilization of next-generation sequencing technology magnifies the challenge to interpret genetic testing results (Lyon and Wang 2012; Lohn et al. 2013; Manolio et al. 2013; Rehm et al. 2013). For example, in August 2013 the ClinVar database recorded 14,746 variants classified as pathogenic and 1672 as probably pathogenic. We focus on the description of those variants that have been previously reported as Mendelian disease mutations that would increase the risk of mortality, such as degenerative diseases, neoplasm, and cardiac diseases. The observation of previously reported disease variants in centenarian genomes may aid clinical geneticists who are confronted with the challenge to evaluate their pathogenicity.

Materials and Methods

Study population

DNA samples of 44 AJ centenarians were collected as part of a longevity study at the Albert Einstein College of Medicine that was described elsewhere (Barzilai et al. 2003). The sample includes eight male (95–103 years old) and 36 female (95–106 years old) subjects. The mean and median age of the subjects was 99.6 and 100 years (standard deviation σ = 3.1). Informed written consent was obtained in accordance with the policy of the Committee on Clinical Investigations of the Albert Einstein College of Medicine. Genomic DNA extracted from blood cells using standard procedure was sent for sequencing. Mini-mental status examinations (MMSE) (Folstein et al. 1975) and selected health status information such as hearing, vision, and whether a subject had a history of cancer or myocardial infarction (MI) are available for the majority of the centenarians. The level of education was not available. Given that median MMSE scores are found to be a function of age and level of education (Crum et al. 1993; Bravo and Hebert 1997) and can be as low as 20 for people 85 years and older with <4 years of education (Crum et al. 1993), we use the reference score of ≥24 (one standard deviation of the mean MMSE score for people 85 and older; Bravo and Hebert 1997) to exclude the presence of dementia in our sample. To our knowledge, no reference MMSE scores specifically for centenarians are available.

Genome sequencing and variant calling

Whole genome sequencing was performed on Illumina HiSeq 2000 platform (Illumina, San Diego, CA). Paired-end 2 × 100 reads were aligned to the GRCh37 human reference using the Burrows-Wheeler Aligner (BWA) (Li and Durbin 2009) and processed using the best-practices pipeline that includes marking of duplicate reads by the use of Picard tools (http://picard.sourceforge.net), and realignment around DNA insertions and deletions (indels) and base recalibration via Genome Analysis Toolkit (GATK) Lite ver. 2.3-9 (McKenna et al. 2010). Single-nucleotide variants (SNVs) were jointly called on the 44 unrelated AJ centenarians together with HapMap trio NA12877, NA12878, and NA12882 using the UnifiedGenotyper module of GATK Lite. The HapMap trio was sequenced at Illumina and released as part of the PlatinumGenomes project (http://www.illumina.com/platinumgenomes). We used variant quality score recalibration and Mendelian inconsistencies on the HapMap trio to determine the optimal variant quality score (VQSLOD) cutoff. Joint calling for transcribed gene regions of 44 AJ centenarians identified a total of 6.3 million genetic variants. A total of 87,899 coding SNVs were found (100% call rate) in coding regions defined by the longest transcript according to the UCSC hg19 database (Meyer et al. 2012). For variants with a VQSLOD score >2, the concordance rate for SNVs between Illumina 2.5M chip and GATK variant calls was above 99.7% for all subjects. Due to the comparatively lower performance of the UnifiedGenotyper in calling short insertions and deletions, in the present study we decided not to analyze insertion/deletion variants.

Database annotation of variants

Using custom scripts and the SNP & Variation Suite software (Version 7.7.8; Golden Helix, Inc., Bozeman, MT, http://www.goldenhelix.com), we annotated coding and splice variants based on the knownGene and the refGene tracks in the UCSC hg19 database (Meyer et al. 2012). To find variants of potential clinical relevance, variants were evaluated for their possible pathogenicity based on the August 2013 version of the ClinVar database (http://www.ncbi.nlm.nih.gov/clinvar/) (Riggs et al. 2013). This database provides free highly structured public access to clinically relevant sequence variants and it also provides evidence-based interpretation of clinical significance for each variant (Landrum et al. 2014). All ClinVar accession numbers for the identified variants are linked to the respective NCBI (The National Center for Biotechnology Information) database for reference sequences (RefSeq) as well as OMIM accession numbers. Population allele frequencies of variants were retrieved from the NHLBI Exome Sequencing Project (ESP) database (Exome Variant Server, NHLBI GO ESP, Seattle, WA) (http://evs.gs.washington.edu/EVS). To our knowledge, a similar large-scale population allele frequency data specific for the AJ population are currently not available.

Quality control filtering of putative disease variants

A total of 225 autosomal and seven X-chromosomal coding SNVs were found to be annotated as “pathogenic” or “likely pathogenic” in the ClinVar database. Eighteen of these variants had a VQSLOD score <4, most often due to poor mapping quality indicating genomic segmental duplications. However, genomic regions that are difficult to map are also known to produce false-negative variant calls (Lee and Schatz 2012). Therefore, we performed manual evaluation of the 18 variants. These include two mutations in the gene GBA that are known to cause Gaucher disease as well as being associated with late-onset Parkinson's disease (most frequently referred to in the literature as L444P [uc001fjh.2:c.T1449C:p.L483P] and N370S [uc001fjh.2:c.A1227G:p.N409S]). Both mutations are known to be relatively frequent in the AJ population (Scott et al. 2010). Using BLAT (Kent 2002), N409S (rs76763715) was uniquely mapped to chromosome 1:155205634 and L483P (rs421016) was uniquely mapped to chr1:155205043. Thus, both variants are likely to be true SNVs in our sample and were therefore retained. The remaining 16 variants with low VQSLOD scores were removed.

Results

In the coding regions of 44 centenarian genomes, we found 210 autosomal and six X-chromosomal SNVs that passed quality control. Among these, 207 variants were classified as pathogenic and nine as likely pathogenic based on the ClinVar database. Excluding the two APOE risk variants, 86 of these SNVs were found to be common defined as minor allele frequency (MAF) of 5% or higher in either the European or African American populations in the ESP database (Table S1). Due to their high allele frequency in the general population, these 86 variants are considered unlikely to be strongly pathogenic with high penetrance and therefore not further discussed. According to the OMIM database, of the remaining 130 variants (Tables 5 and S2), 39 were reported to cause autosomal dominant diseases, 72 to cause autosomal recessive diseases, 6 were associated with X-chromosomal inheritance, and 13 with other modes of inheritance such as digenic, imprinting, complex, or unclear mode of inheritance.
Table 5

Putative disease mutations reported to cause X-chromosomal diseases

CHRPOSIDREFALTALT_ACALT_AC (male)GeneEffectCLNDBNCLNACCOMIM
X31496398rs1800279TC41DMDuc004dda.1:c.A8763G:p.H2921RBecker muscular dystrophyRCV000012020300377
X31496426rs1800278TC11DMDuc004dda.1:c.A8735G:p.N2912DDuchenne muscular dystrophyRCV000012019300377
X31496431rs41305353TA11DMDuc004dda.1:c.A8730T:p.E2910VDuchenne muscular dystrophyRCV000012018300377
X84563194rs75398746CT20POF1Buc004eer.2:c.G987A:p.R329QPremature ovarian failure 2bRCV000011541300603
X105278361rs1804495CA111SERPINA7uc004eme.1:c.G910T:p.L304FThyroxine-binding globulin variant pRCV000010442314200

CHR, chromosome based on GRCh37; POS, position on chromosome; ID, dbSNP ID; REF, reference sequence; ALT, alternative sequence; ALT_AC, nonreference allele counts; Effect, position of nucleotide change on UCSC transcript and amino acid change; CLNDBN and CLNACC, selected disease condition and accession ID from ClinVar; OMIM, OMIM accession numbers for the genes.

Putative disease variants reported to cause degenerative diseases CHR, chromosome; POS, position on chromosome based on GRCh37; ID, dbSNP ID; REF, reference sequence; ALT, alternative sequence; AC, counts of subjects with homozygous reference alleles/heterozygous/homozygous nonreference alleles; Effect, position of nucleotide change on UCSC transcript and amino acid change; CLNDBN and CLNACC, selected disease condition and accession ID from ClinVar; OMIM, OMIM accession numbers for the genes. Putative disease variants reported to cause neoplastic diseases CHR, chromosome; POS, position on chromosome based on GRCh37; ID, dbSNP ID; REF, reference sequence; ALT, alternative sequence; AC, counts of subjects with homozygous reference alleles/heterozygous/homozygous nonreference alleles; Effect, position of nucleotide change on UCSC transcript and amino acid change; CLNDBN and CLNACC, selected disease condition and accession ID from ClinVar; OMIM, OMIM accession numbers for the genes. Putative disease variants reported to cause autosomal dominant cardiac diseases CHR, chromosome; POS, position on chromosome based on GRCh37; ID, dbSNP ID; REF, reference sequence; ALT, alternative sequence; AC, counts of subjects with homozygous reference alleles/heterozygous/homozygous nonreference alleles; Effect, position of nucleotide change on UCSC transcript and amino acid change; CLNDBN and CLNACC, selected disease condition and accession ID from ClinVar; OMIM, OMIM accession numbers for the genes. Putative disease variants reported to cause other autosomal dominant diseases CHR, chromosome; POS, position on chromosome based on GRCh37; ID, dbSNP ID; REF, reference sequence; ALT, alternative sequence; AC, counts of subjects with homozygous reference alleles/heterozygous/homozygous nonreference alleles; Effect, position of nucleotide change on UCSC transcript and amino acid change; CLNDBN and CLNACC, selected disease condition and accession ID from ClinVar; OMIM, OMIM accession numbers for the genes. Putative disease mutations reported to cause X-chromosomal diseases CHR, chromosome based on GRCh37; POS, position on chromosome; ID, dbSNP ID; REF, reference sequence; ALT, alternative sequence; ALT_AC, nonreference allele counts; Effect, position of nucleotide change on UCSC transcript and amino acid change; CLNDBN and CLNACC, selected disease condition and accession ID from ClinVar; OMIM, OMIM accession numbers for the genes.

Variants reported as causal for degenerative diseases of advanced age

Variants associated with age-related degenerative diseases were found in the genes APOE, GBA, UBQLN2, SEMA4A, RP1, MYO1A, CYP1B1, OPTN, VSX1, and WDR36 (Table 1).
Table 1

Putative disease variants reported to cause degenerative diseases

CHRPOSIDREFALTACGeneEffectCLNDBNCLNACCOMIM
1155205043rs421016AG43/1/0GBAuc001fjh.2:c.T1449C:p.L483PGaucher's disease; late-onset Parkinson's disease; susceptibility to dementia with Lewy bodyRCV000004511; RCV000004512606463
1155205634rs76763715TC43/1/0GBAuc001fjh.2:c.A1227G:p.N409SGaucher's disease; late-onset Parkinson's disease; susceptibility to dementia with Lewy bodyRCV000004515; RCV000004516606463
1156146640rs41265017GA34/10/0SEMA4Auc001fnm.2:c.G2139A:p.R713QRetinitis pigmentosa 35RCV000003528607292
238298394rs79204362CT41/3/0CYP1B1uc002rqo.2:c.G1104Ap.R368HGlaucoma early-onset digenicRCV000008178601771
5110441839rs35703638GA43/1/0WDR36uc003kpd.2:c.G1346A:p.A449TGlaucoma 1, open angleRCV000001649609669
5110454719rs34595252AG43/1/0WDR36uc003kpd.2:c.A1974G:p.D658GGlaucoma 1, open angleRCV000001647609669
855537560rs77775126CT43/1/0RP1uc003xsd.1 :c.C1119T:p.T373lRetinitis pigmentosa 1RCV000006334603937
1013178766rs75654767GA43/1/0OPTNuc001ilx1:c.G1635A:p.R545QGlaucoma 1, open angleRCV000007515602432
1257431402rs33962952CT32/10/2MY01Auc001smw.3:c.G1986Ap.G662EDeafness, autosomal dominant 48RCV000008627601478
1257437119rs55679042CT43/1/0MY01Auc001 smw.3:c.G917A:p.V306MDeafness, autosomal dominant 48RCV000008625601478
1945411941rs429358TC39/4/1APOEuc002pab.2:c.T389C:p.C130RHyperlipoproteinemia, type 3, autosomal dominant; Alzheimer's disease associated with APOE4 variantRCV000019438; RCV000019448107741
1945412079rs7412CT33/11/0APOEuc002pab.2:c.C527T:p.R176CFamilial type 3 hyperlipoproteinemia, associated with APOE2RCV000019428107741
2025060096rs74315433CT43/1/0VSX1uc002wuf.2:c.G480Ap.G160DKeratoconus 1RCV000024251605020
X56591879rs369947678cT43/1/0UBQLN2uc004dus.2:c.C1574T:p.P525SAmyotrophic lateral sclerosis 15 with or without frontotemporal dementiaRCV000022846300264

CHR, chromosome; POS, position on chromosome based on GRCh37; ID, dbSNP ID; REF, reference sequence; ALT, alternative sequence; AC, counts of subjects with homozygous reference alleles/heterozygous/homozygous nonreference alleles; Effect, position of nucleotide change on UCSC transcript and amino acid change; CLNDBN and CLNACC, selected disease condition and accession ID from ClinVar; OMIM, OMIM accession numbers for the genes.

Given the important role of APOE variants in late-onset Alzheimer's disease, we first looked at the APOE ε4 allele as defined by the ancestral alleles rs429358-C and rs7412-C (130Arg and 176Arg) and APOE ε2 allele as defined by rs429358-T and rs7412-T (130Cys and 176Cys). In our centenarian sample, the allele frequencies for APOE ε4, ε3, and ε2 were 6.8%, 80.7%, and 12.5%, respectively. One of the two ε2/ε4 and one of the two ε3/ε4 heterozygous carriers were found to have advanced dementia of unknown etiology and the other two subjects were cognitively intact. Notably, the centenarian homozygous for the ε4 allele had an MMSE score of 25 at age 97. Neither this centenarian nor any other subject in our study carried the protective variant A673T in the APP gene (Jonsson et al. 2012). We also observed one carrier for each of the known GBA mutations L483P (also known as L444P) (rs421016) and N409S (N370S) (rs76763715), which is in line with the reported frequency of 3% for either mutation in AJ controls (Sidransky et al. 2009). On the X chromosome we identified a female centenarian with a previously described mutation in UBQLN2 P525S (Deng et al. 2011), which was found to cause an X-linked dominant type of familial amyotrophic lateral sclerosis (ALS) and ALS/dementia with an estimated penetrance of approximately 90% (Table 1). This subject had normal MMSE (score = 28) at age 102 without any neurological symptoms. Furthermore, disease alleles were observed for other autosomal dominantly inherited degenerative diseases of sensory function, including retinitis pigmentosa (SEMA4A, RP1), deafness (MYO1A), glaucoma (CYP1B1, OPTN, WDR36), and keratoconus (VSX1). Of note, none of the 10 centenarians carrying the SEMA4A R713Q had vision impairment but the single subject carrying the RP1 T373I was blind. No hearing impairment was noticed in one of two subjects homozygous and four of ten heterozygous carriers of the MYO1A G662E variant that was linked to an autosomal dominant form of deafness.

Variants for neoplastic diseases

We found variants in the five genes APC, BRCA1, RET, RNASEL, and STK11 that were linked to autosomal dominant forms of cancer or neoplasm as well as four complex risk variants in ELAC2, MSR1, AIP, and SDHB (Table 2). The clinical relevance of these variants has been discussed in the literature and their presence in the genomes of centenarians indicates that these variants are compatible with exceptional longevity.
Table 2

Putative disease variants reported to cause neoplastic diseases

CHRPOSIDREFALTACGeneEffectCLNDBNCLNACCOMIM
117354297rs33927012AG43/1/0SDHBuc001 bae.2:c.T488C:p.S163PCowden-like syndromeRCV000013633185470
1182555149rs74315364CA43/1/0RNASELuc001gpj.1:c.G794T:p.E265*Prostate cancer, hereditaryRCV000013878180435
5112175240rs1801166GC43/1/0APCuc003kpy.3:c.G3950C:p.E1317QAdenomatous polyposis coliRCV000000872611731
816012594rs41341748GA43/1/0MSR1uc003wwz.2:c.C878T:p.R293*Malignant tumor of prostateRCV000015431153622
1043613908rs77724903AT42/2/0RETuc001jal.2:c.A2373T:p.Y791 FFamilial medullary thyroid carcinomaRCV000014962164761
1167258382rs104894190GA41/3/0MPuc001olv.2:c.G912A:p.R304QPituitary-dependent hypercortisolismRCV000005171605555
1712899902rs5030739CT38/6/0ELAC2uc002gnz.3:c.G1622A:p.A541 TProstate cancer, hereditaryRCV000005359605367
1741226488rs1800744CA42/2/0BRCA1uc002ict.2:c.G4599T:p.S1533IFamilial cancer of breastRCV000048588113705
191223125rs59912467CG43/1/0STK11uc002lrl.1:c.C1063G:p.F355LPeutz–Jeghers syndromeRCV000007887602216

CHR, chromosome; POS, position on chromosome based on GRCh37; ID, dbSNP ID; REF, reference sequence; ALT, alternative sequence; AC, counts of subjects with homozygous reference alleles/heterozygous/homozygous nonreference alleles; Effect, position of nucleotide change on UCSC transcript and amino acid change; CLNDBN and CLNACC, selected disease condition and accession ID from ClinVar; OMIM, OMIM accession numbers for the genes.

The tumor suppressor gene STK11 variant F355L (rs59912467) linked to Peutz–Jeghers syndrome was seen in one centenarian. It was reported to affect both the AMPK pathway and cell polarity, thus contributing to the development of malignancies (Forcet et al. 2005). Notably, the same subject carried two additional variants linked to thyroid carcinoma (RET Y791F, rs77724903) and Cowden-like syndrome (SDHB S163P, rs33927012), but was free of any type of neoplasia at age 97. The Y791F variant in the RET protooncogene was further present in another centenarian with a history of cancer of unknown origin. This variant was first described in two German families with familial medullary thyroid carcinoma (Berndt et al. 1998). Six centenarians carried the A541T variant in the gene ELAC2 (rs5030739) that was reported to be associated with prostate cancer (Rebbeck et al. 2000; Tavtigian et al. 2001; Camp and Tavtigian 2002). One of the two male carriers for A541T had no cancer at age 103 years and the other had an unknown type of cancer.

Variants for autosomal dominant forms of cardiac disease

We found variants in ABCC9, ACTN2, ANK2, CACNA1C, JPH2, KCNE2, MYL2, and TMEM43 that were linked to autosomal dominant phenotypes affecting cardiac function (Table 3).
Table 3

Putative disease variants reported to cause autosomal dominant cardiac diseases

CHRPOSIDREFALTACGeneEffectCLNDBNCLNACCOMIM
1236918491rs193922635CT43/1/0ACTN2uc001hyf.2:c.C2148T:p.T716MCardiomyopathyRCV000029298102573
314180731rs113449357CT43/1/0TMEM43uc003byk.2:c.C935T:p.R312WCardiomyopathyRCV000030555612048
4114294537rs45454496GA42/2/0ANK2uc003ibe.3:c.G11792A:p.E3931KCardiac arrhythmia, ankyrin B-relatedRCV000019677106410
122659186rs121912775GA43/1/0CACNA1Cuc001qkl.2:c.G1469A:p.G490RBrugada syndrome 3RCV000019201114205
1222017410rs61688134CT42/2/0ABCC9uc001 rfh.2:c.G2201 A:p.V734lMyocardial infarction 1RCV000029274601439
12111356964rs104894363CT43/1/0MYL2uc001try.3:c.G38Ap.A13TFamilial hypertrophic cardiomyopathy 10RCV000015108160781
2042744802rs140740776CT42/2/0JPH2uc002xli.1:c.G1514A:p.G505SFamilial hypertrophic cardiomyopathy 17RCV000023411605267
2135742938rs74315447TC43/1/0KCNE2uc002ytt.1 :c.T162C:p.M54TLong QT syndrome 6RCV000006425603796

CHR, chromosome; POS, position on chromosome based on GRCh37; ID, dbSNP ID; REF, reference sequence; ALT, alternative sequence; AC, counts of subjects with homozygous reference alleles/heterozygous/homozygous nonreference alleles; Effect, position of nucleotide change on UCSC transcript and amino acid change; CLNDBN and CLNACC, selected disease condition and accession ID from ClinVar; OMIM, OMIM accession numbers for the genes.

Three of the variants in these genes are annotated as causes of cardiac arrhythmia with increased risk of sudden cardiac death. Two centenarians were heterozygous for the E3931K variant in ANK2 (rs45454496) variant, also known as E1813K, which was reported as a loss-of-function mutation in the ankyrin-B regulatory domain (Mohler et al. 2003, 2004). One centenarian carried the G490R variant in CACNA1C (rs121912775), a loss-of-function change that was linked to Brugada syndrome 3 (Antzelevitch et al. 2007). Another carried the M54T (rs74315447) in KCNE2, which was reported to alter the transmembrane domain of MiRP1 that reduces potassium currents leading to long QT syndrome and ventricular fibrillation (Abbott et al. 1999; Splawski et al. 2000). Five variants linked to cardiomyopathy were found in our centenarian sample. The MYL2 A13T (rs104894363) variant located in the regulatory light chain of myosin was reported to be causal for a subtype of familial hypertrophic cardiomyopathy with onset of clinical symptoms around middle age (Poetter et al. 1996; Andersen et al. 2001). The gene ABCC9 encodes the regulatory SUR2A subunit of the cardiac ATP-sensitive potassium (KATP) channel. Two centenarians carried the ABCC9 V734I variant (rs61688134), which was reported to increase the risk of MI by 6.40-fold. (Minoretti et al. 2006). The G505S (rs140740776) variant in JPH2 was reported to be associated with hypertrophic cardiomyopathy in a relatively small sample of Japanese patients (Matsushita et al. 2007). One centenarian carried the TMEM43 R312W variant linked to autosomal dominant arrhythmogenic right ventricular cardiomyopathy/dysplasia (Haywood et al. 2013). Interestingly, one centenarian was found to be heterozygous for both JPH2 G505S and ABCC9 V734I and another was heterozygous for both JPH2 G505S and ANK2 E3931K.

Variants for other autosomal dominant and X-chromosomal diseases

We found 18 rare ClinVar variants for other autosomal dominant diseases (Table 4) and six mutations for X-chromosomal diseases (Table 5). These include variants that increase the risk of metabolic disorders, including hypercholesterolemia (LDLR, V827I, rs137853964), maturity-onset diabetes of the young (MODY) (BLK, A71T, rs55758736), and obesity (MC4R, A175T, rs121913563). One male and one female centenarian carried the V1108M variant in TNXB (rs121912575) linked to the dominant form of Ehlers–Danlos syndrome type 3 (Zweers et al. 2005). The remaining autosomal mutations are annotated to cause a wide range of dominantly inherited phenotypes including metabolic, genitourinary, and skin conditions as well as pediatric conditions, including developmental syndromes, for example, the tetralogy of Fallot, a severe malformation that would reduce life span. Three centenarians presented a variant in TNFRSF1A (R121Q, rs4149584) in the heterozygous state that was linked to TNF-receptor associated periodic fever. This was the only variant for dominantly inherited immune disease in this dataset.
Table 4

Putative disease variants reported to cause other autosomal dominant diseases

CHRPOSIDREFALTACGeneEffectCLNDBNCLNACCOMIM
1152285861rs61816761GA42/2/0FLGuc001 ezu.1:c.C1502T:p.R501*Ichthyosis vulgarisRCV000017712135940
2167141109rs41268673GT36/7/1SCN9Auc010fpl.2:c.C1829A:p.P610TPrimary erythromelalgiaRCV000020511603415
2219755011rs121908120TA41/3/0WNT10Auc002yjd.1:c.T683Ap.F228lOdontoonychodermal dysplasiaRCV000004717606268
5172662014rs28936670GA43/1/0NKX2-5uc003mcm.1:c.C74T:p.R25CTetralogy of FallotRCV000009572600584
5172662026rs104893904CG43/1/0NKX2-5uc003mcm.1:c.G62C:p.E21QTetralogy of FallotRCV000009574600584
632052313rs121912575CT42/2/0TNXBuc003nzl.2:c.G3323Ap.V1108MEhlers–Danlos syndrome type 3RCV000009083600985
811405576rs55758736GA42/2/0BLKuc003wty.2:c.G212Ap.A71TMaturity-onset diabetes of the young type 11RCV000013112191305
818080001rs4987076GA39/5/0NAT1uc003wys.2:c.G632A:p.V211lNAT1*7 ALLELERCV000019386108345
855372085NATA42/2/0SOX17uc003xsb.3:c.T776Ap.Y259NVesicoureteral reflux 3RCV000001140610928
8106431420rs121908601AG42/2/0ZFPM2uc003ymd.2:c.A90G:p.E30GTetralogy of FallotRCV000006502603693
126442643rs4149584CT41/3/0TNFRSF1Auc001qnu.2:c.G363A:p.R121QTNF receptor-associated periodic fever syndrome (TRAPS)RCV000013134191190
1332351535rs121918303AC38/6/0RXFP2uc001 utt.2:c.A665C:p.T222PCryptorchidism, unilateral or bilateralRCV000004376606655
1454418579NATC43/1/0BMP4uc010aoh.2:c.A363G:p.H121RMicrophthalmia syndromic 6RCV000022458112262
1574704267rs56001514GA43/1/0SEMA7Auc002axv.2:c.C1382T:p.R461CBlood group John Milton Hagen systemRCV000029235607961
161129586rs121917877CT42/2/0SSTR5uc002ckq.2:c.C719T:p.R240WResistance to somatostatin analogRCV000013734182455
1772745313rs35910969CG43/1/0SLC9A3R1uc002jlo.2:c.C329G:p.L110VNephrolithiasis/osteoporosis, hypophosphatemic 2RCV000005588604990
1858039060rs121913563CT43/1/0MC4Ruc002lie.1:c.G524A:p.A175TObesityRCV000015406155541
1911240278rs137853964GA43/1/0LDLRuc002mqk.3:c.G2480A:p.V827lFamilial hypercholesterolemiaRCV000030135606945

CHR, chromosome; POS, position on chromosome based on GRCh37; ID, dbSNP ID; REF, reference sequence; ALT, alternative sequence; AC, counts of subjects with homozygous reference alleles/heterozygous/homozygous nonreference alleles; Effect, position of nucleotide change on UCSC transcript and amino acid change; CLNDBN and CLNACC, selected disease condition and accession ID from ClinVar; OMIM, OMIM accession numbers for the genes.

Aside from the above mentioned mutation in UBQLN2, the observed X-chromosomal variants include three variants in the DMD gene that have been discussed as cause of Duchenne and Becker muscular dystrophy. One frequent missense variant rs1800279 (H2921R) in the DMD gene was observed in one male centenarian and another male centenarian carried both rare variants rs1800278 (N2912D) and rs41305353 (E2910V). The carriers did not have any documented muscle diseases.

Variants for recessive diseases and complex diseases

Among the 72 variants for recessive traits, we found three variants that were observed in the homozygous state in at least one centenarian (Table S2). These are variants in ADA, ALG6, and HPS5. The ALG6 variant Y131H (rs35383149) has been annotated to cause congenital disorder of glycosylation, type Ic, a childhood onset metabolic disorder accompanied by severe neurological symptoms (Miller et al. 2011). We found no literature evidence supporting the pathogenicity of the ADA variant D8N (rs73598374) or the HPS5 variant T1098I (rs61884288). Three subjects were heterozygous carriers for Factor V Leiden (F5, Q534R, rs6025) (Bertina et al. 1994). Heterozygous carriers have elevated risks of deep venous thrombosis, pulmonary embolism (Juul et al. 2004), and stroke (Casas et al. 2004). Note that the GRCh37 genome assembly presents the risk allele as the reference allele and no individuals homozygous for the reference allele were seen in our sample. One carrier had a history of MI and another had a history of both MI and stroke, but the third carrier had no known thrombotic diseases by the age of 106 years. Finally, we also identified homozygous individuals for putative recessive disease alleles with relevance for longevity regardless of their population allele frequency. Of note, we found three missense variants in the ACADS gene R107C (rs61732144), R171W (rs1800556), and G209S (rs1799958) (Tables S1 and S2) that were initially reported (Corydon et al. 2001; Pedersen et al. 2008) in 10 patients with ethylmalonic aciduria and short-chain acyl-CoA dehydrogenase (SCAD) deficiency, a mitochondrial fatty acid oxidation disorder causing neuromuscular phenotypes with hypotonia and developmental delay as the prominent features of the disease. Homozygosity for R107C as well as compound heterozygosity for R107C and G209S have been described as disease causing in AJ patients with SCAD with reduced penetrance (Tein et al. 2008) depending on other genetic modifiers or environmental stressors. Variants R171W and G209S have both high allele frequencies (5.5% and 23.5%, respectively) in the general population (U.S. and the Netherlands) and they are considered to confer susceptibility for clinical disease (van Maldegem et al. 2006). We found five AJ centenarians who are homozygous for the variant G209S (MAF = 36.4%) and they do not carry any other missense variants in the ACADS gene. The carrier frequency for the R107C variant in our centenarians (3/44) is identical to the reported carrier frequency in AJ population (Tein et al. 2008).

Discussion

In this study we observed many previously reported Mendelian mutations that are sufficiently benign to allow the individual carriers to achieve exceptional longevity. The presence of these specific variants in the genomes of centenarians can be helpful for clinical geneticists who are challenged with the evaluation of their putative pathogenicity as incidental findings. More generally, our findings support the notion that for many putative disease variants it is not straightforward to decide whether they should be regarded and acted upon as incidental findings, when they are observed in healthy individuals (Kingsmore 2013). Several genes harboring putatively pathogenic variants in our centenarians are on the list of 57 genes with reportable findings according to the ACMG recommendations (Green et al. 2013b). These genes comprise 4 of the 24 genes for cancer (BRCA1, APC, RET, STK11), 1 of the 20 cardiac disease genes (TMEM43), 1 of the remaining 13 genes (LDLR). This demonstrates the challenge to identify actionable mutations even in well-established disease genes. The variants found in four of the cancer genes APC, BRCA1, RET, and SDHB in our AJ centenarian sample had also been labeled as “almost certainly benign” in another sequencing study (ClinSeq) of 572 middle aged participants (17% of which are of AJ ancestry) (Johnston et al. 2012). Surprisingly, we also observed many rare variants linked to dominantly inherited forms of diseases that would clearly affect life expectancy. These disease traits include cardiac diseases such as cardiomyopathy and arrhythmia that increase the risk of sudden cardiac death as well as metabolic diseases such as diabetes, hypercholesterolemia, and obesity. One explanation for the presence of disease alleles in centenarians might be that these variants have incomplete penetrance as a result of complex interplay between modifying genetic and environmental factors (Bergman et al. 2007; Cooper et al. 2013). However, for some variants the published evidence may be viewed as too weak to uphold their classification as pathogenic. For example, the three missense variants in the DMD gene were all found in at least one male centenarian. Given the markedly reduced life expectancy of patients affected by Duchenne muscular dystrophy (Kieny et al. 2013), it is unlikely that these variants are pathogenic. In other cases, the resulting phenotypes of mutations could have little impact on longevity, which may primarily apply to diseases such as deafness, glaucoma, retinitis pigmentosa, premature ovarian failure, or ichthyosis. In other instances the possible influence on life expectancy is not clear, such as the variant V1108M in the TNXB gene, which was reported to cause Ehlers–Danlos syndrome type 3 (Zweers et al. 2005) and shown to affect the protein function (Zhuang et al. 2010). We note that many presumable disease mutations are present in centenarians with fairly high frequency. We did not discuss these variants in detail because they are common in the ESP data. With the exception of the APOE ε4 allele, these common variants are from the beginning more likely to constitute polymorphisms than any clinically significant mutations. Despite its high population frequency, the APOE ε4 allele was suggested to follow semi-dominant inheritance (Genin et al. 2011). APOE is also the only reported gene reaching GWAS significance threshold for longevity (Beekman et al. 2013). In our centenarian sample, the APOE ε4 allele frequency is very similar to the previously reported allele frequencies in 325 French Caucasian centenarians (Schachter et al. 1994), namely 5.2% for APOE ε4 and 12.8% for APOE ε2. In the ESP dataset, the respective allele frequencies for ε4 and ε2 are 11.7% and 5.6% in European Americans and 18.9% and 8.7% in African Americans. This is congruent with the previous finding that ε4 was associated with reduced and ε2 with increased life span (Deelen et al. 2011). However, earlier studies did not observe any homozygote centenarians for APOE ε4. As would be expected, the nonaffected APOE ε4 homozygote had rare coding variants in APOE modifier candidate genes, but testing their significance needs to be left to subsequent studies. The future identification of such protective genetic factors in centenarians may increase the clinical utility of the APOE ε4 allele. Due to the relatively small sample size and lack of a control data set from a matched population, we cannot search for novel variants with impact on longevity here. In future, personalized genomic sequencing is likely to generate many incidental findings. Rigorous investigations for cosegregation in family members, large ancestry-matched population screening, and functional studies will be required to identify those variants that are sufficiently penetrant to be considered clinically relevant. Genetic counseling provided by trained clinicians will be necessary to communicate the potential impact of genetic variants to avoid unnecessary and far reaching burden. This is especially true in considering genetic screening in the young, where the demarcation of putatively actionable mutations needs to be approached with considerable caution, allowing appropriate balance to be conveyed in risk/benefit discussions with parents. Our study provides a list of variants that are currently flagged as pathogenic, but are compatible with exceptional longevity. Thereby, our study contributes to the genetic community's efforts to refine the set of variants with strong clinical significance and mark previously reported disease mutations with unclear clinical significance.
  65 in total

1.  BLAT--the BLAST-like alignment tool.

Authors:  W James Kent
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

2.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits.

Authors:  Lucia A Hindorff; Praveen Sethupathy; Heather A Junkins; Erin M Ramos; Jayashri P Mehta; Francis S Collins; Teri A Manolio
Journal:  Proc Natl Acad Sci U S A       Date:  2009-05-27       Impact factor: 11.205

Review 3.  Incidental findings from clinical genome-wide sequencing: a review.

Authors:  Z Lohn; S Adam; P H Birch; J M Friedman
Journal:  J Genet Couns       Date:  2013-05-26       Impact factor: 2.537

4.  A mutation in APP protects against Alzheimer's disease and age-related cognitive decline.

Authors:  Thorlakur Jonsson; Jasvinder K Atwal; Stacy Steinberg; Jon Snaedal; Palmi V Jonsson; Sigurbjorn Bjornsson; Hreinn Stefansson; Patrick Sulem; Daniel Gudbjartsson; Janice Maloney; Kwame Hoyte; Amy Gustafson; Yichin Liu; Yanmei Lu; Tushar Bhangale; Robert R Graham; Johanna Huttenlocher; Gyda Bjornsdottir; Ole A Andreassen; Erik G Jönsson; Aarno Palotie; Timothy W Behrens; Olafur T Magnusson; Augustine Kong; Unnur Thorsteinsdottir; Ryan J Watts; Kari Stefansson
Journal:  Nature       Date:  2012-08-02       Impact factor: 49.962

5.  Actionable, pathogenic incidental findings in 1,000 participants' exomes.

Authors:  Michael O Dorschner; Laura M Amendola; Emily H Turner; Peggy D Robertson; Brian H Shirts; Carlos J Gallego; Robin L Bennett; Kelly L Jones; Mari J Tokita; James T Bennett; Jerry H Kim; Elisabeth A Rosenthal; Daniel S Kim; Holly K Tabor; Michael J Bamshad; Arno G Motulsky; C Ronald Scott; Colin C Pritchard; Tom Walsh; Wylie Burke; Wendy H Raskind; Peter Byers; Fuki M Hisama; Deborah A Nickerson; Gail P Jarvik
Journal:  Am J Hum Genet       Date:  2013-09-19       Impact factor: 11.025

6.  Recurrent missense mutations in TMEM43 (ARVD5) due to founder effects cause arrhythmogenic cardiomyopathies in the UK and Canada.

Authors:  Annika F M Haywood; Nancy D Merner; Kathy A Hodgkinson; Jim Houston; Petros Syrris; Valerie Booth; Sean Connors; Antonios Pantazis; Giovanni Quarta; Perry Elliott; William McKenna; Terry-Lynn Young
Journal:  Eur Heart J       Date:  2012-11-15       Impact factor: 29.983

7.  Mutations in either the essential or regulatory light chains of myosin are associated with a rare myopathy in human heart and skeletal muscle.

Authors:  K Poetter; H Jiang; S Hassanzadeh; S R Master; A Chang; M C Dalakas; I Rayment; J R Sellers; L Fananapazir; N D Epstein
Journal:  Nat Genet       Date:  1996-05       Impact factor: 38.330

8.  Incidental swimming with millstones.

Authors:  Stephen F Kingsmore
Journal:  Sci Transl Med       Date:  2013-07-17       Impact factor: 17.956

Review 9.  Identifying disease mutations in genomic medicine settings: current challenges and how to accelerate progress.

Authors:  Gholson J Lyon; Kai Wang
Journal:  Genome Med       Date:  2012-07-26       Impact factor: 11.117

10.  ClinVar: public archive of relationships among sequence variation and human phenotype.

Authors:  Melissa J Landrum; Jennifer M Lee; George R Riley; Wonhee Jang; Wendy S Rubinstein; Deanna M Church; Donna R Maglott
Journal:  Nucleic Acids Res       Date:  2013-11-14       Impact factor: 16.971

View more
  35 in total

1.  Inverse effect of the APOE epsilon4 allele in late- and early-onset Alzheimer's disease.

Authors:  Vincenzo De Luca; Maria Donata Orfei; Sara Gaudenzi; Carlo Caltagirone; Gianfranco Spalletta
Journal:  Eur Arch Psychiatry Clin Neurosci       Date:  2015-12-29       Impact factor: 5.270

2.  Four Genome-Wide Association Studies Identify New Extreme Longevity Variants.

Authors:  Paola Sebastiani; Anastasia Gurinovich; Harold Bae; Stacy Andersen; Alberto Malovini; Gil Atzmon; Francesco Villa; Aldi T Kraja; Danny Ben-Avraham; Nir Barzilai; Annibale Puca; Thomas T Perls
Journal:  J Gerontol A Biol Sci Med Sci       Date:  2017-10-12       Impact factor: 6.053

3.  ABO blood group polymorphism has an impact on prostate, kidney and bladder cancer in association with longevity.

Authors:  Donatas Stakišaitis; Milda Juknevičienė; Albertas Ulys; Dalia Žaliūnienė; Daiva Stanislovaitienė; Ramunė Šepetienė; Anželika Slavinska; Kęstutis Sužiedėlis; Vita Lesauskaitė
Journal:  Oncol Lett       Date:  2018-05-21       Impact factor: 2.967

4.  Genetic mutations you want.

Authors:  Sarah C P Williams
Journal:  Proc Natl Acad Sci U S A       Date:  2016-03-08       Impact factor: 11.205

5.  Evaluation of the Genetic Variance of Alzheimer's Disease Explained by the Disease-Associated Chromosomal Regions.

Authors:  Alireza Nazarian; Alexander M Kulminski
Journal:  J Alzheimers Dis       Date:  2019       Impact factor: 4.472

Review 6.  In Silico Functional Annotation of Genomic Variation.

Authors:  Mariusz Butkiewicz; William S Bush
Journal:  Curr Protoc Hum Genet       Date:  2016-01-01

7.  The Alzheimer's Disease Exposome.

Authors:  Caleb E Finch; Alexander M Kulminski
Journal:  Alzheimers Dement       Date:  2019-09-10       Impact factor: 21.566

8.  Positive attitude toward life, emotional expression, self-rated health, and depressive symptoms among centenarians and near-centenarians.

Authors:  Kaori Kato; Richard Zweig; Clyde B Schechter; Nir Barzilai; Gil Atzmon
Journal:  Aging Ment Health       Date:  2015-06-26       Impact factor: 3.658

9.  Folding and Misfolding of Human Membrane Proteins in Health and Disease: From Single Molecules to Cellular Proteostasis.

Authors:  Justin T Marinko; Hui Huang; Wesley D Penn; John A Capra; Jonathan P Schlebach; Charles R Sanders
Journal:  Chem Rev       Date:  2019-01-04       Impact factor: 60.622

10.  Differential burden of rare protein truncating variants in Alzheimer's disease patients compared to centenarians.

Authors:  Yun Freudenberg-Hua; Wentian Li; Avinash Abhyankar; Vladimir Vacic; Vanessa Cortes; Danny Ben-Avraham; Jeremy Koppel; Blaine Greenwald; Soren Germer; Robert B Darnell; Nir Barzilai; Jan Freudenberg; Gil Atzmon; Peter Davies
Journal:  Hum Mol Genet       Date:  2016-06-03       Impact factor: 6.150

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.