Literature DB >> 29288195

Sequencing of Linkage Region on Chromosome 12p11 Identifies PKP2 as a Candidate Gene for Left Ventricular Mass in Dominican Families.

Nicole D Dueker1, Shengru Guo1, Ashley Beecham1, Liyong Wang1,2, Susan H Blanton1,2, Marco R Di Tullio3, Tatjana Rundek4,5, Ralph L Sacco6,4,5.   

Abstract

Increased left ventricular mass (LVM) is an intermediate phenotype for cardiovascular disease (CVD) and a predictor of stroke. Using families from the Dominican Republic, we have previously shown LVM to be heritable and found evidence for linkage to chromosome 12p11. Our current study aimed to further characterize the QTL by sequencing the 1 LOD unit down region in 10 families from the Dominican Republic with evidence for linkage to LVM. Within this region, we tested 5477 common variants [CVs; minor allele frequency (MAF) ≥5%] using the Quantitative Transmission-Disequilibrium Test (QTDT). Gene-based analyses were performed to test rare variants (RVs; MAF < 5%) in 181 genes using the family-based sequence kernel association test. A sample of 618 unrelated Dominicans from the Northern Manhattan Study (NOMAS) and 12 Dominican families with Exome Array data were used for replication analyses. The most strongly associated CV with evidence for replication was rs1046116 (Discovery families P = 9.0 × 10-4; NOMAS P = 0.03; replication families P = 0.46), a missense variant in PKP2 In nonsynonymous RV analyses, PKP2 was one of the most strongly associated genes (P = 0.05) with suggestive evidence for replication in NOMAS (P = 0.05). PKP2 encodes the plakophilin 2 protein and is a desmosomal gene implicated in arrythmogenic right ventricular cardiomyopathy and recently in arrhythmogenic left ventricular cardiomyopathy, which makes PKP2 an excellent candidate gene for LVM. In conclusion, sequencing of our previously reported QTL identified common and rare variants within PKP2 to be associated with LVM. Future studies are necessary to elucidate the role these variants play in influencing LVM.
Copyright © 2018 N. D. Dueker et al.

Entities:  

Keywords:  DNA sequencing; Hispanics; left ventricle; plakophilin; rare variants

Mesh:

Substances:

Year:  2018        PMID: 29288195      PMCID: PMC5919734          DOI: 10.1534/g3.117.300358

Source DB:  PubMed          Journal:  G3 (Bethesda)        ISSN: 2160-1836            Impact factor:   3.154


Cardiovascular disease (CVD) is a significant public health burden, affecting more than one in three American adults ≥20 yr old and resulting in an estimated $316.6 billion in medical costs in 2012 (Mozaffarian 2016). Increased left ventricular mass (LVM), an intermediate phenotype for CVD, is predictive of stroke and CVD events (Levy ; Devereux ; Rodriguez ). Traditional factors, including elevated blood pressure, body mass index, and weight, are known to influence LVM (Savage ; Garner ). Heritability studies suggest that LVM may be genetically controlled, with reported heritability estimates ranging from 0.24 to 0.70 in various populations (Post ; Swan ; Juo ; Sharma ; Assimes ; de Simone ; Wang ). In addition, genome-wide association studies, primarily performed in samples of European ancestry, have identified significant associations between LVM and common variants within several loci (Vasan ; Fox ). A large meta-analysis of individuals of European ancestry found variants within 14q12 and 2p21 to be associated with decreased LVM and variants within 15q14 to be associated with increased LVM (Vasan ). In a meta-analysis of African American cohorts, variants within 8q11 were associated with increased LVM (Fox ). Though these loci were significantly associated with LVM in the discovery cohorts of each study, none achieved replication in their respective validation cohorts. Additional variants have been implicated in LVM through other genome-wide association studies, although none achieved statistical significance (Vasan ; Arnett ; Harper ). In a previous genome-wide linkage study, we identified a novel region on chromosome 12p11 linked to LVM using extended families from the Dominican Republic (MLOD = 3.11), particularly in families with higher waist circumference (MLOD = 4.45) (Wang ). We further characterized this region by investigating common variants in a population-based cohort of Caribbean Hispanics and identified several candidates, including SOX5 (Della-Morte ). For this study we aimed to identify additional common and rare variants contributing to the 12p11 linkage signal by sequencing the 1 LOD down region on 12p11 in 10 extended Dominican families with strong evidence for linkage to this region (LOD ≥ 0.1). We validated the observed associations using both genome-wide and exome-wide genotype data from unrelated Dominicans in the Northern Manhattan Study (NOMAS) as well as exome-wide genotype data from 12 additional extended Dominican families with suggestive evidence for linkage to the 12p11 region.

Materials and Methods

Study samples

Individuals from both NOMAS, a population-based cohort, and the Family Study of Stroke Risk and Carotid Atherosclerosis, a family study consisting of select probands from NOMAS and their family members (Sacco ), were used for the current study. Details of NOMAS and the Family Study have been published previously (Sacco , 2007). Briefly, a total of 3298 stroke-free population-based participants were enrolled in NOMAS from 1993 to 2001. The family study enrolled a subset of Caribbean Hispanic probands from NOMAS with a high risk of cardiovascular disease (Sacco ) who could provide a family history, obtain family members’ permission for research staff to contact them, and have at least three first-degree relatives able to participate. All probands were identified in northern Manhattan and family members were enrolled in New York at Columbia University and in the Dominican Republic (DR) at the Clinicas Corazones Unidos in Santo Domingo. All participants provided written informed consent and the study was approved by the Institutional Review Boards of Columbia University, University of Miami, the National Bioethics Committee, and the Independent Ethics Committee of Instituto Oncologico Regional del Cibao in the DR (20070478 and 20072012). Twenty-two families with a family-specific LOD score >0.1 at the chromosome 12p11 QTL for LVM were selected for our current study; 10 families were sequenced as part of our discovery analyses (n = 180 individuals) (Table 1) and 12 families were genotyped on the Exome Array for our replication analyses (n = 143 individuals) (Supplemental Material, Table S1). Details of the family selection have been published previously (Dueker ). Additional replication analyses were performed using a sample of 618 unrelated Dominicans from NOMAS (Sacco ) who were not enrolled in the family study.
Table 1

Characteristics of Dominican families included in analyses for chr12p11 analyses

Family IDIndividuals per FamilyFamily-Specific LOD ScoreLV Mass Residual (g)Age (yr) µ ± SDBMI (kg/m2) µ ± SDWaist Circumference (inch) µ ± SD% Female% Diabetes% Smoker% Hypertension
MeanSDMinMax
5275240.94−0.102.71−4.776.1644.0 ± 15.733.331.2 ± 6.338.0 ± 6.212.533.333.3
3719120.920.432.64−3.045.3649.3 ± 16.233.332.3 ± 7.837.8 ± 5.48.333.341.7
5103350.441.312.34−3.306.8942.2 ± 17.837.126.9 ± 5.336.3 ± 5.89.637.142.9
2235150.37−0.463.40−5.625.5449.3 ± 16.960.029.1 ± 3.439.0 ± 5.620.060.066.7
4641130.30−0.442.44−4.374.0442.9 ± 18.053.831.3 ± 8.537.3 ± 6.153.853.823.1
2783150.21−0.573.00−3.976.6640.7 ± 15.426.729.5 ± 6.036.9 ± 5.213.326.753.3
6081270.20−0.551.45−3.132.4746.3 ± 16.914.827.7 ± 5.436.0 ± 5.33.714.840.7
3561170.16−0.063.18−4.996.7047.1 ± 13.841.235.3 ± 7.542.2 ± 6.929.441.252.9
1917100.150.202.84−4.064.0145.8 ± 16.880.033.8 ± 7.241.0 ± 6.020.080.060.0
803120.14−1.162.10−4.253.1154.7 ± 24.241.730.3 ± 5.238.8 ± 5.116.741.741.7

Echocardiographic evaluation and risk factor measurements

As detailed previously (Wang ; Della-Morte ), standard two-dimensional echocardiography, including color-Doppler flow study was performed according to the guidelines of the American Society of Echocardiography (Sahn ). High quality parasternal long axis views of the left ventricle were obtained, from which left ventricular end-diastolic diameter (LVDD), left ventricular end-systolic diameter (LVSD), interventricular septum (IVS), and posterior wall thickness (PWT) were derived (Di Tullio ). LVM was calculated according to the modified American Society of Echocardiography formula: LVM = 0.8 [1.04 (LVDD + IVS + PWT)3 − (LVDD)3] + 0.6 (Devereux ). Vascular risk factors, including body mass index (BMI), systolic blood pressure, and smoking status, were collected during a standardized interview (Elkind ). Smoking status was defined as never vs. ever, systolic blood pressure was defined as the average of two separate systolic blood pressure measurements taken after rest, and presence of diabetes was defined as fasting blood glucose level ≥126 mg/dl or self-reported history of diabetes. All data collection procedures were standardized and identical across NOMAS and the Family Study.

Discovery sample sequencing and quality control

In the family sample, genomic DNA was isolated from whole blood. Targeted sequencing of the exons in 181 genes within the 1 LOD unit down region on 12p11 (chr12:24–51 Mb), as well as sequencing beyond the exons for candidate genes identified previously (ARID2, BICD1, BIN2, c12orf68, FAR2, RACGAP1, SLC38A1, SOX5) (Della-Morte ), was performed using a customized Agilent SureSelect Enrichment kit. A detailed description of sequencing methods has been previously published (Wang ). Briefly, DNA libraries were sequenced on an Illumina HiSeq2000 and the raw sequencing reads were aligned to the human reference sequence hg19 with the Burrows-Wheeler Aligner (Li and Durbin 2010). Variant calling was performed using the Genome Analysis ToolKit and potential functions of variants were annotated using ANNOVAR v. 2016Feb01 (Wang ) and SeattleSequation 138. Quality control was conducted at both variant and sample levels, as described previously (Wang ; Dueker ). Within each individual sample, variants with a depth <4 or Phred-Like score <100 were set as missing. Variants with VQSLOD <−4 and variants with call rate <75% were removed from further analysis. Individuals with low concordance (<95%) between the sequencing data and available genotype data were removed (n = 3). Additionally, individuals missing LVM measures (n = 9) and/or covariate values (n = 1) were removed. For the remaining family study samples, pedigree structure was confirmed using the Graphical Relationship Representation software v. 1.2.1.41. Mendelian error checking was performed, and Mendelian errors were set to missing for all the variants called using PLATO v. 0.84 (Grady ).

Replication sample genotyping and quality control

Exome Array:

NOMAS participants and our 12 replication families were genotyped using the Illumina HumanExome-24v1_B Beadchip, at the Hussman Institute for Human Genomics in the Center for Genome Technology (Miami, FL). Our Exome Array included custom exonic variants selected on the basis of sequencing data obtained in the discovery family data set. Details of the variant selection have been described previously (Wang ). A total of 4128 single nucleotide variants (SNVs) within our 12p11 region were available for quality control analyses. Of the 659 NOMAS participants and 150 replication family members genotyped on the Exome Array, 99.8% had a genotype call rate >98%. A subset of NOMAS participants genotyped on the Exome Array also had Affymetrix 6.0 whole-genome genotype data available and all participants had high concordance with this additional data set (≥96%). We removed seven NOMAS individuals and five replication family members due to unexpected duplication or relatedness, gender discrepancy, and low call rate (<98%). For our current study, individuals missing LVM measures and/or covariate values were removed (n = 38 NOMAS participants; n = 2 replication family members), leaving us with a final sample of 618 Dominican NOMAS participants and 143 replication family members. At the variant level, we removed SNVs with call rate <95% (n = 3 in NOMAS; n = 7 in the replication families) and monomorphic SNVs, leaving us with 1419 exonic rare single nucleotide variants (RVs) in our region for Exome Array analysis in NOMAS and 842 exonic RVs in the replication families. Mendelian error checking was performed in the replication families and Mendelian errors were set to missing for all the variants called using PLATO v. 0.84 (Grady ).

Affymetrix 6.0 genotyping chip:

In addition to Exome Array data, many of our NOMAS participants also had Affymetrix 6.0 whole-genome genotyping data available. These data were used for our common single nucleotide variant (CV) analyses since only 10.5% of CVs identified in the discovery families were available on the Exome Array (n = 574). Details of our genotyping and QC have been reported previously (Della-Morte ). These data were imputed using the 1000 Genomes phase 1, version 3 reference panel with IMPUTE2 v.2.2.2 (Howie ). Variants with INFO ≤0.4 were removed from analyses.

Statistical methods

Family-based discovery analyses:

As in previous analyses, LVM was natural log transformed and multiplied by 10 to ensure it was normally distributed and properly scaled for analyses in SOLAR 6.6.2 (Wang ; Della-Morte ). Common and rare SNVs were defined based on frequencies from Dominican NOMAS participants, as described previously (Wang ). SNVs were classified as common if they had MAF ≥5% and rare if they had MAF <5% or could not be imputed efficiently (INFO ≤0.4) in NOMAS Dominicans. Analyses in the families were performed using the sequencing data. Single-variant analyses were performed for CVs using the Quantitative Transmission-Disequilibrium Test (QTDT), implemented in SOLAR. Adjustment was made for sex, BMI, systolic blood pressure, and smoking status. Covariates were identified using a polygenic screen implemented in SOLAR, with covariates having P < 0.1 included in analyses. CVs with P < 9.13 × 10−6 were considered significant based on a Bonferroni correction of 5477 tests. To evaluate the contribution of our most strongly associated CV to our linkage results, linkage analyses were performed in our combined sample of sequenced and replication families in SOLAR following our previously detailed protocol (Wang ). Briefly, linkage analyses were run with and without CV genotype as a covariate and a likelihood ratio test was performed to determine if the LOD score significantly decreased after conditioning on the CV. Gene-based analyses were performed for RVs using the Family SNP-set (Sequence) Kernel Association Test (Fam-SKAT) v. 1.8 (Chen ), adjusting for the same covariates included in our single-variant analyses. We employed two different gene-based analyses based on annotation from ANNOVAR v. 2016Feb01 (Wang ) and SeattleSequation 138: exonic RVs (UTR3, UTR5, synonymous, missense, nonsense, or splice-site variants) and a subset of nonsynonymous RVs only (missense, nonsense, or splice-site variants). These analyses were restricted to genes with ≥2 polymorphic variants and a P < 1.83 × 10−4 was considered significant based on a Bonferroni correction of 273 tests (181 exonic RV genes and 92 nonsynonymous RV genes). Using SAS v. 9.3, we computed the residual LVM value after adjusting for the associated risk factors to better visualize the distribution of RVs in relation to LVM.

Replication analyses:

LVM was natural log transformed and multiplied by 10 to ensure a normal distribution. We additionally removed outliers falling 3 SD above or below the mean LVM value. Single-variant CV analyses were performed in the NOMAS sample using linear regression, implemented in PLINK v. 1.7. Variants were coded additively. Replication was defined as CVs with P < 0.05 and having the same direction of effect. Affy 6.0 genotyping data were used for these analyses. Gene-based RV association analyses using Exome Array data were performed using SKAT-O v. 1.1.2 in the NOMAS sample and Fam-SKAT v. 1.8 in the replication family sample. These analyses were performed using the same two filtering algorithms employed in the discovery analyses; all exonic RVs and then a subset of nonsynonymous RVs only. Analyses were restricted to genes with ≥2 polymorphic variants. Replication was defined as genes with P < 0.05. All association analyses were adjusted for the same covariates included in our discovery analyses. NOMAS analyses were additionally adjusted for factors that were associated with LVM (P < 0.1) in this sample: diabetes, time between LVM measurement and baseline patient assessment, waist hip ratio, and the first principal component obtained via principal components analyses implemented in Eigenstrat to account for population substructure. Details of our Eigenstrat analysis and resulting principal components can be found in Figure S1 and File S1.

Data availability

The data that support the findings of this study are currently being uploaded to dbGaP.

Results

Participant and SNV characteristics

A total of 180 individuals in 10 Dominican families were included in our discovery analyses. Characteristics of these 10 families are summarized in Table 1. Family size ranged from 10 to 35 individuals and family-specific LOD scores for the 12p11 region ranged from 0.14 to 0.94. LVM residual values ranged from −5.62 to 6.89. Mean age was between 40 and 54 yr and mean BMI was similar across the 10 families. Variability was seen with respect to percent of participants with diabetes and hypertension in each family (Table 1). Within the 10 families, sequencing identified 5473 CVs and 10,167 RVs. Among RVs, 24.6% (n = 2503) were novel and 22.8% (n = 2318) were classified as exonic. A total of 20.2% of exonic RVs (n = 468 variants) were either missense, nonsense, or splice-site variants (Table 2). In the NOMAS sample, 96.5% of the CVs and 44.4% of the exonic RVs identified through sequencing were available for replication analyses. An additional 572 exonic RVs on the Exome Array were polymorphic in the NOMAS sample but were monomorphic in the discovery family sample and were included in gene-based replication analyses. In the replication families, 29.6% of the exonic RVs identified through sequencing were available for analysis in addition to 153 exonic RVs that were on the Exome Array and polymorphic in the replication families but monomorphic in the discovery family sample.
Table 2

Polymorphic variants identified by resequencing of chr12p11 region in 10 Dominican families

MAFa
Type of VariantbTotalNovel<5%≥5%
Missense60885372151
Nonsense7151
Synonymous59867316215
Splice site6141
UTR 3′ or UTR 5′2042374995673
ncRNA exonic127326629
Intronic11,380183956273914
ncRNA intronic2514311692
Upstream or downstream3156115896
ncRNA Other8053
Intergenic29800298
Total15,640250376645473

MAF based on NOMAS DR frequencies.

Function based on ANNOVAR(2016Feb01) annotation and dbSNP138/GVS 138 annotation.

MAF based on NOMAS DR frequencies. Function based on ANNOVAR(2016Feb01) annotation and dbSNP138/GVS 138 annotation.

CV association results

Results from the CV analyses are shown in Figure 1 and our most strongly associated variants with suggestive evidence for replication are listed in Table 3. While no CV reached peak-wide significance (P < 9.2 × 10−6), rs1046116 achieved a P-value of 9.0 × 10−4 and showed evidence for replication in NOMAS (P = 0.03). This variant was associated with decreased LVM and is a missense variant located within PKP2. To investigate this association further, we tested rs1046116 for association with LVM in our replication families and found no evidence for replication (P = 0.46). To evaluate the contribution of rs1046116 to our linkage signal, we performed linkage analyses in our combined discovery and replication family sample and observed a change in LOD score from 9.09 to 8.30 (P < 0.0001) with rs1046116 adjustment, indicating that this CV accounted for some of the linkage signal at the 12p11 QTL. Eight additional variants showed suggestive evidence for association in both the families and NOMAS (P < 0.05). The minor allele of these variants, except for rs11168985 (located in CPNE8), was associated with increased LVM.
Figure 1

Peak-wide common variant association results for the chr12p11 region. The multipoint LOD score over the region is shown as a solid line. Red circles represent Quantitative Transmission-Disequilibrium P-values in the Dominican families and black circles represent linear regression P-values in NOMAS. The blue dot above the multipoint LOD score line indicates the position of PKP2. The dashed line indicates suggestive association at P < 0.001.

Table 3

Common variants with P < 0.05 in the discovery families and evidence for replication in NOMAS

Discovery FamiliesNOMAS
GeneSNPBPEffect Allele/Reference AlleleEAFDirection of EffectaP-valuebMAFDirection of EffectaP-valuebFunction
PKP2rs104611633,021,934G/A0.159.04 × 10−40.200.03Missense
ERGIC2rs103560729,509,513C/A0.39+4.72 × 10−30.42+0.05Intronic
OR10AD1rs1116845948,596,241G/A0.26+0.010.24+0.02Missense
SLC38A4rs219116247,197,648A/G0.27+0.010.34+0.03Intronic
VDRrs73123648,238,757G/A0.29+0.020.34+0.03Synonymous
ANO6rs7408182745,833,755A/G0.08+0.030.09+1.99 × 10−33′ UTR
TSPAN11rs3598943931,145,916T/A0.21+0.030.13+0.013′ UTR
CPNE8rs1116898539,045,983A/C0.250.040.220.01Downstream
RPAP3rs731179048,061,435A/G0.04+0.040.07+0.02Intronic

+ indicates effect allele associated with increased LVM; − indicates effect allele associated with decreased LVM.

QTDT P-value for families, linear regression P-value for NOMAS. Bold typeface indicates p-value <0.05.

Peak-wide common variant association results for the chr12p11 region. The multipoint LOD score over the region is shown as a solid line. Red circles represent Quantitative Transmission-Disequilibrium P-values in the Dominican families and black circles represent linear regression P-values in NOMAS. The blue dot above the multipoint LOD score line indicates the position of PKP2. The dashed line indicates suggestive association at P < 0.001. + indicates effect allele associated with increased LVM; − indicates effect allele associated with decreased LVM. QTDT P-value for families, linear regression P-value for NOMAS. Bold typeface indicates p-value <0.05.

RV association results

Gene-based RV analyses were performed using two filtering algorithms; first, analyzing all exonic RVs (n = 181 genes) and second, restricting analyses to nonsynonymous RVs (n = 92 genes). A total of 54 genes showed suggestive evidence for association in the discovery families under at least one filtering algorithm (P < 0.05). Results for these genes are shown in Table S2. Among these top genes, we observed at least suggestive evidence for replication of five genes in NOMAS and three genes in the replication families (P < 0.10). Summary results for these genes are shown in Table 4. In exonic RV analyses, NELL2 was our most strongly associated gene (P = 2.2 × 10−4), although this finding did not replicate in NOMAS (P = 0.53) or the replication families (P = 0.18) (Table S2). While no gene achieved replication in both replication samples, our most strongly associated gene with evidence for replication in NOMAS was ALG10B (P = 0.02 in the discovery families; P = 0.04 in NOMAS; P = 0.86 in the replication families). When analyses were restricted to nonsynonymous RVs, this association became slightly attenuated in the discovery families (P = 0.06), and was even stronger in NOMAS (P = 0.03). GXYLT1 was our most strongly associated gene with evidence for replication in the replication families (P = 0.04 in the discovery families; P = 0.004 in the replication families; P = 0.36 in NOMAS).
Table 4

Gene-based association results for rare variant analysis in the chr12p11 region, for genes with P < 0.05 in the Dominican families and P < 0.10 in the replication families or NOMAS

Exonic RVsNonsynonymous RVs
Discovery FamiliesReplication FamiliesNOMASDiscovery FamiliesReplication FamiliesNOMAS
GeneMB Start#SNVsPval#SNVsPval#SNVsPval#SNVsPval#SNVsPval#SNVsPval
NELL244.90242.2 × 10−450.1880.5370.00620.0251.00
PKP232.94164.68 × 10−360.98140.1360.0560.98110.05
ALG10B38.71420.0290.86220.0470.0620.8170.03
FAM186A50.72260.02150.16210.04180.06120.20180.12
ZNF64148.73170.0380.1130.3650.15
PPFIBP127.67230.0570.33122.54 × 10−360.1250.3999.53 × 10−3
SLC2A1340.14190.0670.3990.1620.0230.5330.04
GXYLT142.48240.0424.27 × 10−390.3630.85
SLC38A447.16210.0260.07120.8930.2950.88
In nonsynonymous RV analyses, IFLTD1 showed the strongest evidence for association in the discovery families (P = 0.003). However, since no variants met our inclusion criteria in NOMAS or the replication families, replication was not possible. NELL2 was our second most strongly associated gene (P = 0.006), and replicated in the replication families (P = 0.02) but not NOMAS (P = 1.00) (Table S2). Two genes, PKP2 and SLC2A13, were moderately associated in the discovery families and showed at least suggestive evidence for replication in NOMAS (P < 0.10), although they did not replicate in the replication families. Details of the nonsynonymous RVs in PKP2, SLC2A13, and NELL2 are shown in Table 5. Within the discovery families, six nonsynonymous variants in PKP2 were observed, all of which were missense variants. These variants were carried by individuals within five families: F2783, F3561, F3719, F5103, and F5275. In families F2783 and F3719, carriers of a rare PKP2 allele had higher average LVM residual compared to noncarriers. In the remaining families, carriers of a rare PKP2 allele had lower average LVM residual compared to noncarriers (Figure 2). When examining each missense variant individually, we observed that carriers of the rare allele of rs143004808, rs146882581, rs151264959, and rs62001016 had higher average LVM residual compared to noncarriers. In contrast, carriers of the rare allele of rs200069860 and rs75909145 had lower average LVM residual compared to noncarriers (Figure 3).
Table 5

Annotations for nonsynonymous RVs in PKP2, SLC2A13, and NELL2

MAF
GeneVariantPositionaMinor/Major AlleleDiscovery Families with VariantDiscovery FamiliesReplication FamiliesNOMASFunctionCADD ScoreGERP ScoreAmino AcidsPolyPhenb
PKP2rs2000698603,3030,850A/TF35610.008Missense18.045.37GLY/CYSProbably-damaging
PKP2rs15126495932,949,047T/CF27830.0030.0077.6 × 10−4Missense17.765.06ASP/ASNProbably-damaging
PKP2rs14688258132,994,073A/GF51030.0030.0070.008Missense3.332.01THR/METBenign
PKP2rs6200101633,031,023A/GF5103, F52750.020.010.01Missense1.723.84ALA/VALBenign
PKP2rs7590914533,049,457A/CF3561, F52750.03Missense14.023.17SER/ILEBenign
PKP2rs14300480833,049,590T/CF37190.0080.005Missense33.004.07ASP/ASNProbably-damaging
PKP2rs11259285532,949,140C/T0.0030.002Missense12.395.06THR/ALABenign
PKP2rs14085201932,974,348C/T7.6 × 10−4Missense12.982.55ASN/SERBenign
PKP2rs13915946432,996,248T/C7.6 × 10−4Splice site6.181.10Unknown
PKP2rs20180391833,030,840A/G7.6 × 10−4Missense11.255.37ALA/VALBenign
PKP2rs14954239833,031,888T/C7.6 × 10−4Missense15.640.71ARG/HISBenign
PKP212_3297542132,975,421A/G0.01Nonsense39.004.16ARG/StopUnknown
PKP212_3302196833,021,968A/G0.005Nonsense38.002.05ARG/StopUnknown
PKP2rs14610224132,977,026T/C0.007Missense23.605.32VAL/ILEProbably-damaging
PKP2rs13973432832,949,101T/G0.007Missense17.805.06ARG/SERBenign
SLC2A13rs13951886340,499,594T/CF37190.0050.0200.01Missense19.664.00SER/ASNPossibly-damaging
SLC2A13rs18634112740,499,132A/GF51030.0080.0100.01Missense7.551.54ALA/VALBenign
SLC2A13rs14602055140,265,659G/A0.020.002Missense12.444.57VAL/ALABenign
NELL2rs36771274244,902,736C/TF51030.01Missense11.995.25GLN/ARGPossibly-damaging
NELL212_4505935645,059,356T/CF60810.003Missense9.884.38ARG/HISProbably-damaging
NELL2rs14473038545,105,152T/CF37190.010.002Missense6.041.40SER/ASNBenign
NELL2rs1757483945,108,480C/TF5103, F52750.03Missense9.574.62ASN/ASPBenign
NELL2rs20165298245,171,085T/CF60810.003Missense32.005.62ASP/ASNProbably-damaging
NELL2rs37252234145,269,034C/TF60810.006Missense16.525.14ASN/SERPossibly-damaging
NELL2rs265897345,269,640T/CF2235, F60810.0080.050.02Missense13.583.08VAL/ILEBenign
NELL2rs13845472945,059,310C/G0.0030.0008Missense18.804.39ILE/METPossibly-damaging
NELL212_4492637244,926,372A/G0.0008Missense27.505.72SER/LEUProbably-damaging
NELL2rs14693671744,915,791T/G0.002Missense17.794.71ARG/SERProbably-damaging

Position based on hg19.

PPH HumanDiv.

Figure 2

Box and whisker plots showing the distribution of LVM residual by PKP2 nonsynonymous rare variant carrier status, stratified by family. LVM residuals were calculated by adjusting for sex, body mass index (BMI), systolic blood pressure, and smoking status. Carriers of a PKP2 nonsynonymous rare variant are shown in gray and noncarriers are shown in white.

Figure 3

Box and whisker plots showing the distribution of LVM residual within the 10 sequenced families by PKP2 nonsynonymous rare variant carrier status, stratified by variant. LVM residuals were calculated by adjusting for sex, body mass index (BMI), systolic blood pressure, and smoking status. Carriers of a rare allele of a PKP2 variant are shown in gray and noncarriers are shown in white.

Position based on hg19. PPH HumanDiv. Box and whisker plots showing the distribution of LVM residual by PKP2 nonsynonymous rare variant carrier status, stratified by family. LVM residuals were calculated by adjusting for sex, body mass index (BMI), systolic blood pressure, and smoking status. Carriers of a PKP2 nonsynonymous rare variant are shown in gray and noncarriers are shown in white. Box and whisker plots showing the distribution of LVM residual within the 10 sequenced families by PKP2 nonsynonymous rare variant carrier status, stratified by variant. LVM residuals were calculated by adjusting for sex, body mass index (BMI), systolic blood pressure, and smoking status. Carriers of a rare allele of a PKP2 variant are shown in gray and noncarriers are shown in white. Since our most strongly associated CV was a missense variant in PKP2, we performed an additional gene-based analysis on all nonsynonymous variants in PKP2 (common and rare) which resulted in a slightly stronger association in the discovery families (P = 0.04, seven variants) and no association in NOMAS (P = 0.41, 12 variants) or the replication families (P = 0.98, seven variants).

Candidate variant and gene association results

We additionally tested whether candidate CVs identified in our previous study of LVM (Della-Morte ) (n = 33 CVs in 14 genes), a study which primarily investigated CV associations with LVM in all Caribbean Hispanics, were associated in our current study which focused specifically on Dominicans. Association results for these CVs are summarized in Table S3. Eleven of the previously reported CVs, located in SLC38A1, BICD1, RACGAP1, ARID2, FAR2, and c12orf68, were found in our discovery families. These CVs included three intronic variants in SLC38A1 which showed suggestive evidence for association in the discovery families and NOMAS, although the direction of effect differed (rs11183394, rs6582621, rs7133522). One other CV in SLC38A1, rs7956629, was moderately associated in the discovery families (P = 0.02), but did not replicate in NOMAS (P = 0.29). No other CVs were associated with LVM in the families. We also performed gene-based RV analyses on 14 genes containing the previously reported candidate CVs. Results are summarized in Table S4. Exonic RV analyses identified PDZRN4 (P = 0.03) and ARID2 (P = 0.04) to be moderately associated in the discovery families, although neither replicated in NOMAS or the replication families (P > 0.05). One additional gene, c12orf68, was moderately associated in the discovery families (P = 0.03); however, replication was not possible as no variants met our inclusion criteria in either replication sample. In nonsynonymous RV analyses, SLC2A13 was the most strongly associated gene and was a top gene in our peak-wide nonsynonymous RV analyses.

Discussion

Building upon our previous studies which identified a region on 12p11 to be linked with LVM in Dominicans, we performed targeted resequencing in 10 extended families with evidence for linkage to refine the region and identify LVM susceptibility genes. Through these resequencing efforts, we found suggestive evidence for both common and rare variants within 12p11 to influence LVM in Dominican families. Common variant analyses revealed rs1046116, a missense variant in PKP2, to be the most strongly associated variant with evidence for replication in our population-based sample of Dominicans. When investigating this association further by testing rs1046116 for association in our 12 additional families with suggestive evidence for linkage to the region, we observed no association; however, these families generally showed weaker evidence for linkage to this region than the 10 families sequenced for our discovery analyses. Further, although this association could not be replicated in our replication families, linkage analyses in our combined family sample revealed that rs1046116 significantly contributed to the original linkage signal as evidenced by a decrease in LOD score from 9.09 to 8.30 in these families. This variant was associated with decreased LVM, suggesting that the G allele of rs1046116 may be protective against the development of left ventricular hypertrophy. Interestingly, we also found suggestive evidence that RVs within PKP2 influence LVM. In nonsynonymous RV analyses, PKP2 was one of the most strongly associated genes with suggestive evidence for replication, thus suggesting that both common and rare nonsynonymous variants, particularly missense variants, may influence LVM. Indeed, when a gene-based analysis was performed on all nonsynonymous variants, common and rare, the association between PKP2 and LVM became slightly stronger in the families, although the association diminished in NOMAS. To explore the associations underlying PKP2 further, we performed an in-depth characterization of the individual rare nonsynonymous variants within this gene. Three missense variants were predicted to be probably-damaging by PolyPhen and showed strong evidence of being conserved (rs200069860, rs151264959, rs143004808). Of particular interest was rs143004808, which was observed in F3719 and had a Combined Annotation Dependent Depletion (CADD) score of 33, indicating that this variant is in the top 0.1% of variants with respect to deleteriousness. Two nonsense variants (located at base pair positions 32,975,421 and 33,021,968) also had similar CADD scores and were observed in the NOMAS sample. Missense variants in PKP2, including several of the ones identified in our study, have been previously implicated in cardiac phenotypes (Cerrone ; Peters 2014; van der Zwaag ; Saguner ), making PKP2 an excellent candidate gene for LVM. PKP2 encodes plakophilin 2, a member of one of three major protein families found in desmosomes (Getsios ). Desmosomes are protein structures in cell membranes that maintain adhesion between neighboring cells and serve as anchoring sites for intermediate filaments. They are found in tissues that experience mechanical stress, including the myocardium (Getsios ; Desai ). Mutations in PKP2 are known to play a role in arrythmogenic cardiomyopathies (AC), most notably arrythmogenic right ventricular dysplasia/cardiomyopathy (ARD/C) (Gerull ; Fernandez-Rosado ; Groeneweg ; Trenkwalder ). Several studies have shown that PKP2 haploinsufficiency contributes to pathogenesis in AC (Joshi-Mukherjee ; Hall ; Kirchner ; Rasmussen ). In characterizing the clinical course of ARD/C, one study found that 9% of ARD/C patients with a PKP2 mutation had left ventricular dysfunction and patients carrying more than one mutation were three times more likely to have left ventricular dysfunction compared to patients with one mutation (Bhonsale ). Studies have also provided further evidence suggesting that PKP2 mutations may impact left ventricular cardiomyopathies (Horimoto ; Hamid ; Sen-Chowdhry ; Saguner ), including a recent case report that identified a pathogenic PKP2 deletion in two siblings with left ventricular noncompaction cardiomyopathy (Ramond ). Together, these studies support the biologic plausibility for a role of PKP2 in LVM. In addition to PKP2, our analyses revealed nonsynonymous variants within SLC2A13 to be suggestively associated in both our discovery families and the NOMAS sample. Included in our nonsynonymous analyses were two missense variants observed in both the families and NOMAS (rs139518863 and rs186341127), as well as one missense variant observed in NOMAS only (rs146020551). Rs139518863 has strong evidence of being evolutionarily conserved (Genomic Evolutionary Rate Profiling (GERP) score = 4), is predicted to be possibly-damaging according to PolyPhen, and has a CADD score of 19.66 indicating that this variant is in the top 10–1% of all variants with respect to deleteriousness. Similar characteristics were observed for rs146020551. Interestingly, our previous association study found CVs in SLC2A13 to be associated with LVM in Caribbean Hispanics (Della-Morte ). These findings, in combination with results from our current study, suggest that variants in SLC2A13 may be involved in LVM in Dominicans and, more broadly, Caribbean Hispanics. Additional candidate genes identified through our resequencing study were GXYLT1 and NELL2. GXYLT1 was identified in our exonic RV analyses and encodes an enzyme that adds xylose to O-glucose residues bound to epidermal growth factor repeats of Notch proteins (Sethi ). Notch signaling plays a role in cardiac development and mutations within Notch signaling genes have been associated with cardiac structural abnormalities including left ventricular outflow tract abnormalities, making GXYLT1 an excellent candidate gene for LVM (Nemir and Pedrazzini 2008; Padang ; Penton ). NELL2 was one of our most strongly associated genes in RV analyses and encodes the neural epidermal growth factor-like 2 protein which is largely expressed in brain but is also expressed in hematopoietic cells (Luce and Burrows 1999). Its role in LVM is unknown. When evaluating the results from our study, there are several limitations which should be noted. First, only a subset of families was sequenced; however, these were the families with the strongest evidence for linkage to the 12p11 region. Second, sequencing was performed primarily on exons, therefore missing noncoding regions affecting LVM. However, for candidate genes previously implicated in LVM, sequencing beyond the exons was performed to allow for identification of noncoding variants. Third, the RV replication analyses were performed using Exome Array data, thereby limiting our analyses to only those variants included on the Exome Array. However, variants on the Exome Array were selected to be functional (primarily missense variants) and had to be observed at least three times in at least two people (http://genome.sph.umich.edu/wiki/Exome_Chip_Design), and we added custom content within the 12p11 region to our Exome Array. Fourth, our study was limited to individuals from the Dominican Republic and may not be generalizable to other Hispanic or non-Hispanic populations. Fifth, due to our sample sizes, we had limited power to detect RVs in our study. In conclusion, our current targeted resequencing study, in combination with our previous studies, shows evidence of a role for both common and rare variants within the 12p11 region in LVM pathogenesis, particularly missense variants within PKP2. Functional studies are needed to elucidate the mechanism underlying the association of the implicated genes with LVM.

Supplementary Material

Supplemental material is available online at www.g3journal.org/lookup/suppl/doi:10.1534/g3.117.300358/-/DC1. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  56 in total

1.  Homocysteine and the risk of ischemic stroke in a triethnic cohort: the NOrthern MAnhattan Study.

Authors:  Ralph L Sacco; Kishlay Anand; Hye-Seung Lee; Bernadette Boden-Albala; Sally Stabler; Robert Allen; Myunghee C Paik
Journal:  Stroke       Date:  2004-09-02       Impact factor: 7.914

2.  Heritability of left ventricular mass in a large cohort of twins.

Authors:  Pankaj Sharma; Rita P S Middelberg; Toby Andrew; Michael R Johnson; Howard Christley; Morris J Brown
Journal:  J Hypertens       Date:  2006-02       Impact factor: 4.844

3.  A genetic variants database for arrhythmogenic right ventricular dysplasia/cardiomyopathy.

Authors:  Paul A van der Zwaag; Jan D H Jongbloed; Maarten P van den Berg; Jasper J van der Smagt; Roselie Jongbloed; Hennie Bikker; Robert M W Hofstra; J Peter van Tintelen
Journal:  Hum Mutat       Date:  2009-09       Impact factor: 4.878

4.  Clinical Presentation, Long-Term Follow-Up, and Outcomes of 1001 Arrhythmogenic Right Ventricular Dysplasia/Cardiomyopathy Patients and Family Members.

Authors:  Judith A Groeneweg; Aditya Bhonsale; Cynthia A James; Anneline S te Riele; Dennis Dooijes; Crystal Tichnell; Brittney Murray; Ans C P Wiesfeld; Abhishek C Sawant; Bina Kassamali; Douwe E Atsma; Paul G Volders; Natasja M de Groot; Karin de Boer; Stefan L Zimmerman; Ihab R Kamel; Jeroen F van der Heijden; Stuart D Russell; Maarten Jan Cramer; Ryan J Tedford; Pieter A Doevendans; Toon A van Veen; Harikrishna Tandri; Arthur A Wilde; Daniel P Judge; J Peter van Tintelen; Richard N Hauer; Hugh Calkins
Journal:  Circ Cardiovasc Genet       Date:  2015-03-27

5.  Molecular insights into arrhythmogenic right ventricular cardiomyopathy caused by plakophilin-2 missense mutations.

Authors:  Florian Kirchner; Anja Schuetz; Leif-Hendrik Boldt; Kristina Martens; Gunnar Dittmar; Wilhelm Haverkamp; Ludwig Thierfelder; Udo Heinemann; Brenda Gerull
Journal:  Circ Cardiovasc Genet       Date:  2012-07-09

6.  Impact of genotype on clinical course in arrhythmogenic right ventricular dysplasia/cardiomyopathy-associated mutation carriers.

Authors:  Aditya Bhonsale; Judith A Groeneweg; Cynthia A James; Dennis Dooijes; Crystal Tichnell; Jan D H Jongbloed; Brittney Murray; Anneline S J M te Riele; Maarten P van den Berg; Hennie Bikker; Douwe E Atsma; Natasja M de Groot; Arjan C Houweling; Jeroen F van der Heijden; Stuart D Russell; Pieter A Doevendans; Toon A van Veen; Harikrishna Tandri; Arthur A Wilde; Daniel P Judge; J Peter van Tintelen; Hugh Calkins; Richard N Hauer
Journal:  Eur Heart J       Date:  2015-01-23       Impact factor: 29.983

7.  Left ventricular mass and geometry and the risk of ischemic stroke.

Authors:  Marco R Di Tullio; Donna R Zwas; Ralph L Sacco; Robert R Sciacca; Shunichi Homma
Journal:  Stroke       Date:  2003-09-04       Impact factor: 7.914

8.  Missense mutations in plakophilin-2 cause sodium current deficit and associate with a Brugada syndrome phenotype.

Authors:  Marina Cerrone; Xianming Lin; Mingliang Zhang; Esperanza Agullo-Pascual; Anna Pfenniger; Halina Chkourko Gusky; Valeria Novelli; Changsung Kim; Tiara Tirasawadichai; Daniel P Judge; Eli Rothenberg; Huei-Sheng Vincent Chen; Carlo Napolitano; Silvia G Priori; Mario Delmar
Journal:  Circulation       Date:  2013-12-18       Impact factor: 29.690

9.  Moderate alcohol consumption reduces risk of ischemic stroke: the Northern Manhattan Study.

Authors:  Mitchell S V Elkind; Robert Sciacca; Bernadette Boden-Albala; Tanja Rundek; Myunghee C Paik; Ralph L Sacco
Journal:  Stroke       Date:  2005-11-23       Impact factor: 7.914

10.  Genome-wide association of echocardiographic dimensions, brachial artery endothelial function and treadmill exercise responses in the Framingham Heart Study.

Authors:  Ramachandran S Vasan; Martin G Larson; Jayashri Aragam; Thomas J Wang; Gary F Mitchell; Sekar Kathiresan; Christopher Newton-Cheh; Joseph A Vita; Michelle J Keyes; Christopher J O'Donnell; Daniel Levy; Emelia J Benjamin
Journal:  BMC Med Genet       Date:  2007-09-19       Impact factor: 2.103

View more
  2 in total

Review 1.  Genetic approaches toward understanding the individual variation in cardiac structure, function and responses to exercise training.

Authors:  Minsun Kim; Seung Kyum Kim
Journal:  Korean J Physiol Pharmacol       Date:  2021-01-01       Impact factor: 2.016

2.  Rare variants in previously identified linkage regions associated with carotid plaque in Dominican Republic families.

Authors:  Nicole D Dueker; Ashley Beecham; Liyong Wang; Chuanhui Dong; Ralph L Sacco; Susan H Blanton; Tatjana Rundek
Journal:  PLoS One       Date:  2022-01-12       Impact factor: 3.240

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.