Literature DB >> 19252134

Genome-wide association scan for diabetic nephropathy susceptibility genes in type 1 diabetes.

Marcus G Pezzolesi1, G David Poznik, Josyf C Mychaleckyj, Andrew D Paterson, Michelle T Barati, Jon B Klein, Daniel P K Ng, Grzegorz Placha, Luis H Canani, Jacek Bochenski, Daryl Waggott, Michael L Merchant, Bozena Krolewski, Lucia Mirea, Krzysztof Wanic, Pisut Katavetin, Masahiko Kure, Pawel Wolkow, Jonathon S Dunn, Adam Smiles, William H Walker, Andrew P Boright, Shelley B Bull, Alessandro Doria, John J Rogus, Stephen S Rich, James H Warram, Andrzej S Krolewski.   

Abstract

OBJECTIVE: Despite extensive evidence for genetic susceptibility to diabetic nephropathy, the identification of susceptibility genes and their variants has had limited success. To search for genes that contribute to diabetic nephropathy, a genome-wide association scan was implemented on the Genetics of Kidneys in Diabetes collection. RESEARCH DESIGN AND METHODS: We genotyped approximately 360,000 single nucleotide polymorphisms (SNPs) in 820 case subjects (284 with proteinuria and 536 with end-stage renal disease) and 885 control subjects with type 1 diabetes. Confirmation of implicated SNPs was sought in 1,304 participants of the Diabetes Control and Complications Trial (DCCT)/Epidemiology of Diabetes Interventions and Complications (EDIC) study, a long-term, prospective investigation of the development of diabetes-associated complications.
RESULTS: A total of 13 SNPs located in four genomic loci were associated with diabetic nephropathy with P < 1 x 10(-5). The strongest association was at the FRMD3 (4.1 protein ezrin, radixin, moesin [FERM] domain containing 3) locus (odds ratio [OR] = 1.45, P = 5.0 x 10(-7)). A strong association was also identified at the CARS (cysteinyl-tRNA synthetase) locus (OR = 1.36, P = 3.1 x 10(-6)). Associations between both loci and time to onset of diabetic nephropathy were supported in the DCCT/EDIC study (hazard ratio [HR] = 1.33, P = 0.02, and HR = 1.32, P = 0.01, respectively). We demonstratedexpression of both FRMD3 and CARS in human kidney.
CONCLUSIONS: We identified genetic associations for susceptibility to diabetic nephropathy at two novel candidate loci near the FRMD3 and CARS genes. Their identification implicates previously unsuspected pathways in the pathogenesis of this important late complication of type 1 diabetes.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19252134      PMCID: PMC2682673          DOI: 10.2337/db08-1514

Source DB:  PubMed          Journal:  Diabetes        ISSN: 0012-1797            Impact factor:   9.461


Diabetic nephropathy is the leading contributor to end-stage renal disease (ESRD) in the U.S. (1). Clinically, diabetic nephropathy is manifest as a progressive disease process that advances through characteristic stages. It begins with microalbuminuria (leakage of small amounts of albumin into the urine) and progresses to overt proteinuria. In a large proportion of these patients, renal function declines and continues to deteriorate until ESRD is reached, and replacement therapy is indicated (2–4). Overall, ESRD develops in ∼20% of all patients with type 1 diabetes (5,6). Despite evidence that genetic susceptibility plays a role in the development of diabetic nephropathy in type 1 diabetes (7–9), success in identifying the responsible genetic variants has been limited (10,11). This has been attributable, in part, to the small size of the DNA collections available to individual research groups and the narrow focus of the searches on candidate genes. Another challenge that has received little attention in previous studies is the possibility that successive stages of diabetic nephropathy are influenced by different genetic factors (12,13). To conduct a statistically robust study that provides genome-wide coverage for detection of common variants that may have relatively small, but pathogenically significant, effect on risk of diabetic nephropathy in type 1 diabetes, the Genetics of Kidneys in Diabetes (GoKinD) collection was established (14). A genome-wide scan of this collection was supported by the Genetic Association Information Network (GAIN) initiative (15). This report presents 1) results of this genome-wide association scan in the GoKinD collection, 2) replication of the significant associations in this scan with time to onset of diabetes-associated complications (severe nephropathy) in theDiabetes Control and Complications Trial (DCCT)/Epidemiology of Diabetes Interventions and Complications (EDIC) study, and 3) characterization of expression of the identified candidate diabetic nephropathy genes in normal human cell lines.

RESEARCH DESIGN AND METHODS

Subjects for the GoKinD collection were recruited through two centers with different methods of ascertainment and recruitment (14). The George Washington University (GWU) Biostatistics Center coordinated the recruitment of volunteers (through mass media advertisement) living throughout the U.S. (excluding New England) and Canada to 1 of 27 clinical centers located across the U.S. and Canada. The Section of Genetics and Epidemiology at the Joslin Diabetes Center (JDC) recruited and examined patients of the Joslin Clinic from New England who were already enrolled in the Joslin Kidney Study on the Genetics of Diabetic Nephropathy, a clinic-based cohort study in which case subjects with diabetic nephropathy and a random sampling of eligible control subjects were identified and recruited (16). A detailed description of the GoKinD collection has been published (14). Briefly, subjects enrolled in GoKinD had type 1 diabetes diagnosed before age 31, began insulin treatment within 1 year of their diagnosis, and were between 18 and 59 years of age at the time of enrollment. Participation in the DCCT/EDIC study was an exclusion criterion so that the two study populations would be independent. Case subjects with diabetic nephropathy had either persistent proteinuria, defined by a urinary albumin-to-creatinine ratio ≥300 μg/mg in two of the last three measurements taken at least 1 month apart, or ESRD (dialysis or renal transplant). Control subjects had type 1 diabetes for at least 15 years and normoalbuminuria, defined by an albumin-to-creatinine ratio <20 μg/mg in two of the last three measurements taken at least 1 month apart (if a third measurement was required, a value <40 μg/mg was necessary for inclusion), without ever having been treated with ACE inhibitors or angiotensin receptor blockers, and they were not being treated with antihypertensive medication at the time of recruitment into the study. For additional information regarding the definition of case and control subjects used in this analysis, refer to the report by Mueller et al. (14). In total, 1,879 subjects (935 case and 944 control subjects) were recruited into GoKinD. The GWU panel included 437 case subjects with diabetic nephropathy (58 with proteinuria and 379 with ESRD) and 446 control subjects; the JDC panel included 498 case subjects with diabetic nephropathy (268 with proteinuria and 230 with ESRD) and 498 control subjects. Further details are also provided in the supplementary information, which is available in an online appendix at http://diabetes.diabetesjournals.org/cgi/content/full/db08-1514/DC1. Confirmation of our findings in the GoKinD collection was sought in genome-wide association data from the DCCT/EDIC study, a long-term, prospective investigation of the development of diabetes-associated complications (17,18). Of the original DCCT cohort recruited between 1983 and 1989, 1,375 subjects (95%) were retained in the EDIC follow-up study. Participants in EDIC underwent baseline examinations between 1994 and 1995 and have since participated in annual follow-up examinations to assess the development or progression of complications. As of EDIC year 12 (2005), this cohort had 16–22 years of follow-up, and 132 cases of severe nephropathy (proteinuria or ESRD) had been documented in 1,304 Caucasian DCCT/EDIC participants. This phenotype is the closest to the phenotype used in the GoKinD collection. Detailed clinical characteristics of the DCCT/EDIC study have been published (13,17,18). Additional details are also provided in the supplementary information.

Genotyping.

The GoKinD collection was genotyped on an Affymetrix 5.0 500K single nucleotide polymorphism (SNP) array by the GAIN genotyping laboratory at the Eli and Edythe L. Broad Institute (Cambridge, MA). A description of study genotyping is available in the supplementary information. Additionally, two SNPs, rs39075 and rs1888747, were genotyped in the GoKinD collection using TaqMan (Applied Biosystems, Foster City, CA) technology by the genetics core of the Diabetes and Endocrinology Research Center at the JDC in accordance with the manufacturer's protocols. DNA samples used for the genotyping of these SNPs in the GoKinD collection were obtained through the National Institute of Diabetes and Digestive and Kidney Diseases central repository (www.niddkrepository.org).

SNP quality control.

After internal quality control, the GAIN genotyping laboratory released genotypes for 467,144 SNPs. Several quality control metrics, including filters for minor allele frequency < 0.01, rejection of Hardy-Weinberg assumptions (P≤ 10−5), and differential rates of missing data (by case/control status) were applied to these data. After reconciliation of SNPs eliminated by these analyses, the resulting data contained 359,193 autosomal SNPs. More details are available in the supplementary information and supplementary Table 1.

Sample quality control and SNP imputation.

The application of quality control criteria reduced the GoKinD population from 1,879 to 1,705 individuals of European ancestry. Samples from the two GoKinD panels that constitute this sample are 379 GWU case subjects (49 with proteinuria and 330 with ESRD), 413 GWU control subjects, 441 JDC case subjects (235 with proteinuria and 206 with ESRD), and 472 JDC control subjects. The measured genotypes for these individuals were augmented by imputation of ungenotyped SNPs across two mega–base pair regions flanking each of the lead SNPs in the GoKinD collection (where linkage disequilibrium, as measured by r2, decayed to <0.20 for all lead SNPs). A total of 8,245 imputed HapMap SNPs across these four loci were included in our association analysis. Further details of sample quality control and imputation procedures are available in the supplementary information and supplementary Table 2.

Statistical analysis.

GoKinD samples were collected under two separate ascertainment protocols (JDC and GWU panels) so that tests for latent residual stratification were performed. Cochran-Armitage tests of trend for the JDC versus GWU control subjects, and JDC versus GWU case subjects, revealed an overdispersion in the test statistics for both control and case subjects compared with the complete null. The median genomic control parameters were estimated at λ = 1.13 for control subjects and λ = 1.097 for case subjects. Permutation analysis within the control subjects and within the case subjects resulted in a stratification significance of P < 10−3 for both case and control subjects. Therefore, the primary association analysis used in the study was a stratified test of association combining case-control tests of allele frequencies in JDC and GWU strata. Combined P values and odds ratios (ORs) were calculated using a Cochran-Mantel-Haenszel procedure. Homogeneity across strata was assessed using the Breslow-Day statistic. All genome-wide statistical association analyses were performed using PLINK and R (19). Further details of quality control procedures, software, statistical analysis and adjustments, and cluster plots are available in the supplementary information. Data from our analysis of the GoKinD collection are available for specific SNPs and/or genes upon request.

Genotyping and analysis of the DCCT/EDIC data.

Genotypes of the DCCT/EDIC study participants were generated with the Illumina Human1M Beadchip. Briefly, quality control measures resulted in 840,354 SNPs suitable for statistical analysis. Population substructure was assessed to ensure that all included samples were of European ancestry. Multivariate Cox proportional hazard analyses were performed on data from 1,304 Caucasian subjects using time to onset of severe nephropathy, defined by an albumin excretion rate (AER) >300 mg/24 h on at least two consecutive examinations or dialysis/renal transplant with prior persistent microalbuminuria (two consecutive AERs >30 mg/24 h) as the outcome phenotype (n = 132). Among those with severe nephropathy, 116 subjects developed only proteinuria (AER >300 mg/24 h), whereas 16 progressed to ESRD. The DCCT cohort (primary prevention versus secondary intervention), treatment (intensive versus conventional), and interaction between cohort and treatment were used as covariates in the analysis of the effect of an independent additive SNP genetic factor. This model was examined for all associated loci in GoKinD and subsequently tested for both statistical significance and the same direction of effect for associated alleles (20–22).

Gene expression.

The expression of candidate genes was examined in four primary human cell lines derived from cells that have been implicated in the pathogenesis of kidney complications (endothelial cells from the iliac artery, adult dermal fibroblasts, mesangial cells, and epithelial cells from proximal tubules) by quantitative real-time PCR. Sources of these cells, cell culture conditions, and protocols used in these experiments are available in the supplementary information.

RESULTS

Genome-wide association scans for genes associated with diabetic nephropathy in type 1 diabetes.

The application of metrics for SNP and sample quality resulted in the analysis of 359,193 autosomal SNPs and 1,705 GoKinD samples of European ancestry (885 control subjects and 820 case subjects) (see research design and methods and the supplementary information). Clinical characteristics of the JDC and GWU panels are summarized in Table 1. Because different ascertainment protocols were used by the JDC and GWU, the resulting data were found to exhibit significant stratification. As a result, the primary association analyses were conducted using a stratified test of association.
TABLE 1

Baseline clinical characteristics of the GoKinD collection

JDC GoKinD control subjectsJDC GoKinD case subjectsGWU GoKinD control subjectsGWU GoKinD case subjects
n472441413379
Men/women220/252233/208143/270190/189
Age at diagnosis of type 1 diabetes (years)12.9 ± 7.511.5 ± 6.712.9 ± 7.212.1 ± 6.6
Duration of type 1 diabetes (years)*26.4 ± 8.027.0 ± 7.824.2 ± 7.325.9 ± 7.6
Age at examination (years)39.4 ± 6.542.0 ± 7.137.1 ± 8.844.4 ± 6.5
Laser treatment (%)22841085
A1C(%)7.5 ± 1.78.5 ± 1.67.4 ± 1.17.8 ± 1.4
ACR in control subjects (μg/mg)7.0 ± 3.76.3 ± 3.6
Case subjects with proteinuria/ESRD235/20649/330
ACR in subjects with proteinuria (μg/mg)1,571 ± 1,5341,282 ± 1,096
ESRD duration (years)7.5 ± 5.47.2 ± 5.6
Kidney transplant (%)8695

Clinical characteristics are the mean values ± SD for all Caucasian patients (n = 1,705) included in the current analysis after the application of quality control metrics (see supplemental information regarding sample quality control analysis and population substructure and ancestry analysis).

*The duration of type 1 diabetes in control subjects and in subjects with proteinuria is based on the duration at examination. Among ESRD case subjects, this is based on the duration of type 1 diabetes at the onset of ESRD. All other clinical characteristics are based on measurements performed at examination.

†Mean A1C values do not include data from case subjects that have undergone pancreas transplantation (11% of JDC case subjects and 58% of GWU case subjects);

‡percentages are of ESRD group.

Baseline clinical characteristics of the GoKinD collection Clinical characteristics are the mean values ± SD for all Caucasian patients (n = 1,705) included in the current analysis after the application of quality control metrics (see supplemental information regarding sample quality control analysis and population substructure and ancestry analysis). *The duration of type 1 diabetes in control subjects and in subjects with proteinuria is based on the duration at examination. Among ESRD case subjects, this is based on the duration of type 1 diabetes at the onset of ESRD. All other clinical characteristics are based on measurements performed at examination. †Mean A1C values do not include data from case subjects that have undergone pancreas transplantation (11% of JDC case subjects and 58% of GWU case subjects); ‡percentages are of ESRD group. Although no SNP achieved genome-wide significance (0.05/359,193 = 1.4 × 10−7), the primary association analysis identified 11 SNPs representing four distinct chromosomal regions with P < 1 × 10−5 (Fig. 1 and Table 2), which were considered for replication. The strongest association with diabetic nephropathy occurred on chromosome 9q with rs10868025 (OR = 1.45, P = 5.0 × 10−7). This SNP is located near the 5′ end of the 4.1 protein ezrin, radixin, moesin (FERM) domain–containing 3 (FRMD3) gene.
FIG. 1.

Summary of genome-wide association scan results in the GoKinD collection. The –log10 P values calculated using the Cochran-Mantel-Haenszel method (adjusting for sex and GoKinD subcollection [JDC/GWU]) across the entire genome are shown for the combined GoKinD collection. The horizontal dashed line corresponds to a –log10 P value = 5.0 (P = 1 × 10−5). SNPs shown in green (n = 11) exceed this threshold (because of the resolution of this image, some of the SNPs located on chromosome 13 [n = 7] appear indistinguishable).

TABLE 2

Summary of SNPs associated with diabetic nephropathy in the GoKinD collection

Locus
Risk allele frequencies and P values for control and case subjects by panel
P values and ORs for combined analysis
GWU GoKinD
JDC GoKinD
SNPChromosomePosition (Mb)Nearest gene(s)Risk allele (non–risk allele)Control subjectsCase subjectsPControl subjectsCase subjectsPPOR (95% CI)
n413379472441
rs390597p29.2CPVL/CHN2A(G)0.610.698.8 × 10−40.600.671.7 × 10−35.0 × 10−61.39 (1.20–1.61)
rs390757p29.2CPVL/CHN2G(A)0.570.662.0 × 10−40.570.648.2 × 10−46.5 × 10−71.43 (1.23–1.64)
rs18887479q85.3FRMD3G(C)0.680.733.6 × 10−30.660.744.4 × 10−56.3 × 10−71.45 (1.25–1.67)
rs108680259q85.4FRMD3A(G)0.590.661.9 × 10−30.560.667.2 × 10−55.0 × 10−71.45 (1.25–1.67)
rs73940111p3.0CARSC(T)0.460.544.7 × 10−40.490.553.6 × 10−36.4 × 10−61.36 (1.19–1.56)
rs45104111p3.0CARSA(G)0.460.546.9 × 10−40.480.561.3 × 10−33.1 × 10−61.36 (1.19–1.56)
rs104146613q109.0No geneG(A)0.390.473.6 × 10−30.430.512.7 × 10−43.2 × 10−61.38 (1.20–1.58)
rs1411766/rs1741285813q109.1No geneA(G)/G(A)0.310.398.5 × 10−40.320.406.4 × 10−41.8 × 10−61.41 (1.23–1.63)
rs6492208/rs2391777§13q109.1No geneT(C)/G(A)0.550.628.7 × 10−30.560.651.9 × 10−46.1 × 10−61.37 (1.20–1.59)
rs798984813q109.1No geneA(G)0.490.562.0 × 10−30.500.571.1 × 10−37.0 × 10−61.37 (1.19–1.56)
rs952144513q109.1No geneA(C)0.470.542.1 × 10−30.470.554.2 × 10−42.9 × 10−61.38 (1.20–1.58)

The most strongly associated SNPs from the combined analysis of the GWU and JDC GoKinD panels are presented along with the risk allele frequencies and P values (calculated using the Cochran-Mantel-Haenszel method, adjusting for sex, between case and control subjects within each collection) for each separate collection. Combined P values and ORs were calculated using the Cochran-Mantel-Haenszel method. Chromosomal locations, SNP positions, and gene annotations are in reference to NCBI Build 36.1. A summary of the genotype frequencies for the most strongly associated SNPs in the GoKinD collection are presented in supplementary Table 3.

†rs39075 and rs1888747 were identified through imputation and genotyped using Taqman assays in the GoKinD collection;

‡rs1411766 and rs17412858 were both genotyped on the Affymetrix array and are in complete linkage disequilibrium (r2 = 1.0);

§rs6492208 and rs2391777 were both genotyped on the Affymetrix array and are in complete linkage disequilibrium (r2 = 1.0).

Summary of genome-wide association scan results in the GoKinD collection. The –log10 P values calculated using the Cochran-Mantel-Haenszel method (adjusting for sex and GoKinD subcollection [JDC/GWU]) across the entire genome are shown for the combined GoKinD collection. The horizontal dashed line corresponds to a –log10 P value = 5.0 (P = 1 × 10−5). SNPs shown in green (n = 11) exceed this threshold (because of the resolution of this image, some of the SNPs located on chromosome 13 [n = 7] appear indistinguishable). Summary of SNPs associated with diabetic nephropathy in the GoKinD collection The most strongly associated SNPs from the combined analysis of the GWU and JDC GoKinD panels are presented along with the risk allele frequencies and P values (calculated using the Cochran-Mantel-Haenszel method, adjusting for sex, between case and control subjects within each collection) for each separate collection. Combined P values and ORs were calculated using the Cochran-Mantel-Haenszel method. Chromosomal locations, SNP positions, and gene annotations are in reference to NCBI Build 36.1. A summary of the genotype frequencies for the most strongly associated SNPs in the GoKinD collection are presented in supplementary Table 3. rs39075 and rs1888747 were identified through imputation and genotyped using Taqman assays in the GoKinD collection; rs1411766 and rs17412858 were both genotyped on the Affymetrix array and are in complete linkage disequilibrium (r2 = 1.0); §rs6492208 and rs2391777 were both genotyped on the Affymetrix array and are in complete linkage disequilibrium (r2 = 1.0). Three additional genomic regions located on chromosomes 7p, 11p, and 13q were also associated with diabetic nephropathy. The rs39059 SNP (OR = 1.39, P = 5.0 × 10−6) localizes to the first intron of CHN2 (β-chimerin) isoform 2 and upstream of an alternatively spliced CPVL (serine carboxypeptidase vitellogenic-like) transcript on chromosome 7p. The rs451041 SNP (OR = 1.36, P = 3.1 × 10−6) is located on chromosome 11p in an intronic region of the CARS (cysteinyl-tRNA synthetase) gene. And, finally, the region bounded by rs1411766/rs1742858 (OR = 1.41, P = 1.8 × 10−6) is located in a 42 kb intergenic region on chromosome 13q. Analyses of the imputed SNPs in our lead loci identified 11 additional SNPs that were highly correlated withthe original associations (P < 1 × 10−5). Of these, two were more strongly associated with diabetic nephropathy than our lead genotyped SNPs. Imputed SNP rs1888747 (chromosome 9q), which is in partial linkage disequilibrium (r2 = 0.81) with rs10868025, was more strongly associated with diabetic nephropathy than the original SNP (P = 4.7 × 10−7) (Fig. 2B). Similarly, two imputed SNPs in the 7p region (rs39075 and rs39076) were also more strongly associated than the original SNP in that region (rs39059) (Fig. 2A). Both imputed SNPs were genotyped in the GoKinD samples, and the associations with the imputed data were confirmed (rs39075, P = 6.5 × 10−7; and rs1888747, P = 6.3 × 10−7) (Table 2).
FIG. 2.

Summary of genome-wide association results for the chromosome 7p, 9q, 11p, and 13q loci. A: Genome-wide association scan and imputed data for the chromosome 7p locus. rs39059 (solid red triangle) is located at position IVS1 + 21350 relative to exon 1 of CHN2 isoform 2 and is in tight linkage disequilibrium with rs39075 (r2 = 0.96), located at position IVS1 + 42572. rs39059 and rs39075 reside −69,318 and −90,540 kb, respectively, upstream of CPVL isoforms 1 and 2. A third alternate transcript (isoform 3) is predicted for CPVL and contains an exon that extends to intron 1 of CHN2. rs39059 and rs39075 are located at positions −20579 and −41801, respectively, relative to this transcript. ▲, SNPs genotyped on the Affymetrix array (n = 163); △, imputed SNPs (n = 694). *Imputed SNP rs39075 was genotyped in the GoKinD samples to confirm the imputation. B: Genome-wide association scan and imputed data for the chromosome 9q locus. A total of 100 genotyped SNPs from the Affymetrix array data and 450 imputed SNPs are shown. rs10868025 (solid red triangle) is located at position −10829 relative to FRMD3's transcription start site. rs10868025 is in complete linkage disequilibrium (r2 = 1.0) and only 253 bp from imputed SNP rs13289150 (△ superimposed on rs10868025). rs1888747, located at position −2204, is in partial linkage disequilibrium (r2 = 0.81) with rs10868025. *Imputed SNP rs1888747 was genotyped in the GoKinD samples to confirm the imputation. C: Genome-wide association scan and imputed data for the chromosome 11p locus. A total of 33 genotyped SNPs from the Affymetrix array data and 190 imputed SNPs are shown. rs739401 and rs451041 (solid red triangles) are in strong linkage disequilibrium (r2 = 0.97). rs739401 is located in intron 16 (isoforms a and c)/17 (isoforms b and d) of the CARS gene (position IVS16 + 687/IVS17 + 687). rs451041 is located in intron 4 (isoforms a and c)/5 (isoforms b and d), position IVS4 − 203/IVS5 − 203). D: Genome-wide association scan and imputed data for the chromosome 13q locus. A total of 68 genotyped SNPs from the Affymetrix array data and 268 imputed SNPs are shown. Seven lead SNPs (rs1041466, rs1411766, rs17412858, rs6492208, rs2391777, rs7989848, and rs9521445) from this region are indicated in red. rs1411766 and rs17412858 are in complete linkage disequilibrium (r2 = 1.0). Similarly, rs6492208 and rs2391777 are in complete linkage disequilibrium (r2 = 1.0). rs7989848 and rs9521445 are in strong linkage disequilibrium (r2 = 0.87), whereas only modest linkage disequilibrium exists between all other SNP pairs (r2 = 0.30–0.65). The two nearest genes are MYO16 and IRS2, located ∼384 kb centromeric and 120 kb telomeric of this region, respectively.

Summary of genome-wide association results for the chromosome 7p, 9q, 11p, and 13q loci. A: Genome-wide association scan and imputed data for the chromosome 7p locus. rs39059 (solid red triangle) is located at position IVS1 + 21350 relative to exon 1 of CHN2 isoform 2 and is in tight linkage disequilibrium with rs39075 (r2 = 0.96), located at position IVS1 + 42572. rs39059 and rs39075 reside −69,318 and −90,540 kb, respectively, upstream of CPVL isoforms 1 and 2. A third alternate transcript (isoform 3) is predicted for CPVL and contains an exon that extends to intron 1 of CHN2. rs39059 and rs39075 are located at positions −20579 and −41801, respectively, relative to this transcript. ▲, SNPs genotyped on the Affymetrix array (n = 163); △, imputed SNPs (n = 694). *Imputed SNP rs39075 was genotyped in the GoKinD samples to confirm the imputation. B: Genome-wide association scan and imputed data for the chromosome 9q locus. A total of 100 genotyped SNPs from the Affymetrix array data and 450 imputed SNPs are shown. rs10868025 (solid red triangle) is located at position −10829 relative to FRMD3's transcription start site. rs10868025 is in complete linkage disequilibrium (r2 = 1.0) and only 253 bp from imputed SNP rs13289150 (△ superimposed on rs10868025). rs1888747, located at position −2204, is in partial linkage disequilibrium (r2 = 0.81) with rs10868025. *Imputed SNP rs1888747 was genotyped in the GoKinD samples to confirm the imputation. C: Genome-wide association scan and imputed data for the chromosome 11p locus. A total of 33 genotyped SNPs from the Affymetrix array data and 190 imputed SNPs are shown. rs739401 and rs451041 (solid red triangles) are in strong linkage disequilibrium (r2 = 0.97). rs739401 is located in intron 16 (isoforms a and c)/17 (isoforms b and d) of the CARS gene (position IVS16 + 687/IVS17 + 687). rs451041 is located in intron 4 (isoforms a and c)/5 (isoforms b and d), position IVS4 − 203/IVS5 − 203). D: Genome-wide association scan and imputed data for the chromosome 13q locus. A total of 68 genotyped SNPs from the Affymetrix array data and 268 imputed SNPs are shown. Seven lead SNPs (rs1041466, rs1411766, rs17412858, rs6492208, rs2391777, rs7989848, and rs9521445) from this region are indicated in red. rs1411766 and rs17412858 are in complete linkage disequilibrium (r2 = 1.0). Similarly, rs6492208 and rs2391777 are in complete linkage disequilibrium (r2 = 1.0). rs7989848 and rs9521445 are in strong linkage disequilibrium (r2 = 0.87), whereas only modest linkage disequilibrium exists between all other SNP pairs (r2 = 0.30–0.65). The two nearest genes are MYO16 and IRS2, located ∼384 kb centromeric and 120 kb telomeric of this region, respectively. If the etiology of diabetic nephropathy involves the interaction of a locus with the cumulative effect of hyperglycemia, the association of the locus with diabetic nephropathy can vary according to diabetes duration at diabetic nephropathy onset, such that it is strongest in early-appearing case subjects and diminishes in later ones, even reversing in direction in very late-appearing case subjects (23). We examined the SNPs in Table 2 according to diabetes duration by stratifying case and control subjects across tertiles of diabetes duration (at the onset of ESRD or at enrollment into GoKinD for proteinuria patients and control subjects). The strength of the associations was consistent across these strata (data not shown). Additionally, if a locus influences mortality risk, the high mortality experienced by patients with ESRD would alter its association with diabetic nephropathy according to the duration of survival with ESRD and may mask the effect of a diabetic nephropathy risk allele or produce a false association. For this reason, we also analyzed the lead SNPs in Table 2 according to duration of ESRD. For each of these SNPs, the ORs were consistent across tertiles of ESRD duration (supplementary Table 4), a pattern consistent with the absence of survival bias. However, the current study is underpowered to formally exclude the presence of such effects.

Confirmation of associated type 1 diabetic nephropathy SNPs in the DCCT/EDIC study.

Data from a genome-wide association scan of the DCCT/EDIC study were used to assess whether genome regions identified in the GoKinD collection were associated with advanced diabetic nephropathy in an independent collection. Among the 11 SNPs identified in GoKinD, eight were included on the Illumina array used in the DCCT/EDIC study (Table 3). The three SNPs not included on this platform, rs39059, rs739401, and rs9521445, were in strong linkage disequilibrium (r2 ≥ 0.87) with rs39075, rs451041, and rs7989848, respectively. Analysis of time to onset of severe nephropathy confirmed the significant associations with diabetic nephropathy in GoKinD for rs1888746 (FRMD3, P = 0.02), rs13289150 (FRMD3, P = 0.05), and rs451041 (CARS, P = 0.01).
TABLE 3

HRs for the development of severe nephropathy during 16–22 years of follow-up in the DCCT/EDIC study for SNPs associated with diabetic nephropathy in GoKinD

Locus
P values and HRs in DCCT/EDIC collection severe nephropathy
SNPChromosomePosition (Mb)Nearest gene(s)Risk allele*Frequency of risk alleleP (one-sided)HR
rs390757p29.2CPVL/CHN2G0.60NS0.85
rs18887469q85.3FRMD3C0.700.021.33
rs13289150§9q85.4FRMD3A0.620.051.23
rs45104111p3.0CARSA0.510.011.32
rs104146613q109.0No geneG0.470.111.22
rs141176613q109.1No geneA0.360.111.17
rs649220813q109.1No geneT0.61NS0.90
rs798984813q109.1No geneA0.53NS0.93

Data are from multivariate Cox proportional hazard analysis of time to onset of severe nephropathy. As of 2005, the number of severe nephropathy cases was 132 (vs. 1,172 censored). Chromosomal locations, SNP positions, and gene annotations are in reference to NCBI Build 36.1.

*The risk alleles that are presented are in reference to those identified in the GoKinD collection;

†one-sided P values (consistent with the current “best practices” for replication in GWA scans) (20–22) are used to test for the same direction of effect as in the GoKinD collection;

‡rs1888746 was genotyped on an Illumina array in DCCT/EDIC and is in complete linkage disequilibrium (r2 = 1.0) with rs1888747 (genotyped using a Taqman assay in GoKinD);

§rs13289150 was genotyped on an Illumina array in DCCT/EDIC and is in complete linkage disequilibrium (r2 = 1.0) with rs10868025 (genotyped on an Affymetrix array in GoKinD).

HRs for the development of severe nephropathy during 16–22 years of follow-up in the DCCT/EDIC study for SNPs associated with diabetic nephropathy in GoKinD Data are from multivariate Cox proportional hazard analysis of time to onset of severe nephropathy. As of 2005, the number of severe nephropathy cases was 132 (vs. 1,172 censored). Chromosomal locations, SNP positions, and gene annotations are in reference to NCBI Build 36.1. *The risk alleles that are presented are in reference to those identified in the GoKinD collection; †one-sided P values (consistent with the current “best practices” for replication in GWA scans) (20–22) are used to test for the same direction of effect as in the GoKinD collection; rs1888746 was genotyped on an Illumina array in DCCT/EDIC and is in complete linkage disequilibrium (r2 = 1.0) with rs1888747 (genotyped using a Taqman assay in GoKinD); §rs13289150 was genotyped on an Illumina array in DCCT/EDIC and is in complete linkage disequilibrium (r2 = 1.0) with rs10868025 (genotyped on an Affymetrix array in GoKinD).

Analysis of candidate diabetic nephropathy geneexpression.

Previous studies, as well as publicly available gene expression data (www.ncbi.nlm.nih.gov/geo), have shown that genes closest to the lead SNPs identified in GoKinD are expressed in a variety of human tissues, including kidney (24–26). To further test whether these candidate genes may be involved in the development of diabetic nephropathy, we examined their expression in cell lines relevant to this disease. The expression of CHN2, CPVL, FRMD3, and CARS was examined in four primary human cell lines: iliac artery endothelial cells, adult dermal fibroblasts, mesangial cells, and renal proximal tubule cells. Our data show that CARS expression was high in all four of the cell lines that we examined (Table 4). FRMD3 expression was also detected in each cell type, with its highest expression being observed in renal proximal tubule cells. Of the two candidate diabetic nephropathy genes located in chromosome 7p region, neither was detected in mesangial cells, whereas CPVL expression was greatest in proximal tubule cells.
TABLE 4

Relative gene expression of novel candidate diabetic nephropathy susceptibility genes in primary human cell lines

Cell lineCPVL Ct valuesCHN2 Ct valuesFRMD3 Ct valuesCARS Ct values
Endothelial cells, iliac17.3 ± 0.712.5 ± 0.112.5 ± 0.38.4 ± 0.4
Fibroblasts15.7 ± 0.1n.e.11.9 ± 0.76.5 ± 0.4
Mesangial cellsn.e.n.e.12.9 ± 0.57.2 ± 0.1
Proximal tubules8.6 ± 0.113.1 ± 0.110.2 ± 0.28.0 ± 0.1

The relative gene expression of CPVL, CHN2, FRMD3, and CARS was determined in four primary human cell lines using real-time PCR. Relative gene expression of each gene was calculated in reference to a normalization control (β-actin) and is presented as the mean ΔCt (Ctvalue from each gene minus Ct value from the normalization control) and SD from three separate experiments. Mean ΔCt = 0 equals high relative gene expression (i.e., expression similar to β-actin). n.e., not expressed (mean ΔCt >40.0).

Relative gene expression of novel candidate diabetic nephropathy susceptibility genes in primary human cell lines The relative gene expression of CPVL, CHN2, FRMD3, and CARS was determined in four primary human cell lines using real-time PCR. Relative gene expression of each gene was calculated in reference to a normalization control (β-actin) and is presented as the mean ΔCt (Ctvalue from each gene minus Ct value from the normalization control) and SD from three separate experiments. Mean ΔCt = 0 equals high relative gene expression (i.e., expression similar to β-actin). n.e., not expressed (mean ΔCt >40.0).

DISCUSSION

In this report, we describe the results of a genome-wide association scan in the GoKinD collection to identify loci associated with risk of diabetic nephropathy in type 1 diabetes. The most significant associations were identified with variants located within four distinct chromosomal regions. Although the biology underlying these associations remains to be elucidated, they implicate CHN2/CPVL, FRMD3, CARS, and an intergenic region on chromosome 13q as novel genes/genetic regions involved in the pathogenesis of diabetic nephropathy. None of these loci overlap with previously reported associations between candidate genes and the development of any stage of diabetic nephropathy (10,11). Importantly, replication in a Cox proportional hazard analysis of the associations at the FRMD3 and CARS loci with time to the onset of severe nephropathy in the DCCT/EDIC study bolsters the significance of these two findings; that two studies having such different designs (one a case-control study and the other a prospective cohort study) yielded similar ORs strengthens confidence in this conclusion. FRMD3 encodes the 4.1O protein, a structural protein with unknown function and a member of the 4.1 family of proteins (26). Members of the 4.1 protein family have well-characterized roles as cytoskeletal proteins, maintaining both cellular shape and form, in a variety of cell types, including mouse nephron (27,28). Although membership of the 4.1O protein in this family has recently been questioned, it does contain a FERM domain, which is a module that is integral in maintaining cell integrity through its interactions with transmembrane proteins and actin filaments (29,30). FRMD3 is detectable in adult ovaries as well as in fetal skeletal muscle, brain, and thymus (26). Our data extend the expression profile of FRMD3 to specifically include mesangial and proximal tubular cells. Interestingly, among 18 genes that contain FERM domains, including several members of the 4.1 protein family, we identified nominally significant associations with diabetic nephropathy for SNPs located in eight of these genes (supplementary Table 5), including FARP2 (FERM, RhoGEF and pleckstrin domain protein 2; P = 3.0 × 10−4) and EPB41L2 (erythrocyte membrane protein band 4.1-like 2; P = 2.3 × 10−4). Although these findings require further study, including replication in additional collections, it is interesting to speculate that these data may point to the involvement of new, previously unsuspected pathways in the pathogenesis of diabetic nephropathy. The CARS gene encodes cysteinyl-tRNA synthetase, one of several aminoacyl-tRNA synthetases (ARSs) that have been identified in humans (31,32). ARSs are important regulators of intracellular amino acid concentrations and protein biosynthesis in both the cytoplasm and mitochondria (a process facilitated by specialized mitochondria-specific and bifunctional ARSs). In the initial steps of protein translation, the function of these enzymes is to attach amino acids to their cognate tRNA molecules. To date, both autosomal dominant and recessive mutations in ARS-encoding genes have been identified only in neurodegenerative disease, including missense changes in glycyl-tRNA synthetase (GARS) and both missense mutations and in-frame deletions in tyrosyl-tRNA synthetase (YARS) in Charcot-Marie-Tooth disease (32). CARS has been implicated in cystinosis, an autosomal recessive renal tubule disorder caused by the accumulation of free cystine in cellular lysosomes (33,34). A recent study identified defects in lysosomal cystine transport as the primary cause of the disease (35). However, ESRD is prominent in this disorder, and such an outcome may be due to vulnerability of specific renal cells to damage by excess cystine. Interesting, in this light, is the observation that of all the associated SNPs, only those in the CARS locus were associated primarily with ESRD (supplementary Table 4). CARS is expressed in mesangial and proximal tubule cells. Further work is needed to characterize the role of CARS in the pathway that is involved in the development of ESRD in diabetes. Similar to the set of genes containing FERM domains, analysis of 21 ARS genes identified nominally significant associations with diabetic nephropathy for SNPs located in four members of this class of genes (supplementary Table 6), with the most significant association (P = 9.1 × 10−3) occurring at the TARS (threonyl-tRNA synthetase) locus. Two additional loci were strongly associated with diabetic nephropathy in both panels of the GoKinD collection. Of the two genes located on chromosome 7p, CPVL, a carboxypeptidase that is highly expressed in the kidney and, more specifically, in proximal tubules, is a particularly interesting candidate gene. Other carboxypeptidases, such as ACE and bradykinin, are important regulators of renal hemodynamics and have previously been implicated in the pathogenesis of diabetic nephropathy (36,37). The last diabetic nephropathy–associated locus involves multiple SNPs within a 33 kb haplotype block on chromosome 13q. Previously, genomic deletions of this locus have been linked to congenital renal abnormalities (38). The two genes closest to the associated SNPs, MYO16 (myosin heavy-chain Myr 8) and IRS2 (insulin receptor substrate 2), are located ∼384 kb centromeric and 120 kb telomeric of this region, respectively. Although there is little linkage disequilibrium between the variants within this block and those in the vicinity of either MYO16 or IRS2, the multiple signals identified in this region give credence to the association detected in our analysis. Additional experiments are needed to characterize the nature of these associations further. The findings presented in our study contribute to understanding the genetic susceptibility of diabetic nephropathy in type 1 diabetes. As has been reported for other complex genetic disorders, no single major gene that contributes to an increased risk of disease emerged (20,39). However, given the incomplete coverage of the genome by the genotyping platform and the suboptimal study design (prevalent rather than incident cases of ESRD), detection of any existing major gene effect was not guaranteed. For example, because most of the case subjects with ESRD had survived many years on dialysis or with a kidney transplant, a disease allele that not only increased susceptibility to diabetic nephropathy but also increased mortality in patients with ESRD could go undetected. Appreciably, the SNPs that we identified in the GoKinD collection were mortality neutral (supplementary Table 4). The optimal study design for detecting all disease loci, regardless of their effect on mortality, would be a large cohort of incident ESRD case subjects. Such a data set is currently unavailable. There are other limitations to this study as well. The GoKinD collection is heavily weighted with case subjects with ESRD; thus, the small number of case subjects with proteinuria limited our ability to detect variants primarily associated with the risk of proteinuria. Second, because of the limited power of the DCCT/EDIC study and the need to contain inflation of the α-error in seeking replication for multiple SNPs in this dataset, our replication efforts refrained from considering SNPs less significant than P = 1 × 10−5. It is certainly possible that additional variants among those not meeting this threshold may truly be associated with diabetic nephropathy; however, given these limitations, these variants remain to be identified. Similarly, despite replication in the DCCT/EDIC cohort, we acknowledge that positive associations at both the FRMD3 and CARS loci require additional study to be certain of these findings. Third, although the locations of the variants confirmed in this study implicate both FRMD3 and CARS as novel genes involved in the pathogenesis of diabetic nephropathy, the underlying mechanisms of disease of these associations need to be elucidated. And, finally, although confirmation in DCCT/EDIC has been achieved for variations near FRMD3 and CARS, additional cohorts, particularly non-Caucasian, would be useful to further characterize the pathogenic role of these, and other, candidate genes identified in the GoKinD collection.
  38 in total

Review 1.  Cystinosis.

Authors:  William A Gahl; Jess G Thoene; Jerry A Schneider
Journal:  N Engl J Med       Date:  2002-07-11       Impact factor: 91.245

2.  Cloning and characterization of CPVL, a novel serine carboxypeptidase, from human macrophages.

Authors:  J A Mahoney; B Ntolosi; R P DaSilva; S Gordon; A J McKnight
Journal:  Genomics       Date:  2001-03-15       Impact factor: 5.736

Review 3.  The genetics of the protein 4.1 family: organizers of the membrane and cytoskeleton.

Authors:  K B Hoover; P J Bryant
Journal:  Curr Opin Cell Biol       Date:  2000-04       Impact factor: 8.382

Review 4.  Genetic studies of late diabetic complications: the overlooked importance of diabetes duration before complication onset.

Authors:  John J Rogus; James H Warram; Andrzej S Krolewski
Journal:  Diabetes       Date:  2002-06       Impact factor: 9.461

5.  Molecular cloning and characterization of the protein 4.1O gene, a novel member of the protein 4.1 family with focal expression in ovary.

Authors:  Xiaohua Ni; Chaoneng Ji; Gentao Cao; Haipeng Cheng; Lingchen Guo; Shaohua Gu; Kang Ying; Robert C Zhao; Yumin Mao
Journal:  J Hum Genet       Date:  2003       Impact factor: 3.172

6.  Incidence of ESRD and survival after renal replacement therapy in patients with type 1 diabetes: a report from the Allegheny County Registry.

Authors:  Rimei Nishimura; Janice S Dorman; Zsolt Bosnyak; Naoko Tajima; Dorothy J Becker; Trevor J Orchard
Journal:  Am J Kidney Dis       Date:  2003-07       Impact factor: 8.860

7.  Regression of microalbuminuria in type 1 diabetes.

Authors:  Bruce A Perkins; Linda H Ficociello; Kristen H Silva; Dianne M Finkelstein; James H Warram; Andrzej S Krolewski
Journal:  N Engl J Med       Date:  2003-06-05       Impact factor: 91.245

8.  Distinct distribution of specific members of protein 4.1 gene family in the mouse nephron.

Authors:  Mohamed Ramez; Marcel Blot-Chabaud; Françoise Cluzeaud; Sumita Chanan; Michael Patterson; Loren D Walensky; Shirin Marfatia; Anthony J Baines; Joel A Chasis; John G Conboy; Narla Mohandas; Philippe Gascard
Journal:  Kidney Int       Date:  2003-04       Impact factor: 10.612

9.  Genome-wide association analysis identifies 20 loci that influence adult height.

Authors:  Michael N Weedon; Hana Lango; Cecilia M Lindgren; Chris Wallace; David M Evans; Massimo Mangino; Rachel M Freathy; John R B Perry; Suzanne Stevens; Alistair S Hall; Nilesh J Samani; Beverly Shields; Inga Prokopenko; Martin Farrall; Anna Dominiczak; Toby Johnson; Sven Bergmann; Jacques S Beckmann; Peter Vollenweider; Dawn M Waterworth; Vincent Mooser; Colin N A Palmer; Andrew D Morris; Willem H Ouwehand; Jing Hua Zhao; Shengxu Li; Ruth J F Loos; Inês Barroso; Panagiotis Deloukas; Manjinder S Sandhu; Eleanor Wheeler; Nicole Soranzo; Michael Inouye; Nicholas J Wareham; Mark Caulfield; Patricia B Munroe; Andrew T Hattersley; Mark I McCarthy; Timothy M Frayling
Journal:  Nat Genet       Date:  2008-04-06       Impact factor: 38.330

10.  Exclusion of polymorphisms in carnosinase genes (CNDP1 and CNDP2) as a cause of diabetic nephropathy in type 1 diabetes: results of large case-control and follow-up studies.

Authors:  Krzysztof Wanic; Grzegorz Placha; Jonathon Dunn; Adam Smiles; James H Warram; Andrzej S Krolewski
Journal:  Diabetes       Date:  2008-09       Impact factor: 9.461

View more
  134 in total

Review 1.  Progress in Defining the Genetic Basis of Diabetic Complications.

Authors:  Emma Dahlström; Niina Sandholm
Journal:  Curr Diab Rep       Date:  2017-09       Impact factor: 4.810

Review 2.  Genome-wide association studies of chronic kidney disease: what have we learned?

Authors:  Conall M O'Seaghdha; Caroline S Fox
Journal:  Nat Rev Nephrol       Date:  2011-12-06       Impact factor: 28.314

3.  Genetic variation in the matrix metalloproteinase genes and diabetic nephropathy in type 1 diabetes.

Authors:  Masahiko Kure; Marcus G Pezzolesi; G David Poznik; Pisut Katavetin; Jan Skupien; Jonathon S Dunn; Josyf C Mychaleckyj; James H Warram; Andrzej S Krolewski
Journal:  Mol Genet Metab       Date:  2011-01-14       Impact factor: 4.797

4.  Polygenic effects of common single-nucleotide polymorphisms on life span: when association meets causality.

Authors:  Anatoliy I Yashin; Deqing Wu; Konstantin G Arbeev; Svetlana V Ukraintseva
Journal:  Rejuvenation Res       Date:  2012-04-25       Impact factor: 4.663

Review 5.  Does familial clustering of risk factors for long-term diabetic complications leave any place for genes that act independently?

Authors:  Andrew D Paterson; Shelley B Bull
Journal:  J Cardiovasc Transl Res       Date:  2012-06-23       Impact factor: 4.132

6.  A susceptibility gene for kidney disease in an obese mouse model of type II diabetes maps to chromosome 8.

Authors:  Streamson Chua; Yifu Li; Shun Mei Liu; Ruijie Liu; Ka Tak Chan; Jeremiah Martino; Zongyu Zheng; Katalin Susztak; Vivette D D'Agati; Ali G Gharavi
Journal:  Kidney Int       Date:  2010-06-02       Impact factor: 10.612

Review 7.  Linking variants from genome-wide association analysis to function via transcriptional network analysis.

Authors:  Benjamin Keller; Sebastian Martini; John Sedor; Matthias Kretzler
Journal:  Semin Nephrol       Date:  2010-03       Impact factor: 5.299

Review 8.  Glomerular diseases: genetic causes and future therapeutics.

Authors:  Chih-Kang Chiang; Reiko Inagi
Journal:  Nat Rev Nephrol       Date:  2010-07-20       Impact factor: 28.314

9.  A Genome-Wide Association Study to Identify Single-Nucleotide Polymorphisms for Acute Kidney Injury.

Authors:  Bixiao Zhao; Qiongshi Lu; Yuwei Cheng; Justin M Belcher; Edward D Siew; David E Leaf; Simon C Body; Amanda A Fox; Sushrut S Waikar; Charles D Collard; Heather Thiessen-Philbrook; T Alp Ikizler; Lorraine B Ware; Charles L Edelstein; Amit X Garg; Murim Choi; Jennifer A Schaub; Hongyu Zhao; Richard P Lifton; Chirag R Parikh
Journal:  Am J Respir Crit Care Med       Date:  2017-02-15       Impact factor: 21.405

Review 10.  Early detection of diabetic kidney disease: Present limitations and future perspectives.

Authors:  Chih-Hung Lin; Yi-Cheng Chang; Lee-Ming Chuang
Journal:  World J Diabetes       Date:  2016-07-25
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.