Literature DB >> 31622379

Genetic analysis of hsCRP in American Indians: The Strong Heart Family Study.

Lyle G Best1, Poojitha Balakrishnan2, Shelley A Cole3, Karin Haack3, Jonathan M Kocarnik4, Nathan Pankratz5, Matthew Z Anderson6,7, Nora Franceschini8, Barbara V Howard9,10, Elisa T Lee11, Kari E North12, Jason G Umans9,10, Joseph M Yracheta1,13, Ana Navas-Acien13, V Saroja Voruganti14.   

Abstract

BACKGROUND: Increased serum levels of C-reactive protein (CRP), an important component of the innate immune response, are associated with increased risk of cardiovascular disease (CVD). Multiple single nucleotide polymorphisms (SNP) have been identified which are associated with CRP levels, and Mendelian randomization studies have shown a positive association between SNPs increasing CRP expression and risk of colon cancer (but thus far not CVD). The effects of individual genetic variants often interact with the genetic background of a population and hence we sought to resolve the genetic determinants of serum CRP in a number of American Indian populations.
METHODS: The Strong Heart Family Study (SHFS) has serum CRP measurements from 2428 tribal members, recruited as large families from three regions of the United States. Microsatellite markers and MetaboChip defined SNP genotypes were incorporated into variance components, decomposition-based linkage and association analyses.
RESULTS: CRP levels exhibited significant heritability (h2 = 0.33 ± 0.05, p<1.3 X 10-20). A locus on chromosome (chr) 6, near marker D6S281 (approximately at 169.6 Mb, GRCh38/hg38) showed suggestive linkage (LOD = 1.9) to CRP levels. No individual SNPs were found associated with CRP levels after Bonferroni adjustment for multiple testing (threshold <7.77 x 10-7), however, we found nominal associations, many of which replicate previous findings at the CRP, HNF1A and 7 other loci. In addition, we report association of 46 SNPs located at 7 novel loci on chromosomes 2, 5, 6(2 loci), 9, 10 and 17, with an average of 15.3 Kb between SNPs and all with p-values less than 7.2 X 10-4.
CONCLUSION: In agreement with evidence from other populations, these data show CRP serum levels are under considerable genetic influence; and include loci, such as near CRP and other genes, that replicate results from other ethnic groups. These findings also suggest possible novel loci on chr 6 and other chromosomes that warrant further investigation.

Entities:  

Mesh:

Substances:

Year:  2019        PMID: 31622379      PMCID: PMC6797125          DOI: 10.1371/journal.pone.0223574

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Immune and inflammatory factors have longstanding roles in microbial infection [1,2] and auto-immune disorders [3]; it is becoming increasingly clear that they also influence the pathogenesis and complications of metabolic conditions [4], cancer [5] and other chronic diseases [6]. C-reactive protein (CRP) is a prominent component of the innate immune system involved in non-self recognition and destruction [7] and has been employed as a non-specific measure of inflammatory status in epidemiologic and clinical studies of numerous disorders [8-10]. For example, elevated serum CRP is prospectively associated with a number of cancer types, including colon [11], breast [12] and lung [13]. In addition, two studies using Mendelian randomization approaches to assess the influence of inherited increases in CRP level on the risk for colorectal cancer, supported a causal relationship between increased CRP and cancer [14,15]. Evidence for a genetic influence on the relationship between immune factors, metabolic syndrome and cardiovascular risk factors is provided by the association of alleles increasing CRP levels and increased risk of obesity [16,17]. The interaction between genetic influences on basal CRP levels and a number of environmental factors has been investigated using heritability estimation [18], candidate gene [19], genome-wide linkage [18], genome-wide association (GWAS) [20,21] and other genetic approaches [22]. Some of the more compelling results from GWAS are summarized in Table 1. In general, excluding the CRP gene itself, 7 genomic regions of have been associated with CRP in these studies. Within 5 Mb of the CRP gene, variants of IL6R were shown to be independently associated to CRP levels [19]. Another chromosome (chr) 1 locus encompassing the LEPR/JAK1 genes (which play a key role in immune response pathways [23]) has several SNPs associated with serum CRP [21,24]. On chr 2, SNPs at GCKR have been implicated in regulation of CRP expression [24]. SNPs near the EPHA7 and IL6 genes on chr 6 and 7 respectively, show association with CRP levels as well [25,26]. Variation in the HNF1A region of chr 12 has been repeatedly reported to be correlated with CRP expression [27,28]. Variants in three genes,TOMM40, APOE and APOC1, in a 30 Kb span of chr 19 are prominently related to serum CRP [19,24,25].
Table 1

GWAS Catalog [115] and selected SNP associations with serum CRP from the literature.

SNPChrCoordinatesRiskFreqβor Odds RatioP-valueRisk AlleleUpstreamIntragenicDownstreamethn*Ref**
rs1805096165,636,5740.37-0.113.6 X 10−8GNoLEPRNoEUR[72]
0.46-0.115.4 X 10−5AA[72]
rs1892534165,640,2610.39-0.085.8 X 10−8TNoLEPRNoEA[24]
0.46-0.084.4 X 10−3AA[24]
rs4420065165,695,7780.390.093.5 X 10−62CLEPRNoPDE4BEA[21]
rs41292671154,453,7880.40-0.082.1 X 10−48TNoIL6RNoEA[21]
0.40-0.085.2 X 10−21EA[19]
0.13-0.125.7 X 10−7AA[19]
rs22281451154,454,4940.40-0.127.8 X 10−11CNoIL6RNoEUR[72]
0.14-0.092.6 X 10−2AA[72]
rs120936991159,678,1980.29NA***6.0 X 10−6NAOR10J6PNoCRPP1EUR[112]
rs104943261159,679,9100.1780.41994.0 X 10−73TOR10J6PNoCRPP1AA/HIS[20]
rs7266401159,685,728NA0.442.0 X 10−13NAOR10J6PNoCRPP1AA[116]
rs25929021159,685,9360.38NA1.0 X 10−9AOR10J6PNoCRPP1EA/AA[94]
rs127556061159,700,546NANA4 X 10−120COR10J6PNoCRPP1EUR[117]
rs8765371159,705,1430.430.291.4 X 10−9CNoCRPP1NoFIL[25]
rs168425591159,706,3810.890.1064.0 X 10−21TCRPP1NoCRPAS[118]
rs27945201159,709,0260.660.162 X 10−186CCRPP1NoCRPEUR[21]
0.34NA3.0 X 10−8EA[119]
0.600.191.8 X 10−15EA[29]
NA-0.204.7 X 10−26EA[77]
rs12051159,712,4430.33-0.171.0 X 10−31TNoCRPNoEA[24]
-0.1991.65 X 10−26EA[77]
0.20-0.278.1 X 10−15AA[24]
0.35-0.225.37 X 10−09HIS[24]
0.46-0.268.5 X 10−09FIL[25]
rs18009471159,713,6480.06-0.303.1 X 10−25GNoCRPNoEUR[24]
0.01-0.611.3 X 10−6AA[24]
0.02-0.366.7 X 10−3HIS[24]
0.06-0.274.8 X 10−12EUR[72]
0.01-0.581.5 X 10−5AA[72]
rs778324411159,714,0240.002-0.751.4 X 10−4ANoCRPNoEUR[72]
0.005-2.066.6 X 10−4AA[72]
rs14179381159,714,3960.300.145.6 X 10−7ANoCRPNoEUR[24]
0.110.201.2 X 10−2AA[24]
0.360.142.7 X 10−4HIS[24]
rs30912441159,714,8750.380.173.5 X 10−91GCRPNoRPL27P2EA[19]
0.550.245.1 X 10−45AA[19]
0.080.265.2 X 10−7FIL[25]
NA0.206.0 X 10−28EA[77]
rs30930591159,715,3460.120.1614.0 X 10−21GCRPNoRPL27P2JPT[30]
rs30930581159,715,5250.0010.321.4 X 10−1TCRPNoRPL27P2EUR[24]
0.170.481.4 X 10−40AA[24]
0.010.671.0 X 10−3HIS[24]
rs13416651159,721,7690.96-0.192.0 X 10−20ACRPNoRPL27P2EA[29]
rs28086341159,722,7830.1560.1533.0 X 10−10TCRPNoRPL27P2EA/AA[21]
rs75530071159,728,759NAOR 20.78.0 X 10−44NACRPNoRPL27P2EUR/AS[28]
0.3440.1291.0 X 10−9GHIS[20]
0.2280.2721.0 X 10−37TAA[20]
0.3690.1642.0 X 10−16CAS[120]
0.3270.2027.0 X 10−12AEUR[121]
rs112652601159,730,249NANA7.0 X 10−6NACRPNoRPL27P2EUR/EA[108]
rs7561273224,024,6440.350.226.0 X 10−6ANoMFSD2BNoMIC[124]
rs1260326227,508,0730.410.074.6 X 10−40GNoGCKRNoEA[21]
0.420.102.4 X 10−17EA[19]
0.150.062.1 X 10−2AA[19]
0.031.0 X 10−3AS[26]
rs780094227,518,3700.400.101.5 X 10−16CNoGCKRNoEA[24]
0.190.032.3 X 10−1AA[24]
0.340.072.6 X 10−2HIS[24]
rs14411692213,168,8060.53-0.032.3 X 10−11GLINCO1953NoIKZF2EUR[64]
rs9602462223,072,8410.0130.221.0 X 10−9TKCNE4NoLOC105373905AS[122]
rs15148953170,987,9040.71-0.032.7 X 10−9ASLC2A2NoEIF5A2EUR[86]
rs16871289421,509,7600.0170.039.0 X 10−6ANoKCNIP4NoHIS[123]
rs68460714101,481,0580.0160.2241.0 X 10−11GFLJ20021NoLOC105377346AS[122]
rs283610573,952,6870.4560.037.0 X 10−6GARHGEF28NoCTD-2292M14.1[123]
rs4653845125,907,327NANA1.0 X 10−6NARP11-756H20.1NoRP11-114J13.1EUR[117]
rs176582295172,764,0490.050.065.5 X 10−9CAC022217.2NoDUSP1EUR[86]
rs1408282693,142,5340.100.372.9 X 10−6ACOPS5P1NoEPHA7FIL[25]
rs6904416698,542,6130.0190.1839.0 X 10−10CRP11-436D23.1NoPOU3F2AS[122]
rs122026416115,993,4710.39-0.023.0 X 10−10NoFRKNoEUR[86]
rs93855326130,050,0820.33-0.031.9 X 10−11NoL3MBTLNoEUR[86]
rs69077286131,907,6290.1860.043.0 X 10−6CENPP1NoCTGFHIS[123]
rs2097677722,693,220NA0.052.6 X 10−9AAC002480.2NoIL6AS[26]
rs1880241722,719,8500.48-0.038.4 X 10−14GNoIL6EUR[86]
rs2710804736,044,9190.370.021.3 X 10−8ClncRNANoEEPD1EUR[86]
rs6956675763,117,3920.1350.036.0 X 10−6ASAPCD2P4NoSEPT14P1HIS[123]
rs102552997111,887,5040.0130.2417.0 X 10−11GNoDOCK4NoAS[122]
rs10125337994,681,7560.0040.034.0 X 10−6GFBP1NoLOC107987101HIS[123]
rs6434349133,266,9430.370.021.0 X 10−9ANoABONoEUR[86]
rs70762471018,470,7000.37NA6.0 X 10−6NANoCACNB2NoEUR[112]
rs1106658712113,541,8510.160.265.0 X 10−6GLHX5-AS1NoLOC105369990MIC[124]
rs103930212120,798,4550.360.215.0 X 10−6TNoSPPL3NoMIC[124]
rs265000012120,951,159-0.127.1 X 10−11ANoHNF1A-AS1NoEA[77]
0.35-0.122.6 X 10−23EA[24]
0.12-0.095.2 X 10−3AA[24]
0.36-0.119.5 X 10−4HIS[24]
rs730561812120,965,1290.520.2671.0 X 10−8TNoHNF1A-AS1NoFIL[25]
rs795324912120,965,921-0.137.0 X 10−13GNoHNF1A-AS1NoEA[77]
rs116928912120,978,8190.46-0.129.0 X 10−11GNoHNF1ANoEUR[72]
0.34-0.062.2 X 10−2AA[72]
rs116928812120,978,8470.34-0.119.5 X 10−9CNoHNF1ANoEUR[72]
0.12-0.084.0 X 10−2AA[72]
rs118391012120,983,0040.310.11****6 X 10−76ANoHNF1ANoEA[27]
14%*****1.2 X 10−17EA[28]
0.33-0.152 X 10−124EA[21]
rs239379112120,986,1530.4780.0493.0 X 10−9CNoHNF1ANoAS[122]
rs731040912120,987,0580.40-0.181.6 X 10−10A/GNoHNF1ANoEA[24]
0.32-0.157.9 X 10−3AA[24]
0.41-0.131.1 X 10−3HIS[24]
0.53-0.072.7 X 10−8JPT[30]
0.67-0.221.6 X 10−6FIL[25]
rs225982012120,997,5390.31-0.121.8 X 10−9TNoHNF1ANoEUR[72]
0.12-0.079.6 X 10−2AA[72]
rs246419612120,997,6240.32-0.129.3 X 10−9ANoHNF1ANoEUR[72]
0.12-0.078.1 X 10−2AA[72]
rs116931012121,001,6300.380.132.0 X 10−8ANoHNF1ANoEUR[108]
rs25269321472,614,3600.0120.2756.0 X 10−13GRP3-514A23.2NoDPF3AS[122]
rs22392221472,545,1770.360.049.9 X 10−20GNoRGS6NoEUR[86]
rs1126352991494,371,8050.02-0.112.1 X 10−10TSERPINA1NoSERPINA2EUR[86]
rs1788101716,194,1160.560.022.9 X 10−8TNoNCOR1NoEUR[86]
rs8920731929,421,3870.0440.038.0 X 10−6ANoCTC-525D6.1NoHIS[123]
rs20756501944,892,3620.15-0.124.2 X 10−8GNoTOMM40NoEA[65]
0.14-0.221.8 X 10−38EA[24]
-0.216.8 X 10−16EA[77]
0.13-0.182.2 X 10−47EA[19]
0.14-0.026.5 X 10−1AA[24]
rs1575811944,892,4570.21-0.162.4 X 10−12CNoTOMM40NoEUR[72]
0.47-0.095.1 X 10−4AA[72]
rs115565051944,892,8870.14-0.182.9 X 10−11TNoTOMM40NoEUR[72]
0.12-0.026.3 X 10−1AA[72]
rs1128492591944,894,0500.03-0.281.3 X 10−6CNoTOMM40NoEUR[72]
0.04-0.371.1 X 10−7AA[72]
rs7694491944,906,7450.386.8 X 10−3GNoAPOENoFIL[125]
rs7694501944,907,1870.031.0 X 10−4NANoAPOENoEA[19]
0.370.081.6 X 10−6AA[19]
rs4293581944,908,6840.11-0.317.0 X 10−8CNoAPOENoEUR[72]
0.19-0.241.5 X 10−6AA[72]
rs44206381944,919,6890.18-0.241.0 X 10−56GAPOC1NoAPOC4EA[24]
0.20-0.032.7 X 10−1AA[24]
0.10-0.185.6 X 10−4HIS[24]
0.20-0.249 X 10−139EA[21]
0.21-0.281.6 X 10−6MIC[124]
rs21593241945,192,4800.440.192.0 X 10−6TNoAC005779.2NoMIC[124]
rs23150082063,712,6040.31-0.025.4 X 10−10TNoZGPATNoEUR[86]
rs23156562063,786,9840.3950.034.0 X 10−6GNoZBTB46NoHIS[123]

* ethnicity, EA: European American, AA: African American, HIS: Hispanic, FIL: Filipino, AS: Asian, JPT: Japanese, MIC: Micronesia, EUR: European.

** endnote reference.

*** NA: not available.

**** per allele effect in z score units -0.11 (lnCRP).

***** % change in ln CRP per minor allele.

* ethnicity, EA: European American, AA: African American, HIS: Hispanic, FIL: Filipino, AS: Asian, JPT: Japanese, MIC: Micronesia, EUR: European. ** endnote reference. *** NA: not available. **** per allele effect in z score units -0.11 (lnCRP). ***** % change in ln CRP per minor allele. Although genetic variants common across populations associate with CRP levels, there also appear to be variants in multiple loci across the genome with differential strength of effects [24,29,30], or that are found primarily in certain populations, such as African Americans [19], and in some cases very restricted in prevalence (Aboriginal Canadians [31]). There have been very few studies focused on indigenous populations of North and South America [31-33]; and none employing linkage or GWAS analysis. These have shown similar heritability of CRP (29 to 46%) and differing population prevalences of variants affecting CRP levels. Cardiovascular disease (CVD) [34], diabetes mellitus (DM) [35] and other conditions [36] with a significant inflammatory component account for a disproportionately large fraction of mortality and morbidity in American Indian (AI) communities. A better understanding of the genetic contributions to this important component of the innate immune system may shed light on some of these health disparities. Unfortunately genetic research among indigenous peoples has become more challenging after the inappropriate activities of some investigations have been revealed [37,38]. The resulting lack of trust in investigators, exhibited among AI and other populations can have many important societal impacts, including the possibility of worsening already adverse health disparities [39,40]. The aim of this study is to identify genetic loci influencing basal CRP levels using genome-wide linkage and extensive SNP genotyping among participants in a large and well-characterized cohort of American Indians, the Strong Heart Family Study (SHFS).

Methods

Population

The Strong Heart Study (SHS) is a population-based, cohort study of CVD and associated risk factors among American Indians in three centers in Arizona, Oklahoma and North/South Dakota. The participating communities, study design, survey methods and laboratory techniques have been described previously [41,42]. The SHS was extended in 1998 and subsequent phases, as the Strong Heart Family Study, recruiting participants 16 years and older, without regard to disease status, from multi-generational families, including index members of the SHS cohort. All participants have given written, informed consent. In addition, approval for this study was obtained from relevant tribal communities and institutional review boards, including Great Plains Indian Health Service (IHS) Institutional Review Board (IRB), Oglala Sioux Research Review Board, Oklahoma IHS IRB, University of Oklahoma IRB, Phoenix Area IHS IRB, MedStar Health Research Institute IRB, University of North Carolina IRB, Columbia University IRB, and University of Texas Health IRB. The collection of phenotypic data for the SHFS was conducted between 2001 and 2003 according to methods described previously [41]. "Ever" smoking was defined as having smoked at least 100 cigarettes during the lifetime and "current" smoking as present, regular use of smoke tobacco. "Current" and "ever" alcohol intake was defined as having had at least 12 alcoholic beverages in the last year or in past years, respectively.

Biomarker, serum CRP

CRP was measured using a immunoturbidometric method (Vitros Chemistry Products, number 6801739, Ortho Clinical Diagnostics, Rochester, NY), on a Vitros 5,1 platform (Ortho Clinical Diagnostics, Rochester, NY). This method has shown good comparability to results from the previous Dade-Behring immunonephelometric method [43].

Genome-wide linkage analysis, quality control

The procedures for genotyping microsatellite markers in the SHFS have been described previously [44]. In brief, DNA was amplified with primers specific for short tandem repeat markers using the ABI PRISM Linkage Mapping Set-MD10 Version 2.5 (Applied Biosystems, Foster City, CA). PCR products were loaded into an ABI PRISM 377 DNA sequencer for laser-based automated genotyping. Analyses and assignment of the marker alleles were done using computerized algorithms (Applied Biosystems). deCODE Genetics provided sex-averaged chromosomal maps (in units of Haldane centi-morgans) for this analysis [45]. Pedigrees were screened with the Pedigree Relationship Statistical Tests (PREST) [46] and SimWalk2 [47] programs for checking for Mendelian inconsistencies and possible double recombinants. The above screening resulted in less than 1% of all genotypes being excluded. Multipoint identity-by-descent (IBD) matrices for genome-wide linkage analyses were calculated using the linkage analysis package LOKI [48].

Genome-wide association analysis, quality control

To study potential effects of environmental exposures on incident diabetes [49], a subset of SFHS [42] without prevalent diabetes has been genotyped utilizing the Illumina Human Cardio-Metabo BeadChip array (MetaboChip, Illumina, San Diego, CA), an Illumina custom panel incorporated 196,725 SNPs previously identified as significant GWAS signals for metabolic and CVD traits [50]. Blood samples collected from individuals who were free of DM at baseline visit were used for this study and genotyped at the Texas Biomedical Research Institute, San Antonio, TX. All genomic positions listed are derived from NCBI GRCh38/hg38. Non-autosomal (n = 250) and monomorphic markers (n = 158) were removed prior to genotyping quality control. Mendelian inconsistencies were excluded using Preswalk, a PEDSYS compatible version of Simwalk2 [47]. SNPs with a marker call rate < 98% or no data (n = 33,604) and individual samples with a call rate < 95% (n = 3) were excluded. Allele frequency and Hardy-Weinberg equilibrium (HWE) values were estimated using Sequential Oligogenic Linkage Analysis Routines (SOLAR) [51]. Markers failing HWE analysis at p < 10−5 (n = 1,519) and those with minor allele frequencies (MAF) less than 1% (n = 40,219) were also excluded. Since there have been reports of duplicate sequences surrounding certain SNPs (most easily recognized when the duplicate is on a sex chromosome) [52], we conducted an additional screen for significant differences in genotype distribution between genders among the 69 SNPs with association p-values <4X10-4 and passing the previously described, typical screens. Within this group there were 3 SNPs that showed significant differences (p<0.05) in genotype and allelic distribution between genders and were thus excluded. Details from two examples are presented in the S1 Table. Pairwise correlations (r2) between markers were calculated to estimate linkage disequilibrium (LD). The original annotation file for the Cardio-Metabo BeadChip, “MetaboChip_Gene_Annotation” is accessible through the Illumina website. A PEDSYS [53] compatible version of Merlin [54] was used for pedigree-guided imputation of array marker data using the UCSC Genome Browser hg18 assembly [55]. The lack of comparable data sources for AI populations necessitated the use of primarily European data from the UCSC assembly. The final data set includes 120,972 autosomal markers with information available for MetaboChip analysis of 1,892 AI participants.

Statistical analysis

Genome-wide linkage analysis

We used stepwise linear regression in center stratified samples to screen covariates (SAS, version 8.0). Quantitative genetic analysis was conducted using a maximum likelihood variance components decomposition-based method [51]. This approach was implemented in the computer program SOLAR, version 8.1.1 [51] which allows for an explicit test of whether phenotypic covariance among family members are in part due to genetic effects. A total of 2,428 SHFS participants were considered for linkage analysis (Arizona (AZ) = 286 Dakota (DK) = 1,066, Oklahoma (OK) = 1,076), as seen in Table 2, after excluding those individuals with missing covariate data and as indicated below to normalize the phenotypic trait distribution. Because variance components methods are sensitive to kurtosis [56] and to avoid including those with an acute inflammatory process, phenotypic outliers (N = 195) with CRP levels >16.0 (~3 standard deviations (SD) above the mean) were removed prior to analysis. In addition, CRP levels were natural log transformed. All analyses were conducted separately for each center and then on the combined data from all three centers. To maximize our power to detect genetic effects, a minimally adjusted model (Model 2, Table 3), incorporating age, age2, age*sex, age2*sex, sex, and center covariates was analyzed first. Secondary analyses considered adjustment for the linear fixed effects of the covariates listed in Table 2, which were previously shown to influence the trait in epidemiological studies [32,57-59]. We additionally confirmed the significance of Model 2 covariates while accounting for family relationships in SOLAR. Residuals were generated for Model 2 and used in all subsequent genetic analyses. Kurtosis values for CRP were < 0.50 for all analyses.
Table 2

Descriptive characteristics of SHFS participants stratified by study recruitment center.*

Linkage StudySNP Association Study
AZDKOKAZDKOK
Participants (N)28610661076195901796
 Gender (female)66.4%58.7%57.2%65.5%60.0%58.6%
 Age, years mean (± SD)37.7 (16.6)38.5 (16.8)43.3 (17.3)33.2 (14.6)36.5 (15.9)40.2 (16.1)
Pedigrees, N13169
 Generations556
(ln)hsCRP, in mg/L, mean (± SD)1.267 (1.059)0.964 (1.109)0.976 (1.074)1.196 (1.044)0.904 (1.144)0.896 (1.092)
Diabetes** N (%)82 (29)140 (13)207 (19)4 (9)17 (2)24 (3)
Smoking,*** ever/current, N (%)133 (47)701 (66)625 (58)71 (41)556 (65)430 (56)
Waist, cm, mean (± SD)109.5 (20)99.0 (17)101.5 (17)107.6 (21.4)97.7 (16.2)99.7 (16.6)
Menopausal Yes, N (%)****50 (26)157 (25)224 (36)16 (14)105 (20)127 (28)
Alcohol Current, N (%)177 (62)715 (67)518 (48)111 (65)598 (70)403 (52)
Total cholesterol mg/dl, mean (± SD)173.6 (33.1)181.7 (36.6)185.8 (37.1)171.9 (32.4)181.0 (36.0)184.7 (35.8)
HDL-Cholest, mg/dl mean (± SD)49.3 (14.3)50.7 (13.7)52.9 (15.5)48.8 (14.3)50.8 (13.8)54.0 (15.7)
LDL-Cholest, mg/dl mean (± SD)94.2 (25.8)100.9 (30.8)100.1 (30.2)94.6 (25.5)101.1 (30.6)100.1 (30.6)
Triglycerides, mg/dl mean (± SD)158.2 (111.0)157.3 (138.9)172.1 (176.4)147.8 (85.1)150.9 (129.0)157.1 (103.6)
Estrogen use Yes, N (%)40 (21)102 (16)92 (15)2 (2)40 (8)60 (13)
BMI, Kg/m2 mean (± SD)34.4 (8.5)29.9 (6.6)30.9 (6.8)34.1 (9.0)29.8 (6.6)30.4 (6.8)
HbA1c (%, ± SD)7.0 (2.2)6.0 (1.6)6.5 (1.9)5.7 (1.3)5.5 (0.9)5.6 (0.9)
Systolic blood pressure (mmHg)119.7 (14.8)119.9 (16.0)126.4 (17.2)117.3 (13.0)118.5 (14.6)124.2 (16.2)

* Percentages and means calculated only from those with available measurements.

** Diabetes was determined using the American Diabetes Association criteria.

*** Smoking was defined as "current" or "ever" smokers.

**** Percentage of females.

Table 3

Overall heritability assuming various models.

ModelCovariates included in the final modelh2 (SE)P-valueChrom (loc)LOD score
1None0.29 (0.04)4.x 10–175 (25)0.074
6 (185)0.800
6 (189)1.037
6 (191)1.065
19 (93)1.611
2age, age2, age x sex, age2 x sex, sex, center0.33 (0.05)1.3 x 10–205 (25)0.002
6 (185)1.615
6 (189)1.825
6 (191)1.824
19 (93)1.431
3age, age2, age x sex, age2 x sex, sex, center, smoking status0.33 (0.05)6.7 x 10–215 (25)0.002
6 (185)1.583
6 (189)1.880
6 (191)1.896
19 (93)1.315
4age, age2, age x sex, age2 x sex, sex, center, Waist circumference, Body fat, total cholesterol, triglycerides, HDL, LDL, HbA1C, Systolic blood pressure0.32 (0.09)9.7 x 10–65 (25)2.002
6 (185)0.012
6 (190)*0.000
6 (190)0.000
19 (90)0.192
5age, age2, age x sex, age2 x sex, sex, center, DM status0.32 (0.05)1.2 x10-185 (25)0.002
6 (185)1.512
6 (189)1.807
6 (191)1.828
19 (93)1.463
6age, age2, age x sex, age2 x sex, sex, center, hypertension status0.33 (0.05)1.3 x 10–195 (25)0.000
6 (185)1.493
6 (189)1.714
6 (191)1.698
19 (93)1.332
7age, age2, age x sex, age2 x sex, sex, center, hormone replacement therapy status0.35 (0.07)2.2 x 10–95 (25)0.000
6 (185)1.561
6 (189)1.236
6 (191)1.002
19 (95)0.009
Highest LOD score in each center, compared with other centers
CenterCovariates included in the final modelh2 (SE)P-valueChrom (loc)LOD score
AZage, age2, age x sex, age2 x sex, sex0.70 (0.16)3.3 x 10−618 (36)2.360
DKage, age2, age x sex, age2 x sex, sex18 (35)0.001
OKage, age2, age x sex, age2 x sex, sex18 (35)0.000
AZage, age2, age x sex, age2 x sex, sex16 (50)0.000
DKage, age2, age x sex, age2 x sex, sex0.33 (0.06)3.2 x 10−1116 (51)2.236
OKage, age2, age x sex, age2 x sex, sex16 (50)0.000
AZage, age2, age x sex, age2 x sex, sex6 (190)0.000
DKage, age2, age x sex, age2 x sex, sex6 (193)0.499
OKage, age2, age x sex, age2 x sex, sex0.28 (0.07)2.0 x 10−76 (193)1.284

* closest available locus.

* Percentages and means calculated only from those with available measurements. ** Diabetes was determined using the American Diabetes Association criteria. *** Smoking was defined as "current" or "ever" smokers. **** Percentage of females. * closest available locus.

Genome-wide association analysis using MetaboChip array

MetaboChip genotyping was limited to the subset of SHFS without DM during the pilot (1997–1999) and the next phase (2001–2003), thus a total of 1,892 SHFS participants were included in the SNP association analysis. Linear regression models for CRP with each SNP were used under the assumption of an additive genetic model and the analysis was performed using variance components decomposition-based models to account for familial correlation, as implemented in the SOLAR software package [51]. This approach allows us to account for the non-independence among family members. Principal component analysis (PCA) was used to derive principal component scores (PCs) modeling differences in ancestral contributions among study participants. PCs were calculated using the unrelated SHFS founders (n = 644) and a subset of 15,158 selected SNPs (r2 <0.1; MAF >0.05). PCA was performed on a matrix of “doses” (copies of minor allele) for the selected SNPs, using “prcomp” in R. The PCs were then predicted for all genotyped individuals using the PCA model fit to the founder data [60]. While no PC accounted for a large percentage of total variance in genotype scores, the first four PCs account for substantially more than the rest and were, therefore, included as additional covariates in association analyses. To minimize the problem of non-normality, the CRP data were log-transformed. All analyses involved adjustment for basic covariates (age, age2, age*sex, age2*sex, sex2 and PCs). We stratified the association analysis by geographical location (Arizona, Oklahoma, North and South Dakota) to account for possible differences between the three locations. After consideration of linkage disequilibrium effects using the Moskvina and Schmidt method [61] the 120,972 analyzed SNPs had an effective size of 64,375 and the Bonferroni significance level was determined to be p <7.77 x 10−7. When considering SNPs or gene regions previously shown to be significantly associated with CRP in the literature, a p-value of 0.05 was considered evidence of replication.

Metal

METAL software [62] was used to perform meta-analysis of GWAS results taken from the three study centers, each study containing individual genome-wide MetaboChip association results for multiple markers are analyzed across all studies for marker(s) with significant results. The fixed-effect meta-analysis across the center-specific association results used I2 to assess heterogeneity across centers.

Additional analysis

We used fine mapping and conditional analysis to identify independent SNP associations within loci for CRP. We focused on chromosomal regions of 1, 12 and 19 (included in Table 1) due to better coverage of SNPs in these regions. For SNPs identified in the literature but not available on the MetaboChip, we used a strategy of examining all SNPs within a 1 Mb span of the published SNP, given that LD in AI is not available. Conditional association analysis was then conducted, using the proxy SNPs as covariates. The p-value for the hypothesis that a newly identified, secondary independent association exists, was calculated as 0.05 divided by the number of independent SNPs in the region.

Results

The descriptive characteristics of CRP and other covariates, stratified by recruitment center, are displayed in Table 2. Women exhibited significantly higher mean CRP than men (3.64 +/- 2.84 mg/dl vs 2.27 mg/dl +/- 2.90). The CRP levels were highest (3.61 mg/dl ° 2.74) in the AZ center and lowest (2.58 mg/dl ° 3.01) in the DK center. Individuals from the AZ center had the highest prevalence of DM and obesity compared to the other centers. In contrast, SHFS participants from the DK center had the highest prevalence of current smokers. The OK center had the highest prevalence of women with menopause. The descriptive characteristics of the subsample with genotypes (N = 1,892) included in the MetaboChip analysis are given in Table 2. Essentially all of the MetaboChip cohort (99.6%) were included in the linkage analysis; but due to the exclusion of those with DM, only 70.1% of the linkage cohort were included in the MetaboChip analysis.

Genome-wide linkage analysis of CRP

To estimate the proportion of the CRP level variance due to genetic effects (heritability), we used the full SHFS population. Heritability was significant for lnCRP levels (h2 = 0.33 ± 0.05 with adjustment for demographic covariates in model 2, p<1.3 X 10−20) (Table 3). Further adjustment for smoking (model 3), measures of obesity and serum lipid levels (model 4), DM (model 5), or hypertension (model 6) provided similar estimates. In center stratified analyses, heritability estimates are shown in Table 3, with the highest at 0.71 ± 0.16 in the Arizona center. Using the best heritability model (model 3) we next examined the evidence of linkage for lnCRP. Logarithm of the odds (LOD) scores >1.9 are generally accepted as suggestive genome-wide, multipoint linkage [63]. Thus there was suggestive evidence (LOD = 1.90) from model 3 for linkage of CRP to a locus on chr 6, 191cM, near marker D6S281, corresponding to a physical position at approximately 169.6 Mb (GRCh38/hg38). The signal was maximal with these typical adjustments for smoking and demographic covariates; and was only slightly attenuated after inclusion of DM (LOD = 1.83) and hypertension (LOD = 1.71) in the model. Adjustment of model 2 for measures of adiposity, systolic blood pressure and serum lipids, markedly attenuated the signal (LOD = 0.41) at this position. The LOD score was reduced in each center-specific analyses at this locus, with the highest being 1.36 for model 2 in the Dakota center and 1.28 for the OK center. The AZ center failed to show any sign of linkage at this locus; but was hindered by a small sample size at that center. At chr 16, the strongest linkage signal was found with model 2 at 51cM (D16S3068) with a LOD score of 2.24 in the DK center. At this locus, however, the other centers show virtually no signal, suggestive of a population-specific association. No individual center showed any noteworthy signals on chr 19, with the maximum being 1.09 in the DK center. A maximum LOD score of 2.36 was noted in the AZ center on chr 18 at 36cM; but there was no corresponding signal in the DK and OK centers, with the maximal LOD score of 1.30 in the DK center observed at 52cM.

Genome-wide analysis of SNPs for association with CRP

The main findings from association analyses using the MetaboChip genotyping data and standard covariates (age, age2, age*sex, age2*sex, sex2 and PCs) are summarized in Table 4 (restricted to those with a p-value of less than <7.0 x 10−5). Considering all three centers together, no SNPs demonstrated a Bonferroni-corrected, MetaboChip wide, statistically significant association (p-value < 7.77 x 10−7). It should be noted that in the interest of conserving space, there were 2 additional SNPs at the PHACTR1 locus, two at TARID, one at RP1L1, one at TCF7L2 and two at HNF1A, also with association p-values less than 7.0 x 10−5.
Table 4

MetaboChip results, combined center analysis, by chromosome, with association p-values (maximum of 7.0 X 10−5).

SNPGene*Chr:Min/majallele**z-score***PPhysicalCoordinates****MAF
rs7595184CENTG22A/G4.016.2 X 10−5235,596,1380.04
rs7617596FOXP13A/G-4.035.6 X 10−571,472,3430.329
rs1127343C3orf283A/G4.545.6 X 10−6122,409,5470.147
rs7635320MDS13A/G-3.996.7 X 10−5169,246,8300.356
rs4583704ZNF5094C/G4.035.5 X 10−54,372,7490.486
rs200200FARS26A/G4.341.4 X 10−55,443,4070.03
rs9472752PHACTR16A/G4.016.1 X 10−512,863,9020.383
rs7740975SLC35F16A/G4.242.2 X 10−5118,158,5420.112
rs1966248TARID6A/T-4.074.8 X 10−5133,838,4840.342
rs1294948RAMP37A/C-4.22.7 X 10−545,172,1910.216
rs7814795RP1L18A/G-4.41.1 X 10−510,661,7750.324
rs2577888YTHDF38A/C4.084.5 X 10−564,364,9120.257
rs10739202PTPRD9A/G-46.2 X 10−59,897,2890.013
rs10733682LMX1B9A/G4.811.5 X 10−6126,698,6350.243
rs4132670TCF7L210A/G-4.065.0 X 10−5113,008,0120.108
rs2393791HNF1A12A/G4.594.5 X 10−6120,986,1530.409
rs927791FGF913A/G4.035.6 X 10−521,734,5520.114
rs4531650EGLN314C/G4.074.7 X 10−534,047,3740.478
rs7143416SLC24A414A/G4.232.4 X 10−592,452,4490.284
rs3794808SLC6A417A/G4.133.7 X 10−530,204,7750.385
rs9902290SNIP17C/G3.987.0 X 10−538,651,3250.044
Center specific analysis
SNPGeneChr:Min/majalleleβPPhysicalcoordinatesMAF
Arizona
rs1877715CXCR12A/G-1.021.2 X 10−6218,187,8230.07
rs704951XYLB3G/A-1.363.3 X 10−838,372,0360.06
rs12356821WBP1L10C/G-1.931.1 X 10−8102,804,0510.03
rs11195703ncRNA10G/A-0.733.6 X 10−6111,741,2770.13
rs72858840SBF211A/C-2.155.6 X 10−710,033,6990.02
rs12939525CEP13117G/A-1.151.2 X 10−681,220,2050.07
Dakota
rs12735411ATP1A11C/T-0.481.3 X 10−5116,353,4380.07
rs1205CRP1A/G-0.232.4 X 10−5159,712,4430.52
rs1341665CRP1A/G0.241.6 X 10−5159,721,7690.47
rs11986935PINX18A/T-0.274.4 X 10−510,834,0390.25
rs1328648DCLK113A/G0.233.2 X 10−536,148,8790.43
rs74876483SKOR115C/T0.342.0 X 10−567,861,7930.14
Oklahoma
rs4895389TARID6C/T-0.282.0 X 10−6133,838,0140.34
rs1969783TARID6C/T-0.282.6 X 10−6133,838,2610.35
rs1966248TARID6A/T-0.291.3 X 10−6133,838,4840.34
rs231350KCNQ111A/C-0.265.5 X 10−62,692,4190.39
rs2014429PAUPAR11A/G/T0.268.7 X 10−631,919,4890.39

* Most proximal candidate gene.

** Minor allele is effect allele, major is referent.

*** inverse weighted average of three centers.

**** GRCh38.p7, dbSNP build 1.

* Most proximal candidate gene. ** Minor allele is effect allele, major is referent. *** inverse weighted average of three centers. **** GRCh38.p7, dbSNP build 1. These results corroborate previous literature reports of association between CRP and the SNPs in Table 1. Of the 77 variants previously linked to CRP, 27 were also identified as having similar associations in the present study, such as 4 SNPs in the CRP region, all with p-values less than 9 x 10−4, and 10 SNPs at HNF1A with p-values ranging from 0.07 to 4.4 x 10−6 for rs2393791. Of interest, a recent meta-analysis of over 200,000 Europeans found 58 novel variants [64], including rs178810 on chr 17, significantly associated with CRP, which was replicated by the current study (p-value 0.038). At the previously reported chr 19 region spanning APOE, TOMM40 and APOC1 [19,65], there were 3 SNPs with duplicate genotyping in the current study, all with nominal significance and direction of effect concordant with the literature [19,65]. We found two additional, nominally significant SNPs relatively proximal to others reported associated with CRP level, such as rs7635320 within 1.5 Mb of reported rs1514895 on chr 3, and rs7143416 within 2 Mb of reported rs112635299 on chr 14. Alternatively, when the available MetaboChip SNPs were searched by region for the genes listed in Table 1, there are 3 SNPs intronic to JAK1 (and approximately 400 Kb from the reported LEPR gene) on chr 1, all with association p-values less than 4.5 x 10−3. Also on chromosome 1, we found 5 SNPs in the IL6R region with association p-values less than 0.05. At the GCKR locus of chromosome 2, seven SNPs were identified of nominal significance. The above mentioned meta-analysis also found 2 additional variants (implicating FRK and ABO on chr 6 and 9) [64] which have 7 and 4 SNPs represented in the present analysis, all with p-values less than 0.05. S2 Table highlights 3 clusters of nominally significant SNPs co-localizing with the CRP, HFN1A, and TOMM40/APOE/APOC1 regions, which include five SNPs within a 39Kb region, 19 SNPs within 43 Kb, and 6 SNPs within 27 Kb respectively. In addition, another 7 compact chromosomal regions are shown (S1 Table), suggesting that these SNPs are in linkage disequilibrium, potentially with a functional variant. These include a cluster of 6 SNPs within 28 Kb on chr 2, and a cluster of 5 SNPs within a 194 Kb region of chr 5. The latter group is within ~1 Mb of a novel SNP recently reported associated with CRP [64]. On chr 6 there are two groups, one of 12 SNPs within an 148 Kb region (centered at 12,825,000) and one of 5 SNPs within 55Kb (centered at 133,850,000). The latter is again within 2 and 4 Mb from two previous literature reports of associated SNPs. A group of 5 SNPs clustered within 10 Kb is found on chr 9, within ~200 Kb of a newly reported SNP [64]. There are 10 SNPs centered around 113,120,000 and within 243 Kb on chr 10, and another 3 SNPs reside within a 17 Kb area of chr 17. The SNPs in these clusters range from 1: 1,800bp to 1:39,000bp; and except for the chr 19 cluster, all of the p-values are <1x10-3. Please see figures S1–S10 Figs, showing locus zoom (A) and linkage disequilibrium plots (B) for each of these clusters in regions of chromosomes 1,2,5,6a/b,9,10,12,17 and 19. Center specific analysis (Table 4, last section) revealed considerable differences between centers in strength of association among SNPs. For example, the OK center showed consistently strong association (all p-values less than 2.6 x 10−6) for three chr 6 SNPs tightly clustered within only 470 bp of each other; and within an intron of the TARID gene, which also encompasses the chr 6 SNP cluster at ~133,850,000 shown in Table 4. The DK center showed strongest association for 2 SNPs at the CRP locus, but essentially no apparent association with the top SNPs from the other centers. The AZ center also showed substantial association (p-values from 2.3 x 10−6 to 7 x 10−5) with SNPs at chr 6, but in loci quite distant from the clusters identified in the overall analysis. Although other SNPs showed association p-values below the genome-wide threshold, there were only 171 participants at the AZ center and there was minimal overlap with loci in the overall analysis or the other centers. Chromosome 11 also contained a large cluster of SNPs with maximum association p-values of ~1 x 10−6, but the minor allele frequencies were quite low, leaving the results dependent on as few as 6 individuals from the total of 171.

Conditional analysis for fine mapping

We conducted linkage analysis conditional on our top SNPs on chromosomes 5 and 6. For model 4, our topmost signal was on chromosome 5 (LOD = 2.00). Our association analysis identified four SNPs on chromosome 5 (p < 7 x 10−4), however they were not in the same region as our linkage signal and thus adjusting for the SNPs did not change the linkage signal (LOD = 2.00). For model 3, our best linkage hit was on chromosome 6 (LOD = 1.9). There were two clusters of SNPs on chromosome 6 which were associated with CRP at a significance level of < 7 x 10−4. None of these regions overlapped with our linkage signal. The SNPs with at least nominal association with CRP and closest to our linkage region were about 34Mb apart. On chromosome 1, we identified a 5Mb region from 154,453,788 to 159,730,249 that contained most of the significant SNPs in the literature. Of the 20 SNPs in this region, only rs2794520, rs1205, rs1341665 and rs3091244 were genotyped in our dataset. For the rest of the SNPs we scanned the region plus and minus 500kb to find proxy SNPs. We considered proxy SNPs to be those that are significantly associated with CRP and are in LD (r2 ≥ 0.8) with one of the four SNPs in a European population. We found that except two SNPs of the IL6R gene and four SNPs (rs1417938, rs4131568, rs1800947, rs3093058) of CRP, the remaining 14 SNPs were in strong LD with each other. Since none of the other SNPs in the region were significantly associated with CRP, we conducted a conditional analysis with one variant (rs1205) of the LD block of 14 SNPs. With adjustment for rs1205, the CRP SNPs rs2592887, rs1470515, rs2794520 and rs1341665 became non-significant, with p-values of 0.26, 0.35, 0.41 and 0.63 respectively. For chromosome 12, we did not find any SNPs to be in LD with significant SNPs in other populations. However, since our best association signal was on chromosome 12 (HNF1A), we conducted additional analysis conditional on rs2393791 to identify secondary signals. Similar to previous analysis, loci on chromosome 12 were no longer significant and others showed minimal changes. For chromosome 19, we found rs8106922 to be a proxy SNP for rs769450 which was significantly associated with CRP in other populations but not genotyped in our cohort. Analysis conditional on rs8106922, however, showed no change in p-values. In summary, conditional analysis showed loss of nominal association when our initial chr 1 results were adjusted with rs1205 as a covariate; and similar loss of association for chr 12 SNPs in the HNF1A region, when adjusting for rs2393791. See S3 Table for details.

Functional annotation of identified variants

The associated SNPs are grouped into clusters based on their LD patterns (Table 4). We used RegulomeDB [66] and HaploReg [67,68] to functionally annotate CRP-associated variants. RegulomeDB showed that rs1969783 and rs1966248 of TARID and rs5629931 of TCF7L2 had a score of 3a (less likely to affect binding) and rs2592887 and rs1205 of CRP, rs4895389 of TARID and rs3405329 and rs6721844 of LDAH had lower scores of 4 (minimal binding evidence). The LDs shown by HaploReg (S4 Table), based on European populations by default, are very similar to the LD patterns found in the present study. In addition, several SNPs overlap with promoter and enhancer histone marks and DNase hypersensitivity regions in various tissues (S4 Table).

Discussion

The results reported here further inform our understanding of inherited genetic influences on baseline serum CRP levels, by examining a family-based sample with unique ethnic and environmental characteristics, through the use of linkage, focused SNP association analyses, and bio-informatic methods. Previously unrecognized loci suggesting an association with serum CRP levels include a linkage signal at chr 6, 181–194 cM and three SNPs with among the lowest association p-values within our study, located approximately 34 Mb centromeric to the linkage peak. Further support for an effect from this region is seen in center specific linkage and SNP association analysis from Oklahoma. The MetaboChip genotype association results also support earlier findings in proximity to the CRP gene (chr 1), the KCNE4 and GCKR genes (chr 2), HNF1A (chr 12), and TOMM40, APOE (chr 19) genes that have been previously associated with CRP expression, as noted in Table 1. In addition, the clustering of groups of highly associated SNPs within very limited regions on chromosomes 5, 6, 9, 10, and 17 suggests the existence of novel loci, even though the p-values are above the Bonferroni, genome-wide adjusted threshold. Fine mapping suggested lack of secondary associated SNPs at two regions near the CRP and HNF1A genes. The heritability estimate of CRP from the present study is 0.33 (p<1.3 x 10−20) compared with similarly-adjusted, previous findings for American Indians (0.38) [32], African Americans (0.45) [69], Chinese (0.38) [70] and non-Hispanic whites (0.40) [57]. The strongest linkage signal, across all centers, was between 189 and 191 cM on chr 6 with adjustment for typical demographic variables; and there was minimal attenuation with further adjustment for DM and hypertensive status. Center specific analyses in both DK and OK revealed linkage peaks in the same chr 6 region, however there was an absence of signal in the AZ center, perhaps due to the small number of participants there. While higher LOD scores were seen in some center-specific analyses, only a LOD of 1.09 in the DK center corresponded with the previously identified chr 19 locus [20,24,71,72], otherwise there seemed to be no correlation between centers, with the MetaboChip association results, or with reports in the literature, as summarized in Table 1. As CRP levels are clearly correlated with measures of obesity [73] and there is a genetic correlation between physical activity and LDL-C [44], it is possible that inclusion of covariates of adiposity, lipids and blood pressure could result in "over-adjustment", reducing power to detect linkage. While there have been relatively few linkage studies of CRP [74-76], Ding et al reported a LOD of 0.49 on chr 6, 187cM among African Americans [18], and a LOD score of 1.7 (107cM) on chr 10 [18]. While the present linkage results failed to replicate this chr 10 result in American Indians, a suggestive cluster of SNPs was found approximately 30 Mb proximal to this locus. Another region on chr 6 (6q16.1) has been associated with plasma CRP levels in a cohort of Filipino women [25], and lies within ~38 Mb of the present findings at 133,000,000. Inspecting loci where SNPs influencing CRP expression have been reported, for example, on chr 1 (at the CRP gene) [24,77], chromosomes 2 [21,64], 12 [72,77], and 19 [20,24], our results failed to show a linkage signal, with the highest LOD score (1.41) at chr 19, 96cM (APOE and TOMM40 genes). The MetaboChip data showing 12 SNPs clustered in a span of about 150Kb at position 12,800,000 and another 5 SNPs within 55Kb around 133,850,000 in the 6p22.3 and 6q22.31 regions may represent extended regions of LD which contain a functional variant influencing CRP levels. The first cluster of 12 SNPs is intronic to the PHACTR1 gene, which plays a role in endothelial cell survival and is associated with susceptibility to myocardial infarction and coronary artery disease [78]. The second cluster of 5 SNPs is within the TCF21 antisense RNA inducing promoter demethylation (TARID) gene [79]. Variants within TARID or its target (ie TCF21) are associated with coronary artery disease [80,81], blood pressure [82], cis-effects on circulating cytokines [83], and visceral fat [84]. Directly between the above two clusters, lies rs7740975 (2.2 X 10−5 p-value for association in present study), which is intronic to solute carrier family 35 member F1 (SLC35F1), a member of the SLC35 family of transporters which aid in the formation of glycoproteins in the Golgi apparatus and endoplasmic reticulum [85]. Variants of this member of the solute carrier gene family have been associated with a number of cardiovascular disease phenotypes related to hypertension [86], congestive heart failure [87], obesity [88], heart rate [89], and electrocardiographic QT interval [90], as have some polymorphisms of the CRP gene [17,91-93]. Of note, the gene G protein-coupled receptor class C group 6 member A (GPRC6A) is only 500 Kb distal to this SNP and has demonstrated effects on CRP levels [21]. A search of dbSNP failed to reveal any other significant citations of SNPs from this region on chr 6 in relation to effects on serum levels of CRP. Further examination of the MetaboChip results in relation to the apparent clusters of associated SNPs, or possible haplotypes, we find that the group on chr 1 between 159,683,149 and 159,721,769 very clearly overlap the CRP gene, as well as contain documented, functional SNPs such as the 3' UTR SNP, rs1205 [24,25]. The rs2592887 SNP in this region is in linkage disequilibrium with rs876537, which is associated with CRP in both European and African American populations [94]. The chr 2 cluster between 20,961,892 and 20,989,723 is approximately 25 Kb from the 3' end of the apolipoprotein B (APOB) gene. This gene is intricately involved with lipid metabolism and regulation, as well as associated with a number of cardiovascular disease entities [95]. This set of SNPs is also 150 Kb 5' from the lipid droplet associated hydrolase (LDAH) gene, similarly involved in lipid metabolism and demonstrating increased expression within the macrophages of human atherosclerotic lesions [96]. Five SNPs within 200 Kb on chr 5 reside very near the CEPB4 gene, variants of which have been related to control of inflammation and obesity [97]. Within 6 Kb of SLC2A6 on chr 9, a group of 5 SNPs is nominally associated with CRP. This gene is involved with hexose transport in brain, spleen and leukocytes [98], as well as increasingly expressed in chronic lymphocytic leukemia [99], but a PubMed search reveals no apparent relevance to clinical inflammation or CRP expression. Another suggestive cluster is found on chr 10, comprising 10 SNPs within 243 Kb, all but one of which are intronic to the transcription factor 7 like 2 (TCF7L2) gene. TCF7L2 is instrumental in the Wnt signaling pathway [100] and variants are well known to be associated with risk of DM and its complications [101]. Variants of this gene also alter CRP levels in response to drug treatment [102]. Consistent with other studies, there are 19 SNPs within a 43 Kb span on chr 12 encompassing the HNF1A gene, a hepatic transcription factor which has been repeatedly found to affect CRP expression [24,77], as well as C12orf43, variants of which have been linked to cardiovascular disease [103] and CRP expression [20]. Mutations of HNF1A are known to cause maturity DM of youth, type 3 (MODY3) [104], and polymorphisms are associated with risk of DM and atherosclerotic vascular disease [105]. Lastly, 3 SNPs on chr 17 lie within 17 Kb, within or between MYL4 and CDC27, the latter known to influence TGF-beta [106], a strong modulator of inflammatory response [107]. A rather extensive literature exists associating SNPs with serum CRP [20,21,24,25,72,77,108]. Some of the more compelling reports are summarized in Table 1. Besides CRP, the LEPR region on chr 1, has suggestive findings in the current study; and three SNPs intronic to JAK1 show nominal significance. The latter gene plays a key role in immune response pathways [109] and is within 400 Kb of LEPR. Conversely, the IL6R region fails to indicate any signal, including rs4129267 (p = 0.17). Our results in the GCKR region reveal a cluster of SNPs, with maximal association p-values of 3.5 X 10−3. Although 31 SNPs were genotyped in the EPHA and many in the IL6 regions, no indications of association with CRP were found. Current findings related to the HNF1A gene are noted above. Our results highlight 5 SNPs in the APOE, TOMM40, APOC1 area, with p-values for association all less than 6.8 X 10−3. The TOMM40 protein is a component of the mitochondrial membrane and mutations appear to contribute to risk of Alzheimer's disease and other aging phenomenon [110,111]. The APOE and APOC1 genes play important roles in lipid metabolism and are associated with clinical conditions dependent on this function [112,113]. Like most genetic association studies, we identified several noncoding variants associated with CRP levels. Although noncoding regions do not affect mRNA sequence, they may regulate other factors involved in the transcription or regulation of the genes. We used RegulomeDB [66] and HaploReg [67,68] to functionally annotate variants; and several, such as rs1205, rs35131127, rs1969783, and rs1169310 showed potential evidence of functionality. (S4 Table) Limitations of this study include marginally significant findings after conservative, Bonferroni adjustment for multiple testing in a study population of a moderate sample size. This is ameliorated to some extent by the correlation between many groups of SNPs in the array and the fact that all of the MetaboChip SNPs were chosen for a priori evidence of association with cardiovascular phenotypes, which also relate to CRP. The somewhat indirect correspondence between the linkage and SNP association analyses shows different strengths of the methodologies, in that linkage appears more successful at identifying rare and family-specific variants whereas association analysis tends to rely more on common variants, as illustrated in an analysis by He et al [114]. Differences may also arise from the fact that the MetaboChip cohort excluded those with DM, perhaps minimizing the effect of variants that both predisposed to diabetes/obesity and increased CRP level. Our replication of an interaction (gender by microarray determined genotype) at certain loci (eg at rs12723357 on chr 1 and at rs17301021 on chr 15) is an important reminder that the potential for systematic, microarray genotyping errors is a problem that warrants careful attention [52]. An additional concern involved identified clusters of SNPs within constricted regions that showed strong association with CRP; but also showed uniformly marginal HWE p-values (eg 9 SNPs on chr 6), all within 250 Kb and none of which with HWE p-values greater than 2.2 X 10−4. In contrast to the clusters we thought pointed to regions harboring a functional variant (with HWE results well within an expected distribution), the anomalous clusters were interpreted as due to haplotypes identifying unique center background and thus spuriously associated with CRP due to the recognized differences in CRP by center. The underlying difference in CRP between centers could be due to genetic influences; but could also reflect environmental factors as well. In either case, this probably represents an example of population stratification, when the analysis addresses all centers combined. The strengths of this study include a population-based ascertainment of samples from communities with unique environmental and genetic backgrounds, extensive covariate information collected in a prospective manner, and the use of two complementary genetic analysis methods. From a broader perspective, the SHS [44] has represented a relatively successful collaboration with the participating tribal communities since inception in 1998. Sustaining a mutually beneficial engagement at this level requires considerable effort from both parties; but we feel the history of the Strong Heart Study can provide a useful model for this type of research.

Genotype by gender interaction for two anomalous SNPs.

(DOCX) Click here for additional data file.

SNP clusters of interest.

(DOCX) Click here for additional data file.

Association analysis of SNPs conditional on rs1205 and rs2393791 genotype.

(DOCX) Click here for additional data file.

Bioinformatic analysis of SNP clusters associated with serum CRP levels utilizing HaploReg and RegulomeDB.

(DOCX) Click here for additional data file. Locus zoom (A) and linkage disequilibrium plots (B) for chromosome 1. (PDF) Click here for additional data file. Locus zoom (A) and linkage disequilibrium plots (B) for chromosome 2. (PDF) Click here for additional data file. Locus zoom (A) and linkage disequilibrium plots (B) for chromosome 5. (PDF) Click here for additional data file. Locus zoom (A) and linkage disequilibrium plots (B) for chromosome 6a. (PDF) Click here for additional data file. Locus zoom (A) and linkage disequilibrium plots (B) for chromosome 6b. (PDF) Click here for additional data file. Locus zoom (A) and linkage disequilibrium plots (B) for chromosome 9. (PDF) Click here for additional data file. Locus zoom (A) and linkage disequilibrium plots (B) for chromosome 10. (PDF) Click here for additional data file. Locus zoom (A) and linkage disequilibrium plots (B) for chromosome 12. (PDF) Click here for additional data file. Locus zoom (A) and linkage disequilibrium plots (B) for chromosome 17. (PDF) Click here for additional data file. Locus zoom (A) and linkage disequilibrium plots (B) for chromosome 19. (PDF) Click here for additional data file.
  123 in total

1.  A high-resolution recombination map of the human genome.

Authors:  Augustine Kong; Daniel F Gudbjartsson; Jesus Sainz; Gudrun M Jonsdottir; Sigurjon A Gudjonsson; Bjorgvin Richardsson; Sigrun Sigurdardottir; John Barnard; Bjorn Hallbeck; Gisli Masson; Adam Shlien; Stefan T Palsson; Michael L Frigge; Thorgeir E Thorgeirsson; Jeffrey R Gulcher; Kari Stefansson
Journal:  Nat Genet       Date:  2002-06-10       Impact factor: 38.330

2.  Enhanced pedigree error detection.

Authors:  Lei Sun; Kenneth Wilder; Mary Sara McPeek
Journal:  Hum Hered       Date:  2002       Impact factor: 0.444

Review 3.  Plasma levels of apolipoprotein E, APOE genotype and risk of dementia and ischemic heart disease: A review.

Authors:  Katrine Laura Rasmussen
Journal:  Atherosclerosis       Date:  2016-10-20       Impact factor: 5.162

Review 4.  Janus kinases in immune cell signaling.

Authors:  Kamran Ghoreschi; Arian Laurence; John J O'Shea
Journal:  Immunol Rev       Date:  2009-03       Impact factor: 12.988

Review 5.  Inflammatory bio-markers and cardiovascular risk prediction.

Authors:  G J Blake; P M Ridker
Journal:  J Intern Med       Date:  2002-10       Impact factor: 8.989

Review 6.  Inflammatory and microenvironmental factors involved in breast cancer progression.

Authors:  Mina Ham; Aree Moon
Journal:  Arch Pharm Res       Date:  2013-11-13       Impact factor: 4.946

Review 7.  Apolipoprotein B-containing lipoproteins and atherosclerotic cardiovascular disease.

Authors:  Michael D Shapiro; Sergio Fazio
Journal:  F1000Res       Date:  2017-02-13

8.  Genome-wide association for abdominal subcutaneous and visceral adipose reveals a novel locus for visceral fat in women.

Authors:  Caroline S Fox; Yongmei Liu; Charles C White; Mary Feitosa; Albert V Smith; Nancy Heard-Costa; Kurt Lohman; Andrew D Johnson; Meredith C Foster; Danielle M Greenawalt; Paula Griffin; Jinghong Ding; Anne B Newman; Fran Tylavsky; Iva Miljkovic; Stephen B Kritchevsky; Lenore Launer; Melissa Garcia; Gudny Eiriksdottir; J Jeffrey Carr; Vilmunder Gudnason; Tamara B Harris; L Adrienne Cupples; Ingrid B Borecki
Journal:  PLoS Genet       Date:  2012-05-10       Impact factor: 5.917

9.  Genome-wide association to body mass index and waist circumference: the Framingham Heart Study 100K project.

Authors:  Caroline S Fox; Nancy Heard-Costa; L Adrienne Cupples; Josée Dupuis; Ramachandran S Vasan; Larry D Atwood
Journal:  BMC Med Genet       Date:  2007-09-19       Impact factor: 2.103

10.  Single-trait and multi-trait genome-wide association analyses identify novel loci for blood pressure in African-ancestry populations.

Authors:  Jingjing Liang; Thu H Le; Digna R Velez Edwards; Bamidele O Tayo; Kyle J Gaulton; Jennifer A Smith; Yingchang Lu; Richard A Jensen; Guanjie Chen; Lisa R Yanek; Karen Schwander; Salman M Tajuddin; Tamar Sofer; Wonji Kim; James Kayima; Colin A McKenzie; Ervin Fox; Michael A Nalls; J Hunter Young; Yan V Sun; Jacqueline M Lane; Sylvia Cechova; Jie Zhou; Hua Tang; Myriam Fornage; Solomon K Musani; Heming Wang; Juyoung Lee; Adebowale Adeyemo; Albert W Dreisbach; Terrence Forrester; Pei-Lun Chu; Anne Cappola; Michele K Evans; Alanna C Morrison; Lisa W Martin; Kerri L Wiggins; Qin Hui; Wei Zhao; Rebecca D Jackson; Erin B Ware; Jessica D Faul; Alex P Reiner; Michael Bray; Joshua C Denny; Thomas H Mosley; Walter Palmas; Xiuqing Guo; George J Papanicolaou; Alan D Penman; Joseph F Polak; Kenneth Rice; Ken D Taylor; Eric Boerwinkle; Erwin P Bottinger; Kiang Liu; Neil Risch; Steven C Hunt; Charles Kooperberg; Alan B Zonderman; Cathy C Laurie; Diane M Becker; Jianwen Cai; Ruth J F Loos; Bruce M Psaty; David R Weir; Sharon L R Kardia; Donna K Arnett; Sungho Won; Todd L Edwards; Susan Redline; Richard S Cooper; D C Rao; Jerome I Rotter; Charles Rotimi; Daniel Levy; Aravinda Chakravarti; Xiaofeng Zhu; Nora Franceschini
Journal:  PLoS Genet       Date:  2017-05-12       Impact factor: 6.020

View more
  2 in total

1.  Genetic variation and urine cadmium levels: ABCC1 effects in the Strong Heart Family Study.

Authors:  Maria Grau-Perez; V Saroja Voruganti; Poojitha Balakrishnan; Karin Haack; Walter Goessler; Nora Franceschini; Josep Redón; Shelley A Cole; Ana Navas-Acien; Maria Tellez-Plaza
Journal:  Environ Pollut       Date:  2021-02-11       Impact factor: 8.071

Review 2.  Differences in epidemiology of patients with preeclampsia between China and the US (Review).

Authors:  Ping Shi; Lei Zhao; Sha Yu; Jun Zhou; Jing Li; Ning Zhang; Baoxiang Xing; Xuena Cui; Shengmei Yang
Journal:  Exp Ther Med       Date:  2021-07-15       Impact factor: 2.447

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.