Literature DB >> 30498476

Heritability and Genome-Wide Association Study of Plasma Cholesterol in Chinese Adult Twins.

Hui Liu1, Weijing Wang1, Caixia Zhang1, Chunsheng Xu2, Haiping Duan2, Xiaocao Tian2, Dongfeng Zhang1.   

Abstract

Dyslipidemia represents a strong and independent risk factor for cardiovascular disease. Plasma cholesterol, such as total cholesterol (TC), low density lipoprotein cholesterol (LDL-C), and high density lipoprotein cholesterol (HDL-C), is the common indicator of diagnosing dyslipidemia. Here based on 382 Chinese twin pairs, we explored the magnitude of genetic impact on TC, HDL-C, LDL-C variation and further searched for genetic susceptibility loci for them using genome-wide association study (GWAS). The ACE model was the best fit model with additive genetic parameter (A) accounting for 26.6%, common or shared environmental parameter (C) accounting for 47.8%, unique/non-shared environmental parameter (E) accounting for 25.6% for the variance in HDL-C. The AE model was the best fit model for TC (A: 61.4%; E: 38.6%) and LDL-C (A: 65.5%; E: 34.5%). While no SNPs reached the genome-wide significance level (P < 5 × 10-8), 8, 14, 9 SNPs exceeded the suggestive significance level (P < 1 × 10-5) for TC, HDL-C, LDL-C, respectively. The promising genetic regions for TC, HDL-C, LDL-C were on chromosome 11 around rs7107698, chromosome 5 around rs12518218, chromosome 2 around rs10490120, respectively. Gene-based analysis found 1038, 1033 and 1090 genes nominally associated with TC, HDL-C, LDL-C (P < 0.05), especially FAF1, KLKB1 for TC, KLKB1 for HDL-C, and NTRK1, FAF1, SNTB2 for LDL-C, respectively. The number of common related genes among TC, HDL-C and LDL-C was 71, including FAF1, KLKB1, etc. Pathway enrichment analysis discovered known related pathways-zinc transporters, metal ion SLC transporters for TC, cell adhesion molecules CAMs, IL-6 signaling for HDL, FC epsilon RI signaling pathway, NFAT pathway for LDL, respectively. In conclusion, the TC and LDL-C level is moderately heritable and the HDL-C level is lowly heritable in Chinese population. The genomic loci, functional genes and pathways are identified to account for the heritability of plasma cholesterol level. Our findings provide important insights into plasma cholesterol molecular physiology and expect future research to replicate and validate our results.

Entities:  

Keywords:  GWAS; HDL; LDL; cholesterol; dyslipidemias; genetics; heritability; twins

Year:  2018        PMID: 30498476      PMCID: PMC6249314          DOI: 10.3389/fendo.2018.00677

Source DB:  PubMed          Journal:  Front Endocrinol (Lausanne)        ISSN: 1664-2392            Impact factor:   5.555


Introduction

Dyslipidemia represents a strong and independent risk factor for cardiovascular disease (1), a leading cause of death worldwide (2, 3). Besides, many studies indicate that dyslipidemia can increase the risk of preeclampsia, colorectal cancer, diabetic macular edema and polycystic ovary syndrome (4–7). The prevalence of dyslipidemia is high and is increasingly prevalent in China (8, 9). Plasma cholesterol, such as total cholesterol (TC), low density lipoprotein cholesterol (LDL-C), high density lipoprotein cholesterol (HDL-C), is a common indicator of diagnosing dyslipidemia. Therefore, exploring factors affecting plasma cholesterol homeostasis is a crucial step toward providing early prevention and therapeutic targets for dyslipidemia. The plasma cholesterol level is mediated by a combination of genetic and environmental factors. At present, the heritability for plasma cholesterol level has been estimated in several studies, with the heritability of TC, HDL-C, LDL-C level ranging from 0 to 89% (10–13), 22 to 93% (13–16), and 22 to 91% (10, 13, 16, 17), respectively. Additionally, several genome-wide association studies (GWAS) have attempted to find susceptible genetic loci located in the corresponding gene affecting plasma cholesterol level. A GWAS conducted by Middelberg RP et al. (18) for plasma cholesterol found genes LPL, LIPC, CETP associated with HDL-C level and CELSR2, APOB, TOMM40 with LDL-C level. Another two GWAS studies identified SRGAP2, HOXC13 and CD47, DUSP4 that were associated with HDL-C and TC level, respectively (19, 20). However, the known genetic variations only explain a small proportion of the genetic contribution and many potential genetic genes and loci remain to be discovered. Besides, allele frequencies, life style and environmental contributions differ between Chinese and other ethnic populations. Twin samples will have a higher power in genetic study, especially in human complex diseases (21). Here based on 382 Chinese twin pairs, we explored the genetic effect on TC, HDL-C, LDL-C variation and further searched for genetic susceptibility loci for these traits using GWAS (22).

Materials and methods

Twin samples collection and phenotypic measurement

The twin samples were collected from Qingdao Twin Registry and the details can be found in the literatures (23, 24). Twins who were pregnancy and lactation, as well as incomplete co-twin pairs were excluded. Twin pairs that taking cholesterol-lowering drugs also were excluded. The study included 382 twin pairs for heritability analysis and 139 dizygotic (DZ) twin pairs for GWAS with a mean age of 51.6 ± 7.7. All twin samples undertook the blood sampling and a physical examination after a 10–12 h overnight fast and completed a questionnaire. The zygosity of same sex and blood type was identified by using 16 multiple short tandem sequence repeat DNA markers (25, 26). According to the standard procedure using an automatic biochemical analyzer (Hitachi 7600; Hitachi, Tokyo, Japan), we tested participants' fasting blood cholesterol sample in the Qingdao Diabetes Hospital. Friedewald equation was used to calculate the LDL-C level: LDL-C (mmol/L) = TC–HDL-C–(TG/2.2) (mmol/L) (27). The Regional Ethics Committee of the Qingdao CDC Institutional Review Boards approved this study and the ethical principles followed the Helsinki Declaration. All subjects signed the written informed consent.

Genotyping, imputation, and quality control

Genome-wide SNPs were genotyped using the Illumina's InfiniumOmni2.5Exome-8v1.2 Bead-Chip platform. We performed stringent genotype quality control procedures: locus missing (< 0.05), minor allele frequency (MAF > 0.05), call rate (>0.98), and Hardy-Weinberg Equilibrium (HWE > 1 × 10−4). The final number of SNPs included in the subsequent GWAS analysis was 1,365,181. We used the IMPUTE2 (28) software to impute un-typed SNPs using the LD information from 1,000 Genomes Project Phase 3 reference panel (29) (CHS N = 171 and CHB N = 142). We used R2 > 0.6, MAF > 0.05 and HWE > 1 × 10−4 to filter the imputed data, and the 7,405,822 SNPs were used to explore the association with plasma cholesterol level.

Statistical analysis

Heritability

Data preparation and statistical description were performed with SPSS version 22.0. We used the structural equation models (SEM) to evaluate the genetic variance components with Mx software. The Blom's formula was used to guarantee the normal distribution or approximate normal distribution because of the deviation of the distribution of all indicators. Pearson's product-moment correlation coefficient was used to calculate intra-class phenotypic correlations. The correlation in MZ twins was significantly higher than DZ twins, reflecting the importance of genetic effects in plasma cholesterol levels. The total phenotypic variance could be decomposed to additive genetic variance (A), common or shared environmental variance (C), and unique/non-shared environmental variance (E). The full ACE model was firstly fitted, the likelihood-ratio χ2-test was applied to test whether the contributions of A or C to the model had statistical significance by comparing the full model (ACE) and their nested models (CE and AE). The Akaike information criterion (AIC), which is equal to the goodness-of-fit χ2-value minus twice the degrees of freedom, was used to indicate the parsimony of each model and a lower AIC indicated a better fit. Heritability (h2) was calculated in the best-fitting model based on the ratio of additive genetic variation to total phenotypic variation, with adjusting for the effect of age, sex and education level. We calculated the power of twin pairs for additive genetic influences by Mx software, and the calculation results showed that the power of our heritability analysis was above 90%.

GWAS

SNPs-based analysis

We used the genome-wide efficient mixed-model association (GEMMA) to test the association between plasma cholesterol level and SNP genotypes after adjusting for the following covariates: sex, age, and education level (30). We used Quantile-quantile (Q-Q) and Manhattan plots to illustrate overall significance level (P < 5 × 10−8) and suggestive level (P < 1 × 10−5) autosomal and chromosomal X SNPs (31). In addition, we analyzed the enrichment of cell-type enhancers for the typed GWAS results of the regulatory domains outside the coding regions by using online HaploReg v4.1 software (32, 33). SNPs with P < 1 × 10−5 were selected as query SNPs, and an uncorrected P < 0.05 for enrichments of cell-type enhancers were reported.

Gene-based analysis

The gene-based analysis integrated all SNPs within a gene to increase the signal or strength of association, which correcting for linkage disequilibrium (LD) and gene size. The gene-based test was implemented in Versatile Gene-based Association Study-2 (VEGAS2) which uses 1000 Genomes data to simulate the correlations of SNPs across the autosomes and chromosome X. (34, 35). We used SNPs from “1000G East ASIAN Population”. The genome-wide significant gene for the association was defined as P < 2.63 × 10−6 (0.05/19,001) as 19,001 genes being evaluated.

pathway enrichment analysis

We used PASCAL to compute pathway-scored (36–38). In this approach, genetic markers SNPs were first mapped to genes, and the association scores of all genes in the pathway were computed. We then combined the genes scores of the same pathways to calculate the pathway scores. We used chi-squared or empirical score to evaluate pathway enrichment of high-scoring (possibly fused) genes, avoiding any standard binary enrichment tests with inherent P-value thresholds. Pathways and their corresponding gene annotation were obtained from KEGG, Reactome, and Biocarta (as defined in MSigDB).

Results

Heritability

The final sample consisted of 382 twin pairs for heritability analysis and 139 DZ twin pairs for GWAS analysis with a mean age of 51.6 ± 7.7. The mean value ± SD of TC, HDL-C, LDL-C level for all subjects was 4.9 ± 1.2 umol/L, 1.5 ± 0.5 umol/L, 2.8 ± 0.9 umol/L, respectively (Supplemental Table 1). MZ twin correlations for TC (rMZ = 0.61, 95%CI: 0.52–0.68), HDL-C (rMZ = 0.74, 95%CI: 0.68–0.79), LDL-C (rMZ = 0.65, 95%CI: 0.57–0.72) level were all larger than DZ twin correlations (rDZ = 0.35, 95%CI: 0.21–0.47; rDZ = 0.61, 95%CI: 0.50–0.70; rDZ = 0.35, 95%CI: 0.21–0.46), respectively, indicating the existence of genetic effects (Supplemental Table 2). The full ACE model was first determined and then the likelihood ratio test and AIC was applied to choose the nested models. Finally, the ACE model was the best fit model with A accounting for 26.6% (95% CI: 7.9–48.9), C accounting for 47.8% (95% CI: 26.3–64.6), E accounting for 25.6% (95% CI: 20.8–31.6) for the variance in HDL-C level. The best fit model for TC level was AE model with A accounting for 61.4% (95% CI: 53.2–68.3), E accounting for 38.6% (95% CI: 31.7–46.8) and for LDL-C level was also AE model with A accounting for 65.5% (95% CI: 57.9–71.8), E accounting for 34.5% (95% CI: 28.2–42.2) (Table 1).
Table 1

Model fit and proportion of variance for TC, HDL-C, LDL-C level accounted by genetic and environmental parameters.

VariableModelA (95%CI)C (95%CI)E (95%CI)−2LLdfAICχ2P
TCACE51.8(23.6–67.9)9.0(0–33.7)39.2(32.0–47.9)1996.9757482.9
CE49.5(41.5–56.8)50.5(43.2–58.5)2010.3758494.313.42.49E−04
AE*61.4(53.2–68.3)38.6(31.7–46.8)1997.4758481.40.45.13E01
HDL-CACE*26.6(7.9–48.9)47.8(26.3–64.6)25.6(20.8–31.6)1763.6757249.6
CE69.4(63.7–74.4)30.6(25.6–36.3)1771.7758255.78.14.48E−03
AE74.9(69.5–79.4)25.1(20.6–30.5)1778.5758262.514.91.15E−04
LDL-CACE61.5(34.9–71.7)3.8(0–27.4)34.7(28.3–42.7)2028.1760508.1
CE50.9(43.0–58.0)49.1(42.0–57.0)2049.4761527.421.24.06E−06
AE*65.5(57.9–71.8)34.5(28.2–42.2)2028.2761506.20.097.71E01

, The best fitted model; −2LL, −2 Log Likelihood; df, degree of freedom; χ.

Model fit and proportion of variance for TC, HDL-C, LDL-C level accounted by genetic and environmental parameters. , The best fitted model; −2LL, −2 Log Likelihood; df, degree of freedom; χ.

GWAS

SNPs-based analysis

TC level

A sample of 139 DZ twin pairs including 1,365,181 qualified SNPs was included for the present GWAS. The Q-Q plot about TC level illustrated the relationship between the observed and expected GWAS P-values (Supplemental Figure 1). The genomic inflation factor (λ-statistic = 1) revealed no evidence of inflation of the test statistics due to population stratification. And weak association was shown due to the slight deviation in the upper right tail from the null distribution. No SNPs reached the genome-wide significance level (P < 5 × 10−8) as illustrated in Manhattan plot (Supplemental Figure 2). However, there were 8 SNPs exceeding the threshold for suggestive significance level (P < 1 × 10−5) (Supplemental Table 3). The strongest association SNP was rs7107698 (P = 2.29 × 10−6). As the locus zoom plots illustrated, one chromosomal loci 11p15.4 showed nominal association with TC level (Supplemental Figure 3). Four SNPs (P = 2.29 × 10−6 to 2.45 × 10−6) were located at AMPD3 gene on chromosome 11p15.4. By HaploReg v4.1, three cell-type specific enhancers (uncorrected P < 0.05) of primary neutrophils from peripheral blood (P = 0.04), ovary (P = 0.04) and NHDF-Ad adult dermal fibroblast primary cells (P = 0.01) were identified for TC level (Supplemental Table 4).

HDL-C level

The Q-Q plot about HDL-C level was shown in Figure 1. None of the SNPs reached the genome-wide significance level (P < 5 × 10−8) as illustrated in Manhattan plot (Figure 2). However, there are 14 SNPs exceeding the threshold for suggestive significance level (P < 1 × 10−5) (Table 2). The strongest association SNP was kgp6737496 (rs199929635) (P = 7.10 × 10−7). Chromosomal loci 5q14.1 showed suggestive association with HDL-C level, which including kgp6737496 (rs199929635), rs12518218, rs7729225 near LOC101929154 genes (Figure 3).
Figure 1

Quantile-quantile plot for quality control check and visualizing crude association for genome-wide association study of HDL-C level. The x-axis shows the –log10 of expected P-values of association from chi-square distribution and the y-axis shows the –log10 of P-values from the observed chi-square distribution. The black dots represent the observed data, and the red line is the expectation under the null hypothesis of no association.

Figure 2

Manhattan plot for genome-wide association study of HDL-C level. The x-axis shows the numbers of autosomes and the X chromosome, and the y-axis shows the –log10 of P-values for statistical significance. The dots represent the SNPs. None of the SNPs reached the genome-wide significance level (P < 5 × 10−8).

Table 2

The summary of SNPs with P < 1 × 10−5 for association with HDL-C in typed GWAS data.

SNPChr bandCHRBPP-valueClosest genes or genesOfficial full name
kgp6737496 (rs199929635)5q14.1577,142,7887.10E-07LOC101929154Uncharacterized LOC101929154
rs284022134q244106,517,4497.51E-07ARHGEF38Rho guanine nucleotide exchange factor 38
rs64689098q22.38105,227,9851.65E-06RIMS2Regulating synaptic membrane exocytosis 2
rs726850708q22.38105,250,2371.71E-06RIMS2Regulating synaptic membrane exocytosis 2
rs125182185q14.1577,161,4892.22E-06LOC101929154Uncharacterized LOC101929154
rs2411788p21.1828,626,4182.67E-06INTS9Integrator complex subunit 9
rs5620711512q13.131252,994,8963.08E-06KRT72Keratin 72
rs109390124p15.2424,897,0324.43E-06CCDC149Coiled-coil domain containing 149
rs100530125q35.15171,522,2755.71E-06STK10Serine/threonine kinase 10
rs1241470910p131017,041,0835.90E-06CUBNCubilin
rs125110684p15.2424,896,6586.04E-06CCDC149Coiled-coil domain containing 149
rs77292255q14.1577,142,8296.05E-06LOC101929154Uncharacterized LOC101929154
kgp10999245 (rs201647698)8q22.38105,243,1236.36E-06RIMS2Regulating synaptic membrane exocytosis 2
rs1734599310p131017,032,8859.56E-06CUBNCubilin

kgp, 1,000 Genomes Project; CHR, chromosome. The content discussed in detail were in bold.

Figure 3

Regional association plot showing signal around chromosomal loci of 5q14.1 for genome-wide association study of HDL-C level.

Quantile-quantile plot for quality control check and visualizing crude association for genome-wide association study of HDL-C level. The x-axis shows the –log10 of expected P-values of association from chi-square distribution and the y-axis shows the –log10 of P-values from the observed chi-square distribution. The black dots represent the observed data, and the red line is the expectation under the null hypothesis of no association. Manhattan plot for genome-wide association study of HDL-C level. The x-axis shows the numbers of autosomes and the X chromosome, and the y-axis shows the –log10 of P-values for statistical significance. The dots represent the SNPs. None of the SNPs reached the genome-wide significance level (P < 5 × 10−8). The summary of SNPs with P < 1 × 10−5 for association with HDL-C in typed GWAS data. kgp, 1,000 Genomes Project; CHR, chromosome. The content discussed in detail were in bold. Regional association plot showing signal around chromosomal loci of 5q14.1 for genome-wide association study of HDL-C level.

LDL-C level

The Q-Q plot about LDL-C level was shown in Supplemental Figure 4. None of the SNPs reached the genome-wide significance level (P < 5 × 10−8) as illustrated in Manhattan plot (Supplemental Figure 5). However, there are 9 SNPs exceeding the threshold for suggestive significance level (P < 1 × 10−5) (Supplemental Table 5). The strongest association SNP was rs10490120 (P = 1.11 × 10−6). At chromosomal loci 2p16.3, four SNPs rs10490120, rs4953640, rs74263479, rs17037869 were positioned closest to FSHR gene that was involved in follicle stimulating hormone receptor (Supplemental Figure 6). HUES6 cells (P = 0.01), HUES64 cells (P = 0.01) and iPS-18 cells (P = 0.01) were confirm as cell-type specific enhancers for LDL-C level (Supplemental Table 6).

Imputation

To maximize the identification of new risk variants, we imputed typed SNPs using 1,000 Genomes Project Phase 3 as the reference panel. Manhattan plots for all post-imputation variants showed none evidence of genome-wide significance level (P < 5 × 10−8) (Figure 2; Supplementary Figures 2, 5). However, there were 39, 55, and 59 SNPs exceeding the threshold for suggestive significance level (P < 1 × 10−5) for TC, HDL-C, and LDL-C, respectively. The strongest associations were rs77348447, rs79775842, rs56047090 for TC, HDL-C, LDL-C, respectively (Supplemental Tables 7–9). We also compared our post-imputation results with 34,421 East Asians lipids GWAS meta-analysis results And 32 SNPs located in genes RGS5, OSBPL10, ADGRB3, PSMB7, SRSF8 for TC level, 50 SNPs located in genes KAZN, LOC105378657, LOC105373529, LOC105373941, BIN3, LINC02153, LDHAL6CP for HDL-C level, 39 SNPs located in genes VPS13D, IGF2BP2, PSMB7, CA10, NECTIN2, APOE, APOC1, TOMM40 for LDL-C level could be replicated (Supplemental Table 10).

Gene-based analysis

In the gene-based analysis, none genes were found to achieve genome-wide significance level and the number of genes that were nominally associated with TC, HDL-C, LDL-C level was 1,038, 1,033, and 1,090, respectively (P < 0.05). The top 20 genes were ranked by their P values (TC: Supplemental Table 11, HDL-C: Table 3, LDL-C: Supplemental Table 12). LOC101929154, RIMS2, INTS9, KRT72, CCDC149, CUBN genes for HDL-C level and AMPD3, ABR genes for TC level had already been shown in the suggestive level SNPs-based analysis, whereas the other genes were novel.
Table 3

The top 20 genes from VEGAS2 gene-based analysis showing the strongest association with HDL-C level (P < 0.05) in typed GWAS data.

ChrGeneNumbers of SNPsStart positionStop positionGene-based test statisticGeneP-valueTop-SNPTop-SNPP-value
11RRP896,621,1436,624,88057.804.00E-06rs178346921.58E-04
1RLF2340,627,04040,706,593164.271.20E-05rs168270792.88E-05
4MGARP7140,187,316140,201,49254.602.30E-05rs32089412.04E-05
12SLC26A10558,013,69258,019,93445.552.90E-05rs9238281.79E-04
5CTXN315126,984,712126,994,32299.493.80E-05rs2487094.16E-04
3UROC139126,200,007126,236,616243.374.10E-05rs10915531.99E-04
4KLKB119187,148,671187,179,625113.757.10E-05rs19128261.31E-04
17CBX4677,806,95477,813,21354.628.60E-05rs734221237.25E-05
19ACPT751,293,67151,298,48158.321.00E-04rs557355285.48E-04
20CSTL11023,420,32123,425,56752.621.30E-04rs37467373.40E-04
1ISG155948,846949,91936.631.80E-04rs1160026088.77E-05
19C19orf481351,300,94951,308,110112.721.90E-04rs48018532.61E-04
1TMCO2240,713,57240,717,36517.202.10E-04rs612006543.36E-05
4NDUFC14140,211,070140,223,70538.882.20E-04rs126426472.49E-05
2SOWAHC5110,371,910110,376,56443.042.40E-04rs67262522.79E-04
19ZNF776758,258,16358,269,52745.193.10E-04rs359194562.11E-04
1CD10124117,544,371117,579,173106.373.30E-04rs15557931.20E-04
10IFIT1B391,137,81291,144,96218.814.00E-04rs108879517.36E-04
1HECTD3645,468,21945,477,02742.864.20E-04rs75412072.66E-04
12FBXO2110117,581,584117,628,30090.534.40E-04rs22797665.40E-04

Chr, chromosome. The content discussed in detail were in bold.

The top 20 genes from VEGAS2 gene-based analysis showing the strongest association with HDL-C level (P < 0.05) in typed GWAS data. Chr, chromosome. The content discussed in detail were in bold. The number of common nominally associated genes (P < 0.05) between TC VS HDL-C level, TC VS LDL-C level, HDL-C VS LDL-C level was 195, 504, 91, respectively. And 71 common genes were found among the three traits (Supplemental Table 13).

Pathway enrichment analysis

PASCAL program was used in the pathway enrichment analysis. The number of pathways that were nominally associated with TC, HDL-C, and LDL-C level was 816, 529, and 813, respectively (P < 0.05). The top 20 pathways were ranked by their emp-P-values (TC: Supplemental Table 14; HDL-C: Table 4; LDL-C: Supplemental Table 15). The Q-Q plots were shown in Supplemental Figures 7–9. Pathway analysis revealed several biological processes significantly associated with TC level: zinc transporters, metal ion SLC transporters, ERKS ARE inactivated, MAPK targets nuclear events mediated by map kinases, ERK MAPK targets, transport of glucose and other sugars bile salts and organic acids metal ions and amine compounds, amino acid transport across the plasma membrane, etc.
Table 4

The top 20 pathway results-KEGG, Reactome, and Biocarta (emp-P < 0.05) using PASCAL program for HDL-C level in typed GWAS data.

Pathwaychisq-Pemp-P-log(chisqP)–log(empP)
REACTOME_CELL_SURFACE_INTERACTIONS_AT_THE_VASCULAR_WALL7.11E-042.94E-043.148183.53165
BIOCARTA_HER2_PATHWAY4.39E-044.35E-043.357363.36151
KEGG_CELL_ADHESION_MOLECULES_CAMS6.25E-044.44E-043.204213.35262
REACTOME_IL_6_SIGNALING6.27E-044.66E-043.202813.33161
KEGG_TIGHT_JUNCTION2.91E-036.20E-042.536493.20761
BIOCARTA_AHSP_PATHWAY8.78E-048.80E-043.056523.05552
REACTOME_BILE_SALT_AND_ORGANIC_ANION_SLC_TRANSPORTERS9.23E-049.30E-043.034633.03152
REACTOME_TRANSPORT_OF_GLUCOSE_AND_OTHER_SUGARS_BILE_SALTS_AND_ORGANIC_ACIDS_METAL_IONS_AND_AMINE_COMPOUNDS9.23E-049.50E-043.034633.02228
REACTOME_SIGNALING_BY_ILS1.35E-031.17E-032.870212.93181
BIOCARTA_CHREBP2_PATHWAY1.13E-031.30E-032.947432.88606
REACTOME_INTRINSIC_PATHWAY1.51E-031.38E-032.820862.86012
REACTOME_FORMATION_OF_FIBRIN_CLOT_CLOTTING_CASCADE1.51E-031.48E-032.820862.82974
REACTOME_NEPHRIN_INTERACTIONS4.77E-031.82E-032.321392.73993
BIOCARTA_IL6_PATHWAY2.04E-031.96E-032.690872.70774
KEGG_NATURAL_KILLER_CELL_MEDIATED_CYTOTOXICITY7.95E-032.15E-032.099382.66756
REACTOME_CHONDROITIN_SULFATE_BIOSYNTHESIS2.36E-032.31E-032.626342.63639
KEGG_SULFUR_METABOLISM2.36E-032.34E-032.626342.63078
KEGG_GLYCOSAMINOGLYCAN_BIOSYNTHESIS_CHONDROITIN_SULFATE2.36E-032.39E-032.626342.6216
REACTOME_APC_C_CDH1_MEDIATED_DEGRADATION_OF_CDC20_AND_OTHER_APC_C_CDH1_TARGETED_PROTEINS_IN_LATE_MITOSIS_EARLY_G15.66E-032.61E-032.247552.58336
KEGG_LEUKOCYTE_TRANSENDOTHELIAL_MIGRATION2.96E-032.62E-032.529122.5817

chisq-P, Chi-square p-value. Chi-squared method (gene-score p-value were ranked and transformed to a uniform distribution, these values were then transformed by a chi-square quantile function, and summed). The content discussed in detail were in bold.

emp-P, empirical p-value. Empirical sampling method (gene-scores are transformed with chi-square quantile function and summed, then Monte Carlo estimate of the p-values were obtained by sampling random sets of the same size).

The top 20 pathway results-KEGG, Reactome, and Biocarta (emp-P < 0.05) using PASCAL program for HDL-C level in typed GWAS data. chisq-P, Chi-square p-value. Chi-squared method (gene-score p-value were ranked and transformed to a uniform distribution, these values were then transformed by a chi-square quantile function, and summed). The content discussed in detail were in bold. emp-P, empirical p-value. Empirical sampling method (gene-scores are transformed with chi-square quantile function and summed, then Monte Carlo estimate of the p-values were obtained by sampling random sets of the same size). Several biological pathways significantly related to HDL-C levels emerged: cell adhesion molecules CAMs, IL-6 signaling, tight junction, bile salt and organic anion SLC transporters, transport of glucose and other sugars bile salts and organic acids metal ions and amine compounds, signaling by ILS, IL6 pathway, natural killer cell mediated cytotoxicity, sulfur metabolism, chondroitin sulfate biosynthesis, glycosaminoglycan biosynthesis chondroitin sulfate, leukocyte transendothelial migration, etc. The pathways associated with LDL-C involved: FC epsilon RI signaling pathway, NFAT pathway, metabolism of amino acids and derivatives, ERBB signaling pathway, IL1R pathway, HIV infection, etc.

Discussion

Our investigation was based on 382 twin pairs to explore the heritability of TC, HDL-C, LDL-C level, and confirmed the genetic variants underlying these trait by GWAS. The correlation in MZ twins was significantly higher than DZ twins, which reflected the existence of genetic variance in TC, HDL-C, LDL-C levels (Supplemental Table 2). The ACE model was the best fit model with A accounting for 26.6%, C accounting for 47.8%, E accounting for 25.6% for the variance in HDL-C level. The best fit model for TC level was AE model with A accounting for 61.4%, E accounting for 38.6% and for LDL-C level was also AE model with A accounting for 65.5%, E accounting for 34.5% (Table 1). The magnitude of the additive genetic components (A) for TC level obtained here was in line with several previous twin studies (15, 16, 27, 39–47). HDL-C level is close to some other twin studies (11, 14, 27, 48, 49). LDL-C level is similar to the others (27, 42, 43, 45, 47, 50).

SNP-based analysis

TC level.

While no genome-wide significant SNPs were found in GWAS for TC level, one SNP rs7107698 on chromosome 11 was found as promising genetic regions. The AMPD3 gene around the rs7107698 has been linked to the TC level. Adenosine monophosphate deaminase 3 (AMPD3) is an enzyme that catalyzes the hydrolytic deamination of adenosine monophosphate to inosine monophosphate and participates in the pathway of adenylate catabolic (51). The activation of AMP deaminase leads to the increase of uric acid and the production of mitochondrial oxidative stress, which further stimulates the formation of de novo lipogenesis and the activation of ATP-citrate lyase and the synthesis of long-chain saturated fatty acids and eventually affects TC level (52, 53). Additionally, three highly correlated SNPs (rs4909928, rs12184411, kgp6520322) were detected within AMPD3 gene, showing suggestive evidence of association with TC level.

HDL-C level.

This cluster of suggestive SNPs (kgp6737496, rs12518218, rs7729225) is linked to LOC101929154, a non-coding RNA. Although we have not fully elucidated the biological function of this ncRNA, mutations in base pairs may affect their regulatory domain to regulate HDL-C levels. Another gene CUBN including suggestive SNPs rs12414709 and rs17345993 has been linked to HDL-C metabolism (54). Cubilin (CUBN) encodes high affinity HDL-C and lipid-poor apoA-I endocytosis receptor, which participate in the renal clearance of filterable apolipoprotein AI/HDL-C (55, 56). SNPs rs10490120, rs4953640, rs74263479, rs17037869 were included in the FSHR gene. Follicle stimulating hormone receptor (FSHR) may interact with FSH in human hepatic tissue, which can elevate LDL-C level by blocking the expression of LDLR (57, 58). Another KCNK9 around the suggestive SNP rs13251143 also has evidence for dyslipidemia. A great deal of studies has shown that KCNK9 gene is associated with obesity, HDL-C, adiponectin levels and aldosterone production (59–61). All the above factors are related to dyslipidemia (62–64). As for cell-type enhancers, the relationship between TC level and ovary has been studied. Women with polycystic ovary syndrome had higher TC and LDL-C level, lower HDL-C level (65, 66). Many studies have showed that Estrogen Receptor 1 (ESR1) is associated with TC, HDL-C, LDL-C level (67). Genotype imputation substantially increases available SNPs for analysis in GWAS. Substantive SNPs introduced by imputation reached the suggestive level associated with cholesterol. SNPs rs17578959 and rs28845526 were located in the PID1 gene, which served as the regulator of the LDLR-related protein 1 (LRP1) function and controlled the processing of postprandial lipoproteins (68). Solute Carrier Family 13 Member 1 (SLC13A1) gene can regulate the cholesterol level by participating in transport of glucose and other sugars, bile salts and organic acids, metal ions and amine compounds pathways. As additional replication, we cross-referenced our post-imputation results with 34,421 East Asians lipids GWAS meta-analysis results (Supplemental Table 10). A list of SNPs could be replicated, especially the SNPs located in the relevant genes, eg. RGS5, OSBPL10, SRSF8 for TC level, KAZN, LOC105378657, LOC105373529, LOC105373941, BIN3, LINC02153, LDHAL6CP for HDL-C level, VPS13D, IGF2BP2, PSMB7, CA10, NECTIN2, APOE, APOC1, TOMM40 for LDL-C level. Fas Associated Factor 1 (FAF1) can inhibit the activation of NF-kB by interfering with the assembly of IêB kinase (IKK) complex and impeding the nuclear translocation of NF-kB RelA in stimulation-dependent manner. NF-kB signaling pathway is closely related to inflammation (69–71). Epidemiology investigations have revealed that inflammation is associated with a risk of dyslipidemia (72, 73). Lamina C et al. found that Kallikrein B1 (KLKB1) was associated with apolipoprotein A-IV concentrations in a genome-wide association meta-analysis. Apolipoprotein A-IV acting as a major component of HDL-C and chylomicron particles participates in reverse cholesterol transport (74). Kallikrein B1 (KLKB1) has been studied in a genome-wide association meta-analysis on apolipoprotein A-IV concentrations. Apolipoprotein A-IV acting as a major component of HDL-C and chylomicron particles participates in reverse cholesterol transport (74). Neurotrophic Receptor Tyrosine Kinase 1 (NTRK1) encodes the neurotrophic tyrosine kinase receptor (NTKR) family members, which are related to classical mitogen-activated protein kinases (MAPK) signaling pathways (75). Further, MAP signaling pathways participated in lipid metabolism (76). Syntrophin Beta 2 (SNTB2) suggestes a role in ERK and SR-BI level, and sphingomyelin metabolism in obesity and have been proved to affect the activity of ABCA1 by stabilizing ABCA1 protein, which affect the catabolism of LDL-C level (77–79). Fas Associated Factor 1 (FAF1) also plays a key role in regulating the LDL-C level (69–71). Pathway analysis revealed several biological processes significantly associated with TC level: zinc transporters, metal ion SLC transporters, immune system, regulation of transport. Other novel pathways especially top 20 may also be interesting potential candidates for future research and validation. (1) Zinc serves as a catalytic or structural cofactor for many different proteins and participates in the formation and function of enzyme. One study in young Finns (6–18 years) reported that serum Zinc was positively correlated with TC level (80). (2) Six SLC gene families encode proteins which mediate transport of metals. Transition metal ion such as vanadium, iron, can influence the serum lipid and lipoprotein profiles (81, 82). (3) ERKS ARE inactivated, MAPK targets nuclear events mediated by map kinases and ERK MAPK targets participate in the immune system pathway. Immune system is also associated with cholesterol level. Acute phase conditions and immune disorders promote the decrease of HDL-C and change the composition and size of HDL-C (83). (4) Transport of glucose and other sugars bile salts and organic acids metal ions and amine compounds and amino acid transport across the plasma membrane participate in regulation of transport. Several genes (GAS6, GPLD1, PRKCE, WNK1, ESR1, RAC2, FAF1, FFAR1, RNASEL, GCK, SLC30A8, RARRES2, TLR5, TLR1, ACSL3, ABCA7, Th17) enriching in pathway of regulation of transport have been clarified the association with cholesterol level.

HDL level

Several biological pathways significantly related to HDL-C levels include: cell adhesion molecules CAMs, tight junction, chondroitin sulfate biosynthesis, sulfur metabolism, immune system, regulation of transport. (1) An increased CAMs expression may be a mechanism that decreases plasma HDL-C level (84). (2) Zonulin, a physiological mediator of tight intercellular junctions reversibly regulates intestinal permeability to reduce the level of HDL-C (85). (3) Animal experiments have shown that chondroitin-6-sulfate can reduce plasma LDL-C (86). (4) Sulfur-containing compounds are generally associated with an unfavorable lipid profile (87). (5) IL-6 signaling, signaling by ILS, IL6 pathway, natural killer cell mediated cytotoxicity, leukocyte transendothelial migration constitute the immune system to regulate cholesterol levels. (6) Bile salt and organic anion SLC transporters, transport of glucose and other sugars bile salts and organic acids metal ions and amine compounds participate in regulation of transport.

LDL level

The pathways associated with LDL-C were: immune system, metabolism of amino acids and derivatives, ERBB signaling pathway, HIV infection. (1) FC epsilon RI signaling pathway, NFAT pathway, IL1R pathway constitute the immune system to regulate cholesterol levels. (2) Metabolism of amino acids and derivatives participates in the formation and function of enzyme to regulate cholesterol level. (3) ERBB4 activates sterol regulatory element binding protein-2 (SREBP-2) to enhance LDL-C uptake and cholesterol biosynthesis (88). (4) A cross-sectional epidemiological study in China showed that mean LDL-C was lower in HIV-positive than HIV-negative subjects (89).

Strengths and limitations

There are two advantages in this study. First, our results were based on the twin data of TC, HDL-C, LDL-C level. Associated individuals, such as twin pairs, will confer increased power in genetic association analysis due to their genetic association (21). And the power of our heritability analysis was above 90%, which suggested that the heritability findings were credible. Second, compared with most GWAS results from European populations, we sampled GWAS from the Qingdao twin population. Our GWAS results of TC, HDL-C, LDL-C level in East Asian population provided a basis for future molecular biology investigations. This study has two limitations that are noteworthy. First, our study had a relatively small sample size due to the challenges of recruiting and identifying qualified twin pairs. So it should be noted that the power of our analysis might not be sufficient to detect association between SNPs, pathways and cholesterol levels, and our results needed to be furtherly confirmed. Second, our study provided a lot of suggestive results, but no statistically significant results. However, many variants and genes had been confirmed, other novel variants and genes may also be interesting potential candidates for future research and validation.

Conclusions

In brief, we have verified genetic impact on TC, HDL-C, LDL-C variation through twin modeling. The promising genetic regions for TC, HDL-C, LDL-C level were on chromosome 11 around rs7107698, chromosome 5 around rs12518218, chromosome 2 around rs10490120, respectively. There are 8, 14, 9 SNPs exceeding the threshold for suggestive significance level for TC, HDL-C, LDL-C level, respectively. In the gene-based analysis, the number of genes that was nominally associated with TC, HDL-C, LDL-C level was 1,038, 1,033, and 1,090, respectively. Although our findings need to be replicated and validated, the data reported here could represent a useful reference for GWAS results of TC, HDL-C, LDL-C level in East Asian population and provide a basis for future molecular biology investigations.

Data availability

The SNPs datasets of this study have been deposited in the European Variation Archive (EVA) (Accession No. PRJEB23749).

Author contributions

HL and DZ designed the study. HD and CX collected samples and phenotypes. HL and CZ assisted in sample data and sequencing data management. HL and WW analyzed the sequencing data and interpreted the analysis results. HL and DZ drafted the manuscript, XT and HD participated in the discussion, and WW, CX, and CZ revised it. All the authors read the manuscript and agreed to publish. All the authors agreed to be responsible for all aspects of the work.

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  89 in total

Review 1.  Adiponectin as a routine clinical biomarker.

Authors:  Ken Kishida; Tohru Funahashi; Iichiro Shimomura
Journal:  Best Pract Res Clin Endocrinol Metab       Date:  2013-08-20       Impact factor: 4.690

2.  Inheritance of high density lipoprotein and lipoprotein lipase and hepatic lipase activity.

Authors:  T Kuusi; Y A Kesäniemi; M Vuoristo; T A Miettinen; M Koskenvuo
Journal:  Arteriosclerosis       Date:  1987 Jul-Aug

3.  Suppression of atherogenesis in hypercholesterolemic rabbits by chondroitin-6-sulfate.

Authors:  T Matsushima; Y Nakashima; M Sugano; H Tasaki; A Kuroiwa; O Koide
Journal:  Artery       Date:  1987

Review 4.  Dyslipidemia and Diabetic Macular Edema: A Systematic Review and Meta-Analysis.

Authors:  Radha Das; Rebecca Kerr; Usha Chakravarthy; Ruth E Hogg
Journal:  Ophthalmology       Date:  2015-07-03       Impact factor: 12.079

5.  Genetic and environmental contributions to phenotypic components of metabolic syndrome: a population-based twin study.

Authors:  Shanchun Zhang; Xin Liu; Yunxian Yu; Xiumei Hong; Katherine K Christoffel; Binyan Wang; Hui-Ju Tsai; Zhiping Li; Xue Liu; Genfu Tang; Houxun Xing; Wendy J Brickman; Donald Zimmerman; Xiping Xu; Xiaobin Wang
Journal:  Obesity (Silver Spring)       Date:  2009-04-30       Impact factor: 5.002

6.  Zinc, iron and vitamins A, C and e are associated with obesity, inflammation, lipid profile and insulin resistance in Mexican school-aged children.

Authors:  Olga Patricia García; Dolores Ronquillo; María del Carmen Caamaño; Guadalupe Martínez; Mariela Camacho; Viridiana López; Jorge L Rosado
Journal:  Nutrients       Date:  2013-12-10       Impact factor: 5.717

7.  A global reference for human genetic variation.

Authors:  Adam Auton; Lisa D Brooks; Richard M Durbin; Erik P Garrison; Hyun Min Kang; Jan O Korbel; Jonathan L Marchini; Shane McCarthy; Gil A McVean; Gonçalo R Abecasis
Journal:  Nature       Date:  2015-10-01       Impact factor: 49.962

8.  Fast and Rigorous Computation of Gene and Pathway Scores from SNP-Based Summary Statistics.

Authors:  David Lamparter; Daniel Marbach; Rico Rueedi; Zoltán Kutalik; Sven Bergmann
Journal:  PLoS Comput Biol       Date:  2016-01-25       Impact factor: 4.475

9.  Heritability and Genome-Wide Association Analyses of Serum Uric Acid in Middle and Old-Aged Chinese Twins.

Authors:  Weijing Wang; Dongfeng Zhang; Chunsheng Xu; Yili Wu; Haiping Duan; Shuxia Li; Qihua Tan
Journal:  Front Endocrinol (Lausanne)       Date:  2018-03-06       Impact factor: 5.555

10.  Prevalence of Dyslipidemia Among Antiretroviral-Naive HIV-Infected Individuals in China.

Authors:  Yinzhong Shen; Jiangrong Wang; Zhenyan Wang; Tangkai Qi; Wei Song; Yang Tang; Li Liu; Renfang Zhang; Hongzhou Lu
Journal:  Medicine (Baltimore)       Date:  2015-12       Impact factor: 1.817

View more
  6 in total

1.  Exome sequencing in a familial form of anorexia nervosa supports multigenic etiology.

Authors:  Thierry Bienvenu; Nicolas Lebrun; Julia Clarke; Philibert Duriez; Philip Gorwood; Nicolas Ramoz
Journal:  J Neural Transm (Vienna)       Date:  2019-08-06       Impact factor: 3.575

2.  An Integrative Phenotype-Genotype Approach Using Phenotypic Characteristics from the UAE National Diabetes Study Identifies HSD17B12 as a Candidate Gene for Obesity and Type 2 Diabetes.

Authors:  Mahmood Y Hachim; Hayat Aljaibeji; Rifat A Hamoudi; Ibrahim Y Hachim; Noha M Elemam; Abdul Khader Mohammed; Albert Salehi; Jalal Taneera; Nabil Sulaiman
Journal:  Genes (Basel)       Date:  2020-04-23       Impact factor: 4.096

3.  Heritability and genome-wide association analyses of fasting plasma glucose in Chinese adult twins.

Authors:  Weijing Wang; Caixia Zhang; Hui Liu; Chunsheng Xu; Haiping Duan; Xiaocao Tian; Dongfeng Zhang
Journal:  BMC Genomics       Date:  2020-07-18       Impact factor: 3.969

4.  Gene-based association tests using GWAS summary statistics and incorporating eQTL.

Authors:  Xuewei Cao; Xuexia Wang; Shuanglin Zhang; Qiuying Sha
Journal:  Sci Rep       Date:  2022-03-03       Impact factor: 4.379

5.  Genetic and environmental determinants of variation in the plasma lipidome of older Australian twins.

Authors:  Anne Poljak; Perminder S Sachdev; Matthew Wk Wong; Anbupalam Thalamuthu; Nady Braidy; Karen A Mather; Yue Liu; Liliana Ciobanu; Bernhardt T Baune; Nicola J Armstrong; John Kwok; Peter Schofield; Margaret J Wright; David Ames; Russell Pickford; Teresa Lee
Journal:  Elife       Date:  2020-07-22       Impact factor: 8.140

6.  A Genome-Wide Association Study of Age-Related Hearing Impairment in Middle- and Old-Aged Chinese Twins.

Authors:  Haiping Duan; Wanxue Song; Weijing Wang; Hainan Cao; Bingling Wang; Yan Liu; Chunsheng Xu; Yili Wu; Zengchang Pang; Dongfeng Zhang
Journal:  Biomed Res Int       Date:  2021-07-17       Impact factor: 3.411

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.