Literature DB >> 31902109

Progressive effects of single-nucleotide polymorphisms on 16 phenotypic traits based on longitudinal data.

Donghe Li1, Hahn Kang2, Sanghun Lee3, Sungho Won4,5,6.   

Abstract

BACKGROUND: There are many research studies have estimated the heritability of phenotypic traits, but few have considered longitudinal changes in several phenotypic traits together.
OBJECTIVE: To evaluate the progressive effect of single nucleotide polymorphisms (SNPs) on prominent health-related phenotypic traits by determining SNP-based heritability ([Formula: see text]) using longitudinal data.
METHODS: Sixteen phenotypic traits associated with major health indices were observed biennially for 6843 individuals with 10-year follow-up in a Korean community-based cohort. Average SNP heritability and longitudinal changes in the total period were estimated using a two-stage model. Average and periodic differences for each subject were considered responses to estimate SNP heritability. Furthermore, a genome-wide association study (GWAS) was performed for significant SNPs.
RESULTS: Each SNP heritability for the phenotypic mean of all sixteen traits through 6 periods (baseline and five follow-ups) were significant. Gradually, the forced vital capacity in one second (FEV1) reflected the only significant SNP heritability among longitudinal changes at a false discovery rate (FDR)-adjusted 0.05 significance level ([Formula: see text], FDR = 0.0012). On estimating chromosomal heritability, chromosome 2 displayed the highest heritability upon periodic changes in FEV1. SNPs including rs2272402 and rs7209788 displayed a genome-wide significant association with longitudinal changes in FEV1 (P = 1.22 × 10-8 for rs2272402 and P = 3.36 × 10-7 for rs7209788). De novo variants including rs4922117 (near LPL, P = 2.13 × 10-15) of log-transformed high-density lipoprotein (HDL) ratios and rs2335418 (near HMGCR, P = 3.2 [Formula: see text] 10-9) of low-density lipoprotein were detected on GWAS.
CONCLUSION: Significant genetic effects on longitudinal changes in FEV1 among the middle-aged general population and chromosome 2 account for most of the genetic variance.

Entities:  

Keywords:  Genomic restricted maximum likelihood; Heritability; Longitudinal changes; Phenotypic trait

Year:  2020        PMID: 31902109      PMCID: PMC7113194          DOI: 10.1007/s13258-019-00902-x

Source DB:  PubMed          Journal:  Genes Genomics        ISSN: 1976-9571            Impact factor:   1.839


Introduction

Single nucleotide polymorphism (SNP)-based heritability () indicates the relative proportion of genetic variance explained based on SNPs used for genome-wide association studies (GWASs). For estimation, the genomic restricted maximum likelihood (GREML) for linear mixed models (LMMs) is often implemented in the genetic complex trait analysis (GCTA) tool (Yang et al. 2011). GREML first calculates the genetic relatedness matrices (GRM), which are used as variance–covariance matrices for random effects. The significance of estimates obtained through GREML depends on the study design; if it is applied to family-based samples, it displays pedigree-based heritability, but for unrelated subjects, it estimates (Kim et al. 2015; Yang et al. 2017). Estimation of involves considerable differences across not only methodologies but also procedures requiring careful interpretation of results (Evans et al. 2018; Ni et al. 2018). Moreover, the estimated heritability is potentially biased and misleading owing to measurement errors at various degrees. To overcome these challenges, the heritability determined from longitudinal data is more reliable than that determined from cross-sectional data. While most studies on focused on the primary effect of SNPs, significant effects of SNPs on the average annual differences indicate the SNP-by-age interaction. Numerous examples illustrate the importance of age on longitudinal changes (Genetic epidemiologic studies on age-specified traits. NIA Aging and Genetic Epidemiology Working Group 2000; Nishimura et al. 2012; van de Pol and Verhulst 2006). For instance, one study showed that an annual decline in lung function is associated with age (Kim et al. 2016), and another study reported a genetic influence on changes in both lipoprotein risk factors and systolic blood pressure over a decade (Friedlander et al. 1997). Therefore, the should be estimated on the basis of not only the mean of observed traits but also changes in the sufficient period. Hence, we applied a two-stage approach, which is a convenient method of analyzing longitudinal data by combining linear regression models to investigate the effect of SNPs on both average and longitudinal differences in phenotypic traits. In this study, we investigated the magnitude of SNPs effect on average and longitudinal differences by using both genomic data and 16 phenotypic traits associated with major health indices using a phenotype-genotype dataset of unrelated individuals in a community-based cohort and evaluated their importance. Except for baseline, each phenotype was objectively measured every 2 years for 10-year follow-up, and six repeated measurements (maximum) were obtained for each individual. For each subject, both the average phenotypic traits and their longitudinal changes were estimated via subject-specific regression analysis, using intercepts and coefficients of ages, respectively. Each value was estimated using GCTA. Our results show that lung function has the only significant for longitudinal changes, while all average phenotypes of the 16 traits yielded a significant value. Furthermore, the GWAS revealed certain novel genome-wide significant SNPs associated with the phenotypes analyzed herein.

Materials and Methods

Ethics approval and consent to participate

The respective Institutional Review Board (IRB) of Seoul National University reviewed and approved the informed consent, the study protocols and other documents (Permit No. E1605/002-003). All methods were performed in accordance with the relevant guidelines and regulations.

Korea Associated Resource (KARE) cohort data

Korea Associated Resource (KARE) data are based on a community-based epidemiological study and comprises subjects residing in Ansan (urban area) and Ansung (rural area) in the Gyeonggi Province of South Korea (Cho et al. 2009). A baseline survey was completed in 2001–2002, and 10,030 participants aged 40–69 years were recruited. After that, biennial repeated surveys were conducted, and the last survey were completed in 2013–2014 (Kim et al. 2017). Six different surveys were conducted in total. These measurement periods are indicated as periods 1–6 throughout this study, each with a different number of subjects (e.g., period 1: 8543 subjects [4052 male, 4491 female]; period 6: 5391 subjects [2502 male, 2889 female]). The number of overlapping subjects throughout the six periods was 4306 (2009 male, 2297 female). Among these, subjects whose traits were measured at least three times were considered, and 6843 participants (3273 male, 3570 female) were assessed in total. Many participant phenotypes were recorded by trained interviewers through questionnaires and clinical measurement, and we only considered the 16 quantitative traits that they were measured objectively and associated with major health indices; these were classified into four groups: anthropometric, biochemistry, cardiopulmonary, and red blood cell traits (Table 1). As glycated hemoglobin (HbA1c), fasting blood glucose (GLU0), high-density lipoprotein (HDL), triglycerides (TG), and systolic blood pressure (SBP) displayed skewed distributions, they were log-transformed and denoted by log(HbA1c), log(GLU0), log(HDL), log(TG), and log(SBP), respectively. The missing rate of HbA1c was larger than 0.5 at period 2 and was excluded from further analyses. For each trait, subjects with more than three measurement observations were assessed.
Table 1

Sixteen phenotypic traits associated with major health indices

Anthropomorphic traitsHeight, Waist, Weight, body-mass index (BMI)
Biochemistry traits
 GlucoseGlycated hemoglobin (HbA1c), Fasting blood glucose (GLU0)
 CholesterolLow-density lipoprotein (LDL), high-density lipoprotein (HDL), total cholesterol (TCHL), triglyceride (TG)
Cardiopulmonary traits
 Blood pressureSystolic blood pressure (SBP), diastolic blood pressure (DBP)
 Lung capacityPredicted forced vital capacity (FVC) %, predicted forced expiratory volume in one second (FEV1) %, predicted FEV1/FVC  %
Red blood cell traitsHemoglobin levels (Hb)
Sixteen phenotypic traits associated with major health indices

Genotypes

Genotype data for the KARE cohort were obtained using the Affymetrix Genome-Wide Human SNP array 5.0 (Cho et al. 2009). Quality control (QC) analysis of SNPs and subjects was conducted using PLINK (Purcell et al. 2007) and ONETOOL (Song et al. 2018). SNPs with P-values from Hardy–Weinberg equilibrium (HWE) analysis < 10−5, minor allele frequencies (MAFs) < 0.05, and genotype call rates < 95% were excluded. Furthermore, subjects with missing genotype call rates > 5% or sex-based inconsistencies were excluded. The missing genotypes for typed SNPs were imputed on the basis of the 1000-genome sequence reference data. After QC analysis, 305,158 SNPs were analyzed for SNP heritability and GWAS (Fig. 1).
Fig. 1

A schematic representation of heritability analysis and the genome-wide association study

A schematic representation of heritability analysis and the genome-wide association study

Data availability

The data used in this study underwent an application process according to the Korean Genome and Epidemiology study data access policy, and can be downloaded from following website: http://nih.go.kr/index.es?sid=a5#a.

Determination of phenotypic averages and longitudinal changes in each subject

The phenotypic averages and longitudinal changes for each subject were determined and used to estimate SNP heritability and for the GWAS. Significant differences in phenotypic variances were observed for each period, and such heteroscedasticity was considered for phenotypic averages and longitudinal changes for each subject as follows. First, the linear regression model for subjects of the same period was adjusted for traits. The effect of sex, age, and top 10 principal component (PC) scores estimated from the genetic relationship matrix were adjusted as covariates. Considering as the residual variances of trait k (k = 1, …, 16) at the period j, the following linear regression model was adjusted with repeated measures for each subject i as follows:Here, i indicates the ith subject, and indicates the mean of ages at the observed time points. In the regression model (1), is the jth period observed value of subject i for trait k, indicates the expected phenotypic mean of subject i for trait k when he or she is years old, and is the average longitudinal change in trait k. The estimated values of and were used to estimate the heritability and for GWAS analysis. For convenience, both are denoted by and , respectively.

Estimation of heritability

and were separately used to estimate SNP heritability. Analyses were conducted using GCTA (Yang et al. 2011), and the restricted maximum likelihood estimator was used. Effects of age and sex were adjusted as covariates.

GWAS analysis

and were separately analyzed to identify disease susceptibility loci for 16 different traits. The , sex, and 10 PC scores estimated from the genetic relationship matrix were included as covariates to adjust the population stratification.

Results

A schematic representation of the heritability analysis and genome-wide association study is shown in Fig. 1. For 16 different traits of 6843 subjects, the mean and standard deviation values of each trait at period 1 are shown in Table 2 (see Table 1 for detailed information). Some missing values resulted in differences in the total number of subjects depending on the phenotype, and the sample sizes of those traits and descriptive statistics including sex and age were summarized.
Table 2

Descriptive statistics of 16 traits

TraitTrait (baseline)Total (N)FemaleAge
MeanSDN%MeanSD
Height (cm)160.118.636823355752.13%51.908.69
Waist (cm)82.638.706835356752.19%51.908.69
Weight (kg)63.2410.106822355652.13%51.908.69
BMI (kg/m2)24.623.106822355652.13%51.908.69
HbA1c (%)5.740.826329332152.47%51.878.62
GLU0 (mg/dl)86.7319.416728351452.23%51.858.67
TG (mg/dl)161.47103.196840356852.16%51.918.70
LDL (mg/dl)115.0032.896840356852.16%51.918.70
HDL (mg/dl)44.699.916840356852.16%51.918.70
TCHL (mg/dl)191.9235.096840356852.16%51.918.70
SBP (mmHg)121.1218.106843357052.17%51.918.70
DBP (mmHg)80.1911.336843357052.17%51.918.70
Hb (g/dl)13.611.576840356852.16%51.918.70
FVC (%predicted)104.7614.174291213549.76%50.378.17
FEV1 (%predicted)112.2716.624290213449.74%50.378.16
FEV1/FVC (predicted)74.891.774291213549.76%50.378.17
Descriptive statistics of 16 traits For those 6843 subjects, a multidimensional scaling (MDS) plot was generated (Fig. 2). As shown in Fig. 2, subjects from the 1000 Genomes Project were also included, and our analyses were not affected by population stratification.
Fig. 2

Population structures identified via a multidimensional scaling (MDS) plot. This plot shows that our analyses (KARE) are not affected by population stratification. AFR, AMR, EAS, EUR, and SAS indicate African, Ad Mixed American, East Asian, European, and South Asian populations, respectively, from the 1000 Genomes Project

Population structures identified via a multidimensional scaling (MDS) plot. This plot shows that our analyses (KARE) are not affected by population stratification. AFR, AMR, EAS, EUR, and SAS indicate African, Ad Mixed American, East Asian, European, and South Asian populations, respectively, from the 1000 Genomes Project We calculated the descriptive statistics for and (Table 3, see “Materials and methods” for details). in Eq. (1) indicates the means of the predicted traits at years of age. stands for the longitudinal changes in the traits of each subject. Table 3 shows that the means of are similar to those of period 1. Means of were generally closer to 0. Figure 3 shows the estimates of heritability with as the response in the GREML model, and the estimated heritability of height for the data peaked at 0.318 (P = 1.665 10−16, FDR = 2.664 10−15). The subsequent three highest heritability traits were total cholesterol (TCHL), log(HDL), and low-density lipoprotein (LDL), with values of 0.265 (P = 3.895 10−12, FDR = 3.116 10−11), 0.241 (P = 8.911 10−10, FDR = 4.753 10−9), and 0.222 (P = 5.178 10−9, FDR = 1.657 10−8), respectively. These three traits are cholesterol-related. The heritability of waist was 0.218 (P = 5.016 10−9, FDR = 1.657 10−8) and that of weight was 0.196 (P = 2.046 10−7, FDR = 5.456 10−7). The heritability was 0.195 for Hb (P = 4.926 10−7, FDR = 9.852 10−7) and 0.192 for log(TG) (P = 4.419 10−7, FDR = 9.852 10−7). The heritability of the other traits with an FDR larger than 1 10−6 were less than 0.19.
Table 3

Summary of and of 16 traits

Trait\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$B_{0}$$\end{document}B0\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$B_{1}$$\end{document}B1
MeanSDMinMaxMeanSDMinMax
Height159.9068.724130.241187.866− 0.0600.139− 2.1680.747
Waist83.7438.48058.333121.5910.1840.692− 4.9686.904
Weight62.8609.93130.532105.355− 0.0940.480− 3.7392.657
BMI24.5312.99214.19738.831− 0.0190.185− 1.4861.048
log(HbA1c)1.7370.1071.2562.4410.0020.011− 0.0930.157
log(GLU0)4.5380.1494.2605.7330.0120.018− 0.1660.171
log(TG)4.8340.4233.5847.189− 0.0120.057− 0.4090.412
LDL120.11125.75711.833281.5900.1934.065− 29.21828.549
log(HDL)3.7820.1933.1004.567− 0.0010.024− 0.2110.135
TCHL194.20828.11497.986343.106− 0.1204.468− 34.59929.468
log(SBP)4.7680.1144.4615.156− 0.0010.017− 0.1220.086
DBP78.2528.23950.639111.556− 0.2591.358− 12.3308.066
Hb13.6951.3707.76418.8890.0220.147− 1.6411.468
FVC104.54113.46646.629162.844− 0.0902.478− 13.06513.685
FEV1111.12816.29538.951184.532− 0.2392.575− 16.02215.620
FEV1/FVC73.9451.80967.65478.000− 0.2130.127− 1.2461.244
Fig. 3

Single-nucleotide polymorphism heritability estimates of 16 traits with as the response. Error bars correspond to standard error values. The values above the error bar are P values and false discovery rate (FDR; bold)

Summary of and of 16 traits Single-nucleotide polymorphism heritability estimates of 16 traits with as the response. Error bars correspond to standard error values. The values above the error bar are P values and false discovery rate (FDR; bold) Figure 4 shows the heritability estimates for , which are generally less than those for , and we found that lung function traits FVC and FEV1, wasit, diastolic blood pressure (DBP), BMI, and log(SBP) are relatively high. The highest heritability estimate was observed for FEV1 (0.171) and its FDR-adjusted P value was 0.0189. The heritability estimates of other traits were less than 0.1. The second highest heritability was 0.0941 for FVC, and its FDR-adjusted P value was 0.166. The heritability of waist was also relatively higher than that of other traits. Waist heritability and the FDR-adjusted P values were 0.0082 and 0.0657, respectively. The higher heritability estimates for indicate that the decreasing/increasing rates were associated with genetic factors. Height displayed the highest heritability estimates for , but its estimate for was low (0.0297). Height does not usually change after the age of 20, and its lower value here is probably attributable to it. For the other traits including log(HbA1c), LDL, log(HDL), TCHL, and Hb levels, SNP heritability estimates tended towards 0.
Fig. 4

Single-nucleotide polymorphism heritability estimates of 16 traits with as the response. Error bars correspond to standard error values. The values above the error bar are P values and the false discovery rate (FDR; bold), and “*” indicates significant findings at an FDR of 0.05

Single-nucleotide polymorphism heritability estimates of 16 traits with as the response. Error bars correspond to standard error values. The values above the error bar are P values and the false discovery rate (FDR; bold), and “*” indicates significant findings at an FDR of 0.05 Furthermore, we determined chromosomal heritability estimates of FEV1, which displayed the highest heritability in the model. Consequently, chromosome 2 accounted for the highest proportion of phenotypic variance ( = 0.0397), albeit with a high standard error (Fig. 5).There was a significant positive correlation between chromosome length and heritability (r = 0.58, P = 0.0045) in FEV1 (Fig. 6).
Fig. 5

Single-nucleotide polymorphism heritability estimates of FEV1 based on chromosomes with as the response. Error bars correspond to standard error values. The values above the error bar are P values and false discovery rate (FDR; bold)

Fig. 6

Correlation between chromosome length and chromosome-specific heritability. Numbers near dots are the chromosome numbers. Black line is the estimated regression line

Single-nucleotide polymorphism heritability estimates of FEV1 based on chromosomes with as the response. Error bars correspond to standard error values. The values above the error bar are P values and false discovery rate (FDR; bold) Correlation between chromosome length and chromosome-specific heritability. Numbers near dots are the chromosome numbers. Black line is the estimated regression line

Genome-wide association studies

and were considered responses for GWAS. Tables 4 and 5 show genome-wide significant SNPs at a significance level of 1 × 10−7. Subsequent findings are summarized in the Supplementary material (Tables S1 and S2). Table 4 shows that SNPs have relatively lower P values for log(TG) and log(HDL) than any other trait. The most significant variant for log(TG) was rs6589566 in ZPR1 with a P-value of 7.9 10−38, the lowest P value among all 16 traits. Furthermore, ZPR1 is associated with TG (Coram et al. 2013). The most significant variant of log(HDL) was rs16940212 with a P value of 2.08 10−18 in ALDH1A2, which is associated with HDL (Spracklen et al. 2017). Certain other significant variants were significantly associated with proximal genes and with traits assessed herein. The variant rs180349 (P = 8.86 10−35) of log(TG) was proximal to BUD13, which is associated with TG (Hoffmann et al. 2018). The variant rs17482753 (P = 3.199 10−18) is proximal to LPL, which was strongly associated with HDL (Hoffmann et al. 2018). Herein, we also detected some de novo variants including rs4922117 (P = 2.13 10−15) of log(HDL) and rs2335418 (P = 3.2 10−9) of LDL, which were previously unknown; however, both their proximal genes LPL and HMGCR are significantly associated with each trait (Hoffmann et al. 2018). The Manhattan Plot and QQ plot for the model with as the response are provided in the Supplementary material (Figures S1 and S2).
Table 4

Results of the genome-wide association study with as the response

TraitSNPCHRBP1BP2A1A2GPGENEMAFHWE_PBETAP
log(GLU0)rs1799884744,229,06844,229,068AGUpstreamGCK0.18720.90510.018895.62E−09
log(GLU0)rs7754840620,661,25020,661,250CGIntronicCDKAL10.47810.014886.25E−09
Hbrs57565052237,467,35437,467,354CGIntronicTMPRSS60.49790.92290.11252.61E−13
Hbrs3768751246,346,71646,346,716GAIntronicPRKCE0.17961− 0.1131.79E−08
log(HbA1c)rs7754840620,661,25020,661,250CGIntronicCDKAL10.47810.012545.41E−11
log(HDL)rs169402121558,694,02058,694,020TGIntergenicALDH1A20.34140.97860.029882.08E−18
log(HDL)rs17482753819,832,64619,832,646TGIntergenicLPL(dist = 7876),SLC18A1(dist = 169,720)0.12410.13330.042323.20E−18
log(HDL)rs4922117819,852,58619,852,586GAIntergenicLPL(dist = 27,816),SLC18A1(dist = 149,780)0.20770.4180.03162.13E−15
LDLrs5998391109,822,166109,822,166GADownstreamPSRC10.064560.1925− 5.8861.53E−11
LDLrs12654264574,648,60374,648,603TAIntronicHMGCR0.47580.8085− 2.6251.41E−09
LDLrs2335418574,603,47974,603,479GAIntergenicANKRD31(dist = 70,776),HMGCR(dist = 29,514)0.42320.5689− 2.63.20E−09
LDLrs4045166574,909,44674,909,446GCIntronicANKDD1B0.33260.31372.7273.55E−09
LDLrs10942739574,786,08374,786,083TCIntronicCOL4A3BP0.33250.2762.7094.61E−09
LDLrs6881911,227,60211,227,602TCExonicLDLR0.1360.7973.4026.63E−08
TCHLrs5998391109,822,166109,822,166GADownstreamPSRC10.064560.1925− 6.8221.19E−12
TCHLrs780092227,743,15427,743,154GAIntronicGCKR0.32480.2948− 3.3654.63E−11
TCHLrs173215158126,486,409126,486,409TCIntergenicTRIB1(dist = 35,762),LINC00861(dist = 448,358)0.44250.041782.7824.20E−09
TCHLrs1881396227,844,60127,844,601GTUTR3ZNF5120.33130.8699− 2.7933.18E−08
TCHLrs6861279574,919,40974,919,409TCIntronicANKDD1B0.33860.17722.7843.99E−08
TCHLrs6734059227,808,15427,808,154CTIntronicZNF5120.33570.8921− 2.7246.36E−08
log(TG)rs658956611116,652,423116,652,423CTIntronicZPR10.21690.35450.11137.90E−38
log(TG)rs18034911116,611,827116,611,827ATIntergenicLINC00900(dist = 980,909),BUD13(dist = 7059)0.22650.7520.10658.86E−35
log(TG)rs10503669819,847,69019,847,690TGIntergenicLPL(dist = 22,920),SLC18A1(dist = 154,676)0.12070.003444− 0.089021.63E−16
log(TG)rs780094227,741,23727,741,237CTIntronicGCKR0.46260.9806− 0.056562.60E−15
log(TG)rs711524211116,908,283116,908,283TCIntronicSIK30.27960.1330.059878.66E−14

Only the significant variants with P values less than 1 × 10−7 in each trait are included. The more significant results are listed in the supplement material (Table S1)

Table 5

Results of the genome-wide association study with as the response

TraitSNPCHRBP1BP2A1A2GPGENEMAFHWE_PBETAP
FEV1rs2272402311,075,46111,075,461AGIntronicSLC6A10.073630.1473− 0.58231.22E−08
FVCrs2272402311,075,46111,075,461AGIntronicSLC6A10.073630.1473− 0.5951.40E−09

Only the variants with P values less than 1 × 10−7 are included. The more variants under suggestive threshold (P values less than 1 × 10−5) are listed in the supplement material (Table S2)

Results of the genome-wide association study with as the response Only the significant variants with P values less than 1 × 10−7 in each trait are included. The more significant results are listed in the supplement material (Table S1) Results of the genome-wide association study with as the response Only the variants with P values less than 1 × 10−7 are included. The more variants under suggestive threshold (P values less than 1 × 10−5) are listed in the supplement material (Table S2) Table 5 shows the results of GWAS for . Based on the results, rs2272402 (SLC6A1) was the most significant variant both in FEV1 (P = 1.22 10−8) and FVC (P = 1.40 10−9) (Regional plots are shown in Supplementary Figures S5 and S7), and the SLC6A1 enhancer was associated with lung function. Other variants including rs7209788 (NARF, P = 3.36 10−7) for FEV1 and rs2668162 (FAM19A1, P = 6.18 10−7) for FVC had P-values less than the 1 10−6 threshold (S2 Table). From the regional plot (Supplementary Figure S6), we found that rs4789777(HEXDC, P = 4.599 10−6) is highly correlated with rs7209788 of FEV1. The Manhattan Plot and QQ plot for the model with as the response are provided in the Supplementary material (Figures S3 and S4).

Discussion

In this study, SNP-based heritability estimates of 16 phenotypic traits were estimated longitudinal data from a 10-year follow-up of the KARE cohort. The GCTA tool was used with a two-stage approach to determine the heritability estimate of phenotypic mean and longitudinal changes in each trait. Moreover, chromosomal heritability estimates were determined and GWAS analyses were performed using the same approach. Overall, heritability estimates within the population-based cohort including KARE are potentially lower than those of pedigree or twin studies for all 16 traits, regardless of whether the response is that phenotypic mean of traits or which stands for the changes by time of traits. For example, the heritability of height herein was estimated to be approximately 0.318 with as the response, which is lower than the conventional heritability estimate of height of approximately 0.8 based on the assumption-free model (Visscher et al. 2006). In the case of TCHL and LDL, each heritability estimate was determined to be 0.265 and 0.22, respectively, which are also lower than the heritability estimates of 0.67 and 0.69 for TCHL and LDL, respectively, on familial and pedigree analysis (van Dongen et al. 2013). The underlying reason may be explained on the basis of the missing heritability, which describes the difference in values between heritability estimated via GWAS and via familial studies (Sandoval-Motta et al. 2017). However, systemic inflation of estimated heritability estimates of polygenic phenotypes in familial studies may be confounded owing to a shared environment or environment-dependent genetic effects (Robinson et al. 2017). Therefore, the population-based design similar to that of the present study potentially represents the average genetic effects regardless of various confounding environmental factors. Based on the present and model, the heritabilities of are markedly lower than those of , indicating that most of the genetic variance of traits are not temporally influenced. Here, was not determined from the baseline measurements of traits but rather the average values of repeated measurements to yield a more robust and reasonable result. If baseline measurement and longitudinal changes calculated from those were considered responses during the estimation of heritability, the estimate may have been potentially inaccurate owing to the correlation between baseline and values. Thus, by applying a regression model to estimate the average and longitudinal changes , the effect of on in each subject could be removed. On GWAS, the two-stage model elucidated significant variants associated with the traits and their changes in the longitudinal data. We confirmed several proven variants and identified some other significant unreported variants. In the case of the model, rs4922117 (P = 2.13 10−15) of log (HDL) and rs2335418 (P = 3.2 10−9) of LDL were both unreported; however, their proximal genes LPL and HMGCR, respectively, were significantly associated with each trait (Hoffmann et al. 2018). Furthermore, unreported genes, such as rs180349, including non-coding SNPs with a significant P value for TG are proximal to BUD13, which is strongly associated with TG (Hoffmann et al. 2018). Variants including rs17482753 also had significant P values and was proximal to LPL, which is strongly associated with the HDL trait (Hoffmann et al. 2018). In the model, rs2272402 (SLC6A1, P = 1.22 10−8) was significant in both FEV1 and FVC lung function. The SLC6A1 enhancer is associated with pulmonary function. Therefore, the present results are concurrent with previous findings regarding genes associated with each phenotype. Among the 16 phenotypic traits in this study, only FEV1 displayed longitudinally significant heritability herein (Fig. 4), thus reliably reflecting the physiological state of the lungs and airways and acting as a predictor of morbidity and mortality in the general population; FEV1 is also widely used to define chronic obstructive pulmonary disease (COPD) (Young et al. 2007). Lung function develops in early life, peaks at a specific time point in early adulthood, and subsequently declines with age. Therefore, the decline of lung function in middle-aged and older individuals is suggested to be heritable in the general population (Gottlieb et al. 2001). However, longitudinal studies on FEV1 and FEV1/FVC have suggested several significant genetic regions that markedly differ from the numerous genetic variants associated with lung function, with FEV1 being estimated at a single time point (John et al. 2017; Tang et al. 2014). Hence, gene-environment interactions and significant genetic heterogeneity in lung function have been  observed in diseases such as asthma or COPD (Hansel et al. 2013; Imboden et al. 2012). Accordingly, the present study included the middle-aged general population with similar environmental exposure without specific lung diseases, thus suggesting that intact FEV1 decreased due to aging. Therefore, the present results show that FEV1 has significant SNP heritability for longitudinal changes (FDR = 0.0012 for FEV1). This study has several limitations. First, the analysis of new variants in the present GWAS was not replicated for other cohorts. Second, the two-stage approach is statistically inefficient even though it is computationally fast. However, the sample size was very large, which hopefully minimized this problem. Furthermore, we considered subjects with at least three or more measurements, which potentially minimize statistical power loss. Third, gene-environment interactions were not analyzed, although the estimation of random effects in the mixed model was elusive. Fourth, GCTA itself has limitations for reasons such as data overfitting and skewed singular values (Kumar et al. 2016). Though our study optimized parameters to attain accurate results using GCTA, our sample size might have resulted in certain variations in comparison with other large studies. Furthermore, the issue regarding missing heritability was inevitable to an extent because the Affymetrix genotypic array represents only common variants for SNPs, while rare genetic SNP variants were not included herein (Bandyopadhyay et al. 2017). Despite the aforementioned limitations, our study elucidates heritability estimates via a two-stage approach using a mixed model in GCTA and a GWAS, which further determines longitudinal change effects independently with a linear model, followed by estimating heritability using regression coefficients. This approach provides a reasonable and easy method to estimate heritability in longitudinal data and potentially assess both heritability of the phenotypic mean and changes through several periods. Essentially, our results show that significant SNP heritability is objectively confirmed for longitudinal changes in lung function decline including FEV1 in comparison with other health-related indices. Therefore, genetic studies on longitudinal FEV1 decline among the middle-aged general population and chromosome 2, which attributes the most in genetic variance should be encouraged. Below is the link to the electronic supplementary material. Supplementary material 1 (DOCX 2019 kb)
  29 in total

1.  Heritability of longitudinal change in lung function. The Framingham study.

Authors:  D J Gottlieb; J B Wilk; M Harmon; J C Evans; O Joost; D Levy; G T O'Connor; R H Myers
Journal:  Am J Respir Crit Care Med       Date:  2001-11-01       Impact factor: 21.405

2.  Age-dependent traits: a new statistical model to separate within- and between-individual effects.

Authors:  M van de Pol; S Verhulst
Journal:  Am Nat       Date:  2006-03-20       Impact factor: 3.926

3.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

4.  Annual change in pulmonary function and clinical phenotype in chronic obstructive pulmonary disease.

Authors:  Masaharu Nishimura; Hironi Makita; Katsura Nagai; Satoshi Konno; Yasuyuki Nasuhara; Masaru Hasegawa; Kaoruko Shimizu; Tomoko Betsuyaku; Yoichi M Ito; Satoshi Fuke; Takeshi Igarashi; Yasushi Akiyama; Shigeaki Ogura
Journal:  Am J Respir Crit Care Med       Date:  2012-01-01       Impact factor: 21.405

5.  Concepts, estimation and interpretation of SNP-based heritability.

Authors:  Jian Yang; Jian Zeng; Michael E Goddard; Naomi R Wray; Peter M Visscher
Journal:  Nat Genet       Date:  2017-08-30       Impact factor: 38.330

6.  Heritability of metabolic syndrome traits in a large population-based sample.

Authors:  Jenny van Dongen; Gonneke Willemsen; Wei-Min Chen; Eco J C de Geus; Dorret I Boomsma
Journal:  J Lipid Res       Date:  2013-08-05       Impact factor: 5.922

7.  Genome-wide characterization of shared and distinct genetic components that influence blood lipid levels in ethnically diverse human populations.

Authors:  Marc A Coram; Qing Duan; Thomas J Hoffmann; Timothy Thornton; Joshua W Knowles; Nicholas A Johnson; Heather M Ochs-Balcom; Timothy A Donlon; Lisa W Martin; Charles B Eaton; Jennifer G Robinson; Neil J Risch; Xiaofeng Zhu; Charles Kooperberg; Yun Li; Alex P Reiner; Hua Tang
Journal:  Am J Hum Genet       Date:  2013-05-30       Impact factor: 11.025

8.  Genome-wide association study of lung function decline in adults with and without asthma.

Authors:  Medea Imboden; Emmanuelle Bouzigon; Ivan Curjuric; Adaikalavan Ramasamy; Ashish Kumar; Dana B Hancock; Jemma B Wilk; Judith M Vonk; Gian A Thun; Valerie Siroux; Rachel Nadif; Florent Monier; Juan R Gonzalez; Matthias Wjst; Joachim Heinrich; Laura R Loehr; Nora Franceschini; Kari E North; Janine Altmüller; Gerard H Koppelman; Stefano Guerra; Florian Kronenberg; Mark Lathrop; Miriam F Moffatt; George T O'Connor; David P Strachan; Dirkje S Postma; Stephanie J London; Christian Schindler; Manolis Kogevinas; Francine Kauffmann; Debbie L Jarvis; Florence Demenais; Nicole M Probst-Hensch
Journal:  J Allergy Clin Immunol       Date:  2012-03-16       Impact factor: 10.793

9.  Comparison of methods that use whole genome data to estimate the heritability and genetic architecture of complex traits.

Authors:  Luke M Evans; Rasool Tahmasbi; Scott I Vrieze; Gonçalo R Abecasis; Sayantan Das; Steven Gazal; Douglas W Bjelland; Teresa R de Candia; Michael E Goddard; Benjamin M Neale; Jian Yang; Peter M Visscher; Matthew C Keller
Journal:  Nat Genet       Date:  2018-04-26       Impact factor: 38.330

10.  A large electronic-health-record-based genome-wide study of serum lipids.

Authors:  Thomas J Hoffmann; Elizabeth Theusch; Tanushree Haldar; Dilrini K Ranatunga; Eric Jorgenson; Marisa W Medina; Mark N Kvale; Pui-Yan Kwok; Catherine Schaefer; Ronald M Krauss; Carlos Iribarren; Neil Risch
Journal:  Nat Genet       Date:  2018-03-05       Impact factor: 38.330

View more
  1 in total

1.  Heritability Analyses Uncover Shared Genetic Effects of Lung Function and Change over Time.

Authors:  Donghe Li; Woojin Kim; Jahoon An; Soriul Kim; Seungku Lee; Ahra Do; Wonji Kim; Sanghun Lee; Dankyu Yoon; Kwangbae Lee; Seounguk Ha; Edwin K Silverman; Michael Cho; Chol Shin; Sungho Won
Journal:  Genes (Basel)       Date:  2022-07-15       Impact factor: 4.141

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.