Literature DB >> 32522283

Whole-genome sequencing of Chinese centenarians reveals important genetic variants in aging WGS of centenarian for genetic analysis of aging.

Shuhua Shen1, Chao Li2, Luwei Xiao1, Xiaoming Wang1, Hang Lv3, Yuan Shi3, Yixue Li4,5, Qi Huang6.   

Abstract

BACKGROUND: Genetic research on longevity has provided important insights into the mechanism of aging and aging-related diseases. Pinpointing import genetic variants associated with aging could provide insights for aging research.
METHODS: We performed a whole-genome sequencing in 19 centenarians to establish the genetic basis of human longevity.
RESULTS: Using SKAT analysis, we found 41 significantly correlated genes in centenarians as compared to control genomes. Pathway enrichment analysis of these genes showed that immune-related pathways were enriched, suggesting that immune pathways might be critically involved in aging. HLA typing was next performed based on the whole-genome sequencing data obtained. We discovered that several HLA subtypes were significantly overrepresented.
CONCLUSIONS: Our study indicated a new mechanism of longevity, suggesting potential genetic variants for further study.

Entities:  

Keywords:  Centenarian; Longevity; WGS

Mesh:

Year:  2020        PMID: 32522283      PMCID: PMC7285530          DOI: 10.1186/s40246-020-00271-7

Source DB:  PubMed          Journal:  Hum Genomics        ISSN: 1473-9542            Impact factor:   4.639


Introduction

With the development of human genomics research, a large number of studies of the genetics of longevity have been conducted. Scientists from various countries have proposed many different theories concerning the mechanisms of aging from different perspectives, involving oxidative stress, energy metabolism, signal transduction pathways, immune response, etc. [1, 2]. These mechanisms interact with each other and are influenced by heredity to some degree [2, 3]. The identification of longevity-related biological markers is critical to an in-depth understanding of the mechanisms of carrier protection against common disease and/or of the retardation of the process of aging. Studies revealed from 300 to 750 genes related to longevity that are critically involved in a variety of life activities, such as growth and development, energy metabolism, oxidative stress, genomic stability maintenance, and neurocognition [4]. These candidate genes include mainly APOE, a gene involved in lipoprotein metabolism [5, 6]. Others are those involved in cell cycle regulation, cell growth and signal transduction, the maintenance of genome stability, and the endocrine-related pathway [7-9]. In addition, the candidates for longevity encompass genes related to drug metabolism, the ones involved in protein folding, stabilization, and degradation, as well those related to coagulation and regulation of circulation [10], etc. In most cases, these genes or their polymorphic sites were examined in multiple population replication studies, which discovered certain longevity-associated genes or pathways [4-10]. Besides, longevity is associated with immunity and inflammation [11, 12]. HLA gene, also known as the major histocompatibility complex gene, encodes the major histocompatibility complex (MHC), which is a gene family existing in most vertebrate genomes, closely related to the immune system [13]. Earlier study indicated that HLA may be the genetic basis for the specific response patterns of longevity and longevity immunity [14]. Inflammatory cytokines, such as TNF-α, IL-1β, and IL-6, may be key players [15]. IL-10 limits and terminates the inflammatory response by inhibiting the action of T cells, monocytes, and macrophages, and thus, the genetic variants of this gene may also affect the longevity phenotype [16]. However, at present, most of the investigations on longevity factors of centenarians are performed on a small number of candidate miRNAs, single tissues, or single samples, and only few studies have systematically conducted analyses of multiple tissues and copies at the whole-genome level [17-19]. Based on the results of previous cohort study, in this study, we aimed to use genome-wide sequencing technology to conduct genome-wide association studies and analysis of centenarians. Our findings would facilitate a more accurate focus on the most important genetic basis and molecular mechanisms associated with longevity. The conclusions of this study can serve as the basis for the public efforts towards the extension of the length of life. Moreover, they will provide a scientific reference for further clinical research on disease treatment and overall health care promotion.

Results

SKAT analysis revealed significantly correlated genes in aging

The sequencing platform Illumina XTen (Illumina, San Diego, CA, USA) was used for sequencing of the entire genome of 19 centenarians at an average depth of 30×. The sequencing quality metrics are provided in Supplementary Table 1. Baseline information of the centenarians is shown in Table 1. The identified variants were annotated, and non-synonymous variants affecting gene function were selected for association analysis.
Table 1

Characteristics of the centenarians

Age at drawSexBMIMajor age-related diseasesRace
104Male22.6NoneEastern Asiatic Mongoloid
103Male21.8NoneEastern Asiatic Mongoloid
102Female20.6NoneEastern Asiatic Mongoloid
105Female19.8NoneEastern Asiatic Mongoloid
116Female27.2NoneEastern Asiatic Mongoloid
101Female22.4NoneEastern Asiatic Mongoloid
102Female20.6NoneEastern Asiatic Mongoloid
104Male18.6NoneEastern Asiatic Mongoloid
103Male21.5NoneEastern Asiatic Mongoloid
103Female20.3NoneEastern Asiatic Mongoloid
101Male24.5NoneEastern Asiatic Mongoloid
105Female22.5NoneEastern Asiatic Mongoloid
103Male21.6NoneEastern Asiatic Mongoloid
104Female20.7NoneEastern Asiatic Mongoloid
102Female19.8NoneEastern Asiatic Mongoloid
100Female22.3NoneEastern Asiatic Mongoloid
103Male21.4NoneEastern Asiatic Mongoloid
114Female20.6NoneEastern Asiatic Mongoloid
100Male26.2NoneEastern Asiatic Mongoloid
Characteristics of the centenarians The experiment design is shown in Fig. 1. Association analysis is applied to WGS data to find important gene and pathways. All centenarian and controls were Eastern Asiatic Mongoloids ascertained to be of Chinese descent (Zhejiang Province, Southeast China). Correlative analysis involved mainly association analysis, SKAT, and Burden tests for rare variants. These methods are commonly employed for GWAS research, especially for case/control samples. They are identified by the difference in frequency of occurrence of variants between case and control samples. Variations associated with the phenotype of the disease, generally directed against common variants, are detected using this method, where the frequency of occurrence and the variation contribute to the phenotype of the disease. Annotations and literature surveys can be used for significantly related variants to further determine the effect of related genes and variants on gene function.
Fig. 1

Overall design of the study. WGS were applied to 19 centenarians. Association test was used to select candidate variants/genes. Function analysis was then applied

Overall design of the study. WGS were applied to 19 centenarians. Association test was used to select candidate variants/genes. Function analysis was then applied Based on the whole-genome sequencing data, this analysis was performed on the variants detected with MAF > 1%. The control sample consisted of selected 1000G East Asian population data, and the total number of the control samples is 208 [20]. PCA analysis was conducted to evaluate the stratification of the case and control group (supplementary Fig 1). For rare variants with a low frequency of variation, analysis using the method Rare variants case/control association test was performed. A total number of 41 (Supplementary Table 2) significantly correlated genes were obtained through SKAT analysis. The top 10 genes were as follows: PABPC3, BAGE2, HLA-DRB1, PDE4DIP, PADI4, CHI3L2, MUC17, WARS, HLA-DRB5, and SIRPB1 (Table 2).
Table 2

Significantly correlated genes in SKAT analysis

GeneSKAT_Common.OnlySKAT_Rare.OnlySKAT_allSKAT_Rare.proSKAT_Common.proBurden_allBurden__Rare.proBurden__Common.pro
PABPC31.97E−274.57E−324.41E−312.72E−482.26E−512.50E−301.46E−503.13E−51
BAGE21.70E−276.41E−218.76E−257.30E−361.52E−458.66E−262.20E−375.57E−30
HLA-DRB10.032542111.57E−142.66E−130.0291139044.24E−301.94E−131.33E−211.37E−20
PDE4DIP4.19E−071.71E−132.01E−121.13E−093.32E−325.43E−131.40E−168.32E−16
PADI40.3513949734.20E−104.84E−070.5641809133.26E−192.18E−083.17E−109.98E−11
CHI3L20.1826763242.30E−093.99E−099.37E−082.84E−243.99E−069.70E−096.54E−09
MUC170.6400112542.33E−094.68E−082.45E−083.01E−182.60E−062.54E−103.23E−10
WARS0.2482689832.07E−086.87E−066.88E−051.99E−158.07E−071.61E−082.34E−08
HLA-DRB50.0211025181.05E−065.24E−070.017766923.01E−128.72E−084.27E−115.39E−11
SIRPB10.4793936212.95E−063.42E−065.33E−103.26E−100.0013823030.0001846810.000282752
Significantly correlated genes in SKAT analysis

Immune system-related pathway was significantly enriched

The significant genes were subjected to differential pathway enrichment analysis. Then, MutsigDB was used to enrich the KEGG and Reactome pathways. As can be seen in Table 3, the associated genes were significantly enriched in the pathways related to immune and inflammatory responses, such as those of interferons, antibodies, and immunity.
Table 3

Significantly enriched pathway

Gene set name[#Genes(K)]Description#Genes in Overlap(k)p valueFDR q value
REACTOME_INTERFERON_GAMMA_SIGNALING[63]Interferon gamma signaling52.61E−092.81E−06
KEGG_ANTIGEN_PROCESSING_AND_PRESENTATION[89]Antigen processing and presentation51.52E−088.16E−06
KEGG_ALLOGRAFT_REJECTION[38]Allograft rejection43.55E−081.28E−05
KEGG_GRAFT_VERSUS_HOST_DISEASE[42]Graft-versus-host disease45.35E−081.40E−05
KEGG_TYPE_I_DIABETES_MELLITUS[44]Type I diabetes mellitus46.51E−081.40E−05
KEGG_CELL_ADHESION_MOLECULES_CAMS[134]Cell adhesion molecules (CAMs)51.19E−072.13E−05
KEGG_AUTOIMMUNE_THYROID_DISEASE[53]Autoimmune thyroid disease41.40E−072.15E−05
REACTOME_TRANSLOCATION_OF_ZAP_70_TO_IMMUNOLOGICAL_SYNAPSE[14]Translocation of ZAP-70 to immunological synapse32.21E−072.97E−05
REACTOME_INTERFERON_SIGNALING[159]Interferon signaling52.78E−073.32E−05
REACTOME_PHOSPHORYLATION_OF_CD3_AND_TCR_ZETA_CHAINS[16]Phosphorylation of CD3 and TCR zeta chains33.39E−073.66E−05
Significantly enriched pathway

HLA subtypes are correlated with aging

Based on the whole-genome sequencing data obtained, HLA-typing was performed. Through the analysis of HLA type distribution, as presented in Tables 3 and 4 and Fig. 2, we found that the type II HLA genes had an important relationship with longevity. Among them, the HLA DRB1 *13:02, HLA DRB1 *14:01, and HLA DRB1 *16:02 were significantly associated with longevity.
Table 4

HLA subtype percentage in the case and control groups

HLA typeRatioControl_PercentageCase_Percentage
A*02:033.70.030.11
DRB1*13:025.60.010.05
DRB1*14:014.20.020.08
DRB1*16:025.60.010.05
Fig. 2

HLA typing’s correlation with the centinarian group. Frequency ratio of every HLA type was plotted. Four HLA types with frequency ratio larger than 3 were marked

HLA subtype percentage in the case and control groups HLA typing’s correlation with the centinarian group. Frequency ratio of every HLA type was plotted. Four HLA types with frequency ratio larger than 3 were marked

Discussion

Researches on the genetic mechanisms of longevity have been conducted from many perspectives, including that on longevity-related genes, variants, and biological pathways [4, 10]. With the advancements in the NGS technology and analysis algorithms, increasingly more longevity-related genetic features could be found and would be useful for the understanding of mechanisms of longevity and related diseases [21, 22]. In our study, we used a small centenarian cohort to establish the association between the genetic variants and longevity. By SKAT analysis, rare variants were found that were related to the longevity phenotype. HLA-DRB1, HLA-DRB5, and PDE4 DIP have been reported to be associated with longevity [23-25]. Importantly, HLA-DRB1 variants have been specifically reported to have been significantly enriched in a French centenarian study [26]. Further, through the analysis of HLA type distribution, we found several subtypes of HLA DRB1 which have a closer relationship with longevity. Other significantly related genes that we found in this study, such as PABPC3, BAGE2, PADI4, CHI3L2, MUC17, WARS, and SIRPB1, have never been reported before, and their functions deserve further study. Pathway enrichment was performed and showed an important association of the immune-related pathway and the aging process. Previous examinations revealed that immune and inflammatory responses are closely related to ARD (aging-related diseases) [27, 28]. The significant differences in the gene enrichment of the related pathways suggest that a possible longevity mechanism may be associated with protective variants in genes that occur in the related pathways. Here, we have shown that genome-wide data can be further mined as compared with the findings of traditional SNP studies. For example, HLA-type analysis could also be associated with the phenotype, which appears to be a possibility for expanded mining of genome-wide data. Nonetheless, the relatively small sample size limited the power of our findings, and thus further validation by large cohort studies is required.

Conclusions

In conclusion, the findings of our study provide novel insights into aging mechanisms, suggesting the involvement of several genes, pathways, and specific HLA subtypes that are worth further investigations.

Materials and methods

Sample preparation

A random number table was used to randomly collect whole-blood samples from the centenarian cohort in Zhejiang Province, China. Ten percent of centenarians, that is, 19 centenarians, were chosen. All of them were free of major age-related diseases, i.e., cardiovascular or cerebrovascular disease, cancer, dementia, renal or hepatic failure, etc. They were informed about the study and signed a letter of consent, in accordance with the guidelines of the Ethics Committee of the First Affiliated Hospital of Zhejiang Chinese Medical University, Hangzhou, China (2015-KL-008-01). PBMC were isolated to be used for extraction of genomic DNA. Nineteen sex-matched samples with good physical condition were randomly selected from 1000G East Asian population data to serve as a control group [20]. All centenarian and controls were Eastern Asiatic Mongoloids ascertained to be of Chinese descent (Zhejiang Province, Southeast China).

Library preparation and whole-genome sequencing

Indexed Illumina NGS libraries were prepared from plasma DNA and germline genomic DNA. Next, an NGS library was prepared using the KAPA library preparation kit (Kapa Biosystems). Agencourt AMPure XP beads (Beckman-Coulter) were used to purify the extracted DNA. A 100-fold mole excess of ligation Illumina TruSeq adaptors was used for ligation at 16 °C for 16 h. Size selection of DNA fragments was performed in the 100-μL solution system, and then, the connected fragments were amplified with 500 μm Illumina backbone oligonucleotides for 4–9 rounds of PCR. After that, the DNA fragments were input. The library concentration was assessed by Qubit and Qpcr. The fragment length was determined using a 2100 Bioanalyzer with a DNA 1000 kit (Agilent). DNA fragments were mixed with HiFi Hot Start Ready Mix (1×), and 2 × 150 bp sequencing of multiple libraries was finally performed with Illumina HiSeq X10.

Data preprocessing

Paired reads were aligned to the hg19 reference genome using the BWA (V0.7.15-r1140)-mem command [29]. Then, they were sorted and indexed using SAMtools [30]. An in-house Python script was utilized to evaluate the various statistics collected, including mapping statistics, read quality, and panels capture efficiency. For each sample, the SAMtools pileup function was employed to generate variant candidates among the corresponding sites. We excluded the SNP sites and lower depth sites (≤ eu) among the candidates and removed the reads with low base (< Q30) and mapping qualities (< 40).

SKAT analysis

The file with the data of the centenarian variants was subjected to SKAT. Functional annotation of the genomic variation of each sample was performed, distinguishing between rare variants and variants affecting the protein function. Next, the influence of the variants in the gene was scored. We used the aforementioned three methods to test candidate genome variants to identify potential rare variants associated with the phenotype. Literature and databases were searched to find how the associated variants affected the biological processes, and speculation on disease mechanisms was carried out.

Pathway enrichment analysis

The KEGG pathway enrichment was conducted using the DAVID Functional Annotation Bioinformatics Microarray Analysis.

HLA typing

HLA typing was performed through the HLAscan algorithm [31]. HLAscan started with sequence reads in FASTQ format for mapping to IMGT/HLA data. For targeted sequencing data, sequence reads were used as direct input for HLAscan, whereas for WGS and WES data, we selected reads for HLA genes prior to running the HLAscan. In comparison with the targeted sequencing data, alignment of whole-genome/exome data directly to the IMGT/HLA database may lead to the omission of some HLA reads. Nonetheless, this algorithm was adopted because alignment of HLA reads to the IMGT/HLA database is advantageous in regard to both time and computational processing without loss of predictive accuracy. Initial alignment was performed using BWA-MEM (v0.7.10-r789) with default options. The alignment was the best fit for HLAscan in our investigation, which involved many allele sequences in IMGT/HLA and BWA-MEM. Sequence reads in the BAM file were sorted by reference coordinates using the FixMateInformation function, followed by removal of duplicate reads using MarkDuplicates in the Picard software package (version 1.68) (http://picard.sourceforge.net). Subsequently, identification of indels and re-alignment around these features were performed with the RealignerTargetCreator and IndelRealigner tools, respectively, and base-pair quality scores were recalibrated with BaseRecalibrator and PrintReads using the GATK software (version 3.3.0) [32]. Throughout these processes, sequence reads corresponding to the exonic regions of HLA genes were selected based on an initial alignment generated using GATK with a whole-genome reference (GRCh37.p13). This filtering step does not classify the sequence reads into specific HLA genes. Additional file 1: Supplementary Figure 1. PCA plot of the case and control group. PCA plot was made using the SNP genotype of centinarian group and the control group. Additional file 2: Supplementary Table 1. Sequencing quality metrics Additional file 3: Supplementary Table 2. Significantly correlated genes obtained through SKAT analysis.
  32 in total

Review 1.  Inflammaging and anti-inflammaging: a systemic perspective on aging and longevity emerged from studies in humans.

Authors:  Claudio Franceschi; Miriam Capri; Daniela Monti; Sergio Giunta; Fabiola Olivieri; Federica Sevini; Maria Panagiota Panourgia; Laura Invidia; Laura Celani; Maria Scurti; Elisa Cevenini; Gastone C Castellani; Stefano Salvioli
Journal:  Mech Ageing Dev       Date:  2006-11-20       Impact factor: 5.432

Review 2.  Exome and whole genome sequencing in aging and longevity.

Authors:  Erik B van den Akker; Joris Deelen; P Eline Slagboom; Marian Beekman
Journal:  Adv Exp Med Biol       Date:  2015       Impact factor: 2.622

3.  Association between the HLA-DR alleles and longevity: a study in Sardinian population.

Authors:  Domenico Lio; Giovanni Mario Pes; Ciriaco Carru; Florinda Listì; Viviana Ferlazzo; Giuseppina Candore; Giuseppina Colonna-Romano; Luigi Ferrucci; Luca Deiana; Giovannella Baggio; Claudio Franceschi; Calogero Caruso
Journal:  Exp Gerontol       Date:  2003-03       Impact factor: 4.032

4.  HLA-DR alleles display sex-dependent effects on survival and discriminate between individual and familial longevity.

Authors:  R Ivanova; N Hénon; V Lepage; D Charron; E Vicaut; F Schächter
Journal:  Hum Mol Genet       Date:  1998-02       Impact factor: 6.150

5.  The Sequence Alignment/Map format and SAMtools.

Authors:  Heng Li; Bob Handsaker; Alec Wysoker; Tim Fennell; Jue Ruan; Nils Homer; Gabor Marth; Goncalo Abecasis; Richard Durbin
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

6.  Genetic associations with human longevity at the APOE and ACE loci.

Authors:  F Schächter; L Faure-Delanef; F Guénot; H Rouger; P Froguel; L Lesueur-Ginot; D Cohen
Journal:  Nat Genet       Date:  1994-01       Impact factor: 38.330

7.  Innate immunity and inflammation in ageing: a key for understanding age-related diseases.

Authors:  Federico Licastro; Giuseppina Candore; Domenico Lio; Elisa Porcellini; Giuseppina Colonna-Romano; Claudio Franceschi; Calogero Caruso
Journal:  Immun Ageing       Date:  2005-05-18       Impact factor: 6.400

8.  Growth hormone secretion is diminished and tightly controlled in humans enriched for familial longevity.

Authors:  Evie van der Spoel; Steffy W Jansen; Abimbola A Akintola; Bart E Ballieux; Christa M Cobbaert; P Eline Slagboom; Gerard Jan Blauw; Rudi G J Westendorp; Hanno Pijl; Ferdinand Roelfsema; Diana van Heemst
Journal:  Aging Cell       Date:  2016-09-07       Impact factor: 9.304

9.  Fast and accurate short read alignment with Burrows-Wheeler transform.

Authors:  Heng Li; Richard Durbin
Journal:  Bioinformatics       Date:  2009-05-18       Impact factor: 6.937

Review 10.  Genetics, lifestyle and longevity: Lessons from centenarians.

Authors:  Diddahally Govindaraju; Gil Atzmon; Nir Barzilai
Journal:  Appl Transl Genom       Date:  2015-02-04
View more
  2 in total

1.  Data mining of human plasma proteins generates a multitude of highly predictive aging clocks that reflect different aspects of aging.

Authors:  Benoit Lehallier; Maxim N Shokhirev; Tony Wyss-Coray; Adiv A Johnson
Journal:  Aging Cell       Date:  2020-10-08       Impact factor: 9.304

2.  From mutation to mechanism: deciphering the molecular function of genetic variants linked to human ageing.

Authors:  Maarouf Baghdadi; Helena M Hinterding; Linda Partridge; Joris Deelen
Journal:  Brief Funct Genomics       Date:  2022-01-25       Impact factor: 4.241

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.