| Literature DB >> 35273064 |
Vincenzo Forgetta1, Rui Li2,3, Corinne Darmond-Zwaig2,3, Alexandre Belisle2,3, Cynthia Balion4, Delnaz Roshandel5, Christina Wolfson6, Guillaume Lettre7, Guillaume Pare4, Andrew D Paterson5,8, Lauren E Griffith9, Chris Verschoor9, Mark Lathrop2,3, Susan Kirkland10, Parminder Raina9, J Brent Richards1,6,11, Jiannis Ragoussis12,3,13.
Abstract
PURPOSE: The Canadian Longitudinal Study on Aging (CLSA) Comprehensive cohort was established to provide unique opportunities to study the genetic and environmental contributions to human disease as well as ageing process. The aim of this report was to describe the genomic data included in CLSA. PARTICIPANTS: A total of 26 622 individuals from the CLSA Comprehensive cohort of men and women aged 45-85 recruited between 2010 and 2015 underwent genome-wide genotyping of DNA samples collected from blood. Comprehensive quality control metrics were measured for genetic markers and samples, respectively. The genotypes were imputed to the TOPMed reference panel. Sex chromosome abnormalities were identified by copy number profiling. Classical human leukocyte antigen gene haplotypes were imputed at two-field (four-digit). FINDINGS TO DATE: Of the 26 622 genotyped participants, 24 655 (92.6%) were identified as having European ancestry. These genomic data were linked to physical, lifestyle, medical, economic, environmental and psychosocial factors collected longitudinally in CLSA. The combined analysis, including CLSA genomic data, uncovered over 100 novel loci associated with key parameters to define glaucoma. The CLSA genomic dataset validated the contribution of a polygenic risk score to screen individuals with high fracture risk. It is also a valuable resource to directly identify common genetic variations associated with conditions related to complex traits. Taking advantage of the comprehensive interview and physical information collected in CLSA, this genomic dataset has been linked to psychosocial factors to investigate both the independent and interactive effects on cardiovascular disease. FUTURE PLANS: The CLSA overall is ongoing. Follow-up data will continue to be collected from participants in the current genomic subcohort, including the DNA methylation and metabolomic data. Ongoing studies focus on elucidating the role of genetic factors in cognitive decline and cardiovascular diseases. This genomic data resource is available on request through the CLSA data access application process. © Author(s) (or their employer(s)) 2022. Re-use permitted under CC BY-NC. No commercial re-use. See rights and permissions. Published by BMJ.Entities:
Keywords: CLSA; HLA; aging; genome-wide genotyping
Mesh:
Year: 2022 PMID: 35273064 PMCID: PMC8915305 DOI: 10.1136/bmjopen-2021-059021
Source DB: PubMed Journal: BMJ Open ISSN: 2044-6055 Impact factor: 2.692
Count of Canadian Longitudinal Study on Aging genotyped participants by self-reported sex and sex chromosome composition
| Self-reported sex | Sex chromosome composition | Count |
| Male | Male | 13 324 |
| Female | Female | 13 250 |
| Female | Male | 17 |
| Male | Female | 16 |
| Female | Undefined | 10 |
| Male | Undefined | 5 |
Figure 1Pairwise plot of allele frequency of SNPs that pass all four tests from genotype batches 1–5. The SNPs are considered as passed if they have non-significant p value (Fisher’s p>3.5×10−10) below the multiple testing corrected threshold for the respective test on discordant genotype frequency between batch, departure from Hardy-Weinberg equilibrium, discordance between the positive control replicates and on discordant genotype frequency between male and female. SNP, single-nucleotide polymorphism
Count of kinship pairs per type of inferred relationship
| Inferred relationship | Count |
| Monozygotic twin | 1 |
| Full sibling | 357 |
| Parent/offspring | 176 |
| Second degree | 315 |
| Third degree | 1066 |
| Unrelated | 123 294 |
Figure 2Determining the CLSA European ancestry subset. (A) Top four principal components from all 1000 Genomes populations labelled and coloured (population code refers to https://wwwinternationalgenomeorg/category/population/). (B) Top four principal components from CLSA colour coded and labelled by cluster number. CLSA, Canadian Longitudinal Study on Aging.
Count of Canadian Longitudinal Study on Aging genotyped participants per self-reported ancestry and k-means cluster
| Self-reported ancestry* | k-means cluster | |||||
| 1 | 2 | 3 | 4 | 5 | 6 | |
| Black | 7 | 0 | 156 | 0 | 7 | 0 |
| East Asian | 0 | 214 | 1 | 2 | 0 | 3 |
| Latin American | 1 | 0 | 1 | 2 | 9 | 72 |
| Mixed | 11 | 11 | 7 | 207 | 61 | 21 |
| Other | 11 | 5 | 8 | 54 | 53 | 41 |
| South Asian | 211 | 5 | 0 | 0 | 7 | 0 |
| Southeast Asian | 20 | 61 | 0 | 0 | 1 | 1 |
| West Asian | 4 | 0 | 1 | 2 | 98 | 0 |
| White | 7 | 2 | 0 | 24 380 | 742 | 41 |
| White and Asian | 3 | 3 | 0 | 5 | 19 | 11 |
| White and black | 2 | 0 | 11 | 3 | 17 | 0 |
*The details of grouping self-reported cultural and racial category into fewer groups are in online supplemental table S2.
Figure 3BAF (top) and log2 ratio (bottom) of chromosomes X and Y are shown for sex chromosome abnormalities. (A) In 46, XY, the BAF is either 0 or 1, and the expected log2 ratio is less than 0 on chromosome X. However, in the PAR and the chrY11.2/chrXq21.3 homology block, there are heterozygous calls in male shown as BAF of 0.5. The red line shows the locally weighted scatterplot smoothing curve for log2 ratio. The BAF is either 0 or 1, and the expected log2 ratio is 0 on chromosome Y. (B) In 46, XX, the BAF is either 0 (AA), ½ (AB) or 1 (BB), and the expected log2 ratio is 0 on chromosome X as in a normal diploid cell. The BAF is between 0 and 1, and the log2 ratio is less than 0 on chromosome Y. (C) For Klinefelter syndrome (47, XXY), the log2 ratio is around 0 on chromosome X, which indicates ploidy as 2N. Compared with 46, XY, there are relatively lower peaks of log2 ratio at PAR and chrX21.3/chrY11.2 homology block region. Moreover, BAF of heterozygous calls at PAR and chrX21.3/chrY11.2 homology block region shifted from 0.5 to intermediate values. They both indicated an extra copy of chromosome X. Chromosome Y intensity profile showed a clear male pattern. (D) For Turner syndrome (45, X), on chromosome X, the log2 ratio is below 0, and there is no BAF bands of 0.5, which indicates one copy loss. Chromosome Y intensity profile showed a clear female pattern. (E) For 45, X/46, XX mosaicism, on chromosome X, there is a relatively smaller decrease of log2 ratio compared with one copy of chromosome X as in male. The BAF of heterozygous calls on chromosome X is split to intermediate values. They both indicate that the sample is mosaic for deletion of chromosome X. Chromosome Y intensity profile showed a clear female pattern. (F) For 45, X/46, XY mosaicism, the log2 ratio is less than 0, and no BAF 0.5 band on chromosome X indicates one copy. The log2 ratio shifts to below 0 and BAF values between 0 and 1 on chromosome Y indicate chromosome loss. However, the intermediate BAF values close to 0 or 1 at PAR and chrX21.3/chrY11.2 homology block region indicate the loss of chromosome Y existed in a larger proportion of cells. BAF, B allele frequency; PAR, pseudoautosomal region.