| Literature DB >> 33137877 |
Katherine Parker1, A Mesut Erzurumluoglu2, Santiago Rodriguez1,3.
Abstract
The Human Y chromosome (ChrY) has been demonstrated to be a powerful tool for phylogenetics, population genetics, genetic genealogy and forensics. However, the importance of ChrY genetic variation in relation to human complex traits is less clear. In this review, we summarise existing evidence about the inherent complexities of ChrY variation and their use in association studies of human complex traits. We present and discuss the specific particularities of ChrY genetic variation, including Y chromosomal haplogroups, that need to be considered in the design and interpretation of genetic epidemiological studies involving ChrY.Entities:
Keywords: Y chromosome; Y haplogroups; complex human traits; complex locus; genetic association analyses; genetic epidemiology
Mesh:
Year: 2020 PMID: 33137877 PMCID: PMC7693691 DOI: 10.3390/genes11111273
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1GWASs can be used to look for genetic associations with human disease traits across the human genome. This figure is a modified version of the GWAS diagram from the September 2019 version of the GWAS Catalog, a directory which contains more than 157,000 associations from 4220 publications [2]. The diagram shows hits identified on chromosomes 22 and X, with different categories of disease indicated by different coloured markers. There are no hits which correspond to ChrY. Legend has not been included for clarity, but the original diagram and legend can be found at https://www.ebi.ac.uk/gwas/diagram.
Chronological order of the discovery of the particularities of the Y chromosome from the discovery of the male-determining region to exploration of gene function.
| Year | Description | References |
|---|---|---|
| 1985 | ChrY utilised for evolutionary studies ‘genetic distance’ genealogy | [ |
| 1986 | First evidence that the male-determining region is located on the short arm of the Y chromosome | [ |
| 1989 | ChrY polymorphisms utilised in phylogenetics | [ |
| 1990 | First gene, sex-determining region Y (SRY), mapped onto ChrY | [ |
| 1990 | Animal models used to investigate the influence of ChrY on hypertension | [ |
| 1991 | Sry used in animal models to form transgenic mice | [ |
| 1997 | ChrY utilised for forensic science and paternity testing | [ |
| 2000 | ChrY utilised for phylogenetics | [ |
| 2000 onwards | Association between hind III restriction fragment polymorphism and cardiovascular disease detected | [ |
| 2002 | Y-chromosomal haplogroups established | [ |
| 2003 | MSY first sequenced | [ |
| 2005 onwards | Unconvincing evidence relating the MSY to cardiovascular disease risk is reported | [ |
| 2009 onwards | Y haplotypes utilised to investigate association between ChrY and other complex traits | [ |
| 2016 | Exploration of the function of genes identified on within MSY | [ |
Figure 2Schematic representation of the Y and X chromosomes. Pseudoautosomal regions 1 and 2, the MSY and the location of SRY are indicated.
Figure 3Haplogroups are groups of haplotypes that share a common ancestor. Individuals belonging to a specific haplogroup share the same derived allele for one informative SNP. A derived allele is a new variant in a locus, different from the original non-mutated allele (which is known as the ancestral allele). This figure shows three haplotypes. All of them share the derived allele for SNP1. Variation in other SNPs (in red) creates different haplotypes within this haplogroup.
Basis of different study designs used to investigate the association between ChrY and complex traits, with strengths and limitations for consideration when analysing findings.
| Study Design | Basis | Strengths | Limitations | Reference |
|---|---|---|---|---|
| Linkage analysis | First-degree relatives are compared in order to ascertain the potential for a genetic component of disease susceptibility | 1- Successful in the identification of highly penetrant genetic variants related to Mendelian traits or monogenic disorders | 1- Limited application for complex traits due to the use of individuals that share similar genetic and environmental constituents, establishing true effects in multifactorial traits is limited | [ |
| Candidate gene studies | A particular gene is studied based on biological plausibility. Variation at this gene is investigated in genetic association studies | 1- Highly specific for genetic variation which focuses on the MSY | 1- Locus selected in absence of understanding of its function and potential effects | [ |
| Animal models | Animal studies are used as a framework for looking at human disease | 1- Allows careful control and manipulation of both genetic and external environment to isolate the effects of the MSY | 1- Application of animal models to human disease makes two key assumptions that may be incorrect: -That disease process studied is the same in humans as in other animals -That the MSY in other animals and that in humans is equivalent | [ |
| GWAS | Case-control study design used to look for common genetic variants more frequently identified in those with particular diseases | 1- Allows genotype-first analysis of the MSY for which understanding of genetic content is limited | 1- Sex chromosomes are routinely excluded from these study types as the entirety of the MSY is effectively in linkage disequilibrium | [ |
Considerations and possible solutions for genetic association studies carried out using SNPs in the MSY.
| GWAS Features | Y Chromosome vs. Autosomes | Implication | Possible Solution(s) | References |
|---|---|---|---|---|
|
| Only men inherit a Y chromosome | ~50% reduction in sample size | 1- Include the Y chromosome in GWASs—could be mandated by funders | [ |
|
| All the common variants in the MSY are in LD | Identifying the causal variant is very difficult | 1- Larger sample sizes | [ |
|
| Principal components calculated using autosomal SNPs are not applicable | Potential overadjustment and loss of statistical power | 1- Sensitivity analyses with and without genetic principal components | [ |
|
| Many highly variable regions and repetitive sequences | Variant calling may not be accurate | Only include variants called with high confidence | [ |
|
| No eQTLs identified for the Y chromosome | GWAS-eQTL colocalisation analysis cannot be carried out at present | Initiate trans-ethnic eeGWASs and/or study rare variants on the Y chromosome | [ |
|
| No conclusive GWAS signal identified | Not much information to link potential findings with other biological pathways | 1- Initiate a consortium to identify associations on the MSY | [ |
|
| Very few examples identified in autosomes. None with SNPs on the Y chromosome | Almost no statistical power to detect small effects | Hypothesis driven approaches (e.g., between SNPs associated with obesity/CVD and SNPs in/near | [ |
|
| Many MSY genes/proteins are not expressed at detectable levels in non-gonadal tissues | Identified associations are likely to be biologically implausible if not expressed in disease-relevant tissue | Query the Human Protein Atlas to check whether the putatively causal gene/protein is active in a relevant tissue ( | [ |
Figure 4Protein-coding Y-chromosomal genes expressed in non-gonadal tissues (not comprehensive) as detected by the Human Protein Atlas. Protein labelled 1: DDX3Y; 2: EIF1AY; 3: NLGN4Y; 4: RPS4Y1; 5: UTY; 6: ZFY. Proteins highlighted in red have a low quality of supporting evidence. Image source: commons.wikimedia.org/wiki/Human_body_diagrams (Wikimedia CC0 licence).