Literature DB >> 27064253

Mosaic loss of chromosome Y is associated with common variation near TCL1A.

Weiyin Zhou1,2, Mitchell J Machiela1, Neal D Freedman1, Nathaniel Rothman1, Nuria Malats3, Casey Dagnall1,2, Neil Caporaso1, Lauren T Teras4, Mia M Gaudet4, Susan M Gapstur4, Victoria L Stevens4, Kevin B Jacobs2,5, Joshua Sampson1, Demetrius Albanes1, Stephanie Weinstein1, Jarmo Virtamo6, Sonja Berndt1, Robert N Hoover1, Amanda Black1, Debra Silverman1, Jonine Figueroa1, Montserrat Garcia-Closas1,7, Francisco X Real3,8, Julie Earl3, Gaelle Marenne3, Benjamin Rodriguez-Santiago8,9,10, Margaret Karagas11, Alison Johnson12, Molly Schwenn13, Xifeng Wu14, Jian Gu14, Yuanqing Ye14, Amy Hutchinson1,2, Margaret Tucker1, Luis A Perez-Jurado8,9,15, Michael Dean1,16, Meredith Yeager1,2, Stephen J Chanock1.   

Abstract

Mosaic loss of chromosome Y (mLOY) leading to gonosomal XY/XO commonly occurs during aging, particularly in smokers. We investigated whether mLOY was associated with non-hematological cancer in three prospective cohorts (8,679 cancer cases and 5,110 cancer-free controls) and genetic susceptibility to mLOY. Overall, mLOY was observed in 7% of men, and its prevalence increased with age (per-year odds ratio (OR) = 1.13, 95% confidence interval (CI) = 1.12-1.15; P < 2 × 10(-16)), reaching 18.7% among men over 80 years old. mLOY was associated with current smoking (OR = 2.35, 95% CI = 1.82-3.03; P = 5.55 × 10(-11)), but the association weakened with years after cessation. mLOY was not consistently associated with overall or specific cancer risk (for example, bladder, lung or prostate cancer) nor with cancer survival after diagnosis (multivariate-adjusted hazard ratio = 0.87, 95% CI = 0.73-1.04; P = 0.12). In a genome-wide association study, we observed the first example of a common susceptibility locus for genetic mosaicism, specifically mLOY, which maps to TCL1A at 14q32.13, marked by rs2887399 (OR = 1.55, 95% CI = 1.36-1.78; P = 1.37 × 10(-10)).

Entities:  

Mesh:

Substances:

Year:  2016        PMID: 27064253      PMCID: PMC4848121          DOI: 10.1038/ng.3545

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


Mosaic loss of the Y chromosome (mLOY) refers to the loss of the Y chromosome in a subset of cells, while the remainder of cells retains the normal chromosome. For more than four decades, it has been noted that a fraction of healthy men lose all or some portion of the Y chromosome over the course of their lifetime[1]. Moreover, several studies have reported mLOY in males of advanced age suggesting mLOY is associated with aging and increasing hypodiploidy[1-4]. Other studies suggested mLOY is associated with specific hematologic disorders including acute myelogenous leukemia (AML), myelodysplastic syndrome (MDS), and preleukemia[5-9]. Single nucleotide polymorphism (SNP) genotyping arrays have become an important tool for discovering common variants that contribute to human diseases[10]. Two widely-used applications of this technology are genome-wide association studies (GWAS) and copy number variant (CNV) analyses for large-scale mosaic autosomal aberrations[11-18]. SNP microarray data has also been used to investigate mosaicism on the sex chromosomes[19,20]. Other studies have evaluated next-generation sequencing data to detect mosaic mutations at the base pair level[21-23]. While each of these studies of mosaic CNVs, mosaic uniparental disomies, mosaic single nucleotide variants (SNVs) and mLOY ascertain different aspects of the biological process of clonal expansion, taken together these studies suggest that mosaic events, both large and small, increase with age. This trend could reflect either a deterioration in the capacity to maintain a stable genome or, alternatively, a decline in stem-cell diversity[22-25]. We investigated the association between mLOY and age at DNA collection, smoking status, DNA source (derived from blood or buccal material), inferred ancestry, genetic susceptibility to mLOY, non-hematologic cancer risk and cancer-specific survival in subjects from three prospective cohorts: the Alpha-Tocopherol, Beta-Carotene Cancer Prevention Study (ATBC), Prostate, Lung, Colorectal, Ovarian Cancer Screening Trial (PLCO), and Cancer Prevention Study-II (CPS-II) (Supplementary Table 1). SNP microarray data generated using Illumina Infinium arrays for genome-wide association studies (GWAS) that contained sufficient coverage of chromosome Y (Human Hap 610K, Hap1M, Omni1M, and Omni2.5M) were used to detect mLOY in DNA isolated from blood or buccal cells from 8,679 males with non-hematologic cancer and 5,110 cancer-free adult male controls. The hybridization data from all subjects were examined for deviations from expected log2 intensity ratio (see Online Methods) for evidence of loss or gain of the male specific region of chromosome Y (MSY) (chrY:6,671,498-22,919,969; hg18/build36) (Supplementary Figure 1A). We observed 970 men (7.03%) with detectable mLOY and the estimated fraction of cells was 22.7% to 73.4% (Supplementary Figure 1C and Online Methods), which differs from autosomes (7–95% mosaicism[14,15]). No significant difference in mLOY frequency was observed by DNA source, either blood or buccal origin (P=0.33), or by genotyping array (P=0.14, Supplementary Table 2). The accuracy of our mLOY detection method was validated by quantitative PCR (qPCR) markers for 15 genes distributed across MSY (Supplementary Figure 1B)[26]. We selected 124 subjects from the cohorts with probable mLOY for validation by qPCR assays. The concordance rate between the SNP microarray and qPCR was 87.9% (Supplementary Table 3), with no differences observed by array type. Among the 13,789 males scanned, we found evidence of chromosome Y gain for 133 males (0.96%). In the males with suspected Y gain, we had DNA available from 69 individuals for qPCR validation and the concordance rate was only 49% for mosaic gains (Supplementary Table 3), with validation corresponding to substantial LRR deviations from baseline. After removal of 34 men with validated Y gain, we excluded 26 participants with similarly large positive deviations from baseline after manual review. The resulting dataset included 8,632 cases and 5,097 controls for Y loss analysis. Of the 8,632 cancer cases, 5,545 had their blood or buccal cells collected at least one year prior to cancer diagnosis. The majority of these participants (n=5,369) had been diagnosed with bladder, lung or prostate cancer. The most significant association for mLOY was with increasing age; in 13,729 men, the frequency of detectable mLOY increased from 1.18% under 60 to 18.71% over 80 years of age. The proportion of individuals with mLOY increased for every age stratum until 80 years of age (P<2.2×10−16); for older participants, the estimates became less stable due to small numbers (Figure 1A and 1B, Supplementary Table 4. Adjustment for smoking status (current smoking and for former smokers, years since cessation), ancestry, source of DNA and contributing studies, we observed evidence for an association between mLOY and age (OR per year of age=1.13, 95%CI=1.12–1.15; P<2.00×10−16) (Table 1, Supplementary Table 5). This association with age has a substantially greater magnitude than that observed previously for large-scale structural autosomal mosaic events (OR=1.05, 95%CI=1.04–1.07)[15] and for mosaic SNVs (OR=1.08, 95%CI=1.07–1.09)[22].
Figure 1

Fraction of men with mLOY across 5-year age groups for all subjects (n = 13,729)

Fraction of men with Y loss is calculated as men in the age group with Y loss divided by the total number of men in that age group. Error bars represent 95% Jeffery’s confidence intervals around the proportion estimate. (A) Mosaic chromosome Y loss is associated with older age at DNA collection, with frequencies of 1.18% in individuals less than 60 years and 18.71% in those 80 years or older (OR per year of age = 1.13, 95% CI = 1.12–1.15; P < 2×10−16). Chi-squared test for trend among 6 age groups shows there is significant evidence that the fraction of men with Y loss increases with age (P < 2.2×10−16). (B) Scatterplot for age versus fraction of men with Y loss. There is an overall increasing trend for the fraction of men with Y loss until age 80. After age 80, the trend became unstable reflecting the limited number of subjects in this age group in our study (Supplementary Table 4). All statistical tests are two-sided.

Table 1

Predictors of mLOY

Age refers to the age at DNA collection and the odds ratio estimate is for a one-year increase in age. For the association between age and mLOY, the analysis was adjusted for smoking status (current smoking and for former smokers, the number of years since cessation), estimated admixture proportion, DNA source, and contributing study. For the association between mLOY and the number of years since cessation and number of cigarettes smoked per day, analyses were adjusted for age, estimated admixture proportion, DNA source, and contributing study.

# of Y loss# of NormalTotalProportion with mLOYOR95% CIp-value
Age (ATBC+PLCO+CPSII)97012759137297.07%1.131.12–1.15<2.00×10−16
Smoking use (PLCO+CPSII)
Never smoking209319934086.13%reference
Former smoking446496454108.24%1.33(1.12–1.57)0.001
Current smoking10093410349.67%2.35(1.82–3.03)5.55×10−11
Years since quitting (PLCO+CPSII)
1–4 years quit3833237010.27%2.15(1.49–3.10)3.83×10−5
5–10 since quit6456763110.14%1.92(1.43–2.58)1.26×10−5
11–20 since quit107116712748.40%1.49(1.17–1.90)0.001
> 20 since quit231280930407.60%1.10(0.91–1.34)0.333
Cigarettes per day in current smokers (ATBC+PLCO+CPSII)
1 to 10617277887.74%reference
11 to 20154220123556.54%1.04(0.76–1.43)0.800
21 to 3066120612725.19%0.92(0.63–1.33)0.648
>30264294555.71%0.95(0.58–1.55)0.835
The frequency of individuals exhibiting age-related mLOY was nearly 10-fold greater than that observed in the autosomes (0.7–2.0%)[14-16]. A detectable mosaic autosomal abnormality (>2 MB) was observed in 130 (0.95%) men[16]. This is substantially lower than what was observed for mLOY in the same individuals (7.07%), indicating that mLOY is the most frequent large-scale chromosomal somatic event. Of 970 mLOY males, 18 (1.86%) also had evidence of detectable autosomal mosaicism (Supplementary Table 6 and 7), suggesting men with mLOY are more likely to harbor large autosomal mosaic events than men without mLOY (OR=2.13, 95% CI=1.22–3.55; P=0.005). We investigated the association with smoking status in 9,859 subjects (1,034 current, 5,410 former, 3,408 never smokers, 7 unknown) from the PLCO and CPSII studies, excluding participants from ATBC, which recruited only current smokers. The frequency of mLOY was higher in ever smokers (65.4%) than never smokers (34.6%), with the highest frequency among current smokers. Relative to never-smokers who were less than 65 years old, current smokers over 75 years old had an increase of mLOY (OR=13.9, 95%CI=6.60–29.26; P=4.13 × 10−12) (Figure 2A, Supplementary Table 8). Adjusting for age, ancestry, DNA source and study, we observed an association of mLOY with current smoking (OR=2.35, 95%CI=1.82–3.03; P=5.55 × 10−11) and former smoking (OR=1.33, 95%CI=1.12–1.57; P=0.001) (Table 1, Supplementary Table 9). This result is consistent with a prior report that men who smoke are at greater risk of mLOY, which observed odds ratios for current versus non-smoking that ranged from 2.4 (95%CI=1.6–3.6) to 4.3 (95%CI=2.8–6.7)[19] among three included studies. Mosaic SNVs have also been associated with smoking with comparable odds ratio estimates (OR=2.2)[23]. It is notable that prior studies have not reported an association between smoking and large-scale (>2 Mb) autosomal mosaicism[14,15].
Figure 2

mLOY and Smoking Analysis

(A) Proportion of males with mLOY across strata of 5-year age group and smoking status for subjects from PLCO and CPSII studies (n = 9,859). Non-smokers are in grey, former smokers are in blue, and current smokers are in red. Error bars represent 95% Jeffery’s confidence intervals around the proportion estimate. Current smoking men in the 75+ age group are at a 13.90 times increased odds of having mosaic Y loss as compared to non-smoking men less than 65 years old (95% CI =6.60–29.26, P = 4.13 × 10−12). (B) Association between current smoking and years since cessation and mosaic Y loss from adjusted logistic regression models (n=8,825). The dotted line reflects an odds ratio of 1.0 for the referent never smokers. All statistical tests are two-sided.

Due to differences in mLOY between current and former smokers, the risk of mLOY progressively declined with years after cessation: quitting smoking within 1 to 4 years (OR=2.15, 95%CI=1.49–3.10; P=3.83 × 10−5), within 5 to 10 years (OR=1.92, 95%CI=1.43–2.58; P=1.26 × 10−5) and within 11–20 years (OR=1.49, 95%CI=1.17–1.90; P=0.001), relative to never-smoking. By 20 years after cessation, there was no evidence for association (OR=1.10, 95% CI= 0.91–1.34, P=0.33) (Figure 2B, Table 1, Supplementary Table 10). Our findings suggest that smoking has a long lasting impact on mLOY, perhaps more than a decade after quitting, but the association wanes with long-term cessation. Among 4,904 current smokers, we observed no association for smoking intensity, measured as cigarettes per day (Table 1, Supplementary Table 11). It has been proposed that autosomal mosaicism may be associated with risk for certain solid tumors but not definitely established[14-16]. Moreover, others have suggested that mLOY is associated with cancer risk overall[20]. We investigated the frequency of mLOY in blood or buccal DNA and solid tumor risk in 5,545 cancer subjects from whom DNA was collected at least 1 year prior to cancer diagnosis and in 5,097 cancer-free individuals (Supplementary Table 12). Overall, mLOY was slightly more common in men who went on to develop cancer (6.67%) than in cancer-free controls (5.49%) (unadjusted OR=1.23, 95%CI=1.04–1.45; P=0.012; multivariate adjusted OR=1.19, 95%CI=1.00–1.42; P=0.047) (Table 2, Supplementary Figure 2, Supplementary Table 13). Furthermore, we investigated a possible association between cancer risk and mLOY for cancer types with sufficient sample size including bladder, lung, and prostate cancer in adjusted analyses (using continuous age; smoking status-current smoking and for former smokers, years since quitting; pack years; ancestry; source of DNA and study). In the cohort studies with DNA collected one year or more before diagnosis, we observed a possible association between mLOY and the risk of bladder cancer (OR=1.47, 95%CI=1.09–1.99; P=0.011) and risk of prostate cancer (OR=1.35, 95%CI=1.04–1.74; P=0.024), but no evidence for a relationship with lung cancer (OR=0.90, 95%CI=0.69–1.18, P=0.45) (Table 2). This latter point is striking given that lung cancer is more strongly associated with smoking than either bladder or prostate cancer. We examined cases diagnosed at or before biospecimen sampling. For bladder and prostate but not lung cancer, when examining DNA obtained contemporaneously with cancer diagnosis, we observed somewhat higher risk estimates that could reflect effects of treatment modalities (chemotherapy, surgery and/or radiation therapy). We additionally examined the possible association between mLOY and bladder cancer risk in three case-control studies (total of 2,062 cases and 2,064 controls), but found no association (OR=1.17, 95%CI=0.93–1.48; P=0.18; Supplementary Table 14). Together, these results provide limited support for the hypothesis that mLOY is a strong risk factor for common solid tumors and larger studies will be needed to investigate further based on current estimated effect sizes.
Table 2

Association between mLOY and incident cancer overall and stratified by date of DNA collection

The analysis used mLOY as the predictor variable, cancer type as response variable, and was adjusted for continuous age, smoking status (current smoking and for former smokers, the number of years since cessation), pack years, estimated admixture proportion, DNA source, and contributing study. A total of 5,097 cancer-free controls were used as a reference. Other cancers in addition to bladder, lung, and prostate cancer were included when performing the combined cancer analysis.

DNA 1 year before cancer diagnosis
DNA at and after cancer diagnosis
All
NOR95% CIP ValueNOR95% CIP ValueNOR95% CIP Value
Bladder7311.47(1.09–1.99)0.0115582.01(1.47–2.75)1.40×10−512891.69(1.33–2.13)1.26×10−5
Lung19080.9(0.69–1.18)0.4503810.81(0.48–1.38)0.4422890.90(0.70–1.17)0.44
Prostate27301.35(1.04–1.74)0.02420931.53(1.17–2.01)1.84×10−348231.43(1.19–1.73)1.40×10−4

Combined Cancer55451.19(1.00–1.42)0.04730871.52(1.23–1.87)8.40×10−586321.31(1.13–1.53)4.79×10−4
A recent report suggested that mLOY was associated with all-cause and cancer mortality among a cohort of 982 participants who were cancer-free at study baseline[20]. As cancer mortality reflects both developing cancer and dying from it, we examined whether mLOY may be associated with subsequent overall and cancer-specific mortality in our cancer cases, restricting our analysis to cases with DNA collected at least one year prior to diagnosis and available follow-up (N=5,340). We observed little evidence for an association with either endpoint, whether in Kaplan-Meier survival curves (Figure 3A–B) or in unadjusted or multivariate-adjusted Cox proportional-hazard models (Supplementary Table 15). After adjusting for age at diagnosis, smoking status (current smokers and for former smokers, the number of years since quitting), pack years, body mass index, and contributing study; the HR for mortality from all causes was 0.89 (95%CI=0.76–1.04; P=0.15) and the HR for mortality from cancer was 0.87 (95%CI=0.73–1.04; P=0.12). Similar findings were observed for bladder, lung, and prostate cancer separately (Figure 3C–E, Supplementary Table 15).
Figure 3

Association between mLOY and overall and cancer survival among participants with DNA collected at least one year prior to cancer diagnosis

Kaplan-Meier survival curves are for (A) all-cause mortality for all cancer cases (n=5,340), (B) overall cancer survival (died of cancer) for all cancer cases (n=5,340), (C) cancer survival of bladder cancer cases (n=721), (D) cancer survival of lung cancer cases (n=1744), and (E) cancer survival of prostate cancer cases (n=2711). Subjects without Y loss are in blue and with Y loss are in red. All statistical tests are two-sided.

We conducted a genome-wide association study (GWAS) to identify regions associated with risk for mLOY in the three cohorts, analyzed separately and in a meta-analysis. We adjusted for smoking status (ever versus never smoker) and principal components significantly associated with mLOY in each cohort (reported previously to account for subtle differences in population substructure in participants of European background). The analysis included 895 men with detected mLOY and 11,474 men with no detected mLOY. The p-value distribution from the combined meta-analysis had a λGC of 1.015, as depicted in the quantile-quantile plot (Supplementary Figure 3). A significant association was observed with SNP rs2887399 at 14q32.13 (OR=1.55, 95%CI=1.36–1.78; p=1.37×10−10) (Figure 4, Supplementary Figure 4). The major risk allele (G) has a frequency of 0.77 in CEU. The effect estimates are consistent across the three studies and there was no evidence for heterogeneity (P=0.86, Supplementary Figure 5). The relationship remained robust when not adjusting for smoking status (OR=1.57, 95%CI=1.36–1.80; P=6.46×10−11). The rs2887399 variant maps to the 5′ end of the T-cell leukemia/lymphoma 1A gene (TCL1A), which functions as a coactivator of the cell survival kinase AKT and has been implicated in T-cell and B-cell hematological malignancies[27], mainly because recurrent chromosomal rearrangements bring TCL1A in close proximity to the T-cell antigen receptors gene[28]. The rs2887399 variant is the first common variant associated with a clonal-expansion phenotype beyond the known association of a JAK2 haplotype with mosaicism for the common JAK2 V617F mutation[29-32]. This genetic finding linking germline variation to somatic mosaicism could lead to understanding how clonal hematopoiesis relates many chronic diseases; it is plausible that the susceptibility haplotype could contribute to actual loss of the Y chromosome or it could be permissive for clonal expansion. Further work is needed to fine map the region and investigate its biological underpinnings on development of mLOY.
Figure 4

Regional plot for chromosome 14 mLOY meta-analysis association p-values

The GWAS for mLOY included 895 men with detected mLOY and 11,474 men with no detected mLOY. The top panel is a Manhattan plot showing −log10 P-values on the Y axis and chromosomal position on the X axis. A 500 Kb region around the top SNP (rs2887399, blue) is highlighted (orange) and zoomed in using LDlink[33] to investigate LD (middle panel) and nearby genes (bottom panel). Numbers encapsulated within points in the middle plot represent RegulomeDB values. All statistical tests are two-sided.

In summary, mLOY is the most common large-scale detectable mosaic chromosomal event in males and it has a striking association with aging and cigarette smoking, which is attenuated by years after cessation. We observed limited evidence for mLOY as a strong risk factor for three common cancers in men, and did not observe an association with survival after cancer diagnosis. Contrary to prior evidence suggesting that men with mLOY were at substantially higher likelihood of dying from cancer, our study provides little evidence for this hypothesis. Together, these data suggest that age and smoking have a substantial effect on the development of mLOY, but that there is insufficient evidence to conclude mLOY is a major risk factor for non-hematologic cancer in men. Lastly, our GWAS of mLOY identified a locus on 14q32.13, which may provide insight into the biological basis of mLOY in relation to smoking and aging in men.

ONLINE METHODS

Study overview

The analyzed data consists of 13,789 males drawn from cancer genome-wide association studies (GWAS) conducted within three prospective cohorts. The mean age at DNA collection was 67.26 years for all participants. The studies were approved by the institutional ethics committees of each participating hospital and the Institutional Review Board (IRB) of the United States National Cancer Institute (NCI). Written informed consent was obtained from all individuals. DNA was extracted from peripheral circulating leukocytes (70.38%) and buccal samples (29.62%) for men drawn from the three cohorts (Supplementary Table 1). Genomic DNA was screened and analyzed at the NCI according to the standard sample handling process of the Cancer Genomics Research Laboratory (CGR), Division of Cancer Epidemiology and Genetics (DCEG). AmpFlSTR Identifiler assays confirmed each sample had gender concordance and removed samples with evidence for contamination. Genotyping was carried out on one of four Illumina Infinium SNP arrays (e.g., Hap610K, HapIM, Omni1M, and Omin2.5M), each of which has an adequate number of Y specific probes. Additional scanning on other microarray chips was not used because of the inadequate Y probe coverage. Of all participants, 94.99% were detected to have >=80% of European ancestry and 2.39% were detected to have >=80% of African ancestry. Case-control studies for bladder cancer were also included and drawn from two studies carried out in Spain and New England Bladder Cancer Study previously scanned with the Hap1M and Hap610K chip[34], respectively. The MD Anderson bladder cancer study was initially scanned on a chip with inadequate Y chromosome probe coverage[35], and so we detected mLOY in this study using the qPCR assay described below (Supplementary Table 14).

Intensity Analysis: Log2 Intensity Ratio (LRR) and B-Allele Frequency (BAF) generation

Sample intensity files (two files per sample, for red and green channels) were loaded into the Illumina GenomeStudio software. The intensity data were normalized using the Illumina five-step self-normalization procedures, which used information contained in the array itself to convert raw X and Y (allele A and allele B) signal intensities to normalized values. The LRR and BAF values for each assay were exported from GenomeStudio software using the “Genotype Final Report” (GFR) format.

Test region for detecting Y chromosome abnormality

We extensively examined loci across the Y chromosome for the four commercial chip types with adequate Y chromosome coverage and used the male specific region of chromosome Y between 6,671,498-22,919,969 (hg18/build36) as the test region for detecting Y chromosome mosaicism, since this region provides relatively stable signal intensity and was outside of majority of the regions containing genes with multiple copies[26] (Supplementary Figure 1A).

Y Chromosome mosaicism detection method

The Y chromosome mosaicism was detected using log R ratio (LRR), which is the normalized measure of total signal intensity and provides data on relative copy number. Subjects were examined for deviations from expected log2 intensity ratio for evidence of loss of the male specific region of chromosome Y (MSY). A minimum mean threshold of LRR <= −0.15 was used for identifying a Y chromosome loss event. A minimum mean threshold of LRR >= 0.15 was used to define a Y chromosome gain event. Samples with mean LRR values falling below or above these thresholds were called as mosaic Y losses and gains, respectively. To minimize the false discovery of Y chromosome mosaicism, the ratio of the mean LRR to the standard deviation for the test region was calculated. A minimum threshold for the ratio was set to 0.25 to filter out samples with excessive noise in LRR values. For potential mLOY, each chromosome Y plot was then manually reviewed and suspect events were further excluded from subsequent analyses.

qPCR Validation

Quantitative PCR (qPCR) was used to evaluate the ratio of Y chromosome signal to an autosomal single copy gene signal. 15 qPCR gene assays spanning the p and q arms of the Y chromosome (Supplementary Figure 1B), were run in duplex with RNase P as the reference gene, known to be single copy[26] (Supplementary Table 16). 5 ng of sample DNA, according to Quant-iT PicoGreen dsDNA quantitation (Life Technologies, Grand Island, NY), was transferred to LightCycler-compatible 384-well plates (Roche, Indianapolis, IN) and dried down. An internal standard curve (serial dilution, to 7 target ratios, of pooled male gDNA samples, with no detectable Y chromosome loss, with a pool of female gDNA samples) and assay control samples (3 target ratios, prepared similarly to the standard curve) were applied to the assay plates to guide analysis and indicate overall quality of assay performance. All experimental and control samples were assayed in triplicate on each plate. PCR was performed using 5 uL reaction volumes consisting of: 2.5 uL of LightCycler 480 Probes Master Mix (Roche, Indianapolis, IN), 2.0 uL of MBG Water, 0.25 uL of 20X TaqMan® Copy Number Reference Assay, human, RNase P (Life Technologies, Grand Island, NY), and 0.25 uL of specific 20X TaqMan® Copy Number Assay (Life Technologies, Grand Island, NY). Thermal cycling was performed on a LightCycler 480 (Roche) where PCR conditions consisted of: 95°C hold for 5 min, denature at 95°C for 15 sec, anneal at 60°C for 30 sec, with fluorescence data collection, 45 cycles. The LightCycler software (Release 1.5.0) was used for initial analysis of raw data. Utilizing absolute quantification analysis with the second derivative maximum method and high confidence detection algorithm, single target sequences were quantified and expressed as a ratio (Target/Reference) based on the internal standard curve of known ratios. The ratios of the 15 assays were averaged to yield an overall Y chromosome signal ratio.

Estimation of the proportion of cells with Y loss

A quadratic regression model was used to fit the average qPCR ratio and mean LRR data pairs with mean LRR as predictor variable (X) and the average qPCR ratio as the response variable (Y) to create a predictive polynomial equation: Y = ao + a1×X + a2×X2. Only the data points from subjects having consensus event calls between qPCR and chip data for Y loss and normal, with CV% <=10% from qPCR data were used to generate such relationship (n = 98 subjects). For qPCR, the standard curve utilized simulates known ratios of Y chromosome ranging from 10% to 100% (90% loss to no loss). This is achieved by diluting a pool of male samples with no mLOY with a pool of female samples to simulate these percentages of loss. Y gain data was not included when building the predictive model since the quantitative qPCR ratio for the estimated amount of gain is not precise since the data is extrapolated outside the experimental copy number range of 0.1–1 defined by the standard curve. For each mean LRR, the corresponding copy number can be predicted by inserting the mean LRR into the quadratic equation. The percent of cells with Y loss equals to 1- average Y signal ratio (Supplementary Figure 1C), for example a mean LRR of −0.15 corresponds to a frequency of Y loss of 22.7%. We also performed this analysis for the case-controls studies and found similar results (data not shown).

Logistic Regression Analysis

All of the logistic regression models were generated in R using the glm function with quasi-binomial family and logit as link function. To determine the relationship between individuals having Y chromosome loss and their age at DNA collection, smoking behavior, DNA source, ancestry, and study cohort, we fit several models that regressed the presence of Y loss for each individual on relevant covariates in a logistic model. To determine the relationship between individuals having Y chromosome loss and cancer diagnosis, we fit several models that regressed the presence of cancer diagnosis for each individual on relevant covariates in a logistic model. The following covariate terms were defined for each individual: (i) age of DNA (a continuous measure of age at DNA collection); (ii-a) smoking status (categorical variable with three levels: current, former, and never smokers (reference group)); (ii-b) number of years since stopped smoking (categorical variable with six levels: current smokers, 1–4, 5–10, 11–20, >20 years since quitting smoking, never smokers (reference group)); (iii) DNA source (categorical variable with 2 levels: individuals who contributed DNA derived from a buccal sample and for blood sample (reference group)); (iv) East Asian ancestry (a continuous measure of admixture estimate); (v) African ancestry (a continuous measure of admixture estimate); and (vi) study (categorical variable with 3 levels: ATBC, PLCO, CPSII (reference group)).

Survival analysis

Cox proportional-hazard models were generated in R using the Survival package. Using age as the time-scale, the start year is the year of cancer diagnosis, the end year is the year of death or censorship. For total survival analysis, the event indicator equals 1 for those who died during the study follow-up and 0 for those who did not. For cancer survival analysis, the event indicator equals 1 for those with a cancer cause of death during the period of the study and 0 for those who were alive at the end of the study or had a non-cancer cause of death. We observed similar associations when we started follow-up time at year of DNA collection. All case subjects used in the analysis had DNA collected at least 1 year before a cancer diagnosis. The following covariate terms were defined for each individual: (i) age of diagnosis (a continuous measure of age at diagnosis); (ii) BMI (a continuous measure of body mass index); (iii) pack years (categorical variable with 4 levels: >60, >60–40, >40–20, (>20 + never smokers (reference group)); (iv) mosaic Y loss (categorical variable with 2 levels: 1 for individuals identified as Y loss and 0 for individuals without Y loss. All other covariates were defined as in the logistic regression models.

GWAS Analysis

A genome wide association scan (GWAS) was conducted adjusting for smoking status (ever versus never smoker) and principal components that were significantly associated with mLOY in each cohort. The combined meta-analysis of men from the three cohorts consisted of 928 men with detectable mLOY and 12,118 men without evidence of mLOY (Supplementary Figure 5). To ensure a robust association, a further analysis was carried out that did not adjust for smoking status.

Other analysis

Frequency plots were generated using R. Confidence intervals (CI) on frequencies were reported using 95% confidence bounds from the Jefferys interval method and were generated in R using the binom package. The 95% CI unadjusted analysis of count data and frequencies was performed using the Fisher’s Exact test for contingency tables, as implemented in the R software package. GLU software package was used to estimate admixture coefficients for each subject.
  34 in total

1.  The male-specific region of the human Y chromosome is a mosaic of discrete sequence classes.

Authors:  Helen Skaletsky; Tomoko Kuroda-Kawaguchi; Patrick J Minx; Holland S Cordum; LaDeana Hillier; Laura G Brown; Sjoerd Repping; Tatyana Pyntikova; Johar Ali; Tamberlyn Bieri; Asif Chinwalla; Andrew Delehaunty; Kim Delehaunty; Hui Du; Ginger Fewell; Lucinda Fulton; Robert Fulton; Tina Graves; Shun-Fang Hou; Philip Latrielle; Shawn Leonard; Elaine Mardis; Rachel Maupin; John McPherson; Tracie Miner; William Nash; Christine Nguyen; Philip Ozersky; Kymberlie Pepin; Susan Rock; Tracy Rohlfing; Kelsi Scott; Brian Schultz; Cindy Strong; Aye Tin-Wollam; Shiaw-Pyng Yang; Robert H Waterston; Richard K Wilson; Steve Rozen; David C Page
Journal:  Nature       Date:  2003-06-19       Impact factor: 49.962

2.  Change of human chromosome count distribution with age: evidence for a sex differences.

Authors:  P A JACOBS; M BRUNTON; W M COURT BROWN; R DOLL; H GOLDSTEIN
Journal:  Nature       Date:  1963-03-16       Impact factor: 49.962

3.  Age-associated aneuploidy: loss of Y chromosome from human bone marrow cells with aging.

Authors:  R V Pierre; H C Hoagland
Journal:  Cancer       Date:  1972-10       Impact factor: 6.860

4.  Loss of Y chromosome. An age-related event or a cytogenetic marker of a malignant clone?

Authors:  D Abeliovich; O Yehuda; S Ben-Neriah; R Or
Journal:  Cancer Genet Cytogenet       Date:  1994-08

5.  Loss of the Y chromosome in bone marrow cells: results on 1907 consecutive cases of leukaemia and preleukaemia.

Authors:  C Herens; E Brasseur; M Jamar; L Vierset; I Schoenen; L Koulischer
Journal:  Clin Lab Haematol       Date:  1999-02

6.  Clinical significance of Y chromosome loss in hematologic disease.

Authors:  A Wiktor; B A Rybicki; Z S Piao; M Shurafa; B Barthel; K Maeda; D L Van Dyke
Journal:  Genes Chromosomes Cancer       Date:  2000-01       Impact factor: 5.006

7.  Deregulated expression of TCL1 causes T cell leukemia in mice.

Authors:  L Virgilio; C Lazzeri; R Bichi; K Nibu; M G Narducci; G Russo; J L Rothstein; C M Croce
Journal:  Proc Natl Acad Sci U S A       Date:  1998-03-31       Impact factor: 11.205

8.  Loss of the Y chromosome from normal and neoplastic bone marrows. United Kingdom Cancer Cytogenetics Group (UKCCG)

Authors: 
Journal:  Genes Chromosomes Cancer       Date:  1992-07       Impact factor: 5.006

9.  Sex chromosome loss and aging: in situ hybridization studies on human interphase nuclei.

Authors:  M Guttenbach; B Koschorz; U Bernthaler; T Grimm; M Schmid
Journal:  Am J Hum Genet       Date:  1995-11       Impact factor: 11.025

Review 10.  Detectable clonal mosaicism in the human genome.

Authors:  Mitchell J Machiela; Stephen J Chanock
Journal:  Semin Hematol       Date:  2013-10       Impact factor: 3.851

View more
  60 in total

1.  Insights into the loss of the Y chromosome with age in control individuals and in patients with age-related macular degeneration using genotyping microarray data.

Authors:  Felix Grassmann; Bernhard H F Weber; Reiner A Veitia
Journal:  Hum Genet       Date:  2019-05-27       Impact factor: 4.132

2.  Back to the drawing board-loss of chromosome Y (LOY) in leukocytes is associated with age-related macular degeneration.

Authors:  Lars A Forsberg
Journal:  Eur J Hum Genet       Date:  2018-10-23       Impact factor: 4.246

Review 3.  The ageing genome, clonal mosaicism and chronic disease.

Authors:  Mitchell J Machiela; Stephen J Chanock
Journal:  Curr Opin Genet Dev       Date:  2017-01-06       Impact factor: 5.578

Review 4.  Mosaicism, aging and cancer.

Authors:  Mitchell J Machiela
Journal:  Curr Opin Oncol       Date:  2019-03       Impact factor: 3.645

Review 5.  Mosaicism in health and disease - clones picking up speed.

Authors:  Lars A Forsberg; David Gisselsson; Jan P Dumanski
Journal:  Nat Rev Genet       Date:  2016-12-12       Impact factor: 53.242

6.  Characterization of breakpoint regions of large structural autosomal mosaic events.

Authors:  Mitchell J Machiela; Lea Jessop; Weiyin Zhou; Meredith Yeager; Stephen J Chanock
Journal:  Hum Mol Genet       Date:  2017-11-15       Impact factor: 6.150

Review 7.  Mosaic loss of human Y chromosome: what, how and why.

Authors:  Xihan Guo; Xueqin Dai; Tao Zhou; Han Wang; Juan Ni; Jinglun Xue; Xu Wang
Journal:  Hum Genet       Date:  2020-02-04       Impact factor: 4.132

8.  Why Y? Downregulation of Chromosome Y Genes Potentially Contributes to Elevated Cancer Risk.

Authors:  Derek W Brown; Mitchell J Machiela
Journal:  J Natl Cancer Inst       Date:  2020-09-01       Impact factor: 13.506

9.  LDtrait: An Online Tool for Identifying Published Phenotype Associations in Linkage Disequilibrium.

Authors:  Shu-Hong Lin; Derek W Brown; Mitchell J Machiela
Journal:  Cancer Res       Date:  2020-06-30       Impact factor: 12.701

Review 10.  Human Y-chromosome variation in the genome-sequencing era.

Authors:  Mark A Jobling; Chris Tyler-Smith
Journal:  Nat Rev Genet       Date:  2017-05-30       Impact factor: 53.242

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.