Literature DB >> 22561516

Detectable clonal mosaicism from birth to old age and its relationship to cancer.

Cathy C Laurie1, Cecelia A Laurie, Kenneth Rice, Kimberly F Doheny, Leila R Zelnick, Caitlin P McHugh, Hua Ling, Kurt N Hetrick, Elizabeth W Pugh, Chris Amos, Qingyi Wei, Li-e Wang, Jeffrey E Lee, Kathleen C Barnes, Nadia N Hansel, Rasika Mathias, Denise Daley, Terri H Beaty, Alan F Scott, Ingo Ruczinski, Rob B Scharpf, Laura J Bierut, Sarah M Hartz, Maria Teresa Landi, Neal D Freedman, Lynn R Goldin, David Ginsburg, Jun Li, Karl C Desch, Sara S Strom, William J Blot, Lisa B Signorello, Sue A Ingles, Stephen J Chanock, Sonja I Berndt, Loic Le Marchand, Brian E Henderson, Kristine R Monroe, John A Heit, Mariza de Andrade, Sebastian M Armasu, Cynthia Regnier, William L Lowe, M Geoffrey Hayes, Mary L Marazita, Eleanor Feingold, Jeffrey C Murray, Mads Melbye, Bjarke Feenstra, Jae H Kang, Janey L Wiggs, Gail P Jarvik, Andrew N McDavid, Venkatraman E Seshan, Daniel B Mirel, Andrew Crenshaw, Nataliya Sharopova, Anastasia Wise, Jess Shen, David R Crosslin, David M Levine, Xiuwen Zheng, Jenna I Udren, Siiri Bennett, Sarah C Nelson, Stephanie M Gogarten, Matthew P Conomos, Patrick Heagerty, Teri Manolio, Louis R Pasquale, Christopher A Haiman, Neil Caporaso, Bruce S Weir.   

Abstract

We detected clonal mosaicism for large chromosomal anomalies (duplications, deletions and uniparental disomy) using SNP microarray data from over 50,000 subjects recruited for genome-wide association studies. This detection method requires a relatively high frequency of cells with the same abnormal karyotype (>5-10%; presumably of clonal origin) in the presence of normal cells. The frequency of detectable clonal mosaicism in peripheral blood is low (<0.5%) from birth until 50 years of age, after which it rapidly rises to 2-3% in the elderly. Many of the mosaic anomalies are characteristic of those found in hematological cancers and identify common deleted regions with genes previously associated with these cancers. Although only 3% of subjects with detectable clonal mosaicism had any record of hematological cancer before DNA sampling, those without a previous diagnosis have an estimated tenfold higher risk of a subsequent hematological cancer (95% confidence interval = 6-18).

Entities:  

Mesh:

Year:  2012        PMID: 22561516      PMCID: PMC3366033          DOI: 10.1038/ng.2271

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


INTRODUCTION

Chromosomal mosaicism is the presence of different karyotypes in two or more cell lineages within an individual derived from a single zygote[1,2]. This karyotypic variation may arise early in development and involve both the soma and the germline or it may occur later and be restricted to one or more specific cell types. In cancer, chromosomal anomalies can initiate a neoplastic clone or arise during clonal evolution and serve as clonal markers[3]. Here we consider such clonal variation as a form of mosaicism, since the cancer cells may have acquired one or more chromosomal abnormalities, while other cells in the same tissue, or elsewhere in the body, retain the normal karyotype. Chromosomal mosaicism in humans has been well studied in embryos[4,5], fetuses from spontaneous abortions[6], children with birth defects or developmental delay[7,8] and cancer patients[9]. However, little is known about the type, frequency and age distribution of acquired chromosomal anomalies in large samples from the general population[9,10]. Data from genome-wide association studies now provide an opportunity to detect chromosomal variation in tens of thousands of people of all ages and to investigate the association of mosaicism with disease. Single nucleotide polymorphism (SNP) microarray data are used routinely to detect chromosomal anomalies (copy number variants (CNV) and uniparental disomy (UPD)) in clinical cytogenetic laboratories[11,12] and to detect small CNVs in population studies[13-15]. However, the analytical methods used in population studies are not optimized for detecting large anomalies or mosaicism. Therefore, we developed an efficient method to identify and localize large (50 kb to whole-chromosome) anomalies and mosaicism within a single DNA sample. This method requires a relatively high frequency of cells (>5–10%) with the same abnormal karyotype (presumably of clonal origin) in the presence of normal cells. Therefore, we use the term ‘detectable clonal mosaicism’, rather than simply ‘chromosomal mosaicism’, to emphasize the observation of clones of cells with abnormal karyotype that occur at a frequency sufficient for detection using SNP microarray data. DNA samples (primarily from peripheral blood) from over 50,000 people genotyped for the Gene-Environment Association Studies (GENEVA) consortium[16] were analyzed to detect clonal mosaicism. The GENEVA studies include all ages from birth to old age, several major ethnic groups, and a variety of different health conditions, including healthy controls (Table 1, Supplementary Table 1, Supplementary Fig. 1). Here we characterize the types of chromosomal anomalies detected, show how the prevalence of detectable clonal mosaicism within blood cells increases with age, and examine the association between mosaic anomalies and hematological cancer.
Table 1

Summary of GENEVA study characteristics.

GENEVA studyIllumina arrayaRelatednessDesignEthnicitybMean Agec% MaleNd
MelanomaOmni1Mmostly unrelatedecase-controlEur52582,947
Lung Health660Wmostly unrelatedcohortEur54634,087
Cleft Lip/Palate610trioscase-parent trioEur & Asian33526,860
Addiction1Mmostly unrelatedcase-controlEur & AA39462,790
Lung Cancer550mostly unrelatedcase-controlEur66725,518
Blood clottingOmni1Msib pairspopulation sampleEur21381,158
Prostate cancer Japanese/Latino660Wmostly unrelatedcase-controlAsian+ Hisp711004,281
Prostate cancer African-American1Mmostly unrelatedcase-controlAA691004,338
Venous Thrombo-embolism660Wmostly unrelatedcase-controlEur55492,591
Birth weight Afro-Caribbean1Mmother-offspring duospopulation sampleAC25252,254
Birth weight European610mother-offspring duospopulation sampleEur31252,712
Birth weight Hispanic1Mmother-offspring duospopulation sampleHisp29211,419
Dental caries610Families and singletonspopulation sampleEur36453,841
Prematurity660Wmother-offspring duoscase-controlEur30263,725
Glaucoma660Wmostly unrelatedcase-controlEur67421,977

See Supplementary Table 1 for a full description of the array type.

Predominant ethnicity; most studies have small numbers of other ethnicities. Eur=European-ancestry, AA=African-American, Hisp=Hispanic, AC=Afro-Caribbean.

Mean age of participants older than 15 years. The Cleft, Birth Weight, Dental Caries and prematurity studies also have substantial numbers of infants and children less than 15 years old.

Total number of subjects with genotyped samples analyzed in this study.

“Unrelated” here means less relatedness than second degree relatives, based on estimation of identity-by-descent coefficients

RESULTS

Types of anomalies detected

This report deals with autosomal anomalies, defined here as deviations from the normal biparental disomic state. Anomalies were detected using log R ratio (LRR) and B Allele Frequency (BAF)[17]. LRR is a measure of relative signal intensity (log2 of the ratio of observed to expected intensity, where the expectation is based on other samples). BAF is an estimate of the frequency of the B allele of a given SNP in the population of cells from which the DNA was extracted. In a normal cell, the B allele frequency at any locus is either 0 (AA), ½ (AB) or 1 (BB) and the expected LRR is 0. Both copy number changes and copy-neutral changes from biparental to uniparental disomy (UPD) result in changes in BAF, while copy number changes also affect LRR (Figures 1 and 2). Our detection method identifies both non-mosaic (constitutional) and clonal mosaic anomalies, which were distinguished subsequently using standards based on parent-offspring transmission in family studies and polymorphic CNVs in non-family studies. Three types of clonal mosaics were detected: mixtures of disomic and monosomic cells (deletions), mixtures of disomic and trisomic cells (duplications), and copy-neutral mixtures of biparental and acquired uniparental disomy (aUPD) (see examples in Figure 3 and Supplementary Figure 2). The aUPDs are primarily terminal segments, as expected for an origin through mitotic crossing over (Supplementary Fig 3), while some cases of whole-chromosome aUPD may be due to aneuploidy rescue (Supplementary Fig. 4).
Figure 1

Expected values of B Allele Frequency (BAF) and Log R Ratio (LRR) for discrete copy number states. Mosaics have intermediate positions between these discrete states. Copy number is given above the horizontal lines in the LRR plot, while SNP genotypes are given in the BAF plot. M=maternal and P=paternal chromosome. States with M and P reversed are also possible. The scatter of points for copy number = 0 (homozygous deletion) represents background signal noise.

Figure 2

Expected and observed values of B Allele Frequency (BAF) and Log R Ratio (LRR) metrics for clonal mosaic anomalies detected in GENEVA subjects. (a) Expected. (b) Observed (N=514). In (b), “med”=median, “anom”=within the anomaly, “nonanom”=non-anomalous autosomal regions of the same sample. The solid purple and red circles represent the mean values of non-mosaic anomalies and the solid black and cyan circles are theoretical.

Figure 3

B Allele Frequency (BAF) and Log R Ratio (LRR) plots of four representative mosaic anomalies. Each pair of plots is for a different sample-chromosome combination and each point is a single SNP. Points in BAF plots are color-coded by genotype (red=AA, cyan=AB, purple-blue=BB, black=missing call). The vertical black lines indicate the breakpoint(s) of the anomaly. The vertical gray rectangle is the centromeric gap. Horizontal pink lines are drawn at 0, 1/3, 1/2, 2/3 and 1 in the BAF plots. The solid horizontal red line in each plot is the median value for non-anomalous regions of the autosomes. The horizontal dashed red line is the median value within the anomaly. (a) Mosaic acquired uniparental disomy for distal 12q is indicated by the split in the intermediate BAF band along with the lack of change in LRR. A non-mosaic uniparental isodisomy would have only two BAF bands (at 0 and 1). (b) Mosaic trisomy for chromosome 19 is indicated by a narrow split in the intermediate BAF band along with a small elevation of LRR. A non-mosaic trisomy would have a wider BAF split (at 1/3 and 2/3) and a larger elevation of LRR. (c) A mosaic deletion on 20q is indicated by a narrow split in the intermediate BAF band along with a small decrease in LRR. A non-mosaic heterozygous deletion would have no intermediate BAF bands and a larger decrease in LRR. (d) A mosaic deletion on 6q is indicated by a wide split in the intermediate BAF band along with a large decrease in LRR. The mosaic deletion in (d) has a greater proportion of cells containing the deletion than the one in (c). See Supplementary Figure 2 for additional examples.

Using a method optimized to detect large anomalies (50 kb to whole chromosome), we identified at least one non-mosaic anomaly (i.e. large CNV) in 75% of all subjects, at least one clonal mosaic anomaly in 0.80%, and both types in 0.69%. The median size of all anomalies detected is 150 kb (Supplementary Fig. 5) and the mean number per subject is 1.5, with a range of 0 to 13. There were 514 mosaic anomalies in 404 of 50,222 subjects analyzed. The reproducibility (in 568 duplicate sample pairs) of all anomalies analyzed for mosaic status is 82% (with >80% overlap; see Methods and Supplementary Table 2 for details). For clonal mosaic anomalies in duplicate samples, the reproducibility is 15/22 = 68% and all discordant calls appear to be false negatives, based on examination of BAF/LRR plots. We also assessed the reproducibility of clonal mosaic anomaly calls in comparison with the results of Jacobs et al.[18], who analyzed the same raw data for 5,510 subjects from the GENEVA Lung Cancer study. While both methods detected 83 mosaics, the GENEVA method described here detected an additional 28 mosaics (8 > 2 Mb) and the Jacobs method detected an additional 20 mosaics (all > 2 Mb). The overall reproducibility is 63% or, when considering only anomalies greater than 2 Mb (the size-detection limit of the Jacobs method), 75%. Both estimates are considerably greater than the 25–50% reproducibility across methods estimated for several common CNV-calling algorithms[19]. All of the discordant mosaic detections appear to be due to false negatives. The Jacobs method is more conservative with respect to size threshold (2 Mb), while our method is more conservative with respect to sample quality (but calling mosaics involving segments less than 2 Mb when sample quality is sufficient). Therefore, the false negative rate of both methods appears to be high and the prevalence of clonal mosaic anomalies detected here is likely to be underestimated. Mosaic detection is difficult when the fraction of abnormal cells is extreme, when the anomaly length is small or when sample quality is low (i.e. high BAF/LRR variability). The clonal mosaic anomalies detected in GENEVA subjects were classified as 15.6% duplications, 50.4% deletions and 34.0% aUPDs. All three classes of mosaic anomalies are large (Figure 4 and Supplementary Fig. 6). Median lengths are 34.1 Mb for duplications, 3.8 Mb for deletions and 39.8 Mb for aUPD. Mosaic aneuploidies include +8, +9, +12, +14, +15, +18, +19, −21, and +22, while whole-chromosome mosaic UPDs include chromosomes 2, 3, 13, 14, and 15. Plots of the breakpoints of all mosaic anomalies are provided in Supplementary Figure 7 and genomic coordinates (along with other information) are provided in Supplementary Table 3.
Figure 4

The lengths and chromosomal positions of the 514 clonal mosaic anomalies detected in GENEVA subjects. An ideogram of each autosome is shown with scaled and color-coded representations of each mosaic anomaly to the right.

There is a highly significant excess of subjects with multiple clonal mosaics, compared to the Poisson distribution expected if the anomalies occurred independently. The multiples are of two kinds: (a) ‘compound’ sets of anomalies adjacent to one another on a single chromosome, suggesting a single event or related mechanism of origin (e.g. Supplementary Figure 2g) and (b) non-adjacent sets. Among the 404 mosaic subjects, 64 had multiple mosaics of one or both types (while 2.6 were expected) and 55 had only non-adjacent sets (2.4 expected). The excess of multiple mosaics occurs for both CNVs and aUPD. The age of subjects with multiple anomalies is not significantly different than those with a single anomaly (p=0.99).

The frequency of detectable clonal mosaicism increases with age

The observed frequency of subjects with one or more clonal mosaic anomalies detected (‘mosaic status’) is shown in Figure 5 and Supplementary Table 4. It is low (< 0.5%) in subjects less than 50 years old, but increases thereafter to 2.7% in subjects over 80. The mosaic frequency is 0.2% in both the 0–14 (15/8535) and 15–29 year old group (16/6739), despite the fact that approximately half of the 0–14 year old subjects have a phenotypic abnormality (non-syndromic cleft lip/palate, prematurity or low birth weight). Excluding subjects less than 15 years old, in multiple logistic regression of mosaic status on age at DNA sampling, and adjusting for several covariates (study, sex, DNA source, and ethnicity), age is a highly significant predictor of mosaic status (p = 2 × 10−16, odds ratio=1.05, 95% confidence interval (CI)=1.04 – 1.07). Among the covariates, only study is significant (p=0.01) and a subsequent test of age-by-study interaction was not significant. It is notable that DNA source (92% from blood, 8% from saliva/buccal swabs) was not a significant predictor (p=0.45). When only blood samples are analyzed, the age effect estimate is the same (to three decimal places) and the p-value is only slightly higher (4 × 10−15). Copy-number mosaics and aUPD, when tested separately, each have a significant age effect and similar odds ratios (p-value for gain=0.01, loss=5 × 10−11, aUPD=6 × 10−8; OR (95% CI) for gain = 1.032 (1.005 – 1.061), loss = 1.057 (1.039 – 1.075), aUPD = 1.056 (1.035 – 1.077).
Figure 5

The percentage of subjects having one or more mosaic anomalies within 5-year age bins. Vertical bars are 95% confidence intervals. For two cells with zero counts, the upper bar connects zero to the frequency with a lower 95% confidence interval of zero given the sample size. Expected leukemia values are given for reference and calculated using age- and sex-specific prevalence estimates (http://seer.cancer.gov).

This age effect is specific for mosaic anomalies. The same logistic regression performed with the non-mosaic anomalies did not have a significant age effect (p=0.11) and the sign of the regression coefficient estimate was reversed (Supplementary Figure 8). This result indicates that our classification method distinguishes effectively between acquired and constitutional anomalies. To further explore the robustness of the age effect on clonal mosaicism, additional analyses were performed with each of the seven studies having more than 1,000 subjects over 50 years old (using both blood and saliva/buccal samples). Only the age effect was significant (p=8 × 10−16) in a combined logistic regression of mosaic status on study, sex, DNA source, ethnicity and smoking status (separately testing either ‘ever’ smoker or ‘never’ smoker). When only controls from these studies were analyzed together, the age effect remained highly significant (p=7 × 10−11). We also analyzed each study separately, with age and the case status specific to each study. A meta-analysis shows a highly significant effect of age (Figure 6), which is very robust to differences in both study and subject characteristics.
Figure 6

Fixed-effects meta-analysis for effect of age at DNA sampling on mosaic status. Effect estimates are from logistic regression of mosaic status on age at DNA sampling, with adjustment for case status specific to each study. The summary estimate of the log odds ratio is 0.05 (95% CI=0.04 – 0.07) and the corresponding odds ratio is 1.06 (95% CI=1.04 – 1.07). Cochran’s Q test of heterogeneity has p-value=0.89. The sizes of the black boxes are proportional to the inverse of the squared standard error and the gray lines are 95% confidence intervals. The horizontal points of the diamond span the 95% confidence interval of the summary estimate. See Table 1 for study descriptions. AA = African American and JL = Japanese/Latino.

These cross-sectional analyses strongly suggest that most of the mosaic anomalies detectable by SNP microarrays appear late in life, because they arise more frequently and/or because they are more readily detected due to clonal expansion. This suggestion is supported by longitudinal observation in one GENEVA subject (the only subject sampled twice who had mosaicism in at least one sample). This subject was sampled at age 66 and again at age 72 (both with DNA from saliva). No mosaic anomalies were detected in the earlier sample, but the later sample contained 5 mosaic deletions, each on a different chromosome. Additional studies with subjects sampled at multiple ages are needed to evaluate the temporal origin and stability of mosaic anomalies. In some GENEVA subjects, anomalies appear to have occurred early enough in development to be mosaic in both the soma and germline. In 35 parent-offspring pairs in which a mosaic anomaly was detected in the parent, there are three cases in which the offspring appears to be non-mosaic for the same anomaly (one deletion and two duplications), while there is no corresponding anomaly (mosaic or otherwise) in the remaining 32 offspring. Although this result suggests that a fairly large fraction of cases have mosaicism shared by the germline and soma, it may not be representative of the more frequent mosaics that occur in older subjects because parents in the family studies were sampled in their 20s and 30s (Table 1). The mosaics that appear in subjects less than 50 years of age may have different origins than those that appear later, when the frequency increases rapidly.

Mosaic anomalies characteristic of hematological cancers

The clonal mosaic anomalies detected in this study tend to cluster in location both within and among chromosome arms (Figure 4; Supplementary Fig. 7 and 9). Regions with multiple overlapping anomalies frequently coincide with regions of copy number change or aUPD characteristic of hematological cancers. Using the Mitelman "Recurrent Chromosome Aberrations in Cancer Database" (http://cgap.nci.nih.gov/Chromosomes/Mitelman), we found that 222 of 669 recurrent duplications and deletions found in hematological cancers have >80% overlap with at least one mosaic CNV in GENEVA subjects. Also, 77% of GENEVA mosaic CNVs have >80% overlap with the Mitelman aberrations and 48% overlap both cytological bands defining the limits of the aberration. The most common overlaps are 20q-, 13q-, 11q-, 17p-, 12+ and 8+. Common deleted regions (CDR) of mosaic anomalies in different GENEVA subjects often pinpoint genes previously associated with the hematological cancers. The following examples are shown in Supplementary Figure 7: (1) On 13q, 31 deletions have a CDR of 299 kb, containing only one gene, DLEU7, which is thought to be a tumor suppressor[20]. In addition, 18 deletions on 13q cover RB1 and 24 cover MiR15a and MiR16-1. Deletions in this region (13q14) represent the most common cytogenetic abnormality in chronic lymphocytic leukemia (CLL)[21], which is the most common leukemia in older adults (http://seer.cancer.gov). (2) On 4q, 14 deletions have a CDR of 214 kb containing only one gene, the TET2 oncogene, which is commonly deleted in myelodysplastic syndrome (MDS), myeloproliferative disorder (MPD) and acute myeloid leukemia (AML)[22]. (3) On 2p, 17 deletions have a CDR of 194 kb, which contains two genes, one of which is DNMT3A, recently found to be commonly mutated in AML-M5[23]. (4) On 22q, 11 deletions have a CDR of 153 kb, which includes three genes, one of which is PRAME, which is frequently deleted in CLL[24]. (5) On 20q, 46 deletions have a CDR of 965 kb, containing 7 genes including L3MBTL1, which is a candidate tumor suppressor in del(20q12) myeloid disorders[25]. Long (multi-megabase) segments of aUPD are frequently observed in cancers of many types[26]. In most cases, the UPD occurs on a terminal segment of one arm, consistent with origin by a single mitotic crossover, followed by outgrowth of one of the daughter cells. Acquired UPDs are frequently observed in hematological cancers such as MDS, MPD and AML and are associated with homozygosity of mutations in several tumor suppressors and oncogenes[27,28]. All autosomes (except chromosome 10) have at least one clonal mosaic aUPD in GENEVA subjects. Chromosomes 9 (with 24), 14 (with 21) and 11 (with 19) have the most aUPDs, which greatly exceed the expected number based on arm length (Supplementary Figure 9). Despite the observation that many of the clonal mosaic anomalies observed here are characteristic of hematological cancer, the fraction of subjects with one or more mosaics who have a record of hematological cancer before DNA sampling is low. This fraction was estimated as 2.8% (95% CI=1.0 – 4.7%) in 291 mosaic subjects (with DNA from blood; from 13 GENEVA studies; using medical records, self-reported conditions and study exclusion criteria, as described in the Supplementary Note).

Hematological cancer incidence

We investigated whether detectable clonal mosaicism predisposes to incident hematological cancer after DNA sampling by using three GENEVA studies, which included cohorts with cancer diagnosis records both before and after DNA sampling. From the following studies, we analyzed 8,562 subjects who had DNA derived from blood and no record of hematological cancer prior to DNA sampling: (1) Glaucoma study, with subjects from the Nurses Health Study (NHS, N=363) and Health Professionals Follow-up Study (HPFS, N=285), (2) Lung Cancer study, with subjects from the Prostate, Lung, Colorectal and Ovarian Cancer Screening Trial (PLCO, N= 1600) and (3) Prostate Cancer study, with subjects from the Multiethnic Cohort (MEC, N=6314). Among the 8,562 subjects analyzed for incident hematological cancer, 8,323 were non-mosaics with no events, 90 were non-mosaics with events, 134 were mosaics with no events, and 15 were mosaics with events (where ‘event’ is a hematological cancer diagnosis). To test for an association between mosaic status and incident hematological cancer, we used a cause-specific Cox proportional hazards model to analyze time to a hematological cancer diagnosis from the date of DNA sampling, with right censoring at death and the endpoint of follow-up data. We performed a stratified analysis of the four cohorts, which included mosaic status and adjusted for age at DNA sampling, non-hematological cancer status (as a time-dependent covariate), ethnicity (two principal components) and sex (within the PLCO stratum). The hazard ratio estimate for mosaic status is 10.1 (95% CI=5.8 – 17.7) with a p-value of 3 × 10−10. A meta-analysis showed consistent results among cohorts and gave a very similar effect estimate (Supplementary Figure 10). These results estimate that the risk of hematological cancer is ten-fold higher for mosaic than for non-mosaic subjects. Because both cancer and the clonal mosaic anomalies detected in this study increase with age, the adjustment for age at time of DNA sampling in the Cox regression model is critical. We modeled the age covariate as either a linear effect or as a non-linear effect (spline smoothing with 5 degrees of freedom) and found that the mosaic effect estimates and p-values are essentially identical. Among the 15 mosaic subjects who had a hematological diagnosis after DNA sampling, four had myeloid leukemia, six had chronic lymphocytic leukemia, one had multiple myeloma, one had MDS, one had MPD and two had non-Hodgkin lymphoma. Thus, the 15 cases are about evenly divided between mature B-cell neoplasms and myeloid malignancies. Not surprisingly, the leukemias are over-represented among mosaic compared with non-mosaic subjects (p-value=0.005, Supplementary Tables 5 and 6). A variety of chromosomal anomalies were found in the mosaic subjects (Supplementary Table 7). Deletions covering the CDRs described above were found in several of these subjects: 13q- in five CLLs, 4q- in one chronic myelogenous leukemia (CML), 20q- in one multiple myeloma and one AML, and 22q- in one CLL. Five of the 15 mosaic subjects with incident hematological cancer had more than one mosaic anomaly, which is higher than in the remaining subjects within this set of cohort samples (25/134), although not significantly so (p=0.18). Although the risk of incident hematological cancer is estimated as 10-fold higher for mosaic than for non-mosaic subjects (95% CI=5.8 – 17.7), it is important to note that the incidence rate in mosaics is low (10 year event rate of 0.143, 95% CI=0.065 – 0.214, Figure 7) and that only a small fraction of GENEVA mosaic subjects have a record of hematological cancer before DNA sampling (2.8%, 95% CI=1.0 – 4.7%). The period between first appearance of detectable clonal mosaicism and incidence of hematological cancer is of interest, but cannot be estimated from our data since mosaicism was present for an unknown period of time prior to DNA sampling. However, the median time of 3.5 years between DNA sampling and hematological cancer diagnosis provides a very rough minimum estimate (range 3.5 months to 10.7 years with N=15; see Figure 7).
Figure 7

A Kaplan-Meier plot of the proportion of living subjects who remain free of hematological cancer as a function of time since the time of DNA sampling and determination of mosaic status. Estimates for mosaic (red) and non-mosaic (black) subjects are given separately (solid lines), each with their 95% confidence intervals (dashed lines). The vertical ticks represent censoring times. For the 15 mosaic subjects with incident cancer, the times between DNA sampling and diagnosis are 3.5, 6.1, 12.7, 18.8, 25.0, 36.9, 37.5, 42.9, 44.0, 46.2, 48.0, 60.4, 61.8, 91.1, and 130.5 months.

Non-hematological cancer

To investigate the relationship between mosaic status and non-hematological cancer, two types of analyses were done. First, in each of the three GENEVA case-control cancer studies (Lung Cancer, Prostate Cancer, Melanoma), we did logistic regression of mosaic status on case status and age at DNA sampling. Case status was not significant in any of the three studies or in a meta-analysis (one-tailed p=0.06). The estimated odds of having a clonal mosaic anomaly was higher among cancer cases than controls in the lung and prostate cancer studies, but lower in the melanoma study (Supplementary Fig. 11). Second, in the cohort studies (PLCO, HPFS, NHS and MEC), we did logistic regression of mosaic status on whether or not the subject had a non-hematological cancer prior to DNA sampling (excluding any hematological cancer cases). In these analyses the relationship is consistently positive, but small and not significant overall (one-tailed p=0.11, Supplementary Figure 12). In summary, the evidence hints at a positive relationship between mosaic status and non-hematological cancer, but lacks statistical significance. Therefore, further work is needed in larger sets of non-hematological cancer studies, including data on potential exposure, disease and treatment effects.

DISCUSSION

Here we have shown that the frequency of subjects with detectable clonal mosaicism for large chromosomal anomalies in peripheral blood is low (<0.5%) from birth until 50 years of age, after which it rises rapidly. This relationship between mosaicism and age is very robust to both study and subject characteristics. Among the covariates sex, ethnicity, smoking and disease status (exclusive of hematological cancer), none had a significant effect on mosaic status. The age effect in GENEVA subjects is consistent with a recent study showing that acquired differences in structural chromosome variants between members of monozygotic twin pairs (including clonal mosaic anomalies) are observed in pairs >55 years of age but not in younger pairs[29]. Nevertheless, longitudinal studies are required to rule out the possibility that a trend in environmental exposures across birth cohorts may contribute to the increase in mosaicism with age. The observed increase in detectable clonal mosaicism late in life may be due to a change in the frequency with which chromosomal anomalies occur (i.e. increased somatic mutation rate) and/or their ability to form large clones (i.e. clonal expansion). Previous work has shown that the occurrence of chromosomal anomalies (rearrangements and aneuploidies) during cell division increases with age in cultured lymphocytes and fibroblasts[30,31], that DNA damage accumulates with age in mouse hematopoietic stem cells[32], and that mitotic recombination (leading to uniparental disomy) increases with replicative age in yeast[33]. This apparent increase in somatic mutation may result from age-related decline in genomic maintenance mechanisms (such as telomere attrition[34]). Clonal expansion of cells containing chromosomal anomalies could be due to either positive selection or to random changes in the frequencies of hematopoietic stem cell descendants. In principle, stem cell senescence and age-related decline in replicative function[35] could result in a decrease in the effective population size of stem cells, leading to shifts in clonal composition analogous to random drift in small populations of individuals[36]. However, analyses of the clonal composition of blood cells, based on X-inactivation markers in healthy women, suggest stability over time and between lymphoid and myeloid lineages, even in the elderly[37,38]. Therefore, in most cases, positive selection may be required to establish clones of cells with chromosomal anomalies that are sufficiently large for detection with SNP microarrays. The potential for positive selection may increase with age as somatic mutations accumulate in genes that regulate cellular proliferation. For example, a highly proliferative clone may arise when a recessive tumor suppressor mutation becomes hemizygous in combination with a deletion, or homozygous due to aUPD. This suggestion is supported by the observation that acquired anomalies tend to cluster in certain genomic regions and that common deleted regions pinpoint genes previously associated with hematological cancer. In the mosaics described in this study, the chromosomally abnormal cells constitute a significant fraction of white blood cells, since a minimum of 5–10% is required for detection by our method and many abnormal clones are substantially larger (Figure 2). The blood samples used for DNA extraction were not fractionated by white blood cell type. The abnormal blood cells within an individual may include multiple cell types if the anomaly arose in a multipotent hematopoietic stem cell that became predominant due to senescence or positive selection within the stem cell population. Alternatively, the abnormal cells may include a restricted set of cell types, particularly when the normal composition of blood (i.e. 60–70% of neutrophils and 20–40% of lymphocytes[39]) is altered by unregulated proliferation[40]. There is a strong association between the clonal mosaic anomalies detected in our study and hematological cancer. We estimate the risk of acquiring a hematological cancer diagnosis as 10-fold higher for subjects with mosaic anomalies. This association is strongly supported by finding that many of the mosaic anomalies are characteristic of those found in hematological cancers. Nevertheless, the event numbers analyzed here are small and additional studies are needed across a broader diversity of cohorts to establish the clinical significance of these findings. Notwithstanding the strong association with hematological cancer, we estimated that ~97% of subjects with clonal mosaic anomalies did not have a record of a hematological cancer prior to DNA sampling and the incidence rate is low (~ 14% over ten years in subjects who survive and are not lost to follow-up during this period). These results suggest that the clonal mosaicism observed in elderly subjects may be an asymptomatic condition with a predisposition to hematological cancer that is often not realized. It is possible that many of the subjects with detectable clonal mosaicism in our study have monoclonal B-cell lymphocytosis (MBL), an asymptomatic condition with an estimated prevalence of 3–5% in the elderly. MBL is characterized by a clonal population of B lymphocytes with an immunophenotype similar to CLL or other B-cell malignancy[41]. Most, if not all, cases of CLL are preceded by MBL, but most cases of MBL do not progress to malignancy[42,43]. However, 85% of MBL detected in population screening studies have a B-cell count below 500/μl[43], which is less than 10% of the normal white blood cell count. Since 10% is near the lower limit of detection for chromosomal mosaicism using our methods, the two types of clones may not be closely related. Nevertheless, further work on the relationship between B cell immunophenotypes and mosaic anomalies is warranted. Although it appears that most of the clonal mosaicism observed in GENEVA subjects represents a non-malignant condition, further work is needed to evaluate the fraction of subjects who might have unrecorded malignant conditions such as MDS and MPD, or undiagnosed CLL. MDS and MPD were added to the Surveillance, Epidemiology, and End Results (SEER) cancer registries in 2001 and may still be under-recorded because they are often managed outside of the hospital setting[44]. Therefore, accurate prevalence data from widespread populations are not available, but local population estimates (0.1% MDS[45] and 0.5% MPD[46] in the elderly) are substantially less than the ~2.5% of GENEVA subjects with mosaic anomalies in the over 75 age. This survey is the first large-scale study of acquired chromosomal anomalies in people of all ages and various states of health. Previously, the extent of chromosomal variation within developmentally normal individuals, in the absence of overt cancer, was largely unknown. The results presented here indicate that a significant fraction of blood cells in people without a prior history of hematological cancer may contain large chromosomal anomalies, including multi-megabase deletions, duplications and aUPD. The frequency of people with such clonal anomalies in a mosaic state is low up to about 50 years of age and then increases rapidly up to 2–3%. We find that these anomalies are associated with an approximately ten-fold higher risk of hematological cancer, but subjects with detectable clonal mosaicism may survive for years without having a hematological cancer diagnosis. Further work is needed to determine the stability of the mosaic state over time, to replicate and improve estimates of the predisposition to hematological cancer, and to identify anomalies associated with asymptomatic cancer precursor conditions. It also will be important to explore the health consequences of these anomalies for conditions other than cancer, such as immune system function.

METHODS

Study subjects, phenotypic data and genotyping

Subjects were recruited for 15 different studies belonging to the Gene Environment Association Studies (GENEVA) consortium[16] (Table 1). Each study was approved by the institutional review board of the study investigator’s institution, and all subjects provided written informed consent for participation in the study. The Supplementary Note describes the phenotypes. Each study was genotyped on one of five different Illumina array types at the Center for Inherited Disease Research (CIDR), the Broad Institute Center for Genotyping and Analysis, or the University of Southern California (Supplementary Table 1). DNA samples were derived from blood (92%) or saliva/buccal swabs (8%). No lymphoblastoid cell line or whole-genome amplified samples were included in the analyses described here. Because cell lines may have artifactual mosaic anomalies[47], mis-identification of DNA source is a concern. However, only the Addiction study had both cell line and non-cell line samples and the non-cell line samples analyzed here did not have an unusual frequency of mosaic anomalies. Genotypic data cleaning and calculation of BAF and LRR are described in the Supplementary Note. Sample sizes for analyses vary (as stated in Results) because a small proportion of the subjects are missing data for age at DNA sampling or other variables.

Anomaly detection and quality control

The method of anomaly detection is described in detail in the Supplementary Note and summarized here. Detection of anomalies (both mosaic and non-mosaic) was based on BAF and LRR metrics. The primary focus for detecting anomalies was BAF, because we wanted to identify copy-neutral events (mosaic UPD) and because BAF is much less noisy and prone to artifacts (such as GC waves[48]) than LRR. The main approach was to detect a split in the BAF intermediate band, which in normal (biparental disomic) samples is centered at 1/2 and corresponds to AB heterozygotes (Figure 1). In trisomic samples, this band splits into two components (AAB and ABB) at BAF= 1/3 and 2/3. In disomic-trisomic mosaics, the width of the split varies from zero to one third and LRR varies from zero to a theoretical value of log2(3/2). In disomic-monosomic mosaics, the width of the split varies from zero to one and LRR varies from 0 to a theoretical value of log2(1/2). In biparental-uniparental disomic mosaics, the width of the split varies from zero to one, while LRR remains constant at zero. These transitions are shown in Figure 2 as deviations from expected. In chromosomal regions containing heterozygous SNPs, use of BAF alone can detect duplications (both mosaic and non-mosaic), mosaic deletions, mosaic uniparental disomy and homozygous deletions. LRR is required to detect monosomic regions and duplications in regions lacking heterozygosity. Therefore, we implemented two separate but complementary methods, called ‘BAF’ and ‘LOH’ (the latter for LRR change detection in regions lacking heterozygosity). Anomalies detected by the BAF method were classified as mosaic or non-mosaic. Anomalies detected by the LOH method were used here only to define the BAF/LRR position of heterozygous deletions and not for mosaic detection. We did not attempt to identify non-mosaic segments of uniparental isodisomy, which have no heterozygosity and normal LRR. In the BAF method, Circular Binary Segmentation (CBS)[49] was used to detect change points in a metric modified from Itsara et al.[15]: sqrt(min(BAF,1-BAF,|BAF -median(BAF)|)) for SNPs called as missing or heterozygotes (i.e. excluding homozygotes). The use of missing calls allows detection of wide splits (e.g. Figure 3d). In the LOH method, CBS was applied to LRR values and combined with overlapping runs of homozygosity. By focusing on regions of homozygosity, we avoided a high false positive rate associated with a genome-wide search for changes in LRR. In both methods, the identification of anomalous segments involved establishing a non-anomalous baseline, choosing anomalous segments based on deviation from baseline, and applying quality control filters. Computations were done using the Bioconductor packages DNACopy and GWASTools. The latter was developed by our group; relevant functions are described in the Supplementary Note. Quality control (QC) was done at the sample and anomaly level. Low quality samples (with high variance of BAF and/or LRR metrics or a high level of segmentation) were removed differentially for the two methods. Supplementary Table 1 shows the percentage of samples that passed QC for the BAF method (mean = 99.1%) and the LOH method (86.8%). In some studies, a high fraction of samples failed QC for LOH detection (maximum 47%), but the failure rates for BAF-detection (from which all mosaics were identified) are all low (maximum 8%). Anomaly-level QC involved several steps, including manual curation of all anomalies designated as mosaic and all other anomalies greater than 2 Mb in length. (see Supplementary Note). Manual curation involved evaluation of BAF/LRR plots, as shown in Figure 3 and Supplementary Fig. 2. Note that Supplementary Fig 2(m-t) shows a sample of eight of the smallest mosaic deletions. Features that distinguish mosaic from non-mosaic are described in the Figure 3 legend. The reproducibility of anomaly detection was assessed using samples genotyped in duplicate (N = 568 pairs). For each sample pair, we defined a unit of observation as a contiguous chromosomal region containing an anomaly in one or both samples. Each unit is given a score equal to the length of the intersection divided by the length of the union of anomalies in that unit. A reproducibility measure was defined as the fraction of units with a score greater than either 0.30 or 0.80 (chosen for comparison with published CNV studies). We also calculated the average of the scores that were greater than zero. Supplementary Table 2 summarizes these quantities for each study. For BAF, the mean reproducibility measure was 90% with a 30% overlap threshold and 82% with 80% overlap. For LOH, the means were 71% (30% overlap) and 67% (80% overlap). The mean of scores greater than zero was 95% for BAF- and 96% for LOH-detected anomalies (30% threshold), indicating that when an anomaly is detected in both scans, the breakpoints are highly reproducible. These reproducibility estimates are higher than the 40–60% that is typical for detecting CNVs using Hidden Markov Models (HMM)[19,50], perhaps in large part because we do not attempt to detect small anomalies (the 5th percentile of anomalies we detect is 35 kb). In our experience, standard methods of CNV detection, such as PennCNV[51], tend to break up large anomalies into many segments and to miss large mosaics.

Identifying and classifying clonal mosaic anomalies

Clonal mosaic anomalies were identified in GENEVA family studies by using transmitted anomalies to characterize the bivariate BAF/LRR distribution expected for non-mosaic (constitutional) anomalies. For transmitted anomalies, this distribution is approximately bivariate normal within a study and we used this distribution to estimate a 95% prediction ellipse[52], which defines an area likely to contain most of the constitutional anomalies (Supplementary Figure 13). Among the anomalies used to identify mosaics, the majority are 3N duplications. There is also a small cluster of 4N anomalies, but we did not attempt to detect 3N/4N mosaics. Anomalies outside of these two clusters contain mosaics and artifacts. The latter consist of false positives and anomalies with inaccurate breakpoints (which distorts the median BAF/LRR values). To distinguish between the mosaics and artifacts, we performed a manual review of BAF/LRR plots for all anomalies that fell outside of the 95% prediction ellipse and below the mean LRR for anomalies used to define the ellipse. The non-family studies were analyzed in a similar way, except that we replaced the class of transmitted anomalies with polymorphic CNVs. The latter were defined by hierarchical clustering to identify sets of anomalies with similar breakpoints. We then defined polymorphic sets as those with at least 4 members (but excluding sets with mean anomaly length greater than 10 Mb). We also included in the mosaic class three whole-autosome anomalies (12, 8, 22) that fell within the 3N ellipse, because constitutional trisomies for these chromosomes are not compatible with normal development[1]. Although we did not have access to biospecimens necessary for experimental validation of mosaics (i.e. live cells or those preserved for cytology), all anomalies classified as mosaics were manually reviewed and the BAF/LRR patterns that we observed are very similar to those reported by Peiffer[17], Rodriguez-Santiago[53] and Conlin[7], who performed cytological validation for a variety of mosaic types. Classification of clonal mosaic anomalies as duplication, deletion or aUPD was done using the median LRR and BAF deviations from non-anomalous segments (Figure 2b). Deviations from non-anomalous segments within the same sample were used to control for overall LRR variation among samples and for BAF asymmetry that occurs in some samples. Anomalies that are either terminal segments or whole chromosome and that have an LRR deviation within a ‘neutral zone’ (|LRR|<0.05) were classified as aUPD. This neutral zone was chosen because it includes nearly all of the wide splits (BAF deviation > 0.25) that have much smaller LRR deviations than expected for disomic/trisomic or disomic/monosomic transitions, while including very few interstitial anomalies (Supplementary Figure 3). All other anomalies (except for a few outliers) were classified as either duplications or deletions, depending on the sign of their LRR deviation. There is some ambiguity in classifying anomalies near the tip of the arrow, where the three transition zones intersect. This ambiguity is noted as ‘intensity.flag’ in Supplementary Table 3. Mixture proportions in mosaics can be estimated as position along the transitional line that connects the two constitutional states (Figure 2; see Supplementary Note). All anomalies discussed in this paper are autosomal in the reference genome. Detection of X chromosome mosaics is complicated by the fact that LRR is a measure of the intensity of a sample relative to other samples. X chromosome LRR values (calculated in the standard way) are affected by the sex ratio in the sample set and are not comparable to those for the autosomes.

Statistical analysis

All statistical analyses were done in the R statistical package (http://www.R-project.org) using functions described in the Supplementary Note.

URLs

http://cgap.nci.nih.gov/Chromosomes/Mitelman, Mitelman, F., Johansson, B. & Mertens, F. (eds.). Mitelman Database of Chromosome Aberrations and Gene Fusions in Cancer, (2011). http://seer.cancer.gov, SEER. US Estimated 33-Year L-D Prevalence Counts on 1/1/2008. (ed. Surveillance, E., and End Results (SEER) Program, National Cancer Institute, DCCPS, Surveillance Research Program, Statistical Research and Applications Branch, released April 2011, based on the November 2010 SEER data submission.) (2011). http://www.R-project.org, The R Development Core Team, R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0, URL. (2006).
  46 in total

1.  Age-related somatic structural changes in the nuclear genome of human blood cells.

Authors:  Lars A Forsberg; Chiara Rasi; Hamid R Razzaghian; Geeta Pakalapati; Lindsay Waite; Krista Stanton Thilbeault; Anna Ronowicz; Nathan E Wineinger; Hemant K Tiwari; Dorret Boomsma; Maxwell P Westerman; Jennifer R Harris; Robert Lyle; Magnus Essand; Fredrik Eriksson; Themistocles L Assimes; Carlos Iribarren; Eric Strachan; Terrance P O'Hanlon; Lisa G Rider; Frederick W Miller; Vilmantas Giedraitis; Lars Lannfelt; Martin Ingelsson; Arkadiusz Piotrowski; Nancy L Pedersen; Devin Absher; Jan P Dumanski
Journal:  Am J Hum Genet       Date:  2012-02-02       Impact factor: 11.025

2.  High-resolution genomic profiling of chromosomal aberrations using Infinium whole-genome genotyping.

Authors:  Daniel A Peiffer; Jennie M Le; Frank J Steemers; Weihua Chang; Tony Jenniges; Francisco Garcia; Kirt Haden; Jiangzhen Li; Chad A Shaw; John Belmont; Sau Wai Cheung; Richard M Shen; David L Barker; Kevin L Gunderson
Journal:  Genome Res       Date:  2006-08-09       Impact factor: 9.043

3.  Detection of low-level mosaicism by array CGH in routine diagnostic specimens.

Authors:  Blake C Ballif; Emily A Rorem; Kyle Sundin; Matt Lincicum; Shannon Gaskin; Justine Coppinger; Catherine D Kashork; Lisa G Shaffer; Bassem A Bejjani
Journal:  Am J Med Genet A       Date:  2006-12-15       Impact factor: 2.802

4.  Copy number variation accuracy in genome-wide association studies.

Authors:  Peng Lin; Sarah M Hartz; Jen-Chyong Wang; Robert F Krueger; Tatiana M Foroud; Howard J Edenberg; John I Nurnberger; Andrew I Brooks; Jay A Tischfield; Laura Almasy; Bradley T Webb; Victor M Hesselbrock; Bernice Porjesz; Alison M Goate; Laura J Bierut; John P Rice
Journal:  Hum Hered       Date:  2011-07-20       Impact factor: 0.444

5.  An investigation of the age-dependency of chromosome abnormalities in human populations exposed to low-dose ionising radiation.

Authors:  I Vorobtsova; A Semenov; N Timofeyeva; A Kanayeva; I Zvereva
Journal:  Mech Ageing Dev       Date:  2001-09-15       Impact factor: 5.432

6.  Incidence and prevalence of myelodysplastic syndromes: data from the Düsseldorf MDS-registry.

Authors:  Judith Neukirchen; Wilma M Schoonen; Corinna Strupp; Norbert Gattermann; Carlo Aul; Rainer Haas; Ulrich Germing
Journal:  Leuk Res       Date:  2011-06-25       Impact factor: 3.156

Review 7.  Mechanisms of aneuploidy induction in human oogenesis and early embryogenesis.

Authors:  J D A Delhanty
Journal:  Cytogenet Genome Res       Date:  2005       Impact factor: 1.636

8.  Incidence of the myelodysplastic syndromes using a novel claims-based algorithm: high number of uncaptured cases by cancer registries.

Authors:  Christopher R Cogle; Benjamin M Craig; Dana E Rollison; Alan F List
Journal:  Blood       Date:  2011-04-29       Impact factor: 22.113

9.  Genome-wide SNP assay reveals structural genomic variation, extended homozygosity and cell-line induced alterations in normal individuals.

Authors:  Javier Simon-Sanchez; Sonja Scholz; Hon-Chung Fung; Mar Matarin; Dena Hernandez; J Raphael Gibbs; Angela Britton; Fabienne Wavrant de Vrieze; Elizabeth Peckham; Katrina Gwinn-Hardy; Anthony Crawley; Judith C Keen; Josefina Nash; Digamber Borgaonkar; John Hardy; Andrew Singleton
Journal:  Hum Mol Genet       Date:  2006-11-20       Impact factor: 6.150

10.  Detectable clonal mosaicism and its relationship to aging and cancer.

Authors:  Kevin B Jacobs; Meredith Yeager; Weiyin Zhou; Sholom Wacholder; Zhaoming Wang; Benjamin Rodriguez-Santiago; Amy Hutchinson; Xiang Deng; Chenwei Liu; Marie-Josephe Horner; Michael Cullen; Caroline G Epstein; Laurie Burdett; Michael C Dean; Nilanjan Chatterjee; Joshua Sampson; Charles C Chung; Joseph Kovaks; Susan M Gapstur; Victoria L Stevens; Lauren T Teras; Mia M Gaudet; Demetrius Albanes; Stephanie J Weinstein; Jarmo Virtamo; Philip R Taylor; Neal D Freedman; Christian C Abnet; Alisa M Goldstein; Nan Hu; Kai Yu; Jian-Min Yuan; Linda Liao; Ti Ding; You-Lin Qiao; Yu-Tang Gao; Woon-Puay Koh; Yong-Bing Xiang; Ze-Zhong Tang; Jin-Hu Fan; Melinda C Aldrich; Christopher Amos; William J Blot; Cathryn H Bock; Elizabeth M Gillanders; Curtis C Harris; Christopher A Haiman; Brian E Henderson; Laurence N Kolonel; Loic Le Marchand; Lorna H McNeill; Benjamin A Rybicki; Ann G Schwartz; Lisa B Signorello; Margaret R Spitz; John K Wiencke; Margaret Wrensch; Xifeng Wu; Krista A Zanetti; Regina G Ziegler; Jonine D Figueroa; Montserrat Garcia-Closas; Nuria Malats; Gaelle Marenne; Ludmila Prokunina-Olsson; Dalsu Baris; Molly Schwenn; Alison Johnson; Maria Teresa Landi; Lynn Goldin; Dario Consonni; Pier Alberto Bertazzi; Melissa Rotunno; Preetha Rajaraman; Ulrika Andersson; Laura E Beane Freeman; Christine D Berg; Julie E Buring; Mary A Butler; Tania Carreon; Maria Feychting; Anders Ahlbom; J Michael Gaziano; Graham G Giles; Goran Hallmans; Susan E Hankinson; Patricia Hartge; Roger Henriksson; Peter D Inskip; Christoffer Johansen; Annelie Landgren; Roberta McKean-Cowdin; Dominique S Michaud; Beatrice S Melin; Ulrike Peters; Avima M Ruder; Howard D Sesso; Gianluca Severi; Xiao-Ou Shu; Kala Visvanathan; Emily White; Alicja Wolk; Anne Zeleniuch-Jacquotte; Wei Zheng; Debra T Silverman; Manolis Kogevinas; Juan R Gonzalez; Olaya Villa; Donghui Li; Eric J Duell; Harvey A Risch; Sara H Olson; Charles Kooperberg; Brian M Wolpin; Li Jiao; Manal Hassan; William Wheeler; Alan A Arslan; H Bas Bueno-de-Mesquita; Charles S Fuchs; Steven Gallinger; Myron D Gross; Elizabeth A Holly; Alison P Klein; Andrea LaCroix; Margaret T Mandelson; Gloria Petersen; Marie-Christine Boutron-Ruault; Paige M Bracci; Federico Canzian; Kenneth Chang; Michelle Cotterchio; Edward L Giovannucci; Michael Goggins; Judith A Hoffman Bolton; Mazda Jenab; Kay-Tee Khaw; Vittorio Krogh; Robert C Kurtz; Robert R McWilliams; Julie B Mendelsohn; Kari G Rabe; Elio Riboli; Anne Tjønneland; Geoffrey S Tobias; Dimitrios Trichopoulos; Joanne W Elena; Herbert Yu; Laufey Amundadottir; Rachael Z Stolzenberg-Solomon; Peter Kraft; Fredrick Schumacher; Daniel Stram; Sharon A Savage; Lisa Mirabello; Irene L Andrulis; Jay S Wunder; Ana Patiño García; Luis Sierrasesúmaga; Donald A Barkauskas; Richard G Gorlick; Mark Purdue; Wong-Ho Chow; Lee E Moore; Kendra L Schwartz; Faith G Davis; Ann W Hsing; Sonja I Berndt; Amanda Black; Nicolas Wentzensen; Louise A Brinton; Jolanta Lissowska; Beata Peplonska; Katherine A McGlynn; Michael B Cook; Barry I Graubard; Christian P Kratz; Mark H Greene; Ralph L Erickson; David J Hunter; Gilles Thomas; Robert N Hoover; Francisco X Real; Joseph F Fraumeni; Neil E Caporaso; Margaret Tucker; Nathaniel Rothman; Luis A Pérez-Jurado; Stephen J Chanock
Journal:  Nat Genet       Date:  2012-05-06       Impact factor: 38.330

View more
  274 in total

1.  Exploring the variation within.

Authors:  Evan Z Macosko; Steven A McCarroll
Journal:  Nat Genet       Date:  2012-05-29       Impact factor: 38.330

2.  Genetic Diversity and Association Studies in US Hispanic/Latino Populations: Applications in the Hispanic Community Health Study/Study of Latinos.

Authors:  Matthew P Conomos; Cecelia A Laurie; Adrienne M Stilp; Stephanie M Gogarten; Caitlin P McHugh; Sarah C Nelson; Tamar Sofer; Lindsay Fernández-Rhodes; Anne E Justice; Mariaelisa Graff; Kristin L Young; Amanda A Seyerle; Christy L Avery; Kent D Taylor; Jerome I Rotter; Gregory A Talavera; Martha L Daviglus; Sylvia Wassertheil-Smoller; Neil Schneiderman; Gerardo Heiss; Robert C Kaplan; Nora Franceschini; Alex P Reiner; John R Shaffer; R Graham Barr; Kathleen F Kerr; Sharon R Browning; Brian L Browning; Bruce S Weir; M Larissa Avilés-Santa; George J Papanicolaou; Thomas Lumley; Adam A Szpiro; Kari E North; Ken Rice; Timothy A Thornton; Cathy C Laurie
Journal:  Am J Hum Genet       Date:  2016-01-07       Impact factor: 11.025

Review 3.  Somatic mosaicism: on the road to cancer.

Authors:  Luis C Fernández; Miguel Torres; Francisco X Real
Journal:  Nat Rev Cancer       Date:  2015-12-18       Impact factor: 60.716

4.  Genome-wide Association Study of Platelet Count Identifies Ancestry-Specific Loci in Hispanic/Latino Americans.

Authors:  Ursula M Schick; Deepti Jain; Chani J Hodonsky; Jean V Morrison; James P Davis; Lisa Brown; Tamar Sofer; Matthew P Conomos; Claudia Schurmann; Caitlin P McHugh; Sarah C Nelson; Swarooparani Vadlamudi; Adrienne Stilp; Anna Plantinga; Leslie Baier; Stephanie A Bien; Stephanie M Gogarten; Cecelia A Laurie; Kent D Taylor; Yongmei Liu; Paul L Auer; Nora Franceschini; Adam Szpiro; Ken Rice; Kathleen F Kerr; Jerome I Rotter; Robert L Hanson; George Papanicolaou; Stephen S Rich; Ruth J F Loos; Brian L Browning; Sharon R Browning; Bruce S Weir; Cathy C Laurie; Karen L Mohlke; Kari E North; Timothy A Thornton; Alex P Reiner
Journal:  Am J Hum Genet       Date:  2016-01-21       Impact factor: 11.025

5.  Different mutational rates and mechanisms in human cells at pregastrulation and neurogenesis.

Authors:  Taejeong Bae; Livia Tomasini; Jessica Mariani; Bo Zhou; Tanmoy Roychowdhury; Daniel Franjic; Mihovil Pletikos; Reenal Pattni; Bo-Juen Chen; Elisa Venturini; Bridget Riley-Gillis; Nenad Sestan; Alexander E Urban; Alexej Abyzov; Flora M Vaccarino
Journal:  Science       Date:  2017-12-07       Impact factor: 47.728

Review 6.  The role of replicates for error mitigation in next-generation sequencing.

Authors:  Kimberly Robasky; Nathan E Lewis; George M Church
Journal:  Nat Rev Genet       Date:  2013-12-10       Impact factor: 53.242

7.  Confidence intervals for heritability via Haseman-Elston regression.

Authors:  Tamar Sofer
Journal:  Stat Appl Genet Mol Biol       Date:  2017-09-26

Review 8.  Clonal expansion in non-cancer tissues.

Authors:  Nobuyuki Kakiuchi; Seishi Ogawa
Journal:  Nat Rev Cancer       Date:  2021-02-24       Impact factor: 60.716

Review 9.  Clinical consequences of clonal hematopoiesis of indeterminate potential.

Authors:  David P Steensma
Journal:  Blood Adv       Date:  2018-11-27

10.  Genome-wide consequences of deleting any single gene.

Authors:  Xinchen Teng; Margaret Dayhoff-Brannigan; Wen-Chih Cheng; Catherine E Gilbert; Cierra N Sing; Nicola L Diny; Sarah J Wheelan; Maitreya J Dunham; Jef D Boeke; Fernando J Pineda; J Marie Hardwick
Journal:  Mol Cell       Date:  2013-11-07       Impact factor: 17.970

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.