Literature DB >> 32620889

Assessment of polygenic architecture and risk prediction based on common variants across fourteen cancers.

Nilanjan Chatterjee^1,2, Montserrat Garcia-Closas³, Yan Dora Zhang^4,5, Amber N Hurson^3,6, Haoyu Zhang^3,7, Parichoy Pal Choudhury³, Douglas F Easton^8,9, Roger L Milne^10,11,12, Jacques Simard¹³, Per Hall^14,15, Kyriaki Michailidou^9,16, Joe Dennis⁹, Marjanka K Schmidt^17,18, Jenny Chang-Claude^19,20, Puya Gharahkhani²¹, David Whiteman²², Peter T Campbell²³, Michael Hoffmeister²⁴, Mark Jenkins¹¹, Ulrike Peters²⁵, Li Hsu²⁵, Stephen B Gruber²⁶, Graham Casey²⁷, Stephanie L Schmit²⁸, Tracy A O'Mara²⁹, Amanda B Spurdle²⁹, Deborah J Thompson⁹, Ian Tomlinson^30,31, Immaculata De Vivo^32,33, Maria Teresa Landi³, Matthew H Law²¹, Mark M Iles³⁴, Florence Demenais³⁵, Rajiv Kumar³⁶, Stuart MacGregor²¹, D Timothy Bishop³⁷, Sarah V Ward³⁸, Melissa L Bondy³⁹, Richard Houlston⁴⁰, John K Wiencke⁴¹, Beatrice Melin⁴², Jill Barnholtz-Sloan⁴³, Ben Kinnersley⁴⁰, Margaret R Wrensch⁴¹, Christopher I Amos⁴⁴, Rayjean J Hung⁴⁵, Paul Brennan⁴⁶, James McKay⁴⁶, Neil E Caporaso³, Sonja I Berndt³, Brenda M Birmann³², Nicola J Camp⁴⁷, Peter Kraft⁴⁸, Nathaniel Rothman³, Susan L Slager⁴⁹, Andrew Berchuck⁵⁰, Paul D P Pharoah^8,9, Thomas A Sellers²⁸, Simon A Gayther⁵¹, Celeste L Pearce^26,52, Ellen L Goode⁵³, Joellen M Schildkraut⁵⁴, Kirsten B Moysich⁵⁵, Laufey T Amundadottir⁵⁶, Eric J Jacobs²³, Alison P Klein⁵⁷, Gloria M Petersen⁵³, Harvey A Risch⁵⁸, Rachel Z Stolzenberg-Solomon³, Brian M Wolpin⁵⁹, Donghui Li⁶⁰, Rosalind A Eeles⁶¹, Christopher A Haiman²⁶, Zsofia Kote-Jarai⁶¹, Fredrick R Schumacher⁶², Ali Amin Al Olama^63,64, Mark P Purdue³, Ghislaine Scelo⁴⁶, Marlene D Dalgaard^65,66, Mark H Greene⁶⁷, Tom Grotmol⁶⁸, Peter A Kanetsky²⁸, Katherine A McGlynn³, Katherine L Nathanson⁶⁹, Clare Turnbull⁴⁰, Fredrik Wiklund¹⁴, Stephen J Chanock³.

Abstract

Genome-wide association studies (GWAS) have led to the identification of hundreds of susceptibility loci across cancers, but the impact of further studies remains uncertain. Here we analyse summary-level data from GWAS of European ancestry across fourteen cancer sites to estimate the number of common susceptibility variants (polygenicity) and underlying effect-size distribution. All cancers show a high degree of polygenicity, involving at a minimum of thousands of loci. We project that sample sizes required to explain 80% of GWAS heritability vary from 60,000 cases for testicular to over 1,000,000 cases for lung cancer. The maximum relative risk achievable for subjects at the 99th risk percentile of underlying polygenic risk scores (PRS), compared to average risk, ranges from 12 for testicular to 2.5 for ovarian cancer. We show that PRS have potential for risk stratification for cancers of breast, colon and prostate, but less so for others because of modest heritability and lower incidence.

Entities: Chemical

Mesh：

Year: 2020 PMID： 32620889 PMCID： PMC7335068 DOI： 10.1038/s41467-020-16483-3

Source DB: PubMed Journal: Nat Commun ISSN： 2041-1723 Impact factor: 14.919

Introduction

Genome-wide association studies (GWASs) have led to the identification of hundreds of independent cancer susceptibility loci containing common, low-risk variants[1,2]. The number of discoveries varies widely across cancers, largely driven by available sample size, which reflects, in part, disease incidence in the general population. However, specific cancers, e.g., chronic lymphoid leukemia (CLL)[3] and testicular cancer[4], are notable for unexpectedly high numbers of genome-wide significant discoveries from GWASs of relatively small sample size. Previous studies have also reported that these two cancers have high heritability[5]. Across cancer types, polygenic risk scores (PRSs) show varying levels of risk stratification depending on the heritability explained by the identified variants and the disease incidence rates in the population[6-12]. Their potential clinical utility would depend not only on the level of risk stratification but also on other factors such as the availability of appropriate risk-reducing interventions for those identified as at high risk. Estimation of heritability due to additive effects of all single-nucleotide polymorphisms (SNPs) included in GWAS arrays[13], referred to as GWAS heritability in this article, have shown that common variants have substantial potential to identify individuals at different levels of risk for many cancer types[14]. It remains, however, unclear how large the sample sizes of GWAS need to be to reap the full potential of PRS-based risk prediction. Herein we apply our recently published method[15] to estimate the degree of polygenicity and the effect-size distribution associated with common variants (minor allele frequency (MAF) > 0.05) across 14 different cancer types, based on summary-level association statistics from available GWASs[16-28] from populations of European ancestry (Supplementary Table 1). From these inferred parameters, we then provide projections of the expected number of common variants to be discovered and predictive performance of associated PRS as a function of increasing sample size for future GWASs. Finally, by incorporating age-specific incidence[29] from population-based cancer registries, we explore the magnitude of absolute risk stratification potentially achievable by PRS.

Results

Cancer polygenicity

We found that cancers are highly polygenic, like other complex traits[15,30,31]. Estimates of the number of susceptibility variants with independent risk associations vary from ~1000 to 7500 between the 14 cancer sites (Table 1). For comparability, effect-size distributions are shown in groups of similarly sized GWASs with similar power for detecting associations (Fig. 1). For GWASs with <10,000 cancer cases (group 1), CLL and testicular cancer are each associated with 2000–2500 variants and characterized by a much larger proportion of variants with larger estimated effect sizes than for the other group 1 cancers, as reflected by wider effect-size distribution with heavier tails (Fig. 1, Table 1). GWAS heritability estimates indicate that, in aggregate, common variants explain a high degree of variation of risk for these two cancers. In contrast, in group 1, esophageal and oropharyngeal cancers are associated with a larger proportion of variants with substantially smaller effect sizes, compared with CLL and testicular cancers in group 1.

Table 1

Estimated number of independent common susceptibility variants and heritability across 14 cancer sites.

Number of cases in the analysis	Cancer site^a	Total number of susceptibility SNPs (SE)	Total heritability, in log-OR scale^b (SE)	Average heritability explained per susceptibility SNP^c (SE), in \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$10^{ - 4}$$\end{document}10−4	Number of SNPs associated with larger variance component (SE)	% of heritability explained by SNPs with larger variance component	AUC associated with the best PRS^d (SE)
<10,000	CLL	2025 (1501)	1.62 (0.37)	7.2 (4.4)	52 (15)	41	0.82 (0.03)
<10,000	Esophageal	3641 (2515)	1.24 (0.36)	3.4 (1.9)	NA^e	NA	0.78 (0.03)
<10,000	Testicular	2598 (2088)	2.81 (0.40)	9.2 (6.6)	196 (75)	54	0.88 (0.02)
<10,000	Oropharyngeal	3623 (2060)	0.68 (0.27)	1.9 (0.5)	NA	NA	0.72 (0.04)
<10,000	Pancreas	1757 (1490)	0.60 (0.16)	3.2 (2.2)	47 (27)	31	0.71 (0.03)
10,000–25,000	Renal	2220 (1555)	0.57 (0.12)	2.4 (1.4)	46 (36)	24	0.70 (0.02)
10,000–25,000	Glioma	2364 (1593)	0.87 (0.11)	2.2 (1.2)	61 (25)	55	0.75 (0.01)
10,000–25,000	Melanoma	1098 (533)	0.65 (0.09)	4.4 (1.6)	106 (58)	52	0.72 (0.01)
10,000–25,000	Colorectal	1484 (696)	0.43 (0.10)	2.9 (0.8)	14 (11)	7	0.68 (0.02)
10,000–25,000	Endometrial	1052 (772)	0.27 (0.07)	2.5 (1.3)	46 (34)	26	0.64 (0.02)
10,000–25,000	Ovarian	1015 (715)	0.24 (0.06)	2.2 (1.1)	49 (31)	36	0.64 (0.02)
>25,000	Lung	6096 (2750)	0.39 (0.06)	0.6 (0.2)	15 (7)	15	0.67 (0.01)
>25,000	Prostate	4530 (1052)	0.77 (0.04)	1.1 (0.2)	276 (99)	51	0.73 (0.01)
>25,000	Breast	7599 (1615)	0.60 (0.03)	0.6 (0.1)	587 (133)	56	0.71 (0.00)

SNP single-nucleotide polymorphism, SE standard errors, CLL chronic lymphocytic leukemia.

aAll results are reported using the best fitted (two- or three-component) normal mixture model for effect-size distributions, with respect to a reference panel of 1.07 million common SNPs included in the Hapmap3 panel after removal of MHC region.

bTotal heritability is characterized by population variance of the underlying true PRS as , where denotes per-SNP effect-size of the non-null SNPs in the log-odds-ratio scale.

cAverage heritability explained per susceptibility SNP excludes SNPs with extremely large effects (see “Methods”).

dArea under the curve (AUC) associated with best PRS is calculated using the formula AUC= where is the cumulative density function of standard normal distribution.

eNA indicates that a two-component model is favorable compared to three-component model.

Fig. 1

Estimated effect-size distributions for susceptibility SNPs across 14 cancer sites.

Estimated number of independent common susceptibility variants and heritability across 14 cancer sites. SNP single-nucleotide polymorphism, SE standard errors, CLL chronic lymphocytic leukemia. aAll results are reported using the best fitted (two- or three-component) normal mixture model for effect-size distributions, with respect to a reference panel of 1.07 million common SNPs included in the Hapmap3 panel after removal of MHC region. bTotal heritability is characterized by population variance of the underlying true PRS as , where denotes per-SNP effect-size of the non-null SNPs in the log-odds-ratio scale. cAverage heritability explained per susceptibility SNP excludes SNPs with extremely large effects (see “Methods”). dArea under the curve (AUC) associated with best PRS is calculated using the formula AUC= where is the cumulative density function of standard normal distribution. eNA indicates that a two-component model is favorable compared to three-component model.

Estimated effect-size distributions for susceptibility SNPs across 14 cancer sites.

Effect-size distribution of susceptibility SNPs is modeled using a two-component normal mixture model for all sites, except esophageal and oropharyngeal cancers. For these sites, effect sizes are modeled using a single normal distribution that provided similar fit as the two-component normal mixture model (see Supplementary Figs. 1 and 2). SNPs with extremely large effects are excluded for effect-size distribution estimation (see “Methods”). Plots are stratified by sample size of the GWAS for comparability. Distributions with fatter tails imply the underlying traits have relatively greater number of susceptibility SNPs with larger effects. Note here that the effect-size distribution is plotted on the log scale of odds ratio (x-axis). CLL chronic lymphocytic leukemia. For GWASs with 10,000–25,000 cases (group 2), melanoma is noteworthy because it is associated with a wider effect size distribution than other group 2 cancers. The estimated number of susceptibility variants in this group ranges from 1000 to 2000. GWAS heritability estimates indicate that aggregated common variants make a relatively small contribution to ovarian and endometrial cancer susceptibility. Finally, for the 3 GWAS with >25,000 cases each (group 3), prostate cancer is remarkable for having more variants with large effect sizes, namely, the underlying effect-size distribution has a heavier tail, compared with cancers of the breast and lung (Fig. 1). In this group, all three cancer types tend to have large numbers of associated variants (>4500) compared with cancer sites in other groups, but this pattern could partially be due to the very large sample sizes of group 3 GWAS[15]. For a large majority of the 14 cancer sites, a two-component normal-mixture model for non-null effects provides a substantially better fit to observed summary statistics than a single normal distribution; this indicates the presence of a fraction of variants with distinctly larger effect sizes than the remaining (Supplementary Figs. 1 and 2). In contrast, a single normal distribution appears to be adequate for esophageal and oropharyngeal cancer, indicating the presence of a large number of variants with a continuum of small effects, similar to our previous findings for traits related to mental health and abilities[15]. Across all 14 cancers, the predicted number of discoveries and their associated genetic variance explained for current GWAS sample sizes match well to those observed empirically (Supplementary Table 2), indicating good fit of our model to the observed data.

Future GWAS projections

GWAS heritability estimates indicate that the potential of PRS for risk discrimination in the population varies widely among cancer types (Table 1). The area under the curve (AUC) statistics associated with the best achievable PRS varies from 64% (endometrial and ovarian cancer) to 88% (testicular cancer) and in the range of 70–80% for most cancers. The percentage of GWAS heritability explained by known variants varies widely, depending on study sample size and the underlying trait genetic architecture (Fig. 2). Known variants explain more than a quarter of heritability for cancer sites based on very large sample sizes (e.g., breast and prostate cancer) or for cancer sites that have susceptibility variants with relatively large effect sizes (e.g., CLL, melanoma, and testicular cancer). Oropharyngeal cancer, in contrast, has both a small sample size and small effect sizes; its percentage heritability currently explained is almost zero.

Fig. 2

Projections of percentage of GWAS heritability explained by SNPs as sample size for GWAS increases.

Projections of percentage of GWAS heritability explained by SNPs as sample size for GWAS increases.

Results are shown for projections including SNPs at the optimized p value threshold (solid curve) and at genome-wide significance (p < ) level (dashed curve). Colored dots correspond to sample size for the largest published GWAS and those for doubled and quadruped sizes. For oropharyngeal cancer, the projections at the “current sample size” are based on a sample size of 25K cases and 25K controls. For breast and esophageal cancer, the projections at the “current sample size” are based on the current largest GWAS sample sizes: 123K cases and 106K controls and 10K cases and 17K controls, respectively. For all other cancer sites, the projections at the “current sample size” are based on the GWAS sample sizes in Supplementary Table 1. CLL chronic lymphocytic leukemia. The sample size needed to identify common variants that could explain approximately 80% of the total GWAS heritability for the cancers evaluated is generally very large, requiring 200,000–1,000,000 cancer cases, with a comparable number of controls (Fig. 2). However, for three sites, namely, testicular cancer, CLL, and melanoma, the required sample size is smaller, 60,000, 80,000, and 110,000 cases, respectively, due to the large effect sizes of their associated variants. By quadrupling the sample sizes of currently published GWASs, the percentage of GWAS heritability explained would rise to >40% across all cancers, except for oropharyngeal cancer. Such sample size increases would also lead to appreciable improvements in PRS discriminatory power across all these sites (Figs. 3 and 4). For cancers that were found to be the most polygenic and that had small effect sizes (e.g., cancers of breast, lung, and oropharynx), improvement would occur at a slower rates as sample sizes increase, and these sites would require the largest sample sizes to generate PRSs with discriminatory power close to theoretical limits. Of note, for a number of cancers, the achievable relative risks for subjects at the 99th percentile of PRS distribution compared with those at average risk, are comparable to those for monogenic disorders[32] (e.g., relative-risk >3–4-fold) (Fig. 4). Across all 14 cancer types, inclusion of SNPs using more liberal but optimized p value thresholds (see “Methods”) would improve performance of PRS-based risk prediction versus using the stringent genome-wide significance level, but the anticipated gains would be generally modest (Supplementary Figs. 3 and 4).

Fig. 3

Projections of area under the curve (AUC) characterizing predictive performance of PRS as sample size for GWAS increases.

Results are shown for PRS including SNPs at the optimized p value threshold. The dotted horizontal red line indicates the maximum AUC achievable according to the estimate of GWAS heritability. Colored dots correspond to sample size for largest published GWAS and those for doubled and quadruped sizes. For oropharyngeal cancer, the projections at the “current sample size” are based on a sample size of 25K cases and 25K controls. For breast and esophageal cancer, the projections at the “current sample size” are based on the current largest GWAS sample sizes: 123K cases and 106K controls and 10K cases and 17K controls, respectively. For all other cancer sites, the projections at the “current sample size” are based on the GWAS sample sizes in Supplementary Table 1. CLL chronic lymphocytic leukemia.

Fig. 4

Projections of relative risks for individuals at or higher than 99th percentile of PRS as sample size for GWAS increases.

Projections of area under the curve (AUC) characterizing predictive performance of PRS as sample size for GWAS increases.

Projections of relative risks for individuals at or higher than 99th percentile of PRS as sample size for GWAS increases.

Results are shown where PRS is built based on SNPs at optimized p value threshold. The dotted horizontal red line indicates the maximum relative risk achievable according to estimate of GWAS heritability. Colored dots correspond to sample size for the largest published GWAS and those for doubled and quadruped sizes. y-Axis is presented in log10 scale. For oropharyngeal cancer, the projections at the “current sample size” are based on a sample size of 25K cases and 25K controls. For breast and esophageal cancer, the projections at the “current sample size” are based on the current largest GWAS sample sizes: 123K cases and 106K controls and 10K cases and 17K controls, respectively. For all other cancer sites, the projections at the “current sample size” are based on the GWAS sample sizes in Supplementary Table 1. CLL chronic lymphocytic leukemia. Projections of residual lifetime cancer risks for the US non-Hispanic white population show that the discriminatory power of PRS built from current or foreseeable studies will depend heavily on the underlying cancer incidence in the population (Fig. 5, Supplementary Figs. 5–7). The potential clinical utility of PRS depends on the degree of risk stratification and specific prevention or early detection strategies for a given cancer, should they exist. For common cancers, such as breast, colorectal, and prostate, a PRS with even modest discriminatory power (maximum AUC of approximately 70%, Fig. 3) can provide substantial stratification of absolute risk in the population. In contrast, for CLL and testicular cancer, even though its PRS could achieve a higher AUC (e.g. in the range 80–90%, Fig. 3), the degree of absolute risk stratification will be modest because of the infrequency of these cancers. Thus a PRS by itself has the least impact on risk stratification for cancer sites that are infrequent or/and that have low heritability. However, it is possible that PRS could have clinical utility for some of these cancers in the presence or in combination with other risk factors and biomarkers. For example, a PRS for lung cancer may provide larger stratification for absolute risk among smokers than never smokers because of the higher baseline risk in smokers.

Fig. 5

Projected distribution of average residual lifetime risk in the US population of non-Hispanic whites aged 30–75 years.

The risk is obtained according to variation of polygenic risk scores. The projections are shown for PRS built based on GWAS with current, doubled and quadrupled sample sizes and the best PRS that corresponds to limits defined by heritability. The projections are obtained by combining information on projected population variance of PRS, age-specific population incidence rate, competing risk of mortality and current distribution of age according to US 2016 census. For oropharyngeal cancer, the projections at the “current sample size” are based on a sample size of 25K cases and 25K controls. For breast and esophageal cancer, the projections at the “current sample size” are based on the current largest GWAS sample sizes: 123K cases and 106K controls and 10K cases and 17K controls, respectively. For all other cancer sites, the projections at the “current sample size” are based on the GWAS sample sizes in Supplementary Table 1. CLL chronic lymphocytic leukemia.

Projected distribution of average residual lifetime risk in the US population of non-Hispanic whites aged 30–75 years.

Discussion

Our study is subject to several limitations. We may have underestimated the number of underlying common susceptibility loci, especially for those cancers for which current GWAS have small sample sizes[15]. Thus the interpretation of comparisons of the underlying genetic architecture across cancer types with very different sample sizes requires caution. Nevertheless, the major patterns are unlikely to be due to differences in sample size. For example, we estimated oropharyngeal and esophageal cancers to be two of the most polygenic sites, though the GWAS sample sizes for these two sites were relatively small. Further, Q–Q plots of observed and expected p values indicate that the inferred models for effect-size distributions explain observed GWAS summary statistics well, regardless of GWAS sample size. Another important limitation is that we only included data from subjects of European ancestry, since GWAS data for other ancestries are currently too small to permit reliable projections for most cancer sites. In addition, several cancers (e.g., lung, ovary, glioma, and breast) consist of etiologically heterogeneous subtypes that were not considered in our analyses due to lack of adequate sample sizes for appropriate subtypes for most of these cancer sites. Further studies of ancestry- and subtype-specific genetic architectures are needed to address these limitations. In our projections, we assume standard agnostic association analysis of SNPs without incorporating any external information on population genetics or functional characteristics of SNPs. It is, however, possible to incorporate various types of external information to improve power for discovery of associations[33-36] and genetic risk prediction[37]. We have evaluated the merit of future GWAS only in terms of their ability to explain heritability and improve risk prediction. However, current and future discoveries have other major implications, including provident insights to biological pathways and mechanisms, potential gene–environment interactions, and understanding causal relationships through Mendelian Randomization analyses[38]. A number of these cancers are known to have rare high-penetrant risk variants, but for this study we have focused on estimating effect-size distribution associated with common variants. Furthermore, heritability analysis indicate that uncommon and rare variants could explain a substantial fraction of the variation of complex traits[39], and thus it is likely that there are many unknown uncommon and rare variants associated with these cancers as well. In the future, characterization of heritability and effect-size distribution associated with the full spectrum of allele frequencies will require individual-level sequencing data on a substantially larger number of cases and controls. The observed differences in the underlying genetic architecture of susceptibility across cancers could be due to various factors, including the effect of negative selection[30,40], tissue-specific genetic regulation of gene expression[41], cell of origin[42], the number of biological steps needed to transition from normal to malignant tissue[43], mediation of genetic effects by underlying environmental exposures[44], and the presence of heterogeneous cancer-specific subtypes[21,25,27,28]. A number of cancer types, including those of lung, oropharynx, and esophagus, which were associated with large numbers of SNPs with small average effect sizes, have known strong environmental risk factors and distinct etiologic subtypes. It is also noteworthy that testicular cancer also stands out for a large number of discoveries in cross-tissue expression quantitative trait loci analyses, likely indicating a stronger association of SNPs on gene expression levels for this tissue compared to others[41]. In conclusion, our comprehensive analysis of 14 cancer sites in adults of European ancestry reveals that, while all sites have polygenic influences, there is substantial diversity observed in their underlying genetic architectures, which reflects important biology and also influences the utility of polygenic risk prediction for individual cancers. Our projections for future yields of GWAS across these cancers provide a roadmap for important returns from future investment in research, including the potential clinical utility of polygenic risk prediction for stratification of absolute risks in the population.

Methods

Description of GWAS studies

We analyzed summary data from GWAS studies across 14 cancer types. For select cancer sites[26,28], we downloaded publicly available genome-wide summary-level statistics from the latest consortium-based analyses. For others, we obtained access to data through collaborative efforts with individual consortia. Details about individual studies, including the number of cases and controls, are provided in Supplementary Table 1.

Linkage disequilibrium (LD) reference panel selection

We consider a reference panel with ~1.07 million SNPs included in the HapMap3 and that had MAF > 0.05 in the 1000 Genome European Ancestry sample. Based on known LD among common variants, we expect these set of variants to provide high coverage for all common variants for European ancestry population and thus loss of information due to imperfect tagging of causal variants to be fairly minimal.

Quality control for summary GWAS data

Across all cancers, we applied several filtering steps analogous to those used earlier for estimation of heritability[45,46] and effect-size distribution using summary-level data[15]. First, we restricted analysis to SNPs within a set of reference ~1.07 million SNPs included in the HapMap3 and that had MAF > 0.05 in the 1000 Genome European Ancestry sample. Second, we excluded SNPs having substantial amounts of missing genotype data: sample sizes <0.67 times the 90th percentile of the distribution of sample sizes across all SNPs. Third, we excluded SNPs within the major histocompatibility complex region (i.e., SNPs between 26,000,000 and 34,000,000 base pairs on chromosome six), which is known to have very complex allelic architecture and can have uncharacteristically large effects on some traits. Fourth, we removed regions that have SNPs with extremely large effect sizes to reduce possible undue influence of them on estimation of parameters associated with overall effect-size distributions. Using PLINK --clump, we identify all top SNPs that have associated chi-square statistics >80 (i.e., odds ratio (in standardized scale) >2.19) and removed all SNPs that were within 1-MB distance of or had an estimated squared LD >0.1 with those top SNPs. We added back the contribution of these top independent SNPs in the final reporting of the total number of susceptibility SNPs, estimates of total heritability, and various projections we made as a function of sample size of the GWAS.

Statistical model

We inferred common variant genetic architecture of the different cancers using GENESIS[15], a method we recently developed to characterize underlying effect-size distributions in terms of the total number of susceptibility SNPs (polygenicity) and a normal mixture model for the distribution of their effects. Specifically, it is assumed that standardized effects of common SNPs in an underlying logistic regression model on the risk of a cancer can be specified in the mixture distribution in the form (two-component model) or (three-component model) where is the Dirac delta function indicating that a fraction, , of the SNPs have null effects and remaining fraction of SNPs have non-null effects. Under the three-component model, denotes the proportion of SNPs allocated to mixture component with larger variance component (assuming > ) models. Under these models, characterizes the degree of polygenicity, i.e., the number of susceptibility SNPs with independent effects on disease risk. Under both models, we defined “GWAS heritability” of a disease as , where denotes the average variance size of the non-null SNPs. We observed that, under the above model, is also the population variance of the underlying “true” PRS, defined as , where denotes the standardized genotype associated with the th SNP. Under the two-component model, which assumes a single normal distribution for the effect of all susceptibility SNPs, . Under the three-component model, which allows mixture of two normal distributions with distinct variance components and thus can better accommodate the presence of a group of susceptibility SNPs with much larger effects than others, we have . Under the three-component model, we use the fraction to characterize the proportion of heritability explained by SNPs associated with the larger variance component parameter. As we removed SNPs with extremely large effects () and the associated regions from the analysis, in reporting the final heritability estimates, we added back the contribution of the independent top SNPs from these excluded regions as where is the estimate of log odds ratio (in standardized scale) and is the corresponding standard error for the th SNP.

Genetic variance projection

Given the estimated effect-size distribution, we calculated expected discoveries and genetic variance explained using and , respectively, at for a GWAS of sample size , where with the standard normal cumulative density function and the αth quantile for the standard normal distribution. Similar to heritability calculations, we added back the contributions of independent top SNPs with very large effects to the number of expected discoveries and associated variances explained by the quantities and . We observed that for projections involving sample sizes bigger than the current study for the large effect SNPs will all be very close to 1.0.

Projection for AUC and relative risk at top 1%

As we quantify heritability in terms of the variability of the underlying “true” PRS, we used the formula[12,47,48] to characterize the best discriminatory power achievable in limiting using common variant PRS. We used the same formula to calculate the AUC associated with PRSs that could be built using SNPs either reaching genome-wide significance (p value <5) or a weaker but optimized threshold for a GWAS of given sample size based on the projected variance of the respective PRS. Given sample size of GWAS and an effect-size distribution for the underlying cancer, an optimal threshold for SNP selection that will maximize the expected predictive performance of PRS is calculated using analytic formula we have derived earlier[48]. The relative risk for those estimated to be at the 99th percentile or higher of the distribution of a PRS (compared to the average risk of the population) was calculated using the formula[12] , where is the population variance of the PRS.

Absolute risk projection

For each cancer site, we projected the distribution of residual lifetime risk (up to age 80 years) for non-Hispanic white individuals in the general US population according to PRSs, which could be built from GWASs of different sample sizes. For any given age, we first obtain the distribution of residual lifetime risks based on a model for absolute risks developed using the iCARE tool that we have described earlier[12,29]. The iCARE tool uses projected standard deviations of PRS at different GWAS sample sizes and age-specific cancer incidence rates available from the US National Cancer Institute-Surveillance, Epidemiology, and End Results Program (NCI-SEER) (2015) to obtain absolute risk distributions. In deriving absolute risks, we adjusted for competing risk of mortality due to other causes using the age-specific mortality rates from the Center for Disease Control WONDER database (2016). We then weighted the projected residual lifetime risk distribution at different baseline ages (in 5-year categories) based on the US population distribution of ages within 30–75 years, as observed in the estimated 2016 US Census. For cancers of the reproductive system, weights were based on the age distributions among males or females, as appropriate.

42 in total

1. Common genetic polymorphisms modify the effect of smoking on absolute risk of bladder cancer.

Authors: Montserrat Garcia-Closas; Nathaniel Rothman; Jonine D Figueroa; Ludmila Prokunina-Olsson; Summer S Han; Dalsu Baris; Eric J Jacobs; Nuria Malats; Immaculata De Vivo; Demetrius Albanes; Mark P Purdue; Sapna Sharma; Yi-Ping Fu; Manolis Kogevinas; Zhaoming Wang; Wei Tang; Adonina Tardón; Consol Serra; Alfredo Carrato; Reina García-Closas; Josep Lloreta; Alison Johnson; Molly Schwenn; Margaret R Karagas; Alan Schned; Gerald Andriole; Robert Grubb; Amanda Black; Susan M Gapstur; Michael Thun; William Ryan Diver; Stephanie J Weinstein; Jarmo Virtamo; David J Hunter; Neil Caporaso; Maria Teresa Landi; Amy Hutchinson; Laurie Burdett; Kevin B Jacobs; Meredith Yeager; Joseph F Fraumeni; Stephen J Chanock; Debra T Silverman; Nilanjan Chatterjee
Journal: Cancer Res Date: 2013-03-27 Impact factor: 12.701

2. Familial Risk and Heritability of Cancer Among Twins in Nordic Countries.

Authors: Lorelei A Mucci; Jacob B Hjelmborg; Jennifer R Harris; Kamila Czene; David J Havelick; Thomas Scheike; Rebecca E Graff; Klaus Holst; Sören Möller; Robert H Unger; Christina McIntosh; Elizabeth Nuttall; Ingunn Brandt; Kathryn L Penney; Mikael Hartman; Peter Kraft; Giovanni Parmigiani; Kaare Christensen; Markku Koskenvuo; Niels V Holm; Kauko Heikkilä; Eero Pukkala; Axel Skytthe; Hans-Olov Adami; Jaakko Kaprio
Journal: JAMA Date: 2016-01-05 Impact factor: 56.272

Review 3. Developing and evaluating polygenic risk prediction models for stratified disease prevention.

Authors: Nilanjan Chatterjee; Jianxin Shi; Montserrat García-Closas
Journal: Nat Rev Genet Date: 2016-05-03 Impact factor: 53.242

Review 4. Genome-wide association studies of cancer: current insights and future perspectives.

Authors: Amit Sud; Ben Kinnersley; Richard S Houlston
Journal: Nat Rev Cancer Date: 2017-10-13 Impact factor: 60.716

Review 5. Cancer genetics, precision prevention and a call to action.

Authors: Clare Turnbull; Amit Sud; Richard S Houlston
Journal: Nat Genet Date: 2018-08-29 Impact factor: 38.330

6. Identification of 19 new risk loci and potential regulatory mechanisms influencing susceptibility to testicular germ cell tumor.

Authors: Kevin Litchfield; Max Levy; Giulia Orlando; Chey Loveday; Philip J Law; Gabriele Migliorini; Amy Holroyd; Peter Broderick; Robert Karlsson; Trine B Haugen; Wenche Kristiansen; Jérémie Nsengimana; Kerry Fenwick; Ioannis Assiotis; ZSofia Kote-Jarai; Alison M Dunning; Kenneth Muir; Julian Peto; Rosalind Eeles; Douglas F Easton; Darshna Dudakia; Nick Orr; Nora Pashayan; D Timothy Bishop; Alison Reid; Robert A Huddart; Janet Shipley; Tom Grotmol; Fredrik Wiklund; Richard S Houlston; Clare Turnbull
Journal: Nat Genet Date: 2017-06-12 Impact factor: 38.330

7. Polygenic hazard score to guide screening for aggressive prostate cancer: development and validation in large scale cohorts.

Authors: Tyler M Seibert; Chun Chieh Fan; Yunpeng Wang; Verena Zuber; Roshan Karunamuni; J Kellogg Parsons; Rosalind A Eeles; Douglas F Easton; ZSofia Kote-Jarai; Ali Amin Al Olama; Sara Benlloch Garcia; Kenneth Muir; Henrik Grönberg; Fredrik Wiklund; Markus Aly; Johanna Schleutker; Csilla Sipeky; Teuvo Lj Tammela; Børge G Nordestgaard; Sune F Nielsen; Maren Weischer; Rasmus Bisbjerg; M Andreas Røder; Peter Iversen; Tim J Key; Ruth C Travis; David E Neal; Jenny L Donovan; Freddie C Hamdy; Paul Pharoah; Nora Pashayan; Kay-Tee Khaw; Christiane Maier; Walther Vogel; Manuel Luedeke; Kathleen Herkommer; Adam S Kibel; Cezary Cybulski; Dominika Wokolorczyk; Wojciech Kluzniak; Lisa Cannon-Albright; Hermann Brenner; Katarina Cuk; Kai-Uwe Saum; Jong Y Park; Thomas A Sellers; Chavdar Slavov; Radka Kaneva; Vanio Mitev; Jyotsna Batra; Judith A Clements; Amanda Spurdle; Manuel R Teixeira; Paula Paulo; Sofia Maia; Hardev Pandha; Agnieszka Michael; Andrzej Kierzek; David S Karow; Ian G Mills; Ole A Andreassen; Anders M Dale
Journal: BMJ Date: 2018-01-10

8. Polygenic Risk Scores for Prediction of Breast Cancer and Breast Cancer Subtypes.

Authors: Nasim Mavaddat; Kyriaki Michailidou; Joe Dennis; Michael Lush; Laura Fachal; Andrew Lee; Jonathan P Tyrer; Ting-Huei Chen; Qin Wang; Manjeet K Bolla; Xin Yang; Muriel A Adank; Thomas Ahearn; Kristiina Aittomäki; Jamie Allen; Irene L Andrulis; Hoda Anton-Culver; Natalia N Antonenkova; Volker Arndt; Kristan J Aronson; Paul L Auer; Päivi Auvinen; Myrto Barrdahl; Laura E Beane Freeman; Matthias W Beckmann; Sabine Behrens; Javier Benitez; Marina Bermisheva; Leslie Bernstein; Carl Blomqvist; Natalia V Bogdanova; Stig E Bojesen; Bernardo Bonanni; Anne-Lise Børresen-Dale; Hiltrud Brauch; Michael Bremer; Hermann Brenner; Adam Brentnall; Ian W Brock; Angela Brooks-Wilson; Sara Y Brucker; Thomas Brüning; Barbara Burwinkel; Daniele Campa; Brian D Carter; Jose E Castelao; Stephen J Chanock; Rowan Chlebowski; Hans Christiansen; Christine L Clarke; J Margriet Collée; Emilie Cordina-Duverger; Sten Cornelissen; Fergus J Couch; Angela Cox; Simon S Cross; Kamila Czene; Mary B Daly; Peter Devilee; Thilo Dörk; Isabel Dos-Santos-Silva; Martine Dumont; Lorraine Durcan; Miriam Dwek; Diana M Eccles; Arif B Ekici; A Heather Eliassen; Carolina Ellberg; Christoph Engel; Mikael Eriksson; D Gareth Evans; Peter A Fasching; Jonine Figueroa; Olivia Fletcher; Henrik Flyger; Asta Försti; Lin Fritschi; Marike Gabrielson; Manuela Gago-Dominguez; Susan M Gapstur; José A García-Sáenz; Mia M Gaudet; Vassilios Georgoulias; Graham G Giles; Irina R Gilyazova; Gord Glendon; Mark S Goldberg; David E Goldgar; Anna González-Neira; Grethe I Grenaker Alnæs; Mervi Grip; Jacek Gronwald; Anne Grundy; Pascal Guénel; Lothar Haeberle; Eric Hahnen; Christopher A Haiman; Niclas Håkansson; Ute Hamann; Susan E Hankinson; Elaine F Harkness; Steven N Hart; Wei He; Alexander Hein; Jane Heyworth; Peter Hillemanns; Antoinette Hollestelle; Maartje J Hooning; Robert N Hoover; John L Hopper; Anthony Howell; Guanmengqian Huang; Keith Humphreys; David J Hunter; Milena Jakimovska; Anna Jakubowska; Wolfgang Janni; Esther M John; Nichola Johnson; Michael E Jones; Arja Jukkola-Vuorinen; Audrey Jung; Rudolf Kaaks; Katarzyna Kaczmarek; Vesa Kataja; Renske Keeman; Michael J Kerin; Elza Khusnutdinova; Johanna I Kiiski; Julia A Knight; Yon-Dschun Ko; Veli-Matti Kosma; Stella Koutros; Vessela N Kristensen; Ute Krüger; Tabea Kühl; Diether Lambrechts; Loic Le Marchand; Eunjung Lee; Flavio Lejbkowicz; Jenna Lilyquist; Annika Lindblom; Sara Lindström; Jolanta Lissowska; Wing-Yee Lo; Sibylle Loibl; Jirong Long; Jan Lubiński; Michael P Lux; Robert J MacInnis; Tom Maishman; Enes Makalic; Ivana Maleva Kostovska; Arto Mannermaa; Siranoush Manoukian; Sara Margolin; John W M Martens; Maria Elena Martinez; Dimitrios Mavroudis; Catriona McLean; Alfons Meindl; Usha Menon; Pooja Middha; Nicola Miller; Fernando Moreno; Anna Marie Mulligan; Claire Mulot; Victor M Muñoz-Garzon; Susan L Neuhausen; Heli Nevanlinna; Patrick Neven; William G Newman; Sune F Nielsen; Børge G Nordestgaard; Aaron Norman; Kenneth Offit; Janet E Olson; Håkan Olsson; Nick Orr; V Shane Pankratz; Tjoung-Won Park-Simon; Jose I A Perez; Clara Pérez-Barrios; Paolo Peterlongo; Julian Peto; Mila Pinchev; Dijana Plaseska-Karanfilska; Eric C Polley; Ross Prentice; Nadege Presneau; Darya Prokofyeva; Kristen Purrington; Katri Pylkäs; Brigitte Rack; Paolo Radice; Rohini Rau-Murthy; Gad Rennert; Hedy S Rennert; Valerie Rhenius; Mark Robson; Atocha Romero; Kathryn J Ruddy; Matthias Ruebner; Emmanouil Saloustros; Dale P Sandler; Elinor J Sawyer; Daniel F Schmidt; Rita K Schmutzler; Andreas Schneeweiss; Minouk J Schoemaker; Fredrick Schumacher; Peter Schürmann; Lukas Schwentner; Christopher Scott; Rodney J Scott; Caroline Seynaeve; Mitul Shah; Mark E Sherman; Martha J Shrubsole; Xiao-Ou Shu; Susan Slager; Ann Smeets; Christof Sohn; Penny Soucy; Melissa C Southey; John J Spinelli; Christa Stegmaier; Jennifer Stone; Anthony J Swerdlow; Rulla M Tamimi; William J Tapper; Jack A Taylor; Mary Beth Terry; Kathrin Thöne; Rob A E M Tollenaar; Ian Tomlinson; Thérèse Truong; Maria Tzardi; Hans-Ulrich Ulmer; Michael Untch; Celine M Vachon; Elke M van Veen; Joseph Vijai; Clarice R Weinberg; Camilla Wendt; Alice S Whittemore; Hans Wildiers; Walter Willett; Robert Winqvist; Alicja Wolk; Xiaohong R Yang; Drakoulis Yannoukakos; Yan Zhang; Wei Zheng; Argyrios Ziogas; Alison M Dunning; Deborah J Thompson; Georgia Chenevix-Trench; Jenny Chang-Claude; Marjanka K Schmidt; Per Hall; Roger L Milne; Paul D P Pharoah; Antonis C Antoniou; Nilanjan Chatterjee; Peter Kraft; Montserrat García-Closas; Jacques Simard; Douglas F Easton
Journal: Am J Hum Genet Date: 2018-12-13 Impact factor: 11.025

9. Genome-wide association analysis implicates dysregulation of immunity genes in chronic lymphocytic leukaemia.

Authors: Philip J Law; Sonja I Berndt; Helen E Speedy; Nicola J Camp; Georgina P Sava; Christine F Skibola; Amy Holroyd; Vijai Joseph; Nicola J Sunter; Alexandra Nieters; Silvia Bea; Alain Monnereau; David Martin-Garcia; Lynn R Goldin; Guillem Clot; Lauren R Teras; Inés Quintela; Brenda M Birmann; Sandrine Jayne; Wendy Cozen; Aneela Majid; Karin E Smedby; Qing Lan; Claire Dearden; Angela R Brooks-Wilson; Andrew G Hall; Mark P Purdue; Tryfonia Mainou-Fowler; Claire M Vajdic; Graham H Jackson; Pierluigi Cocco; Helen Marr; Yawei Zhang; Tongzhang Zheng; Graham G Giles; Charles Lawrence; Timothy G Call; Mark Liebow; Mads Melbye; Bengt Glimelius; Larry Mansouri; Martha Glenn; Karen Curtin; W Ryan Diver; Brian K Link; Lucia Conde; Paige M Bracci; Elizabeth A Holly; Rebecca D Jackson; Lesley F Tinker; Yolanda Benavente; Paolo Boffetta; Paul Brennan; Marc Maynadie; James McKay; Demetrius Albanes; Stephanie Weinstein; Zhaoming Wang; Neil E Caporaso; Lindsay M Morton; Richard K Severson; Elio Riboli; Paolo Vineis; Roel C H Vermeulen; Melissa C Southey; Roger L Milne; Jacqueline Clavel; Sabine Topka; John J Spinelli; Peter Kraft; Maria Grazia Ennas; Geoffrey Summerfield; Giovanni M Ferri; Robert J Harris; Lucia Miligi; Andrew R Pettitt; Kari E North; David J Allsup; Joseph F Fraumeni; James R Bailey; Kenneth Offit; Guy Pratt; Henrik Hjalgrim; Chris Pepper; Stephen J Chanock; Chris Fegan; Richard Rosenquist; Silvia de Sanjose; Angel Carracedo; Martin J S Dyer; Daniel Catovsky; Elias Campo; James R Cerhan; James M Allan; Nathanial Rothman; Richard Houlston; Susan Slager
Journal: Nat Commun Date: 2017-02-06 Impact factor: 17.694

10. Breast Cancer Risk From Modifiable and Nonmodifiable Risk Factors Among White Women in the United States.

Authors: Paige Maas; Myrto Barrdahl; Amit D Joshi; Paul L Auer; Mia M Gaudet; Roger L Milne; Fredrick R Schumacher; William F Anderson; David Check; Subham Chattopadhyay; Laura Baglietto; Christine D Berg; Stephen J Chanock; David G Cox; Jonine D Figueroa; Mitchell H Gail; Barry I Graubard; Christopher A Haiman; Susan E Hankinson; Robert N Hoover; Claudine Isaacs; Laurence N Kolonel; Loic Le Marchand; I-Min Lee; Sara Lindström; Kim Overvad; Isabelle Romieu; Maria-Jose Sanchez; Melissa C Southey; Daniel O Stram; Rosario Tumino; Tyler J VanderWeele; Walter C Willett; Shumin Zhang; Julie E Buring; Federico Canzian; Susan M Gapstur; Brian E Henderson; David J Hunter; Graham G Giles; Ross L Prentice; Regina G Ziegler; Peter Kraft; Montse Garcia-Closas; Nilanjan Chatterjee
Journal: JAMA Oncol Date: 2016-10-01 Impact factor: 31.777

22 in total

1. Integration of rare expression outlier-associated variants improves polygenic risk prediction.

Authors: Craig Smail; Nicole M Ferraro; Qin Hui; Matthew G Durrant; Matthew Aguirre; Yosuke Tanigawa; Marissa R Keever-Keigher; Abhiram S Rao; Johanne M Justesen; Xin Li; Michael J Gloudemans; Themistocles L Assimes; Charles Kooperberg; Alexander P Reiner; Jie Huang; Christopher J O'Donnell; Yan V Sun; Manuel A Rivas; Stephen B Montgomery
Journal: Am J Hum Genet Date: 2022-05-18 Impact factor: 11.043

Review 2. Are polygenic risk scores ready for the cancer clinic?-a perspective.

Authors: Robert J Klein; Zeynep H Gümüş
Journal: Transl Lung Cancer Res Date: 2022-05

Review 3. Germline Aberrations in Pancreatic Cancer: Implications for Clinical Care.

Authors: Raffaella Casolino; Vincenzo Corbo; Philip Beer; Chang-Il Hwang; Salvatore Paiella; Valentina Silvestri; Laura Ottini; Andrew V Biankin
Journal: Cancers (Basel) Date: 2022-06-30 Impact factor: 6.575

Review 4. The role of genomics in global cancer prevention.

Authors: Ophira Ginsburg; Paul Brennan; Patricia Ashton-Prolla; Anna Cantor; Daniela Mariosa
Journal: Nat Rev Clin Oncol Date: 2020-09-24 Impact factor: 66.675

5. Will polygenic risk scores for cancer ever be clinically useful?

Authors: Amit Sud; Clare Turnbull; Richard Houlston
Journal: NPJ Precis Oncol Date: 2021-05-21

6. Metrics for Evaluating Polygenic Risk Scores.

Authors: Stuart G Baker
Journal: JNCI Cancer Spectr Date: 2020-12-23

7. Association of smoking and polygenic risk with the incidence of lung cancer: a prospective cohort study.

Authors: Peidong Zhang; Pei-Liang Chen; Zhi-Hao Li; Ao Zhang; Xi-Ru Zhang; Yu-Jie Zhang; Dan Liu; Chen Mao
Journal: Br J Cancer Date: 2022-02-22 Impact factor: 9.075

8. Genome-wide Modeling of Polygenic Risk Score in Colorectal Cancer Risk.

Authors: Minta Thomas; Lori C Sakoda; Michael Hoffmeister; Elisabeth A Rosenthal; Jeffrey K Lee; Franzel J B van Duijnhoven; Elizabeth A Platz; Anna H Wu; Christopher H Dampier; Albert de la Chapelle; Alicja Wolk; Amit D Joshi; Andrea Burnett-Hartman; Andrea Gsur; Annika Lindblom; Antoni Castells; Aung Ko Win; Bahram Namjou; Bethany Van Guelpen; Catherine M Tangen; Qianchuan He; Christopher I Li; Clemens Schafmayer; Corinne E Joshu; Cornelia M Ulrich; D Timothy Bishop; Daniel D Buchanan; Daniel Schaid; David A Drew; David C Muller; David Duggan; David R Crosslin; Demetrius Albanes; Edward L Giovannucci; Eric Larson; Flora Qu; Frank Mentch; Graham G Giles; Hakon Hakonarson; Heather Hampel; Ian B Stanaway; Jane C Figueiredo; Jeroen R Huyghe; Jessica Minnier; Jenny Chang-Claude; Jochen Hampe; John B Harley; Kala Visvanathan; Keith R Curtis; Kenneth Offit; Li Li; Loic Le Marchand; Ludmila Vodickova; Marc J Gunter; Mark A Jenkins; Martha L Slattery; Mathieu Lemire; Michael O Woods; Mingyang Song; Neil Murphy; Noralane M Lindor; Ozan Dikilitas; Paul D P Pharoah; Peter T Campbell; Polly A Newcomb; Roger L Milne; Robert J MacInnis; Sergi Castellví-Bel; Shuji Ogino; Sonja I Berndt; Stéphane Bézieau; Stephen N Thibodeau; Steven J Gallinger; Syed H Zaidi; Tabitha A Harrison; Temitope O Keku; Thomas J Hudson; Veronika Vymetalkova; Victor Moreno; Vicente Martín; Volker Arndt; Wei-Qi Wei; Wendy Chung; Yu-Ru Su; Richard B Hayes; Emily White; Pavel Vodicka; Graham Casey; Stephen B Gruber; Robert E Schoen; Andrew T Chan; John D Potter; Hermann Brenner; Gail P Jarvik; Douglas A Corley; Ulrike Peters; Li Hsu
Journal: Am J Hum Genet Date: 2020-08-05 Impact factor: 11.025

9. Risk of Breast Cancer Among Carriers of Pathogenic Variants in Breast Cancer Predisposition Genes Varies by Polygenic Risk Score.

Authors: Chi Gao; Eric C Polley; Steven N Hart; Hongyan Huang; Chunling Hu; Rohan Gnanaolivu; Jenna Lilyquist; Nicholas J Boddicker; Jie Na; Christine B Ambrosone; Paul L Auer; Leslie Bernstein; Elizabeth S Burnside; A Heather Eliassen; Mia M Gaudet; Christopher Haiman; David J Hunter; Eric J Jacobs; Esther M John; Sara Lindström; Huiyan Ma; Susan L Neuhausen; Polly A Newcomb; Katie M O'Brien; Janet E Olson; Irene M Ong; Alpa V Patel; Julie R Palmer; Dale P Sandler; Rulla Tamimi; Jack A Taylor; Lauren R Teras; Amy Trentham-Dietz; Celine M Vachon; Clarice R Weinberg; Song Yao; Jeffrey N Weitzel; David E Goldgar; Susan M Domchek; Katherine L Nathanson; Fergus J Couch; Peter Kraft
Journal: J Clin Oncol Date: 2021-06-08 Impact factor: 50.717

10. Cancer PRSweb: An Online Repository with Polygenic Risk Scores for Major Cancer Traits and Their Evaluation in Two Independent Biobanks.

Authors: Lars G Fritsche; Snehal Patil; Lauren J Beesley; Peter VandeHaar; Maxwell Salvatore; Ying Ma; Robert B Peng; Daniel Taliun; Xiang Zhou; Bhramar Mukherjee
Journal: Am J Hum Genet Date: 2020-09-28 Impact factor: 11.025