Literature DB >> 28604732

Meta-analysis of five genome-wide association studies identifies multiple new loci associated with testicular germ cell tumor.

Zhaoming Wang1, Katherine A McGlynn1, Ewa Rajpert-De Meyts2, D Timothy Bishop3, Charles C Chung1, Marlene D Dalgaard2,4, Mark H Greene1, Ramneek Gupta4, Tom Grotmol5, Trine B Haugen6, Robert Karlsson7, Kevin Litchfield8, Nandita Mitra9, Kasper Nielsen4, Louise C Pyle10,11, Stephen M Schwartz12, Vésteinn Thorsson13, Saran Vardhanabhuti14, Fredrik Wiklund7, Clare Turnbull8,15, Stephen J Chanock1, Peter A Kanetsky16, Katherine L Nathanson10,17.   

Abstract

The international Testicular Cancer Consortium (TECAC) combined five published genome-wide association studies of testicular germ cell tumor (TGCT; 3,558 cases and 13,970 controls) to identify new susceptibility loci. We conducted a fixed-effects meta-analysis, including, to our knowledge, the first analysis of the X chromosome. Eight new loci mapping to 2q14.2, 3q26.2, 4q35.2, 7q36.3, 10q26.13, 15q21.3, 15q22.31, and Xq28 achieved genome-wide significance (P < 5 × 10-8). Most loci harbor biologically plausible candidate genes. We refined previously reported associations at 9p24.3 and 19p12 by identifying one and three additional independent SNPs, respectively. In aggregate, the 39 independent markers identified to date explain 37% of father-to-son familial risk, 8% of which can be attributed to the 12 new signals reported here. Our findings substantially increase the number of known TGCT susceptibility alleles, move the field closer to a comprehensive understanding of the underlying genetic architecture of TGCT, and provide further clues to the etiology of TGCT.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28604732      PMCID: PMC5490654          DOI: 10.1038/ng.3879

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


In Europe and the United States, testicular germ cell tumors (TGCT) are the most common cancers in young men aged 20 to 39 years1. The incidence of TGCT is rising, and is highest in men of Northern European and lowest in men of African ancestry1–3. Risk factors for TGCT include cryptorchidism, adult height, prior diagnosis and familial history of TGCT4–8; its heritability ranges from 37% to 49%9,10. Despite the multiple lines of evidence demonstrating a considerable genetic component of TGCT risk, linkage and candidate gene approaches to find rare, highly-penetrant susceptibility genes involved in TGCT etiology were unsuccessful11. In contrast, genome wide association studies (GWAS) of TGCT have had remarkable success in identifying susceptibility loci with strong effects. Of the 27 replicated loci, most were discovered using GWAS chip-based microarray platforms12–18, with 13 identified after replication on the iCOGs array19–21 and one identified as a candidate region22. The genes mapping at or near identified susceptibility loci have revealed several biological themes that are highly likely to be important to TGCT development, including male germ cell maturation and differentiation, KIT-MAPK signaling, DNA damage response, and chromosomal segregation. We imputed each of five published TGCT GWAS scans12,18,20,23, and combined the association test statistics for a total of 8,960,654 autosomal and 249,696 chromosome X single nucleotide polymorphisms (SNPs), after excluding those with INFO score < 0.3 or minor allele frequency (MAF) < 0.01. We conducted a fixed-effects meta-analysis for 3,558 cases and 13,970 controls (Methods and Supplementary Table 1). The genomic control factor λ = 1.037 suggests little systematic inflation from population stratification (Supplementary Fig. 1). We identified eight new TGCT susceptibility loci surpassing genome-wide significance, and an additional four novel independent loci in two previously established regions (9p24.3 and 19p12) (Table 1). Two of these loci (rs6837349 and rs12912292) showed evidence of effect measure heterogeneity (I2 > 0.50) across the five sample sets. We also determined the Bayes false discovery probability (BFDP)24 for these 12 loci using a prior probability of 0.0001 and odds ratio of 1.2 (Supplementary Table 2). Two loci, rs61408740 and rs17336718, failed to surpass a BFDP < 0.05, likely because of their low minor allele frequencies (0.023 and 0.053, respectively).
Table 1

TGCT meta-analysis association results for novel loci and new independent SNPs in established loci

CytobandGene NeighborhoodSNPPositionStudyInfoControlsCasesReference alleleEffect alleleEffect allele frequencyORCIPPhetI2
ControlCase
2q14.2TFCP2L1rs2713206122007941NCI0.921055581CT0.150.201.45(1.18-1.79)4.19E-04
UK0.984945985CT0.150.171.15(1.00-1.31)4.43E-02
PENN0.90918480CT0.160.201.32(1.06-1.64)1.25E-02
Norway/Sweden0.9366871326CT0.140.171.25(1.09-1.44)1.74E-03
Denmark0.85363183CT0.170.211.41(0.98-2.03)6.39E-02
Combined1396835551.26(1.16-1.36)1.68E-080.393.6
3q26.2GPR160rs3755605169756119NCI0.971055581CT0.410.441.14(0.98-1.33)9.31E-02
UK0.984946985CT0.390.431.21(1.09-1.33)2.25E-04
PENN0.97918480CT0.410.431.08(0.92-1.27)3.54E-01
Norway/Sweden0.9866871326CT0.400.441.24(1.12-1.37)2.08E-05
Denmark0.99363183CT0.390.431.19(0.91-1.55)1.94E-01
Combined1396935551.19(1.12-1.26)3.87E-090.670.0
4q35.2ZFP42rs6837349188921355NCI0.991055581GT0.660.630.84(0.72-0.98)2.71E-02
UK0.994945985GT0.660.620.83(0.75-0.92)3.98E-04
PENN0.69918480GT0.660.681.15(0.94-1.40)1.80E-01
Norway/Sweden0.9966871326GT0.680.630.77(0.70-0.86)1.18E-06
Denmark0.74363183GT0.660.650.96(0.70-1.32)8.00E-01
Combined1396835550.84(0.79-0.89)3.13E-080.0267.4
7q36.3NCAPG2rs11769858158501492NCI0.931055581TC0.670.650.89(0.76-1.06)1.86E-01
UK0.954945985TC0.690.650.84(0.76-0.94)1.51E-03
PENN0.88918480TC0.640.600.81(0.68-0.96)1.69E-02
Norway/Sweden0.9166871326TC0.700.660.82(0.73-0.91)3.89E-04
Denmark0.90363183TC0.680.640.82(0.62-1.08)1.59E-01
Combined1396835550.84(0.79-0.89)2.38E-080.920.0
9p24.3*DMRT1rs55873183878563NCI0.731055581AG0.060.071.34(0.93-1.93)1.14E-01
UK0.824945985AG0.060.091.90(1.52-2.38)1.54E-08
PENN0.73918480AG0.060.091.97(1.36-2.84)3.05E-04
Norway/Sweden0.8066871326AG0.070.112.08(1.70-2.54)1.21E-12
Denmark0.81363183AG0.070.111.84(1.11-3.03)1.75E-02
Combined1396835551.89(1.67-2.14)2.18E-230.368.1
10q26.13LHPPrs61408740126274612NCI0.991056582CG0.020.041.68(1.09-2.60)1.89E-02
UK0.954945985CG0.030.041.64(1.22-2.20)1.05E-03
PENN0.94919480CG0.030.041.92(1.22-3.03)4.92E-03
Norway/Sweden1.0066871326CG0.020.031.53(1.12-2.09)7.79E-03
Denmark0.96363183CG0.020.031.61(0.69-3.76)2.75E-01
Combined1397035561.65(1.38-1.96)1.75E-080.950.0
15q21.3PRTGrs1291229256038707NCI0.951055581GA0.510.551.25(1.07-1.46)4.76E-03
UK0.994946985GA0.530.551.09(0.99-1.20)9.18E-02
PENN0.95918480GA0.470.561.44(1.23-1.70)8.74E-06
Norway/Sweden0.9566871326GA0.520.581.26(1.14-1.39)3.42E-06
Denmark0.95363183GA0.510.571.30(1.00-1.69)4.81E-02
Combined1396935551.22(1.15-1.29)1.38E-110.0361.4
15q22.31MAP2K1, TIPINrs6018074766663261NCI0.991055582AC0.260.301.27(1.07-1.50)5.59E-03
UK0.994946986AC0.260.301.23(1.10-1.37)2.34E-04
PENN0.99919481AC0.270.341.37(1.15-1.63)4.53E-04
Norway/Sweden1.0066871326AC0.260.281.18(1.05-1.31)3.89E-03
Denmark1.00363183AC0.270.311.22(0.93-1.60)1.59E-01
Combined1397035581.23(1.16-1.32)1.10E-100.700.0
19p12*ZNF728rs5852126223205184NCI0.991056581GA0.1510.1060.69(0.55-0.85)6.64E-04
UK1.004946985GA0.1400.1060.74(0.64-0.86)4.83E-05
PENN0.98919481GA0.1540.1230.78(0.63-0.98)2.95E-02
Norway/Sweden0.9966871326GA0.1730.1360.74(0.65-0.84)4.90E-06
Denmark0.99363183GA0.1830.1370.71(0.51-1.00)5.10E-02
Combined1397135560.74(0.68-0.80)4.87E-140.950.0
19p12*ZNF726rs3460137624050828NCI0.841055581AT0.2160.2201.04(0.85-1.28)6.77E-01
UK0.924945985AT0.2020.2421.29(1.14-1.46)5.08E-05
PENN0.83918480AT0.1910.2291.32(1.07-1.64)1.02E-02
Norway/Sweden0.8866871326AT0.1950.2381.39(1.22-1.58)4.19E-07
Denmark0.69363183AT0.1460.1681.32(0.85-2.05)2.23E-01
Combined1396835551.29(1.20-1.39)2.40E-110.2329.2
19p12*ZNF257rs7301987622267849NCI0.931055581TG0.450.420.89(0.76-1.04)1.35E-01
UK0.964946985TG0.450.400.83(0.75-0.91)1.51E-04
PENN0.95918480TG0.510.490.91(0.78-1.07)2.59E-01
Norway/Sweden0.9566871326TG0.430.410.85(0.77-0.94)1.22E-03
Denmark0.94363183TG0.440.360.72(0.55-0.93)1.25E-02
Combined1396935550.85(0.80-0.90)2.04E-080.540.0
Xq28TKTL1rs17336718153536119NCI0.971056582CT0.050.091.33(1.07-1.64)8.85E-03
UK0.634945986CT0.050.071.46(1.19-1.80)3.71E-04
PENN0.78918480CT0.050.091.59(1.23-2.06)4.06E-04
Denmark0.84363183CT0.060.081.15(0.78-1.69)4.71E-01
Combined728222311.41(1.25-1.59)3.84E-080.510.0

New independent SNPs in established loci

Prior reports identified 27 independent SNPs, at 25 distinct regions, and the gr/gr deletion associated with TGCT susceptibility (Fig. 1). In our current study, 19 of the SNPs (at 17 loci) reached a level of genome-wide significance (Table 2). Eight previously reported susceptibility markers were not identified in our meta-analysis likely related to limited study power, staged replication of GWAS chip-based array results in published studies, and possible residual population substructure. Considering these limitations and to further place context around our findings, we calculated the BFDP for these 27 loci using a prior probability of 0.10, which assumes 10% of the previously established loci are true positives (Supplementary Table 2). This threshold is more liberal than one that would be used to identify novel loci, but still may be too conservative for the re-identification of previously identified susceptibility markers. Only one locus of all 27, rs11705932, failed to surpass a BFDP < 0.05.
Figure 1

All identified SNP markers associated with TGCT susceptibility to date

In the ideogram, red dots and red rs number annotation indicate SNPs identified and described in the current study (P ≤ 1 × 10-8); blue dots and blue rs number annotation represent previously identified SNP markers achieving genome wide significance (P ≤ 1 × 10-8) in the current study; and gray dots and gray rs number annotation are previously identified SNPs that fail to achieve genome wide significance in this study (P > 1 × 10-8).

Table 2

TGCT meta-analysis association results for previously published susceptibility loci

CytobandGene NeighborhoodSNPPositionORCIPPhetI2
1q22KIAA0446 SLC25A44rs20724991561696101.20(1.13-1.27)1.63E-090.780.0
1q24.1UCK2rs37906721658733921.27(1.20-1.35)2.15E-140.840.0
3p24.3DAZLrs10510452166250480.82(0.77-0.87)3.36E-100.760.0
3q23*TFDP2 DKFZp434G222rs117059321418188500.88(0.82-0.94)3.51E-040.900.0
3q25.31rs15102721563007240.83(0.78-0.88)7.15E-090.450.0
4q22.3*SMARCAD1 HPGDSrs17021463952248120.87(0.82-0.92)1.36E-060.0656.5
4q24CENPE CENPE variant proteinrs27204601040546860.78(0.74-0.83)9.88E-170.920.0
5p15.33TERT hTERTrs273610012865161.29(1.22-1.37)7.69E-200.410.0
5p15.33CLPTM1Lrs463596913085521.46(1.37-1.57)2.83E-270.1541.0
5q31.1*CATSPER3 PITX1 AK026965rs38056631343662000.88(0.83-0.93)9.16E-060.368.2
5q31.3SPRY4rs46248201416817881.51(1.42-1.59)2.59E-460.510.0
6p21.31BAK1 AY383626 C6orf227rs210138335425381.55(1.44-1.66)2.51E-340.660.0
7p22.3MAD1L1rs1269947719689531.21(1.14-1.28)2.24E-100.1343.7
8q13.3*PRDM14rs7010162709765050.86(0.81-0.91)1.42E-070.3313.5
9p24.3DMRT1rs7040024**8455160.67(0.62-0.71)1.21E-320.0459.3
9p24.3DMRT1rs755383**8636351.49(1.41-1.58)6.52E-410.520.0
11q14.1*GAB2rs7107174779979361.19(1.10-1.29)6.35E-060.368.7
12p13.1ATF7IP PLBD1rs2900333146538670.85(0.80-0.90)2.71E-080.2033.6
12q21.32KITLGrs3782181889535612.02(1.88-2.18)1.32E-760.900.0
16p13.13*BCAR4 CATX-11 RSL1D1rs4561483119200370.86(0.81-0.91)4.19E-070.450.0
16q12.1HEATR3 AF086132rs8046148501429441.24(1.15-1.33)3.15E-090.2132.2
16q23.1RFWD3rs4888262746704580.83(0.78-0.88)5.65E-110.0852.7
16q24.2*ZFPM1rs55637647885492641.18(1.11-1.26)1.33E-070.401.4
17q12HNF1Brs7501939361011561.26(1.19-1.34)1.27E-140.420.0
17q22TEX14rs9905704566325431.27(1.19-1.35)1.99E-140.680.0
19p12AK125686rs2195987241495451.23(1.15-1.32)1.21E-090.890.0
21q22.3*MCM3APAS MCM3APrs2839186476900681.13(1.07-1.20)2.00E-050.0267.1

Indicates sub genome-wide statistical significance.

Pairwise r2=0.38.

rs12912292 is the most statistically significant novel SNP marker in our study, (OR=1.22; P = 1.38 ×10-11), marking a 131 kb haploblock on 15q21.3 (Table 1 and Supplementary Fig. 2a). This region contains only a single gene, PRTG, a member of the immunoglobulin superfamily implicated in neurogenesis25. PRTG is highly expressed in thyroid, testes and uterus (Supplementary Fig. 3a). No variant in this region is an eQTL in either normal testes or TGCT. The SNP marker rs60180747 (OR=1.23; P = 1.10 ×10-10) marks a 261 kb haploblock on 15q22.31 that contains several genes, including TIPIN (TIMELESS-interacting protein) MAP2K1 (mitogen-activated protein kinase kinase 1), DIS3L (DIS3 like exosome 3'-5' exoribonuclease), SNAPC5 (small nuclear RNA activating complex, polypeptide 5), and LCTL (lactose-like) (Table 1 and Supplementary Fig. 2b). Several of these proteins, particularly ZWILCH and TIPIN, have high and somewhat specific expression in testes (Supplementary Fig. 3b-g). TIPIN coordinates the DNA replication checkpoint by interacting with Replication protein A26; ZWILCH is a kinetochore protein important for proper chromatid alignment during cell division27. A missense mutation in ZWILCH, p.Ser230Gly, lies within the LD block (Supplementary Table 3). The LD block also contains a single eQTL to RP11-653J6.1, which is a long non-coding (lnc) RNA highly specific to testes (Supplemental Fig. 3h). RP11-653J6.1 levels and eQTLs were not measured in TGCT Cancer Genome Atlas (TCGA). Further dissection of this signal is needed to pinpoint the candidate gene. The SNP marker rs3755605 (OR=1.19; P = 3.87 ×10-9) identifies a 213 kb haploblock containing three genes, GPR160 (G protein-coupled receptor 160), PHC3 (polyhomeotic homolog 3), and PRKCI (protein kinase C, iota form) on 3q26.2 (Table 1 and Supplementary Fig. 2c). Several SNPs across this block are eQTLs for GPR160 (Supplementary Table 3, Supplementary Fig. 4a); the risk allele is associated with increased expression in both normal testes and TGCT (Supplementary Fig. 5a and Supplementary Fig. 6a). Several SNPs in the haploblock also are eQTLs in normal testes with RP11-469J4.3, a lncRNA of unknown function. RP11-469J4.3 is highly expressed in normal testes, but lies outside the haploblock (Supplementary Fig. 3l, Supplementary Fig. 4b, and Supplementary Fig. 5b). RP11-469J4.3 expression was not measured in the TCGA. The SNP marker rs2713206 (OR = 1.26; P = 1.68 ×10-8) lies within a smaller LD region of only 48 kb on 2q14.2. The gene TFCP2L1 (transcription factor CP2-like 1) overlies the entirety of the haploblock (Table 1 and Supplementary Fig. 2d); SNPs in the region are eQTLs (Supplementary Table 3) with the risk allele associated with decreased expression of TFCP2L1 in TGCT (Supplementary Fig. 6b). TFCP2L1 is not expressed in normal adult testes (Supplementary Fig. 3m) but it is highly expressed in fetal gonocytes and in germ cell neoplasia in situ, the precursor of TGCT28,29. TFCP2L1 is upregulated in human primordial germ cells during embryogenesis at the time of epigenetic reprogramming30, but downregulated during transition from fetal gonocytes into spermatogonia29. The SNP marker rs6837349 (OR = 0.84; P = 3.13 ×10-8) localizes to an intron of ZFP42 (zinc finger protein 42) on 4q35.2, and marks a small 11 kb haploblock containing no other genes (Table 1 and Supplementary Fig. 2e). The region has no eQTLs in either normal testes or TGCT, although ZFP42 is expressed exclusively in normal testes (Supplementary Fig. 3n), specifically in human spermatogonia and TGCT29,31. Additionally, both ZFP42 and TFCP2L1 are involved in embryonal stem cell pluripotency30,32. The SNP marker rs61408740 (OR= 1.65; P = 1.75 ×10-8) localizes to an intron of LHPP (phospholysine phosphohistidine inorganic pyrophosphate phosphatase) on 10q26.13 (Table 1 and Supplementary Fig. 2f). Only two SNPs were identified with pair-wise r2>0.4, one in the region of the second gene, FAM175B (Supplementary Table 3); neither are eQTLs. LHPP encodes an inorganic diphosphatase that functions in oxidative phosphorylation33. The SNP marker rs11769858 (OR = 0.84; P = 2.38 ×10-8) identifies an 82 kb LD block on 7q36.3 that contains a large portion of NCAPG2 (non-SMC condensin II complex subunit G2) (Table 1 and Supplementary Fig. 2g). NCAPG2 encodes a regulatory subunit of the condensin II complex, which is highly expressed in testes (Supplementary Fig. 3q), and plays a role in chromosome assembly and segregation during mitosis34. We also identified a locus marked by SNP rs17336718 (OR = 1.41; P = 3.84 ×10-8) on chromosome Xq28 (Table 1 and Supplementary Fig. 2h) in an intron of TKTL1 (transketolase-like 1), which is highly expressed in normal testes. The SNP is an eQTL for TKTL1 in the TGCT TCGA data (Supplementary Figure 6c). TKTL1 converts sedoheptulose to ribose and glyceraldehyde to xylulose, linking the pentose phosphate pathway to the glycolytic pathway. Interestingly, although overexpression of TKTL1 is associated with the Warburg effect and poor prognosis in several cancer types35–38, the risk allele is associated with lower expression. At the previously-reported TGCT susceptibility locus DMRT1 on chromosome 9, we identified a third independent signal, rs55873183 (OR = 1.89; P = 2.18 ×10-23) (Table 1, Fig. 2a, and Supplementary Table 4a). This intronic SNP marker has an r2 of 0.03 and 0.06 with the two previously published SNP markers, rs7040024 and rs75538314,17, respectively; it retained genome-wide significance in conditional analysis (Table 1, Supplementary Table 4b, Supplementary Fig. 2i). We also identified three additional independent signals at 19p12: rs58521262, rs34601376 and rs73019876 (Table 1, Fig. 2b, Supplementary Tables 5a and 5b, and Supplementary Figs. 2j, 2k, 2l). We identified a SNP, rs2194275 (P = 9.23 ×10-12; OR = 0.76), in moderate LD (r2 = 0.7) with the previously published rs2195987 (P = 1.21 ×10-9; OR = 0.81), which was more statistically significant in this meta-analysis (Supplementary Table 5b). 19p12 contains a cluster of Krüppel-associated box zinc finger genes (KRAB-ZFPs)39. KRAB-ZFPs are highly and differentially expressed in germ cells, and important for the epigenetic reprogramming requisite for normal germ cell development30. A number of different GWA studies, including one for telomere length40, have identified significant SNPs in this region. The 19p12 LD blocks are large: rs58521262 marks a 219 kb block, and rs73019876 marks a 184 kb block, each containing over 200 relevant SNPs and several genes. The rs73019876 haploblock is extremely eQTL rich, with ZNF729 and ZNF676 both eQTLs in normal testes (Supplemental Figs. 4c and 4d). rs58521262 is an eQTL with ZNF728 and CTD-2291D10.2, which like ZNF729 (Supplemental Figs. 3t), are only expressed in normal testes (Supplemental Fig. 3v). Associated SNPs in the LD block also include two putative missense mutations in ZNF728 (Supplemental Table 3) not predicted to be deleterious. Given the multiple independent signals in this region, further study will be required to determine which, if any, are causally involved in TGCT.
Figure 2

Genetic association between SNP markers and TGCT risk for regions with multiple independent signals

The strength of the association signals (−log10 P-values) for individual SNPs at (a) 9p24.3 and (b) 19p12-11 are plotted on the Y-axis relative to their genomic locations (GRCh37) along the X-axis. Red diamonds are the newly identified independent SNPs, blue diamonds are previously reported SNP markers, and all other SNPs are colored gray. The line graph shows likelihood ratio statistics (right Y-axis) for recombination hotspots calculated with SequenceLDhot software using 1000 Genomes Project CEU population data. Gene annotation along the X-axis is based on NCBI RefSeq genes from the UCSC Genome Browser.

Similar to prior reports, several of the loci identified in the current study contain biologically plausible genes implicating pathways involved in male germ cell development and pluripotency (TFCP2L1, ZFP42), kinetochore function (ZWILCH), DNA damage response (TIPIN), and metabolic mitochondrial function (TKTL1 and LHPP). We identified eight such loci in novel genomic regions and four in previously identified regions. These additional loci bring the cumulative total of independent susceptibility alleles for TGCT to 40. Interestingly, racial differences in risk allele frequencies that parallel population-specific TGCT risk also continue to be apparent. Of the 40 identified susceptibility loci, the allele frequencies of all but one differ significantly across continents based on analysis of data from the AFR, AMR, ASN and EUR populations available in the 1000 Genomes project41, with most comparisons having a P-value surpassing strict Bonferroni correction (P < 0.00125) (Supplementary Table 6). The 12 newly identified susceptibility alleles account for 5.3% of the genetic risk to the brothers and 8.0% of risk to the sons of TGCT patients, increasing estimates of heritability to 25% and 37% of the risk to brothers and sons, respectively. The newly identified TGCT susceptibility markers continue to demonstrate moderate effects with ORs that range from 1.17 to 1.89. In comparison with other cancer types, we have accounted for a high proportion of site-specific heritability with fewer loci42–44.

Online Methods

Studies

Detailed characteristics and genotype quality control metrics of the study populations (Denmark, NCI [STEED, FTCS], Norway/Sweden, Penn, UK) have been previously described12,15,18,19,23. Subjects used in the current study are all of European descent, and data from each study were collected and analyzed in accordance with local ethical permissions and informed consent.

Genotype imputation

Genotype imputation was conducted by each center following a similar protocol. SNPs with a call rate < 95% or Hardy-Weinberg proportion test P-value < 0.000001 or MAF < 1% were removed prior to imputation. Imputation was conducted using IMPUTE2 software version 2.2.2 and version 3 of the 1,000 Genomes Project Phase 1 data as the reference set. First, the genomic coordinates were lifted over from NCBI human genome build 36 to build 37 using the UCSC lift over tool. Second, the strand of the inference data was aligned with the 1,000 Genomes data informed by allele state comparison or allele frequency matching for A/T and G/C SNPs. A pre-phasing strategy with SHAPEIT software version 1 was adopted to improve the imputation performance. The phased haplotypes from SHAPEIT were imported directly into the IMPUTE2 program. We applied sliding windows of 4Mb with 250kb as an overlapping buffer and generated 744 segments for imputing autosomes. For Chromosome X, the pseudoautosomal region 1 (PAR1), PAR2, and the remaining region, which was split into 37 sections, were imputed separately. We excluded imputed loci with INFO score < 0.3 or MAF < 0.01 from further association analysis. Further, we acknowledge the limitations of imputation, including that the accuracy of imputation depends on the linkage disequilibrium between markers in the reference panel and markers to be imputed, and that the quality of imputation across scans differs because of imperfect population matching of data sets to the reference panel.

Statistical analysis

Within each data set, a test for trend was performed for each SNP using SNPTEST software version 2.2 or 2.5. Fixed-effects meta-analysis was used to combine individual within-study association estimates from five imputed GWAS scans. Genetic effect heterogeneity across studies was assessed by using I2 and P-value calculated from the Cochran's Q statistic. To refine the association signals of each risk region, we first performed LD pruning using pairwise R2 > 0.3 and then conducted the conditional association analyses to estimate the independent effect of each SNP by simultaneously including all specified SNPs from the same region and with their unconditional P-values < 5 × 10-8 into the same logistic regression model.

Heritability analysis

To evaluate the familial risk explained by the new loci identified in our study, we estimated the contribution of each SNP based on the formula h2SNP = β2 x 2f (1-f), where β is the log per allele odds ratio and f is the risk allele frequency45. We calculated the proportion of familial risk explained by dividing the summed contribution of all h2SNP by the total heritability, which was derived from the log relative risk (RR), where RR = 4 for affected father and RR = 8 for affected brothers46.

In silico bioinformatics analysis

We used HaploReg v4.1 and RegulomeDB v1.1 to explore potential non-coding functional annotation within the ENCODE database in the genomic region surrounding our SNPs of interest, with particular attention to annotations in induced pluripotent stem cells (IPSC) and embryonic stem cells (ESC), as we considered these tissue types as best proxies for TGCT. Specifically, we interrogated the linkage disequilibrium (LD) block of neighboring SNPs in a haploblock defined as pair-wise r2 > 0.4 with the index SNP (Supplementary Table 4). We also searched the GTEx v6 database to determine whether the haploblock SNPs were implicated as eQTLs in their sample of 157 normal adult testis tissues with available genotype. Of note, the normal testis contains an abundance of stromal cells (i.e., Sertoli and Leydig cells), so may not be an exact surrogate for germ cells, and in particular the primordial germ cells from which TGCT is believed to develop. Finally, we assessed our 12 novel susceptibility loci for eQTLs among the 128 cases of TGCT with linked genotype data available in The Cancer Genome Atlas. For the correlation between genotype and expression data from the TGCT TCGA data set, the genotype data was downloaded from the NCI’s Genomic Data Commons (https://gdc.nci.nih.gov/). Data was converted to PLINK v1.0747 format. Subjects were screened for discordant sex, insufficient genotype call rate (>0.05), and excessive heterozygosity (> +/- 3 SD from the mean). SNPs were screened for MAF (>0.01), Hardy-Weinberg equilibrium violations (p<0.0001), and missingness (>0.01). All quality control steps were performed in PLINK. A total of five subjects were removed (all for heterozygosity violations), leaving 145 valid for analysis. 1000 Genomes Phase 341 was used as the reference set. Alignment to the reference set and haplotype estimation was performed using Shapeit v248, and additional SNPs were imputed using Impute249. Imputed SNPs with an info score <0.4 were discarded. For the 12 SNPs of interest, the risk allele was calculated as the allele with increased odds of TGCT (OR>1). For each subject, the zygosity with respect to the risk allele was calculated, and genotypes were tabulated. All available TCGA TGCT data were retreived from the TCGA Data Coordinating Center and processed through the TCGA pipeline at the TCGA Genome Data Analysis Center at the Institute for Systems Biology. Gene expression matrices were generated for 133 primary tumor samples using available (TCGA Level 3) gene expression values from RNA sequencing, expressed as RSEM values50. Imputed genotypes for all novel SNPs reported in this paper were related to gene expression, for the 128 cases with both genotype and gene expression levels available. Assocations were tested using a linear regression model (using the lm function in R).

Technical validation of imputed SNPs

To technically validate our imputation findings, we optimized TaqMan assays (Applied Biosystems) for 12 loci based on the standard pipeline at the Cancer Genomics Research Laboratory at National Cancer Institute (Supplementary Table 7). For six loci that failed initial TaqMan assay design, LD surrogate SNPs were used. We randomly selected about 1000 samples previously scanned in one of three GWAS (~300 each from NCI, Penn and Norway/Sweden) for TaqMan genotyping. For the imputed probabilistic genotypes, a threshold of 0.80 was applied to derive the discrete genotypes. The average concordance rates are 0.98, 0.97 and 0.93 for NCI, Penn and Norway/Sweden respectively (Supplementary Table 7).
  49 in total

1.  Familial testicular cancer in a single-centre population.

Authors:  D J Sonneveld; D T Sleijfer; H Schrafford Koops; R H Sijmons; W T van der Graaf; W J Sluiter; H J Hoekstra
Journal:  Eur J Cancer       Date:  1999-09       Impact factor: 9.162

2.  Estimating the heritability of colorectal cancer.

Authors:  Shuo Jiao; Ulrike Peters; Sonja Berndt; Hermann Brenner; Katja Butterbach; Bette J Caan; Christopher S Carlson; Andrew T Chan; Jenny Chang-Claude; Stephen Chanock; Keith R Curtis; David Duggan; Jian Gong; Tabitha A Harrison; Richard B Hayes; Brian E Henderson; Michael Hoffmeister; Laurence N Kolonel; Loic Le Marchand; John D Potter; Anja Rudolph; Robert E Schoen; Daniela Seminara; Martha L Slattery; Emily White; Li Hsu
Journal:  Hum Mol Genet       Date:  2014-02-21       Impact factor: 6.150

3.  Improved whole-chromosome phasing for disease and population genetic studies.

Authors:  Olivier Delaneau; Jean-Francois Zagury; Jonathan Marchini
Journal:  Nat Methods       Date:  2013-01       Impact factor: 28.547

4.  Estimation of effect size distribution from genome-wide association studies and implications for future discoveries.

Authors:  Ju-Hyun Park; Sholom Wacholder; Mitchell H Gail; Ulrike Peters; Kevin B Jacobs; Stephen J Chanock; Nilanjan Chatterjee
Journal:  Nat Genet       Date:  2010-06-20       Impact factor: 38.330

5.  Poor outcome in primary non-small cell lung cancers is predicted by transketolase TKTL1 expression.

Authors:  Gian Kayser; Wulf Sienel; Britta Kubitz; Dominik Mattern; Elmar Stickeler; Bernward Passlick; Martin Werner; Axel Zur Hausen
Journal:  Pathology       Date:  2011-12       Impact factor: 5.306

6.  Molecular cloning of a cDNA for the human phospholysine phosphohistidine inorganic pyrophosphate phosphatase.

Authors:  Fumiaki Yokoi; Hiroyuki Hiraishi; Kenji Izuhara
Journal:  J Biochem       Date:  2003-05       Impact factor: 3.387

7.  Variants near DMRT1, TERT and ATF7IP are associated with testicular germ cell cancer.

Authors:  Clare Turnbull; Elizabeth A Rapley; Sheila Seal; David Pernet; Anthony Renwick; Deborah Hughes; Michelle Ricketts; Rachel Linger; Jeremie Nsengimana; Panagiotis Deloukas; Robert A Huddart; D Timothy Bishop; Douglas F Easton; Michael R Stratton; Nazneen Rahman
Journal:  Nat Genet       Date:  2010-06-13       Impact factor: 38.330

8.  Genome-wide association analysis of more than 120,000 individuals identifies 15 new susceptibility loci for breast cancer.

Authors:  Kyriaki Michailidou; Jonathan Beesley; Sara Lindstrom; Sander Canisius; Joe Dennis; Michael J Lush; Mel J Maranian; Manjeet K Bolla; Qin Wang; Mitul Shah; Barbara J Perkins; Kamila Czene; Mikael Eriksson; Hatef Darabi; Judith S Brand; Stig E Bojesen; Børge G Nordestgaard; Henrik Flyger; Sune F Nielsen; Nazneen Rahman; Clare Turnbull; Olivia Fletcher; Julian Peto; Lorna Gibson; Isabel dos-Santos-Silva; Jenny Chang-Claude; Dieter Flesch-Janys; Anja Rudolph; Ursula Eilber; Sabine Behrens; Heli Nevanlinna; Taru A Muranen; Kristiina Aittomäki; Carl Blomqvist; Sofia Khan; Kirsimari Aaltonen; Habibul Ahsan; Muhammad G Kibriya; Alice S Whittemore; Esther M John; Kathleen E Malone; Marilie D Gammon; Regina M Santella; Giske Ursin; Enes Makalic; Daniel F Schmidt; Graham Casey; David J Hunter; Susan M Gapstur; Mia M Gaudet; W Ryan Diver; Christopher A Haiman; Fredrick Schumacher; Brian E Henderson; Loic Le Marchand; Christine D Berg; Stephen J Chanock; Jonine Figueroa; Robert N Hoover; Diether Lambrechts; Patrick Neven; Hans Wildiers; Erik van Limbergen; Marjanka K Schmidt; Annegien Broeks; Senno Verhoef; Sten Cornelissen; Fergus J Couch; Janet E Olson; Emily Hallberg; Celine Vachon; Quinten Waisfisz; Hanne Meijers-Heijboer; Muriel A Adank; Rob B van der Luijt; Jingmei Li; Jianjun Liu; Keith Humphreys; Daehee Kang; Ji-Yeob Choi; Sue K Park; Keun-Young Yoo; Keitaro Matsuo; Hidemi Ito; Hiroji Iwata; Kazuo Tajima; Pascal Guénel; Thérèse Truong; Claire Mulot; Marie Sanchez; Barbara Burwinkel; Frederik Marme; Harald Surowy; Christof Sohn; Anna H Wu; Chiu-chen Tseng; David Van Den Berg; Daniel O Stram; Anna González-Neira; Javier Benitez; M Pilar Zamora; Jose Ignacio Arias Perez; Xiao-Ou Shu; Wei Lu; Yu-Tang Gao; Hui Cai; Angela Cox; Simon S Cross; Malcolm W R Reed; Irene L Andrulis; Julia A Knight; Gord Glendon; Anna Marie Mulligan; Elinor J Sawyer; Ian Tomlinson; Michael J Kerin; Nicola Miller; Annika Lindblom; Sara Margolin; Soo Hwang Teo; Cheng Har Yip; Nur Aishah Mohd Taib; Gie-Hooi Tan; Maartje J Hooning; Antoinette Hollestelle; John W M Martens; J Margriet Collée; William Blot; Lisa B Signorello; Qiuyin Cai; John L Hopper; Melissa C Southey; Helen Tsimiklis; Carmel Apicella; Chen-Yang Shen; Chia-Ni Hsiung; Pei-Ei Wu; Ming-Feng Hou; Vessela N Kristensen; Silje Nord; Grethe I Grenaker Alnaes; Graham G Giles; Roger L Milne; Catriona McLean; Federico Canzian; Dimitrios Trichopoulos; Petra Peeters; Eiliv Lund; Malin Sund; Kay-Tee Khaw; Marc J Gunter; Domenico Palli; Lotte Maxild Mortensen; Laure Dossus; Jose-Maria Huerta; Alfons Meindl; Rita K Schmutzler; Christian Sutter; Rongxi Yang; Kenneth Muir; Artitaya Lophatananon; Sarah Stewart-Brown; Pornthep Siriwanarangsan; Mikael Hartman; Hui Miao; Kee Seng Chia; Ching Wan Chan; Peter A Fasching; Alexander Hein; Matthias W Beckmann; Lothar Haeberle; Hermann Brenner; Aida Karina Dieffenbach; Volker Arndt; Christa Stegmaier; Alan Ashworth; Nick Orr; Minouk J Schoemaker; Anthony J Swerdlow; Louise Brinton; Montserrat Garcia-Closas; Wei Zheng; Sandra L Halverson; Martha Shrubsole; Jirong Long; Mark S Goldberg; France Labrèche; Martine Dumont; Robert Winqvist; Katri Pylkäs; Arja Jukkola-Vuorinen; Mervi Grip; Hiltrud Brauch; Ute Hamann; Thomas Brüning; Paolo Radice; Paolo Peterlongo; Siranoush Manoukian; Loris Bernard; Natalia V Bogdanova; Thilo Dörk; Arto Mannermaa; Vesa Kataja; Veli-Matti Kosma; Jaana M Hartikainen; Peter Devilee; Robert A E M Tollenaar; Caroline Seynaeve; Christi J Van Asperen; Anna Jakubowska; Jan Lubinski; Katarzyna Jaworska; Tomasz Huzarski; Suleeporn Sangrajrang; Valerie Gaborieau; Paul Brennan; James McKay; Susan Slager; Amanda E Toland; Christine B Ambrosone; Drakoulis Yannoukakos; Maria Kabisch; Diana Torres; Susan L Neuhausen; Hoda Anton-Culver; Craig Luccarini; Caroline Baynes; Shahana Ahmed; Catherine S Healey; Daniel C Tessier; Daniel Vincent; Francois Bacot; Guillermo Pita; M Rosario Alonso; Nuria Álvarez; Daniel Herrero; Jacques Simard; Paul P D P Pharoah; Peter Kraft; Alison M Dunning; Georgia Chenevix-Trench; Per Hall; Douglas F Easton
Journal:  Nat Genet       Date:  2015-03-09       Impact factor: 38.330

9.  Transketolase-like 1 ectopic expression is associated with DNA hypomethylation and induces the Warburg effect in melanoma cells.

Authors:  Aparna Jayachandran; Pu-Han Lo; Anderly C Chueh; Prashanth Prithviraj; Ramyar Molania; Mercedes Davalos-Salas; Matthew Anaka; Marzena Walkiewicz; Jonathan Cebon; Andreas Behren
Journal:  BMC Cancer       Date:  2016-02-22       Impact factor: 4.430

10.  Meta-analysis identifies four new loci associated with testicular germ cell tumor.

Authors:  Charles C Chung; Peter A Kanetsky; Zhaoming Wang; Michelle A T Hildebrandt; Roelof Koster; Rolf I Skotheim; Christian P Kratz; Clare Turnbull; Victoria K Cortessis; Anne C Bakken; D Timothy Bishop; Michael B Cook; R Loren Erickson; Sophie D Fosså; Kevin B Jacobs; Larissa A Korde; Sigrid M Kraggerud; Ragnhild A Lothe; Jennifer T Loud; Nazneen Rahman; Eila C Skinner; Duncan C Thomas; Xifeng Wu; Meredith Yeager; Fredrick R Schumacher; Mark H Greene; Stephen M Schwartz; Katherine A McGlynn; Stephen J Chanock; Katherine L Nathanson
Journal:  Nat Genet       Date:  2013-05-12       Impact factor: 38.330

View more
  38 in total

1.  Radiogenomic Predictors of Adverse Effects following Charged Particle Therapy.

Authors:  Lindsay M Morton; Luisel Ricks-Santi; Catharine M L West; Barry S Rosenstein
Journal:  Int J Part Ther       Date:  2018-09-21

2.  Increasing Incidence of Testicular Germ Cell Tumors among Racial/Ethnic Minorities in the United States.

Authors:  Armen A Ghazarian; Katherine A McGlynn
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2020-05-08       Impact factor: 4.254

3.  LDtrait: An Online Tool for Identifying Published Phenotype Associations in Linkage Disequilibrium.

Authors:  Shu-Hong Lin; Derek W Brown; Mitchell J Machiela
Journal:  Cancer Res       Date:  2020-06-30       Impact factor: 12.701

4.  EnsembleCNV: an ensemble machine learning algorithm to identify and genotype copy number variation using SNP array data.

Authors:  Zhongyang Zhang; Haoxiang Cheng; Xiumei Hong; Antonio F Di Narzo; Oscar Franzen; Shouneng Peng; Arno Ruusalepp; Jason C Kovacic; Johan L M Bjorkegren; Xiaobin Wang; Ke Hao
Journal:  Nucleic Acids Res       Date:  2019-04-23       Impact factor: 16.971

5.  Divergent roles for KLF4 and TFCP2L1 in naive ground state pluripotency and human primordial germ cell development.

Authors:  G V Hancock; W Liu; L Peretz; D Chen; J J Gell; A J Collier; J R Zamudio; K Plath; A T Clark
Journal:  Stem Cell Res       Date:  2021-08-08       Impact factor: 2.020

Review 6.  Update on epidemiologic considerations and treatment trends in testicular cancer.

Authors:  Solomon L Woldu; Aditya Bagrodia
Journal:  Curr Opin Urol       Date:  2018-09       Impact factor: 2.309

7.  Mammalian germ cells are determined after PGC colonization of the nascent gonad.

Authors:  Peter K Nicholls; Hubert Schorle; Sahin Naqvi; Yueh-Chiang Hu; Yuting Fan; Michelle A Carmell; Ina Dobrinski; Adrienne L Watson; Daniel F Carlson; Scott C Fahrenkrug; David C Page
Journal:  Proc Natl Acad Sci U S A       Date:  2019-11-21       Impact factor: 11.205

Review 8.  Human germ cell tumours from a developmental perspective.

Authors:  J Wolter Oosterhuis; Leendert H J Looijenga
Journal:  Nat Rev Cancer       Date:  2019-08-14       Impact factor: 60.716

Review 9.  Testicular cancer in 2017: Sequencing advances understanding.

Authors:  Matthew J Murray; Clare Turnbull
Journal:  Nat Rev Urol       Date:  2017-12-19       Impact factor: 14.432

10.  Subphenotype meta-analysis of testicular cancer genome-wide association study data suggests a role for RBFOX family genes in cryptorchidism susceptibility.

Authors:  Yanping Wang; Dione R Gray; Alan K Robbins; Erin L Crowgey; Stephen J Chanock; Mark H Greene; Katherine A McGlynn; Katherine Nathanson; Clare Turnbull; Zhaoming Wang; Marcella Devoto; Julia Spencer Barthold
Journal:  Hum Reprod       Date:  2018-05-01       Impact factor: 6.918

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.