Literature DB >> 29719630

Genetic susceptibility to bone and soft tissue sarcomas: a field synopsis and meta-analysis.

Clara Benna1,2, Andrea Simioni1, Sandro Pasquali1,3, Davide De Boni1, Senthilkumar Rajendran1, Giovanna Spiro1, Chiara Colombo3, Calogero Virgone4, Steven G DuBois5, Alessandro Gronchi3, Carlo Riccardo Rossi1,6, Simone Mocellin1,6.   

Abstract

BACKGROUND: The genetic architecture of bone and soft tissue sarcomas susceptibility is yet to be elucidated. We aimed to comprehensively collect and meta-analyze the current knowledge on genetic susceptibility in these rare tumors.
METHODS: We conducted a systematic review and meta-analysis of the evidence on the association between DNA variation and risk of developing sarcomas through searching PubMed, The Cochrane Library, Scopus and Web of Science databases. To evaluate result credibility, summary evidence was graded according to the Venice criteria and false positive report probability (FPRP) was calculated to further validate result noteworthiness. Integrative analysis of genetic and eQTL (expression quantitative trait locus) data was coupled with network and pathway analysis to explore the hypothesis that specific cell functions are involved in sarcoma predisposition.
RESULTS: We retrieved 90 eligible studies comprising 47,796 subjects (cases: 14,358, 30%) and investigating 1,126 polymorphisms involving 320 distinct genes. Meta-analysis identified 55 single nucleotide polymorphisms (SNPs) significantly associated with disease risk with a high (N=9), moderate (N=38) and low (N=8) level of evidence, findings being classified as noteworthy basically only when the level of evidence was high. The estimated joint population attributable risk for three independent SNPs (rs11599754 of ZNF365/EGR2, rs231775 of CTLA4, and rs454006 of PRKCG) was 37.2%. We also identified 53 SNPs significantly associated with sarcoma risk based on single studies.Pathway analysis enabled us to propose that sarcoma predisposition might be linked especially to germline variation of genes whose products are involved in the function of the DNA repair machinery.
CONCLUSIONS: We built the first knowledgebase on the evidence linking DNA variation to sarcomas susceptibility, which can be used to generate mechanistic hypotheses and inform future studies in this field of oncology.

Entities:  

Keywords:  SNP; meta-analysis; polymorphisms; risk; sarcoma

Year:  2018        PMID: 29719630      PMCID: PMC5915097          DOI: 10.18632/oncotarget.24719

Source DB:  PubMed          Journal:  Oncotarget        ISSN: 1949-2553


INTRODUCTION

Sarcomas are a family of rare malignant tumors arising from bone and soft tissues with more than 50 different histologies accounting for about 1-2% of cancers in adults and 15-20% in children (worldwide incidence: approximately 200,000 cases per year). The pathogenesis of sarcomas is multifactorial including environmental (such as exposure to ionizing radiations or chemical carcinogens) and genetic components, although the disease rarity represents an objective hurdle to the research in this field of investigation. Significant advances have been made in the understanding of the acquired genetic events leading to sarcomagenesis. It has been recognized that three types of somatic DNA alterations, translocations, mutations, and copy number variations, play a key role in these tumors [1]. As a consequence, sarcomas are grouped into two categories: balanced translocation associated sarcomas (BATS) and complex genotype/karyotype sarcomas (CGKS), which are characterized by a stable genome and genomic instability, respectively [2]. A potential therapeutic implication of such genetic taxonomy classification is that some recurrent chromosomal translocations might be exploited for the development of drugs targeting the protein products of fusion oncogenes [1]. Conversely, knowledge on the role of germline DNA variations in sarcomagenesis is sparse and limited. Although a minority of sarcomas arise within well characterized heritable cancer predisposition syndromes (e.g., osteosarcoma and Bloom syndrome, desmoid tumors and familial adenomatous polyposis) [3], the vast majority of sarcomas occur sporadically and the role of the genetic background in their pathogenesis is to be uncovered. Recent advances in molecular high-throughput technology, which conduct of genome wide association studies (GWAS), is accelerating the pace of discovery of sarcoma predisposition loci. Looking at the already existing international literature, some investigators have meta-analyzed the evidence regarding a handful of SNPs such as XRCC3 rs861539 [4], MDM2 rs2279744 [5, 6], and CTLA4 rs231775 [7]: however, to the best of our knowledge no comprehensive collection of the available data in this field of oncology has been published thus far. With the present work we systematically reviewed and meta-analyzed the available evidence in this field in order to: 1) provide readers with the first knowledgebase dedicated to the relationship between germline DNA variation and sarcoma risk; 2) identify areas lacking of meaningful information thus helping to inform future studies; and 3) suggest a biological interpretation of current findings utilizing network and pathway analysis [8] after integrating multiple sources of biological data [9].

RESULTS

Characteristics of the eligible studies

We identified 90 eligible articles, comprising 47,796 subjects, 14,358 cases and 33,438 controls. The details of the literature search are summarized in Figure 1.
Figure 1

Flow diagram summarizing the search strategy and the study selection process

Based on the prevalent ancestry (ie. the race of at least 80% of the enrolled subjects) the majority of the studies were Asian (N=57 studies) the rest being Caucasian (N=25 studies), or mixed (N=8 studies). Based on study design, half of included studies were population based case-controls studies (N=40 studies), the remaining were hospital based (N=39 studies), with a few (N=11) being mixed or not specified. Two studies were GWAS [10, 11]. According to histology, the majority of the eligible studies investigated bone tumors (N=65) and the remaining investigated Ewing's sarcoma (N=9), soft tissue sarcomas (N=6), chordoma (N=4), hemangiosarcoma (N=1), and mixed sarcomas (N=5). Thirteen studies investigated pediatric subjects or young adults. Although pediatric/young age ranged from 0 to 35 years old in eligible studies, most of the studies considered subjects < 20 years old. We evaluated the included studies following the criteria of the Newcastle-Ottawa scale (NOS) scoring system. The mean score was 7.8. The main features of all the eligible studies and the NOS score are available on Table 1.
Table 1

Characteristics of the included studies and Newcastle-Ottawa quality assessment (NOS) evaluation

Included articles referencesSubjects characteristicsNOS
First AuthorJournalYearCancer TypeCasesControlsAgeEthnicitySource of ControlsNOS 123NOS [0–9]
Adiguzel M. [12]Indian J Exp Biol2016Bone tumors5481AdultCaucasianPopulation4138
Alhopuro P. [13]J Med Genet2005Soft tissue sarcoma68185AdultCaucasianPopulation4138
Almeida PSR. [14]Genet Mol Res2008Soft tissue sarcoma10085AdultMixednot specified2136
Aoyama T. [15]Cancer Letters2002Bone tumors3872AdultAsianPopulation3137
Barnette P. [16]Cancer Epidemiol Biomarkers Prev2004Mixed42326Pediat/YoungCaucasianPopulation3238
Biason P. [17]Pharmacogenomics J2012Bone tumors130250AdultCaucasianHospital3238
Bilbao-Aldaiturriaga N. [18]Pediatr Blood Cancer2015Bone tumors99387Pediat/YoungCaucasianHospital3238
Chen Y. [19]Tumor Biol2016Bone tumors190190AdultAsianHospital3238
Cong Y. [20]Tumor Biol2015Bone tumors203406AdultAsianHospital3238
Cui Y. [21]Biomarkers2016Bone tumors251251AdultAsianHospital3238
Cui Y. [22]Tumor Biol2016Bone tumors260260AdultAsianHospital3238
Dong YZ. [23]Genet Mol Res2015Bone tumors185201AdultAsianHospital3238
DuBois SG. [24]Pediatr Blood Cancer2011Ewing's sarcoma135200Pediat/YoungCaucasianHospital2136
Ergen A. [25]Mol Biol Rep2011Bone tumors5050AdultCaucasiannot specified3137
Feng D. [26]Genet Test Mol Biomarkers2013Ewing's sarcoma308362AdultAsianHospital3238
Gloudemans T. [27]Cancer Res1993Soft tissue sarcoma926AdultCaucasianPopulation3036
Grochola LF. [28]Clin Cancer Res2009Soft tissue sarcoma130497AdultCaucasianPopulation3137
Grünewald TG. [29]Nat Genet2015Ewing's sarcoma343251AdultCaucasianPopulation4239
Guo J. [30]Genet Mol Res2015Bone tumors136136AdultAsianHospital3137
He J. [31]Endocr J2013Bone tumors415431AdultAsianHospital3238
He J. [32]Endocrine2014Bone tumors415431AdultAsianHospital3238
He M. [33]Tumor Biol2014Bone tumors189195AdultAsianHospital3238
He ML. [34]Asian Pac J Cancer Prev2013Bone tumors5963AdultAsianHospital3137
He Y. [35]Int Orthop2014Bone tumors120120AdultAsianHospital3238
Hu GL. [36]Genet Mol Res2015Bone tumors130130AdultAsianHospital3238
Hu YS. [37]BMC Cancer2010Bone tumors168168AdultAsianPopulation4239
Hu YS. [38]Med Oncol2011Bone tumors168168AdultAsianPopulation4239
Hu Z. [39]Genet Test Mol Biomarkers2015Bone tumors368370AdultAsiannot specified2136
Ito M. [40]Clin Cancer Res2010Soft tissue sarcoma15537AdultMixedHospital2035
Jiang C. [41]Med Oncol2014Bone tumors168216AdultAsianHospital3238
Kelley MJ. [42]Hum Genet2014Chordoma103160AdultAsianPopulation4138
Koshkina NV. [43]J Pediatr Hematol Oncol2007Bone tumors123510Pediat/YoungMixedPopulation4138
Le Morvan V. [44]Int J Cancer2006Mixed9353AdultCaucasianPopulation4037
Li L. [45]Genet Mol Res2015Bone tumors52100AdultAsianHospital3126
Liu Y. [46]DNA Cell Biol2011Bone tumors267282AdultAsianPopulation3137
Liu Y. [47]PloSONE2012Bone tumors326433AdultAsianPopulation4239
Lu H. [48]Tumor Biol2015Bone tumors388388AdultAsianHospital3238
Lu XF. [49]Asian Pac J Cancer Prev2011Bone tumors110226AdultAsianHospital3137
Lv H. [50]Mol Med Rep2014Bone tumors103201AdultAsianHospital2136
Ma X. [51]Genet Mol Res2016Bone tumors141282AdultAsianHospital2237
Martinelli M. [52]Oncotarget2016Ewing's sarcoma100147Pediat/YoungCaucasianPopulation4239
Mei JW. [99]Int J Clin Exp Pathol2016Bone tumors97120AdultAsianPopulation3137
Miao C.[53]Sci Rep2015Soft tissue sarcoma138131AdultAsianHospital2237
Mirabello L. [54]Carcinogenesis2010Bone tumors991430AdultCaucasianmixed3238
Mirabello L. [55]BMC Cancer2011Bone tumors961426AdultCaucasianmixed3238
Nakayama R. [56]Cancer Sci2008Mixed5441378AdultAsianmixed3238
Naumov VA. [57]Bull Exp Biol Med2012Bone tumors6896AdultCaucasiannot specified3137
Oliveira ID. [58]J Pediatr Hematol Oncol2007Bone tumors80160Pediat/YoungMixedHospital3238
Ozger H. [59]Folia Biologica (Praha)2008Mixed5644AdultCaucasianPopulation4037
Patino-Garcia A. [60]J Med Genet2000Bone tumors110111Pediat/YoungCaucasiannot specified3238
Pillay N. [61]Nat Genet2012Chordoma40358AdultCaucasianpopulation3238
Postel-Vinay S. [10]Nat Genet2012Ewing's sarcoma4014352AdultCaucasianpopulation4239
Qi Y. [62]Tumor Biol2016Bone tumors206206AdultAsianHospital3238
Qu WR. [63]Genetic Mol Res2016Bone tumors153252AdultAsianHospital3238
Ru JY. [64]Int J Clin Exp Pathol2015Bone tumors210420AdultAsianHospital3238
Ruza E. [65]J Pediatr Hematol Oncol2003Mixed125143Pediat/YoungCaucasiannot specified3227
Saito T. [66]Int J Cancer2000Hemangiosarcoma2284AdultMixedPopulation2136
Salinas-Souza C. [67]Pharmacogenet Genomics2010Bone tumors80160Pediat/YoungMixedHospital3238
Savage SA. [68]Cancer Epidemiol Biomarkers Prev2007Bone tumors10474Pediat/YoungCaucasianHospital2136
Savage SA. [69]Pediatr Blood Cancer2007Bone tumors10474Pediat/YoungCaucasianHospital2136
Savage SA. [11]Nat Genet2013Bone tumors9413291AdultCaucasianPopulation4239
Shi ZW. [70]Cancer Biomark2016Bone tumors174150AdultAsianHospital3137
Silva DS. [71]Gene2012Ewing's sarcoma24200AdultMixedPopulation3238
Tang YJ. [72]Medicine2014Bone tumors160250AdultAsianPopulation4239
Thurow HS. [73]Mol Biol Rep2013Ewing's sarcoma2491AdultMixedPopulation3238
Tian Q. [74]Eur J Surg Oncol2013Bone tumors133133AdultAsianPopulation4239
Tie Z. [75]Int J Clin Exp Pathol2014Bone tumors165330AdultAsianPopulation4239
Toffoli G. [76]Clin Cancer Res2009Bone tumors201250AdultCaucasianPopulation4239
Walsh KM. [77]Carcinogenesis2016Bone tumors6606892Pediat/YoungCaucasianPopulation4239
Wang J. [78]DNA Cell Biol2012Ewing's sarcoma158212AdultAsianPopulation3238
Wang J. [79]DNA Cell Biol2013Bone tumors106210AdultAsianPopulation3238
Wang K. [80]Biomed Rep2014Chordoma6565AdultAsianPopulation3137
Wang K. [81]Tumor Biol2016Bone tumors126168AdultAsianHospital3238
Wang W. [82]DNA Cell Biol2011Bone tumors205216AdultAsianHospital3238
Wang W. [83]Genet Test Mol Biomarkers2011Bone tumors205215AdultAsianHospital3238
Wang Z. [84]Tumor Biol2014Bone tumors330342AdultAsianPopulation4239
Wu Y. [85]Tumor Biol2015Bone tumors124136AdultAsianHospital3238
Wu Z. [86]Int J Mol Sci2013Chordoma65120AdultAsiannot specified3137
Xin DJ. [87]Int J Clin Exp Pathol2015Bone tumors90100AdultAsianPopulation4138
Xu H. [88]Med Sci Monit2016Bone tumors279286Pediat/YoungAsianHospital3238
Xu S. [89]DNA Cell Biol2014Bone tumors202216AdultAsianPopulation4239
Yang L. [90]Int J Clin Exp Pathol2015Bone tumors152304AdultAsianPopulation4239
Yang S. [91]Genet Test Mol Biomarkers2012Ewing's sarcoma223302AdultAsianPopulation4239
Yang W. [92]Med Oncol2014Bone tumors118126AdultAsiannot specified3238
Zhang G. [93]Genet Mol Res2015Bone tumors180360AdultAsianPopulation4239
Zhang HF. [94]Genet Mol Res2015Bone tumors182182AdultAsianPopulation4239
Zhang N. [95]Onco Targets Ther2016Bone tumors276286AdultAsianHospital3238
Zhang Y. [96]Tumor Biol2014Bone tumors610610AdultAsianPopulation4239
Zhao J. [97]BioMed Res Int2014Bone tumors247428AdultAsianPopulation4239
Zhi LQ. [98]Tumor Biol2014Bone tumors212240AdultAsianHospital3238

NOS: Newcastle-Ottawa quality assessment scale evaluation (0-9). NOS1: selection of the study groups (0-4); NOS2: comparability of the groups (0-2); NOS3: ascertainment of the exposure or outcome (0-3).

NOS: Newcastle-Ottawa quality assessment scale evaluation (0-9). NOS1: selection of the study groups (0-4); NOS2: comparability of the groups (0-2); NOS3: ascertainment of the exposure or outcome (0-3).

Characteristics of the retrieved genetic variants

Overall, data on 1,126 polymorphisms involving 320 genes were retrieved. Variations were mainly SNPs, only six being insertion/deletions of more than one nucleotide. Based on the number of different genetic variations studied, the 11 most studied genes were the following: EGR2 (179 different SNPs), ADO (58 different SNPs), ZNF365 (40 different SNPs), TRAPPC9 (28 different SNPs), CASC8 (23 different SNPs), CD99 (20 different SNPs), EWSR1 (16 different SNPs) TP53, HSD17B2 (15 different SNPs each) and UGT1A8, LOC107984012 (12 different SNPs each). Thirty-seven of these genetic variants were located no more than 2kb upstream the relevant gene, ten no more than 500bp downstream the relevant gene, 493 in introns, 100 in exons (non-UTRs), 19 in the 3’-UTR, seven in the 5’-UTR. Moreover, 413 SNPs were located in intergenic regions more than 2kb upstream or more than 500 bp downstream the relevant gene and 41 in non-coding transcripts. Among the exonic SNPs, 63 had a missense functional effect, while 37 were synonymous. Detailed information on all SNPs is reported in Supplementary Table 1.

Meta-analysis findings

At least two independent datasets were available for 51 genetic variations allowing us to perform 118 meta-analyses, 16 of them were histology-based meta-analysis on osteosarcoma and Ewing's sarcoma. Moreover, 13 sensitivity analysis were performed considering the ethnicity of the different datasets. The results of data meta-analyses are comprehensively reported in Supplementary Table 2. Polymorphism “rs” identifier, nucleotide change and amino acid change are reported in Supplementary Table 3. The eight most studied genetic variants were the following: TP53 rs1042522 (6 datasets), VEGF rs3025039 and GSTM1 deletion (5 datasets each), CTLA4 rs231775, CTLA4 rs5742909, MDM2 rs2279744, rs10434 VEGF and GSTT1 deletion (4 datasets each). The number of subject (cases plus controls) enrolled in the 118 meta-analyses ranged from 144 to 5,347 (median: 1,195). Based on the number of subjects, the 10 most studied genetic variants, all with 5,347 subjects, were the following: EGR2 rs224292 and rs224278, ADO rs1848797 and rs1509966, MDM2 rs1690916, LOC107984012 rs9633562, rs944684 and rs6479860, ZNF365 rs11599754 and rs10761660. Of the 118 meta-analyses and 13 sensitivity analysis (131 total analyses) performed, 55 resulted to be statistically significant (P-value <0.05). The level of summary evidence, among the significant associations identified by meta-analysis, was high, intermediate, and low in 9, 38, and 8 analyses respectively. The most frequent single cause of non-high-quality level of evidence was between-study heterogeneity followed by the small sample size. Considering all statistically significant meta-analyses FPRP was optimal (<0.2) at least at the 10E3 level for 10/55 analysis, 9 of them with high level of summary evidence. The details of significant associations are reported in Table 2.
Table 2

Meta-analysis results: genetic variants significantly associated with sarcoma risk

SNP IDGenesAnalysisModelSarcoma typedatasetsMeta-analysis EthnicityOR [95% CI]I 2 %P valueCasesControlsRef/ AltVenice CriteriaFPRP (E-03)Level of Evidence
rs11599754ZNF365, ADOprimaryPer alleleEwing's2Caucasian1.48 [1.32, 1.66]0<0.000017444603T/CAAAYHIGH
rs1509966ADO, EGR2primaryPer alleleEwing's2Caucasian1.58 [1.42, 1.77]0<0.000017444603A/GAAAYHIGH
rs1848797ADO, EGR2primaryPer alleleEwing's2Caucasian1.57 [1.4, 1.77]0<0.000017444603G/AAAAYHIGH
rs224278EGR2primaryPer alleleEwing's2Caucasian1.73 [1.49, 2.02]0<0.000017444603T/CAAAYHIGH
rs9633562EGR2, LOC107984012primaryPer alleleEwing's2Caucasian1.46 [1.29, 1.65]0<0.000017444603A/CAAAYHIGH
rs10761660ADO, EGR2primaryPer alleleEwing's2Caucasian1.39 [1.21, 1.6]0<0.000017444603T/CAAAYHIGH
rs224292ADO, EGR2primaryPer alleleEwing's2Caucasian1.67 [1.42, 1.96]0<0.000017444603A/GAAAYHIGH
rs231775CTLA4primaryPer alleleMixed4Asian1.36 [1.2, 1.54]0<0.0000110031162G/AAAAYHIGH
rs454006PRKCGprimaryPer alleleOsteo2Asian1.35 [1.18, 1.54]0<0.0001998998T/CAAAYHIGH
rs944684LOC107984012primaryPer alleleEwing's2Caucasian1.73 [1.4, 2.14]49<0.000017444603C/TABAYINTERM
rs2305089TsensitivityPer alleleChordoma2Caucasian3.91 [2.4, 6.38]47<0.00001163881G/AABANINTERM
rs1042522TP53primaryDominantMixed6Mixed0.67 [0.53, 0.84]00.0007788950G/CAAANINTERM
rs1042522TP53subgroupDominantOsteo3Mixed0.6 [0.43, 0.84]150.002509737G/CAAANINTERM
rs1129055CD86primaryRecessiveMixed2Asian0.6 [0.41, 0.88]00.008363428A/GBAANINTERM
rs11737764NUDT6primaryDominantBone tumor2Caucasian2.12 [1.34, 3.37]00.0011641522A/CAAANINTERM
rs1690916MDM2primaryPer alleleEwing's2Caucasian0.62 [0.46, 0.83]00.0011641522C/TAAANINTERM
rs17206779ADAMTS6primaryPer alleleOsteo2Mixed0.79 [0.67, 0.93]350.00411093507C/TABANINTERM
rs17655ERCC5primaryRecessiveMixed2Caucasian2.04 [1.07, 3.9]00.03223515G/CBAANINTERM
rs1799793ERCC2primaryPer alleleOsteo2Mixed0.75 [0.58, 0.97]230.03271532G/ABAANINTERM
rs1799793ERCC2primaryDominantOsteo2Mixed0.63 [0.44, 0.89]00.009271532G/ABAANINTERM
rs1800896IL10primaryPer alleleOsteo2Mixed1.33 [1.06,1.66]00.01340420A/GBAANINTERM
rs1906953GRM4sensitivityPer alleleOsteo2Asian0.68 [0.55, 0.84]00.0004294384G/ABAANINTERM
rs2279744MDM2primaryPer alleleMixed4Mixed1.36 [1.06, 1.76]260.02448563T/GABANINTERM
rs2279744MDM2primaryRecessiveMixed4Mixed1.58 [1.03, 2.42]200.04448563T/GAAANINTERM
rs2279744MDM2primaryDominantMixed4Mixed1.55 [1.05, 2.29]360.03448563T/GABANINTERM
rs231775CTLA4primaryRecessiveMixed4Asian2 [1.53, 2.62]0<0.0000110031162G/AAAANINTERM
rs231775CTLA4primaryDominantMixed4Asian1.35 [1.14, 1.61]00.000710031162G/AAAANINTERM
rs231775CTLA4subgroupPer alleleEwing's2Asian1.36 [1.15, 1.61]00.0003531664G/AAAANINTERM
rs231775CTLA4subgroupRecessiveEwing's2Asian2 [1.39, 2.89]00.0002531664G/AAAANINTERM
rs231775CTLA4subgroupDominantEwing's2Asian1.36 [1.07, 1.72]00.01531664G/AAAANINTERM
rs231775CTLA4subgroupPer alleleOsteo2Asian1.36 [1.13, 1.64]00.001472498G/AABANINTERM
rs231775CTLA4subgroupRecessiveOsteo2Asian2 [1.34, 2.98]00.0007472498G/AABANINTERM
rs231775CTLA4subgroupDominantOsteo2Asian1.35 [1.04, 1.75]00.02472498G/AABANINTERM
rs3025039VEGFAprimaryPer alleleOsteo5Asian1.28 [1.12, 1.47]00.00049871344C/TAAANINTERM
rs3025039VEGFAprimaryRecessiveOsteo5Asian1.65 [1.19, 2.27]60.0029871344C/TAAANINTERM
rs3025039VEGFAprimaryDominantOsteo5Asian1.24 [1.04, 1.47]00.029871344C/TAAANINTERM
rs454006PRKCGprimaryRecessiveOsteo2Asian1.99 [1.54, 2.58]0<0.0001998998T/CAAANINTERM
rs6599400FGFR3primaryPer alleleOsteo2Caucasian1.53 [1.19, 1.97]00.0011641522C/AAAANINTERM
rs699947VEGFAprimaryPer alleleOsteo2Asian1.46 [1.19, 1.79]00.0003347512C/ABAANINTERM
rs699947VEGFAprimaryRecessiveOsteo2Asian1.73 [1.17, 2.55]00.006347512C/ABAANINTERM
rs699947VEGFAprimaryDominantOsteo2Asian1.51 [1.14, 2]00.004347512C/ABAANINTERM
rs820196RECQL5primaryRecessiveOsteo2Asian2.15 [1.41, 3.29]00.0004397441T/CBAANINTERM
rs820196RECQL5primaryDominantOsteo2Asian1.49 [1.12, 1.98]00.006397441T/CBAANINTERM
rs861539XRCC3, KLC1primaryPer alleleOsteo2Asian1.57 [1.25, 1.97]00.0001288440C/TBAANINTERM
rs861539XRCC3, KLC1primaryRecessiveOsteo2Asian2.23 [1.4, 3.57]00.0008288440C/TBAANINTERM
rs861539XRCC3, KLC1primaryDominantOsteo2Asian1.57 [1.16, 2.13]00.003288440C/TBAANINTERM
deletionGSTT1primaryRecessiveMixed4Mixed1.32 [1.01, 1.73]40.04355938non-null/ nullAAANINTERM
rs1042522TP53primaryPer alleleMixed6Mixed0.6 [0.39, 0.93]840.02788950G/CACANLOW
rs1042522TP53subgroupPer alleleOsteo3Mixed0.47 [0.23, 0.95]930.04509737G/CACANLOW
rs1129055CD86primaryPer alleleMixed2Asian0.33 [0.11, 1.01]930.05363428A/GBCANLOW
rs2305089TprimaryPer alleleChordoma3Mixed2.87 [1.35, 6.08]860.0062281001G/AACANLOW
rs2305089TprimaryRecessiveChordoma2Mixed4.16 [1.21, 14.25]820.02125841G/ABCANLOW
rs6479860LOC107984012 NRBF2primaryPer alleleEwing's2Caucasian1.79 [1.36, 2.34]66<0.00017444603C/TACANLOW
rs7591996GRM4primaryPer alleleOsteo2Mixed1.28 [1.02, 1.61]530.0311093507A/CACANLOW
deletionGSTM1sensitivityRecessiveBone tumor3Asian1.69 [1.02, 2.81]660.04315578non-null/ nullBCANLOW

OR [95%CI]: Summary Odds Ratio [95% Confidence Interval]; Ref: reference allele; Alt: alternative allele; Venice criteria: A (high), B (moderate), C (weak) credibility for three parameters (amount of evidence, heterogeneity and bias); FPRP: false positive report probability at a prior probability of 10E-3; Y: noteworthy association (FPRP cut-off value 0.2), N: non noteworthy association; Level of evidence: overall level of summary evidence according to the Venice criteria and FPRP.

OR [95%CI]: Summary Odds Ratio [95% Confidence Interval]; Ref: reference allele; Alt: alternative allele; Venice criteria: A (high), B (moderate), C (weak) credibility for three parameters (amount of evidence, heterogeneity and bias); FPRP: false positive report probability at a prior probability of 10E-3; Y: noteworthy association (FPRP cut-off value 0.2), N: non noteworthy association; Level of evidence: overall level of summary evidence according to the Venice criteria and FPRP. In order to provide an estimate of the impact of germline variants on sarcoma risk, the PAR (population attributable risk) was calculated. As an example, we considered the following three independent SNPs with high quality evidence on their relationship with sarcoma risk: rs11599754 of ZNF365/EGR2 (chromosome 10, risk allele: C, risk allele frequency in European ancestry population: 0.39, meta-analysis OR: 1.48); rs231775 of CTLA4 (chromosome 2, risk allele: A, risk allele frequency in European ancestry population: 0.65, meta-analysis OR: 1.36); and rs454006 of PRKCG (chromosome 19, risk allele: C, risk allele frequency in European ancestry population: 0.25, meta-analysis OR: 1.35). The PAR resulted equal to 37.2%.

Associations based on single studies

Beside the variations resulted to be statistically significantly associated with sarcoma risk in this meta-analysis, we retrieved from the included articles 906 SNPs statistically significantly associated with sarcoma risk (P-value <0.05) based on single-study analysis. In Table 3 are reported 53 SNPs strongly associated with Ewing's sarcoma or osteosarcoma risk (P-value
Table 3

Statistically significant associations based on single studies (P-value threshold E-06)

ReferenceCancer typeGenesSNP IDRef/AltChrOR [95%CI]P-valuelocationeQTLeQTL P-value skeletal muscle
Postel-Vinay S. [10]Ewing'sC1orf127, TARDBPrs9430161T/G12.20 [1.80, 2.70]1.40E-20intergene
Postel-Vinay S. [10]Ewing'sC1orf127rs2003046A/C11.80 [1.50, 2.20]1.30E-14intron
Postel-Vinay S. [10]Ewing'sC1orf127rs11576658T/C11.80 [1.40, 2.30]9.40E-11intron
Postel-Vinay S. [10]Ewing'sSRP14-AS1rs4924410C/A151.50 [1.30, 1.70]6.60E-09intronRP11-521C20.21.60E-07
Grünewald TG. [29]Ewing'sADO, EGR2rs10995305G/A101.59 [1.26, 2.00]4.38E-07intergeneADO1.40E-16
Zhao J. [97]OsteoARHGAP35rs1052667C/T192.25 [1.64, 3.09]4.43E-07utr 3 primeARHGAP35Other tissue
Grünewald TG. [29]Ewing'sADO, EGR2rs224290G/C100.55 [0.43, 0.70]7.80E-07intergeneADO7.50E-14
Grünewald TG. [29]Ewing'sADO, EGR2rs224291G/A100.55 [0.43, 0.70]7.80E-07intergeneADO7.20E-14
Grünewald TG. [29]Ewing'sADO, EGR2rs224296C/T100.55 [0.43, 0.70]7.80E-07intergeneADO2.90E-14
Grünewald TG. [29]Ewing'sADO, EGR2rs224297T/C100.55 [0.43, 0.70]7.80E-07intergeneADO2.80E-14
Grünewald TG. [29]Ewing'sADO, EGR2rs224298G/A100.55 [0.43, 0.70]7.80E-07intergeneADO2.90E-14
Grünewald TG. [29]Ewing'sADO, EGR2rs224294C/T100.54 [0.43, 0.69]1.01E-06intergeneADO5.60E-14
Grünewald TG. [29]Ewing'sADO, EGR2rs224293G/A100.55 [0.44, 0.71]1.02E-06intergeneADO7.20E-14
Grünewald TG. [29]Ewing'sEGR2, ADOrs1848796C/T101.80 [1.42, 2.29]1.08E-06intergeneADO2.90E-14
Grünewald TG. [29]Ewing'sADO, EGR2rs224282A/G100.55 [0.44, 0.71]1.08E-06intergeneADO7.20E-14
Savage SA. [11]OsteoADAMTS17rs2086452T/C151.35 [1.19, 1.52]1.12E-06intron
Grünewald TG. [29]Ewing'sEGR2rs648746G/T100.56 [0.44, 0.71]1.21E-06upstreamADO5.10E-15
Grünewald TG. [29]Ewing'sEGR2rs648748G/A100.56 [0.44, 0.71]1.21E-06upstreamADO5.10E-15
Grünewald TG. [29]Ewing'sEGR2rs7076924A/G101.79 [1.41, 2.28]1.21E-06upstreamADO5.50E-15
Grünewald TG. [29]Ewing'sEGR2rs224277T/C100.56 [0.44, 0.71]1.40E-06upstreamADO3.30E-15
Grünewald TG. [29]Ewing'sADO, EGR2rs224289T/C100.56 [0.44, 0.71]1.42E-06intergeneADO7.20E-14
Grünewald TG. [29]Ewing'sADO, EGR2rs7096645G/T101.78 [1.40, 2.27]1.54E-06intergeneADO8.60E-14
Grünewald TG. [29]Ewing'sLOC107984012, NRBF2rs10740101A/G102.07 [1.55, 2.76]2.29E-06intergeneADO4.90E-10
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs7079482C/T102.06 [1.54, 2.76]2.69E-06intergeneADO1.70E-10
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs1115705T/C102.07 [1.55, 2.77]2.73E-06intergeneADO9.40E-11
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs983319A/T102.07 [1.55, 2.77]2.99E-06intergeneADO4.10E-10
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs1571918A/G102.05 [1.54, 2.74]3.44E-06intergeneADO2.80E-10
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs1888968C/T102.05 [1.54, 2.74]3.44E-06intergeneADO1.90E-10
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs1912369G/A102.05 [1.54, 2.74]3.44E-06intergeneADO3.50E-10
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs4147153A/G102.05 [1.54, 2.74]3.44E-06intergeneADO3.50E-10
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs4237316C/T102.05 [1.54, 2.74]3.44E-06intergeneADO1.90E-10
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs4746746C/T102.05 [1.54, 2.74]3.44E-06intergeneADO7.20E-10
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs6479854C/T102.05 [1.54, 2.74]3.44E-06intergeneADO1.50E-10
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs7100213T/C102.05 [1.54, 2.74]3.44E-06intergeneADO2.10E-10
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs4746745T/C102.03 [1.52, 2.72]3.48E-06intergeneADO5.90E-11
Grünewald TG. [29]Ewing'sADO, EGR2rs224301G/A100.60 [0.47, 0.76]3.67E-06intergeneADO1.20E-10
Grünewald TG. [29]Ewing'sADO, EGR2rs224302G/A100.60 [0.47, 0.76]3.67E-06intergeneADO3.70E-10
Grünewald TG. [29]Ewing'sADO, EGR2rs10822056C/T101.65 [1.31, 2.09]3.70E-06intergeneADO3.00E-13
Grünewald TG. [29]Ewing'sADO, EGR2rs224295A/C100.60 [0.48, 0.76]4.80E-06intergeneADO1.50E-10
Grünewald TG. [29]Ewing'sADO, EGR2rs224299T/C100.60 [0.48, 0.76]4.80E-06intergeneADO1.50E-10
Savage SA. [11]OsteoLOC105373401, LOC105373402rs13403411C/T21.30 [1.16, 1.46]5.20E-06intergene
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs1509952C/T102.06 [1.54, 2.76]5.28E-06intergeneADO3.50E-10
Grünewald TG. [29]Ewing'sLOC107984012rs10740095T/C102.03 [1.52, 2.72]5.50E-06intronADO4.20E-11
Grünewald TG. [29]Ewing'sLOC107984012rs925307T/C102.03 [1.52, 2.72]5.50E-06intronADO6.00E-11
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs7073383A/G102.01 [1.50, 2.69]5.98E-06intergeneADO1.60E-10
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs10733780G/T102.01 [1.50, 2.69]6.90E-06intergeneADO2.90E-10
Grünewald TG. [29]Ewing'sLOC107984012rs7071512T/C102.01 [1.50, 2.69]6.90E-06intronADO4.20E-11
Savage SA. [11]OsteoFAM208B, GDI2rs2797501A/G100.62 [0.51, 0.77]7.88E-06missense, downstream
Savage SA. [11]OsteoDLEU1, LOC107984568rs573666G/A130.77 [0.68, 0.86]8.59E-06intergeneEBPLOther tissue
Grünewald TG. [29]Ewing'sEGR2, LOC107984012rs10740097C/T102.03 [1.51, 2.72]9.03E-06intergeneADO1.20E-10
Grünewald TG. [29]Ewing'sLOC107984012rs6479848T/C102.01 [1.50, 2.69]9.16E-06intronADO2.70E-11
Grünewald TG. [29]Ewing'sZNF365, ADO, EGR2rs224079C/T101.58 [1.25, 2.01]9.24E-06intergeneADO5.00E-22
Grünewald TG. [29]Ewing'sLOC107984012rs965128C/T101.99 [1.49, 2.66]9.48E-06intronADO3.10E-11

OR [95%CI]: Odds Ratio [95% Confidence Interval]; Ref: reference allele; Alt: alternative allele; eQTL: expression quantitative trait locus.

OR [95%CI]: Odds Ratio [95% Confidence Interval]; Ref: reference allele; Alt: alternative allele; eQTL: expression quantitative trait locus. One dataset was available for each of those genetic variants. Although it was not possible to perform a meta-analysis, a strong association with sarcoma risk was found (P-values range from E-20 to E-06). Ewing's sarcoma associations in European and US European-descendant population mainly involved the candidate risk loci at 1p36.22, 10q21 reported by Postel-Vinay et al [10] GWAS and in the following related study of Grünewald et al [29]. The 1p36.22 variants associated with Ewing's sarcoma are located 25 kb proximal to the TARDBP gene. TARDBP (Tat activating regulatory DNA-binding protein, or TDP-43, transactive response DNA-binding protein) is a highly conserved DNA- and RNA-binding protein involved in RNA transcription and splicing. The 10q21 variants strongly associated with Ewing's sarcoma are located in a block containing four genes: ADO (encoding cysteamine dioxygenase), ZNF365 (encoding zinc-finger protein 365), EGR2 (encoding early growth response protein 2) and LOC107984012 (unknown function). A further association with osteosarcoma in Guangxi population was studied by Zhao et al [97] regarding the Rho GTPase-activating protein 35 (ARHGAP35), a Rho family GTPase-activating protein. Finally Savage et al [11] GWAS found associations with osteosarcoma and GMR4 (glutamate receptor metabotropic 4), which were part of our meta-analysis and ADAMTS protein family, as ADAM Metallopeptidase with Thrombospondin Type 1 Motif 17. Of note, most statistically significant associations based on single studies did not have a statistically significant eQTL effect.

Network and pathway analysis findings

Using the 36 genes whose SNPs were significantly associated with sarcoma risk (including data from both meta-analysis and single studies) and were also characterized by a significant eQTL effect, we found that the corresponding protein products interact with each other beyond chance (observed edges: 120; expected edges: 12; PPI enrichment P-value <10E-20), with an average node degree equal to 6.7 (see Figure 2). Such enrichment indicates that the input molecules - as a whole group - are at least partially biologically connected. This high connectivity prompted us to conduct pathway analysis, which showed that the identified network is significantly enriched in DNA repair proteins, as shown in Table 4.
Figure 2

Network analysis of proteins encoded by genes whose variants associated with sarcoma risk and characterized by an expression quantitative trait locus effect (eQTL)

The figure illustrates the high degree of connectivity of these proteins, which result to be enriched in DNA repair pathway components.

Table 4

Pathway analysis main findings: gene set enrichment analysis based on 36 sarcoma risk genes. Enrichments with at least ten overlapping genes are shown

PathwayOverlapFDRGenesDatabase
Base excision repair (BER)11/1390.002374441BLM;RAD50; PARP4; RECQL5; LIG1; MPG; PARP2; ERCC4; PNKP; FANCG; POLHGO biol process
DNA 3' dephosphorylation involved in DNA repair10/1200.002376199BLM; RAD50; PARP4; RECQL5; LIG1; PARP2; ERCC4; PNKP; FANCG; POLHGO biol process
DNA dealkylation involved in DNA repair12/1280.000983329BLM; RAD50; PARP4; RECQL5; LIG1; MPG; MGMT; PARP2; ERCC4; PNKP; FANCG; POLHGO biol process
DNA ligation involved in DNA repair11/1320.002374441BLM; RAD50; PARP4; RECQL5; LIG1; MGMT; PARP2; ERCC4; PNKP; FANCG; POLHGO biol process
DNA repair18/2858.22494E-05BLM; LIG1; CCNH; XRCC5; PARP2; MGMT; MPG; POLM; PNKP; FANCG; BRIP1; RAD50; NEIL2; ERCC4; ERCC2; ATM; ERCC5; POLHReactome
DNA synthesis involved in DNA repair12/1420.001514863BLM; BRIP1; RAD50; PARP4; RECQL5; LIG1; PARP2; ERCC4; PNKP; ATM; FANCG; POLHGO biol process
Double-strand break repair (DSBR)12/1640.002374441BLM; BRIP1; RAD50; PARP4; RECQL5; LIG1; XRCC5; PARP2; ERCC4; PNKP; FANCG; POLHGO biol process
Mismatch repair (MMR)10/1400.005835867BLM; RAD50; PARP4; RECQL5; LIG1; PARP2; ERCC4; PNKP; FANCG; POLHGO biol process
Mitochondrial DNA repair10/1230.002552586BLM; RAD50; PARP4; RECQL5; LIG1; PARP2; ERCC4; PNKP; FANCG; POLHGO biol process
Non homologous end joining (NHEJ)10/1200.002376199BLM; RAD50; PARP4; RECQL5; LIG1; PARP2; ERCC4; PNKP; FANCG; POLHGO biol process
Nucleotide excision repair (NER)11/1380.002374441BLM; RAD50; PARP4; RECQL5; LIG1; PARP2; ERCC4; PNKP; ERCC2; FANCG; POLHGO biol process
Nucleotide phosphorylation involved in DNA repair10/1200.002376199BLM; RAD50; PARP4; RECQL5; LIG1; PARP2; ERCC4; PNKP; FANCG; POLHGO biol process
Homologous recombination (HR)10/1320.00369711BLM; RAD50; PARP4; RECQL5; LIG1; PARP2; ERCC4; PNKP; FANCG; POLHGO biol process
Single strand break repair (SSBR)11/1240.001805921BLM; RAD50; PARP4; RECQL5; LIG1; PARP2; ERCC4; PNKP; APTX; FANCG; POLHGO biol process
UV-damage excision repair11/1580.003533915BLM; RAD50; PARP4; RECQL5; LIG1; PARP2; ERCC4; PNKP; EIF2AK4; FANCG; POLHGO biol process
XPC complex (NER)15/1608.19637E-06WWOX; CCNH; XRCC5; MGMT; CD3EAP; FANCG; POC5; ERCC4; ERCC2; MDM2; OBFC1; ATM; ERCC5; POLH; UGT1A6Jenesen compartments

FDR: false discovery rate.

Network analysis of proteins encoded by genes whose variants associated with sarcoma risk and characterized by an expression quantitative trait locus effect (eQTL)

The figure illustrates the high degree of connectivity of these proteins, which result to be enriched in DNA repair pathway components. FDR: false discovery rate. In particular, many sarcoma risk genes appear to be involved in all main DNA repair pathways, including single strand break repair pathways (base excision repair [BER], nucleotide excision repair [NER], mismatch repair [MMR]) and double strand repair pathways (non homologous end joining [NHEJ], homologous recombination [HR]).

DISCUSSION

We described the findings of the first field synopsis and meta-analysis dedicated to the relationship between germline DNA variation and risk of developing bone and soft tissue sarcomas, which is based on genotyping data from 90 studies enrolling almost 48,000 people with a control-to-case ratio equal to 2. The resulting knowledgebase will be hosted by our cancer-dedicated website (at www.mmmp.org) [100] as a freely available online data repository that will be annually updated. Overall, our findings support the hypothesis that genetic polymorphism does contribute to sarcoma susceptibility. This is exemplified by the population attributable risk (PAR=37.2%) calculated for three SNPs associated with the risk of sarcoma at a high level of evidence (rs11599754 of ZNF365/EGR2, rs231775 of CTLA4, and rs454006 of PRKCG), which indicates that more than one third of sarcoma cases would not occur in a hypothetical population where these three risk variants were absent. This remarkable influence of just three SNPs is linked not only to the high frequency of the risk alleles but also to the interesting fact that the risk, defined as odds ratio, associated with single variants ranged between 1.35 and 1.48, which are values higher than those usually observed for other malignancies such as breast [101], colorectal [102], and gastric carcinomas [103], which generally include odds ratios between 1.10 and 1.30. Considering that the mean risk among variants significantly associated with sarcoma predisposition was even higher (approximately 1.70, see Table 2), one might speculate that germline DNA variation is especially important in the determinism of the susceptibility to this family of tumors. Overall, the quality of the available data, which was thoroughly assessed by means of both Venice criteria and false positive report probability (FPRP), was satisfactory considering that the statistically significant evidence on 47 of 55 variants for which a meta-analysis was feasible was classified as high to moderate level of quality with 10 SNPs considered adequate according to the FPRP (Table 2). A statistically significant association was also demonstrated for additional 906 SNPs, for which only a single data source was available, which pinpoints the urgent need for replication studies in order to validate or refute these findings. Conventional meta-analysis of single variants led us to identify 55 SNPs significantly associated with sarcoma risk (Table 2), and additional 53 SNPs were reported in single studies (Table 3): these variants are linked to a variety of genes whose protein products are involved in several cell activities. Therefore, we tried to provide readers with a preliminary interpretation of these findings from the functional biology viewpoint. Using modern SNP-to-gene and gene-to-function approaches such as integrative analysis of genetic variation with expression quantitative trait locus (eQTL) data [9] and respectively pathway/network analysis [8], we hypothesize that germline variation of the DNA repair machinery might be of special relevance for the development of this type of cancer (Figure 2). This finding – which has been very recently confirmed in patients with Ewing's sarcoma [104] - is in line with the complex gene and chromosome abnormalities that characterized some sarcoma histologies, as well as with the epidemiological observation that people accidentally [105] or therapeutically [106] exposed to ionizing radiations and thus prone to develop DNA damage are at higher risk of different types of sarcomas. In this regard, it is interesting to note that peripheral blood mononuclear cells of patients diagnosed with sarcomas show a higher sensitivity to mutagens in vitro as compared to controls [107], which supports the hypothesis that the genetic background can make the difference on an individual basis in terms of response to environmental carcinogens potentially involved in sarcomagenesis. Finally, also somatic DNA alterations appear to confer a defective DNA repair capability to some sarcoma types such as Ewing's sarcoma [108], and thus the combinatory study of germline and somatic DNA variations characterizing sarcomas might lead to better understand the cascade of molecular events underlying sarcomagenesis, as recently proposed for the EWSR1-FLI1 fusion gene and the SNPs near EGR2 in Ewing's sarcoma patients [29]. Overall, these converging data suggest that more investigation aimed to fully elucidate whether the germline individual capacity of repairing genomic damage can actually affect the predisposition to a complex and heterogeneous trait such as sarcomas might be particularly fruitful. In our work we also confirmed the association between sarcoma risk and variants of single genes, such as ZNF365, ADO, EGR2, CTLA4, TP53, CD86, NUDT6, MDM2, ERCC5 and ADAMTS6 just to mention the top ten by statistical significance. Many of these genes are not known to be involved in DNA repair and thus the relationship between these single gene findings and network/pathway analysis might appear of unclear interpretation and doubtful importance. However, we must remember that current evidence (and thus our analysis) is based on 88 candidate gene studies and only two GWAS: therefore, more extensive investigation is needed on the variation of pathways for which data on single genes are currently available. In this regard, our meta-analysis data can be utilized to inform future studies on candidate pathways whose genetic variation could affect sarcoma susceptibility. This systematic review also underscores the main limitation of the evidence on the genetic susceptibility of sarcomas. In fact, most of current information is driven by data from studies investigating bone tumors (78 of 90, 86.6%). Studies focusing on soft tissue sarcomas are thus eagerly awaited, the formation of international consortia being advocated in order to overcome the hurdle of disease rarity. Hopefully, technological improvements in direct DNA sequencing such as next generation sequencing (NGS) methods will further accelerate the discovery pace in this field of investigation, as recently reported [104]. Nevertheless, we also recognize some limitations of this synopsis: data from different tumor types and population ethnicity were pooled together to find associations despite the diversity of sarcoma histologies, leading to high level of between-study heterogeneity. To overcome to this limitation we performed subgroup and sensitivity analysis whenever possible. Moreover, despite our efforts to avoid the issue of overlapping series, it is always possible that partial overlaps between multiple series published by the same research groups that cannot be detected by full text reading did remain included in pooled analyses: however, we believe that the influence of this potential residual overlapping on the overall results is reasonably low. In conclusion, we hope that the creation of the first knowledgebase dedicated to the relationship between germline DNA variation and sarcoma risk can not only represent a valuable reference for investigators involved in sarcoma research but also inform future studies based on the gaps of the current literature.

MATERIALS AND METHODS

Search strategy, eligibility criteria, quality score assessment and data extraction

This study followed the principles proposed by the Human Genome Epidemiology Network (HuGeNet) for the systematic review of molecular association studies [109]. We considered eligible all the studies concerning the association between any genetic variant and the predisposition to sarcoma in humans, providing the raw data necessary to calculate risk of developing a sarcoma or the summary data. Exclusion criteria were: virus-induced sarcomas (HHV8 - Kaposi sarcoma); sarcomas secondary to radiation therapy; sarcomas secondary to burns/scars/surgery; associations between mitochondrial DNA variations and sarcomas; gastrointestinal stromal tumors (GIST). Database search of original articles analyzing the association between any genetic variant and susceptibility to sarcoma was conducted independently by two investigators though the following database: MEDLINE (via the PubMed gateway); The Cochrane Library; Scopus; Web of Science. The search included the following three groups of keywords: 1) sarcoma, solitary fibrous tumor, chordoma, tenosynovitis, fibromatosis, desmoids, myofibroblastic, myopericytoma, myxoma, Ewing, desmoplastic, PEComa, haemangioendothelioma, lymphangioma, myoepithelioma; 2) risk, sarcomagenesis, tumorigenesis, predisposition, susceptibility; 3) polymorphism, SNP, variant, genome wide association study and its acronym GWAS. Searches were conducted using all combinations of at least one keyword from each group. References from eligible articles were also used to refine the literature search. The quality of the studies was evaluated according to Newcastle-Ottawa quality assessment scale (NOS) [110]. In brief, the following three parameters were evaluated with a “star system”: the selection of the study groups (0 to 4 “stars”), the comparability of the groups (0 to 2 “stars”), and the ascertainment of either the exposure or outcome of interest for case-control or cohort studies respectively (0 to 3 “stars”). The maximum total score was 9 “stars” and represented the highest quality. Data were extracted independently by two investigators using a template. Every disagreement was resolved by a third investigator in order to reach consensus. Authors were contacted whenever unreported data were potentially useful to enable the inclusion of the study into the systematic review. The data extracted from eligible studies were: authors, journal, year of publication, region or country where the study was conducted, hospital where the patients were diagnosed, number of patients with sarcoma enrolled and healthy control subjects, period of enrolment, prevalent ethnicity (>80%, categorized in Caucasian, Asian, African and mixed), subjects age, genetic polymorphisms and allelic frequency in both cases and controls (if no raw data were available, summary data were collected, i.e. odds ratios and confidence intervals), study design (population-based versus hospital-based), statistical methods used, and sarcoma histology. We considered data published in different articles by the same Author/s with the same (or similar) number of subjects enrolled in the same period of time in the same hospital, to be derived by the same group of patients. In publications with either overlapping cases or controls, the most recent or largest population was chosen. For analysis purposes, the search was closed in August 2017.

Statistical analysis

We calculated summary odds ratios (ORs) and their corresponding 95% confidence intervals (95%CI) starting from raw data to measure the strength of association between each polymorphism and sarcoma risk. Whenever possible, we calculated the pooled ORs assuming 3 different genetic models: per-allele (additive), dominant and recessive. If the included studies reported exclusively per-allele ORs, as in GWAS, we calculated the pooled OR assuming the per-allele (additive) model. Random effects meta-analysis based on the inverse variance method was used to calculate summary ORs; this model reduces to a fixed effect meta-analysis if between-study heterogeneity is absent. We chose this model for the large between-study heterogeneity usually expected in genetic association studies. A meta-analysis was performed only if at least two independent data sources were available. In case of GWAS, we considered as data source the joint analysis between the discovery and the validation phases. Subgroup analysis by histological subtype (Ewing's sarcoma vs osteosarcoma) was planned if data permitted. Regarding ethnicity, analyses were divided in 4 groups: African (if the datasets were all African population-based), Asian (if the datasets were all Asian population-based), Caucasian (if the datasets were all Caucasian population-based), and mixed (if the datasets were African, Asian and Caucasian or if the datasets were from mixed ethnicity). In order to test any dominant study driving effect, sensitivity analysis by ethnicity (Asian vs Caucasian/other) was performed in mixed meta-analyses, with more than two datasets, excluding either the Asian study or the Caucasian study from the meta-analysis. Between-study heterogeneity was formally assessed by the Cochran Q-test and the I-squared statistic, the latter indicating the proportion of the variability in effect estimates linked to true between-study heterogeneity as opposed to within-study sampling error. All statistical analyses were performed with RevMan 5 (Review Manager computer program, version 5.3; Copenhagen, The Nordic Cochrane Centre, The Cochrane Collaboration, 2014).

Assessment of cumulative evidence

With the aim to assess the credibility of statistically significant associations based on the results of data meta-analysis, we used the Venice criteria [111]. In brief, we defined credibility levels based on the strength (classified as A=strong, B=moderate or C=weak) of three following parameters: amount of the evidence, replication of the association and protection from bias. We graded the amount of evidence, which approximately depends on the study sample size, based on the sum of cases and controls. Grade A, B or C was assigned to meta-analyses with total sample size >1000, 100–1000 and <100, respectively. Also, the replication of the association was graded considering the amount of between-study heterogeneity. We assigned grade A, B or C to meta-analyses with I-squared <25%, 25–50% and >50%, respectively. We graded protection from bias as A if no bias was observed, B if bias was potentially present or C if bias was evident. While assessing protection from bias we also considered the magnitude of the association. We assigned a score of C to an association characterized by a summary OR<1.15 or a summary OR>0.87 if the effect of the polymorphism was protective. In addition to the Venice criteria, we assessed the noteworthiness of significant findings by calculating the false positive report probability (FPRP) [112], which is defined as the probability of no true association between a genetic variant and disease (null hypothesis) given a statistically significant finding. FPRP is based not only on the observed P-value of the association test but also on the statistical power of the test and on the prior probability that the molecular association is real following a Bayesian approach. We calculated FPRP values for two levels of prior probabilities: at a low prior (10E-3) that would be similar to what is expected for a candidate variant, and at a very low prior (10E-6) that would be similar to what would be expected for a random variant. To classify a significant association as ‘noteworthy’, we used a FPRP cut-off value of 0.2. Overall, we defined the credibility level of the cumulative evidence as high (Venice criteria A grades only coupled with “noteworthy” finding at FPRP analysis), low (one or more C grades combined with lack of noteworthiness), or intermediate (for all other combinations). To estimate the impact of genetic variation on the risk of sarcomas, we calculated the so called population attributable risk (PAR) using the following formula: Pr (RR − 1)/[1 + Pr (RR − 1)], where Pr is the proportion of control subjects exposed to the allele of interest and the relative risk (RR) was estimated using the summary estimates (i.e. ORs) calculated by the meta-analysis. The joint PAR for combinations of polymorphisms was calculated as follows: 1 − (∏1→n[1 − PARi]), where PARi corresponds to the individual PAR of the ith polymorphism and n is the number of polymorphisms considered [113].

Network and pathway analysis

In order to explore the mechanisms underlying the pathogenesis of sarcomas, we utilized network and pathway analysis to test the hypothesis that genes whose variations are associated with sarcoma risk interact with each other possibly within the frame of some specific molecular pathways [8]. To this aim, we first selected SNPs significantly associated with sarcoma risk. In case of SNPs located in intergenic regions we selected the first closest and the second closest genes, not necessarily upstream and downstream of the SNPs of interest. Since most SNPs are intergenic or intronic and thus no obvious functional effect can be inferred, expression quantitative trait locus (eQTL) analysis was used to identify genes whose expression is affected by DNA variants [114]. The resulting gene list was the input for both network and pathway analysis. For the former, the STRING web server was employed to study protein-protein interaction (PPI) across the selected genes [115], the confidence score being set >0.4. As a measure of across network connectivity STRING provides the average node degree, where degree is the conceptually simplest centrality measure as it measures the number of edges between protein connections attached to a protein; moreover, STRING computes the PPI enrichment P-value, which is significant when input proteins have more interactions among themselves than what would be expected for a random set of proteins of similar size, drawn from the genome. As regards pathway analysis, the Enrichr web server was utilized to identify in our list over-representation of genes involved in specific pathways described in dedicated databases [116]. Hypergeometric distribution with Fisher's exact test was used to calculate the statistical significance of gene overlapping, followed by correction for multiple hypotheses testing using the false discovery rate [FDR] method.

Declarations

Ethics approval and consent to participate: Not applicable Consent for publication: Not applicable Availability of data and material: All data generated or analysed during this study are included in this published article [and its supplementary information files].
  114 in total

1.  Correlation between TGF-β1 gene 29 T > C single nucleotide polymorphism and clinicopathological characteristics of osteosarcoma.

Authors:  Yang Wu; Jinmin Zhao; Maolin He
Journal:  Tumour Biol       Date:  2015-02-10

2.  Association of interleukin-12 polymorphisms and serum IL-12p40 levels with osteosarcoma risk.

Authors:  Junli Wang; Legen Nong; Yesheng Wei; Shanyu Qin; You Zhou; Yujin Tang
Journal:  DNA Cell Biol       Date:  2013-08-30       Impact factor: 3.311

3.  Targeting the DNA repair pathway in Ewing sarcoma.

Authors:  Elizabeth Stewart; Ross Goshorn; Cori Bradley; Lyra M Griffiths; Claudia Benavente; Nathaniel R Twarog; Gregory M Miller; William Caufield; Burgess B Freeman; Armita Bahrami; Alberto Pappo; Jianrong Wu; Amos Loh; Åsa Karlström; Chris Calabrese; Brittney Gordon; Lyudmila Tsurkan; M Jason Hatfield; Philip M Potter; Scott E Snyder; Suresh Thiagarajan; Abbas Shirinifard; Andras Sablauer; Anang A Shelat; Michael A Dyer
Journal:  Cell Rep       Date:  2014-10-23       Impact factor: 9.423

4.  Single nucleotide polymorphisms in VEGF gene are associated with an increased risk of osteosarcoma.

Authors:  Zhang Tie; Rui Bai; Zhongwen Zhai; Gang Zhang; Hong Zhang; Zhenqun Zhao; Deshan Zhou; Wanlin Liu
Journal:  Int J Clin Exp Pathol       Date:  2014-10-15

5.  Cytotoxic T-lymphocyte antigen-4 polymorphisms and susceptibility to osteosarcoma.

Authors:  Yang Liu; Zhimin He; Dapeng Feng; Guodong Shi; Rui Gao; Xiaodong Wu; Weiguo Song; Wen Yuan
Journal:  DNA Cell Biol       Date:  2011-05-25       Impact factor: 3.311

6.  Effect of CYP1A1 and GSTM1 genetic polymorphisms on bone tumor susceptibility.

Authors:  L Li; J G Li; C Y Liu; Y J Ding
Journal:  Genet Mol Res       Date:  2015-12-11

7.  Comprehensive mapping of p53 pathway alterations reveals an apparent role for both SNP309 and MDM2 amplification in sarcomagenesis.

Authors:  Moriko Ito; Louise Barys; Terence O'Reilly; Sophie Young; Bella Gorbatcheva; John Monahan; Sabine Zumstein-Mecker; Peter F Choong; Ian Dickinson; Philip Crowe; Christine Hemmings; Jayesh Desai; David M Thomas; Joanna Lisztwan
Journal:  Clin Cancer Res       Date:  2010-12-15       Impact factor: 12.531

8.  Cytotoxic T-lymphocyte antigen-4 genetic variants and risk of Ewing's sarcoma.

Authors:  Dapeng Feng; Xinghai Yang; Shufeng Li; Tielong Liu; Zhipeng Wu; Yuanlin Song; Jian Wang; Wenwu Gao; Quan Huang; Wending Huang; Wei Zheng; Jianru Xiao
Journal:  Genet Test Mol Biomarkers       Date:  2013-03-12

9.  Association of a missense single nucleotide polymorphism, Cys1367Arg of the WRN gene, with the risk of bone and soft tissue sarcomas in Japan.

Authors:  Robert Nakayama; Yasunori Sato; Mitsuko Masutani; Hideki Ogino; Fumihiko Nakatani; Hirokazu Chuman; Yasuo Beppu; Hideo Morioka; Hiroo Yabe; Hiroshi Hirose; Haruhiko Sugimura; Hiromi Sakamoto; Tsutomu Ohta; Yoshiaki Toyama; Teruhiko Yoshida; Akira Kawai
Journal:  Cancer Sci       Date:  2008-02       Impact factor: 6.716

10.  An avaII restriction fragment length polymorphism in the insulin-like growth factor II gene and the occurrence of smooth muscle tumors.

Authors:  T Gloudemans; I Pospiech; L T Van der Ven; C J Lips; W Den Otter; J S Sussenbach
Journal:  Cancer Res       Date:  1993-12-01       Impact factor: 12.701

View more
  3 in total

1.  Associations of clock genes polymorphisms with soft tissue sarcoma susceptibility and prognosis.

Authors:  Clara Benna; Senthilkumar Rajendran; Giovanna Spiro; Saveria Tropea; Paolo Del Fiore; Carlo Riccardo Rossi; Simone Mocellin
Journal:  J Transl Med       Date:  2018-12-05       Impact factor: 5.531

2.  Systematic Review of the Current Status of Human Sarcoma Cell Lines.

Authors:  Emi Hattori; Rieko Oyama; Tadashi Kondo
Journal:  Cells       Date:  2019-02-13       Impact factor: 6.600

3.  Long Leukocyte Telomere Length Is Associated with Increased Risks of Soft Tissue Sarcoma: A Mendelian Randomization Study.

Authors:  Yifan Xu; Junfeng Xu; Haidee Chancoco; Maosheng Huang; Keila E Torres; Jian Gu
Journal:  Cancers (Basel)       Date:  2020-03-05       Impact factor: 6.639

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.