| Literature DB >> 32211398 |
Aayushi Srivastava1,2,3,4, Sara Giangiobbe1,4, Abhishek Kumar1, Nagarajan Paramasivam5, Dagmara Dymerska6, Wolfgang Behnisch7, Mathias Witzens-Harig4, Jan Lubinski6, Kari Hemminki1,8, Asta Försti1,2,3, Obul Reddy Bandapalli1,2,3,4.
Abstract
Hodgkin lymphoma (HL) is a lymphoproliferative malignancy of B-cell origin that accounts for 10% of all lymphomas. Despite evidence suggesting strong familial clustering of HL, there is no clear understanding of the contribution of genes predisposing to HL. In this study, whole genome sequencing (WGS) was performed on 7 affected and 9 unaffected family members from three HL-prone families and variants were prioritized using our Familial Cancer Variant Prioritization Pipeline (FCVPPv2). WGS identified a total of 98,564, 170,550, and 113,654 variants which were reduced by pedigree-based filtering to 18,158, 465, and 26,465 in families I, II, and III, respectively. In addition to variants affecting amino acid sequences, variants in promoters, enhancers, transcription factors binding sites, and microRNA seed sequences were identified from upstream, downstream, 5' and 3' untranslated regions. A panel of 565 cancer predisposing and other cancer-related genes and of 2,383 potential candidate HL genes were also screened in these families to aid further prioritization. Pathway analysis of segregating genes with Combined Annotation Dependent Depletion Tool (CADD) scores >20 was performed using Ingenuity Pathway Analysis software which implicated several candidate genes in pathways involved in B-cell activation and proliferation and in the network of "Cancer, Hematological disease and Immunological Disease." We used the FCVPPv2 for further in silico analyses and prioritized 45 coding and 79 non-coding variants from the three families. Further literature-based analysis allowed us to constrict this list to one rare germline variant each in families I and II and two in family III. Functional studies were conducted on the candidate from family I in a previous study, resulting in the identification and functional validation of a novel heterozygous missense variant in the tumor suppressor gene DICER1 as potential HL predisposition factor. We aim to identify the individual genes responsible for predisposition in the remaining two families and will functionally validate these in further studies.Entities:
Keywords: familial Hodgkin lymphoma; genetic predisposition to disease; germline variants; next generation sequencing; predisposing genes; variant prioritization; whole genome sequencing
Year: 2020 PMID: 32211398 PMCID: PMC7067901 DOI: 10.3389/fbioe.2020.00179
Source DB: PubMed Journal: Front Bioeng Biotechnol ISSN: 2296-4185
Figure 1Pedigrees of the three HL families analyzed in this study. (A) Family 1, (B) Family 2, and (C) Family 3.
Top missense variants prioritized using the FCVPPv2.
| Family_1 | 17_48746518_C_T | ABCC3 | p.P652L | 22.6 | 2 | 7 | Uncertain significance |
| Family_1 | 1_49052793_G_A | AGBL4 | p.R384C | 35 | 2 | 7 | Uncertain significance |
| Family_1 | 5_139909090_A_G | ANKHD1, ANKHD1-EIF4EBP3 | p.N2187D | 25.2 | . | 6 | Uncertain significance |
| Family_1 | 1_160164884_T_C | CASQ1 | p.I183T | 26.5 | 2 | 10 | Likely Benign |
| Family_1 | 6_159206584_G_A | EZR | p.P75L | 32 | 3 | 9 | Uncertain significance |
| Family_1 | 12_8192537_G_A | FOXJ2 | p.G37R | 29.9 | 3 | 9 | Uncertain significance |
| Family_1 | 14_88729713_C_T | KCNK10 | p.A79T | 27.1 | 3 | 6 | Uncertain significance |
| Family_1 | 10_88705360_G_A | MMRN2 | p.P58L | 29.8 | 2 | 7 | Uncertain significance |
| Family_1 | 5_36962227_G_A | NIPBL | p.R154Q | 27.5 | 3 | 8 | Uncertain significance |
| Family_1 | 2_206614449_A_G | NRP2 | p.D596G | 23.1 | 2 | 9 | Uncertain significance |
| Family_1 | 6_52372363_G_C | TRAM2 | p.A205G | 29.8 | 3 | 10 | Uncertain significance |
| Family_1 | 22_18613830_C_T | TUBA8 | p.A450V | 24.7 | 3 | 10 | Uncertain significance |
| Family_1 | X_47272364_G_A | ZNF157 | p.G298R | 27.6 | 2 | 6 | Uncertain significance |
| Family_2 | 9_116151739_G_C | ALAD | p.I243M | 22.9 | 2 | 6 | Uncertain significance |
| Family_2 | 9_131196759_G_T | CERCAM | p.A468S | 24.5 | 2 | 7 | Uncertain significance |
| Family_3 | 9_139917418_C_T | ABCA2 | p.G83S | 26.3 | 2 | 7 | Uncertain significance |
| Family_3 | 17_40971572_G_C | BECN1 | p.P85R | 23.4 | 3 | 6 | Uncertain significance |
| Family_3 | 8_67968830_G_T | COPS5 | p.P131T | 23.9 | 3 | 9 | Uncertain significance |
| Family_3 | 3_5246773_C_T | EDEM1 | p.T160M | 34 | 3 | 9 | Uncertain significance |
| Family_3 | 6_131191103_G_A | EPB41L2 | p.S736F | 22.1 | 3 | 7 | Uncertain significance |
| Family_3 | 8_28575243_G_A | EXTL3 | p.R172H | 23 | 3 | 6 | Likely Benign |
| Family_3 | 14_74970734_C_T | LTBP2 | p.G1493R | 27.7 | 3 | 10 | Uncertain significance |
| Family_3 | 3_196730925_C_A | MFI2 | p.D662Y | 34 | 3 | 6 | Uncertain significance |
| Family_3 | 17_27441099_G_A | MYO18A | p.A843V | 24.3 | 3 | 6 | Uncertain significance |
| Family_3 | 19_14584756_A_G | PTGER1 | p.L126P | 25.9 | 2 | 6 | Uncertain significance |
| Family_3 | 3_49138083_G_A | QARS | p.R301C | 34 | 2 | 9 | Uncertain significance |
| Family_3 | 11_9838541_C_T | SBF2 | p.R1275H | 33 | 2 | 9 | Likely Benign |
| Family_3 | 20_35467682_G_A | SOGA1 | p.R46C | 32 | 3 | 7 | Uncertain significance |
| Family_3 | 1_43891311_G_A | SZT2 | p.A96T | 31 | 3 | 6 | Uncertain significance |
Chromosomal positions, classifications, PHRED-like CADD scores, protein changes and the number of positive intolerance (Int) and deleteriousness (Del) scores are shown for each variant. Variants of interest that were validated by Sanger sequencing in the provided family samples have been shown in bold.
Top non-sense variants prioritized using the FCVPPv2.
| Family_1 | 10_88911115_AGT_A | FAM35A | Frameshift deletion | p.2_2del | 25.8 | 2 | PM2 |
| Family_1 | 1_177923437_CTG_C | SEC16B | Frameshift deletion | p.481_481del | 36 | 0 | Uncertain significance |
| Family_1 | 15_91546350_TG_T | VPS33B | Frameshift deletion | p.P321fs | 36 | 3 | PVS1 |
| Family_3 | 7_31683260_AT_A | CCDC129 | Frameshift deletion | p.D611fs | 34 | 0 | Uncertain significance |
| Family_3 | 1_21267855_C_T | EIF4G3 | Stopgain SNV | p.W7X | 14.54 | 2 | PVS1, PP3 (1) |
| Family_3 | 15_24921469_G_A | NPAP1 | Stopgain SNV | p.W152X | 24.8 | 0 | PM2, PP3 (3) |
| Family_3 | 1_241958547_CAG_C | WDR64 | Frameshift deletion | p.836_836del | 37 | 0 | Uncertain significance |
[I]VarSome Scores.
PM2, Pathogenic Moderat; PP3, Pathogenic Supporting (no. of scores predicting pathogenicity); Uncertain Significance: No scores could be found for the variant in question, PVS1, Pathogenic Very Strong.
Chromosomal positions, classifications, PHRED-like CADD scores, protein changes, the number of positive intolerance (Int) and VarSome prediction scores are included for each variant. Variants of interest that were validated by Sanger sequencing in the provided family samples have been shown in bold.
Variants corresponding to genes present in the panel of 565 known cancer predisposition genes from a study by Zhang et al. (2015).
| 10499 | 1 | NCOA2 | 8_71316112_T_TCCTCCTCCC | Indel | Upstream | Nuclear receptor coactivator 2 | 15.56 | Other CancerGene | |
| 8880 | 1 | FUBP1 | 1_78414225_A_G | SNVs | UTR3 | Far upstream element (FUSE) binding protein 1 | 13.59 | Other CancerGene | |
| 23157 | 1 | SEPT6 | X_118751062_CGTGT_C | Indel | UTR3 | Septin 6 | 10.56 | Other CancerGene | |
| 23405 | 1 | DICER1 | 14_95560456_A_C | SNVs | Non-synonymous SNV | Dicer 1, ribonuclease type III | 24 | DICER1 syndrome, Familial Multinodular Goiter | Autosomal Dominant |
| 7430 | 1 | EZR | 6_159206584_G_A | SNVs | Non-synonymous SNV | Ezrin | 32 | Other CancerGene | |
| 604 | 3 | BCL6 | 3_187463568_C_A | SNVs | Upstream; downstream | B-cell CLL/lymphoma 6 | 13 | Other CancerGene | |
| 4026 | 3 | LPP | 3_188123978_G_T | SNVs | Stopgain SNV | LIM domain containing preferred translocation partner in lipoma | 40 | Other CancerGene | |
| 3 | LPP | 3_188123979_A_T | SNVs | Non-synonymous SNV | LIM domain containing preferred translocation partner in lipoma | 32 | Other CancerGene | ||
| 3 | LPP | 3_188608373_A_T | SNVs | UTR3 | LIM domain containing preferred translocation partner in lipoma | 10.5 | Other CancerGene | ||
| 5892 | 3 | RAD51D | 17_33428327_G_A | SNVs | Non-synonymous SNV | RAD51 paralog D | 27 | Other CancerGene | |
| 5727 | 3 | PTCH1 | 9_98270531_C_A | SNVs | Non-synonymous SNV | Patched 1 | 20.4 | Gorlin syndrome | Autosomal Dominant |
Variants corresponding to genes intersecting with the list of 2,383 high-risk HL genes from a study by Rotunno et al. (2016).
| ABHD16A | 6_31670740_A_T | 3 | 6_31671105_G_A | 13.23 | SNVs | UTR5 | – | |
| C6orf62 | 6_24719009_T_C | rs147402940 | 3 | 6_24705773_T_C | 12.31 | SNVs | UTR3 | – |
| CEP120 | 5_122758609_G_T | rs141808885 | 1 | 5_122681069_C_T | 12.31 | SNVs | UTR3 | – |
| EDEM1 | 3_5257909_A_G | rs139745426 | 3 | 3_5246773_C_T | 34 | SNVs | Non-synonymous SNV | p.T160M |
| EIF4G3 | 1_21494519_T_C | 3 | 1_21267855_C_T | 14.54 | SNVs | stopgain SNV | p.W7X | |
| EPB41L2 | 6_131202023_A_G | 3 | 6_131191103_G_A | 22.1 | SNVs | Non-synonymous SNV | p.S736F | |
| EXTL3 | 8_28609630_G_A | rs191528081 | 3 | 8_28575243_G_A | 23 | SNVs | Non-synonymous SNV | p.R172H |
| FAM35A | 10_88917757_A_G | rs371636091 | 1 | 10_88911115_AGT_A | 25.8 | Indel | Frameshift deletion | p.2_2del |
| FUK | 16_70507153_G_A | 1 | 16_70501193_C_T | 10.08 | SNVs | UTR3 | - | |
| HLTF | 3_148757909_A_G | rs61750365 | 1 | 3_148802664_C_T | 37 | SNVs | stopgain SNV | p.W11X |
| LPP | 3_188464149_C_T | 3 | 3_188608373_A_T | 10.5 | SNVs | UTR3 | – | |
| LPP | 3_188464149_C_T | 3 | 3_188123978_G_T | 40 | SNVs | stopgain SNV | p.E24X | |
| LPP | 3 | 3_188123979_A_T | 32 | SNVs | Non-synonymous SNV | p.E24V | ||
| LRP6 | 12_12419973_G_T | 3 | 12_12272924_AATATATATATATATATATATATATATATATATATATAT_A | 12.55 | Indel | UTR3 | – | |
| LTBP2 | 14_74983553_G_A | rs145851939 | 3 | 14_74970734_C_T | 27.7 | SNVs | Non-synonymous SNV | p.G1493R |
| MAPKAP1 | 9_128199718_AT_ | 3 | 9_128199770_TAA_T | 14.3 | Indel | UTR3 | – | |
| MARCH10 | 17_60827878_G_A | rs112201730 | 3 | 17_60885673_G_A | 12.03 | SNVs | UTR5 | – |
| MYO18A | 17_27448659_C_T | rs371862120 | 3 | 17_27441099_G_A | 24.3 | SNVs | Non-synonymous SNV | p.A843V |
| NCAM1 | 11_113113556_A_G | 1 | 11_113134920_C_A | 11.07 | SNVs | UTR3 | – | |
| NIPBL | 5_36876673_G_A | 1 | 5_36962227_G_A | 27.5 | SNVs | Non-synonymous SNV | p.R154Q | |
| PHC2 | 1_33820711_G_A | 3 | 1_33896663_C_A | 15.12 | SNVs | upstream | – | |
| RCN1 | 11_32126524_C_T | 3 | 11_32112681_C_T | 10.66 | SNVs | UTR5 | – | |
| SBF2 | 11_9985135_TAAT_ | 3 | 11_9838541_C_T | 33 | SNVs | Non-synonymous SNV | p.R1275H | |
| SLMAP | 3_57914019_A_C | rs191613999 | 1 | 3_57742023_C_G | 13.51 | SNVs | UTR5 | – |
| SZT2 | 1_43885320_C_T | 3 | 1_43891311_G_A | 31 | SNVs | Non-synonymous SNV | p.A96T | |
Variant details from both databases the present study and the study by Rotunno et al. (.
Figure 2The top three molecular networks identified by Ingenuity Pathway Analysis: (A) Network 1. Cancer, hematological disease, immunological disease; (B) Network 2. developmental disorder, endocrine system disorders, hereditary disorder; (C) Network 3. RNA post-transcriptional modification, cell death and survival, cellular movement. Genes from our input-data are shown in gray, genes from our prioritized candidate list are highlighted in peach.