| Literature DB >> 31053132 |
Jason E Miller1, Raghu P Metpally2, Thomas N Person2, Sarathbabu Krishnamurthy3, Venkata Ramesh Dasari3, Manu Shivakumar2, Daniel R Lavage2, Adam M Cook3, David J Carey3, Marylyn D Ritchie1, Dokyoon Kim2,4,5,6, Radhika Gogoi7.
Abstract
BACKGROUND: Endometrial cancer (EMCA) is the fifth most common cancer among women in the world. Identification of potentially pathogenic germline variants from individuals with EMCA will help characterize genetic features that underlie the disease and potentially predispose individuals to its pathogenesis.Entities:
Keywords: DiscovEHR; Endometrial Cancer; Germline variants; TCGA; Uterine Cancer; Whole exome sequencing
Mesh:
Year: 2019 PMID: 31053132 PMCID: PMC6499978 DOI: 10.1186/s12920-019-0504-9
Source DB: PubMed Journal: BMC Med Genomics ISSN: 1755-8794 Impact factor: 3.063
Summary of rare pathogenic variant distribution across cohorts
| EMCA | NCC | OHRM | |
|---|---|---|---|
| Participantsa | 297 | 2120 | 1486 |
| Participants with variants after filterb | 85 (28.6%) | 628 (29.6%) | 462 (31.1%) |
| Genes with variants after filterc | 62 | 211 | 205 |
| Loci with variantsd | 73 | 485 | 371 |
| Rare variant burden across participants in each cohorte | 99 | 791 | 593 |
Summary level data of participants from WES and rare variant analysis. The total number of participants included in each cohort (a). The total number of participants from “a” that had at least one rare variant that met criteria from Fig. 1 workflow, (b). The number of genes with at least one rare variant that met workflow criteria (c). The number of unique variants present after filtering using the bioinformatics pipeline in Fig. 1 (d). The total number of unique and non-unique rare variants present across the participants in the cohort (e)
Fig. 1Waterfall plot of all genes with pathogenic variants. Waterfall plot of all EMCA samples that contained rare variants that passed the filter from Additional file 1: Figure S1. The main heatmap contains columns which represent an individual participant (N = 86), and rows that represent genes, while the color that fills in the cell represents the type of variant present for a specific participant in a specific gene. The heatmap below illustrates that histology, cancer stage and patient survival status, each column represents a different participant. “Undiff” refers to undifferentiated histology. The graph to the left shows the percentage of participants who have a rare variant in a gene, relative to all participants with variants, while the bar plot above the main graph represents the variant burden for each participant
Fig. 2Distribution of types of rare variants across cohorts. The total number of unique variants are represented by their VEP annotation. Additionally, each type of variant is grouped by the variant category (a). The percentage of each variant category represented in each cohort (b). The overlap between genes with variants represented in each cohort (c). The number of unique loci (variants) that overlap between each cohort (d)
Fig. 3Non-synonymous and pLoF variants among EMCA to non-cancer control cohort. a For each gene with at least two variants in both EMCA and NCC, the ratio of non-synonymous variants across the EMCA cohort was divided by those in the NCC after adjusting for differences in cohort size. Orange, blue and red lines are used to delineate 2, 1 and 0.5 fold EMCA burden relative to the NCC cohort. b The total number of rare non-synonymous variants from each cohort for each gene. c and d the same as a and b, respectively, except pLoF variants were used
Fig. 4Overlap between DiscovEHR and TCGA germline variants from EMCA and uterine cancer samples. Rare potentially pathogenic variants were identified from the germline uterine cancer cohort in TCGA. The overlap between TCGA variants and those from this study (DiscovEHR) is illustrated in the Venn diagram (a). The overlap of genes from TCGA and the genes with variants from this EMCA cohort from this study (b)