| Literature DB >> 31439692 |
Michael N Edmonson1, Aman N Patel1, Dale J Hedges1, Zhaoming Wang1, Evadnie Rampersaud1, Chimene A Kesserwan2, Xin Zhou1, Yanling Liu1, Scott Newman1, Michael C Rusch1, Clay L McLeod1, Mark R Wilkinson1, Stephen V Rice1, Thierry Soussi3,4,5, J Paul Taylor6,7, Michael Benatar8, Jared B Becksfort1, Kim E Nichols2, Leslie L Robison9, James R Downing10, Jinghui Zhang1.
Abstract
Variant interpretation in the era of massively parallel sequencing is challenging. Although many resources and guidelines are available to assist with this task, few integrated end-to-end tools exist. Here, we present the Pediatric Cancer Variant Pathogenicity Information Exchange (PeCanPIE), a web- and cloud-based platform for annotation, identification, and classification of variations in known or putative disease genes. Starting from a set of variants in variant call format (VCF), variants are annotated, ranked by putative pathogenicity, and presented for formal classification using a decision-support interface based on published guidelines from the American College of Medical Genetics and Genomics (ACMG). The system can accept files containing millions of variants and handle single-nucleotide variants (SNVs), simple insertions/deletions (indels), multiple-nucleotide variants (MNVs), and complex substitutions. PeCanPIE has been applied to classify variant pathogenicity in cancer predisposition genes in two large-scale investigations involving >4000 pediatric cancer patients and serves as a repository for the expert-reviewed results. PeCanPIE was originally developed for pediatric cancer but can be easily extended for use for nonpediatric cancers and noncancer genetic diseases. Although PeCanPIE's web-based interface was designed to be accessible to non-bioinformaticians, its back-end pipelines may also be run independently on the cloud, facilitating direct integration and broader adoption. PeCanPIE is publicly available and free for research use.Entities:
Mesh:
Year: 2019 PMID: 31439692 PMCID: PMC6724669 DOI: 10.1101/gr.250357.119
Source DB: PubMed Journal: Genome Res ISSN: 1088-9051 Impact factor: 9.043
Figure 1.Overview of variant classification using PeCanPIE. (A) Overview of processing steps from VCF through ACMG-based classification. Variant counts at each processing step for whole-exome sequencing data generated from a germline sample of a patient with acute lymphoblastic leukemia (ALL), SJNORM015857_G1 (Methods) (B) and whole-genome sequencing data generated from Genome in a Bottle normal sample NA12878_HG001 (Methods) (C).
Figure 2.Design of the MedalCeremony pipeline for automated germline variant classification. Truncating variants in loss-of-function genes (e.g., tumor suppressors) and those matching highly curated databases receive gold medals. Truncations in non-loss-of-function genes, in-frame indels, variants predicted to be damaging, and matches to additional databases receive silver medals. Otherwise, variants predicted to be tolerated by damage-prediction algorithms receive bronze medals. Imperfect database matches receive a lower-grade medal than exact matches. Variants not meeting any of the prior criteria are labeled “unknown.”
Databases used in classification
Figure 3.Annotation interface. Excerpts of PeCanPIE annotation interface. (A) Results for Genome in a Bottle WGS data set. Variant page details for NOTCH1 R1350L: (B) variant population frequency detail from ExAC non-TCGA database; (C) functional predictions. (D) Functional data display for TP53 gene: Functional assay results for N235S and G245S show that N235S appears functionally benign, whereas G245S appears functionally damaging.
Figure 4.ACMG classification on ETV6. (Top) ProteinPaint display of somatic ETV6 variants across 11 subtypes of pediatric leukemia, showing enrichment of loss-of-function mutations (frameshifts in red, nonsense variants in orange). Arrow indicates position of germline R359* variant. (Bottom) Detail of PeCanPIE ACMG classification interface for R359* variant.
Figure 5.ACMG classification of NRAS G12S. (Top) ProteinPaint display of somatic NRAS variants across 28 cancer subtypes, showing hotspot at G12. Arrow indicates position of germline G12S variant. (Bottom) Detail of PeCanPIE ACMG classification interface for G12S variant. Automated classification detected a pathogenic ClinVar variant at the same amino acid position as well as a hotspot in COSMIC data.
Figure 6.ACMG classification of SOD1 Ala5Val. (Top) ProteinPaint display of somatic SOD1 variants; arrow indicates position of Ala5Val. (Bottom) Detail of PeCanPIE ACMG classification interface for this variant.