Literature DB >> 32961042

REDBot: Natural language process methods for clinical copy number variation reporting in prenatal and products of conception diagnosis.

Mengmeng Liu1, Yunshan Zhong1, Hongqian Liu2, Desheng Liang3,4, Erhong Liu1, Yu Zhang1, Feng Tian1, Qiaowei Liang4, David S Cram1, Hua Wang5, Lingqian Wu3, Fuli Yu6.   

Abstract

BACKGROUND: Current copy number variation (CNV) identification methods have rapidly become mature. However, the postdetection processes such as variant interpretation or reporting are inefficient. To overcome this situation, we developed REDBot as an automated software package for accurate and direct generation of clinical diagnostic reports for prenatal and products of conception (POC) samples.
METHODS: We applied natural language process (NLP) methods for analyzing 30,235 in-house historical clinical reports through active learning, and then, developed clinical knowledge bases, evidence-based interpretation methods and reporting criteria to support the whole postdetection pipeline.
RESULTS: Of the 30,235 reports, we obtained 37,175 CNV-paragraph pairs. For these pairs, the active learning approaches achieved a 0.9466 average F1-score in sentence classification. The overall accuracy for variant classification was 95.7%, 95.2%, and 100.0% in retrospective, prospective, and clinical utility experiments, respectively.
CONCLUSION: By integrating NLP methods in CNVs postdetection pipeline, REDBot is a robust and rapid tool with clinical utility for prenatal and POC diagnosis.
© 2020 The Authors. Molecular Genetics & Genomic Medicine published by Wiley Periodicals LLC.

Entities:  

Mesh:

Year:  2020        PMID: 32961042      PMCID: PMC7667294          DOI: 10.1002/mgg3.1488

Source DB:  PubMed          Journal:  Mol Genet Genomic Med        ISSN: 2324-9269            Impact factor:   2.183


  33 in total

1.  The UCSC Known Genes.

Authors:  Fan Hsu; W James Kent; Hiram Clawson; Robert M Kuhn; Mark Diekhans; David Haussler
Journal:  Bioinformatics       Date:  2006-02-24       Impact factor: 6.937

Review 2.  Structural variation in the human genome and its role in disease.

Authors:  Paweł Stankiewicz; James R Lupski
Journal:  Annu Rev Med       Date:  2010       Impact factor: 13.739

Review 3.  Microarrays in prenatal diagnosis.

Authors:  Beatrice Oneda; Anita Rauch
Journal:  Best Pract Res Clin Obstet Gynaecol       Date:  2017-01-23       Impact factor: 5.237

4.  ClinTAD: a tool for copy number variant interpretation in the context of topologically associated domains.

Authors:  Jacob D Spector; Arun P Wiita
Journal:  J Hum Genet       Date:  2019-02-14       Impact factor: 3.172

5.  REDBot: Natural language process methods for clinical copy number variation reporting in prenatal and products of conception diagnosis.

Authors:  Mengmeng Liu; Yunshan Zhong; Hongqian Liu; Desheng Liang; Erhong Liu; Yu Zhang; Feng Tian; Qiaowei Liang; David S Cram; Hua Wang; Lingqian Wu; Fuli Yu
Journal:  Mol Genet Genomic Med       Date:  2020-09-22       Impact factor: 2.183

6.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins.

Authors:  Kim D Pruitt; Tatiana Tatusova; Donna R Maglott
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

7.  Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology.

Authors:  Sue Richards; Nazneen Aziz; Sherri Bale; David Bick; Soma Das; Julie Gastier-Foster; Wayne W Grody; Madhuri Hegde; Elaine Lyon; Elaine Spector; Karl Voelkerding; Heidi L Rehm
Journal:  Genet Med       Date:  2015-03-05       Impact factor: 8.822

8.  Technical standards for the interpretation and reporting of constitutional copy-number variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics (ACMG) and the Clinical Genome Resource (ClinGen).

Authors:  Erin Rooney Riggs; Erica F Andersen; Athena M Cherry; Sibel Kantarci; Hutton Kearney; Ankita Patel; Gordana Raca; Deborah I Ritter; Sarah T South; Erik C Thorland; Daniel Pineda-Alvarez; Swaroop Aradhya; Christa Lese Martin
Journal:  Genet Med       Date:  2019-11-06       Impact factor: 8.822

9.  CNV-seq, a new method to detect copy number variation using high-throughput sequencing.

Authors:  Chao Xie; Martti T Tammi
Journal:  BMC Bioinformatics       Date:  2009-03-06       Impact factor: 3.169

10.  ClinVar: public archive of interpretations of clinically relevant variants.

Authors:  Melissa J Landrum; Jennifer M Lee; Mark Benson; Garth Brown; Chen Chao; Shanmuga Chitipiralla; Baoshan Gu; Jennifer Hart; Douglas Hoffman; Jeffrey Hoover; Wonhee Jang; Kenneth Katz; Michael Ovetsky; George Riley; Amanjeev Sethi; Ray Tully; Ricardo Villamarin-Salomon; Wendy Rubinstein; Donna R Maglott
Journal:  Nucleic Acids Res       Date:  2015-11-17       Impact factor: 16.971

View more
  3 in total

1.  REDBot: Natural language process methods for clinical copy number variation reporting in prenatal and products of conception diagnosis.

Authors:  Mengmeng Liu; Yunshan Zhong; Hongqian Liu; Desheng Liang; Erhong Liu; Yu Zhang; Feng Tian; Qiaowei Liang; David S Cram; Hua Wang; Lingqian Wu; Fuli Yu
Journal:  Mol Genet Genomic Med       Date:  2020-09-22       Impact factor: 2.183

2.  Cruxome: a powerful tool for annotating, interpreting and reporting genetic variants.

Authors:  Qingmei Han; Ying Yang; Shengyang Wu; Yingchun Liao; Shuang Zhang; Hongbin Liang; David S Cram; Yu Zhang
Journal:  BMC Genomics       Date:  2021-06-03       Impact factor: 3.969

3.  Whole exome sequencing is an alternative method in the diagnosis of mitochondrial DNA diseases.

Authors:  Chong Sun; Shengyang Wu; Ruiguo Chen; Junwu Liu; Jiasen Wang; Yanyun Ma; Zhulin Yuan; Yuezhen Li
Journal:  Mol Genet Genomic Med       Date:  2022-04-07       Impact factor: 2.473

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.