OBJECTIVE: To extract disorder-associated genes from the scientific literature in PubMed with greater sensitivity for literature-based support than existing methods. METHODS: We developed a PubMed query to retrieve disorder-related, original research articles. Then we applied a rule-based text-mining algorithm with keyword matching to extract target disorders, genes with significant results, and the type of study described by the article. RESULTS: We compared our resulting candidate disorder genes and supporting references with existing databases. We demonstrated that our candidate gene set covers nearly all genes in manually curated databases, and that the references supporting the disorder-gene link are more extensive and accurate than other general purpose gene-to-disorder association databases. CONCLUSIONS: We implemented a novel publication search tool to find target articles, specifically focused on links between disorders and genotypes. Through comparison against gold-standard manually updated gene-disorder databases and comparison with automated databases of similar functionality we show that our tool can search through the entirety of PubMed to extract the main gene findings for human diseases rapidly and accurately.
OBJECTIVE: To extract disorder-associated genes from the scientific literature in PubMed with greater sensitivity for literature-based support than existing methods. METHODS: We developed a PubMed query to retrieve disorder-related, original research articles. Then we applied a rule-based text-mining algorithm with keyword matching to extract target disorders, genes with significant results, and the type of study described by the article. RESULTS: We compared our resulting candidate disorder genes and supporting references with existing databases. We demonstrated that our candidate gene set covers nearly all genes in manually curated databases, and that the references supporting the disorder-gene link are more extensive and accurate than other general purpose gene-to-disorder association databases. CONCLUSIONS: We implemented a novel publication search tool to find target articles, specifically focused on links between disorders and genotypes. Through comparison against gold-standard manually updated gene-disorder databases and comparison with automated databases of similar functionality we show that our tool can search through the entirety of PubMed to extract the main gene findings for human diseases rapidly and accurately.
Authors: Nicole C Allen; Sachin Bagade; Matthew B McQueen; John P A Ioannidis; Fotini K Kavvoura; Muin J Khoury; Rudolph E Tanzi; Lars Bertram Journal: Nat Genet Date: 2008-07 Impact factor: 38.330
Authors: Stuart I Davidson; Yu Liu; Patrick A Danoy; Xin Wu; Gethin P Thomas; Lei Jiang; Linyun Sun; Niansong Wang; Jun Han; Huanxing Han; Peter M Visscher; Matthew A Brown; Huji Xu Journal: Ann Rheum Dis Date: 2010-11-10 Impact factor: 19.103
Authors: Mariken de Krom; Wouter G Staal; Roel A Ophoff; Judith Hendriks; Jan Buitelaar; Barbara Franke; Maretha V de Jonge; Patrick Bolton; David Collier; Sarah Curran; Herman van Engeland; Jan M van Ree Journal: Biol Psychiatry Date: 2008-12-05 Impact factor: 13.382
Authors: Casper Shyr; Maja Tarailo-Graovac; Michael Gottlieb; Jessica J Y Lee; Clara van Karnebeek; Wyeth W Wasserman Journal: BMC Med Genomics Date: 2014-12-03 Impact factor: 3.063
Authors: Maude M David; David Enard; Alp Ozturk; Jena Daniels; Jae-Yoon Jung; Leticia Diaz-Beltran; Dennis P Wall Journal: PLoS One Date: 2016-07-14 Impact factor: 3.240