Literature DB >> 25484339

DISEASES: text mining and data integration of disease-gene associations.

Sune Pletscher-Frankild1, Albert Pallejà2, Kalliopi Tsafou1, Janos X Binder3, Lars Juhl Jensen4.   

Abstract

Text mining is a flexible technology that can be applied to numerous different tasks in biology and medicine. We present a system for extracting disease-gene associations from biomedical abstracts. The system consists of a highly efficient dictionary-based tagger for named entity recognition of human genes and diseases, which we combine with a scoring scheme that takes into account co-occurrences both within and between sentences. We show that this approach is able to extract half of all manually curated associations with a false positive rate of only 0.16%. Nonetheless, text mining should not stand alone, but be combined with other types of evidence. For this reason, we have developed the DISEASES resource, which integrates the results from text mining with manually curated disease-gene associations, cancer mutation data, and genome-wide association studies from existing databases. The DISEASES resource is accessible through a web interface at http://diseases.jensenlab.org/, where the text-mining software and all associations are also freely available for download.
Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Data integration; Information extraction; Named entity recognition; Text mining; Web resource

Mesh:

Year:  2014        PMID: 25484339     DOI: 10.1016/j.ymeth.2014.11.020

Source DB:  PubMed          Journal:  Methods        ISSN: 1046-2023            Impact factor:   3.608


  183 in total

1.  Data integration of structured and unstructured sources for assigning clinical codes to patient stays.

Authors:  Elyne Scheurwegs; Kim Luyckx; Léon Luyten; Walter Daelemans; Tim Van den Bulcke
Journal:  J Am Med Inform Assoc       Date:  2015-08-27       Impact factor: 4.497

2.  Study of genetic correlation between children's sleep and obesity.

Authors:  Hao Mei; Fan Jiang; Lianna Li; Michael Griswold; Shijian Liu; Thomas Mosley
Journal:  J Hum Genet       Date:  2020-06-18       Impact factor: 3.172

3.  PertInInt: An Integrative, Analytical Approach to Rapidly Uncover Cancer Driver Genes with Perturbed Interactions and Functionalities.

Authors:  Shilpa Nadimpalli Kobren; Bernard Chazelle; Mona Singh
Journal:  Cell Syst       Date:  2020-07-14       Impact factor: 10.304

Review 4.  Exploring the dark genome: implications for precision medicine.

Authors:  Tudor I Oprea
Journal:  Mamm Genome       Date:  2019-07-04       Impact factor: 2.957

5.  A network-based machine-learning framework to identify both functional modules and disease genes.

Authors:  Kuo Yang; Kezhi Lu; Yang Wu; Jian Yu; Baoyan Liu; Yi Zhao; Jianxin Chen; Xuezhong Zhou
Journal:  Hum Genet       Date:  2021-01-07       Impact factor: 4.132

6.  GWAB: a web server for the network-based boosting of human genome-wide association data.

Authors:  Jung Eun Shim; Changbae Bang; Sunmo Yang; Tak Lee; Sohyun Hwang; Chan Yeong Kim; U Martin Singh-Blom; Edward M Marcotte; Insuk Lee
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

Review 7.  Ecological Sensing Through Taste and Chemosensation Mediates Inflammation: A Biological Anthropological Approach.

Authors:  Cristina Giuliani; Claudio Franceschi; Donata Luiselli; Paolo Garagnani; Stanley Ulijaszek
Journal:  Adv Nutr       Date:  2020-11-16       Impact factor: 8.701

8.  Proteins Altered by Surgical Weight Loss Highlight Biomarkers of Insulin Resistance in the Community.

Authors:  Ravi V Shah; Shih-Jen Hwang; Ashish Yeri; Kahraman Tanriverdi; Alexander R Pico; Chen Yao; Venkatesh Murthy; Jennifer Ho; Olga Vitseva; Danielle Demarco; Sajani Shah; Mark D Iafrati; Daniel Levy; Jane E Freedman
Journal:  Arterioscler Thromb Vasc Biol       Date:  2019-01       Impact factor: 8.311

9.  Cytoscape StringApp: Network Analysis and Visualization of Proteomics Data.

Authors:  Nadezhda T Doncheva; John H Morris; Jan Gorodkin; Lars J Jensen
Journal:  J Proteome Res       Date:  2018-12-05       Impact factor: 4.466

10.  RGS6 Mediates Effects of Voluntary Running on Adult Hippocampal Neurogenesis.

Authors:  Yu Gao; Minjie Shen; Jose Carlos Gonzalez; Qiping Dong; Sudharsan Kannan; Johnson T Hoang; Brian E Eisinger; Jyotsna Pandey; Sahar Javadi; Qiang Chang; Daifeng Wang; Linda Overstreet-Wadiche; Xinyu Zhao
Journal:  Cell Rep       Date:  2020-08-04       Impact factor: 9.423

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.