Literature DB >> 29775406

Phrase mining of textual data to analyze extracellular matrix protein patterns across cardiovascular disease.

David A Liem1, Sanjana Murali1, Dibakar Sigdel1, Yu Shi2, Xuan Wang2, Jiaming Shen2, Howard Choi1, John H Caufield1, Wei Wang3, Peipei Ping1,3, JiaWei Han2.   

Abstract

Extracellular matrix (ECM) proteins have been shown to play important roles regulating multiple biological processes in an array of organ systems, including the cardiovascular system. Using a novel bioinformatics text-mining tool, we studied six categories of cardiovascular disease (CVD), namely, ischemic heart disease, cardiomyopathies, cerebrovascular accident, congenital heart disease, arrhythmias, and valve disease, anticipating novel ECM protein-disease and protein-protein relationships hidden within vast quantities of textual data. We conducted a phrase-mining analysis, delineating the relationships of 709 ECM proteins with the 6 groups of CVDs reported in 1,099,254 abstracts. The technology pipeline known as Context-Aware Semantic Online Analytical Processing was applied to semantically rank the association of proteins to each CVD and all six CVDs, performing analyses to quantify each protein-disease relationship. We performed principal component analysis and hierarchical clustering of the data, where each protein was visualized as a six-dimensional vector. We found that ECM proteins display variable degrees of association with the six CVDs; certain CVDs share groups of associated proteins, whereas others have divergent protein associations. We identified 82 ECM proteins sharing associations with all 6 CVDs. Our bioinformatics analysis ascribed distinct ECM pathways (via Reactome) from this subset of proteins, namely, insulin-like growth factor regulation and interleukin-4 and interleukin-13 signaling, suggesting their contribution to the pathogenesis of all six CVDs. Finally, we performed hierarchical clustering analysis and identified protein clusters predominantly associated with a targeted CVD; analyses of these proteins revealed unexpected insights underlying the key ECM-related molecular pathogenesis of each CVD, including virus assembly and release in arrhythmias. NEW & NOTEWORTHY The present study is the first application of a text-mining algorithm to characterize the relationships of 709 extracellular matrix-related proteins with 6 categories of cardiovascular disease described in 1,099,254 abstracts. Our analysis informed unexpected extracellular matrix functions, pathways, and molecular relationships implicated in the six cardiovascular diseases.

Entities:  

Keywords:  big data; machine learning; relationship discovery; text mining

Mesh:

Substances:

Year:  2018        PMID: 29775406      PMCID: PMC6230912          DOI: 10.1152/ajpheart.00175.2018

Source DB:  PubMed          Journal:  Am J Physiol Heart Circ Physiol        ISSN: 0363-6135            Impact factor:   4.733


  41 in total

1.  Harnessing the heart of big data.

Authors:  Sarah B Scruggs; Karol Watson; Andrew I Su; Henning Hermjakob; John R Yates; Merry L Lindsey; Peipei Ping
Journal:  Circ Res       Date:  2015-03-27       Impact factor: 17.367

2.  Novel extracellular matrix biomarkers as predictors of adverse outcome in chronic heart failure: association between biglycan and response to statin therapy in the CORONA trial.

Authors:  Thor Ueland; Pål Aukrust; Ståle H Nymo; John Kjekshus; John J V McMurray; John Wikstrand; Dirk Block; Christian Zaugg; Lars Gullestad
Journal:  J Card Fail       Date:  2014-11-07       Impact factor: 5.712

3.  Crossing Into the Next Frontier of Cardiac Extracellular Matrix Research.

Authors:  Francis G Spinale; Nikolaos G Frangogiannis; Boris Hinz; Jeffrey W Holmes; Zamaneh Kassiri; Merry L Lindsey
Journal:  Circ Res       Date:  2016-10-28       Impact factor: 17.367

Review 4.  Remodelling the extracellular matrix in development and disease.

Authors:  Caroline Bonnans; Jonathan Chou; Zena Werb
Journal:  Nat Rev Mol Cell Biol       Date:  2014-12       Impact factor: 94.444

Review 5.  Extracellular matrix remodeling in atrial fibrosis: mechanisms and implications in atrial fibrillation.

Authors:  Jason Pellman; Robert C Lyon; Farah Sheikh
Journal:  J Mol Cell Cardiol       Date:  2009-09-12       Impact factor: 5.000

Review 6.  Possible involvement of tight junctions, extracellular matrix and nuclear receptors in epithelial differentiation.

Authors:  Naoki Ichikawa-Tomikawa; Kotaro Sugimoto; Seiro Satohisa; Keisuke Nishiura; Hideki Chiba
Journal:  J Biomed Biotechnol       Date:  2011-11-17

7.  New Role for Interleukin-13 Receptor α1 in Myocardial Homeostasis and Heart Failure.

Authors:  Uri Amit; David Kain; Allon Wagner; Avinash Sahu; Yael Nevo-Caspi; Nir Gonen; Natali Molotski; Tal Konfino; Natalie Landa; Nili Naftali-Shani; Galia Blum; Emmanuelle Merquiol; Danielle Karo-Atar; Yariv Kanfi; Gidi Paret; Ariel Munitz; Haim Y Cohen; Eytan Ruppin; Sridhar Hannenhalli; Jonathan Leor
Journal:  J Am Heart Assoc       Date:  2017-05-20       Impact factor: 5.501

Review 8.  Molecular Mechanisms of Retinoid Receptors in Diabetes-Induced Cardiac Remodeling.

Authors:  Jing Pan; Rakeshwar S Guleria; Sen Zhu; Kenneth M Baker
Journal:  J Clin Med       Date:  2014-06-04       Impact factor: 4.241

9.  The research on gene-disease association based on text-mining of PubMed.

Authors:  Jie Zhou; Bo-Quan Fu
Journal:  BMC Bioinformatics       Date:  2018-02-07       Impact factor: 3.169

Review 10.  Potential Risks Related to Modulating Interleukin-13 and Interleukin-4 Signalling: A Systematic Review.

Authors:  Martin Braddock; Nicola A Hanania; Amir Sharafkhaneh; Gene Colice; Mats Carlsson
Journal:  Drug Saf       Date:  2018-05       Impact factor: 5.606

View more
  8 in total

Review 1.  Extracellular matrix in cardiovascular pathophysiology.

Authors:  Maria Bloksgaard; Merry Lindsey; Luis A Martinez-Lemus
Journal:  Am J Physiol Heart Circ Physiol       Date:  2018-09-21       Impact factor: 4.733

2.  Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications.

Authors:  Dibakar Sigdel; Vincent Kyi; Aiden Zhang; Shaun P Setty; David A Liem; Yu Shi; Xuan Wang; Jiaming Shen; Wei Wang; JiaWei Han; Peipei Ping
Journal:  J Vis Exp       Date:  2019-02-23       Impact factor: 1.355

3.  Cardioinformatics: the nexus of bioinformatics and precision cardiology.

Authors:  Bohdan B Khomtchouk; Diem-Trang Tran; Kasra A Vand; Matthew Might; Or Gozani; Themistocles L Assimes
Journal:  Brief Bioinform       Date:  2020-12-01       Impact factor: 11.622

4.  Quantitative temporal analysis of protein dynamics in cardiac remodeling.

Authors:  Daniel B McClatchy; Yuanhui Ma; David A Liem; Dominic C M Ng; Peipei Ping; John R Yates
Journal:  J Mol Cell Cardiol       Date:  2018-07-19       Impact factor: 5.000

Review 5.  The role of machine learning applications in diagnosing and assessing critical and non-critical CHD: a scoping review.

Authors:  Stephanie M Helman; Elizabeth A Herrup; Adam B Christopher; Salah S Al-Zaiti
Journal:  Cardiol Young       Date:  2021-11-02       Impact factor: 1.093

6.  Ensemble machine learning model identifies patients with HFpEF from matrix-related plasma biomarkers.

Authors:  Michael Ward; Amirreza Yeganegi; Catalin F Baicu; Amy D Bradshaw; Francis G Spinale; Michael R Zile; William J Richardson
Journal:  Am J Physiol Heart Circ Physiol       Date:  2022-03-11       Impact factor: 4.733

Review 7.  Cardiovascular informatics: building a bridge to data harmony.

Authors:  John Harry Caufield; Dibakar Sigdel; John Fu; Howard Choi; Vladimir Guevara-Gonzalez; Ding Wang; Peipei Ping
Journal:  Cardiovasc Res       Date:  2022-02-21       Impact factor: 13.081

Review 8.  Machine Learning and Integrative Analysis of Biomedical Big Data.

Authors:  Bilal Mirza; Wei Wang; Jie Wang; Howard Choi; Neo Christopher Chung; Peipei Ping
Journal:  Genes (Basel)       Date:  2019-01-28       Impact factor: 4.096

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.