Jing Xu1, Xiangdong Liu2, Qiming Dai3. 1. Department of Clinical Laboratory, ZhongDa Hospital, Southeast University, Nanjing, China. 2. Institute of Life Science, Southeast University, Nanjing, China. 3. Department of Cardiology, ZhongDa Hospital, Southeast University, Nanjing, China. qimingdai@icloud.com.
Abstract
BACKGROUND: Hypertrophic cardiomyopathy (HCM) represents one of the most common inherited heart diseases. To identify key molecules involved in the development of HCM, gene expression patterns of the heart tissue samples in HCM patients from multiple microarray and RNA-seq platforms were investigated. METHODS: The significant genes were obtained through the intersection of two gene sets, corresponding to the identified differentially expressed genes (DEGs) within the microarray data and within the RNA-Seq data. Those genes were further ranked using minimum-Redundancy Maximum-Relevance feature selection algorithm. Moreover, the genes were assessed by three different machine learning methods for classification, including support vector machines, random forest and k-Nearest Neighbor. RESULTS: Outstanding results were achieved by taking exclusively the top eight genes of the ranking into consideration. Since the eight genes were identified as candidate HCM hallmark genes, the interactions between them and known HCM disease genes were explored through the protein-protein interaction (PPI) network. Most candidate HCM hallmark genes were found to have direct or indirect interactions with known HCM diseases genes in the PPI network, particularly the hub genes JAK2 and GADD45A. CONCLUSIONS: This study highlights the transcriptomic data integration, in combination with machine learning methods, in providing insight into the key hallmark genes in the genetic etiology of HCM.
BACKGROUND:Hypertrophic cardiomyopathy (HCM) represents one of the most common inherited heart diseases. To identify key molecules involved in the development of HCM, gene expression patterns of the heart tissue samples in HCM patients from multiple microarray and RNA-seq platforms were investigated. METHODS: The significant genes were obtained through the intersection of two gene sets, corresponding to the identified differentially expressed genes (DEGs) within the microarray data and within the RNA-Seq data. Those genes were further ranked using minimum-Redundancy Maximum-Relevance feature selection algorithm. Moreover, the genes were assessed by three different machine learning methods for classification, including support vector machines, random forest and k-Nearest Neighbor. RESULTS: Outstanding results were achieved by taking exclusively the top eight genes of the ranking into consideration. Since the eight genes were identified as candidate HCM hallmark genes, the interactions between them and known HCM disease genes were explored through the protein-protein interaction (PPI) network. Most candidate HCM hallmark genes were found to have direct or indirect interactions with known HCM diseases genes in the PPI network, particularly the hub genes JAK2 and GADD45A. CONCLUSIONS: This study highlights the transcriptomic data integration, in combination with machine learning methods, in providing insight into the key hallmark genes in the genetic etiology of HCM.
Authors: Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras Journal: Bioinformatics Date: 2012-10-25 Impact factor: 6.937
Authors: Matthew E Ritchie; Belinda Phipson; Di Wu; Yifang Hu; Charity W Law; Wei Shi; Gordon K Smyth Journal: Nucleic Acids Res Date: 2015-01-20 Impact factor: 16.971
Authors: Perry M Elliott; Aris Anastasakis; Michael A Borger; Martin Borggrefe; Franco Cecchi; Philippe Charron; Albert Alain Hagege; Antoine Lafont; Giuseppe Limongelli; Heiko Mahrholdt; William J McKenna; Jens Mogensen; Petros Nihoyannopoulos; Stefano Nistri; Petronella G Pieper; Burkert Pieske; Claudio Rapezzi; Frans H Rutten; Christoph Tillmanns; Hugh Watkins Journal: Eur Heart J Date: 2014-08-29 Impact factor: 29.983
Authors: Daniel Castillo; Juan Manuel Gálvez; Luis Javier Herrera; Belén San Román; Fernando Rojas; Ignacio Rojas Journal: BMC Bioinformatics Date: 2017-11-21 Impact factor: 3.169
Authors: Andrea Martins-da-Silva; Mirella Baroni; Karina Bezerra Salomão; Pablo Ferreira das Chagas; Ricardo Bonfim-Silva; Lenisa Geron; Gustavo Alencastro Veiga Cruzeiro; Wilson Araújo da Silva; Carolina Alves Pereira Corrêa; Carlos Gilberto Carlotti; Rosane Gomes de Paula Queiroz; Suely Kazue Nagahashi Marie; Silvia Regina Brandalise; José Andrés Yunes; Carlos Alberto Scrideli; Elvis Terci Valera; Luiz Gonzaga Tone Journal: Cell Mol Neurobiol Date: 2022-04-02 Impact factor: 5.046