BACKGROUND: Despite the large increase of transcriptomic studies that look for gene signatures on diseases, there is still a need for integrative approaches that obtain separation of multiple pathological states providing robust selection of gene markers for each disease subtype and information about the possible links or relations between those genes. RESULTS: We present a network-oriented and data-driven bioinformatic approach that searches for association of genes and diseases based on the analysis of genome-wide expression data derived from microarrays or RNA-Seq studies. The approach aims to (i) identify gene sets associated to different pathological states analysed together; (ii) identify a minimum subset within these genes that unequivocally differentiates and classifies the compared disease subtypes; (iii) provide a measurement of the discriminant power of these genes and (iv) identify links between the genes that characterise each of the disease subtypes. This bioinformatic approach is implemented in an R package, named geNetClassifier, available as an open access tool in Bioconductor. To illustrate the performance of the tool, we applied it to two independent datasets: 250 samples from patients with four major leukemia subtypes analysed using expression arrays; another leukemia dataset analysed with RNA-Seq that includes a subtype also present in the previous set. The results show the selection of key deregulated genes recently reported in the literature and assigned to the leukemia subtypes studied. We also show, using these independent datasets, the selection of similar genes in a network built for the same disease subtype. CONCLUSIONS: The construction of gene networks related to specific disease subtypes that include parameters such as gene-to-gene association, gene disease specificity and gene discriminant power can be very useful to draw gene-disease maps and to unravel the molecular features that characterize specific pathological states. The application of the bioinformatic tool here presented shows a neat way to achieve such molecular characterization of the diseases using genome-wide expression data.
BACKGROUND: Despite the large increase of transcriptomic studies that look for gene signatures on diseases, there is still a need for integrative approaches that obtain separation of multiple pathological states providing robust selection of gene markers for each disease subtype and information about the possible links or relations between those genes. RESULTS: We present a network-oriented and data-driven bioinformatic approach that searches for association of genes and diseases based on the analysis of genome-wide expression data derived from microarrays or RNA-Seq studies. The approach aims to (i) identify gene sets associated to different pathological states analysed together; (ii) identify a minimum subset within these genes that unequivocally differentiates and classifies the compared disease subtypes; (iii) provide a measurement of the discriminant power of these genes and (iv) identify links between the genes that characterise each of the disease subtypes. This bioinformatic approach is implemented in an R package, named geNetClassifier, available as an open access tool in Bioconductor. To illustrate the performance of the tool, we applied it to two independent datasets: 250 samples from patients with four major leukemia subtypes analysed using expression arrays; another leukemia dataset analysed with RNA-Seq that includes a subtype also present in the previous set. The results show the selection of key deregulated genes recently reported in the literature and assigned to the leukemia subtypes studied. We also show, using these independent datasets, the selection of similar genes in a network built for the same disease subtype. CONCLUSIONS: The construction of gene networks related to specific disease subtypes that include parameters such as gene-to-gene association, gene disease specificity and gene discriminant power can be very useful to draw gene-disease maps and to unravel the molecular features that characterize specific pathological states. The application of the bioinformatic tool here presented shows a neat way to achieve such molecular characterization of the diseases using genome-wide expression data.
Authors: Kira Orlovsky; Alexander Kalinkovich; Tanya Rozovskaia; Elias Shezen; Tomer Itkin; Hansjuerg Alder; Hatice Gulcin Ozer; Letizia Carramusa; Abraham Avigdor; Stefano Volinia; Arthur Buchberg; Alex Mazo; Orit Kollet; Corey Largman; Carlo M Croce; Tatsuya Nakamura; Tsvee Lapidot; Eli Canaani Journal: Proc Natl Acad Sci U S A Date: 2011-04-25 Impact factor: 11.205
Authors: C M Woolthuis; L Han; R N Verkaik-Schakel; D van Gosliga; P M Kluin; E Vellenga; J J Schuringa; G Huls Journal: Leukemia Date: 2011-10-11 Impact factor: 11.528
Authors: Torsten Haferlach; Alexander Kohlmann; Lothar Wieczorek; Giuseppe Basso; Geertruy Te Kronnie; Marie-Christine Béné; John De Vos; Jesus M Hernández; Wolf-Karsten Hofmann; Ken I Mills; Amanda Gilkes; Sabina Chiaretti; Sheila A Shurtleff; Thomas J Kipps; Laura Z Rassenti; Allen E Yeoh; Peter R Papenhausen; Wei-Min Liu; P Mickey Williams; Robin Foà Journal: J Clin Oncol Date: 2010-04-20 Impact factor: 44.544
Authors: Aedín C Culhane; Markus S Schröder; Razvan Sultana; Shaita C Picard; Enzo N Martinelli; Caroline Kelly; Benjamin Haibe-Kains; Misha Kapushesky; Anne-Alyssa St Pierre; William Flahive; Kermshlise C Picard; Daniel Gusenleitner; Gerald Papenhausen; Niall O'Connor; Mick Correll; John Quackenbush Journal: Nucleic Acids Res Date: 2011-11-21 Impact factor: 16.971
Authors: Maria Mesuraca; Emanuela Chiarella; Stefania Scicchitano; Bruna Codispoti; Marco Giordano; Giovanna Nappo; Heather M Bond; Giovanni Morrone Journal: Biomed Res Int Date: 2015-12-16 Impact factor: 3.411
Authors: Beatriz Roson-Burgo; Fermin Sanchez-Guijo; Consuelo Del Cañizo; Javier De Las Rivas Journal: BMC Genomics Date: 2016-11-21 Impact factor: 3.969
Authors: Jorge Martinez-Romero; Santiago Bueno-Fortes; Manuel Martín-Merino; Ana Ramirez de Molina; Javier De Las Rivas Journal: BMC Genomics Date: 2018-12-11 Impact factor: 3.969