Literature DB >> 18653521

Robust and efficient identification of biomarkers by classifying features on graphs.

TaeHyun Hwang1, Hugues Sicotte, Ze Tian, Baolin Wu, Jean-Pierre Kocher, Dennis A Wigle, Vipin Kumar, Rui Kuang.   

Abstract

MOTIVATION: A central problem in biomarker discovery from large-scale gene expression or single nucleotide polymorphism (SNP) data is the computational challenge of taking into account the dependence among all the features. Methods that ignore the dependence usually identify non-reproducible biomarkers across independent datasets. We introduce a new graph-based semi-supervised feature classification algorithm to identify discriminative disease markers by learning on bipartite graphs. Our algorithm directly classifies the feature nodes in a bipartite graph as positive, negative or neutral with network propagation to capture the dependence among both samples and features (clinical and genetic variables) by exploring bi-cluster structures in a graph. Two features of our algorithm are: (1) our algorithm can find a global optimal labeling to capture the dependence among all the features and thus, generates highly reproducible results across independent microarray or other high-thoughput datasets, (2) our algorithm is capable of handling hundreds of thousands of features and thus, is particularly useful for biomarker identification from high-throughput gene expression and SNP data. In addition, although designed for classifying features, our algorithm can also simultaneously classify test samples for disease prognosis/diagnosis.
RESULTS: We applied the network propagation algorithm to study three large-scale breast cancer datasets. Our algorithm achieved competitive classification performance compared with SVMs and other baseline methods, and identified several markers with clinical or biological relevance with the disease. More importantly, our algorithm also identified highly reproducible marker genes and enriched functions from the independent datasets. AVAILABILITY: Supplementary results and source code are available at http://compbio.cs.umn.edu/Feature_Class. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 18653521     DOI: 10.1093/bioinformatics/btn383

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  14 in total

1.  Multilevel Coarsening for Interactive Visualization of Large Bipartite Networks.

Authors:  Alan Demétrius Baria Valejo; Renato Fabbri; Alneu de Andrade Lopes; Liang Zhao; Maria Cristina Ferreira de Oliveira
Journal:  Front Res Metr Anal       Date:  2022-06-16

2.  A novel algorithm for network-based prediction of cancer recurrence.

Authors:  Jianhua Ruan; Md Jamiul Jahid; Fei Gu; Chengwei Lei; Yi-Wen Huang; Ya-Ting Hsu; David G Mutch; Chun-Liang Chen; Nameer B Kirma; Tim H-M Huang
Journal:  Genomics       Date:  2016-07-21       Impact factor: 5.736

3.  Accounting for control mislabeling in case-control biomarker studies.

Authors:  Mattias Rantalainen; Chris C Holmes
Journal:  J Proteome Res       Date:  2011-11-08       Impact factor: 4.466

4.  Cancer core modules identification through genomic and transcriptomic changes correlation detection at network level.

Authors:  Wenting Li; Rui Wang; Linfu Bai; Zhangming Yan; Zhirong Sun
Journal:  BMC Syst Biol       Date:  2012-06-12

5.  A unified computational model for revealing and predicting subtle subtypes of cancers.

Authors:  Xianwen Ren; Yong Wang; Jiguang Wang; Xiang-Sun Zhang
Journal:  BMC Bioinformatics       Date:  2012-05-01       Impact factor: 3.169

6.  Large-scale integrative network-based analysis identifies common pathways disrupted by copy number alterations across cancers.

Authors:  Tae Hyun Hwang; Gowtham Atluri; Rui Kuang; Vipin Kumar; Timothy Starr; Kevin At Silverstein; Peter M Haverty; Zemin Zhang; Jinfeng Liu
Journal:  BMC Genomics       Date:  2013-07-03       Impact factor: 3.969

7.  Are there any differences between features of proteins expressed in malignant and benign breast cancers?

Authors:  Mansour Ebrahimi; Esmaeil Ebrahimie; Narges Shamabadi; Mahdi Ebrahimi
Journal:  J Res Med Sci       Date:  2010-11       Impact factor: 1.852

8.  ellipsoidFN: a tool for identifying a heterogeneous set of cancer biomarkers based on gene expressions.

Authors:  Xianwen Ren; Yong Wang; Luonan Chen; Xiang-Sun Zhang; Qi Jin
Journal:  Nucleic Acids Res       Date:  2012-12-22       Impact factor: 16.971

9.  Network-based survival analysis reveals subnetwork signatures for predicting outcomes of ovarian cancer treatment.

Authors:  Wei Zhang; Takayo Ota; Viji Shridhar; Jeremy Chien; Baolin Wu; Rui Kuang
Journal:  PLoS Comput Biol       Date:  2013-03-21       Impact factor: 4.475

10.  In silico model for miRNA-mediated regulatory network in cancer.

Authors:  Khandakar Tanvir Ahmed; Jiao Sun; William Chen; Irene Martinez; Sze Cheng; Wencai Zhang; Jeongsik Yong; Wei Zhang
Journal:  Brief Bioinform       Date:  2021-11-05       Impact factor: 13.994

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.