Literature DB >> 31271844

Cancer classification and pathway discovery using non-negative matrix factorization.

Zexian Zeng1, Andy H Vo2, Chengsheng Mao1, Susan E Clare3, Seema A Khan4, Yuan Luo5.   

Abstract

OBJECTIVES: Extracting genetic information from a full range of sequencing data is important for understanding disease. We propose a novel method to effectively explore the landscape of genetic mutations and aggregate them to predict cancer type.
DESIGN: We applied non-smooth non-negative matrix factorization (nsNMF) and support vector machine (SVM) to utilize the full range of sequencing data, aiming to better aggregate genetic mutations and improve their power to predict disease type. More specifically, we introduce a novel classifier to distinguish cancer types using somatic mutations obtained from whole-exome sequencing data. Mutations were identified from multiple cancers and scored using SIFT, PP2, and CADD, and collapsed at the individual gene level. nsNMF was then applied to reduce dimensionality and obtain coefficient and basis matrices. A feature matrix was derived from the obtained matrices to train a classifier for cancer type classification with the SVM model.
RESULTS: We have demonstrated that the classifier was able to distinguish four cancer types with reasonable accuracy. In five-fold cross-validations using mutation counts as features, the average prediction accuracy was 80% (SEM = 0.1%), significantly outperforming baselines and outperforming models using mutation scores as features.
CONCLUSION: Using the factor matrices derived from the nsNMF, we identified multiple genes and pathways that are significantly associated with each cancer type. This study presents a generic and complete pipeline to study the associations between somatic mutations and cancers. The proposed method can be adapted to other studies for disease status classification and pathway discovery.
Copyright © 2019 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Cancer; Classification; Non-negative matrix factorization; Pathway; Somatic mutation; Whole-exome sequencing

Year:  2019        PMID: 31271844      PMCID: PMC6697569          DOI: 10.1016/j.jbi.2019.103247

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  44 in total

1.  Learning the parts of objects by non-negative matrix factorization.

Authors:  D D Lee; H S Seung
Journal:  Nature       Date:  1999-10-21       Impact factor: 49.962

2.  Metagenes and molecular pattern discovery using matrix factorization.

Authors:  Jean-Philippe Brunet; Pablo Tamayo; Todd R Golub; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2004-03-11       Impact factor: 11.205

3.  Elevated frequency and functional activity of a specific germ-line p53 intron mutation in familial breast cancer.

Authors:  T A Lehman; B G Haffty; C J Carbone; L R Bishop; A A Gumbs; S Krishnan; P G Shields; R Modali; B C Turner
Journal:  Cancer Res       Date:  2000-02-15       Impact factor: 12.701

Review 4.  Genome-wide association studies for common diseases and complex traits.

Authors:  Joel N Hirschhorn; Mark J Daly
Journal:  Nat Rev Genet       Date:  2005-02       Impact factor: 53.242

5.  PLINK: a tool set for whole-genome association and population-based linkage analyses.

Authors:  Shaun Purcell; Benjamin Neale; Kathe Todd-Brown; Lori Thomas; Manuel A R Ferreira; David Bender; Julian Maller; Pamela Sklar; Paul I W de Bakker; Mark J Daly; Pak C Sham
Journal:  Am J Hum Genet       Date:  2007-07-25       Impact factor: 11.025

6.  Intron 3 16 bp duplication polymorphism of p53 is associated with an increased risk for breast cancer by the age of 50 years.

Authors:  Shan Wang-Gohrke; Heiko Becher; Rolf Kreienberg; Ingo B Runnebaum; Jenny Chang-Claude
Journal:  Pharmacogenetics       Date:  2002-04

Review 7.  Searching for genetic determinants in the new millennium.

Authors:  N J Risch
Journal:  Nature       Date:  2000-06-15       Impact factor: 49.962

Review 8.  A census of human cancer genes.

Authors:  P Andrew Futreal; Lachlan Coin; Mhairi Marshall; Thomas Down; Timothy Hubbard; Richard Wooster; Nazneen Rahman; Michael R Stratton
Journal:  Nat Rev Cancer       Date:  2004-03       Impact factor: 60.716

9.  Patterns of somatic mutation in human cancer genomes.

Authors:  Christopher Greenman; Philip Stephens; Raffaella Smith; Gillian L Dalgliesh; Christopher Hunter; Graham Bignell; Helen Davies; Jon Teague; Adam Butler; Claire Stevens; Sarah Edkins; Sarah O'Meara; Imre Vastrik; Esther E Schmidt; Tim Avis; Syd Barthorpe; Gurpreet Bhamra; Gemma Buck; Bhudipa Choudhury; Jody Clements; Jennifer Cole; Ed Dicks; Simon Forbes; Kris Gray; Kelly Halliday; Rachel Harrison; Katy Hills; Jon Hinton; Andy Jenkinson; David Jones; Andy Menzies; Tatiana Mironenko; Janet Perry; Keiran Raine; Dave Richardson; Rebecca Shepherd; Alexandra Small; Calli Tofts; Jennifer Varian; Tony Webb; Sofie West; Sara Widaa; Andy Yates; Daniel P Cahill; David N Louis; Peter Goldstraw; Andrew G Nicholson; Francis Brasseur; Leendert Looijenga; Barbara L Weber; Yoke-Eng Chiew; Anna DeFazio; Mel F Greaves; Anthony R Green; Peter Campbell; Ewan Birney; Douglas F Easton; Georgia Chenevix-Trench; Min-Han Tan; Sok Kean Khoo; Bin Tean Teh; Siu Tsan Yuen; Suet Yi Leung; Richard Wooster; P Andrew Futreal; Michael R Stratton
Journal:  Nature       Date:  2007-03-08       Impact factor: 49.962

10.  bioNMF: a versatile tool for non-negative matrix factorization in biology.

Authors:  Alberto Pascual-Montano; Pedro Carmona-Saez; Monica Chagoyen; Francisco Tirado; Jose M Carazo; Roberto D Pascual-Marqui
Journal:  BMC Bioinformatics       Date:  2006-07-28       Impact factor: 3.169

View more
  12 in total

Review 1.  A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases.

Authors:  I S Stafford; M Kellermann; E Mossotto; R M Beattie; B D MacArthur; S Ennis
Journal:  NPJ Digit Med       Date:  2020-03-09

Review 2.  Application of non-negative matrix factorization in oncology: one approach for establishing precision medicine.

Authors:  Ryuji Hamamoto; Ken Takasawa; Hidenori Machino; Kazuma Kobayashi; Satoshi Takahashi; Amina Bolatkan; Norio Shinkai; Akira Sakai; Rina Aoyama; Masayoshi Yamada; Ken Asada; Masaaki Komatsu; Koji Okamoto; Hirokazu Kameoka; Syuzo Kaneko
Journal:  Brief Bioinform       Date:  2022-07-18       Impact factor: 13.994

3.  Using an Unsupervised Clustering Model to Detect the Early Spread of SARS-CoV-2 Worldwide.

Authors:  Yawei Li; Qingyun Liu; Zexian Zeng; Yuan Luo
Journal:  Genes (Basel)       Date:  2022-04-07       Impact factor: 4.141

4.  Tumor immune microenvironment-based classifications of bladder cancer for enhancing the response rate of immunotherapy.

Authors:  Jialin Meng; Xiaofan Lu; Yujie Zhou; Meng Zhang; Qintao Ge; Jun Zhou; Zongyao Hao; Shenglin Gao; Fangrong Yan; Chaozhao Liang
Journal:  Mol Ther Oncolytics       Date:  2021-02-04       Impact factor: 7.200

5.  Unsupervised phenotyping of sepsis using nonnegative matrix factorization of temporal trends from a multivariate panel of physiological measurements.

Authors:  Menghan Ding; Yuan Luo
Journal:  BMC Med Inform Decis Mak       Date:  2021-04-09       Impact factor: 2.796

6.  Deep learning for cancer type classification and driver gene identification.

Authors:  Zexian Zeng; Chengsheng Mao; Andy Vo; Xiaoyu Li; Janna Ore Nugent; Seema A Khan; Susan E Clare; Yuan Luo
Journal:  BMC Bioinformatics       Date:  2021-10-25       Impact factor: 3.169

7.  Comprehensive Genomic and Epigenomic Analyses on Transcriptomic Regulation in Stomach Adenocarcinoma.

Authors:  Junxing Chen; Weinan Liu; Jiabin Du; Pengcheng Wang; Jintian Wang; Kai Ye
Journal:  Front Genet       Date:  2022-02-11       Impact factor: 4.599

8.  Radiomics of Contrast-Enhanced Computed Tomography: A Potential Biomarker for Pretreatment Prediction of the Response to Bacillus Calmette-Guerin Immunotherapy in Non-Muscle-Invasive Bladder Cancer.

Authors:  Lei Ye; Yuntian Chen; Hui Xu; Zhaoxiang Wang; Haixia Li; Jin Qi; Jing Wang; Jin Yao; Jiaming Liu; Bin Song
Journal:  Front Cell Dev Biol       Date:  2022-02-25

9.  Performance-weighted-voting model: An ensemble machine learning method for cancer type classification using whole-exome sequencing mutation.

Authors:  Yawei Li; Yuan Luo
Journal:  Quant Biol       Date:  2020-12-07

Review 10.  A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases.

Authors:  I S Stafford; M Kellermann; E Mossotto; R M Beattie; B D MacArthur; S Ennis
Journal:  NPJ Digit Med       Date:  2020-03-09
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.