Literature DB >> 28921531

A naive Bayes algorithm for tissue origin diagnosis (TOD-Bayes) of synchronous multifocal tumors in the hepatobiliary and pancreatic system.

Weiqin Jiang1, Yifei Shen2,3, Yongfeng Ding4, Chuyu Ye2,3, Yi Zheng1, Peng Zhao1,5, Lulu Liu1, Zhou Tong1, Linfu Zhou6, Shuo Sun2,3, Xingchen Zhang2,3, Lisong Teng4,5, Michael P Timko7, Longjiang Fan2,3, Weijia Fang1,5.   

Abstract

Synchronous multifocal tumors are common in the hepatobiliary and pancreatic system but because of similarities in their histological features, oncologists have difficulty in identifying their precise tissue clonal origin through routine histopathological methods. To address this problem and assist in more precise diagnosis, we developed a computational approach for tissue origin diagnosis based on naive Bayes algorithm (TOD-Bayes) using ubiquitous RNA-Seq data. Massive tissue-specific RNA-Seq data sets were first obtained from The Cancer Genome Atlas (TCGA) and ∼1,000 feature genes were used to train and validate the TOD-Bayes algorithm. The accuracy of the model was >95% based on tenfold cross validation by the data from TCGA. A total of 18 clinical cancer samples (including six negative controls) with definitive tissue origin were subsequently used for external validation and 17 of the 18 samples were classified correctly in our study (94.4%). Furthermore, we included as cases studies seven tumor samples, taken from two individuals who suffered from synchronous multifocal tumors across tissues, where the efforts to make a definitive primary cancer diagnosis by traditional diagnostic methods had failed. Using our TOD-Bayes analysis, the two clinical test cases were successfully diagnosed as pancreatic cancer (PC) and cholangiocarcinoma (CC), respectively, in agreement with their clinical outcomes. Based on our findings, we believe that the TOD-Bayes algorithm is a powerful novel methodology to accurately identify the tissue origin of synchronous multifocal tumors of unknown primary cancers using RNA-Seq data and an important step toward more precision-based medicine in cancer diagnosis and treatment.
© 2017 UICC.

Entities:  

Keywords:  RNA-Seq; hepatobiliary and pancreatic system; naive Bayes algorithm; synchronous multifocal tumors; tissue origin

Mesh:

Substances:

Year:  2017        PMID: 28921531     DOI: 10.1002/ijc.31054

Source DB:  PubMed          Journal:  Int J Cancer        ISSN: 0020-7136            Impact factor:   7.396


  3 in total

1.  ADMETlab: a platform for systematic ADMET evaluation based on a comprehensively collected ADMET database.

Authors:  Jie Dong; Ning-Ning Wang; Zhi-Jiang Yao; Lin Zhang; Yan Cheng; Defang Ouyang; Ai-Ping Lu; Dong-Sheng Cao
Journal:  J Cheminform       Date:  2018-06-26       Impact factor: 5.514

2.  Bayesian Network as a Decision Tool for Predicting ALS Disease.

Authors:  Hasan Aykut Karaboga; Aslihan Gunel; Senay Vural Korkut; Ibrahim Demir; Resit Celik
Journal:  Brain Sci       Date:  2021-01-23

3.  Optimization of anesthetic decision-making in ERAS using Bayesian network.

Authors:  Yuwen Chen; Yiziting Zhu; Kunhua Zhong; Zhiyong Yang; Yujie Li; Xin Shu; Dandan Wang; Peng Deng; Xuehong Bai; Jianteng Gu; Kaizhi Lu; Ju Zhang; Lei Zhao; Tao Zhu; Ke Wei; Bin Yi
Journal:  Front Med (Lausanne)       Date:  2022-09-14
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.