Literature DB >> 31977216

Boosting Tree-Assisted Multitask Deep Learning for Small Scientific Datasets.

Jian Jiang1,2, Rui Wang2, Menglun Wang2, Kaifu Gao2, Duc Duy Nguyen2, Guo-Wei Wei2,3,4.   

Abstract

Machine learning approaches have had tremendous success in various disciplines. However, such success highly depends on the size and quality of datasets. Scientific datasets are often small and difficult to collect. Currently, improving machine learning performance for small scientific datasets remains a major challenge in many academic fields, such as bioinformatics or medical science. Gradient boosting decision tree (GBDT) is typically optimal for small datasets, while deep learning often performs better for large datasets. This work reports a boosting tree-assisted multitask deep learning (BTAMDL) architecture that integrates GBDT and multitask deep learning (MDL) to achieve near-optimal predictions for small datasets when there exists a large dataset that is well correlated to the small datasets. Two BTAMDL models are constructed, one utilizing purely MDL output as GBDT input while the other admitting additional features in GBDT input. The proposed BTAMDL models are validated on four categories of datasets, including toxicity, partition coefficient, solubility, and solvation. It is found that the proposed BTAMDL models outperform the current state-of-the-art methods in various applications involving small datasets.

Entities:  

Mesh:

Year:  2020        PMID: 31977216      PMCID: PMC7350172          DOI: 10.1021/acs.jcim.9b01184

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  15 in total

1.  Reoptimization of MDL keys for use in drug discovery.

Authors:  Joseph L Durant; Burton A Leland; Douglas R Henry; James G Nourse
Journal:  J Chem Inf Comput Sci       Date:  2002 Nov-Dec

2.  Estimation of ADME properties with substructure pattern recognition.

Authors:  Jie Shen; Feixiong Cheng; You Xu; Weihua Li; Yun Tang
Journal:  J Chem Inf Model       Date:  2010-06-28       Impact factor: 4.956

Review 3.  Molecular fingerprint similarity search in virtual screening.

Authors:  Adrià Cereto-Massagué; María José Ojeda; Cristina Valls; Miquel Mulero; Santiago Garcia-Vallvé; Gerard Pujadas
Journal:  Methods       Date:  2014-08-15       Impact factor: 3.608

4.  Multi-Stage Multi-Task Feature Learning.

Authors:  Pinghua Gong; Jieping Ye; Changshui Zhang
Journal:  Adv Neural Inf Process Syst       Date:  2013-10

5.  An integrated iterative annotation technique for easing neural network training in medical image analysis.

Authors:  Brendon Lutnick; Brandon Ginley; Darshana Govind; Sean D McGarry; Peter S LaViolette; Rabi Yacoub; Sanjay Jain; John E Tomaszewski; Kuang-Yu Jen; Pinaki Sarder
Journal:  Nat Mach Intell       Date:  2019-02-11

6.  Handling limited datasets with neural networks in medical applications: A small-data approach.

Authors:  Torgyn Shaikhina; Natalia A Khovanova
Journal:  Artif Intell Med       Date:  2017-01-02       Impact factor: 5.326

7.  DG-GL: Differential geometry-based geometric learning of molecular datasets.

Authors:  Duc Duy Nguyen; Guo-Wei Wei
Journal:  Int J Numer Method Biomed Eng       Date:  2019-02-07       Impact factor: 2.747

8.  Molecular fingerprint-based artificial neural networks QSAR for ligand biological activity predictions.

Authors:  Kyaw-Zeyar Myint; Lirong Wang; Qin Tong; Xiang-Qun Xie
Journal:  Mol Pharm       Date:  2012-08-31       Impact factor: 4.939

Review 9.  Machine learning in chemoinformatics and drug discovery.

Authors:  Yu-Chen Lo; Stefano E Rensi; Wen Torng; Russ B Altman
Journal:  Drug Discov Today       Date:  2018-05-08       Impact factor: 7.851

10.  Quantitative structure-activity relationship modeling of rat acute toxicity by oral exposure.

Authors:  Hao Zhu; Todd M Martin; Lin Ye; Alexander Sedykh; Douglas M Young; Alexander Tropsha
Journal:  Chem Res Toxicol       Date:  2009-12       Impact factor: 3.739

View more
  8 in total

1.  Designing the ultrasonic treatment of nanoparticle-dispersions via machine learning.

Authors:  Christina Glaubitz; Barbara Rothen-Rutishauser; Marco Lattuada; Sandor Balog; Alke Petri-Fink
Journal:  Nanoscale       Date:  2022-09-15       Impact factor: 8.307

2.  Computed tomography radiomics-based distinction of invasive adenocarcinoma from minimally invasive adenocarcinoma manifesting as pure ground-glass nodules with bubble-like signs.

Authors:  Yining Jiang; Ziqi Xiong; Wenjing Zhao; Jingyu Zhang; Yan Guo; Guosheng Li; Zhiyong Li
Journal:  Gen Thorac Cardiovasc Surg       Date:  2022-03-18

3.  Large-Scale Modeling of Multispecies Acute Toxicity End Points Using Consensus of Multitask Deep Learning Methods.

Authors:  Sankalp Jain; Vishal B Siramshetty; Vinicius M Alves; Eugene N Muratov; Nicole Kleinstreuer; Alexander Tropsha; Marc C Nicklaus; Anton Simeonov; Alexey V Zakharov
Journal:  J Chem Inf Model       Date:  2021-02-03       Impact factor: 4.956

4.  AweGNN: Auto-parametrized weighted element-specific graph neural networks for molecules.

Authors:  Timothy Szocinski; Duc Duy Nguyen; Guo-Wei Wei
Journal:  Comput Biol Med       Date:  2021-05-12       Impact factor: 6.698

5.  Decision Trees for Predicting Mortality in Transcatheter Aortic Valve Implantation.

Authors:  Marco Mamprin; Jo M Zelis; Pim A L Tonino; Sveta Zinger; Peter H N de With
Journal:  Bioengineering (Basel)       Date:  2021-02-09

6.  A machine learning model for predicting deterioration of COVID-19 inpatients.

Authors:  Omer Noy; Dan Coster; Maya Metzger; Itai Atar; Shani Shenhar-Tsarfaty; Shlomo Berliner; Galia Rahav; Ori Rogowski; Ron Shamir
Journal:  Sci Rep       Date:  2022-02-16       Impact factor: 4.379

7.  Comparative Study of Repertoire Classification Methods Reveals Data Efficiency of k -mer Feature Extraction.

Authors:  Yotaro Katayama; Tetsuya J Kobayashi
Journal:  Front Immunol       Date:  2022-07-20       Impact factor: 8.786

8.  Automated Classification of 6-n-Propylthiouracil Taster Status with Machine Learning.

Authors:  Lala Chaimae Naciri; Mariano Mastinu; Roberto Crnjar; Iole Tomassini Barbarossa; Melania Melis
Journal:  Nutrients       Date:  2022-01-07       Impact factor: 5.717

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.