Literature DB >> 35106553

Vec2image: an explainable artificial intelligence model for the feature representation and classification of high-dimensional biological data by vector-to-image conversion.

Hui Tang1, Xiangtian Yu2, Rui Liu1,3, Tao Zeng4,5.   

Abstract

Feature representation and discriminative learning are proven models and technologies in artificial intelligence fields; however, major challenges for machine learning on large biological datasets are learning an effective model with mechanistical explanation on the model determination and prediction. To satisfy such demands, we developed Vec2image, an explainable convolutional neural network framework for characterizing the feature engineering, feature selection and classifier training that is mainly based on the collaboration of principal component coordinate conversion, deep residual neural networks and embedded k-nearest neighbor representation on pseudo images of high-dimensional biological data, where the pseudo images represent feature measurements and feature associations simultaneously. Vec2image has achieved better performance compared with other popular methods and illustrated its efficiency on feature selection in cell marker identification from tissue-specific single-cell datasets. In particular, in a case study on type 2 diabetes (T2D) by multiple human islet scRNA-seq datasets, Vec2image first displayed robust performance on T2D classification model building across different datasets, then a specific Vec2image model was trained to accurately recognize the cell state and efficiently rank feature genes relevant to T2D which uncovered potential T2D cellular pathogenesis; and next the cell activity changes, cell composition imbalances and cell-cell communication dysfunctions were associated to our finding T2D feature genes from both population-shared and individual-specific perspectives. Collectively, Vec2image is a new and efficient explainable artificial intelligence methodology that can be widely applied in human-readable classification and prediction on the basis of pseudo image representation of biological deep sequencing data.
© The Author(s) 2022. Published by Oxford University Press.

Entities:  

Keywords:  classification; deep residual neural network; explainable artificial intelligence; feature selection; single-cell sequencing; type 2 diabetes

Mesh:

Year:  2022        PMID: 35106553      PMCID: PMC8921615          DOI: 10.1093/bib/bbab584

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  58 in total

Review 1.  Big data and machine learning algorithms for health-care delivery.

Authors:  Kee Yuan Ngiam; Ing Wei Khor
Journal:  Lancet Oncol       Date:  2019-05       Impact factor: 41.316

2.  Knock-down of LAR protein tyrosine phosphatase induces insulin resistance.

Authors:  Ann Mander; Conrad P Hodgkinson; Graham J Sale
Journal:  FEBS Lett       Date:  2005-06-06       Impact factor: 4.124

3.  Endoplasmic reticulum stress links obesity, insulin action, and type 2 diabetes.

Authors:  Umut Ozcan; Qiong Cao; Erkan Yilmaz; Ann-Hwee Lee; Neal N Iwakoshi; Esra Ozdelen; Gürol Tuncman; Cem Görgün; Laurie H Glimcher; Gökhan S Hotamisligil
Journal:  Science       Date:  2004-10-15       Impact factor: 47.728

4.  Single-Cell Transcriptomics of the Human Endocrine Pancreas.

Authors:  Yue J Wang; Jonathan Schug; Kyoung-Jae Won; Chengyang Liu; Ali Naji; Dana Avrahami; Maria L Golson; Klaus H Kaestner
Journal:  Diabetes       Date:  2016-06-30       Impact factor: 9.461

Review 5.  Computational deconvolution: extracting cell type-specific information from heterogeneous samples.

Authors:  Shai S Shen-Orr; Renaud Gaujoux
Journal:  Curr Opin Immunol       Date:  2013-10-19       Impact factor: 7.486

6.  Single-cell transcriptomes identify human islet cell signatures and reveal cell-type-specific expression changes in type 2 diabetes.

Authors:  Nathan Lawlor; Joshy George; Mohan Bolisetty; Romy Kursawe; Lili Sun; V Sivakamasundari; Ina Kycia; Paul Robson; Michael L Stitzel
Journal:  Genome Res       Date:  2016-11-18       Impact factor: 9.043

7.  An entropy-based metric for assessing the purity of single cell populations.

Authors:  Baolin Liu; Chenwei Li; Ziyi Li; Dongfang Wang; Xianwen Ren; Zemin Zhang
Journal:  Nat Commun       Date:  2020-06-22       Impact factor: 14.919

8.  Pathway Commons 2019 Update: integration, analysis and exploration of pathway data.

Authors:  Igor Rodchenkov; Ozgun Babur; Augustin Luna; Bulent Arman Aksoy; Jeffrey V Wong; Dylan Fong; Max Franz; Metin Can Siper; Manfred Cheung; Michael Wrana; Harsh Mistry; Logan Mosier; Jonah Dlin; Qizhi Wen; Caitlin O'Callaghan; Wanxin Li; Geoffrey Elder; Peter T Smith; Christian Dallago; Ethan Cerami; Benjamin Gross; Ugur Dogrusoz; Emek Demir; Gary D Bader; Chris Sander
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

9.  Dual graph convolutional neural network for predicting chemical networks.

Authors:  Shonosuke Harada; Hirotaka Akita; Masashi Tsubaki; Yukino Baba; Ichigaku Takigawa; Yoshihiro Yamanishi; Hisashi Kashima
Journal:  BMC Bioinformatics       Date:  2020-04-23       Impact factor: 3.169

10.  SCENIC: single-cell regulatory network inference and clustering.

Authors:  Sara Aibar; Carmen Bravo González-Blas; Thomas Moerman; Vân Anh Huynh-Thu; Hana Imrichova; Gert Hulselmans; Florian Rambow; Jean-Christophe Marine; Pierre Geurts; Jan Aerts; Joost van den Oord; Zeynep Kalender Atak; Jasper Wouters; Stein Aerts
Journal:  Nat Methods       Date:  2017-10-09       Impact factor: 28.547

View more
  1 in total

1.  Noninvasive detection and interpretation of gastrointestinal diseases by collaborative serum metabolite and magnetically controlled capsule endoscopy.

Authors:  Xiang-Tian Yu; Ming Chen; Jingyi Guo; Jing Zhang; Tao Zeng
Journal:  Comput Struct Biotechnol J       Date:  2022-10-06       Impact factor: 6.155

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.