Literature DB >> 25592575

Unsupervised feature construction and knowledge extraction from genome-wide assays of breast cancer with denoising autoencoders.

Jie Tan1, Matthew Ung, Chao Cheng, Casey S Greene.   

Abstract

Big data bring new opportunities for methods that efficiently summarize and automatically extract knowledge from such compendia. While both supervised learning algorithms and unsupervised clustering algorithms have been successfully applied to biological data, they are either dependent on known biology or limited to discerning the most significant signals in the data. Here we present denoising autoencoders (DAs), which employ a data-defined learning objective independent of known biology, as a method to identify and extract complex patterns from genomic data. We evaluate the performance of DAs by applying them to a large collection of breast cancer gene expression data. Results show that DAs successfully construct features that contain both clinical and molecular information. There are features that represent tumor or normal samples, estrogen receptor (ER) status, and molecular subtypes. Features constructed by the autoencoder generalize to an independent dataset collected using a distinct experimental platform. By integrating data from ENCODE for feature interpretation, we discover a feature representing ER status through association with key transcription factors in breast cancer. We also identify a feature highly predictive of patient survival and it is enriched by FOXM1 signaling pathway. The features constructed by DAs are often bimodally distributed with one peak near zero and another near one, which facilitates discretization. In summary, we demonstrate that DAs effectively extract key biological principles from gene expression data and summarize them into constructed features with convenient properties.

Entities:  

Mesh:

Substances:

Year:  2015        PMID: 25592575      PMCID: PMC4299935     

Source DB:  PubMed          Journal:  Pac Symp Biocomput        ISSN: 2335-6928


  28 in total

1.  TIP: a probabilistic method for identifying transcription factor target genes from ChIP-seq binding profiles.

Authors:  Chao Cheng; Renqiang Min; Mark Gerstein
Journal:  Bioinformatics       Date:  2011-10-29       Impact factor: 6.937

2.  Potential tumor suppressor role for the c-Myb oncogene in luminal breast cancer.

Authors:  Aaron R Thorner; Joel S Parker; Katherine A Hoadley; Charles M Perou
Journal:  PLoS One       Date:  2010-10-07       Impact factor: 3.240

3.  Supervised risk predictor of breast cancer based on intrinsic subtypes.

Authors:  Joel S Parker; Michael Mullins; Maggie C U Cheang; Samuel Leung; David Voduc; Tammi Vickery; Sherri Davies; Christiane Fauron; Xiaping He; Zhiyuan Hu; John F Quackenbush; Inge J Stijleman; Juan Palazzo; J S Marron; Andrew B Nobel; Elaine Mardis; Torsten O Nielsen; Matthew J Ellis; Charles M Perou; Philip S Bernard
Journal:  J Clin Oncol       Date:  2009-02-09       Impact factor: 44.544

4.  Architecture of the human regulatory network derived from ENCODE data.

Authors:  Mark B Gerstein; Anshul Kundaje; Manoj Hariharan; Stephen G Landt; Koon-Kiu Yan; Chao Cheng; Xinmeng Jasmine Mu; Ekta Khurana; Joel Rozowsky; Roger Alexander; Renqiang Min; Pedro Alves; Alexej Abyzov; Nick Addleman; Nitin Bhardwaj; Alan P Boyle; Philip Cayting; Alexandra Charos; David Z Chen; Yong Cheng; Declan Clarke; Catharine Eastman; Ghia Euskirchen; Seth Frietze; Yao Fu; Jason Gertz; Fabian Grubert; Arif Harmanci; Preti Jain; Maya Kasowski; Phil Lacroute; Jing Jane Leng; Jin Lian; Hannah Monahan; Henriette O'Geen; Zhengqing Ouyang; E Christopher Partridge; Dorrelyn Patacsil; Florencia Pauli; Debasish Raha; Lucia Ramirez; Timothy E Reddy; Brian Reed; Minyi Shi; Teri Slifer; Jing Wang; Linfeng Wu; Xinqiong Yang; Kevin Y Yip; Gili Zilberman-Schapira; Serafim Batzoglou; Arend Sidow; Peggy J Farnham; Richard M Myers; Sherman M Weissman; Michael Snyder
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

5.  The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups.

Authors:  Christina Curtis; Sohrab P Shah; Suet-Feung Chin; Gulisa Turashvili; Oscar M Rueda; Mark J Dunning; Doug Speed; Andy G Lynch; Shamith Samarajiwa; Yinyin Yuan; Stefan Gräf; Gavin Ha; Gholamreza Haffari; Ali Bashashati; Roslin Russell; Steven McKinney; Anita Langerød; Andrew Green; Elena Provenzano; Gordon Wishart; Sarah Pinder; Peter Watson; Florian Markowetz; Leigh Murphy; Ian Ellis; Arnie Purushotham; Anne-Lise Børresen-Dale; James D Brenton; Simon Tavaré; Carlos Caldas; Samuel Aparicio
Journal:  Nature       Date:  2012-04-18       Impact factor: 49.962

6.  ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia.

Authors:  Stephen G Landt; Georgi K Marinov; Anshul Kundaje; Pouya Kheradpour; Florencia Pauli; Serafim Batzoglou; Bradley E Bernstein; Peter Bickel; James B Brown; Philip Cayting; Yiwen Chen; Gilberto DeSalvo; Charles Epstein; Katherine I Fisher-Aylor; Ghia Euskirchen; Mark Gerstein; Jason Gertz; Alexander J Hartemink; Michael M Hoffman; Vishwanath R Iyer; Youngsook L Jung; Subhradip Karmakar; Manolis Kellis; Peter V Kharchenko; Qunhua Li; Tao Liu; X Shirley Liu; Lijia Ma; Aleksandar Milosavljevic; Richard M Myers; Peter J Park; Michael J Pazin; Marc D Perry; Debasish Raha; Timothy E Reddy; Joel Rozowsky; Noam Shoresh; Arend Sidow; Matthew Slattery; John A Stamatoyannopoulos; Michael Y Tolstorukov; Kevin P White; Simon Xi; Peggy J Farnham; Jason D Lieb; Barbara J Wold; Michael Snyder
Journal:  Genome Res       Date:  2012-09       Impact factor: 9.043

7.  IMP: a multi-species functional genomics portal for integration, visualization and prediction of protein functions and networks.

Authors:  Aaron K Wong; Christopher Y Park; Casey S Greene; Lars A Bongo; Yuanfang Guan; Olga G Troyanskaya
Journal:  Nucleic Acids Res       Date:  2012-06-07       Impact factor: 16.971

8.  FOXA1 is a key determinant of estrogen receptor function and endocrine response.

Authors:  Antoni Hurtado; Kelly A Holmes; Caryn S Ross-Innes; Dominic Schmidt; Jason S Carroll
Journal:  Nat Genet       Date:  2010-12-12       Impact factor: 38.330

9.  Comprehensive molecular portraits of human breast tumours.

Authors: 
Journal:  Nature       Date:  2012-09-23       Impact factor: 49.962

10.  PID: the Pathway Interaction Database.

Authors:  Carl F Schaefer; Kira Anthony; Shiva Krupa; Jeffrey Buchoff; Matthew Day; Timo Hannay; Kenneth H Buetow
Journal:  Nucleic Acids Res       Date:  2008-10-02       Impact factor: 16.971

View more
  41 in total

1.  Unsupervised Extraction of Stable Expression Signatures from Public Compendia with an Ensemble of Neural Networks.

Authors:  Jie Tan; Georgia Doing; Kimberley A Lewis; Courtney E Price; Kathleen M Chen; Kyle C Cady; Barret Perchuk; Michael T Laub; Deborah A Hogan; Casey S Greene
Journal:  Cell Syst       Date:  2017-07-12       Impact factor: 10.304

2.  Deep learning in biomedicine.

Authors:  Michael Wainberg; Daniele Merico; Andrew Delong; Brendan J Frey
Journal:  Nat Biotechnol       Date:  2018-09-06       Impact factor: 54.908

3.  A DEEP LEARNING APPROACH FOR CANCER DETECTION AND RELEVANT GENE IDENTIFICATION.

Authors:  Padideh Danaee; Reza Ghaeini; David A Hendrix
Journal:  Pac Symp Biocomput       Date:  2017

4.  Synthetic image augmentation with generative adversarial network for enhanced performance in protein classification.

Authors:  Rohit Verma; Raj Mehrotra; Chinmay Rane; Ritu Tiwari; Arun Kumar Agariya
Journal:  Biomed Eng Lett       Date:  2020-07-13

5.  Unsupervised classification of multi-omics data during cardiac remodeling using deep learning.

Authors:  Neo Christopher Chung; Bilal Mirza; Howard Choi; Jie Wang; Ding Wang; Peipei Ping; Wei Wang
Journal:  Methods       Date:  2019-03-07       Impact factor: 3.608

Review 6.  Integrating Artificial Intelligence and Nanotechnology for Precision Cancer Medicine.

Authors:  Omer Adir; Maria Poley; Gal Chen; Sahar Froim; Nitzan Krinsky; Jeny Shklover; Janna Shainsky-Roitman; Twan Lammers; Avi Schroeder
Journal:  Adv Mater       Date:  2019-07-09       Impact factor: 30.849

Review 7.  Providing data science support for systems pharmacology and its implications to drug discovery.

Authors:  Thomas Hart; Lei Xie
Journal:  Expert Opin Drug Discov       Date:  2016-01-09       Impact factor: 6.098

8.  Extracting a biologically relevant latent space from cancer transcriptomes with variational autoencoders.

Authors:  Gregory P Way; Casey S Greene
Journal:  Pac Symp Biocomput       Date:  2018

9.  COMPUTATIONAL APPROACHES TO STUDY MICROBES AND MICROBIOMES.

Authors:  Casey S Greene; James A Foster; Bruce A Stanton; Deborah A Hogan; Yana Bromberg
Journal:  Pac Symp Biocomput       Date:  2016

10.  Deep Learning-Based Multi-Omics Integration Robustly Predicts Survival in Liver Cancer.

Authors:  Kumardeep Chaudhary; Olivier B Poirion; Liangqun Lu; Lana X Garmire
Journal:  Clin Cancer Res       Date:  2017-10-05       Impact factor: 12.531

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.