Siwen Xu1,2, Zixiao Lu3, Wei Shao4, Christina Y Yu4,5, Jill L Reiter6, Qianjin Feng3, Weixing Feng7, Kun Huang8,9, Yunlong Liu10,11. 1. Institute of Intelligent System and Bioinformatics, College of Automation, Harbin Engineering University, Harbin, Heilongjiang, China. 2. Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, USA. 3. Guangdong Provincial Key Laboratory of Medical Image Processing, School of Biomedical Engineering, Southern Medical University, Guangzhou, China. 4. Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, USA. 5. Department of Biomedical Informatics, The Ohio State University, Columbus, OH, USA. 6. Department of Medical & Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA. 7. Institute of Intelligent System and Bioinformatics, College of Automation, Harbin Engineering University, Harbin, Heilongjiang, China. fengweixing@hrbeu.edu.cn. 8. Department of Medicine, Indiana University School of Medicine, Indianapolis, IN, USA. kunhuang@iu.edu. 9. Regenstrief Institute, Indianapolis, IN, USA. kunhuang@iu.edu. 10. Center for Computational Biology and Bioinformatics, Indiana University School of Medicine, Indianapolis, IN, USA. yunliu@iu.edu. 11. Department of Medical & Molecular Genetics, Indiana University School of Medicine, Indianapolis, IN, USA. yunliu@iu.edu.
Abstract
BACKGROUND: Existing studies have demonstrated that the integrative analysis of histopathological images and genomic data can be used to better understand the onset and progression of many diseases, as well as identify new diagnostic and prognostic biomarkers. However, since the development of pathological phenotypes are influenced by a variety of complex biological processes, complete understanding of the underlying gene regulatory mechanisms for the cell and tissue morphology is still a challenge. In this study, we explored the relationship between the chromatin accessibility changes and the epithelial tissue proportion in histopathological images of estrogen receptor (ER) positive breast cancer. METHODS: An established whole slide image processing pipeline based on deep learning was used to perform global segmentation of epithelial and stromal tissues. We then used canonical correlation analysis to detect the epithelial tissue proportion-associated regulatory regions. By integrating ATAC-seq data with matched RNA-seq data, we found the potential target genes that associated with these regulatory regions. Then we used these genes to perform the following pathway and survival analysis. RESULTS: Using canonical correlation analysis, we detected 436 potential regulatory regions that exhibited significant correlation between quantitative chromatin accessibility changes and the epithelial tissue proportion in tumors from 54 patients (FDR < 0.05). We then found that these 436 regulatory regions were associated with 74 potential target genes. After functional enrichment analysis, we observed that these potential target genes were enriched in cancer-associated pathways. We further demonstrated that using the gene expression signals and the epithelial tissue proportion extracted from this integration framework could stratify patient prognoses more accurately, outperforming predictions based on only omics or image features. CONCLUSION: This integrative analysis is a useful strategy for identifying potential regulatory regions in the human genome that are associated with tumor tissue quantification. This study will enable efficient prioritization of genomic regulatory regions identified by ATAC-seq data for further studies to validate their causal regulatory function. Ultimately, identifying epithelial tissue proportion-associated regulatory regions will further our understanding of the underlying molecular mechanisms of disease and inform the development of potential therapeutic targets.
BACKGROUND: Existing studies have demonstrated that the integrative analysis of histopathological images and genomic data can be used to better understand the onset and progression of many diseases, as well as identify new diagnostic and prognostic biomarkers. However, since the development of pathological phenotypes are influenced by a variety of complex biological processes, complete understanding of the underlying gene regulatory mechanisms for the cell and tissue morphology is still a challenge. In this study, we explored the relationship between the chromatin accessibility changes and the epithelial tissue proportion in histopathological images of estrogen receptor (ER) positive breast cancer. METHODS: An established whole slide image processing pipeline based on deep learning was used to perform global segmentation of epithelial and stromal tissues. We then used canonical correlation analysis to detect the epithelial tissue proportion-associated regulatory regions. By integrating ATAC-seq data with matched RNA-seq data, we found the potential target genes that associated with these regulatory regions. Then we used these genes to perform the following pathway and survival analysis. RESULTS: Using canonical correlation analysis, we detected 436 potential regulatory regions that exhibited significant correlation between quantitative chromatin accessibility changes and the epithelial tissue proportion in tumors from 54 patients (FDR < 0.05). We then found that these 436 regulatory regions were associated with 74 potential target genes. After functional enrichment analysis, we observed that these potential target genes were enriched in cancer-associated pathways. We further demonstrated that using the gene expression signals and the epithelial tissue proportion extracted from this integration framework could stratify patient prognoses more accurately, outperforming predictions based on only omics or image features. CONCLUSION: This integrative analysis is a useful strategy for identifying potential regulatory regions in the human genome that are associated with tumor tissue quantification. This study will enable efficient prioritization of genomic regulatory regions identified by ATAC-seq data for further studies to validate their causal regulatory function. Ultimately, identifying epithelial tissue proportion-associated regulatory regions will further our understanding of the underlying molecular mechanisms of disease and inform the development of potential therapeutic targets.
Authors: Andrew H Beck; Ankur R Sangoi; Samuel Leung; Robert J Marinelli; Torsten O Nielsen; Marc J van de Vijver; Robert B West; Matt van de Rijn; Daphne Koller Journal: Sci Transl Med Date: 2011-11-09 Impact factor: 17.956
Authors: Sarah K Denny; Dian Yang; Chen-Hua Chuang; Jennifer J Brady; Jing Shan Lim; Barbara M Grüner; Shin-Heng Chiou; Alicia N Schep; Jessika Baral; Cécile Hamard; Martine Antoine; Marie Wislez; Christina S Kong; Andrew J Connolly; Kwon-Sik Park; Julien Sage; William J Greenleaf; Monte M Winslow Journal: Cell Date: 2016-06-30 Impact factor: 41.582
Authors: Alice Pajoro; Pedro Madrigal; Jose M Muiño; José Tomás Matus; Jian Jin; Martin A Mecchia; Juan M Debernardi; Javier F Palatnik; Salma Balazadeh; Muhammad Arif; Diarmuid S Ó'Maoiléidigh; Frank Wellmer; Pawel Krajewski; José-Luis Riechmann; Gerco C Angenent; Kerstin Kaufmann Journal: Genome Biol Date: 2014-03-03 Impact factor: 13.583
Authors: C J H Kramer; K M H Vangangelt; G W van Pelt; T J A Dekker; R A E M Tollenaar; W E Mesker Journal: Breast Cancer Res Treat Date: 2018-10-09 Impact factor: 4.872