Zhi Liu1, Tuantuan Gui1, Zhen Wang1, Hong Li1, Yunhe Fu2, Xiao Dong3, Yixue Li4. 1. Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China. 2. Key Laboratory of Systems Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China. 3. Department of Genetics, Albert Einstein College of Medicine, Bronx, NY 10461, USA. 4. Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China School of Life Science and Technology, Shanghai Jiaotong University, Shanghai 200240, China Shanghai Center for Bioinformation Technology, Shanghai Industrial Technology Institute, Shanghai 201203, China and Collaborative Innovation Center for Genetics and Development, Fudan University, Shanghai 200438, China.
Abstract
MOTIVATION: Allele-specific expression (ASE) is a useful way to identify cis-acting regulatory variation, which provides opportunities to develop new therapeutic strategies that activate beneficial alleles or silence mutated alleles at specific loci. However, multiple problems hinder the identification of ASE in next-generation sequencing (NGS) data. RESULTS: We developed cisASE, a likelihood-based method for detecting ASE on single nucleotide variant (SNV), exon and gene levels from sequencing data without requiring phasing or parental information. cisASE uses matched DNA-seq data to control technical bias and copy number variation (CNV) in putative cis-regulated ASE identification. Compared with state-of-the-art methods, cisASE exhibits significantly increased accuracy and speed. cisASE works moderately well for datasets without DNA-seq and thus is widely applicable. By applying cisASE to real datasets, we identified specific ASE characteristics in normal and cancer tissues, thus indicating that cisASE has potential for wide applications in cancer genomics. AVAILABILITY AND IMPLEMENTATION: cisASE is freely available at http://lifecenter.sgst.cn/cisASE CONTACT: biosinodx@gmail.com or yxli@sibs.ac.cnSupplementary information: Supplementary data are available at Bioinformatics online.
MOTIVATION: Allele-specific expression (ASE) is a useful way to identify cis-acting regulatory variation, which provides opportunities to develop new therapeutic strategies that activate beneficial alleles or silence mutated alleles at specific loci. However, multiple problems hinder the identification of ASE in next-generation sequencing (NGS) data. RESULTS: We developed cisASE, a likelihood-based method for detecting ASE on single nucleotide variant (SNV), exon and gene levels from sequencing data without requiring phasing or parental information. cisASE uses matched DNA-seq data to control technical bias and copy number variation (CNV) in putative cis-regulated ASE identification. Compared with state-of-the-art methods, cisASE exhibits significantly increased accuracy and speed. cisASE works moderately well for datasets without DNA-seq and thus is widely applicable. By applying cisASE to real datasets, we identified specific ASE characteristics in normal and cancer tissues, thus indicating that cisASE has potential for wide applications in cancer genomics. AVAILABILITY AND IMPLEMENTATION: cisASE is freely available at http://lifecenter.sgst.cn/cisASE CONTACT: biosinodx@gmail.com or yxli@sibs.ac.cnSupplementary information: Supplementary data are available at Bioinformatics online.