Literature DB >> 32277818

Blood-based multi-tissue gene expression inference with Bayesian ridge regression.

Wenjian Xu1, Xuanshi Liu1, Fei Leng1, Wei Li1.   

Abstract

MOTIVATION: Gene expression profiling is widely used in basic and cancer research but still not feasible in many clinical applications because tissues, such as brain samples, are difficult and not ethnical to collect. Gene expression in uncollected tissues can be computationally inferred using genotype and expression quantitative trait loci. No methods can infer unmeasured gene expression of multiple tissues with single tissue gene expression profile as input.
RESULTS: Here, we present a Bayesian ridge regression-based method (B-GEX) to infer gene expression profiles of multiple tissues from blood gene expression profile. For each gene in a tissue, a low-dimensional feature vector was extracted from whole blood gene expression profile by feature selection. We used GTEx RNAseq data of 16 tissues to train inference models to capture the cross-tissue expression correlations between each target gene in a tissue and its preselected feature genes in peripheral blood. We compared B-GEX with least square regression, LASSO regression and ridge regression. B-GEX outperforms the other three models in most tissues in terms of mean absolute error, Pearson correlation coefficient and root-mean-squared error. Moreover, B-GEX infers expression level of tissue-specific genes as well as those of non-tissue-specific genes in all tissues. Unlike previous methods, which require genomic features or gene expression profiles of multiple tissues, our model only requires whole blood expression profile as input. B-GEX helps gain insights into gene expressions of uncollected tissues from more accessible data of blood.
AVAILABILITY AND IMPLEMENTATION: B-GEX is available at https://github.com/xuwenjian85/B-GEX. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Year:  2020        PMID: 32277818     DOI: 10.1093/bioinformatics/btaa239

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  5 in total

1.  A polygenic stacking classifier revealed the complicated platelet transcriptomic landscape of adult immune thrombocytopenia.

Authors:  Chengfeng Xu; Ruochi Zhang; Meiyu Duan; Yongming Zhou; Jizhang Bao; Hao Lu; Jie Wang; Minghui Hu; Zhaoyang Hu; Fengfeng Zhou; Wenwei Zhu
Journal:  Mol Ther Nucleic Acids       Date:  2022-04-06       Impact factor: 10.183

2.  Deep Large-Scale Multitask Learning Network for Gene Expression Inference.

Authors:  Kamran Ghasedi Dizaji; Wei Chen; Heng Huang
Journal:  J Comput Biol       Date:  2021-05       Impact factor: 1.479

3.  EnRank: An Ensemble Method to Detect Pulmonary Hypertension Biomarkers Based on Feature Selection and Machine Learning Models.

Authors:  Xiangju Liu; Yu Zhang; Chunli Fu; Ruochi Zhang; Fengfeng Zhou
Journal:  Front Genet       Date:  2021-04-27       Impact factor: 4.599

4.  Gene expression imputation and cell-type deconvolution in human brain with spatiotemporal precision and its implications for brain-related disorders.

Authors:  Guangsheng Pei; Yin-Ying Wang; Lukas M Simon; Yulin Dai; Zhongming Zhao; Peilin Jia
Journal:  Genome Res       Date:  2020-12-03       Impact factor: 9.043

5.  Diurnal and circadian rhythmicity of the human blood transcriptome overlaps with organ- and tissue-specific expression of a non-human primate.

Authors:  Carla S Möller-Levet; Emma E Laing; Simon N Archer; Derk-Jan Dijk
Journal:  BMC Biol       Date:  2022-03-09       Impact factor: 7.431

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.