Literature DB >> 33371881

The shape of gene expression distributions matter: how incorporating distribution shape improves the interpretation of cancer transcriptomic data.

Laurence de Torrenté1, Samuel Zimmerman1, Masako Suzuki2, Maximilian Christopeit3, John M Greally2, Jessica C Mar4,5,6.   

Abstract

BACKGROUND: In genomics, we often assume that continuous data, such as gene expression, follow a specific kind of distribution. However we rarely stop to question the validity of this assumption, or consider how broadly applicable it may be to all genes that are in the transcriptome. Our study investigated the prevalence of a range of gene expression distributions in three different tumor types from the Cancer Genome Atlas (TCGA).
RESULTS: Surprisingly, the expression of less than 50% of all genes was Normally-distributed, with other distributions including Gamma, Bimodal, Cauchy, and Lognormal also represented. Most of the distribution categories contained genes that were significantly enriched for unique biological processes. Different assumptions based on the shape of the expression profile were used to identify genes that could discriminate between patients with good versus poor survival. The prognostic marker genes that were identified when the shape of the distribution was accounted for reflected functional insights into cancer biology that were not observed when standard assumptions were applied. We showed that when multiple types of distributions were permitted, i.e. the shape of the expression profile was used, the statistical classifiers had greater predictive accuracy for determining the prognosis of a patient versus those that assumed only one type of gene expression distribution.
CONCLUSIONS: Our results highlight the value of studying a gene's distribution shape to model heterogeneity of transcriptomic data and the impact on using analyses that permit more than one type of gene expression distribution. These insights would have been overlooked when using standard approaches that assume all genes follow the same type of distribution in a patient cohort.

Entities:  

Keywords:  Cancer genomics; Gene expression; Multi-modality; Non-normal distribution; Survival analysis

Year:  2020        PMID: 33371881      PMCID: PMC7768656          DOI: 10.1186/s12859-020-03892-w

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  21 in total

1.  Mature B-cell acute lymphoblastic leukaemia associated with a rare MLL-FOXO4 fusion gene.

Authors:  Leon Lim; Kok-Siong Chen; Shekhar Krishnan; Leena Gole; Hany Ariffin
Journal:  Br J Haematol       Date:  2012-03-19       Impact factor: 6.998

Review 2.  The prognostic and functional role of microRNAs in acute myeloid leukemia.

Authors:  Guido Marcucci; Krzysztof Mrózek; Michael D Radmacher; Ramiro Garzon; Clara D Bloomfield
Journal:  Blood       Date:  2010-11-02       Impact factor: 22.113

3.  Dynamic transcriptomes of human myeloid leukemia cells.

Authors:  Hai Wang; Haiyan Hu; Qian Zhang; Yadong Yang; Yanming Li; Yang Hu; Xiuyan Ruan; Yaran Yang; Zhaojun Zhang; Chang Shu; Jiangwei Yan; Edward K Wakeland; Quanzhen Li; Songnian Hu; Xiangdong Fang
Journal:  Genomics       Date:  2013-06-24       Impact factor: 5.736

Review 4.  MicroRNA biogenesis pathways in cancer.

Authors:  Shuibin Lin; Richard I Gregory
Journal:  Nat Rev Cancer       Date:  2015-06       Impact factor: 60.716

5.  Association of a leukemic stem cell gene expression signature with clinical outcomes in acute myeloid leukemia.

Authors:  Andrew J Gentles; Sylvia K Plevritis; Ravindra Majeti; Ash A Alizadeh
Journal:  JAMA       Date:  2010-12-22       Impact factor: 56.272

6.  HSC commitment-associated epigenetic signature is prognostic in acute myeloid leukemia.

Authors:  Boris Bartholdy; Maximilian Christopeit; Britta Will; Yongkai Mo; Laura Barreyro; Yiting Yu; Tushar D Bhagat; Ujunwa C Okoye-Okafor; Tihomira I Todorova; John M Greally; Ross L Levine; Ari Melnick; Amit Verma; Ulrich Steidl
Journal:  J Clin Invest       Date:  2014-03       Impact factor: 14.808

7.  Stem cell gene expression programs influence clinical outcome in human leukemia.

Authors:  Kolja Eppert; Katsuto Takenaka; Eric R Lechman; Levi Waldron; Björn Nilsson; Peter van Galen; Klaus H Metzeler; Armando Poeppl; Vicki Ling; Joseph Beyene; Angelo J Canty; Jayne S Danska; Stefan K Bohlander; Christian Buske; Mark D Minden; Todd R Golub; Igor Jurisica; Benjamin L Ebert; John E Dick
Journal:  Nat Med       Date:  2011-08-28       Impact factor: 53.440

8.  Identification of a 24-gene prognostic signature that improves the European LeukemiaNet risk classification of acute myeloid leukemia: an international collaborative study.

Authors:  Zejuan Li; Tobias Herold; Chunjiang He; Peter J M Valk; Ping Chen; Vindi Jurinovic; Ulrich Mansmann; Michael D Radmacher; Kati S Maharry; Miao Sun; Xinan Yang; Hao Huang; Xi Jiang; Maria-Cristina Sauerland; Thomas Büchner; Wolfgang Hiddemann; Abdel Elkahloun; Mary Beth Neilly; Yanming Zhang; Richard A Larson; Michelle M Le Beau; Michael A Caligiuri; Konstanze Döhner; Lars Bullinger; Paul P Liu; Ruud Delwel; Guido Marcucci; Bob Lowenberg; Clara D Bloomfield; Janet D Rowley; Stefan K Bohlander; Jianjun Chen
Journal:  J Clin Oncol       Date:  2013-02-04       Impact factor: 44.544

9.  Prognostically useful gene-expression profiles in acute myeloid leukemia.

Authors:  Peter J M Valk; Roel G W Verhaak; M Antoinette Beijen; Claudia A J Erpelinck; Sahar Barjesteh van Waalwijk van Doorn-Khosrovani; Judith M Boer; H Berna Beverloo; Michael J Moorhouse; Peter J van der Spek; Bob Löwenberg; Ruud Delwel
Journal:  N Engl J Med       Date:  2004-04-15       Impact factor: 91.245

10.  Common mechanism for oncogenic activation of MLL by forkhead family proteins.

Authors:  Chi Wai So; Michael L Cleary
Journal:  Blood       Date:  2002-08-22       Impact factor: 22.113

View more
  1 in total

1.  An introduction to new robust linear and monotonic correlation coefficients.

Authors:  Mohammad Tabatabai; Stephanie Bailey; Zoran Bursac; Habib Tabatabai; Derek Wilus; Karan P Singh
Journal:  BMC Bioinformatics       Date:  2021-03-31       Impact factor: 3.169

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.