Literature DB >> 33571253

A mixture model to detect edges in sparse co-expression graphs with an application for comparing breast cancer subtypes.

Haim Bar1, Seojin Bang2.   

Abstract

We develop a method to recover a gene network's structure from co-expression data, measured in terms of normalized Pearson's correlation coefficients between gene pairs. We treat these co-expression measurements as weights in the complete graph in which nodes correspond to genes. To decide which edges exist in the gene network, we fit a three-component mixture model such that the observed weights of 'null edges' follow a normal distribution with mean 0, and the non-null edges follow a mixture of two lognormal distributions, one for positively- and one for negatively-correlated pairs. We show that this so-called L2 N mixture model outperforms other methods in terms of power to detect edges, and it allows to control the false discovery rate. Importantly, our method makes no assumptions about the true network structure. We demonstrate our method, which is implemented in an R package called edgefinder, using a large dataset consisting of expression values of 12,750 genes obtained from 1,616 women. We infer the gene network structure by cancer subtype, and find insightful subtype characteristics. For example, we find thirteen pathways which are enriched in each of the cancer groups but not in the Normal group, with two of the pathways associated with autoimmune diseases and two other with graft rejection. We also find specific characteristics of different breast cancer subtypes. For example, the Luminal A network includes a single, highly connected cluster of genes, which is enriched in the human diseases category, and in the Her2 subtype network we find a distinct, and highly interconnected cluster which is uniquely enriched in drug metabolism pathways.

Entities:  

Mesh:

Substances:

Year:  2021        PMID: 33571253      PMCID: PMC7877669          DOI: 10.1371/journal.pone.0246945

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


  34 in total

1.  Lethality and centrality in protein networks.

Authors:  H Jeong; S P Mason; A L Barabási; Z N Oltvai
Journal:  Nature       Date:  2001-05-03       Impact factor: 49.962

2.  The Gene Ontology (GO) database and informatics resource.

Authors:  M A Harris; J Clark; A Ireland; J Lomax; M Ashburner; R Foulger; K Eilbeck; S Lewis; B Marshall; C Mungall; J Richter; G M Rubin; J A Blake; C Bult; M Dolan; H Drabkin; J T Eppig; D P Hill; L Ni; M Ringwald; R Balakrishnan; J M Cherry; K R Christie; M C Costanzo; S S Dwight; S Engel; D G Fisk; J E Hirschman; E L Hong; R S Nash; A Sethuraman; C L Theesfeld; D Botstein; K Dolinski; B Feierbach; T Berardini; S Mundodi; S Y Rhee; R Apweiler; D Barrell; E Camon; E Dimmer; V Lee; R Chisholm; P Gaudet; W Kibbe; R Kishore; E M Schwarz; P Sternberg; M Gwinn; L Hannick; J Wortman; M Berriman; V Wood; N de la Cruz; P Tonellato; P Jaiswal; T Seigfried; R White
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  Sparse inverse covariance estimation with the graphical lasso.

Authors:  Jerome Friedman; Trevor Hastie; Robert Tibshirani
Journal:  Biostatistics       Date:  2007-12-12       Impact factor: 5.899

4.  Cross-linking of CD53 promotes activation of resting human B lymphocytes.

Authors:  A M Rasmussen; H K Blomhoff; T Stokke; V Horejsi; E B Smeland
Journal:  J Immunol       Date:  1994-12-01       Impact factor: 5.422

5.  Structure of the complex between HER2 and an antibody paratope formed by side chains from tryptophan and serine.

Authors:  Robert D Fisher; Mark Ultsch; Andreas Lingel; Gabriele Schaefer; Lily Shao; Sara Birtalan; Sachdev S Sidhu; Charles Eigenbrot
Journal:  J Mol Biol       Date:  2010-07-21       Impact factor: 5.469

Review 6.  Network medicine: a network-based approach to human disease.

Authors:  Albert-László Barabási; Natali Gulbahce; Joseph Loscalzo
Journal:  Nat Rev Genet       Date:  2011-01       Impact factor: 53.242

7.  Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles.

Authors:  Aravind Subramanian; Pablo Tamayo; Vamsi K Mootha; Sayan Mukherjee; Benjamin L Ebert; Michael A Gillette; Amanda Paulovich; Scott L Pomeroy; Todd R Golub; Eric S Lander; Jill P Mesirov
Journal:  Proc Natl Acad Sci U S A       Date:  2005-09-30       Impact factor: 11.205

8.  SERPINA6, BEX1, AGTR1, SLC26A3, and LAPTM4B are markers of resistance to neoadjuvant chemotherapy in HER2-negative breast cancer.

Authors:  Jorma J de Ronde; Esther H Lips; Lennart Mulder; Andrew D Vincent; Jelle Wesseling; Marja Nieuwland; Ron Kerkhoven; Marie-Jeanne T F D Vrancken Peeters; Gabe S Sonke; Sjoerd Rodenhuis; Lodewyk F A Wessels
Journal:  Breast Cancer Res Treat       Date:  2012-12-01       Impact factor: 4.872

9.  A network of SCOP hidden Markov models and its analysis.

Authors:  Liqing Zhang; Layne T Watson; Lenwood S Heath
Journal:  BMC Bioinformatics       Date:  2011-05-23       Impact factor: 3.169

10.  Pharmacometabolomics study identifies circulating spermidine and tryptophan as potential biomarkers associated with the complete pathological response to trastuzumab-paclitaxel neoadjuvant therapy in HER-2 positive breast cancer.

Authors:  Gianmaria Miolo; Elena Muraro; Donatella Caruso; Diana Crivellari; Anthony Ash; Simona Scalone; Davide Lombardi; Flavio Rizzolio; Antonio Giordano; Giuseppe Corona
Journal:  Oncotarget       Date:  2016-06-28
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.