| Literature DB >> 23308210 |
Huan Cai1, Hongyu Chen, Tie Yi, Caitlin M Daimon, John P Boyle, Chris Peers, Stuart Maudsley, Bronwen Martin.
Abstract
With the development of increasingly large and complex genomic and proteomic data sets, an enhancement in the complexity of available Venn diagram analytical programs is becoming increasingly important. Current freely available Venn diagram programs often fail to represent extra complexity among datasets, such as regulation pattern differences between different groups. Here we describe the development of VennPlex, a program that illustrates the often diverse numerical interactions among multiple, high-complexity datasets, using up to four data sets. VennPlex includes versatile output features, where grouped data points in specific regions can be easily exported into a spreadsheet. This program is able to facilitate the analysis of two to four gene sets and their corresponding expression values in a user-friendly manner. To demonstrate its unique experimental utility we applied VennPlex to a complex paradigm, i.e. a comparison of the effect of multiple oxygen tension environments (1-20% ambient oxygen) upon gene transcription of primary rat astrocytes. VennPlex accurately dissects complex data sets reliably into easily identifiable groups for straightforward analysis and data output. This program, which is an improvement over currently available Venn diagram programs, is able to rapidly extract important datasets that represent the variety of expression patterns available within the data sets, showing potential applications in fields like genomics, proteomics, and bioinformatics.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23308210 PMCID: PMC3538763 DOI: 10.1371/journal.pone.0053388
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1VennPlex data input and output scenarios.
(A) Data input in Excel™ files. Two gene sets and corresponding z-ratios were organized into an Excel™ input file and highlighted in yellow and blue color separately. Three set input data were highlighted in yellow, blue, and green accordingly. Four set input data were highlighted in yellow, blue, green, and red, respectively. (B) Venn diagram drawn by VennPlex program. The numbers in the different sections of 2, 3, 4-set Venn diagram indicate the output set orders in the resultant extracted datasheet in panel (C). To indicate the polarity of numerical regulation of each factor within each set or intersection we have employed a two or three-way key system. The number of factors unique to a set, with a positive regulation polarity are identified with an italic numeral. The number of factors unique to a set, with a negative regulation polarity are identified with an underlined numeral. The number of factors common between multiple sets are indicated with a red colored numeral. This is indicated in a box key for each panel in (B). (C) Data output in Excel™ files.
Figure 2Three-set Venn diagram analysis of multiple oxygen tension-response transcriptional alterations in astrocytes.
(A) 3-set Venn diagram drawn by Venny. 1%, 4%, 9% represent 1%, 4%, 9% O2 conditions, compared to the 20% O2 condition. (B) 3-set Venn diagram generated by VennPlex. The numbers in each section show the number and type (up-, down- or contra-regulated) transcriptional alterations. As stated in Figure 1 the boxed key helps indicate the number of factors in each set or intersection that are upregulated (italic), downregulated (underlined) or contra-regulated (red). For example, there are 60 transcripts common to all three sets, 27 are upregulated (italic) in all three sets, 23 are downregulated (underlined) in all three sets, and 10 (red) demonstrate a mixture of regulation polarities (contra-regulated) between two of the input sets.