Literature DB >> 21410990

AnyExpress: integrated toolkit for analysis of cross-platform gene expression data using a fast interval matching algorithm.

Jihoon Kim1, Kiltesh Patel, Hyunchul Jung, Winston P Kuo, Lucila Ohno-Machado.   

Abstract

BACKGROUND: Cross-platform analysis of gene express data requires multiple, intricate processes at different layers with various platforms. However, existing tools handle only a single platform and are not flexible enough to support custom changes, which arise from the new statistical methods, updated versions of reference data, and better platforms released every month or year. Current tools are so tightly coupled with reference information, such as reference genome, transcriptome database, and SNP, which are often erroneous or outdated, that the output results are incorrect and misleading.
RESULTS: We developed AnyExpress, a software package that combines cross-platform gene expression data using a fast interval-matching algorithm. Supported platforms include next-generation-sequencing technology, microarray, SAGE, MPSS, and more. Users can define custom target transcriptome database references for probe/read mapping in any species, as well as criteria to remove undesirable probes/reads. AnyExpress offers scalable processing features such as binding, normalization, and summarization that are not present in existing software tools. As a case study, we applied AnyExpress to published Affymetrix microarray and Illumina NGS RNA-Seq data from human kidney and liver. The mean of within-platform correlation coefficient was 0.98 for within-platform samples in kidney and liver, respectively. The mean of cross-platform correlation coefficients was 0.73. These results confirmed those of the original and secondary studies. Applying filtering produced higher agreement between microarray and NGS, according to an agreement index calculated from differentially expressed genes.
CONCLUSION: AnyExpress can combine cross-platform gene expression data, process data from both open- and closed-platforms, select a custom target reference, filter out undesirable probes or reads based on custom-defined biological features, and perform quantile-normalization with a large number of microarray samples. AnyExpress is fast, comprehensive, flexible, and freely available at http://anyexpress.sourceforge.net.

Entities:  

Mesh:

Year:  2011        PMID: 21410990      PMCID: PMC3076267          DOI: 10.1186/1471-2105-12-75

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.307


  42 in total

1.  Singular value decomposition for genome-wide expression data processing and modeling.

Authors:  O Alter; P O Brown; D Botstein
Journal:  Proc Natl Acad Sci U S A       Date:  2000-08-29       Impact factor: 11.205

2.  Adjustment of systematic microarray data biases.

Authors:  Monica Benito; Joel Parker; Quan Du; Junyuan Wu; Dong Xiang; Charles M Perou; J S Marron
Journal:  Bioinformatics       Date:  2004-01-01       Impact factor: 6.937

3.  Exploration, normalization, and summaries of high density oligonucleotide array probe level data.

Authors:  Rafael A Irizarry; Bridget Hobbs; Francois Collin; Yasmin D Beazer-Barclay; Kristen J Antonellis; Uwe Scherf; Terence P Speed
Journal:  Biostatistics       Date:  2003-04       Impact factor: 5.899

4.  CrossChip: a system supporting comparative analysis of different generations of Affymetrix arrays.

Authors:  Sek Won Kong; Kyu-Baek Hwang; Richard D Kim; Byoung-Tak Zhang; Steven A Greenberg; Isaac S Kohane; Peter J Park
Journal:  Bioinformatics       Date:  2005-01-31       Impact factor: 6.937

5.  A comparison of meta-analysis methods for detecting differentially expressed genes in microarray experiments.

Authors:  Fangxin Hong; Rainer Breitling
Journal:  Bioinformatics       Date:  2008-01-18       Impact factor: 6.937

6.  DSGeo: software tools for cross-platform analysis of gene expression data in GEO.

Authors:  Ronilda Lacson; Erik Pitzer; Jihoon Kim; Pedro Galante; Christian Hinske; Lucila Ohno-Machado
Journal:  J Biomed Inform       Date:  2010-05-07       Impact factor: 6.317

7.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Authors:  Ben Langmead; Cole Trapnell; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-03-04       Impact factor: 13.583

8.  Evolving gene/transcript definitions significantly alter the interpretation of GeneChip data.

Authors:  Manhong Dai; Pinglang Wang; Andrew D Boyd; Georgi Kostov; Brian Athey; Edward G Jones; William E Bunney; Richard M Myers; Terry P Speed; Huda Akil; Stanley J Watson; Fan Meng
Journal:  Nucleic Acids Res       Date:  2005-11-10       Impact factor: 16.971

9.  Novel definition files for human GeneChips based on GeneAnnot.

Authors:  Francesco Ferrari; Stefania Bortoluzzi; Alessandro Coppe; Alexandra Sirota; Marilyn Safran; Michael Shmoish; Sergio Ferrari; Doron Lancet; Gian Antonio Danieli; Silvio Bicciato
Journal:  BMC Bioinformatics       Date:  2007-11-15       Impact factor: 3.169

10.  Improved precision and accuracy for microarrays using updated probe set definitions.

Authors:  Rickard Sandberg; Ola Larsson
Journal:  BMC Bioinformatics       Date:  2007-02-08       Impact factor: 3.169

View more
  8 in total

1.  iDASH: integrating data for analysis, anonymization, and sharing.

Authors:  Lucila Ohno-Machado; Vineet Bafna; Aziz A Boxwala; Brian E Chapman; Wendy W Chapman; Kamalika Chaudhuri; Michele E Day; Claudiu Farcas; Nathaniel D Heintzman; Xiaoqian Jiang; Hyeoneui Kim; Jihoon Kim; Michael E Matheny; Frederic S Resnic; Staal A Vinterbo
Journal:  J Am Med Inform Assoc       Date:  2011-11-10       Impact factor: 4.497

Review 2.  Genomic Approaches to Posttraumatic Stress Disorder: The Psychiatric Genomic Consortium Initiative.

Authors:  Caroline M Nievergelt; Allison E Ashley-Koch; Shareefa Dalvie; Michael A Hauser; Rajendra A Morey; Alicia K Smith; Monica Uddin
Journal:  Biol Psychiatry       Date:  2018-02-02       Impact factor: 13.382

3.  A regression framework for assessing covariate effects on the reproducibility of high-throughput experiments.

Authors:  Qunhua Li; Feipeng Zhang
Journal:  Biometrics       Date:  2017-11-29       Impact factor: 2.571

4.  Microarray profiling reveals the integrated stress response is activated by halofuginone in mammary epithelial cells.

Authors:  Yana G Kamberov; Jihoon Kim; Ralph Mazitschek; Winston P Kuo; Malcolm Whitman
Journal:  BMC Res Notes       Date:  2011-10-05

5.  virtualArray: a R/bioconductor package to merge raw data from different microarray platforms.

Authors:  Andreas Heider; Rüdiger Alt
Journal:  BMC Bioinformatics       Date:  2013-03-02       Impact factor: 3.169

6.  AmalgamScope: merging annotations data across the human genome.

Authors:  Georgia Tsiliki; Konstantinos Tsaramirsis; Sophia Kossida
Journal:  Biomed Res Int       Date:  2014-05-20       Impact factor: 3.411

7.  A semi-parametric statistical model for integrating gene expression profiles across different platforms.

Authors:  Yafei Lyu; Qunhua Li
Journal:  BMC Bioinformatics       Date:  2016-01-11       Impact factor: 3.169

8.  Revealing post-transcriptional microRNA-mRNA regulations in Alzheimer's disease through ensemble graphs.

Authors:  Rubén Armañanzas
Journal:  BMC Genomics       Date:  2018-09-24       Impact factor: 3.969

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.