| Literature DB >> 19505942 |
Audrey Kauffmann1, Tim F Rayner, Helen Parkinson, Misha Kapushesky, Margus Lukk, Alvis Brazma, Wolfgang Huber.
Abstract
SUMMARY: ArrayExpress is one of the largest public repositories of microarray datasets. R/Bioconductor provides a comprehensive suite of microarray analysis and integrative bioinformatics software. However, easy ways for importing datasets from ArrayExpress into R/Bioconductor have been lacking. Here, we present such a tool that is suitable for both interactive and automated use. AVAILABILITY: The ArrayExpress package is available from the Bioconductor project at http://www.bioconductor.org. A users guide and examples are provided with the package.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19505942 PMCID: PMC2723004 DOI: 10.1093/bioinformatics/btp354
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Application of the ArrayExpress package to the ArrayExpress database in March 2009
| Number of accessions | 6117 | |
| Number of datasets | 6891 | |
| Objects created fully automatically | 5550 | 81% |
| Complete objects created | 4017 | 58% |
| Affymetrix | 3407 | |
| Two-colour | 89 | |
| One-colour | 521 | |
| Incomplete objects | 1533 | 22% |
| Missing feature annotation | 1121 | |
| Missing sample annotation | 466 | |
| Objects created with manual selection of columns | 619 | 9% |
| Object creation failed | 722 | 10% |
The number of datasets is higher than the number of accessions since some accessions store multiple datasets (we consider measurements made with different arrays and different datasets). Manual setting of column names was necessary for 1082 (16%) of the 6891 datasets, and we were successful in 619 (9%) cases.