| Literature DB >> 24701342 |
Mike L Smith1, Keith A Baggerly2, Henrik Bengtsson3, Matthew E Ritchie4, Kasper D Hansen5.
Abstract
The IDAT file format is used to store BeadArray data from the myriad of genomewide profiling platforms on offer from Illumina Inc. This proprietary format is output directly from the scanner and stores summary intensities for each probe-type on an array in a compact manner. A lack of open source tools to process IDAT files has hampered their uptake by the research community beyond the standard step of using the vendor's software to extract the data they contain in a human readable text format. To fill this void, we have developed the illuminaio package that parses IDAT files from any BeadArray platform, including the decryption of files from Illumina's gene expression arrays. illuminaio provides the first open-source package for this task, and will promote wider uptake of the IDAT format as a standard for sharing Illumina BeadArray data in public databases, in the same way that the CEL file serves as the standard for the Affymetrix platform.Entities:
Year: 2013 PMID: 24701342 PMCID: PMC3968891 DOI: 10.12688/f1000research.2-264.v1
Source DB: PubMed Journal: F1000Res ISSN: 2046-1402
Figure 1. A typical BeadArray analysis workflow.
Scanning of BeadChips is performed using the iScan or BeadScan control software, producing IDAT files. Currently, these are read by GenomeStudio where the user has the choice of performing an analysis using that software, or outputting the intensities to a text file for processing by alternative means. illuminaio removes this reliance on GenomeStudio. The intention is for analysis packages to incorporate its routines, effectively merging the dashed region and simplifying the analysis process for end users.
Summarising the file formats for various BeadArray platforms.
| Array type | File format | No. data fields |
|---|---|---|
| SNP genotyping | Binary | 4 |
| Methylation | Binary | 4 |
| Gene expression | Encrypted XML | 10 |