| Literature DB >> 29244006 |
Ernur Saka1, Benjamin J Harrison2,3, Kirk West4, Jeffrey C Petruska2, Eric C Rouchka5.
Abstract
BACKGROUND: Since the introduction of microarrays in 1995, researchers world-wide have used both commercial and custom-designed microarrays for understanding differential expression of transcribed genes. Public databases such as ArrayExpress and the Gene Expression Omnibus (GEO) have made millions of samples readily available. One main drawback to microarray data analysis involves the selection of probes to represent a specific transcript of interest, particularly in light of the fact that transcript-specific knowledge (notably alternative splicing) is dynamic in nature.Entities:
Keywords: Affymetrix®; Custom CDF; Microarrays; Probe group; Probe set; Probeset
Mesh:
Year: 2017 PMID: 29244006 PMCID: PMC5731501 DOI: 10.1186/s12864-017-4266-5
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Release dates of databases used by NetAffx v35 annotations and current database versions
| GEO Platform | Organism | UniGene | |||
| NetAffx | Current | ||||
| GPL570 |
| Mar-10 | Nov-12 | ||
| GPL1261 |
| Jan-10 | Jul-12 | ||
| GPL1355 |
| Mar-10 | Nov-12 | ||
| GPL198 |
| May-09 | Jul-12 | ||
| Databases Common to All Four GEO Platforms | |||||
| Ensembl | RefSeq | GenBank | Entrez Gene | Mirbase | |
| NetAffx | Aug-14 | Jul-14 | Jun-14 | May-14 | Jul-12 |
| Current | Mar-16 | Mar-16 | Apr-16 | May-16 | Jun-14 |
Top Affymetrix® in situ oligonucleotide arrays found in GEO
| GEO Platform | Title | Number of Probes (PM) | Number of Probe Sets | Number of Samples |
|---|---|---|---|---|
| GPL570 | Human Genome U133 Plus 2.0 Array | 604,258 | 54,675 | 120,920 |
| GPL1261 | Mouse Genome 430 2.0 Array | 496,468 | 45,101 | 48,087 |
| GPL1355 | Rat Genome 230 2.0 Array | 342,410 | 31,099 | 18,912 |
| GPL198 | Arabidopsis ATH1 Genome Array | 251,078 | 22,810 | 12,624 |
Alternative CDFs for the top Affymetrix® in situ oligonucleotide arrays found in GEO
| GEO Platform | Number of Alternative CDFs | Number and Percent of Samples Using Alternative CDFs |
|---|---|---|
| GPL570 | 54 | 6403 (5.0%) |
| GPL1261 | 36 | 1984 (4.0%) |
| GPL1355 | 12 | 460 (2.4%) |
| GPL198 | 9 | 642 (4.8%) |
Fig. 1Flow chart for region-based probe annotation framework
Fig. 2Creating probe sets for different types of custom CDF based on probe mapping to gene regions
Custom CDF naming examples
| CDF Type | Probe Set Name |
|---|---|
| Region-based | ENSG00000001036_exon_- |
| ENSG00000001084_UTR_- | |
| ENSG00000001167_CDS_+ | |
| Gene-based | ENSG00000001461 |
| Transcript-based | ENST00000489806 |
Summary of probes used for gene and transcript based custom CDFs
|
|
|
| ||||
|---|---|---|---|---|---|---|
| Gene | Transcript | Gene | Transcript | Gene | Transcript | |
| Number of Probes Used | 414,701 | 504,419 | 162,356 | 205,671 | 323,917 | 395,884 |
| Number of Probe Sets Constructed | 22,651 | 26,096 | 13,150 | 14,466 | 19,282 | 20,980 |
| Average Number of Probes Per Probe Set | 18 | 18 | 12 | 14 | 16 | 18 |
Summary of probes used for region based custom CDFs
|
|
|
| |
|---|---|---|---|
| Number of Probes Aligned to Genome | 822,681 | 321,905 | 637,942 |
| Number of Probes Used | 414,701 | 162,356 | 323,917 |
| Number of Probe Sets Constructed | 33,916 | 19,839 | 28,963 |
| Average Number of Probes Per Probe Set | 12 | 8 | 11 |
Number of mapped probes for custom CDF construction
| GeneChip® | Number of PM Probes | Number of PM Probes Mapped Uniquely | Number of PM Probes Mapped to Multiple Locations | Number of PM Probes Not Aligned |
|---|---|---|---|---|
| Human Genome U133 Plus 2.0 Array | 603,158 | 525,985 | 36,493 | 40,680 |
| Rat Genome 230 2.0 Array | 341,459 | 288,319 | 26,027 | 27,113 |
| Mouse Genome 430 2.0 Array | 495,374 | 427,758 | 28,444 | 39,173 |
Fig. 3Number of common and different differentially expressed genes using our custom region and gene-based CDFs compared to brain array custom CDFs. a Day 7 versus naïve. b Day 14 versus naïve
DEGs detected by our gene based CDF and GPL570
| Cases | Our Gene Based CDF | GPL570 | Common |
|---|---|---|---|
| DS1 versus Ts21 | 810 | 2421 | 616 |
| DS2 versus Ts21 | 668 | 1840 | 337 |
Fig. 4GRIK4 Probe set expression levels within the gene, exon, and 3′ UTR regions
Fig. 5VEGFA Probe set expression levels within the gene, exon, and 3′ UTR regions