| Literature DB >> 21961731 |
Nanette Coetzer1, Alexander A Myburg, Dave K Berger.
Abstract
BACKGROUND: Microarray technology has matured over the past fifteen years into a cost-effective solution with established data analysis protocols for global gene expression profiling. The Agilent-016047 maize 44 K microarray was custom-designed from EST sequences, but only reporter sequences with EST accession numbers are publicly available. The following information is lacking: (a) reporter - gene model match, (b) number of reporters per gene model, (c) potential for cross hybridization, (d) sense/antisense orientation of reporters, (e) position of reporter on B73 genome sequence (for eQTL studies), and (f) functional annotations of genes represented by reporters. To address this, we developed a strategy to annotate the Agilent-016047 maize microarray, and built a publicly accessible annotation database. DESCRIPTION: Genomic annotation of the 42,034 reporters on the Agilent-016047 maize microarray was based on BLASTN results of the 60-mer reporter sequences and their corresponding ESTs against the maize B73 RefGen v2 "Working Gene Set" (WGS) predicted transcripts and the genome sequence. The agreement between the EST, WGS transcript and gDNA BLASTN results were used to assign the reporters into six genomic annotation groups. These annotation groups were: (i) "annotation by sense gene model" (23,668 reporters), (ii) "annotation by antisense gene model" (4,330); (iii) "annotation by gDNA" without a WGS transcript hit (1,549); (iv) "annotation by EST", in which case the EST from which the reporter was designed, but not the reporter itself, has a WGS transcript hit (3,390); (v) "ambiguous annotation" (2,608); and (vi) "inconclusive annotation" (6,489). Functional annotations of reporters were obtained by BLASTX and Blast2GO analysis of corresponding WGS transcripts against GenBank.The annotations are available in the Maize Microarray Annotation Database http://MaizeArrayAnnot.bi.up.ac.za/, as well as through a GBrowse annotation file that can be uploaded to the MaizeGDB genome browser as a custom track.The database was used to re-annotate lists of differentially expressed genes reported in case studies of published work using the Agilent-016047 maize microarray. Up to 85% of reporters in each list could be annotated with confidence by a single gene model, however up to 10% of reporters had ambiguous annotations. Overall, more than 57% of reporters gave a measurable signal in tissues as diverse as anthers and leaves.Entities:
Year: 2011 PMID: 21961731 PMCID: PMC3198759 DOI: 10.1186/1746-4811-7-31
Source DB: PubMed Journal: Plant Methods ISSN: 1746-4811 Impact factor: 4.993
Figure 1Strategy followed to assign genomic and functional annotations to the reporters on the Agilent-016047 maize microarray. Using BLASTN and exonerate software, the 42,034 60-mer reporters were matched to available EST sequences, the maize B73 RefGen v2 genome and the WGS predicted transcripts. BLASTN and exonerate results were filtered and compared to test agreement between EST, WGS transcript and gDNA hits. Based on the agreement analysis, one of six genomic annotation groups was assigned for each reporter. Functional annotations of reporters were based on the functional annotations of their corresponding WGS transcripts. The data has been made accessible from the Maize Microarray Annotation Database http://MaizeArrayAnnot.bi.up.ac.za/.
Number of reporters placed in the genomic annotation groups of the maize Agilent-016047 microarray
| Annotation groups | |||
|---|---|---|---|
| sense gene model | 23668 (56.3%) | 20752 | 88% |
| antisense gene model | 4330 (10.3%) | 2470 | 57% |
| gDNA | 1549 (3.7%) | 985 | 64% |
| EST | 3390 (8.1%) | 1920 | 60% |
| ambiguous | 2608 (6.2%) | 2208 | 84% |
| inconclusive | 6489 (15.4%) | 4038 | 61% |
| Total | 42034 (100%) | 32373 | 77% |
The number and percentage of reporters in each annotation group.
Hybridization to fifty Agilent-016047 arrays by maize leaf cDNA from a segregating population
Percentage calculated from the number of reporters with signal/noise > 3 in maize leaves divided by the total number of reporters in each annotation group.
List of studies using the Agilent-016047 maize microarray
| Publication | Maize Tissue | Signal | No Signal | Criteria to define a 'Signal' |
|---|---|---|---|---|
| Ma | anther and pollen | > 24 K | < 18 K | 3 out of 4 hybridization signals > background (99% confidence) |
| Casati | adult leaves | 26 K | 16 K | median reporter intensity > background |
| Skibbe | developing anthers | 30 K | 14 K | reporter intensity > 2.6 × SD of background |
| Wang | fertile anthers | 32 K | 10 K | not specified |
| Rajhi | roots | no info | no info | not specified |
| Current study | adult leaves | 32 K | 10 K | signal/noise > 3 |
Case studies using the Maize Microarray Annotation Database
| Reference | Description * | Table (see Reference) | Total | Genomic Annotation Groups# | |||||
|---|---|---|---|---|---|---|---|---|---|
| GMS | GMA | G | E | AM | I | ||||
| Ma | a | Table S3 | 2285 | 1614 (70.6%) | 141 (6.2%) | 54 (2.4%) | 108 (4.7%) | 148 (6.5%) | 220 (9.6%) |
| Ma | b | Table S4 | 281 | 209 (74.4%) | 11 (3.9%) | 7 (2.5%) | 13 (4.6%) | 27 (9.6%) | 14 (5.0%) |
| Casati | c | Table S1 | 2092 | 1373 (65.6%) | 142 (6.8%) | 58 (2.8%) | 111 (5.3%) | 134 (6.4%) | 274 (13.1%) |
| Skibbe | d | Table S1 | 449 | 329 (73.3%) | 6 (1.3%) | 18 (4.0%) | 16 (3.6%) | 35 (7.8%) | 45 (10.0%) |
| Skibbe | e | Table S1 | 399 | 279 (69.9%) | 15 (3.8%) | 16 (4.0%) | 18 (4.5%) | 23 (5.8%) | 48 (12.0%) |
| Wang | f | Table S3 | 416 | 159 (38.2%) | 58 (13.9%) | 22 (5.3%) | 57 (13.7%) | 14 (3.4%) | 106 (25.5%) |
| Rajhi | g | Table S2 | 239 | 177 (74.1%) | 15 (6.3%) | 5 (2.1%) | 10 (4.2%) | 12 (5.0%) | 20 (8.4%) |
| Rajhi | h | Table S3 | 336 | 278 (82.7%) | 7 (2.1%) | 8 (2.4%) | 14 (4.2%) | 15 (4.5%) | 14 (4.2%) |
* Description
a = Transcripts differentially expressed between meiotic and post-meiotic stages
b = Zinc finger-related proteins
c = Transcripts that are expressed differentially between mbd101 and chc101 RNAi transgenic plants and WT non transgenic siblings under control and/or UV-B conditions.
d = Up-regulated genes between Mu-active vs inactive lines (Mitotic stage)
e = Down-regulated genes between Mu-active vs inactive lines (Mitotic stage)
f = 1.0 mm stage-specific genes of ms8 anthers expressed at later stages in normal anthers
g = Genes up-regulated in maize root cortex during aerenchyma formation
h = Genes down-regulated in maize root cortex during aerenchyma formation
Genomic annotations:
GMS = Annotation by sense gene model
GMA = Annotation by antisense gene model
G = Annotation by gDNA (Genomic position)
E = Annotation by EST
AM = Ambiguous annotation
I = Inconclusive annotation