| Literature DB >> 20400755 |
Pierre Montalent1, Johann Joets.
Abstract
MOTIVATION: A large part of the maize B73 genome sequence is now available and emerging sequencing technologies will offer cheap and easy ways to sequence areas of interest from many other maize genotypes. One of the steps required to turn these sequences into valuable information is gene content prediction. To date, there is no publicly available gene predictor specifically trained for maize sequences. To this end, we have chosen to train the EuGène software that can combine several sources of evidence into a consolidated gene model prediction. AVAILABILITY: http://genome.jouy.inra.fr/eugene/cgi-bin/eugene_form.pl.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20400755 PMCID: PMC2859131 DOI: 10.1093/bioinformatics/btq123
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Comparison of several maize gene set statistics
| Training set | Curated set | Maize all | Maize cDNA | |
|---|---|---|---|---|
| Genes | 247 | 330 | 32 540 | 20 867 |
| Av. gene size (kb) | 3.5 | 4 | 3.7 | 3.5 |
| Exons | 1321 | 1520 | – | – |
| Av. no. of exon/gene | 5.4 | 4.6 | 5.3 | 4.7 |
| Av. exon size (kb) | 0.22 | 0.25 | 0.3 | 0.3 |
| Av. intron size (kb) | 0.52 | 0.6 | 0.52 | 0.58 |
| G + C gene (%) | 47.6 | – | 47.1 | 47.1 |
| G + C exon (%) | 53.4 | 55.4 | 52.7 | 53.4 |
| G + C intron (%) | 42.3 | 42.3 | 42.1 | 42.5 |
aThe curated maize gene training set built in this study.
bA maize set of curated genes from Haberer et al. (2005).
cAll maize genes in the B73 RefGen_v1 filtered set.
dMaize gene models supported by FLcDNAs are from Schnable et al. (2009).
EuGène-maize and GeneBuilder assessment comparison
| Missed Loci | Exon | ||
|---|---|---|---|
| Se (%) | Sp (%) | ||
| Genebuilder B73 RefGen_v1 | 4 | 79 | 74 |
| EuGène-maize | 1 | 73 | 84 |
Se, sensitivity (fraction of actual exons predicted among total actual exons); Sp, specificity (fraction of actual exons predicted among total predicted exons).