| Literature DB >> 15128450 |
Jeoffrey J Schageman1, Christopher J Horton, Sijing Niu, Harold R Garner, Alexander Pertsemlidis.
Abstract
ELXR (Exon Locator and Extractor for Resequencing) streamlines the process of determining exon/intron boundaries and designing PCR and sequencing primers for high-throughput resequencing of exons. We have pre-computed ELXR primer sets for all exons identified from the human, mouse, and rat mRNA reference sequence (RefSeq) public databases curated by the National Center for Biotechnology Information. The resulting exon-flanking PCR primer pairs have been compiled into a system called ELXRdb, which may be searched by keyword, gene name or RefSeq accession number.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15128450 PMCID: PMC416472 DOI: 10.1186/gb-2004-5-5-r36
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1ELXR sequence processing flow for each mRNA/EST sequence query. HTGS, high-throughput genomic sequence.
Figure 2Sample output from ELXR. Graphical depiction of the human apolipoprotein M gene structure derived from ELXR's Sim4 component.
Time and performance comparison
| Method | N | PT | PS | APT/gene |
| Manual | 864 | 891 | 84.5% | 1.50 h |
| ELXR | 154 | 164 | 89.3% | 0.18 h |
N indicates the number of exons and PT the number of associated primer pairs chosen and tested for PCR and sequencing for each primer-picking method. PS indicates the percentage of PCR products that resulted in high-quality sequence products and subsequent SNP detection. APT/gene indicates the average processing time per gene using each method. This table is not intended to describe the performance of Primer3 (which is used in both methods), but only to illustrate that whereas success was comparable with both methods, exon identification and primer-pair design was more than eight times faster using ELXR compared to nonautomated processing methods.
Statistical assessment of ELXRdb for mouse and human compared with analogous statistics initially reported by the public human genome sequencing project
| Mouse | Human | HGC [ | |
| Total RefSeq mRNAs processed | 5,110 | 12,260 | 1,804 |
| Number of primer pairs successfully designed | 59,874 | 179,065 | |
| Number of exons processed | 44,715 | 123,757 | |
| Single-exon genes | 348 | 853 | |
| Intron size | 3,644 bp | 4,573 bp | 3,365 bp |
| Internal intron size | 3,057 bp | 3,876 bp | |
| Exon size | 218 bp | 244 bp | |
| Exon number | 8.7 | 10 | 8.8 |
| Initial exon size | 204 bp | 240 bp | |
| Terminal exon size | 633 bp | 874 bp | |
| Internal exon size | 143 bp | 151 bp | 145 bp |
| Coding-sequence nucleotides | 1,315 bp | 1,558 bp | 1,340 bp |
| Coding sequence amino acids | 438 | 520 | 447 |
| Genomic extent | 38 kb | 62 kb | 27 kb |
Exon statistics were compiled from Sim4. Coding sequences are as defined in GenBank annotation. Empty fields in the HGC column indicate that there were no values for these measures provided in [1].
Figure 3The ELXRdb entry retrieval interface.