| Literature DB >> 15473908 |
Feng-Mao Lin1, Hsien-Da Huang, Yu-Chung Chang, Jorng-Tzong Horng.
Abstract
BACKGROUND: Information on the occurrence of sequence features in genomes is crucial to comparative genomics, evolutionary analysis, the analyses of regulatory sequences and the quantitative evaluation of sequences. Computing the frequencies and the occurrences of a pattern in complete genomes is time-consuming.Entities:
Mesh:
Substances:
Year: 2004 PMID: 15473908 PMCID: PMC526275 DOI: 10.1186/1471-2164-5-78
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Applications and the relevant data in the database.
| Oligonucleotide occurrences | Positions | 1. Oligonucleotide analysis for regulatory sequences |
| Oligonucleotide frequencies | Counts | 1. Oligonucleotide analysis for regulatory sequences |
| Gene coding regions | Positions | 1. Oligonucleotide analysis for regulatory sequences |
| Repetitive element frequencies (LINE, SINE, Alu, and so on) | Counts | Evolutionary analysis |
| Repetitive element occurrences | Positions | Evolutionary analysis |
| Tandem repeats | Positions | Prediction for genetic disease marker |
Number of occurrences of the repetitive oligonucleotides in yeast genome
| ACCCTA | 2,724 | 822 | 793 | CAATCCA | 1,895 | 655 | 343 |
| ACCCTC | 2,917 | 881 | 795 | CGTCTCC | 592 | 199 | 148 |
| AGTACT | 3,073 | 933 | 879 | CGTCTGA | 652 | 196 | 165 |
| AGTAGA | 6,673 | 1,970 | 1,798 | ACAAACTA | 594 | 179 | 183 |
| AGTAGC | 4,912 | 1,545 | 1,299 | ACAAACTC | 514 | 175 | 112 |
| GATACC | 4,829 | 1,638 | 1,005 | CACAGAAAC | 146 | 38 | 46 |
| GATAGA | 7,030 | 2,163 | 1,807 | CACAGAAGA | 164 | 57 | 39 |
| TGGTAA | 10,513 | 3,493 | 2,214 | ACATATAAAAA | 54 | 9 | 29 |
| TGTAAA | 11,364 | 3,439 | 3,418 | ACATATAAAAC | 139 | 34 | 56 |
| AAGGGGA | 1,172 | 299 | 421 | ACATATAAAAG | 36 | 7 | 22 |
| AAGGGGC | 626 | 142 | 256 | ACTTATGTCATC | 57 | 17 | 23 |
| AGAGTGG | 983 | 310 | 271 | ACTTCTAGTATA | 159 | 44 | 67 |
| AGAGTTA | 1,859 | 610 | 441 | ACTTTTTTTTCT | 32 | 5 | 21 |
| CAATCAG | 1,358 | 445 | 320 | ACTTTTTTTTTC | 50 | 6 | 33 |
Output styles of the database.
| Oligonucleotide occurrences | Positions | Web interface |
| Oligonucleotide frequencies | Counts | Web interface and flat-file |
| Gene coding regions | Positions | Web interface |
| Repetitive element frequencies (LINE, SINE, Alu, and so on) | Counts | Web interface and flat-file |
| Repetitive element occurrences | Positions | Web interface |
| Tandem repeats | Positions | Web interface |
Figure 1Web query interface (1/2).
Figure 2Web query interface (2/2).
Figure 3Database entries in flat-file format.
Figure 4The occurrence positions of the oligonucleotide are found by Oligos Locator.