| Literature DB >> 23653640 |
Manrique Mata-Montero1, Nabil Shalaby, Bradley Sheppard.
Abstract
Obtaining unique oligos from an EST database is a problem of great importance in bioinformatics, particularly in the discovery of new genes and the mapping of the human genome. Many algorithms have been developed to find unique oligos, many of which are much less time consuming than the traditional brute force approach. An algorithm was presented by Zheng et al. (2004) which finds the solution of the unique oligos search problem efficiently. We implement this algorithm as well as several new algorithms based on some theorems included in this paper. We demonstrate how, with these new algorithms, we can obtain unique oligos much faster than with previous ones. We parallelize these new algorithms to further improve the time of finding unique oligos. All algorithms are run on ESTs obtained from a Barley EST database.Entities:
Year: 2013 PMID: 23653640 PMCID: PMC3638690 DOI: 10.1155/2013/793130
Source DB: PubMed Journal: Adv Bioinformatics ISSN: 1687-8027
Algorithm 1Algorithm for the unique oligos problem.
Algorithm 2Algorithm for the unique oligos problem.
Algorithm 3Algorithm for the unique oligos problem.
Results of serial algorithms.
| Algorithm |
|
|
| Dataset | Time taken (secs) | Non-unique oligos |
|---|---|---|---|---|---|---|
|
| 28 | 6 | 4 | 1 (78 ESTs) | 163 | 46,469 |
|
| 28 | 6 | 7 | 1 (78 ESTs) | 131 | 46,469 |
|
| 27 | 6 | 9 | 1 (78 ESTs) | 231 | 46,564 |
|
| 28 | 6 | 4 | 2 (2838 ESTs) | 197, 500 | 1,611,241 |
|
| 28 | 6 | 7 | 2 (2838 ESTs) | 117, 714 | 1,611,241 |
|
| 27 | 6 | 9 | 2 (2838 ESTs) | 94, 317 | 1,614,235 |
Results of parallel algorithms on 12 processors.
| Algorithm |
|
|
| Dataset | Time taken (secs) | Non-unique oligos |
|---|---|---|---|---|---|---|
|
| 28 | 6 | 4 | 1 (78 ESTs) | 33 | 46,469 |
|
| 28 | 6 | 7 | 1 (78 ESTs) | 29 | 46,469 |
|
| 27 | 6 | 9 | 1 (78 ESTs) | 66 | 46,564 |
|
| 28 | 6 | 4 | 2 (2838 ESTs) | 40, 420 | 1,611,241 |
|
| 28 | 6 | 7 | 2 (2838 ESTs) | 22, 848 | 1,611,241 |
|
| 27 | 6 | 9 | 2 (2838 ESTs) | 18, 375 | 1,614,235 |
Algorithm 4Findqmers (q).
Algorithm 5Map (string s, i, j).
Algorithm 6Substring (string s, i, j).
Algorithm 7goo2(q, l, d, posi, mut).
Algorithm 8goo(q, l, d, posi).