| Literature DB >> 24866698 |
Emily Lu1, Miguel-Angel Elizondo-Riojas, Jeffrey T Chang, David E Volk.
Abstract
Next-generation sequencing results from bead-based aptamer libraries have demonstrated that traditional DNA/RNA alignment software is insufficient. This is particularly true for X-aptamers containing specialty bases (W, X, Y, Z, ...) that are identified by special encoding. Thus, we sought an automated program that uses the inherent design scheme of bead-based X-aptamers to create a hypothetical reference library and Markov modeling techniques to provide improved alignments. Aptaligner provides this feature as well as length error and noise level cutoff features, is parallelized to run on multiple central processing units (cores), and sorts sequences from a single chip into projects and subprojects.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24866698 PMCID: PMC4059528 DOI: 10.1021/bi500443e
Source DB: PubMed Journal: Biochemistry ISSN: 0006-2960 Impact factor: 3.162
Top X-Aptamer Sequences Aligned against a Library (not each other) Based on a Markov Model
| T1 | T2 | T3 | T4 | T5 | T6 | T7 | T8 | T9 | T10 | T11 | T12 | all | R1 | R2 | R3 | R4 | R5 | R6 | R7 | R8 | R9 | R10 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | 2 | 0 | 0 | 3426 | 0 | 28 | 0 | 2 | 6 | 0 | 2 | 3467 | GC | GTG | GT | GX | -- | TTC | GTG | --- | TX | GCC |
| 0 | 0 | 0 | 0 | 1391 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 1392 | XT | TGG | -- | GG | AT | TTC | GTG | GTG | GT | GGC |
| 1466 | 0 | 0 | 0 | 1 | 0 | 1 | 0 | 0 | 1 | 0 | 0 | 1469 | XT | GCG | XC | AC | AT | GCC | GTT | CCG | TG | GCC |
| 7 | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 1150 | 0 | 0 | 1159 | XT | TGG | XC | GG | CC | --- | GTT | --- | -- | GCC |
| 9 | 2 | 0 | 0 | 102 | 0 | 0 | 0 | 972 | 59 | 0 | 0 | 1144 | GC | TX- | TA | AC | CC | TTC | GTG | GTG | GT | GCG |
| 0 | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 1080 | 3 | 0 | 0 | 1085 | TG | TXG | TG | GX | AT | TGC | GTT | GTG | -- | GXG |
| 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 1 | 1001 | 0 | 0 | 1004 | TC | TXG | GT | GX | AT | TGC | GTT | --- | -- | GCC |
| 0 | 0 | 0 | 0 | 2 | 1 | 2 | 0 | 0 | 982 | 0 | 0 | 987 | TC | TXG | GT | GX | -- | TTC | GTG | CCG | TX | GCC |
| 2 | 1 | 0 | 0 | 0 | 0 | 3 | 0 | 1 | 959 | 0 | 0 | 966 | GC | GTG | -- | GG | AT | TGC | GTT | GTG | TG | GXG |
| 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 877 | 1 | 0 | 0 | 878 | -- | --- | TA | -- | CC | TGC | GTT | GAG | -- | GXG |
| 0 | 1 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 824 | 0 | 0 | 826 | TC | GTG | XC | GX | GT | XAC | GTG | GTG | GT | GCC |
| 722 | 0 | 0 | 0 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 774 | TG | GTG | TG | GG | CC | TTC | GTT | GXC | GT | GCC |
Sequence Frequencies Determined by Aptaligner and Exact Matcha
| sequence | Exact Match | Aptaligner | |
|---|---|---|---|
| 1 | GCGTGGTGTTTCGTGTTGCC | 5328 | 7602 |
| 2 | TTTGGGGATTTCGTGGTGGTGGC | 2664 | 3662 |
| 3 | GCGTGTGGTATTGCGTTGTCTTGGC | 2260 | 3616 |
| 4 | TTTGGTCGGCCGTTGCC | 2099 | 2686 |
| 5 | TTGCGTCACATGCCGTTCCGTGGCC | 2080 | 3245 |
| 6 | GCTTTAACCCTTCGTGGTGGTGCG | 1921 | 3793 |
| 7 | TGTTGTGGTATTGCGTTGTGGTG | 1837 | 2819 |
| 8 | TCTTGGTGTATTGCGTTGCC | 1785 | 2487 |
| 9 | TTTGGTCGTATTGCGTAGTCGTGGC | 1615 | 2718 |
| 10 | GCGGGATCCGTTGTG | 1615 | 2965 |
| 11 | GCGTGGGATTGCGTTGTGTGGTG | 1593 | 2836 |
| 12 | TCTTGGTGTTTCGTGCCGTTGCC | 1566 | 2298 |
Confidential sequences were scrambled postanalysis. There was an increase in sequence frequency for Aptaligner compared to Exact Match. These are the same sequences listed in Table 1, but they were scrambled after alignment.