| Literature DB >> 27973417 |
Nam Nguyen Quang1,2, Gérald Perret3, Frédéric Ducongé4,5.
Abstract
Aptamers are identified through an iterative process of evolutionary selection starting from a random pool containing billions of sequences. Simultaneously to the amplification of high-affinity candidates, the diversity in the pool is exponentially reduced after several rounds of in vitro selection. Until now, cloning and Sanger sequencing of about 100 sequences was usually used to identify the enriched candidates. However, High-Throughput Sequencing (HTS) is now extensively used to replace such low throughput sequencing approaches. Providing a deeper analysis of the library, HTS is expected to accelerate the identification of aptamers as well as to identify aptamers with higher affinity. It is also expected that it can provide important information on the binding site of the aptamers. Nevertheless, HTS requires handling a large amount of data that is only possible through the development of new in silico methods. Here, this review presents these different strategies that have been recently developed to improve the identification and characterization of aptamers using HTS.Entities:
Keywords: SELEX; aptamers; evolution; fitness landscape; high-throughput sequencing
Year: 2016 PMID: 27973417 PMCID: PMC5198051 DOI: 10.3390/ph9040076
Source DB: PubMed Journal: Pharmaceuticals (Basel) ISSN: 1424-8247
Figure 1Improvements of aptamers identification and characterization with the use of High-Throughput Sequencing. HTS analysis can investigate faster the enrichment of (sub-)sequences or predicted (sub-)structures. Variants of a same family sequence can be compared in order to extract mutants with higher affinity than the most abundant sequence. It is possible to more precisely characterize the interaction between aptamers and their targets by analyzing different conditions of selection (for instance, incubation with target’s variants or varying the composition of binding buffer). It is also possible to find aptamers specific for a defined condition. Finally, the impact of each selection parameter can be studied which could help to improve the SELEX experiments in the future.
Comparison of several programs dedicated to HTS analysis for SELEX experiments.
| Name | System | Rounds Analyzed | Clustering Based on Primary Sequence | Clustering Based on Secondary Predicted Structure | References |
|---|---|---|---|---|---|
| Not determined | Several | Clustering of the most enriched subsequences using hamming distance. | - | [ | |
| Mac/Linux/PC | Several | Alignment on a genome | - | [ | |
| Mac/Linux; Galaxy web platform | 2 | Levenshtein distance on sequence ‘seeds’; possibility to look for a known motif | - | [ | |
| Linux | Several | LSH method followed by k-mer distance on sequence ‘seeds’ | - | [ | |
| Available through services | Several | Levenshtein distance on sequence ‘seeds’ | Look for predicted structure motifs shared by several primary clusters | [ | |
| Available through services | Several | k-mer distance or shannon’s information entropy | Can detect stem loops shared by several primary clusters | [ | |
| Mac/Linux/PC | Several | Look for enrichment of k-mer with a predicted structure; then, primary alignment of k-mers with the same predicted structure is realized to form clusters | [ | ||
| Linux | Several | Look for four kinds of sub-structures in each sequence; primary alignment of the sub-structures to form clusters | [ | ||
| Mac/Linux | 2 | Rank sequences based on k-mer enrichment | - | [ | |
| Mac | Several | Hamming distance on two sequences (seed and control) to obtain (x,y) coordinates of all sequences to build the empirical ‘landscape’ | - | [ | |
| Linux | 2 | Rank variants of a primary sequence family based on their enrichment | - | [ | |