| Literature DB >> 28453688 |
Martin Šošic1, Mile Šikic1,2.
Abstract
Summary: We present Edlib, an open-source C/C ++ library for exact pairwise sequence alignment using edit distance. We compare Edlib to other libraries and show that it is the fastest while not lacking in functionality and can also easily handle very large sequences. Being easy to use, flexible, fast and low on memory usage, we expect it to be easily adopted as a building block for future bioinformatics tools. Availability and Implementation: Source code, installation instructions and test data are freely available for download at https://github.com/Martinsos/edlib, under the MIT licence. Edlib is implemented in C/C ++ and supported on Linux, MS Windows, and Mac OS. Contact: mile.sikic@fer.hr. Supplementary information: Supplementary data are available at Bioinformatics online.Entities:
Mesh:
Year: 2017 PMID: 28453688 PMCID: PMC5408825 DOI: 10.1093/bioinformatics/btw753
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Run time comparison of finding global alignment edit distance and alignment path for different sequence lengths and similarities
| Seq. sizes | Similarity (%) | Edlib (path) | SeqAn (path) | Parasail |
|---|---|---|---|---|
| 106 × 106 | 99 | 1.1s (2.58s) | 111.83s (252.71s) | 1234.5s |
| 106 × 106 | 90 | 7.16s (17.35s) | 111.51s (253.32s) | 1212.37s |
| 106 × 106 | 80 | 14.42s (34.6s) | 111.7s (252.95s) | 1247.44s |
| 106 × 106 | 70 | 33.8s (65s) | 112s (253.1s) | 1205.16s |
| 106 × 106 | 60 | 30.75s (71.59s) | 111.44s (252.61s) | 1212s |
| 105 × 105 | 99 | 0.01s (0.06s) | 1.01s (2.27s) | 4.79s |
| 105 × 105 | 90 | 0.13s (0.24s) | 0.98s (2.32s) | 4.68s |
| 105 × 105 | 80 | 0.2s (0.45s) | 0.98s (2.31s) | 4.79s |
| 105 × 105 | 70 | 0.16s (0.49s) | 1s (2.29s) | 4.76s |
| 105 × 105 | 60 | 0.4s (0.83s) | 1s (2.3s) | 4.76s |
The similarity of two sequences was calculated as . Two different DNA sequences were used for these tests. We artificially mutated them to achieve different similarities. Myers’s implementation is not included in this comparison as it does not support global alignment. For SeqAn and Edlib, time needed for finding of not only score but also of alignment path is provided in parentheses.