| Literature DB >> 29020925 |
Scott Christley1, Mikhail K Levin2, Inimary T Toby1, John M Fonner3, Nancy L Monson4,5, William H Rounds1, Florian Rubelt6, Walter Scarborough3, Richard H Scheuermann7,8,9, Lindsay G Cowell10.
Abstract
BACKGROUND: Pre-processing of high-throughput sequencing data for immune repertoire profiling is essential to insure high quality input for downstream analysis. VDJPipe is a flexible, high-performance tool that can perform multiple pre-processing tasks with just a single pass over the data files.Entities:
Keywords: Bioinformatics; Immune repertoire analysis; Rep-seq
Mesh:
Substances:
Year: 2017 PMID: 29020925 PMCID: PMC5637252 DOI: 10.1186/s12859-017-1853-z
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
List of analysis tools providing pre-processing functions
| Tool | Category | Composition statistics calculation | Merging paired-end reads | Barcode demultiplexing | PCR primer removal | Length filtering | Quality filtering | Homopolymer filtering | UMI consensus determination | Collapsing duplicate reads |
|---|---|---|---|---|---|---|---|---|---|---|
| AbStar [ | 1 | No | Yes | Yes | No | Yes | Yes | No | No | No |
| Decombinator [ | 2 | No | No | Yes | No | No | No | No | Yes | No |
| Ig-HTS cleaner [ | 3 | Yes | No | Yes | Yes | Yes | Yes | No | No | No |
| IgRepertoire constructor [ | 1 | No | Yes | Yes | No | No | No | No | Yes | No |
| ImmunediveRsity [ | 1 | No | No | No | No | Yes | Yes | No | No | No |
| iMonitor [ | 1 | No | Yes | No | No | Yes | Yes | No | No | No |
| IMSEQ [ | 1 | No | Yes | Yes | No | Yes | Yes | No | No | No |
| MIGEC [ | 2 | No | Yes | Yes | No | No | Yes | No | Yes | No |
| pRESTO [ | 3 | No | Yes | Yes | Yes | Yes | Yes | No | Yes | Yes |
| RTCR [ | 1 | No | No | Yes | No | Yes | Yes | No | No | No |
| VDJPipe | 3 | Yes | Yes | Yes | Yes | Yes | Yes | Yes | No | Yes |
| TCRklass [ | 1 | No | Yes | No | No | Yes | Yes | No | No | No |
| Abmining [ | 3 | No | Yes | Yes | No | Yes | Yes | Yes | No | No |
| ClonoCalc [ | 1 | No | Yes | Yes | No | No | No | No | No | No |
Execution times for the Greiff et al., [14] data set
| |Sequences| | VDJPipe (1 core) | pRESTO (4 cores) |
|---|---|---|
| 25,000 | 5.7 s (0.1 s) | 1 m 43.2 s (2.5 s) |
| 250,000 | 1 m 7.5 s (0.5 s) | 17 m 35.1 s (15.6 s) |
| 1,085,869 | 8 m 22.5 s (4.6 s) | 89 m 14.9 s (2 m 18.5 s) |
Execution times are the average of ten runs with the standard deviation in parentheses
Execution times for the Jiang et al., [15] data set
| |Sequences| | VDJPipe (1 core) | pRESTO (4 cores) |
|---|---|---|
| 50,000 | 6.5 s (0.2 s) | 1 m 30.5 s (1.2 s) |
| 277,826 | 40.3 s (0.2 s) | 15 m 5.1 s (4.9 s) |
Execution times are the average of ten runs with the standard deviation in parentheses