| Literature DB >> 25359895 |
Gabriel Renaud1, Udo Stenzel1, Tomislav Maricic1, Victor Wiebe1, Janet Kelso1.
Abstract
MOTIVATION: Pooling multiple samples increases the efficiency and lowers the cost of DNA sequencing. One approach to multiplexing is to use short DNA indices to uniquely identify each sample. After sequencing, reads must be assigned in silico to the sample of origin, a process referred to as demultiplexing. Demultiplexing software typically identifies the sample of origin using a fixed number of mismatches between the read index and a reference index set. This approach may fail or misassign reads when the sequencing quality of the indices is poor.Entities:
Mesh:
Year: 2014 PMID: 25359895 PMCID: PMC4341068 DOI: 10.1093/bioinformatics/btu719
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Fig. 1.Correlation between the Z1 score for reads aligned to the PhiX genome and the observed misassignment rate. Error bars were obtained using Wilson score intervals
Number of sequences demultiplexed by deML and deindexer in terms of TP, FP and FDR for 12 374 149 sequences
| Average error | deML | deindexer | CASAVA | |||||
|---|---|---|---|---|---|---|---|---|
| Rate per base | TP | FP | FDR | TP | FP | FDR | 0 mm | 1 mm |
| 0.002408 | 12 374 119 | 1 | (0.00%) | 12 372 007 | 0 | (0.00%) | 11 962 540 | 405 318 |
| 0.101145 | 11 898 460 | 205 | (0.00%) | 9 784 321 | 146 | (0.00%) | 2 783 384 | 4 381 588 |
| 0.196708 | 9 779 898 | 2761 | (0.03%) | 5 659 886 | 1683 | (0.03%) | 577 456 | 1 978 848 |
Note: The remaining columns present the number that could be identified using an approach allowing 1 mismatch (such as CASAVA).