| Literature DB >> 21756356 |
Victor Missirian1, Luca Comai, Vladimir Filkov.
Abstract
BACKGROUND: TILLING (Targeting induced local lesions IN genomes) is an efficient reverse genetics approach for detecting induced mutations in pools of individuals. Combined with the high-throughput of next-generation sequencing technologies, and the resolving power of overlapping pool design, TILLING provides an efficient and economical platform for functional genomics across thousands of organisms.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21756356 PMCID: PMC3150297 DOI: 10.1186/1471-2105-12-287
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1Bi-dimensional arrangement of the overlapping pools experiments. There are 96 wells and 20 pools (12 column- and 8 row-pools) in our bidimensional pooling scheme. Thus, each individual is present in two pools.
Figure 2Example base positions with mutations in the data of varying difficulty for identification. Three mutations ordered, left to right, by increasing difficulty to identify visually. Left and middle, C → T mutations at positions 552 and 677, respectively, in wheat genes APHYC and AVRN, resp. Right, an A → G mutation at position 838 in rice gene OsRDR2.Each dot in the plots is a library pool, and on the y-axis is the frequency of the base to which the reference has been mutated.
Performance of CAMBa at various thresholds on the Rice and Wheat data sets
| Rice | Wheat | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Pred | Conf | FP | FN | Sens | Spec | Pred | Conf | FP | FN | Sens | Spec | ||
| 0 | 308 | 11 | 233 | 0 | 100.00 | 98.71 | -10 | 310 | 37 | 208 | 5 | 95.33 | 95.44 |
| 1 | 131 | 11 | 56 | 0 | 100.00 | 99.69 | -5 | 172 | 36 | 73 | 8 | 92.52 | 98.40 |
| 2 | 75 | 10 | 7 | 7 | 90.67 | 99.96 | 0 | 107 | 36 | 8 | 8 | 92.52 | 99.82 |
| 3 | 54 | 10 | 0 | 21 | 72.00 | 100.00 | 5 | 92 | 33 | 1 | 16 | 85.05 | 99.98 |
| 4 | 46 | 9 | 0 | 29 | 61.33 | 100.00 | 10 | 81 | 31 | 0 | 26 | 75.70 | 100.00 |
| 5 | 40 | 7 | 0 | 35 | 53.33 | 100.00 | 15 | 59 | 21 | 1 | 49 | 54.21 | 99.98 |
Performance of CAMBa at various thresholds on the Rice (left) and Wheat data sets. F(t) = CAMBa threshold, Pred = # predicted mutations at that threshold, Conf = # of the predicted mutations which overlap with the confirmed mutations (11 in rice and 39 in wheat), FP = # false positives, FN = # false negatives, Sens = Sensitivity, Spec = Specificity. We underscore the line associated with the recommended number of predictions for both Rice and Wheat. We restrict our estimate of the number of False Positives to be at least 0.
Performance comparison of CAMBa to other methods using default parameters on Rice and Wheat
| Rice | Wheat | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Method | Pred | Conf | FP | FN | Sens | Spec | Pred | Conf | FP | FN | Sens | Spec |
| CAMBa | 75 | 10 | 7 | 7 | 90.67 | 99.96 | 107 | 36 | 8 | 8 | 92.52 | 99.82 |
| Outlier | 75 | 8 | 20 | 20 | 73.33 | 99.89 | 107 | 36 | 8 | 8 | 92.52 | 99.82 |
| CRISP | 0 | 0 | 0 | 75 | 0.00 | 100.00 | 121 | 32 | 33 | 19 | 82.24 | 99.28 |
| VarScan | 3 | 0 | 3 | 75 | 0.00 | 99.98 | 30 | 10 | 3 | 80 | 25.23 | 99.93 |
| ComSeq | 9599 | 10 | 9531 | 7 | 90.67 | 47.15 | 154 | 36 | 55 | 8 | 92.52 | 98.79 |
| Poisson | 6889 | 8 | 6834 | 20 | 73.33 | 62.10 | 218 | 35 | 122 | 11 | 89.72 | 97.33 |
Performance comparison of CAMBa to other methods using default parameters, on the Rice (left) and the Wheat data sets. The column abbreviations are as in Table 1. All methods are evaluated on their ability to predict the carrier of each candidate mutation, except CRISP, which cannot make such predictions.
Figure 3Variance in coverage across libraries in the data. Normalized variance of coverage levels across libraries in TILLING genes in rice (black) and wheat (gray). HLP1 is on top.
Performance comparison of CAMBa to other methods using default parameters on Wheat with increased variance
| Method | Wheat with increased variance | |||||
|---|---|---|---|---|---|---|
| Pred | Conf | FP | FN | Sens | Spec | |
| CAMBa | 113 | 28 | 36 | 30 | 71.96 | 99.21 |
| Outlier | 113 | 20 | 58 | 52 | 51.40 | 98.73 |
| CRISP | 86 | 25 | 17 | 38 | 64.49 | 99.63 |
| VarScan | ||||||
| ComSeq | 743 | 30 | 661 | 25 | 76.64 | 85.51 |
| Poisson | 60 | 18 | 11 | 58 | 45.79 | 99.76 |
Performance comparison of CAMBa to other methods using default parameters, on the Wheat data set after artificially increasing the coverage variance. The column abbreviations are as in Table 1. All methods are evaluated on their ability to predict the carrier of each candidate mutation, except CRISP, which cannot make such predictions. We were unable to report results for VarScan due to technical issues in generating a modified pileup of reads.
Performance comparison of CAMBa to other methods using default parameters on Rice with lowered variance
| Method | Rice with lowered variance | |||||
|---|---|---|---|---|---|---|
| Pred | Conf | FP | FN | Sens | Spec | |
| CAMBa | 73 | 10 | 9 | 6 | 91.43 | 99.95 |
| Outlier | 73 | 10 | 9 | 6 | 91.43 | 99.95 |
| CRISP | 0 | 0 | 0 | 70 | 0.00 | 100.00 |
| VarScan | 3 | 0 | 3 | 70 | 0.00 | 99.98 |
| ComSeq | 8106 | 10 | 8042 | 6 | 91.43 | 51.25 |
| Poisson | 6247 | 8 | 6196 | 19 | 72.86 | 62.44 |
Performance comparison of CAMBa to other methods using default parameters, on the Rice data set, after excluding HLP1 to lower the mean coverage variance. The column abbreviations are as in Table 1. All methods are evaluated on their ability to predict the carrier of each candidate mutation, except CRISP, which cannot make such predictions.