| Literature DB >> 25859903 |
Qing Zhan, Yongtao Ye, Tak-Wah Lam, Siu-Ming Yiu, Yadong Wang, Hing-Fung Ting.
Abstract
Progressive sequence alignment is one of the most commonly used method for multiple sequence alignment. Roughly speaking, the method first builds a guide tree, and then aligns the sequences progressively according to the topology of the tree. It is believed that guide trees are very important to progressive alignment; a better guide tree will give an alignment with higher accuracy. Recently, we have proposed an adaptive method for constructing guide trees. This paper studies the quality of the guide trees constructed by such method. Our study showed that our adaptive method can be used to improve the accuracy of many different progressive MSA tools. In fact, we give evidences showing that the guide trees constructed by the adaptive method are among the best.Entities:
Mesh:
Year: 2015 PMID: 25859903 PMCID: PMC4402577 DOI: 10.1186/1471-2105-16-S5-S4
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1GLProbs vs GLProbs-Random on OXBench in term of SP and TC scores.
Mean SP and TC scores on BAliBASE, OXBench and SABmark
| SP | TC | ||||
|---|---|---|---|---|---|
| BAliBASE | ClustalW | 69.578 | 49.121 | ||
| MSAProbs | 82.370 | 67.274 | |||
| Probalign | 82.991 | 67.691 | |||
| ProbCons | 81.541 | 65.618 | |||
| T-Coffee | 80.759 | 64.894 | |||
| OXBench | ClustalW | 89.446 | 80.189 | ||
| MSAProbs | 90.062 | 81.696 | |||
| Probalign | 89.966 | 81.642 | |||
| ProbCons | 89.680 | 80.880 | |||
| T-Coffee | 89.519 | 80.513 | |||
| SABmark | ClustalW | 51.957 | 31.495 | ||
| MSAProbs | 60.245 | 39.946 | |||
| Probalign | 59.532 | 38.626 | |||
| ProbCons | 59.690 | 39.166 | |||
| T-Coffee | 59.158 | 39.291 | |||
Average SP score and average TC score of the alignments on the three benchmarks generated by the five aligners with guide tree generated by adaptive method and by aligners' own. Rows show the average sum of pairs scores (SP) and total column scores (TC) multiplied by 100. The best results in each pair are shown in bold.
Mean SP and TC scores on OXBench
| ClustalW | MSAProbs | Probalign | ProbCons | T-Coffee | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| SP | 0%-100% | 89.446 | 90.062 | 89.966 | 89.680 | 89.519 | |||||
| 0%-20% | 42.944 | 44.840 | 43.576 | 44.140 | 43.818 | ||||||
| 20%-40% | 77.061 | 77.839 | 77.259 | 77.026 | 76.545 | ||||||
| 40%-70% | 93.778 | 94.542 | 94.688 | 94.220 | 94.139 | ||||||
| 70%-100% | 99.236 | 99.25 | 99.055 | ||||||||
| TC | 0%-100% | 80.189 | 81.696 | 81.642 | 80.880 | 80.513 | |||||
| 0%-20% | 18.233 | 20.462 | 20.301 | 19.108 | |||||||
| 20%-40% | 57.255 | 59.817 | 59.107 | 58.277 | 57.938 | ||||||
| 40%-70% | 86.363 | 87.933 | 88.204 | 87.307 | 86.941 | ||||||
| 70%-100% | 97.913 | 97.972 | 97.402 | ||||||||
Average SP score and average TC score of the alignments on OXBench generated by the five aligners with guide tree generated by adaptive method and by aligners' own. Rows show the average sum of pairs scores (SP) and total column scores (TC) multiplied by 100. The best results in each pair are shown in bold.
Mean SP and TC scores on BAliBASE
| ClustalW | MSAProbs | Probalign | ProbCons | T-Coffee | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| SP | 0%-60% | 69.578 | 82.370 | 82.991 | 81.541 | 80.759 | |||||
| 0%-30% | 51.573 | 68.584 | 69.826 | 67.300 | 65.676 | ||||||
| 30%-60% | 84.382 | 93.704 | 93.816 | 93.251 | 93.160 | ||||||
| TC | 0%-60% | 49.121 | 67.274 | 67.691 | 65.618 | 64.894 | |||||
| 0%-30% | 26.014 | 46.143 | 47.178 | 43.332 | 42.346 | ||||||
| 30%-60% | 68.120 | 84.636 | 84.551 | 83.942 | 83.433 | ||||||
Average SP score and average TC score of the alignments on BaliBASE3 generated by the five aligners with guide tree generated by adaptive method and by aligners' own. Rows show the average sum of pairs scores (SP) and total column scores (TC) multiplied by 100. The best results in each pair are shown in bold.
Mean SP and TC scores on SABmark
| ClustalW | MSAProbs | Probalign | ProbCons | T-Coffee | |||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| SP | 0%-60% | 51.957 | 60.245 | 59.532 | 59.690 | 59.158 | |||||
| 0%-30% | 43.907 | 51.996 | 51.161 | 51.459 | 50.799 | ||||||
| 30%-60% | 81.739 | 90.616 | 90.507 | 90.146 | 89.388 | ||||||
| TC | 0%-60% | 31.495 | 39.946 | 38.626 | 39.166 | 39.291 | |||||
| 0%-30% | 22.020 | 29.043 | 27.702 | 28.198 | 28.518 | ||||||
| 30%-60% | 66.551 | 79.954 | 79.044 | 79.687 | 79.151 | ||||||
Average SP score and average TC score of the alignments on SABmark generated by the five aligners with guide tree generated by adaptive method and by aligners' own. Rows show the average sum of pairs scores (SP) and total column scores (TC) multiplied by 100. The best results in each pair are shown in bold.
Figure 2GLProbs vs GLProbs-Reference on BAliBASE, OXBench, SABmark. Dots above diagonal represent GLProbs outperformed GLProbs-Reference.