| Literature DB >> 24564280 |
Linyuan Guo, Cheng Luo, Shanfeng Zhu.
Abstract
BACKGROUND: Computational methods for the prediction of Major Histocompatibility Complex (MHC) class II binding peptides play an important role in facilitating the understanding of immune recognition and the process of epitope discovery. To develop an effective computational method, we need to consider two important characteristics of the problem: (1) the length of binding peptides is highly flexible; and (2) MHC molecules are extremely polymorphic and for the vast majority of them there are no sufficient training data.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24564280 PMCID: PMC3852073 DOI: 10.1186/1471-2164-14-S5-S11
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Five-fold cross validation performance of MHC2SK method compared to GS and SRBF methods on NielsenSet1. For each allele, we display the largest value in boldface.
| AUC | PCC | ||||||
|---|---|---|---|---|---|---|---|
| DRB1*01:01 | 1203 | 0.766 | 0.791 | 0.504 | 0.519 | ||
| DRB1*03:01 | 474 | 0.712 | 0.735 | 0.423 | 0.473 | ||
| DRB1*04:01 | 457 | 0.728 | 0.754 | 0.428 | 0.481 | ||
| DRB1*04:04 | 168 | 0.653 | 0.757 | 0.254 | 0.411 | ||
| DRB1*04:05 | 171 | 0.683 | 0.648 | 0.409 | 0.273 | ||
| DRB1*07:01 | 310 | 0.773 | 0.745 | 0.502 | 0.464 | ||
| DRB1*08:02 | 174 | 0.766 | 0.782 | 0.452 | 0.461 | ||
| DRB1*09:01 | 117 | 0.623 | 0.656 | 0.290 | 0.269 | ||
| DRB1*11:01 | 359 | 0.724 | 0.737 | 0.427 | 0.463 | ||
| DRB1*13:02 | 179 | 0.846 | 0.817 | 0.617 | 0.662 | ||
| DRB1*15:01 | 365 | 0.798 | 0.786 | 0.582 | 0.566 | ||
| DRB3*01:01 | 102 | 0.428 | 0.650 | -0.082 | 0.015 | ||
| DRB4*01:01 | 181 | 0.716 | 0.738 | 0.437 | 0.447 | ||
| DRB5*01:01 | 343 | 0.681 | 0.675 | 0.363 | 0.363 | ||
| average | 4603 | 0.718 | 0.727 | 0.419 | 0.411 | ||
LOO benchmark comparison of MHC2SKpan with four well-known pan-specific methods on NielsenSet2. MRTA, Tepan, Pan1.0, Pan2.0 and MKpan are the abbreviations for MultiRTA, TEPITOPEpan, MetaMHCIIpan-1.0, MetaMHCIIpan-2.0 and MHC2SKpan, respectively. For each allele, we display the largest value in boldface.
| AUC | PCC | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| DRB1*01:01 | 5166 | 0.801 | 0.726 | 0.778 | 0.794 | 0.619 | 0.447 | 0.571 | 0.627 | ||
| DRB1*03:01 | 1020 | 0.751 | 0.663 | 0.746 | 0.778 | 0.438 | 0.277 | 0.465 | 0.543 | ||
| DRB1*04:01 | 1024 | 0.763 | 0.724 | 0.775 | 0.801 | 0.534 | 0.423 | 0.591 | 0.652 | ||
| DRB1*04:04 | 663 | 0.835 | 0.783 | 0.852 | 0.862 | 0.623 | 0.504 | 0.693 | 0.714 | ||
| DRB1*04:05 | 630 | 0.808 | 0.760 | 0.808 | 0.823 | 0.566 | 0.456 | 0.594 | 0.626 | ||
| DRB1*07:01 | 853 | 0.817 | 0.759 | 0.825 | 0.886 | 0.620 | 0.499 | 0.655 | 0.753 | ||
| DRB1*08:02 | 420 | 0.786 | 0.773 | 0.841 | 0.851 | 0.523 | 0.452 | 0.637 | 0.679 | ||
| DRB1*09:01 | 530 | 0.674 | 0.615 | 0.653 | 0.674 | 0.380 | 0.259 | 0.406 | 0.471 | ||
| DRB1*11:01 | 950 | 0.819 | 0.726 | 0.799 | 0.875 | 0.603 | 0.450 | 0.580 | 0.721 | ||
| DRB1*13:02 | 498 | 0.661 | 0.658 | 0.648 | 0.639 | 0.326 | 0.323 | 0.337 | 0.341 | ||
| DRB1*15:01 | 934 | 0.729 | 0.694 | 0.738 | 0.763 | 0.513 | 0.437 | 0.533 | 0.597 | ||
| DRB3*01:01 | 549 | 0.675 | 0.716 | 0.733 | 0.70 | 0.332 | 0.449 | 0.474 | 0.423 | ||
| DRB4*01:01 | 446 | 0.746 | 0.694 | 0.724 | 0.762 | 0.508 | 0.370 | 0.448 | 0.515 | ||
| DRB5*01:01 | 924 | 0.788 | 0.680 | 0.831 | 0.879 | 0.543 | 0.421 | 0.627 | 0.722 | ||
| average | 14607 | 0.773 | 0.710 | 0.767 | 0.795 | 0.531 | 0.404 | 0.541 | 0.605 | ||
Five-fold cross validation comparison of MHC2SKpan and NetMHCIIpan-2.0 on NielsenSet3. For each allele, we display the largest value in boldface.
| AUC | PCC | ||||||
|---|---|---|---|---|---|---|---|
| DRB1*01:01 | 7685 | 0.731 | 0.845 | 0.433 | 0.702 | ||
| DRB1*03:01 | 2505 | 0.718 | 0.853 | 0.346 | 0.672 | ||
| DRB1*03:02 | 148 | 0.603 | 0.755 | 0.227 | 0.447 | ||
| DRB1*04:01 | 3116 | 0.765 | 0.840 | 0.438 | 0.647 | ||
| DRB1*04:04 | 577 | 0.758 | 0.816 | 0.496 | 0.622 | ||
| DRB1*04:05 | 1582 | 0.783 | 0.858 | 0.491 | 0.698 | ||
| DRB1*07:01 | 1745 | 0.781 | 0.864 | 0.533 | 0.740 | ||
| DRB1*08:02 | 1520 | 0.650 | 0.780 | 0.294 | 0.526 | ||
| DRB1*08:06 | 118 | 0.870 | 0.912 | 0.602 | 0.749 | ||
| DRB1*08:13 | 1370 | 0.747 | 0.885 | 0.337 | 0.746 | ||
| DRB1*08:19 | 116 | 0.714 | 0.808 | 0.537 | 0.608 | ||
| DRB1*09:01 | 1520 | 0.683 | 0.818 | 0.340 | 0.634 | ||
| DRB1*11:01 | 1794 | 0.797 | 0.877 | 0.514 | 0.764 | ||
| DRB1*12:01 | 117 | 0.831 | 0.876 | 0.627 | 0.754 | ||
| DRB1*12:02 | 117 | 0.843 | 0.898 | 0.640 | 0.762 | ||
| DRB1*13:02 | 1580 | 0.602 | 0.811 | 0.238 | 0.591 | ||
| DRB1*14:02 | 118 | 0.724 | 0.860 | 0.445 | 0.694 | ||
| DRB1*14:04 | 30 | 0.683 | 0.621 | 0.489 | 0.418 | ||
| DRB1*14:12 | 116 | 0.805 | 0.894 | 0.517 | 0.742 | ||
| DRB1*15:01 | 1769 | 0.739 | 0.819 | 0.465 | 0.653 | ||
| DRB3*01:01 | 1501 | 0.671 | 0.832 | 0.289 | 0.636 | ||
| DRB3*03:01 | 160 | 0.771 | 0.853 | 0.403 | 0.702 | ||
| DRB4*01:01 | 1521 | 0.685 | 0.837 | 0.351 | 0.675 | ||
| DRB5*01:01 | 3106 | 0.764 | 0.875 | 0.445 | 0.736 | ||
| average | 33931 | 0.738 | 0.843 | 0.437 | 0.669 | ||
The AUC performance comparison of MHC2SKpan with MutliRTA, TEPITOPEpan, NetMHCIIpan-1.0 and NetMHCIIpan-2.0 on EpanSet4. For each allele, we display the largest value in boldface. The last row is the average result by excluding two alleles in NielsenSet3, DRB1*03:02 and DRB1*12:01.
| allele | count | MultiRTA | TEPITOPEpan | NetMHCIIpan-1.0 | NetMHCIIpan-2.0 | MHC2SKpan |
|---|---|---|---|---|---|---|
| DRB1*01:02 | 92 | 0.749 | 0.758 | 0.746 | 0.752 | |
| DRB1*01:03 | 52 | 0.772 | 0.756 | 0.772 | 0.798 | |
| DRB1*03:02 | 88 | 0.733 | 0.823 | 0.775 | 0.761 | |
| DRB1*04:03 | 63 | 0.611 | 0.659 | 0.678 | 0.714 | |
| DRB1*04:06 | 92 | 0.519 | 0.501 | 0.486 | 0.489 | |
| DRB1*11:02 | 65 | 0.591 | 0.738 | 0.738 | 0.766 | |
| DRB1*11:03 | 64 | 0.585 | 0.726 | 0.623 | 0.785 | |
| DRB1*11:04 | 73 | 0.618 | 0.654 | 0.639 | 0.737 | |
| DRB1*12:01 | 719 | 0.673 | 0.659 | 0.721 | 0.740 | |
| DRB1*13:01 | 302 | 0.567 | 0.516 | 0.494 | 0.485 | |
| DRB1*14:01 | 43 | 0.785 | 0.761 | 0.676 | 0.721 | |
| DRB1*15:02 | 47 | 0.777 | 0.742 | 0.762 | 0.888 | |
| DRB1*16:01 | 56 | 0.789 | 0.644 | 0.793 | 0.814 | |
| DRB3*02:02 | 656 | 0.680 | 0.686 | 0.732 | 0.789 | |
| Average | 2412 | 0.677 | 0.712 | 0.701 | 0.732 | |
| Average* | 1605 | 0.672 | 0.707 | 0.693 | 0.722 | |
Figure 1The performance of MHC2SKpan under different setting of σ. The performance of MHC2SKpan on DRB1*03:02, DRB1*12:01, DRB1*13:01 and DRB3*02:02 in EpanSet4 under different settings of σ.