| Literature DB >> 29746699 |
Layla Hirsh1,2, Lisanna Paladin1, Damiano Piovesan1, Silvio C E Tosatto1,3.
Abstract
RepeatsDB-lite (http://protein.bio.unipd.it/repeatsdb-lite) is a web server for the prediction of repetitive structural elements and units in tandem repeat (TR) proteins. TRs are a widespread but poorly annotated class of non-globular proteins carrying heterogeneous functions. RepeatsDB-lite extends the prediction to all TR types and strongly improves the performance both in terms of computational time and accuracy over previous methods, with precision above 95% for solenoid structures. The algorithm exploits an improved TR unit library derived from the RepeatsDB database to perform an iterative structural search and assignment. The web interface provides tools for analyzing the evolutionary relationships between units and manually refine the prediction by changing unit positions and protein classification. An all-against-all structure-based sequence similarity matrix is calculated and visualized in real-time for every user edit. Reviewed predictions can be submitted to RepeatsDB for review and inclusion.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29746699 PMCID: PMC6031040 DOI: 10.1093/nar/gky360
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Validation rules for RepeatsDB-lite predictions. The criteria used to include structural alignments in RepeatsDB-lite predictions are shown per repeat class. Coverage is the fraction of residues covered compared to the reference structure. RMSD is the root mean square deviation. The TM-score method is used to calculate the RMSD and TM-score values.
| Alignment type | Class | Min coverage | Max RMSD | Min TM-score |
|---|---|---|---|---|
|
| III—Elongated | 0.8 | 2.3 | 0.35 |
| IV—Toroid | 0.6 | 2.8 | 0.4 | |
| V—Beads on a string | 0.6 | 3.5 | 0.4 | |
|
| III—Elongated | — | 2 | 0.25 |
| IV—Toroid | — | 4 | 0.28 | |
| V—Beads on a string | — | 4 | 0.28 |
Figure 1.RepeatsDB-lite pages. (A) Result page. The header provides summarizing information about the job (PDB code: 3vbn). The tabs below allow the navigation between chain predictions. Each chain tab shows some general information about the chain and a specific card for each region. Download buttons allow the retrieval of text file results, while sequence, structure and alignment viewer guide data visualization. The unit sequence similarity matrix shows the relationship between units in the region. The orange button redirects to the form for annotation editing. (B) Annotation editing page. The user can modify region classification and the unit start end position. Changes are reflected in the viewers on the right. Reviewed annotation can be submitted directly to RepeatsDB maintainers to be included in the database. (C) Results after resubmission. By saving the repeat annotation edits, the user is redirected to RepeatsDB output page where he is provided a detailed visualization of new results to evaluate the annotation quality.
Comparison with other methods. The regions column corresponds to the number of evaluated TR regions, i.e. for which a predictor provides an output, including fully negative predictions (zero units). Sensitivity (Sn), specificity (Sp), precision (Pr), balanced accuracy (Acc) and F-measure (F) values are in the range [0, 1]. Best values are in bold.
| Method | Regions | Sn | Sp | Pr | Acc |
| ||
|---|---|---|---|---|---|---|---|---|
| III.1 | β-solenoid | TAPO | 31 | 0.546 | 0.802 | 0.851 | 0.674 | 0.665 |
| ConSole | 31 | 0.510 | 0.811 | 0.848 | 0.661 | 0.637 | ||
| RepeatsDB-lite 40 | 31 | 0.398 | 0.959 | 0.952 | 0.678 | 0.561 | ||
| RepeatsDB-lite 60 | 31 | 0.543 | 0.962 | 0.967 | 0.752 | 0.695 | ||
| RepeatsDB-lite 80 | 31 | 0.560 | 0.912 | 0.929 | 0.736 | 0.699 | ||
| RepeatsDB-lite | 31 |
|
|
|
|
| ||
| III.2 | α/β solenoid | TAPO | 18 | 0.692 | 0.699 | 0.925 | 0.696 | 0.792 |
| ConSole | 18 | 0.644 | 0.834 | 0.954 | 0.739 | 0.769 | ||
| RepeatsDB-lite 40 | 18 | 0.558 | 0.912 | 0.967 | 0.735 | 0.707 | ||
| RepeatsDB-lite 60 | 18 | 0.790 | 0.851 | 0.961 | 0.820 | 0.867 | ||
| RepeatsDB-lite 80 | 18 | 0.788 | 0.847 | 0.960 | 0.818 | 0.866 | ||
| RepeatsDB-lite | 18 |
|
|
|
|
| ||
| III.3 | α-solenoid | TAPO | 38 | 0.665 | 0.577 | 0.916 | 0.621 | 0.771 |
| ConSole | 38 | 0.552 |
|
| 0.686 | 0.700 | ||
| RepeatsDB-lite 40 | 38 | 0.747 | 0.561 | 0.914 | 0.654 | 0.822 | ||
| RepeatsDB-lite 60 | 38 | 0.859 | 0.595 | 0.930 | 0.727 | 0.893 | ||
| RepeatsDB-lite 80 | 38 | 0.839 | 0.668 | 0.946 | 0.754 | 0.889 | ||
| RepeatsDB-lite | 38 |
| 0.684 | 0.951 |
|
| ||
| III | TAPO | 87 | 0.630 | 0.747 | 0.898 | 0.688 | 0.740 | |
| ConSole | 87 | 0.556 | 0.823 | 0.917 | 0.690 | 0.693 | ||
| RepeatsDB-lite 40 | 87 | 0.589 | 0.849 | 0.932 | 0.719 | 0.722 | ||
| RepeatsDB-lite 60 | 87 | 0.737 | 0.850 | 0.945 | 0.793 | 0.828 | ||
| RepeatsDB-lite 80 | 87 | 0.733 | 0.844 | 0.944 | 0.788 | 0.825 | ||
| RepeatsDB-lite | 87 |
|
|
|
|
|
RepeatsDB-lite performance against RepeatsDB reviewed entries. Columns headers have the same meaning of Table 2.
| Classification | Regions | Sn | Sp | Pr | Acc |
| |
|---|---|---|---|---|---|---|---|
| II.1 | Collagen triple-helix | 3 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 |
| II.2 | α helical coiled coil | 9 | 0.594 | 0.865 | 0.875 | 0.730 | 0.708 |
| II | Fibrous repeats | 12 | 0.545 | 0.865 | 0.875 | 0.705 | 0.672 |
| III.1 | β-Solenoid | 325 | 0.561 | 0.926 | 0.911 | 0.743 | 0.694 |
| III.2 | α/β solenoid | 350 | 0.797 | 0.907 | 0.984 | 0.852 | 0.881 |
| III.3 | α-Solenoid | 888 | 0.784 | 0.628 | 0.943 | 0.706 | 0.856 |
| III.4 | β trefoil / β hairpins | 76 | 0.583 | 0.960 | 0.968 | 0.772 | 0.728 |
| III.5 | Anti-parallel β layer / β hairpins | 63 | 0.642 | 0.723 | 0.869 | 0.683 | 0.739 |
| III | Elongated repeats | 1702 | 0.750 | 0.819 | 0.948 | 0.785 | 0.838 |
| IV.1 | TIM-barrel | 538 | 0.669 | 0.731 | 0.932 | 0.700 | 0.778 |
| IV.2 | β-Barrel / β hairpins | 77 | 0.682 | 0.863 | 0.970 | 0.772 | 0.801 |
| IV.3 | β-Trefoil | 24 | 0.449 | 0.754 | 0.731 | 0.602 | 0.556 |
| IV.4 | β-propeller | 849 | 0.677 | 0.845 | 0.968 | 0.761 | 0.797 |
| IV.5 | α/β prism | 185 | 0.782 | 0.956 | 0.997 | 0.869 | 0.876 |
| IV.6 | α-Barrel | 18 | 0.419 | 0.931 | 0.917 | 0.675 | 0.576 |
| IV.7 | α/β barrel | 5 | 0.986 | 0.000 | 0.995 | 0.493 | 0.991 |
| IV.8 | α/β propeller | 117 | 0.591 | 0.836 | 0.933 | 0.713 | 0.723 |
| IV.9 | α/β trefoil | 70 | 0.836 | 0.929 | 0.973 | 0.883 | 0.899 |
| IV.10 | Aligned prism | 45 | 0.856 | 0.978 | 0.998 | 0.917 | 0.921 |
| IV | Closed repeats | 1928 | 0.685 | 0.826 | 0.961 | 0.755 | 0.800 |
| V.1 | α-Beads | 13 | 0.758 | 0.652 | 0.980 | 0.705 | 0.855 |
| V.2 | β-Beads | 42 | 0.813 | 0.779 | 0.975 | 0.796 | 0.887 |
| V.3 | α/β-beads | 14 | 0.296 | 0.864 | 0.990 | 0.580 | 0.456 |
| V.4 | β sandwich beads | 37 | 0.429 | 0.850 | 0.984 | 0.639 | 0.597 |
| V.5 | α/β sandwich beads | 48 | 0.452 | 0.698 | 0.969 | 0.575 | 0.616 |
| V | Beads on a string | 154 | 0.537 | 0.759 | 0.975 | 0.648 | 0.692 |
| All | 3796 | 0.706 | 0.821 | 0.956 | 0.764 | 0.812 | |