| Literature DB >> 18986514 |
Tiandi Wei1, Jing Gong, Ferdinand Jamitzky, Wolfgang M Heckl, Robert W Stark, Shaila C Rössle.
Abstract
BACKGROUND: Leucine-rich repeats (LRRs) are present in more than 6000 proteins. They are found in organisms ranging from viruses to eukaryotes and play an important role in protein-ligand interactions. To date, more than one hundred crystal structures of LRR containing proteins have been determined. This knowledge has increased our ability to use the crystal structures as templates to model LRR proteins with unknown structures. Since the individual three-dimensional LRR structures are not directly available from the established databases and since there are only a few detailed annotations for them, a conformational LRR database useful for homology modeling of LRR proteins is desirable. DESCRIPTION: We developed LRRML, a conformational database and an extensible markup language (XML) description of LRRs. The release 0.2 contains 1261 individual LRR structures, which were identified from 112 PDB structures and annotated manually. An XML structure was defined to exchange and store the LRRs. LRRML provides a source for homology modeling and structural analysis of LRR proteins. In order to demonstrate the capabilities of the database we modeled the mouse Toll-like receptor 3 (TLR3) by multiple templates homology modeling and compared the result with the crystal structure.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18986514 PMCID: PMC2645405 DOI: 10.1186/1472-6807-8-47
Source DB: PubMed Journal: BMC Struct Biol ISSN: 1472-6807
Consensus sequences of the eight canonical LRR classes [4,5].
| Classes | HCS | VS |
| Typical type (T) | LxxLxLxxNxL | xxLxxxxLxxLxx |
| Bacterial type (S) | LxxLxLxxNxL | xxLPx(x)LPxx |
| Ribonuclease inhibitor-like type (RI) | LxxLxLxxNxL | xxxxxxxLxxxLxxxxx |
| SDS22-like type (SDS22) | LxxLxLxxNxL | xxLxxLxxLxx |
| Cysteine-containing type (CC) | LxxLxLxxCxxL | TDxxxxxLxxxCxx |
| Plant-specific type (PS) | LxxLxLxxNxL | xxxLPxxLGxLxx |
| Treponema pallidum type (Tp) | LxxLxLPxxLxx | LxxxAFxxCxx |
| CD42b type (CD42b) | LxxLxLxxNxL | xxLPxxxxxxxxx |
L: Leu, Ile, Val, Phe; N: Asn, Thr, Ser, Cys; P: Pro; T: Thr; D: Asp; G: Gly; A: Ala; F: Phe; C: Cys; x: random residues.
Figure 1LRR entry length distribution. The most common entry lengths vary from 20 to 29. Each LRR class has a characteristic length distribution. Some entries have a sequence length larger than 29 due to insertions.
Numbers of LRR and PDB entries (release 0.2) in the nine LRR classes.
| T | S | RI | SDS22 | CC | PS | Tp | CD42b | Other | Total | |
| LRR structures | 272 | 72 | 151 | 372 | 184 | 10 | 0 | 0 | 200 | 1261 |
| LRR entries | 169 | 40 | 59 | 114 | 28 | 10 | 0 | 0 | 128 | 548 |
| PDB entries | 32 | 13 | 50 | 16 | 1 | 0 | 0 | - | 112 | |
Up to present, no crystal structures for LRR proteins of Tp/CD42b types are determined. Different from other LRR types, the S type and T type LRRs evolved from a common precursor [1] and thus can exist in the same PDB entry simultaneously.
Coverage of LRR proteins with PDB structures of different databases.
| Databases | Numbers of LRR proteins with PDB structures | Numbers of identified LRRs |
| InterPro | 62 | 325 |
| Swiss-Prot | 98 | 997 |
| Pfam | 48 | 173 |
| SMART | 84 | 547 |
| LRRML | 112 | 1261 |
The results were obtained on October 13, 2008.
Comparison of LRR numbers of different LRR proteins by different databases.
| PDB codes | Protein functions | LRR classes | InterPro | Swiss-Prot | Pfam | SMART | LRRML |
| Immune System | T | 18 | 22 | 7 | 20 | 25 | |
| Toxin | S | 7 | 15 | 1 | 0 | 15 | |
| Structural Protein | T+S | 8 | 8 | 5 | 9 | 12 | |
| Signaling Protein | RI | 2 | 8 | 0 | 0 | 11 | |
| Glycoprotein | SDS22 | 6 | 6 | 4 | 7 | 8 | |
| Signaling Protein | CC | 2 | 16 | 0 | 6 | 18 | |
| Inhibitor | PS | 7 | 10 | 2 | 0 | 10 |
All listed LRR numbers include N-/C-terminal LRRs. To date, only the LRRML database contains the complete set of LRRs of all LRR proteins with known structures. The results were obtained on October 13, 2008.
Figure 2The LRRML description of a LRR structure. This entry is a 24 residue long typical LRR. The first 11 residues compose its HCS and the last 13 residues compose its VS (no insertions). It is contained only in the chain A of PDB structure 2O6S (a protein involved in the immune system). It is the third one of the 7 LRRs of 2O6S, from position 77 to 100. Its secondary structure was extracted from DSSP and its three-dimensional coordinate file is available though the hyperlink on the corresponding web page.
Sequence identities (%) of target-template LRR pairs.
| NT | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | 20 | 21 | 22 | 23 | CT | Avg | |
| LRRML ID | 406 | 65 | 212 | 465 | 151 | 177 | 110 | 259 | 8 | 203 | 64 | 293 | 270 | 357 | 65 | 152 | 259 | 316 | 152 | 239 | 239 | 92 | 80 | 101 | 173 | --- |
| PDB source | --- | |||||||||||||||||||||||||
| Identity (%) | 47.60 | 45.83 | 45.83 | 41.67 | 50.00 | 50.00 | 46.15 | 41.67 | 41.38 | 42.31 | 50.00 | 50.00 | 33.33 | 41.67 | 42.86 | 50.00 | 40.00 | 37.50 | 38.46 | 50.00 | 45.83 | 50.00 | 41.67 | 40.00 | 39.29 | 44.12 |
In the header line, 1–25 denote canonical LRRs; NT and CT denote N-/C-terminal LRRs.
Figure 3Comparison of model and crystal structure of mouse TLR3 ectodomain at the two ligand interaction regions. Blue: structure obtained by homology modeling; orange: crystal structure (PDB code: 3CIG). (A) The modeled backbone structure of mouse TLR3 ectodomain. (B) Model and crystal structure superimposed at the N-terminal interaction region. The root mean square deviation is 1.96 Å. (C) Superimposition at the C-terminal interaction region. The root mean square deviation is 1.9 Å. The reported interacting residues are presented with side chain and labelled with residue name and position in (B) and (C).
Figure 4Ramachandran plot of model and crystal structure of mouse TLR3 ectodomain. (A) Predicted model of mouse TLR3 ectodomain. (B) Crystal structure of mouse TLR3 ectodomain. The different colored areas indicate 'disallowed' (white), 'generously allowed' (light yellow), 'additional allowed' (yellow), and 'most favored' (red) regions.