| Literature DB >> 21647378 |
Assaf Gottlieb1, Milana Frenkel-Morgenstern, Mark Safro, David Horn.
Abstract
BACKGROUND: Aminoacyl tRNA synthetases (aaRSs) constitute an essential enzyme super-family, providing fidelity of the translation process of mRNA to proteins in living cells. They are common to all kingdoms and are of utmost importance to all organisms. It is thus of great interest to understand the evolutionary relationships among them and underline signature motifs defining their common domains.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21647378 PMCID: PMC3103580 DOI: 10.1371/journal.pone.0020361
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Properties of aaRS, ordered by class.
| EC | Name | Class | # of sequences | # of MEX CPs | # of observed CPs | # of specific CPs |
| 6.1.1.1 | Tyrosyl tRNA synthetase (TyrRS) | I | 261 | 400 | 758 | 239 |
| 6.1.1.2 | Tryptophanyl tRNA synthetase (TrpRS) | I | 121 | 163 | 323 | 102 |
| 6.1.1.4 | Leucyl tRNA synthetase (LeuRS) | I | 344 | 1031 | 1730 | 591 |
| 6.1.1.5 | Isoleucyl tRNA synthetase (IleRS) | I | 271 | 871 | 1608 | 568 |
| 6.1.1.9 | Valyl tRNA synthetase (ValRS) | I | 211 | 641 | 1293 | 378 |
| 6.1.1.10 | Methionyl tRNA synthetase (MetRS) | I | 248 | 634 | 1121 | 386 |
| 6.1.1.16 | Cysteinyl tRNA synthetase (CysRS) | I | 362 | 505 | 998 | 301 |
| 6.1.1.17 | Glutamyl tRNA synthetase (GluRS) | I | 373 | 645 | 1237 | 407 |
| 6.1.1.18 | Glutaminyl tRNA synthetase (GlnRS) | I | 37 | 96 | 178 | 50 |
| 6.1.1.19 | Arginyl tRNA synthetase (ArgRS) | I | 327 | 677 | 1275 | 421 |
| 6.1.1.3 | Threonyl tRNA synthetase (ThrRS) | II | 279 | 671 | 1128 | 431 |
| 6.1.1.6 | Lysyl tRNA synthetase (LysRS) | II (I) | 192 | 340 | 651 | 175 |
| 6.1.1.7 | Alanyl tRNA synthetase (AlaRS) | II | 193 | 506 | 1019 | 314 |
| 6.1.1.11 | Seryl tRNA synthetase (SerRS) | II | 345 | 489 | 874 | 264 |
| 6.1.1.12 | Aspartatyl tRNA synthetase (AspRS) | II | 294 | 586 | 992 | 341 |
| 6.1.1.14 | Glycyl tRNA synthetase (GlyRS) | II | 226 | 432 | 773 | 265 |
| 6.1.1.15 | Prolyl tRNA synthetase (ProRS) | II | 369 | 792 | 1313 | 475 |
| 6.1.1.20 | Phenylalanyl tRNA synthetase (PheRS) | II | 495 | 682 | 1576 | 418 |
| 6.1.1.21 | Histidyl tRNA synthetase (HisRS) | II | 312 | 393 | 838 | 246 |
| 6.1.1.22 | Asparaginyl tRNA synthetase (AsnRS) | II | 124 | 213 | 402 | 130 |
| 6.1.1.n2 | O-Phosphoseryl-tRNA synthetase (SepRS) | II | 17 | 31 | 87 | 20 |
| 6.1.1.26 | Pyrrolysyl tRNA synthetase (PylRS) | II | 5 | 16 | 20 | 11 |
*Some of the LysRS sequences are class I.
Top 10 most frequent CPs.
| CP | sequence occurrences (percentage) | # of aaRSs | Bacteria (percentage) | Eukarya (percentage) | Archaea (percentage) | mitochondria (percentage) |
| KMSKS | 1364 (25%) | 9 | 1196 (26.4%) | 24 (14.5%) | 102 (17.1%) | 42 (39.6%) |
| KSLGN | 502 (9%) | 9 | 440 (9.7%) | 5 (3.0%) | 40 (6.7%) | 17 (16.0%) |
| ISRQR | 345 (6%) | 3 | 296 (6.5%) | 0 (0.0%) | 36 (6.0%) | 13 (12.3%) |
| GRPGWH | 333 (6%) | 1 | 297 (6.5%) | 6 (3.6%) | 24 (4.0%) | 6 (5.7%) |
| PSPTG | 329 (6%) | 2 | 319 (7.0%) | 3 (1.8%) | 0 (0.0%) | 7 (6.6%) |
| FPHHE | 327 (6%) | 1 | 294 (6.5%) | 0 (0.0%) | 28 (4.7%) | 5 (4.7%) |
| PYANG | 318 (6%) | 2 | 295 (6.5%) | 0 (0.0%) | 18 (3.0%) | 5 (4.7%) |
| RQRYWG | 310 (6%) | 2 | 283 (6.2%) | 0 (0.0%) | 22 (3.7%) | 5 (4.7%) |
| SKSKG | 299 (6%) | 8 | 266 (5.9%) | 1 (0.6%) | 32 (5.4%) | 0 (0.0%) |
| PYPSG | 294 (5%) | 2 | 284 (6.3%) | 0 (0.0%) | 4 (0.7%) | 6 (5.7%) |
Figure 1Relative abundance of CPs in class I and class II synthetases.
Venn diagram showing relative abundance of CPs in class I and class II synthetases.
Novel class-specific CPs.
| CP | # of class I aaRSs | # of class I occurrences | # of class II aaRSs | # of class II occurrences |
| TADEI | 8 | 47 | 1 | 1 |
| ALADE | 8 | 37 | 1 | 2 |
| KSLGN | 7 | 500 | 2 | 2 |
| SKSKG | 7 | 299 | 1 | 0 |
| SKGNV | 7 | 182 | 1 | 0 |
| DVIAR | 7 | 73 | 0 | 0 |
| DVVAR | 7 | 60 | 1 | 2 |
| ADAIR | 7 | 39 | 1 | 0 |
| GLDLL | 7 | 35 | 1 | 1 |
| GVERL | 0 | 0 | 8 | 92 |
| DLVEE | 1 | 3 | 7 | 66 |
| GLDRI | 1 | 1 | 7 | 43 |
| AEAVL | 1 | 2 | 7 | 24 |
| ERISA | 0 | 0 | 7 | 24 |
| LRLAE | 0 | 0 | 6 | 38 |
| AAGVR | 2 | 2 | 6 | 47 |
*Overlap a binding site.
**less than four residues apart from a binding site.
†GVERL may be part of more general class II motifs [3], [22].
CP frequency in class I aaRS families predicted to share the same origin.
| CP | LeuRS | IleRS | ValRS | MetRS | sum |
| GNVISP | 2 | 11 | 2 | 1 | 16 |
| AEELW | 59 | 1 | 3 | 1 | 64 |
| GKNVL | 44 | 4 | 11 | 2 | 61 |
| HRMRG | 3 | 1 | 15 | 1 | 20 |
| KRMQG | 42 | 2 | 31 | 3 | 78 |
| LYNKG | 5 | 4 | 14 | 2 | 25 |
| NTVDP | 24 | 1 | 3 | 2 | 30 |
| NVVDP | 45 | 32 | 4 | 24 | 105 |
| RMQGY | 40 | 3 | 20 | 4 | 67 |
| RYHRM | 9 | 4 | 25 | 2 | 40 |
| RYKRM | 24 | 3 | 27 | 4 | 58 |
Figure 2BLAST similarities between different aaRSs (log-scaled).
E-values>0.01 are in blue.
CP Statistics for each kingdom of life (Mitochondria separated from the Eukaryotes).
| Kingdom | # of proteins | Percentage of CPs | Specific CPs out of all observed in kingdom |
| Bacteria | 4538 | 94.7% | 62.8% |
| Eukarya | 166 | 13.1% | 3.7% |
| Archaea | 596 | 28.9% | 13.5% |
| Mitochondria | 106 | 11.1% | 1.3% |
Figure 3Distribution of different aaRS families across kingdoms according to the CP origin.
Distribution of different aaRS families according to the CP origins: appearing in all 3 kingdoms, only in Eukarya, only in Archaea, Archaea and Eukarya together, and only in Bacteria.
CPs specific to the Mitochondria.
| CP | # of enzymes appears in |
| TTPIFYVN | 9 |
| SLESGH | 7 |
| VHSHW | 7 |
| ELADALGGLLNRCTA | 5 |
| QWGNYFLH | 5 |
| STWELLD | 5 |
| KIQQAA | 5 |
| CVRQTNGFVQRHAPWKL | 4 |
| ITNCGSGF | 4 |
| YKALEAVS | 4 |
| GTLLQPV | 4 |
| KLPEFNR | 4 |
| AVQHFW | 4 |
| QQQQQ | 4 |
| VLQWL | 4 |
Frequent CPs common to aaRSs and birA.
| CP (alternatives) | Structural properties | # of birA occurrences | # of aaRSs occurrences | # of class I occurrences | # of class II occurrences |
| GILIE(GILVEGILTE) | Biotin binding site in birA,ThrAMS binding site in ThrRS | 198 | 2 | 2 | 113 |
| GALRL(GALLL) | Binding site for 4AD in AspRS | 32 | 1 | 0 | 42 |
| GEALG(GETLG) | Helix-turn-Helix in birA | 22 | 4 | 1 | 31 |
| LRAAL | α-helix in birA | 86 | 13 | 43 | 7 |
*Alternatives in brackets mark one-mutation-distant CPs that were selected on birA but not in aaRSs.
4AD = 4-Amino-1,4-Dioxobutan-2-Aminium Adenosine- 5′-Monophosphate.