| Literature DB >> 19077205 |
David A Coil1, Liesbeth Vandersmissen, Christophe Ginevra, Sophie Jarraud, Elke Lammertyn, Jozef Anné.
Abstract
BACKGROUND: Bacterial genomes harbour a large number of tandem repeats, yet the possible phenotypic effects of those found within the coding region of genes are only beginning to be examined. Evidence exists from other organisms that these repeats can be involved in the evolution of new genes, gene regulation, adaptation, resistance to environmental stresses, and avoidance of the immune system.Entities:
Mesh:
Substances:
Year: 2008 PMID: 19077205 PMCID: PMC2639597 DOI: 10.1186/1471-2180-8-218
Source DB: PubMed Journal: BMC Microbiol ISSN: 1471-2180 Impact factor: 3.605
Genes containing intragenic tandem repeat arrays in the Philadelphia strain
| Gene | Length of repeat (bp) | Copy number | Identity† (%) | Genome annotation‡ |
| LPG0451* | 30 | 18 | 75.6 | IcmE (DotG) |
| LPG0451* | 30 | 7 | 72.9 | IcmE (DotG) |
| LPG0688 | 9 | 5 | 88.9 | Hsp60, 60 K |
| LPG1035 | 102 | 3 | 93.8 | |
| LPG1038 | 12 | 4 | 89.6 | Vrrb |
| LPG1062 | 108 | 8 | 71.1 | TPR repeat protein |
| LPG1172* | 108 | 8 | 69.9 | TPR repeat protein |
| LPG1172* | 108 | 4 | 75.0 | TPR repeat protein |
| LPG1299 (Lpms35)§ | 18 | 3 | 87.0 | Transmembrane Tfp pilus assembly protein FimV |
| LPG1356* | 108 | 4 | 72.2 | TPR repeat protein |
| LPG1356* | 108 | 3 | 74.4 | TPR repeat protein |
| LPG1421 | 261 | 3 | 71.4 | 30S Ribosomal protein S1 |
| LPG1555 | 21 | 2 | 100 | Arginine 3rd transport system periplasmic binding protein ArtJ |
| LPG1602 | 90 | 10 | 65.1 | FLJ00180 protein |
| LPG1948 | 90 | 7 | 75.6 | FLJ00180 protein |
| LPG1958 | 87 | 13 | 74.3 | FLJ00180 protein |
| LPG1976 | 171 | 3 | 73.7 | |
| LPG2222 | 108 | 6 | 70.7 | TPR repeat protein, protein-protein interaction |
| LPG2224 | 171 | 5 | 73.5 | UVB-resistance protein UVR8 |
| LPG2392 | 87 | 6 | 72.2 | Leucine rich repeat protein family |
| LPG2416 | 105 | 3 | 81.9 | Ankyrin repeat family protein |
| LPG2485 | 102 | 9 | 58.5 | TPR domain protein |
| LPG2559 | 12 | 4 | 91.7 | ATP-dependant DNA helicase RecG |
| LPG2639 | 108 | 5 | 65.9 | |
| LPG2644 (Lpms31)§ | 45 | 19 | 85.4 | Tail fiber protein SclB (collagen-like) |
| LPG2793 (Lpms3)§ | 96 | 7 | 74.0 | LepA, interaptin |
*Genes containing two distinct repeat arrays. †Percentage identity between repeats. ‡All gene annotations are from the published sequence of Legionella pneumophila, Philadelphia strain (GenBank accession no. AE017354). §Annotation used in MLVA typing scheme proposed by [17]. IcmE: intracellular multiplication protein E, DotG: defect in organelle trafficking protein G, Vrrb: variable region with repetitive sequence B, TPR: tetratricopeptide repeat, FimV: fimbriae protein V, Tfp: type IV pili, SclB: Streptococcus pyogenes collagen-like protein B, LepA: Legionella effector protein A.
Subcellular localization of proteins containing tandem repeats
| Proteins containing repeats (n = 23) | |||
| Cytoplasmic | 31.1% (n = 915) | 21.7% (n = 5) | 0.33 |
| Inner Membrane | 18.2% (n = 534) | 0% (n = 0) | |
| Extracellular | 0.8% (n = 22) | 21.7% (n = 5) | |
| Outer Membrane | 1.8% (n = 53) | 4.3% (n = 1) | 0.359 |
| Periplasm | 1.2% (n = 36) | 8.7% (n = 2) | |
| Unknown | 46.9% (n = 1381) | 43.4% (n = 10) | 0.738 |
Comparison of average tandem repeat copy number between strain types
| Environmental (E) | 2.78 | 14.52 | 1.02 | 4.40 | 2.04 | 11.80 | 6.19 | |
| Clinical (C) | 3.5 | 17.78 | 1.10 | 4.71 | 2.28 | 12.70 | 6.45 | |
| Hot springs (H) | 3.00 | 9.65 | 1.00 | 5.00 | 2.41 | 14.12 | 7.00 | |
| E versus C | 0.13 | 0.34 | 0.15 | |||||
| E versus H | 0.096 | 0.32 | 0.070 | |||||
| C versus H | 0.17 | 0.36 | ||||||
*Values in bold were considered significant (p < .05). The mean of each population (Environmental, Clinical, Hot Springs) was compared using a two-tailed heteroscedastic t-test.