| Literature DB >> 19622168 |
Maribel Farfán1, David Miñana-Galbis, M Carmen Fusté, J Gaspar Lorén.
Abstract
BACKGROUND: The bacterial flagellum is the most important organelle of motility in bacteria and plays a key role in many bacterial lifestyles, including virulence. The flagellum also provides a paradigm of how hierarchical gene regulation, intricate protein-protein interactions and controlled protein secretion can result in the assembly of a complex multi-protein structure tightly orchestrated in time and space. As if to stress its importance, plants and animals produce receptors specifically dedicated to the recognition of flagella. Aside from motility, the flagellum also moonlights as an adhesion and has been adapted by humans as a tool for peptide display. Flagellar sequence variation constitutes a marker with widespread potential uses for studies of population genetics and phylogeny of bacterial species.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19622168 PMCID: PMC2724415 DOI: 10.1186/1745-6150-4-23
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Origin of the Aeromonas strains used and characteristics of their flaA sequences
| Species | Reference strain | Source/Country | G+C content (% mol) | GenBank accession n°. | |
| CECT 4199T | Diseased eel/Spain | 870 | 50.7 | ||
| CECT 4227T | Fish | 912 | 55.0 | ||
| CECT 7113T | Cockle/Spain | 918 | 58.3 | ||
| CECT 838T | Guinea pig | 906 | 57.0 | ||
| CIP 107763T | Mosquito/India | 915 | 50.9 | ||
| CECT 4342T | Healthy eel/Spain | 900 | 56.1 | ||
| CECT 4224T | Freshwater fish | 900 | 57.7 | ||
| CECT 4221T | Used oil-emulsion | 921 | 55.8 | ||
| CECT 5744T | Human faeces/Bangladesh | 912 | 57.8 | ||
| CECT 839T | Milk | 909 | 50.7 | ||
| CIP 107985T | Frog/Thailand | 912 | 56.2 | ||
| CECT 4228T | Human faeces/USA | 915 | 49.2 | ||
| CECT 5864T | Wedge-shell/Spain | 906 | 57.6 | ||
| LMG 17541T | Drinking water/Belgium | 921 | 52.0 | ||
| CECT 894T | Atlantic salmon/UK | 918 | 52.4 | ||
| CECT 5752T | River water/Argentina | 912 | 54.8 | ||
| CECT 4245T | Fish | 903 | 51.1 | ||
| CECT 4246T | Diseased frog | 915 | 50.9 |
Acronims: CECT, Spanish Type Culture Collection; CIP, Collection de l'Institut Pasteur; LMG, Belgian co-ordinated collections of microorganisms.
Abbreviations: T, type strain; bp, base pairs; % mol, mole percent of G+C.
Figure 1Saturation plot of . Number of transitions (circles) and transversions (triangles) from pairwise comparisons of 18 Aeromonas species and subspecies sequences plotted against their maximum likelihood distances calculated under the GTR + I + G substitution model (t).
Figure 2Plot of amino acid substitutions against transitions and transversions for . Number of amino acid substitutions plotted against transversions (triangles) and transitions (circles) from pairwise comparisons of sequences of 18 Aeromonas species and subspecies.
Figure 3The . Bootstrap values (1,000 replicates) are indicated in the nodes. Estimates of ω ratios, obtained under the "free-ratios" model, which assumes an independent ω ratio (dN/dS) for each branch in the tree, are shown in bold along branches. Ancestral sequences were reconstructed from Bayes Empirical Bayes analysis under a model that assumed a background ω ratio for branches of Group 1 and 2 (background branches) and another ω ratio for the branch connecting the two groups (foreground branch). Changes in the ancestral amino acids are indicated in the table (lower right position in the figure). Shadowed amino acid changes identify positions under positive selection, inferred from the M8 model (Table 6).
Figure 4Pairwise comparisons of . Plot of maximum likelihood estimates of nonsynonymous (dN) against synonymous (dS) substitution rates (Fig. 4A), the straight line represents the neutral expectation (dN = dS). The ω ratio (dN/dS) plotted against the maximum likelihood estimates of sequence divergence (t), defined as the expected number of nucleotide substitutions per codon (Fig. 4B).
Log-likelihood, AIC and parameter estimates under random-site models for flaA sequences
| Model | ωb | ℓc | AICd | κe | Estimates of parameters | Positively selected sites | |
| M0: one ratio | 1 | 0.062 ± 0.000 | -6830.350 | 13662.700 | 2.093 | ω = 0.062 | None |
| M1a: nearly neutral | 2 | 0.114 ± 0.015 | -6689.544 | 13383.089 | 2.192 | Not allowed | |
| M2a: positive selection | 4 | 0.114 ± 0.015 | -6689.544 | 13387.092 | 2.192 | 179 I (at | |
| M3: discrete (K = 3) | 5 | 0.071 ± 0.008 | -6601.883 | 13213.766 | 2.085 | None | |
| M7: beta | 2 | 0.071 ± 0.009 | -6603.940 | 13211.879 | 2.087 | Not allowed | |
| M8: beta & ω > 1 | 4 | 0.072 ± 0.008 | -6600.818 | 13209.635 | 2.082 | ||
| M8a: beta & ωs = 1 | 4 | 0.072 ± 0.008 | -6601.490 | 13210.978 | 2.086 | Not allowed | |
a p, number of free parameters for each model
b ω, averaged ω ± standard error of dN/dS ratio over all sites in the flaA gene alignment
c ℓ, log-likelihood value for each model
d AIC, Akaike's Information Criterion value
e κ, transition/transversion rate [27]
Values of ω > 1 and sites inferred to be under positive selection are in bold
Likelihood Ratio Test statistics (LRT) for random-site models
| Model 1a | Model 2b | 2Δℓc | dfd | |
| M3 | M0 | 456.93 | 4 | 1.4 10-97 *** |
| M1a | M2a | 0 | 2 | 1 |
| M8 | M7 | 6.24 | 2 | 0.044 * |
| M8a | M8 | 1.34 | 1 | 0.246 |
a Alternative model; b null model; c 2Δℓ = 2(ℓ1 - ℓ0), ddegrees of freedom (see Table 2).
*** significant at P < 0.001
* significant at P < 0.05
Figure 5Distribution of the posterior means of the ω ratio. Posterior means of ω ratio under the M8 (beta & ω) model. In this model 10 equal-probability categories are used to approximate the beta distribution [27], giving 11 categories. The ordinate represents the average of ω over the 11 site classes, weighted by the posterior probabilities [30].
Log-likelihood values and parameter estimates under fixed-site models for flaA sequences
| Modela | npa | ℓ | AIC | κb | ωb | |
| A (homogeneous model) | 44 | -6830.350 | 13748.700 | 1 | 2.093 | 0.062 |
| B (different | 46 | -6645.595 | 13383.190 | 2.286 | 0.049 | |
| C (different | 64 | -6613.420 | 13354.840 | 2.262 | 0.046 | |
| D (different | 50 | -6613.537 | 13327.074 | ω1 = 0.025 | ||
| D2 (different | 47 | -6617.228 | 13328.456 | 2.286 | ω1 = 0.025 | |
| E (different | 68 | -6580.030 | 13296.060 | rs2 = 3.944 | ω1 = 0.023 |
The three partitions correspond to the 1: N-terminal extreme (104 codons), 2: the central region (101 codons) and 3: the C-terminal extreme (80 codons).
a np, number of parameters including 33 branch lengths.
b rs, nucleotide substitution rate. The subscript corresponds to the three partitions of flaA sequences (rs1 is 1 in all models and it is not shown); κ: transition/transversion ratio; ω: dN/dS ratio.
c πs, codon frequencies (data not shown).
d Model D but with κ fixed to 2.286.
Likelihood Ratio Test statistics for fixed-site models
| Model 1a | Model 2b | Parameter contrasted | 2Δℓc | dfd | |
| B | A | different | 369.51 | 2 | 5.8 10-81 *** |
| C | B | different πs | 64.35 | 18 | 3.9 10-7 *** |
| D | C | different κ, ω | 0.23 | 14 | > 0.9999 |
| D2 | B | different ω | 56.73 | 1 | 0.5 10-13 *** |
| E | D | different πs | 67.01 | 18 | 1.4 10-7 *** |
a Alternative model; b null model; c 2Δℓ = 2(ℓ1 - ℓ0), d degrees of freedom (see Table 4).
*** significant at P < 0.001
Parameter estimates for branch-site model A
| np | ℓ | Site class | Proportion | Background | Foreground | Selected sites under positive selectionc | |
| Model Aa | 4 | -6631.240 | 0 | 0.779 | 0.027 | 0.027 | |
| 1 | 0.065 | 1 | 1 | ||||
| 2a | 0.144 | 0.027 | 39.95 | ||||
| 2b | 0.012 | 1 | 39.95 | ||||
| Model A with ω = 1b | 3 | -6638.340 | 0 | 0.712 | 0.026 | 0.026 | |
| 1 | 0.060 | 1 | 1 | ||||
| 2a | 0.210 | 0.026 | 1 | Not allowed | |||
| 2b | 0.018 | 1 | 1 | ||||
a Alternative model; b null model
c Sites inferred under positive selection at the 99% level are in bold, and those at the 95% in italics