| Literature DB >> 16083508 |
Mircea Podar1, Jonathan R Eads, Toby H Richardson.
Abstract
BACKGROUND: Completed genomes and environmental genomic sequences are bringing a significant contribution to understanding the evolution of gene families, microbial metabolism and community eco-physiology. Here, we used comparative genomics and phylogenetic analyses in conjunction with enzymatic data to probe the evolution and functions of a microbial nitrilase gene family. Nitrilases are relatively rare in bacterial genomes, their biological function being unclear.Entities:
Mesh:
Substances:
Year: 2005 PMID: 16083508 PMCID: PMC1199592 DOI: 10.1186/1471-2148-5-42
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Figure 1Maximum likelihood tree of nitrilases from known bacterial species (accession numbers are in parentheses). Bootstrap support values are indicated for the major groups only. The schematic organization of the gene clusters that contain a nitrilase ORF is shown for species where that sequence information is available.
Figure 2Organization of gene clusters around the subfamily 1 nitrilases in sequenced bacterial genomes. The highly conserved gene cluster Nit1C is flanked by unrelated genomic neighbourhoods in the different species. Gene names are based on the available genomic annotation.
Figure 3(A). Protein maximum likelihood tree of subfamily 1 nitrilases. The tree was arbitrarily rooted with sequences from the two most closely related subfamilies 2 and 3. Numbers at nodes represent bootstrap support (not shown if <50). (B). Diagram of the gene clusters that include the nitrilase ORF. For environmental genes, the information was limited by the size of the genomic insert. (C). Histogram representing enzymatic enantioselectivity (R or S) on hydroxyglutaronitrile, based on data from [9](na, not assayed; x, not active).
Parameter estimates, likelihood scores and identified selected sites under various models. Branch numbers refer to Figure 4A. Parameters indicating positive selection are in bold. A likelihood ratio test (LRT) is used to compare a pair of nested models: one which accounts for sites with ω > 1 and one which does not (the null model). To accept or reject the ω > 1 hypothesis, twice the log-likelihood difference in the scores is compared with a χ2 distribution with the degrees of freedom equal to the difference in the numbers of parameters between the two models. When ML detects lineages with ω > 1, an empirical Bayes analysis identifies sites under positive selection and calculate posterior probabilities that provide a measure of confidence for that prediction.
| M0:one ratio | 1 | -11903.5 | ω = 0.0418 | none | |
| Site-specific models | |||||
| M1:neutral (K = 2) | 1 | -13195.5 | p0 = 0.298, p1 = 0.702 | not allowed | |
| M3:discrete (K = 2) | 3 | -11627.6 | p0 = 0.6, p1 = 0.4, ω0 = 0.012, ω1 = 0.098 | none | |
| Branch-site models | |||||
| Branch 1 | |||||
| Model A | 3 | -13160.0 | p0 = 0.3, p1 = 0.70, p2+p3 = 0, ω2 = 0 | none | |
| Model B | 5 | -11627.6 | p0 = 0.4, p1 = 0.6, p2+p3 = 0 | none | |
| Branch 2 | |||||
| Model A | 3 | -13188.7 | p0 = 0.296, p1 = 0.688, | Q157 (P = 0.77), Q203 (P = 0.999), T41, Q157, Y184, N200, Q203, R284 (P > 0.9) | LRT vs. M1 2Δl = 6.8, P = 0.03, df = 2 |
| Model B | 5 | -11621.4 | p0 = 0.356, p1 = 0.59, | LRT vs. M3 (K = 2) 2Δl = 6.2, P = 0.04, df = 2 | |
Figure 4(A) Maximum likelihood tree for subfamily 1 nitrilases used to test for positive selection. Branch lengths are scaled to the mean number of substitutions per codon site under model M3. Branches 1 and 2 indicate lineages tested for positive selection signal, following the transition event. The sequences illustrate the variability across the clade at positions identified under positive selection. (B). A three dimensional model of the 1A21 nitrilase dimer. Shown are the catalytic triad (blue) and the residues under positive selection (red). Residue 39, invariant before and after the transition event, is shown in green.