| Literature DB >> 17893075 |
Jin-Yu Wu1, Fang-Qing Zhao, Jie Bai, Gang Deng, Song Qin, Qi-Yu Bao.
Abstract
The cry gene family, produced during the late exponential phase of growth in Bacillus thuringiensis, is a large, still-growing family of homologous genes, in which each gene encodes a protein with strong specific activity against only one or a few insect species. Extensive studies are mostly focusing on the structural and functional relationships of Cry proteins, and have revealed several residues or domains that are important for the target recognition and receptor attachment. In this study, we have employed a maximum likelihood method to detect evidence of adaptive evolution in Cry proteins, and have identified 24 positively selected residues, which are all located in Domain II or III. Combined with known data from mutagenesis studies, the majority of these residues, at the molecular level, contribute much to the insect specificity determination. We postulate that the potential pressures driving the diversification of Cry proteins may be in an attempt to adapt for the "arm race" between delta-endotoxins and the targeted insects, or to enlarge their target spectra, hence result in the functional divergence. The sites identified to be under positive selection would provide targets for further structural and functional analyses on Cry proteins.Entities:
Mesh:
Substances:
Year: 2007 PMID: 17893075 PMCID: PMC5054088 DOI: 10.1016/S1672-0229(07)60020-5
Source DB: PubMed Journal: Genomics Proteomics Bioinformatics ISSN: 1672-0229 Impact factor: 7.691
Fig. 1Multiple alignment of Domains II and III of 18 Cry proteins. The shading reflects the conservation profile at 60% consensus of amino acids.
Fig. 2Phylogenetic tree of the 18 sequences constructed by the ME method. Numbers under the nodes indicate bootstrap values. Scale bars represent level of amino acid sequence divergence.
Parameter estimates, log likelihood scores, and positively selected sites for the Cry proteins from B. thuringiensis*
| Model | Parameter estimate | ln | 2Δ ln | AICc | Positively selected sites |
|---|---|---|---|---|---|
| M0 | −42,707.52 | 85,417.29 | None | ||
| M1a | −41,394.82 | 82,794.44 | Not allowed | ||
| M2a | −41,369.92 | (M1a vs M2a) | 82,750.92 | 400, | |
| M3 | −41,138.80 | (M0 vs M3) | 82,292.60 | 384, 385, 396, 400, 407, 424, | |
| M7 | −41,237.62 | 82,480.04 | Not allowed | ||
| M8 | −41,146.47 | (M7 vs M8) | 82,304.02 |
The maximum likelihood estimates showing the presence of positive selection are in boldface. The parameters p and q describe the shape of the beta distribution of ω, and p0, p1, and p2 are the proportions of codons belonging to each category. Proportions that are not free parameters are in parentheses. Sites inferred to be under positive selection with probabilities > 99% are in boldface.
Fig. 3Mapping of the residues identified to be under positive selection on the tertiary structure of Cry1Aa protein (PDB code: 1ciy). A and B: Ribbon diagram of Cry1Aa. C and D: The molecular surface of Cry1Aa. Sites identified to be under positive selection in the best approximation model M3 with posterior probabilities > 90% are shown in blue color. Three loops in Domain II mentioned in the text are shown in red color.
Accession numbers of the 18 sequences used in this study
| Sequence | Accession number |
|---|---|
| Cry1Ab | AAA22330 |
| Cry1Ac | AAA22331 |
| Cry1Ae | AAA22410 |
| Cry1Ba | CAA29898 |
| Cry1Ca | CAA30396 |
| Cry1Ea | CAA37933 |
| Cry7Aa | AAA22351 |
| Cry7Ab | AAA21120 |
| Cry8Aa | AAA21117 |
| Cry8Bc | CAD57543 |
| Cry8Da | BAC07226 |
| Cry9Ba | CAA52927 |
| Cry9Da | BAA19948 |
| Cry9Db | AAX78439 |
| Cry9Ea | BAA34908 |
| Cry9Eb | CAC50780 |
| Cry9Ec | AAC63366 |