| Literature DB >> 27832206 |
Shalini Kalra1, Mangottil Ayyappan Pradeep1, Ashok K Mohanty1, Jai K Kaushik1.
Abstract
Sperm lysozyme-like proteins belonging to c-type lysozyme family evolved in multiple forms. Lysozyme-like proteins, viz., LYZL2, LYZL3 or SLLP1, LYZL4, LYZL5 and LYZL6 are expressed in the testis of mammals. Not all members of LYZL family have been uniformly and unambiguously identified in the genome and proteome of mammals. Some studies suggested a role of SLLP1 and LYZL4 in fertilization; however, the function of other LYZL proteins is unknown. We identified all known forms of LYZL proteins in buffalo sperm by LC-MS/MS. Cloning and sequence analysis of the Lyzl cDNA showed 38-50% identity at amino acid level among the buffalo LYZL paralogs, complete conservation of eight cysteines and other signature sequences of c-type lysozyme family. Catalytic residues in SLLP1, LYZL4 and LYZL5 have undergone replacement. The substrate binding residues showed significant variation in LYZL proteins. Residues at sites 62, 101, 114 in LYZL4; 101 in SLLP1; 37, 62, and 101 in LYZL6 were more variable among diverse species. Sites 63 and 108 occupied by tryptophan were least tolerant to variation. Site 37 also showed lower tolerance to substitution in SLLP1, LYZL4 and LYZL5, but more variable in non-testicular lysozymes. Models of LYZL proteins were created by homology modeling and the substrate binding pockets were analyzed in term of binding energies and contacting residues of LYZL proteins with tri-N-acetylglucosamine (NAG)3 in the A-B-C and B-C-D binding mode. Except LYZL6, LYZL proteins did not show significant difference in binding energies in comparison to hen egg white lysozyme in the A-B-C mode. (NAG)3 binding energy in the B-C-D mode was higher by 1.3-2.2 kcal/mol than in A-B-C mode. Structural analysis indicated that (NAG)3 was involved in making more extensive interactions including hydrogen bonding with LYZL proteins in B-C-D mode than in A-B-C mode. Despite large sequence divergence among themselves and with respect to c-type lysozymes, substrate binding residues as well as hydrogen bonding network between (NAG)3 and proteins were mostly conserved. LYZL5 in buffalo and other mammalian species contained additional 10-12 amino acid sequence at c-terminal that matched with ankyrin repeat domain-containing protein 27. Phylogenetic analysis indicated LYZL2 to be most ancient among all the LYZL proteins and that the evolution of LYZL proteins occurred through several gene duplications preceding the speciation of mammals from other vertebrates as distant as reptiles and amphibians.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27832206 PMCID: PMC5104373 DOI: 10.1371/journal.pone.0166321
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Multiple sequence alignment of deduced amino acid sequences of buffalo matured LYZL2, SLLP1, LYZL4, LYZL5, LYZL6 and HEWL.
The sequence shown within the red color box indicates specific signature of lysozyme family. The conserved cysteine and tryptophan residues are highlighted with yellow and green color bars, respectively. The catalytic residues corresponding to positions 35 and 52 of c-type lysozyme are shown in red color and underlined. The residues marked with diamond (♦) in blue color represent substrate binding sites in c-type lysozymes.
Substrate binding residues in c-type lysozyme family.
| Residue Number (according to HEWL seq) | Non-Testicular c-type Lysozymes | Testicular Lysozyme-Like Proteins from several species | ||||||
|---|---|---|---|---|---|---|---|---|
| LYZL2 | SLLP1 | SLLP1 (Reptiles) | LYZL4 | LYZL5 | LYZL5 (Frog) | LYZL6 | ||
| 37 | N | G | G | G (Alligator, Anole)/H (Python, Garter snake)/R (Turtle) | K | G | R | K |
| 62 | W | T | K | L | D | W | W | Y |
| 63 | W | W | W | W | W | W | W | W |
| 101 | D | E | E | G | G | S | D | G |
| 108 | W | W | W | W | W | W | W | W |
| 114 | H | H | H | N | N | H | Y | H |
* The most occurring residues in species at respective positions, while those occurring in one or two species only are shown with species name in parenthesis.
Fig 2Structural models of lysozyme-like proteins complexed with (NAG)3 based on several template structures.
PDB IDs of template are provided in material method part. For clarity only one model for each LYZL protein is shown. Panels a–LYZL2, b–SLLP1, c–LYZL4, d–LYZL5, e–LYZL6 and f– 1JEF (TEWL as one of the template). The protein part is shown in gray color and (NAG)3 molecule in ball and stick style has been shown in elemental colors. The substrate binding residues interacting with (NAG)3 are shown in three letter amino acid codes, while catalytic residues (residues at position 35 and 52 or 53) are labelled with single letter code. The NAG monomer binding subsites are represented by capital letters A, B and C in panel f.
Binding energy, contacting residues and hydrogen bonding residues of LYZL with (NAG)3 in their complex structures.
| LYZL2-(NAG)3 | 5.50 ± 0.42 | Asp52, Gln57, Phe56, Ile58, Asn59, Thr62, Trp63, Lys72, Arg74, Ile97, Glu100, Thr101, Asp102, Tyr106, Trp107, Gln108 | Asn59, Thr62(8) Trp63, Glu100, Thr101(15), Asp102(8), Tyr106, Gln108(10) |
| SLLP1-(NAG)3 | 5.52 ± 0.45 | Gln57, Ile58, Asn59, Arg61, Lys62, Trp63, Leu74, Ile97, Glu100, Pro101, Gln102, Ser106, Trp107 | Asn59, Arg61(11), Trp63, Glu100, Gln102, Ser106 |
| LYZL4-(NAG)3 | 5.43 ± 0.60 | Phe57, Gln58, Ile59, Arg60, His62, Asp63, Trp64, Arg69, Arg71, Ile94, Gly97, Lys98, Arg99, Ala103, Trp104, Pro105 | Arg60, Trp64, Arg71(14), Gly97(16), Arg99, Ala103 |
| LYZL5-(NAG)3 | 5.32 ± 0.41 | Glu52, Phe56, Gln57, Leu58, Asn59, Trp62, Trp63, Val97, Ser100, Glu101, Ser102, Ala106, Trp107, Asp108 | Asn59, Trp62, Trp63, Glu101(25), Ala106, Asp108(15) |
| LYZL6-(NAG)3 | 4.83 ± 0.44 | Asp52, Phe56, Gln57, Ile58, Asn59, Tyr62, Trp63, Asn74, Ile97, Gly100, Ala101, Gly102, Asn106, Trp107, Val108 | Asn59, Trp63, Gly100(10), Gly102(8), Asn106 |
| Template (1LZB) | 6.02 ± 0.25 | Asn46, Asp52, Gln57, Ile58, Asn59, Arg61, Trp62, Trp63, Arg73, Leu75, Ile98, Asp101, Gly102, Asn103, Ala107, Trp108 | Asn59, Trp62, Trp63, Asp101, Asn103, Ala107 |
| LYZL2-(NAG)3 | 6.84 ± 0.39 | Glu35, Leu46, Asp52, Gln57, Ile58, Asn59, Thr62, Trp63, Arg74, Ile97, Glu100, Tyr106, Trp107, Gln108, Gly109 | Glu35, Asp52, Gln57(10), Asn59, Thr62(11), Trp63, Glu100, Tyr106, Gln108 |
| SLLP1-(NAG)3 | 7.34 ± 0.47 | Glu46, Asn52, Gln57, Ile58, Asn59, Arg61, Lys62, Trp63, Leu74,0 Ile97, Glu100, Gln102, Ser106, Trp107, Glu108, Ala109 | Glu46, Asn52(21), Gln57(17), Asn59, Arg61(13), Trp63, Glu100(18), Gln102, Ser106, Glu108 |
| LYZL4-(NAG)3 | 7.62 ± 0.47 | Glu35, Tyr44, Asn46, Arg48, Phe57, Gln58, Ile59, Arg60, His62, Asp63, Trp64, Arg71, Ile94, Arg99, Ala103, Trp104, Pro105, Ser106 | Glu35, Asn46(25), Gln58, Arg60, Asp63(21), Trp64, Arg71(13), Arg99, Ala103 |
| LYZL5-(NAG)3 | 6.93 ± 0.33 | Glu35, Asn44, Asn46, Glu52, Phe56, Gln57, Leu58, Asn59, Trp62 Trp63, Val97, Ser100, Ser102, Ala106, Trp107, Asp108, Ser109 | Glu35, Glu52, Asn59, Trp62(16), Trp63, Ser102(8), Ala106, Asp108 |
| LYZL6-(NAG)3 | 6.69 ± 0.41 | Glu35, Asn44, Asn46, Asp52, Phe56, Gln57, Ile58, Asn59, Tyr62, Trp63, Asn74, Ile97, Asn106, Trp107, Val108, Lys109 | Glu35, Asn46(17), Asp52, Gln57(9), Asn59, Tyr62(6), Trp63, Asn106, Val108 |
| Template (1LMP) | 7.0 ± 0.34 | Glu35, Asn44, Asn46, Asp52, Gln57, Ile58, Asn59, Tyr62, Trp63, Lys73, Val75, Val98, Asp101, Asn103, Ala107, Trp108, Val109, Ala110 | Glu35, Asp52, Asn59, Tyr62, Trp63, Asp101, Asn103, Ala107, Val109 |
sd—standard deviation.
The listed residues were observed interacting with (NAG)3 in majority of the model structures. The residue numbering differ among the LYZL proteins due to insertions or deletions, see sequence alignment in Fig 1 for corresponding residues in various LYZL proteins. The data are based on analysis of 60 structures in each case of LYZL proteins in each of the binding mode.
Residues shown are those involved in making hydrogen bond with (NAG)3 in most of the model structures. The numbers in parenthesis indicate the number of models in which a particular residue was observed in only a few models making hydrogen bond with (NAG)3 out of total 60 models.
Fig 3Phylogenetic map of lysozyme family.
The phylogenetic tree was constructed by using ML method. The nodes with diamond (♦) in blue color represent duplication event. LYZ-lysozyme, LYZL-lysozyme-like proteins, BoSt-cattle stomach, BuSt-buffalo stomach, ShSt-sheep stomach, BoMlk-cattle milk, BuMlk-buffalo milk, Hu-human, Ch-chimpanzee, Mc-monkey, Rt-rat, Mu-mouse, Bo-cattle, Bu-buffalo, Sh-sheep, Gt-goat, Py-python, Gr-garter snake. Lysozymes from non-mammals such as drosophila (Dr) and Bombyx mori (Bm) were used as outgroups.