| Literature DB >> 28054982 |
Lander Bauters1, Diana Naalden2, Godelieve Gheysen3.
Abstract
Nematodes are a very diverse phylum that has adapted to nearly every ecosystem. They have developed specialized lifestyles, dividing the phylum into free-living, animal, and plant parasitic species. Their sheer abundance in numbers and presence in nearly every ecosystem make them the most prevalent animals on earth. In this research nematode-specific profiles were designed to retrieve predicted lectin-like domains from the sequence data of nematode genomes and transcriptomes. Lectins are carbohydrate-binding proteins that play numerous roles inside and outside the cell depending on their sugar specificity and associated protein domains. The sugar-binding properties of the retrieved lectin-like proteins were predicted in silico. Although most research has focused on C-type lectin-like, galectin-like, and calreticulin-like proteins in nematodes, we show that the lectin-like repertoire in nematodes is far more diverse. We focused on C-type lectins, which are abundantly present in all investigated nematode species, but seem to be far more abundant in free-living species. Although C-type lectin-like proteins are omnipresent in nematodes, we have shown that only a small part possesses the residues that are thought to be essential for carbohydrate binding. Curiously, hevein, a typical plant lectin domain not reported in animals before, was found in some nematode species.Entities:
Keywords: C-type lectin; bioinformatic analysis; hevein; nematode
Mesh:
Substances:
Year: 2017 PMID: 28054982 PMCID: PMC5297725 DOI: 10.3390/ijms18010091
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Results of the profile hidden Markov model (HMMer) search. Numbers indicate the number of sequences with one or more domains of a type of lectin domain in each nematode species. The total number of sequences per lectin domain is indicated in the bottom row. Numbers in brackets give an indication of the number of potential carbohydrate-binding lectins, a question mark (?) indicates that we are unable to draw conclusions regarding carbohydrate binding properties. CE: Caenorhabditis elegans, AS: Ascaris suum, BM: Brugia malayi, BX: Bursaphelenchus xylophilus, GP: Globodera pallida, HA: Heterodera avenae, MI: Meloidogyne incognita, MH: Meloidogyne hapla, MG: Meloidogyne graminicola, HO: Hirschmanniella oryzae, HG: Heterodera glycines, PC: Pratylenchus coffeae, PT: Pratylenchus thornei, #Seq: the number of protein sequences that was scanned for potential lectin domains. # indicates the total number of potential lectins for each class.
| Species | #Seq | Class V Chitinase | Hevein | Legume | LysM | F-Type Lectin | Ricin-B | Calnexin/Calreticulin | M-Type Lectin | C-Type Lectin | Galectin | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Genomes | CE | 26150 | 34 (30) | 3 | 2 (2) | 8 (?) | 1 (0) | 14 (6) | 2 (2) | 3 | 252 (52) | 38 (18) |
| AS | 18542 | 8 (4) | 0 | 3 (2) | 2 (?) | 1 (0) | 9 (8) | 3 (2) | 5 | 36 (8) | 11 (6) | |
| BM | 17938 | 12 (7) | 0 | 2 (2) | 4 (?) | 0 | 11 (10) | 8 (5) | 4 | 18 (1) | 13 (10) | |
| BX | 18074 | 7 (1) | 0 | 2 (2) | 1 (?) | 0 | 11 (7) | 3 (1) | 4 | 15 (0) | 14 (4) | |
| GP | 16417 | 9 (6) | 0 | 3 (2) | 0 | 0 | 9 (6) | 3 (2) | 5 | 22 (4) | 38 (7) | |
| MI | 20359 | 2 (0) | 0 | 1 (1) | 3 (?) | 0 | 7 (5) | 4 (3) | 6 | 57 (8) | 70 (9) | |
| MH | 14421 | 4 (3) | 0 | 2 (2) | 4 (?) | 0 | 8 (5) | 1 (0) | 4 | 29 (2) | 53 (3) | |
| Transcriptomes | MG | 10973 | 3 | 0 | 0 | 0 | 0 | 1 | 4 | 2 | 48 | 13 |
| HO | 17061 | 5 | 0 | 2 | 2 | 0 | 2 | 5 | 2 | 33 | 28 | |
| HA | 10070 | 3 | 0 | 2 | 2 | 0 | 6 | 1 | 4 | 1 | 9 | |
| HG | 12262 | 4 | 0 | 0 | 1 | 0 | 0 | 5 | 0 | 4 | 9 | |
| PC | 11507 | 0 | 0 | 0 | 1 | 0 | 2 | 1 | 0 | 3 | 7 | |
| PT | 19533 | 0 | 0 | 0 | 1 | 0 | 4 | 2 | 1 | 0 | 5 | |
| # | 91 | 3 | 19 | 29 | 2 | 84 | 42 | 40 | 518 | 308 | ||
Figure 1Alignment of the eight discovered hevein domains in Caenorhabditis elegans with the original hevein peptide from Hevea brasiliensis. Asterisks indicate the eight conserved cysteine residues forming four disulfide bridges; dots show the aromatic amino acids and serine residue involved in chitin binding.
Organization of proteins from the investigated nematode species containing CTLDs. An asterisk (*) indicates a group that was not previously characterized [20]. CTLD: C-type lectin-like domain, CE: C. elegans, AS: A. suum, BM: B. malayi, GP: G. pallida, MI: M. incognita, MH: M. hapla, BX: B. xylophilus. CUB: Complement C1r/C1s, Uegf, Bmp1 domain, CW: conserved cysteine and trypthophan residues, also known as PAN-3 domain, VWA: Von Willebrand factor type A domain. Complex structure: CTLD-containing gene with multiple domains where CTLD might not determine the actual function of the protein.
| Class | Description | CE | AS | BM | GP | MI | MH | BX |
|---|---|---|---|---|---|---|---|---|
| I | 1 CTLD | 141 | 24 | 15 | 17 | 27 | 17 | 9 |
| II | 2–3 CTLD | 46 | 0 | 0 | 1 | 24 | 10 | 1 |
| III | 1–3 CTLD + 1–3 CUB | 28 | 0 | 1 | 0 | 0 | 1 | 1 |
| IV | 1–2 CTLD + 1–2 CW | 16 | 0 | 0 | 0 | 0 | 0 | 0 |
| V | 1 CTLD + 1–2 VWA | 14 | 9 | 0 | 0 | 0 | 0 | 0 |
| VI | Complex structure | 7 | 2 | 1 | 4 | 6 | 1 | 4 |
| VII* | 4 CTLD | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
Presence of conserved residues in nematode CTLDs. The first column shows the nine different conserved residues. The second column indicates the share of sequences that contain the respective residue. The last three rows show the share of the different combinations of disulfide bridges in the investigated nematode sequences. C1-C4: conserved cysteine residues forming the disulfide bridges (C1-C4 and C2-C3 bridge). W, G, and L: conserved residues from the “WIGL” motif. α1A: conserved alanine residue in the first α-helix of the C-type lectin fold. β1’L: conserved leucine residue in the first β-sheet of the C-type lectin fold.
| Conserved Residue | % Residue |
|---|---|
| C1 | 93.17 |
| C2 | 68.45 |
| C3 | 70.30 |
| C4 | 77.49 |
| W | 70.66 |
| G | 90.59 |
| L | 42.80 |
| α1A | 80.07 |
| β1’L | 64.94 |
| Both C bridges | 48.71 |
| Only C1-C4 bridge | 24.35 |
| Only C2-C3 bridge | 11.99 |
Figure 2Midpoint rooted phylogenetic tree of 77 nematode C-type lectin-like domains (CTLDs) predicted with the maximum likelihood algorithm. Taxon labels give the full genome accession code of the gene containing the CTLD, followed by a number indicating which CTLD of that particular gene is used in this analysis. Explanation of prefixes: Bm: Brugia malayi, GPLIN: Globodera pallida, Mh10: Meloidogyne hapla, Mincdb: Meloidogyne incognita, GS: Ascaris suum. Taxon labels without any of the previous prefixes originate from Caenorhabditis elegans.
Figure 3Schematic overview of the different types of potential lectins present in the seven investigated nematode genomes. Eight lectin classes are represented here, each divided into several subclasses (denoted by letters) according to the presence of other predicted protein domains. The number of sequences present in each subclass is indicated in brackets.