| Literature DB >> 22039562 |
Yovany Moreno1, Pierre-Paul Gros, Mifong Tam, Mariela Segura, Rajesh Valanparambil, Timothy G Geary, Mary M Stevenson.
Abstract
The murine parasite Heligmosomoides polygyrus is a convenient experimental model to study immune responses and pathology associated with gastrointestinal nematode infections. The excretory-secretory products (ESP) produced by this parasite have potent immunomodulatory activity, but the protein(s) responsible has not been defined. Identification of the protein composition of ESP derived from H. polygyrus and other relevant nematode species has been hampered by the lack of genomic sequence information required for proteomic analysis based on database searches. To overcome this, a transcriptome next generation sequencing (RNA-seq) de novo assembly containing 33,641 transcripts was generated, annotated, and used to interrogate mass spectrometry (MS) data derived from 1D-SDS PAGE and LC-MS/MS analysis of ESP. Using the database generated from the 6 open reading frames deduced from the RNA-seq assembly and conventional identification programs, 209 proteins were identified in ESP including homologues of vitellogenins, retinol- and fatty acid-binding proteins, globins, and the allergen V5/Tpx-1-related family of proteins. Several potential immunomodulators, such as macrophage migration inhibitory factor, cysteine protease inhibitors, galectins, C-type lectins, peroxiredoxin, and glutathione S-transferase, were also identified. Comparative analysis of protein annotations based on the RNA-seq assembly and proteomics revealed processes and proteins that may contribute to the functional specialization of ESP, including proteins involved in signalling pathways and in nutrient transport and/or uptake. Together, these findings provide important information that will help to illuminate molecular, biochemical, and in particular immunomodulatory aspects of host-H. polygyrus biology. In addition, the methods and analyses presented here are applicable to study biochemical and molecular aspects of the host-parasite relationship in species for which sequence information is not available.Entities:
Mesh:
Substances:
Year: 2011 PMID: 22039562 PMCID: PMC3201918 DOI: 10.1371/journal.pntd.0001370
Source DB: PubMed Journal: PLoS Negl Trop Dis ISSN: 1935-2727
Summary of the transcriptomic assembly and ESP subset annotations.
|
| ||
| Number of transcripts (Isoforms) >100 bp | 33,641 | |
| Number of Loci >100 bp | 29,918 | |
| Average length (SD) | 561.0+(566.2) | |
| Blastx (E>10−5) | ||
|
| 18,816 (55.9%) | |
|
| 15,338 (45.6%) | |
| non-redundant | 19,196 (57.1%) | |
|
|
| |
|
|
| |
|
| ||
| Number of Seqs with at least 1 GO term (%) | 14,094 (41.9) | 113 (54.3) |
|
| 85,884 (6,647) | 642 (306) |
| Cellular Part terms (unique) | 17,180 (764) | 99 (52) |
|
| 8,694 (25.8) | 47 (22.6) |
| Molecular Function terms (unique) | 25,715 (1,805) | 210 (87) |
|
| 11,671 (34.7) | 107 (51.4) |
| Biological Process terms (unique) | 42,989 (4,078) | 333 (167) |
|
| 11,296 (33.6) | 89 (42.8) |
|
| ||
| Seqs with at least 1 protein signature (%) | 17,342 (51.5) | 158 (76.0) |
| Annotated InterPro Entries | 4,547 | 134 |
|
| 2,204 | 70 |
|
| 1,896 | 41 |
| Seqs with annotated InterPro Entries (%) | 12,126 (36) | 157 (75.4) |
|
| 15,435 (45.9) | 104 (50) |
|
| 5,760 (17.1) | 70 (33.7) |
Most abundant GO terms mapped and annotated using Blast2GO in the transcriptome and ESP datasets.
| Transcriptome | ESP subset | ||||||
| GO code | GO Term | Seqs (%) | GO code | GO Term | Seqs (%) | ||
|
| 1 | GO:0016021 | integral to membrane | 1,513 (4.5) | GO:0005737 | cytoplasm | 14 (6.7) |
| 2 | GO:0005634 | nucleus | 1,129 (3.4) | GO:0005576 | extracellular region | 6 (2.9) | |
| 3 | GO:0005737 | cytoplasm | 990 (2.9) | GO:0005615 | extracellular space | 6 (2.9) | |
| 4 | GO:0016020 | membrane | 743 (2.2) | GO:0016021 | integral to membrane | 4 (1.9) | |
| 5 | GO:0005829 | cytosol | 669 (2.0) | GO:0016020 | membrane | 4 (1.9) | |
|
| 1 | GO:0005515 | protein binding | 3,782 (11.2) | GO:0005515 | protein binding | 43 (20.7) |
| 2 | GO:0005524 | ATP binding | 1,327 (3.9) | GO:0008233 | peptidase activity | 8 (3.8) | |
| 3 | GO:0005488 | binding | 937 (2.8) | GO:0020037 | heme binding | 7 (3.4) | |
| 4 | GO:0008270 | zinc ion binding | 643 (1.9) | GO:0008237 | metallopeptidase activity | 7 (3.4) | |
| 5 | GO:0046872 | metal ion binding | 590 (1.8) | GO:0019825 | oxygen binding | 7 (3.4) | |
|
| 1 | GO:0009792 | embryonic development ending in birth or egg hatching | 2,239 (6.7) | GO:0008340 | determination of adult lifespan | 28 (13.5) |
| 2 | GO:0002119 | nematode larval development | 1,655 (4.9) | GO:0009792 | embryonic development ending in birth or egg hatching | 27 (13.0) | |
| 3 | GO:0040010 | positive regulation of growth rate | 1,513 (4.5) | GO:0040010 | positive regulation of growth rate | 26 (12.5) | |
| 4 | GO:0000003 | reproduction | 1,311 (3.9) | GO:0040011 | locomotion | 10 (4.8) | |
| 5 | GO:0040011 | locomotion | 992 (2.9) | GO:0000003 | reproduction | 10 (4.8) | |
*Indicates GO term enrichment in the ESP subset when compared to the transcriptome dataset (FDR<0.5).
Figure 1Molecular function and biological process GO terms (level 2) for the transcriptome and ESP subset.
Five most abundant domains and families inferred from InterProScan in the transcriptome and ESP datasets.
| Transcriptome | Secretome | |||||
| InterPro Accession | Name | Seqs (%) | InterPro Accession | Name | Seqs (%) | |
|
| ||||||
| 1 | IPR011009 | Protein kinase-like domain | 360 (1.1) | IPR014044 | CAP domain | 25 (12.0) |
| 2 | IPR013783 | Immunoglobulin-like fold | 288 (0.9) | IPR006026 | Peptidase, metallopeptidase | 6 (2.9) |
| 3 | IPR015943 | WD40/YVTN repeat-like-containing domain | 235 (0.7) | IPR000436 | Sushi/SCR/CCP | 5 (2.4) |
| 4 | IPR015880 | Zinc finger, C2H2-like | 223 (0.7) | IPR001747 | Lipid transport protein, N-terminal | 5 (2.4) |
| 5 | IPR016024 | Armadillo-type fold | 176 (0.5) | IPR008753 | Peptidase M13 | 5 (2.4) |
|
| ||||||
| 1 | IPR001757 | ATPase, P-type, K/Mg/Cd/Cu/Zn/Na/Ca/Na/H-transporter | 80 (0.2) | IPR001283 | Allergen V5/Tpx-1-related | 18 (8.7) |
| 2 | IPR002198 | Short-chain dehydrogenase/reductase SDR | 69 (0.2) | IPR000718 | Peptidase M13, neprilysin | 8 (3.8) |
| 3 | IPR001283 | Allergen V5/Tpx-1-related | 68 (0.2) | IPR001534 | Transthyretin-like | 5 (2.4) |
| 4 | IPR020685 | Tyrosine-protein kinase | 61 (0.2) | IPR008632 | Nematode fatty acid retinoid binding | 3 (1.4) |
| 5 | IPR020636 | Calcium/calmodulin-dependent protein kinase-like | 56 (0.2) | IPR009283 | Apyrase | 3 (1.4) |
*Indicates enrichment of the domain or the family in the ESP subset when compared to the transcriptome dataset (FDR<0.5).
Most abundant proteins based on the total spectra identified in ESP from adult H. polygyrus.
| Protein Name | Blast2GO Seq. Description | Accession Numbers | P (%) | SC | UP | Closest Blastx Hit Description | Closest Blast HitSpecie | Closest Blast Hit Accession Number | Blastx Hit eValue |
|
| venom-allergen-like protein family member (vap-1) | L_272_T_4_0.700_3 | 100 | 132 | 19 | two-domain activation associated secreted protein ASP4 precursor |
| CAO00417.1 | 2E-89 |
|
| globin | L_566_T_2_0.476_3 L_566_T_3_0.571_3 L_566_T_4_0.524_3 L_566_T_5_0.762_2 L_566_T_6_0.762_3 | 100 | 82 | 5 | Globin-like host-protective antigen |
| P27613.1 | 3E-30 |
|
| secreted protein asp-2 | L_120_T_1_0.410_1 L_120_T_5_0.436_1 L_120_T_8_0.462_1 | 100 | 74 | 6 | secreted protein 4 precursor |
| AAO63576.1 | 3E-26 |
|
| globin | L_384_T_8_0.306_5 | 100 | 73 | 10 | Globin-like host-protective antigen |
| P27613.1 | 3E-55 |
|
| secreted protein asp-2 | L_2405_T_4_0.700_4 | 100 | 53 | 11 | secreted protein 6 precursor |
| AAO63578.1 | 8E-20 |
|
| globin | L_2261_T_1_1.000_4 | 100 | 47 | 7 | Globin-like host-protective antigen |
| P27613.1 | 4E-39 |
|
| vit-2 | L_73_T_5_0.619_4 | 100 | 46 | 10 | C. briggsae CBR-VIT-6 protein |
| XP_002634040.1 | 6E-37 |
|
| C. elegans protein confirmed by T evidence | L_94_T_3_0.429_4 L_94_T_4_0.429_1 | 100 | 46 | 8 | LYSozyme family member (lys-8) |
| NP_495083.1 | 5E-30 |
|
| vitellogenin structural genes (yolk protein genes) family member (vit-2) | L_122_T_10_0.312_3 | 100 | 45 | 10 | VITellogenin structural genes (yolk protein genes) family member (vit-1) |
| NP_509305.1 | 1E-45 |
|
| ll20 15kda ladder antigen | L_2367_T_1_0.375_1 | 100 | 44 | 17 | DVA-1 polyprotein |
| Q24702.1 | 2E-145 |
|
| vitellogenin structural genes (yolk protein genes) family member (vit-2) | L_93_T_5_0.444_2 | 100 | 43 | 10 | VITellogenin structural genes (yolk protein genes) family member (vit-2) |
| NP_001123117.1 | 4E-53 |
|
| briggsae cbr-vit-5 protein | L_207_T_17_0.196_1 | 100 | 41 | 7 | VITellogenin structural genes (yolk protein genes) family member (vit-4) |
| NP_508612.1 | 3E-35 |
|
| briggsae cbr-vit-2 protein | L_290_T_3_0.833_3 | 100 | 35 | 4 | C. briggsae CBR-VIT-2 protein |
| XP_002644638.1 | 4E-32 |
|
| vitellogenin structural genes (yolk protein genes) family member (vit-6) | L_211_T_5_0.765_5 | 100 | 30 | 7 | Vitellogenin-6 |
| Q94637.1 | 8E-39 |
|
| vitellogenin structural genes (yolk protein genes) family member (vit-2) | L_3_T_3_0.875_4 | 100 | 28 | 7 | Vitellogenin-6 |
| Q94637.1 | 6E-31 |
|
| globin | L_2733_T_1_1.000_3 | 100 | 27 | 7 | Globin-like host-protective antigen |
| P27613.1 | 5E-56 |
|
| ancylostoma-secreted protein 1 precursor | L_6303_T_1_1.000_1 | 100 | 27 | 2 | ancylostoma-secreted protein 1 precursor |
| AAD13339.1 | 2E-04 |
|
| NA | L_927_T_1_1.000_6 L_927_T_2_1.000_4 | 100 | 25 | 2 | ||||
|
| secreted protein asp-2 | L_160_T_5_0.733_4 | 100 | 23 | 7 | secreted protein 5 precursor |
| AAO63577.1 | 3E-38 |
|
| vitellogenin structural genes (yolk protein genes) family member (vit-6) | L_407_T_1_1.000_5 | 100 | 23 | 3 | Vitellogenin-6 |
| Q94637.1 | 4E-15 |
|
| acetylcholinesterase 2 | L_7106_T_1_1.000_2 | 100 | 23 | 14 | putative neuromuscular acetylcholinesterase |
| AAS49411.1 | 6E-169 |
Protein name identifiers are derived from the original transcriptome assembly nomenclature (L = locus, T = transcript, C = confidence) from which the conceptual translation frame number was added to the identifiers. P (%) = Protein identification probability (%); SC = Number of spectral counts; UP = Number of unique peptides.