| Literature DB >> 27503142 |
Abstract
BACKGROUND: Prior to egg laying the parasitoid wasp Nasonia vitripennis envenomates its pupal host with a complex mixture of venom peptides. This venom induces several dramatic changes in the host, including developmental arrest, immunosuppression, and altered metabolism. The diverse and potent bioactivity of N. vitripennis venom provides opportunities for the development of novel acting pharmaceuticals based on these molecules. However, currently very little is known about the specific functions of individual venom peptides or what mechanisms underlie the hosts response to envenomation. Many of the venom peptides also lack bioinformatically derived annotations because no homologs can be identified in the sequences databases. The RNA interference system of N. vitripennis provides a method for functional characterisation of venom protein encoding genes, however working with the current list of 79 candidates represents a daunting task. For this reason we were interested in determining the expression levels of venom encoding genes in the venom gland, as this information could be used to rank candidates for further study. To do this we carried out deep transcriptome sequencing of the venom gland and ovary tissue and used RNA-seq to rank the venom protein encoding genes by expression level. The generation of a specific venom gland transcriptome dataset also provides further opportunities to investigate novel features of this specialised organ.Entities:
Keywords: Drug development; Ovary; RNA-seq; Serine protease; Transcriptomics; Venom gene
Mesh:
Substances:
Year: 2016 PMID: 27503142 PMCID: PMC4977848 DOI: 10.1186/s12864-016-2924-7
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
RNA-seq statistics of Nasonia vitripennis ovary and VG tissue
| Sample | Total Reads | Mapped (%) | Assigned (%)a | Multiple Alignments (%)b | No Feature (%)c |
|---|---|---|---|---|---|
| VG 1 | 2.89E + 07 | 95.62 | 88.08 | 3.63 | 1.99 |
| VG 2 | 3.24E + 07 | 96.15 | 87.31 | 3.37 | 1.97 |
| VG 3 | 3.14E + 07 | 67.03 | 59.65 | 3.54 | 4.32 |
| Ovary 1 | 3.31E + 07 | 95.17 | 85.51 | 1.43 | 5.95 |
| Ovary 2 | 3.42E + 07 | 95.16 | 84.06 | 1.46 | 6.34 |
| Ovary 3 | 3.06E + 07 | 95.16 | 84.18 | 1.43 | 6.13 |
aProportion of reads which could be aligned to a gene model
bProportion of reads assigned to multiple gene models
cProportion of aligned reads that did not overlap a gene model
Fig. 1RNA-seq highlights expression differences between ovary and VG tissue. a Principal component analysis of expression profiles for ovary (green) and VG (blue) replicates showing the samples clustering by treatment group. The percentage on the axis labels represents the total variance explained by that component (b) Volcano plot of the RNA-seq data with positive fold change representing up regulated genes in the VG and vice versa for ovary. Genes coloured as red were differentially expressed at an adjusted p-value < 0.05. c Pie chart summarising the proportion of genes that were part of each expression category (adjusted p-value < 0.05). d Semi-quantitative RT-PCR targeting the highest expressed annotated venom encoding gene Nasvi2EG007167. Amplification of Nasvi2EG007167 cDNA could be detected at 24 cycles in the venom gland cDNA replicates, whilst only a faint band could be detected at 30 cycles for the ovary samples. The house-keeping gene RP49 was used as a loading control. VG1-3 and O1-3 represent independent replicates for venom gland and ovary samples, respectively
Fig. 2GO terms related to ribosomal function, translation, and protein processing are enriched in the VG. Enriched GO terms amongst the subset of genes significantly upregulated in the ovary (a) and VG (b). The bar graph represents the proportion (in percentage) of sequences in the differentially expressed subset (blue) with the indicated functional annotation, compared to the proportion of these annotations assigned to the entire gene set (red)
Summary of VG and ovary RNA-seq data for the 79 Nasonia venom protein encoding genes
| Gene ID | Ovary read countsb | VG read countsb | Log2 Fold Changec | Adj p-val | Classification | |
|---|---|---|---|---|---|---|
| Higha | Nasvi2EG013868 | 141 | 4600307 | 15.00 | 2.33E-127 | Venom protein Y |
| Nasvi2EG009648 | 76 | 2859048 | 15.20 | 7.49E-129 | Venom protein G | |
| Nasvi2EG009662 | 93 | 2701601 | 14.82 | 7.75E-126 | Venom protein Z | |
| Nasvi2EG009647 | 90 | 2310849 | 14.65 | 2.5E-124 | Venom protein Q | |
| Nasvi2EG014072 | 75 | 2063715 | 14.75 | 4.41E-85 | Venom protein X | |
| Nasvi2EG007167 | 43 | 1532232 | 15.11 | 7.49E-129 | Serine protease | |
| Mediuma | Nasvi2EG003930 | 49 | 1218607 | 14.61 | 5.88E-123 | Serine protease inhibitor 2 |
| Nasvi2EG014069 | 26 | 976928 | 15.20 | 1.2E-106 | Venom protein L | |
| Nasvi2EG006243 | 25 | 819734 | 15.03 | 5.88E-111 | γ-glutamyltranspeptidase | |
| Nasvi2EG012348 | 23 | 756942 | 15.04 | 1.14E-125 | Venom protein H | |
| Nasvi2EG009035 | 22 | 666528 | 14.92 | 5.13E-127 | Endonuclease-like | |
| Nasvi2EG036525 | 18 | 660159 | 15.17 | 4.8E-129 | Aminotransferase-like | |
| Nasvi2EG008596 | 25 | 595426 | 14.56 | 7.8E-121 | Venom protein P | |
| Nasvi2EG009661 | 21 | 590933 | 14.80 | 1.37E-124 | Venom protein V | |
| Nasvi2EG002524 | 11 | 386873 | 15.11 | 1.45E-74 | Venom protein I | |
| Nasvi2EG001168 | 68 | 385586 | 12.47 | 3.96E-92 | Immunoglobulin-like | |
| Nasvi2EG014575 | 10 | 312136 | 14.94 | 1.22E-99 | Venom protein K | |
| Nasvi2EG010245 | 10 | 257730 | 14.72 | 4.26E-84 | Odorant-binding protein | |
| Nasvi2EG019091 | 48 | 245164 | 12.31 | 4.4E-13 | Venom protein M | |
| Nasvi2EG010351 | 7 | 197286 | 14.79 | 1.09E-124 | Glucose dehydrogenase-like | |
| Nasvi2EG004342 | 6 | 181642 | 14.97 | 9.19E-125 | Lipase-like | |
| Nasvi2EG016543 | 12 | 155729 | 13.63 | 3.31E-110 | Venom protein N | |
| Nasvi2EG007615 | 4 | 153498 | 15.08 | 1.03E-125 | Lipase | |
| Nasvi2EG021024 | 737 | 149042 | 7.66 | 3.33E-50 | Serine protease homologue | |
| Nasvi2EG009991 | 4 | 145328 | 15.06 | 3.41E-127 | Acid phosphatase | |
| Nasvi2EG020297 | 387 | 141654 | 8.52 | 6.11E-59 | Serine protease/CLIP | |
| Nasvi2EG022918 | 4 | 130887 | 14.97 | 7.1E-125 | Serine protease | |
| Nasvi2EG008779 | 4 | 115742 | 14.82 | 1.21E-124 | Aminotransferase-like | |
| Nasvi2EG000667 | 5 | 104214 | 14.48 | 1.04E-119 | Serine proteinase inhibitor | |
| Nasvi2EG004628 | 8 | 98159 | 13.59 | 4.58E-65 | Venom protein D | |
| Nasvi2EG000354 | 7 | 91416 | 13.72 | 2.06E-19 | Cysteine-rich/TIL 1 | |
| Nasvi2EG016421 | 91 | 90433 | 9.96 | 9.19E-65 | α-Esterase | |
| Nasvi2EG004144 | 5 | 81771 | 14.02 | 6.13E-19 | Venom protein T | |
| Nasvi2EG037101 | 6 | 77272 | 13.57 | 2.54E-48 | Chitin binding protein-like | |
| Nasvi2EG019611 | 2 | 75339 | 15.19 | 5.42E-125 | Laccase | |
| Nasvi2EG020586 | 2 | 66726 | 15.06 | 9.42E-120 | Serine protease | |
| Nasvi2EG013736 | 54 | 64975 | 10.24 | 2.62E-45 | Serine protease inhibitor 1 | |
| Nasvi2EG020997 | 1 | 63251 | 15.64 | 7.83E-121 | Venom protein J | |
| Nasvi2EG026553 | 4 | 46698 | 13.56 | 3.89E-109 | Lipoprotein receptor-like | |
| Nasvi2EG020296 | 716 | 42847 | 5.90 | 1.43E-32 | Serine protease/CLIP | |
| Nasvi2EG006920 | 3 | 42299 | 13.91 | 9.42E-113 | Arylsulphatase b | |
| Nasvi2EG002112 | 224 | 38349 | 7.42 | 1.67E-47 | β-1,3-Glucan recognition | |
| Nasvi2EG022916 | 2 | 38281 | 14.41 | 4.45E-79 | Serine protease | |
| Nasvi2EG000112 | 18 | 34599 | 10.87 | 3.97E-83 | Apyrase | |
| Nasvi2EG022914 | 1 | 33717 | 14.50 | 1.78E-111 | Serine protease | |
| Nasvi2EG037342 | 3926 | 33363 | 3.09 | 3.21E-04 | Calreticulin | |
| Nasvi2EG004152 | 18 | 26602 | 10.57 | 2.07E-79 | Venom protein R | |
| Nasvi2EG020295 | 1 | 24624 | 14.74 | 1.28E-113 | Serine protease/CLIP | |
| Nasvi2EG005749 | 1 | 23005 | 14.93 | 1.58E-59 | Serine protease | |
| Nasvi2EG013838 | 711 | 22562 | 4.99 | 5.35E-11 | Venom protein F | |
| Nasvi2EG011463 | 1 | 20535 | 14.43 | 9.18E-110 | Inositol phosphatase | |
| Nasvi2EG004824 | 3 | 19029 | 12.49 | 6.8E-51 | Venom protein E | |
| Lowa | Nasvi2EG006071 | 1 | 18923 | 14.14 | 9.03E-97 | Venom protein S |
| Nasvi2EG023753 | 103 | 12408 | 6.91 | 3.68E-42 | Dipeptidylpeptidase IV | |
| Nasvi2EG013885 | 1 | 9901 | 13.00 | 2.6E-102 | γ-Glutamyl transpeptidase 2 | |
| Nasvi2EG005790 | 1 | 8481 | 13.24 | 6.37E-25 | Antigen-5 like | |
| Nasvi2EG006543 | 30 | 7665 | 8.00 | 2.54E-40 | γ-Glutamyl cyclotransferase- | |
| Nasvi2EG016379 | 0 | 7408 | 15.24 | 3.96E-78 | Venom protein U | |
| Nasvi2EG005645 | 0 | 5919 | 13.75 | 5.86E-31 | Venom protein O | |
| Nasvi2EG007282 | 390 | 4909 | 3.65 | 1.54E-09 | Serine protease | |
| Nasvi2EG000351 | 84 | 3807 | 5.51 | 1.44E-01 | Cysteine-rich/TIL 2 | |
| Nasvi2EG008007 | 5 | 1805 | 8.46 | 3.43E-27 | Angiotensin-converting | |
| Nasvi2EG022626 | 326 | 532 | 0.71 | 6.18E-01 | Serine protease/CUB | |
| Nasvi2EG009433 | 47 | 426 | 3.17 | 1.90E-04 | Cysteine-rich/KU | |
| Nasvi2EG000909 | 93 | 147 | 0.66 | 4.82E-01 | C1q-like venom protein | |
| Nasvi2EG009664 | 1 | 110 | 6.65 | 1.19E-03 | Cysteine-rich/Pacifastin 1 | |
| Nasvi2EG009665 | 4 | 93 | 4.62 | 2.44E-03 | Cysteine-rich/Pacifastin 2 | |
| Nasvi2EG012510 | 5 | 83 | 4.17 | 1.23E-04 | Chitinase 5 | |
| Nasvi2EG012285 | 9 | 77 | 3.15 | 1.94E-05 | Acid phosphatase | |
| Nasvi2EG015708 | 929 | 68 | −3.76 | 1.14E-15 | Aspartylglucosaminidase | |
| Nasvi2EG008324 | 200 | 62 | −1.70 | 4.03E-04 | Trehalase | |
| Nasvi2EG007347 | 18 | 41 | 1.19 | 1.00E + 00 | Serine protease | |
| Nasvi2EG005784 | 1 | 21 | 4.63 | 3.49E-02 | Antigen 5-like protein | |
| Nasvi2EG010516 | 3 | 20 | 2.88 | 1.03E-02 | Metalloprotease | |
| Nasvi2EG007166 | 0 | 16 | N/A | 4.85E-06 | Serine protease | |
| Nasvi2EG015589 | 1 | 8 | 3.37 | 7.42E-03 | Laccase | |
| Nasvi2EG021414 | 0 | 3 | 4.23 | 3.08E-01 | Nucleoside hydrolase | |
| Nasvi2EG011314 | 0 | 1 | N/A | 7.67E-01 | Venom protein W | |
| Nasvi2EG011442 | 1 | 0 | N/A | 1.00E + 00 | Serine protease |
aExpression category, see methods for details
bLibrary size normalised read counts from DESeq divided by transcript length (in kb) and sorted in descending order based on the venom gland values
cFold changes calculated directly from library size normalised read counts by DESeq
Highest 15 expressed genes in the VG and ovary sorted in descending order by normalised read counts
| Venom gland data | |||||
|---|---|---|---|---|---|
| Gene ID | Ovary normalised countsb | VG normalised countsb | Log2 Fold Changec | Adj p-val | Classification |
| Nasvi2EG013868a | 141 | 4600307 | 15.00 | 2.33E-127 | Venom protein Y |
| Nasvi2EG009648a | 76 | 2859048 | 15.20 | 7.49E-129 | Venom protein G |
| Nasvi2EG009662a | 93 | 2701601 | 14.82 | 7.75E-126 | Venom protein Z |
| Nasvi2EG009647a | 90 | 2310849 | 14.65 | 2.50E-124 | Venom protein Q |
| Nasvi2EG014072a | 75 | 2063715 | 14.75 | 4.41E-85 | Venom protein X |
| Nasvi2EG021198 | 62 | 1557446 | 14.61 | 6.16E-55 | Unknown |
| Nasvi2EG007167a | 43 | 1532232 | 15.11 | 7.49E-129 | Serine Protease |
| Nasvi2EG003930a | 49 | 1218607 | 14.61 | 5.88E-123 | Serine protease inhibitor 2 |
| Nasvi2EG009363 | 54 | 1217247 | 14.45 | 5.14E-121 | Unknown |
| Nasvi2EG014069a | 26 | 976927 | 15.20 | 1.20E-106 | Venom protein L |
| Nasvi2EG006243a | 25 | 819734 | 15.03 | 5.88E-111 | γ-glutamyltranspeptidase |
| Nasvi2EG012348a | 23 | 756942 | 15.04 | 1.14E-125 | Venom protein H |
| Nasvi2EG011080 | 20 | 722740 | 15.15 | 3.50E-126 | Unknown |
| Nasvi2EG009035a | 22 | 666528 | 14.92 | 5.13E-127 | Endonuclease-like |
| Nasvi2EG036525a | 18 | 660159 | 15.17 | 4.80E-129 | Aminotransferase-like 2 |
| Ovary data | |||||
| Nasvi2EG004175 | 35984 | 37 | −9.92 | 4.96E-65 | Unknown |
| Nasvi2EG002830 | 34980 | 10187 | −1.78 | 7.40E-05 | HEAT shock 70 |
| Nasvi2EG000341 | 29283 | 27085 | −0.11 | 1 | CCHC-type zinc finger |
| Nasvi2EG007353 | 29093 | 708.41 | −5.36 | 2.33E-32 | ATP-dependent RNA helicase |
| Nasvi2EG002390 | 25623 | 73530 | 1.52 | 2.81E-3 | 40S ribosomal protein |
| Nasvi2EG006904 | 25600 | 45210 | 0.82 | 2.70 E-1 | Elongation factor 1-alpha |
| Nasvi2EG011214 | 25321 | 70701 | 1.48 | 2.69E-2 | 40S ribosomal protein |
| Nasvi2EG020214 | 25158 | 139912 | 2.48 | 1.38E-4 | 60S acidic ribosomal protein |
| Nasvi2EG004598 | 25153 | 72614 | 1.53 | 2.70E-3 | 60S ribosomal protein |
| Nasvi2EG007380 | 23250 | 117959 | 2.34 | 1.28E-3 | 60S acidic ribosomal protein |
| Nasvi2EG019818 | 23062 | 85908 | 1.90 | 1.68E-4 | 40S ribosomal protein |
| Nasvi2EG003839 | 22711 | 103851 | 2.20 | 1.39E-05 | 60S ribosomal protein |
| Nasvi2EG007413 | 22602 | 92995 | 4.11 | 5.36E-05 | 40S ribosomal protein |
| Nasvi2EG011575 | 21836 | 57655 | 2.64 | 2.21E-2 | 60S ribosomal protein |
| Nasvi2EG014261 | 21805 | 219749 | 10.08 | 3.29E-11 | Unknown |
aIndicates venom protein encoding gene
bLibrary size normalised read counts from DESeq, divided by transcript length (in kb) and sorted in descending order based on the venom gland values
cFold changes calculated from library size normalised read counts by DESeq
Fig. 3Venom genes were highly expressed in the VG. Log2 normalised gene counts for the 40 highest expressed (a), and 39 lowest expressed (b), venom protein encoding genes (sorted by highest observed loci count). VG counts are shown in blue and ovary counts in orange
Fig. 4Heatmap of normalised counts for the 100 highest expressed novel genes identified by cufflinks in the ovary and VG expression data. The top 100 novel genes identified by cufflinks based on mean expression in the VG and Ovary colour coded by expression level. VG1-3 and O1-3 represent data from the three VG and ovary replicates, respectively. The dendrogram shows the clustering of samples based on euclidian distance
Fig. 5Differential splicing of VG specific novel gene XLOC_013208. The coverage of mapped reads over the two predicted gene models for XLOC_013208 the highest expressed novel gene in the VG. The coverage range is displayed in the square brackets in the top left corner of the figure. The blue bar represents the gene model as predicted by cufflinks. The thick sections represent exons and the thin lines represent introns. The rightward pointing arrows represent the orientation of the gene and the numbers above are for reference to the base pair position on scaffold 29
Gene annotations of novel detected transcripts from Nasonia vitripennis VG and Ovary samples
| Gene ID | Classification | GO terms |
|---|---|---|
| XLOC_005019 | Atp-dependent dna helicase q4 | metal ion binding, helicase activity, heterocyclic compound binding, organic cyclic compound binding |
| XLOC_010355 | Signal peptidase complex catalytic subunit sec11a | signal peptide processing, integral to membrane, proteolysis, serine-type peptidase activity |
| XLOC_012101 | N-acetylneuraminate lyase-like | lyase activity, metabolic process |
| XLOC_013961 | NADH dehydrogenase subunit 2 | mitochondrial electron transport, NADH to ubiquinone, mitochondrial inner membrane, respiratory chain, respiratory chain, NADH dehydrogenase (ubiquinone) activity, integral to membrane |
| XLOC_015645 | NADH dehydrogenase subunit partial | integral to membrane, mitochondrion, oxidation-reduction process, NADH dehydrogenase (ubiquinone) activity |
| XLOC_016115 | Uncharacterized aarf domain-containing protein kinase 1 | protein phosphorylation,oxidation-reduction process, protein kinase activity, ubiquinone biosynthetic process, ATP binding, flavin adenine dinucleotide binding oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, NAD(P)H as one donor, and incorporation of one atom of oxygen |
| XLOC_016465 | 60s ribosomal protein | ribosome, structural constituent of ribosome, translation |
| XLOC_019886 | 28 s ribosomal protein mitochondrial | ribosome, structural constituent of ribosome, translation |
| XLOC_020689 | signal peptidase complex subunit 3 | peptidase activity, signal peptide processing, integral to membrane, signal peptidase complex |
| XLOC_022279 | inosine-5 -monophosphate dehydrogenase 1b | oxidation-reduction process, adenyl nucleotide binding, metal ion binding, GMP biosynthetic process, cytoplasm, IMP dehydrogenase activity |
Fig. 6Differences in metatranscriptomic composition of the VG and ovary. Distribution of reads that did not map to the Nasonia genome assigned to the phylum level from the VG a and ovary b RNA-seq data by the program Megan
Fig. 7Proteobacteria species identified in the venom gland and ovary. Distribution of reads that did not map to the Nasonia genome, which can be assigned to Proteobacteria species in the ovary and VG data by Megan