| Literature DB >> 32028881 |
Shane Denecke1, Panagiotis Ioannidis2, Benjamin Buer3, Aris Ilias4, Vassilis Douris4,5, Pantelis Topalis4, Ralf Nauen3, Sven Geibel3, John Vontas4,6.
Abstract
BACKGROUND: Stink bugs are an emerging threat to crop security in many parts of the globe, but there are few genetic resources available to study their physiology at a molecular level. This is especially true for tissues such as the midgut, which forms the barrier between ingested material and the inside of the body.Entities:
Keywords: Midgut; Nezara viridula; P450; Proteomics; Southern green stink bug; Transcriptomics; Transporter
Mesh:
Substances:
Year: 2020 PMID: 32028881 PMCID: PMC7006211 DOI: 10.1186/s12864-020-6459-6
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Comparative gene sets among insects. a A phylogeny is shown constructed from 221 single-copy genes present in all species included in this analysis. The Pentatomidae (red) form a cluster within the Hemiptera (yellow) order which forms a sister clade to Holometabola (blue). The tree is rooted with the crustacean Daphnia pulex (not shown). Black dots indicate nodes with bootstrap support > 75%, whereas gray dots indicate nodes with bootstrap support between 50 and 75%. The scale bar is in substitutions per site. b Orthology profile of stinkbugs (names shown in red), compared to other insects. Note the large fraction of species-specific genes in N. viridula (Nviri) which is very similar to what has been previously documented for the pea aphid A. pisum (Apisu). Species names prefixed with “[T]” indicate that the unigene set was obtained from a transcriptome assembly; for the remaining insect species the data were obtained from a genome assembly. Species names abbreviations: Nviri – N. viridula; Ahila – A. hilare; Pstal – P. stali; Hhaly – H. halys; Cruti – C. rutilans; Ofasc – Oncopeltus fasciatus; Rprol – Rhodnius prolixus; Clect – Cimex lectularius; Dcitr – Diaphorina citri; Apisu – A. pisum; Tcast – Tribolium castaneum; Dmela – Drosophila melanogaster; Dplex – Danaus plexippus; Amell – Apis mellifera
Statistics of transcriptome and proteome: An overview of the transcriptome and proteome is given in terms of total reads, contigs, unigenes, and detected proteins
| Transcriptome | ||||
| Total Reads | Total Contigs | Total unigenes | Total non-bacterial unigenes | |
| 1,426,685,586 | 314,260 | 28,402 | 25,890 | |
| Proteome with bacterial-like transcripts | ||||
| Total M1 | Total M2 | Total M3 | Total M4 | Total proteins |
| 2401 | 1992 | 2472 | 2370 | 3472 |
| Proteome without bacterial-like transcripts | ||||
| Total M1 | Total M2 | Total M3 | Total M4 | Total proteins |
| 2377 | 1968 | 2102 | 1945 | 3027 |
Fig. 2Shared transcript and protein expression. Venn diagrams are shown for both detected transcripts (a) and proteins (b), showing expression > 1 transcript per million (TPM) in each tissue. In each case, a sizable portion of the detected features are found across all midgut compartments, indicating a that many genes are expressed across the anterior-posterior axis. In both cases the M1 and M4 region display the most distinctiveness. The relatively lower number of proteins detected in the proteome compared to transcripts in the transcriptome is reflective of the sensitivities of these two technologies
Fig. 3Principle Component Analysis. The results of a principle component analysis of the expression of all unigenes is shown. The first two principle components explain a total of 78% of the total variation detected within the RNA-seq data. Each shape and color represent a distinct sample (M1: Blue triangle, M2: Green square, M3: Black cross, M4: purple crossed square, carcass: red circle). The variation in each sample is shown with an ellipse which encompass all replicates in that sample
Fig. 4Gene expression patterns along the midgut. The results of the fuzzy-C means clustering is shown. Transcripts were grouped into eight categories based on their relative expression pattern, and all members with membership values > 0.6 were plotted. The darker shading on the plot indicates a larger number of individual transcripts which show that expression pattern. The top four clusters are composed of more complicated patterns, whereas the bottom four clusters display transcripts enriched specifically in one compartment
Table 2 A table is shown reflecting all of the significantly enriched GO terms from the gene groups produced from either the Fuzzy C-means clustering or the Top 500 most highly expressed genes in each sample. For each GO term, supporting information is provided such as the Gene Group from which it was identified the Code name, False discovery rate and information about how many genes with this term were identified compared to the total number in the transcriptome
| Gene Group | Code Name | False Discovery Rate | # of Genes in Group with Term | # of Genes in Transcriptome with Term | GO Term Annotation |
|---|---|---|---|---|---|
| Continual_increase_fuzzycluster | PF13909 | 0.000130095 | 23 | 189 | C2H2-type zinc-finger domain |
| Continual_increase_fuzzycluster | GO:0003676 | 0.000187859 | 69 | 1115 | nucleic acid binding, molecular_function |
| M1_fuzzycluster | GO:0006412 | 1.39345E-06 | 52 | 548 | translation, biological_process |
| M1_fuzzycluster | PF02958 | 1.62758E-06 | 19 | 85 | Ecdysteroid kinase |
| M1_fuzzycluster | PF00112 | 0.000255183 | 17 | 110 | Papain family cysteine protease |
| M1_fuzzycluster | PF00504 | 0.000255183 | 11 | 43 | Chlorophyll A-B binding protein |
| M1_fuzzycluster | GO:0008234 | 0.000504127 | 17 | 121 | cysteine-type peptidase activity, molecular_function |
| M1_fuzzycluster | GO:0006414 | 0.000780785 | 11 | 56 | translational elongation, biological_process |
| M1_Top 500 | PF00112 | 1.58324E-11 | 19 | 110 | Papain family cysteine protease |
| M1_Top 500 | GO:0008234 | 2.43945E-11 | 19 | 121 | cysteine-type peptidase activity, molecular_function |
| M1_Top 500 | PF00135 | 1.63518E-08 | 15 | 100 | Carboxylesterase family |
| M1_Top 500 | PF02958 | 1.13244E-06 | 12 | 85 | Ecdysteroid kinase |
| M1_Top 500 | PF00067 | 4.40651E-06 | 14 | 146 | Cytochrome P450 |
| M1_Top 500 | GO:0055114 | 0.000163275 | 37 | 1078 | oxidation-reduction process, biological_process |
| M1_Top 500 | GO:0006508 | 0.000818748 | 18 | 394 | proteolysis, biological_process |
| M2_fuzzycluster | PF00112 | 5.48524E-05 | 11 | 110 | Papain family cysteine protease |
| M2_fuzzycluster | GO:0008234 | 0.000114662 | 11 | 121 | cysteine-type peptidase activity, molecular_function |
| M2_Top 500 | GO:0055114 | 3.88423E-08 | 53 | 1078 | oxidation-reduction process, biological_process |
| M2_Top 500 | PF00112 | 2.93311E-07 | 15 | 110 | Papain family cysteine protease |
| M2_Top 500 | GO:0008234 | 7.96227E-07 | 15 | 121 | cysteine-type peptidase activity, molecular_function |
| M2_Top 500 | PF00067 | 9.31946E-07 | 16 | 146 | Cytochrome P450 |
| M3_Top500 | GO:0055114 | 1.54695E-07 | 51 | 1078 | oxidation-reduction process, biological_process |
| M3_Top500 | PF00067 | 4.50598E-05 | 14 | 146 | Cytochrome P450 |
| M3_Top500 | PF00135 | 4.50598E-05 | 12 | 100 | Carboxylesterase family |
| M4_fuzzycluster | GO:0055085 | 1.64321E-13 | 137 | 572 | transmembrane transport, biological_process |
| M4_fuzzycluster | GO:0016021 | 1.03943E-10 | 122 | 543 | integral component of membrane, cellular_component |
| M4_fuzzycluster | PF12796 | 2.91941E-05 | 83 | 387 | Ankyrin repeats (3 copies) |
| M4_fuzzycluster | GO:0008061 | 4.19023E-05 | 29 | 85 | chitin binding, molecular_function |
| M4_fuzzycluster | GO:0016020 | 5.10354E-05 | 95 | 517 | membrane, cellular_component |
| M4_fuzzycluster | PF00083 | 0.000120073 | 36 | 117 | Sugar (and other) transporter |
| M4_fuzzycluster | GO:0005515 | 0.000164315 | 307 | 2221 | protein binding, molecular_function |
| M4_fuzzycluster | GO:0006468 | 0.000464805 | 57 | 282 | protein phosphorylation, biological_process |
Fig. 5Phylogeny of identified P450s in N. viridula. A phylogenetic tree constructed of N. viridula P450s is shown along with other reference species. The names of each of the four CYP clans is shown at the outer rim. Individual P450s used in the tree were those N. viridula P450s identified in the current study (black), or P450s from D. melanogaster (dark red)
Nutrient transporter expression in the midgut
| Measuremnet | POT | APC | AAAP | NSS | SWEET | SSS | SP |
|---|---|---|---|---|---|---|---|
| Average M1 Expression (TPM) | 32.73 | 12.60 | 22.67 | 2.60 | 56.08 | 0.84 | 36.89 |
| Average M2 Expession (TPM) | 103.55 | 18.06 | 49.53 | 2.08 | 46.17 | 0.26 | 36.73 |
| Average M3 Expression (TPM) | 152.57 | 17.88 | 30.03 | 2.01 | 40.04 | 0.70 | 26.94 |
| Average M4H Expression (TPM) | 33.04 | 48.00 | 85.90 | 9.07 | 34.44 | 5.17 | 54.84 |
| Average Carcass Expression (TPM) | 2.98 | 15.86 | 30.02 | 3.90 | 15.23 | 19.24 | 16.54 |
| # of M1 Transcripts over 50 TPM | 1 | 1 | 1 | 0 | 1 | 0 | 9 |
| # of M2 Transcripts over 50 TPM | 1 | 1 | 2 | 0 | 1 | 0 | 7 |
| # of M3 Transcripts over 50 TPM | 1 | 1 | 1 | 0 | 1 | 0 | 4 |
| # of M4H Transcripts over 50 TPM | 0 | 3 | 6 | 0 | 1 | 0 | 16 |
| # of Carcass Transcripts over 50 TPM | 0 | 0 | 1 | 0 | 0 | 1 | 4 |
After filtering for transcripts with <1 TPM in all compartments, average expression values were displayed along with the total number of highly expressed > 5 and > 50 > 50TPM genes for each transporter family in each compartment
Fig. 6Enrichment of AAAP and APC transporters in the M4 midgut region. A boxplot is shown comparing the expression of all AAAP and APC transporters showing along the midgut. Only genes showing expression above >1TPM in either the M4 region or the anterior region (average of the M1-M3 region) were plotted. The M4 region of the gut displays an enrichment in both APC and AAAP expression compared to the anterior section of the midgut