Aphids are phloem-feeding insects known as major pests in agriculture that are able to transmit hundreds of plant viruses. The majority of these viruses, classified as noncirculative, are retained and transported on the inner surface of the cuticle of the needle-like mouthparts while the aphids move from plant to plant. Identification of receptors of viruses within insect vectors is a key challenge because they are promising targets for alternative control strategies. The acrostyle, an organ discovered earlier within the common food/salivary canal at the tip of aphid maxillary stylets, displays proteins at the cuticle-fluid interface, some of which are receptors of noncirculative viruses. To assess the presence of stylet- and acrostyle-specific proteins and identify putative receptors, we have developed a comprehensive comparative analysis of the proteomes of four cuticular anatomical structures of the pea aphid, stylets, antennae, legs, and wings. In addition, we performed systematic immunolabeling detection of the cuticular proteins identified by mass spectrometry in dissected stylets. We thereby establish the first proteome of stylets of an insect and determine the minimal repertoire of the cuticular proteins composing the acrostyle. Most importantly, we propose a short list of plant virus receptor candidates, among which RR-1 proteins are remarkably predominant. The data are available via ProteomeXchange (PXD016517).
Aphids are phloem-feeding insects known as major pests in agriculture that are able to transmit hundreds of plant viruses. The majority of these viruses, classified as noncirculative, are retained and transported on the inner surface of the cuticle of the needle-like mouthparts while the aphids move from plant to plant. Identification of receptors of viruses within insect vectors is a key challenge because they are promising targets for alternative control strategies. The acrostyle, an organ discovered earlier within the common food/salivary canal at the tip of aphid maxillary stylets, displays proteins at the cuticle-fluid interface, some of which are receptors of noncirculative viruses. To assess the presence of stylet- and acrostyle-specific proteins and identify putative receptors, we have developed a comprehensive comparative analysis of the proteomes of four cuticular anatomical structures of the pea aphid, stylets, antennae, legs, and wings. In addition, we performed systematic immunolabeling detection of the cuticular proteins identified by mass spectrometry in dissected stylets. We thereby establish the first proteome of stylets of an insect and determine the minimal repertoire of the cuticular proteins composing the acrostyle. Most importantly, we propose a short list of plant virus receptor candidates, among which RR-1 proteins are remarkably predominant. The data are available via ProteomeXchange (PXD016517).
Aphids
are phloem-feeding insects, well-known as major pests in
agriculture. More than 5,000 aphid species have been described. They
colonize countless plant species and have been reported on 300 plant
families ranging from gymnosperms to angiosperms.[1,2] Aphids
have a complex life cycle alternating between sexual and asexual reproduction
and seasonal host changes. They are and have long been extensively
studied, not only because of interesting life traits such as reproductive
and wing polyphenisms but also because they transmit numerous plant
diseases.[3] With more than 300 species transmitted,
aphids are one of the most efficient and important vectors of plant
viruses and are the best studied-model to characterize the mechanisms
of vector-mediated virus transmission (recently reviewed by Whitfield
and colleagues).[4−7] Most aphid-transmitted viruses bind reversibly to retention sites
on the inner cuticle of insect mouthparts to which they remain attached
during transport to a new host plant.[4] This
so-called noncirculative virus transmission is particularly difficult
to limit and control in field conditions, as the viruses can be acquired
and inoculated by nonresident aphids mostly within a single puncture
lasting only a few seconds.[8] During this
process, an intimate association occurs between viruses and their
vectors. This interaction is highly specific and involves the capsid
proteins or viral-encoded proteins called helper components, together
with poorly characterized molecular compounds in the insect cuticle
acting as receptors.[9−12] These vector molecules are promising targets for alternative viral
control strategies, and their identification would help characterizing
further the molecular mechanisms of virus–vector interaction.Aphids have piercing-sucking mouthparts, composed of the short
triangular labrum covering the base of the stylet bundle, and the
labium, a segmented organ which contracts to facilitate stylets penetration
into plant tissues. The stylet bundle arises in the head from its
secreting glands and extends outside the head in a dorsal groove of
the labium.[13,14] A pair of external mandibular
stylets innervated by two dendrites, surround two (noninnervated)
inner maxillary stylets and together form the stylet bundle. Specific
anatomical features are visible on both types of stylets. Barb-like
ridges are present at the tip of mandibular stylets to facilitate
penetration into plant tissues.[15,16] Maxillary stylets are
tightly interlocked by a series of longitudinal ridges and grooves,
which enclose the food and salivary canals formed by opposing grooves.
Food and salivary canals fuse a few micrometers from the tip, leading
to the confluent common food/salivary canal, that has long been believed
to harbor receptors of noncirculative viruses.[8,17−20] The existence of such receptors has definitely been evidenced with Cauliflower mosaic virus (CaMV), which was shown to bind
to cuticular proteins (CPs) accessible at the tip of maxillary stylets,
at the surface of a specific organ designated the acrostyle.[21] This organ is a distinct anatomical structure
restricted to a region of the common canal. Its surface properties
are different from the rest of the stylet cuticle.[22] Antibodies targeting CPs from the CPR family—the
largest CP family widespread among arthropods and containing the Rebers
and Riddiford (RR) consensus sequence—allowed the detection
of several peptides from RR-1 and RR-2 subgroups in the acrostyle.[22−26] More specifically, two highly homologous RR-1 proteins, Stylin-01
and Stylin-02, have been localized in the organ with a peptide corresponding
to their common C-terminus sequence directly accessible at the surface. In vitro competition assays and in vivo silencing approaches indicated that Stylin-01 was involved in CaMV
transmission.[26] However, its role in the
transmission of other noncirculative viruses could not be established,
and there is currently no evidence for a common receptor of all noncirculative
viruses. The acrostyle has a complex proteomic composition and contains
at least two RR-1 and several RR-2 proteins which could not be specifically
identified.[25] Many RR-2 have nearly identical
sequences that are not distinguishable by specific antibodies, highlighting
the limit of an immunolabeling approach to comprehensively determine
the proteome of aphid stylets, and the need for larger scale proteomic
characterization.Annotated CPs in the genome of the pea aphid Acyrthosiphon
pisum account for 150 proteins. They have been assigned to
six distinct CP families according to their conserved consensus motifs:
125 CPR (15 RR-1 and 110 RR-2), 11 CPAP1, 8 CPAP3, 3 TWDL, 2 CPF,
and 1 CPCFC proteins.[27] Up to now, mass-spectrometry
(MS)-based proteomics on stylet bundles was inconceivable considering
the size of these appendages. Moreover, studies of the proteomic composition
of larger cuticular structures have not been reported for aphids.
However, the recent advances in MS approaches allowing the extraction
of CPs from small quantities of insect tissues now make stylet proteomics
a realistic objective.[27]Our aim
in this proteomics study was to identify through a bottom-up
approach proteins and related peptides accessible at the surface of
aphid stylets that might play a key role in plant virus binding. To
draw-up a list of stylet-specific CPs, we first developed a comparative
analysis of the proteomes of four distinct pea aphid tissues partially
composed of cuticle: antennae, legs, wings, and stylets. In a second
phase, we produced antibodies targeting peptides from the stylet CPs
identified by proteomics to refine their localization within stylets
through in situ immunolabeling. As a result, we here
provide a short list of plant virus receptor candidates highly conserved
among aphid species and characterize their expression pattern in various
body parts and throughout aphid developmental stages. These candidate
receptors, named Stylins, have a patchy distribution within and along
aphid stylets but are emerging at the surface of the acrostyle. Most
have a RR-1 type chitin-binding domain, highlighting the remarkable
accessibility of CPs of this subgroup at the surface of the cuticle.
Experimental
Procedures
The experimental design and the organization of
the presentation
of the results are summarized in the workflow shown in Figure .
Figure 1
Experimental workflow
to characterize the proteomics composition
of the acrostyle and identify plant virus receptor candidates within
aphid stylets.
Experimental workflow
to characterize the proteomics composition
of the acrostyle and identify plant virus receptor candidates within
aphid stylets.
Chemical Reagents
For sample preparation,
all reagents
were purchased from Sigma-Aldrich (St. Louis, MO) except RapiGest SF surfactant purchased from Waters (Milford, MA). For LC-MS/MS
analysis, formic acid was obtained from Fluka (Sigma-Aldrich, St.
Louis, MO), Milli-Q water from Merck (Merck Millipore, Billerica,
MA), and all other chemicals were purchased from Carlo-Erba Reagents
(Val de Reuil, France). Modified sequencing-grade trypsin (Promega
Corporation, Madison, WI) was used for protein digestion.
Aphid Clones
A colony of Acyrthosiphon pisum (LL01) was maintained
on Vicia faba cv Robin Hood
in an environmental growth chamber at 23/18 °C (day/night) with
a photoperiod of 16/8 h (day/night). Myzus persicae Sulzer was maintained on Solanum melongena cv Barbentane
in the same temperature/photoperiod conditions.
Tissue Collection
for Liquid Chromatography Coupled to Tandem
Mass Spectrometry (LC-MS/MS) Analyses
Alate A. pisum adults collected from Vicia faba plants were starved
for 1 h and stored at −20 °C for several hours before
dissection. Collection of antennae, legs, and wings did not present
any technical difficulties. These appendages were collected in triplicate
for each structure, each replicate comprising antennae, wings, or
legs from 37 insects. Stylet bundles, hereafter named stylets, are
approximately 700 μm in length and 3 μm in diameter for A. pisum adults. They are anchored in aphid heads in the
glands from which they are secreted, the retort organs (described
by Ponsen and Guschinskaya and colleagues).[14,28] To extract exploitable information from MS analysis, hundreds of
stylets had to be pooled for each trial as free as possible of other
contaminating aphid tissues. In addition, we had to reduce electrostatic
charges that too often induced stylets fly off and loss. For these
reasons, a “clean-stylet” dissection protocol compatible
with MS standards had to be specifically designed (Figure A). Frozen aphids were first
glued on their back onto a microscope slide using double-sided bonding
tape (Figure Aa).
Stylets were then pulled out from the labium under a stereomicroscope
using tungsten insect pins (Figure Ab), separated from the head with microscissors (Figure Ac), and then transferred
onto a conductive glass slide (ITO, Bruker Daltonik, Germany), allowing
secured accumulation of stylets (Figure Ad). Dissecting tools were carefully washed
with water, followed by 70% ethanol, and finally dried on lint-free
paper after each stylet collection. Once 500 stylet bundles had been
accumulated onto the ITO-slide, they were carefully transferred with
a fine needle into a single glass microtube, avoiding static charge
effects, allowing control of the number of appendages deposited per
tube (Figure Ae).
This protocol was systematically used to collect three independent
batches of 500 stylets each.
Sample Preparation
All cuticular
structures were prepared
according to the same protocol. Samples were first washed for 15 min
with Milli-Q water and 50% acetonitrile (ACN), both acidified with
0.1% trifluoroacetic acid (TFA) (v/v, final concentration). Supernatants
were then removed after centrifugation, and the cuticular structures
dried out by centrifugation under vacuum (Labconco, Kansas City, USA).
All anatomical structures were then treated as described previously.[27] Briefly, proteins were extracted by successive
incubations in pure hexafluoroisopopanol (HFIP) and in 50 mM ammonium
bicarbonate supplemented with 0.1% RapiGest. After
reduction and alkylation, proteins were submitted to trypsin digestion.
Finally, digested samples were dried out and resuspended in 2% ACN/0.1%
TFA (v/v) prior to mass spectrometry (MS) analysis.
Nano Liquid
Chromatography Coupled to Tandem Mass Spectrometry
(nanoLC-MS/MS) Analysis
NanoLC-MS/MS was carried out using
an Ultimate 3000 nanoHPLC (Thermo Scientific, Germany) for the separation,
hyphenated to a Q-Exactive Orbitrap mass spectrometer (Thermo Scientific).
For chromatography, the digested samples were loaded, concentrated,
and washed at 10 μL/min for 6 min with 2% ACN and 0.05% TFA
on a microconcentrating column (300 μm × 5 mm PepMap 100,
C18, 5 μm, Thermo Scientific). The separation was
performed on a reversed-phase column (75 μm × 250 mm Acclaim
PepMap 100, C18, particle size 3 μm nanoviper column
from Thermo Scientific). The LC mobile phases for the separation were
water (A) and ACN (B), each supplemented with 0.1% formic acid (v/v).
Separation was achieved at a flow rate of 300 nL/min using a biphasic
linear gradient from 2% to 32% B in 100 min and from 32% to 65% B
in 5 min. MS analysis was carried out in positive ion and data-dependent
modes. The voltage applied to the nanotips (Nano Objective, USA) was
approximately 2.0 kV, and the header was at 300 °C. Full scan
(MS) spectra were obtained from 380 to 2,000 m/z (70,000 resolution, AGC target 3 × 106, maximum IT 200 ms), and for each full-scan the most intense ions
(Top 10) were fragmented in MS2 (17,500 resolution, AGC
target 2 × 105, maximum IT 100 ms, intensity threshold
4 × 104, excluding charge-unassigned ions, Normalized
Collision Energy selected at 27). Parent ions were then excluded from
MS/MS for the next 15 s. The softwares Chromeleon Xpress (Thermo Fisher
Scientific) was used to control the HPLC, and Xcalibur 2.2 (Thermo
Fisher Scientific) to control the mass spectrometer.
Database Searching
and Protein Identification
The Sequest
HT searching algorithm was run by Proteome Discoverer 1.4 (Thermo
Fisher Scientific) to match the acquired MS/MS spectra to a protein
database, with the following settings: trypsin digest with two maximum
missed cleavages, 6 and 144 amino acids as minimum and maximum peptide
length, respectively, a tolerance of 10 ppm/0.02 Da for precursors
and fragment ions, respectively. Cysteine carbamidomethylation was
set as a fixed modification; C-terminal protein amidation and methionine
and tryptophan oxidation were set as variable modifications.Searches were performed against combined proteome databases (Table S1) including: (i) the pea aphid proteome
(http://aphidbase.com, version
2.0 containing 33,291 protein-coding sequences); (ii) an eubacterial
reference proteome setup for putative symbionts and aphid microbiome-related
sequences (116,983 sequences); (iii) a set of sequences of entomopathogenic
fungi (28,431 sequences); (iv) a set of viral sequences including
aphid viruses and aphid-vectored broad-bean viruses were also selected
(6,614 sequences); (v) a mixed plant proteome assembled from available Fabacae proteomes (66,777 sequences); and (vi) a set of
common contaminants (116 sequences).A homemade A. pisum CP database comprising 150
annotated CPs was constructed using the Cuticular Protein Family Prediction
Tool CutProtFam-Pred (http://aias.biol.uoa.gr/CutProtFam-Pred/home.php) to accurately detect and classify putative CPs present in the pea
aphid proteome v2.0, thanks to known consensus and semiconsensus sequences
(Table S2).[29] Identification of CPs in cuticular structures was carried out with
this homemade database.
Data Presentation
To identify the
proteins that are
present in pea aphid cuticular anatomical structures, pools of antennae,
wings, legs, and stylet bundles were collected in triplicate on A. pisum adults. Proteins consistently identified in all
three biological replicates with at least a same peptide in each replicate,
regardless of being “Unique” (peptide present in only
one protein) or “Shared” (peptide present in multiple
different proteins), were considered as confidently identified and
constituted whole proteomes of the cuticular structures. In these
first data sets, proteins sharing a same identified peptide were grouped
under a single protein identifier (accession number corresponding
to the protein with the highest score and highest percentage of coverage
of the protein. These parameters can differ from one sample to another,
and a same “shared” peptide may be assigned to different
protein identifiers by the algorithm). Comparative analyses of whole
proteomes were carried out using these data sets.Minimal lists
of CPs were then retrieved from whole proteomes using homemade CP
database (Table S2). Minimal lists represent the smallest number of CPs that can explain all identified
MS peptides and will be used to design peptides for antibodies production
and immunolabeling of dissected stylets. However, CPs can share high
sequence similarity, particularly proteins of the CPR family.[26,30−33] Several proteins can be identified with a same tryptic fragment
(“shared” peptide). In this case, we cannot tell how
many of these proteins are actually incorporated into the cuticle
of the anatomical structures. To be fully exhaustive in our quest
for virus receptor candidates, lists of CPs potentially present in cuticular structures were extended to all proteins containing
these “shared” peptides.
Functional Classification
Gene Ontology (GO) terms
were assigned to AphidBase 2.0 (http://bipaa.genouest.org/is/aphidbase/acyrthosiphon_pisum/downloads/) via Blast2GO software version 5.1.12 using the InterProScan with
the Hidden Markov Models (HMMs) present in the PFAM protein family
database.[34] The number of proteins identified
in each cuticular structure was compared to all corresponding functionally
GO annotated terms in the entire pea aphid (A. pisum). GO term enrichment was determined via Fisher’s Exact test
at an FDR, p-value ≤0.05; the reduced enrichment was determined
at an FDR, p-value ≤0.01.
Antibodies
A total
of 26 antibodies were used to immunolabel
aphid stylets. Eleven antibodies were already available from previous
studies.[22,25,26] In addition,
15 peptides of 13–16 amino acids length with sequences originating
from peptides retrieved from MS analyses (either “Unique”
or “Shared” peptides), or originating from the full-length
sequences of the proteins identified in this study were synthesized
by Eurogentec (http://www.eurogentec.com). Eurogentec also carried out the production of antibodies in rabbit
and the affinity purification of the antisera. Alexa Fluor 488-conjugated
antirabbit IgG (A11070, Thermo Fisher Scientific,Waltham, MA) were
used as secondary antibodies.
Immunolabeling of Dissected
Stylets
A. pisum and M. persicae stylets were dissected and immunolabeled
according to Webster and colleagues, using primary antibodies at dilutions
of 1:200 and secondary Alexa fluor-conjugated antibody at a dilution
of 1:400.[25] Stylets were either untreated
or treated with 2 U/mL of Chitinase from Streptococcus griseus (Sigma-Aldrich, St. Louis, MO) before their incubation with primary
antibodies, as described by Uzest and colleagues, to eventually reveal
epitopes that could be localized under the surface layers of the stylet
cuticle.[21] When no labeling was observed
after 15 min of digestion, the Chitinase treatment was extended to
30 and 60 min. Two independent repeats were performed per condition
and per antibody.
Quantification of Stylin Transcripts in Various
Tissues and
at Different Developmental Stages
Transcripts of stylins
were quantified in the heads of all A. pisum nymphal
instars (N1 to N4) and adult stages and in different tissues. Insect
tissues (antennae, legs, wings, and antenna-free heads as a proxy
for stylet synthesizing glands) were collected from adults under a
stereomicroscope. Six pools of 10 aphids/tissues were tested per condition.
Total RNA was extracted from whole aphids or dissected tissues using
an RNeasy minikit (Qiagen, Hilden, Germany). A total of 80–100
ng of RNA was treated with RQ1 RNase-free DNase I (Promega Corporation,
Madison, WI). First-strand cDNA was synthesized using Moloney murine
leukemia virus (MMLV) reverse transcriptase (Promega Corporation)
according to the manufacturer’s instructions, with oligo(dT)
as primers. All RT-qPCRs were performed in duplicates on a LightCycler
480 instrument (Roche, Basel, Switzerland) using 1:4 dituted cDNAs
and a LightCycler 480 SYBR green I master mix (Roche, Penzberg, Germany)
according to the manufacturer’s recommendations with gene-specific
primers (Table S3). Two internal reference
genes encoding actin and elongation factor 1α (EF1α) from A. pisum were used for normalization. Amplification efficiencies
were analyzed with LinRegPCR free software (v. 2014.5). Relative expressions
were calculated using the threshold cycle (2–ΔΔCT) method.[35]
Alignments and Phylogenetic
Analyses
Sequences encoding
stylin proteins were retrieved from the seven genomes available on
AphidBase (http://bipaa.genouest.org/is/aphidbase/)—Myzus persicae (Mp; clone G006), Myzus cerasi (Mce), Rhopalosiphum padi (Rp), Diuraphis noxia (Dn), A. pisum (Ap), Daktulosphaira vitifoliae (Dv), Aphis glycines (Ag; Biotype 4)—using CutprotFam-Pred with standard settings.[29,36−39] Among these sequences, 17 were manually curated for the present
study.After removal of the predicted signal peptides using
the SignalP-5.0 server, CPR_RR-1 mature protein sequences were used
in phylogenetic analyses.[40] For CPAP3 proteins,
full sequences were used in phylogenetic analyses. Sequences were
aligned using MUSCLE (v3.8.31) configured for highest accuracy (MUSCLE
with default settings).[41] Ambiguous regions
(i.e., containing gaps and/or poorly aligned) were removed with Gblocks
(v0.91b) using the following parameters: minimum length of a block
after gap cleaning: 10, no gap positions were allowed in the final
alignment and all segments with contiguous nonconserved positions
bigger than eight were rejected, minimum number of sequences for a
flank position: 85%. Phylogenetic trees were reconstructed using the
maximum likelihood method implemented in the PhyML program (v3.1/3.0
aLRT). The WAG matrix, which works as empirical amino-acid substitution
model to simulate the biological sequence evolution with flexibility,
was selected assuming an estimated proportion of invariant sites (of
0.009) and four gamma-distributed rate categories to account for rate
heterogeneity across sites. The gamma shape parameter was estimated
directly from the data (gamma = 3.517). Reliability for internal branch
was assessed using the aLRT test (SH-Like). Graphical representation
and edition of the phylogenetic trees and alignments were performed
with TREEDYN (v. 198.3) and the T-Coffee software, respectively.[42−44]
Results
Comparative Analyses of the Proteome of Four
Cuticular Structures
of the Pea Aphid A. Pisum
To identify stylet-specific CPs,
we first characterized and compared the proteomes of four cuticular
structures: two muscular articulated segmented tissues, antennae and
legs; wings which are outgrowths of the exoskeleton; and stylet bundles
comprising two mandibular stylets harboring dendrites and two maxillary
stylets that are cell-free cuticular structures.6,574 peptides
were identified in antennae, 6,025 in legs, and 7,242 in wings, which
were specifically assigned to 971, 749, and 910 proteins. In stylets,
a smaller number of 1,118 peptides were identified and assigned to
141 proteins (Table , Tables S4–S7). Proteins identified
in legs, antennae, and wings represent 2.2–2.9% of the whole A. pisum proteome, while proteins identified in stylets
represent only 0.4%. Of the 1,273 proteins identified in this study,
90 were found in all four cuticular structures (7%) and constitute
what we hereafter refer to as the “core proteome” of
cuticular structures. 426 proteins were found to be structure-specific
(33%), with 162, 59, 165, and 40 proteins solely identified in antennae,
legs, wings, and stylets, respectively (Figure A, Table S8).
Table 1
Summary of Proteins Identified in Acyrthosiphon
pisum Cuticular Structures: (A) All Peptides
and Proteins Recovered from LC-MS/MS Analyses; (B) Peptides and Proteins
Assigned to A. pisum Proteome (A. pisum ID) Recovered from LC-MS/MS Analysis; (C) Cuticular Proteins (CPs)
Assigned to A. pisum Recovered from LC-MS/MS Analysisa
Antennae
Legs
Wings
Stylets
A
Total # of peptides identified
6590
6040
7260
1119
Total # of unique peptides identified
6304
5786
6942
1056
Total # of proteins identified
987
762
927
142
B
Total # of peptides
identified (A. pisum ID)
6574
6025
7242
1118
Total # of unique peptides identified (A. pisum ID)
Smallest
number of cuticular proteins
that can explain all observed peptides recovered from LC-MS/MS analysis.
All cuticular proteins recovered
from LC-MS/MS analysis identified with “Unique” peptides
and “Shared” peptides.
Figure 2
Core proteome of A. pisum cuticular structures.
(A) Venn diagram showing the proteins identified in the four cuticular
structures. 90 proteins found in all structures represent the “Core
Proteome”. (B) GO terms enriched in the Core Proteome set.
Histograms represent significantly enriched functional terms in Molecular
Function (green), Cellular Component (orange), and Biological Process
(blue). Bars represent the proportion of genes (%) enriched in the
corresponding functional groups. Numbers of genes enriched for each
functional group are displayed on the bars. *, p <
0,05; **, p < 0,01; ***,p <
0,001. See also Table S9.
#, number; CPs,
Cuticular Proteins;
ID, identifier.Smallest
number of cuticular proteins
that can explain all observed peptides recovered from LC-MS/MS analysis.All cuticular proteins recovered
from LC-MS/MS analysis identified with “Unique” peptides
and “Shared” peptides.Core proteome of A. pisum cuticular structures.
(A) Venn diagram showing the proteins identified in the four cuticular
structures. 90 proteins found in all structures represent the “Core
Proteome”. (B) GO terms enriched in the Core Proteome set.
Histograms represent significantly enriched functional terms in Molecular
Function (green), Cellular Component (orange), and Biological Process
(blue). Bars represent the proportion of genes (%) enriched in the
corresponding functional groups. Numbers of genes enriched for each
functional group are displayed on the bars. *, p <
0,05; **, p < 0,01; ***,p <
0,001. See also Table S9.As expected for cuticle-containing insect body parts, the
GO terms
overrepresented in the core proteome were related to the extracellular
region and were significantly enriched in cuticle-related functions
and the sclerotization process. Hence, categories such as structural
constituent of cuticle, chitin binding, carbohydrate metabolic process,
chitin metabolic process, oxidoreductase activity, and oxidation–reduction
process were overrepresented (Figure B, Table S9). Other categories
related to unfolded protein binding or small molecules binding, metabolic
processes, proton transmembrane transport, transferase or GTPase activities
were also identified.Proteomes of antennae, legs, and wings
were enriched in functions
related not only to the extracellular region but also to categories
associated with intracellular compartments such as ribosome, proteasome,
mitochondria, and vesicles (Figure A–C, Tables S10–S12). More categories were overrepresented in these three structures
compared to aphid stylets (31–41 categories vs 16 categories,
respectively; see Figure A–D). Notably, the chitin metabolic process category
is not specially enriched in stylets (Figure A–D), emphasizing the strictly extracellular
origin of stylet cuticle, while the other appendages contain the chitin-synthesizing
epithelia and its intracellular functions. Apart from cuticle-related
functions, antennae, legs, and wings contained proteins with commonly
enriched functions associated with protein synthesis, protein folding,
intracellular trafficking, and small molecules binding. In comparison,
categories enriched in stylets were mostly related to cuticle synthesis
or sclerotization pathways (Figure D, Table S13). Not surprisingly,
our data sets highlighted differences in proteomic composition between
nearly exclusively cuticle-based structures (i.e., stylets) and only
partly cuticle-based tissues also including cellular machineries (i.e.,
antennae, legs, and wings) (Figure E).
Figure 3
Comparative analysis of GO terms enriched by identified
proteins
in antennae, legs, wings, and stylets of A. pisum. (A–D) Comparison of identified proteins in the four cuticular
structures for Molecular Function (MF, green), Cellular Component
(CC, orange), and Biological Process (BP, blue). Histograms represent
significantly enriched functional terms in antennae (A), legs (B),
wings (C), and stylets (D). Bars represent the proportion of enriched
genes encoding the identified proteins compared to all genes involved
in the specified GO terms (%Gene/Term). Numbers of genes enriched
for each functional group are displayed on the bars. *, p < 0,05; **, p < 0,01; ***, p < 0,001. (E) WordClouds summarize the functional profiles of
each cuticular anatomical structure. See also Tables S10–S13.
Comparative analysis of GO terms enriched by identified
proteins
in antennae, legs, wings, and stylets of A. pisum. (A–D) Comparison of identified proteins in the four cuticular
structures for Molecular Function (MF, green), Cellular Component
(CC, orange), and Biological Process (BP, blue). Histograms represent
significantly enriched functional terms in antennae (A), legs (B),
wings (C), and stylets (D). Bars represent the proportion of enriched
genes encoding the identified proteins compared to all genes involved
in the specified GO terms (%Gene/Term). Numbers of genes enriched
for each functional group are displayed on the bars. *, p < 0,05; **, p < 0,01; ***, p < 0,001. (E) WordClouds summarize the functional profiles of
each cuticular anatomical structure. See also Tables S10–S13.
Stylet-Specific Proteins
Forty stylet-specific proteins
have been identified in our MS data sets (Figure A, Table S8),
most of which were not previously annotated. Although not possible
to ascertain, at least 13 of these stylet-specific proteins (32.5%)
likely came from the saliva remaining in the salivary canal after
dissection of stylet bundles. Consistent with this hypothesis, 12
out of these 13 proteins have already been reported in the pea aphid
sialome (Table S8), and one is a predicted
secreted peroxidase, an enzyme found in aphid saliva.[45−49] Two cuticle-related proteins were only present in the stylets data
sets, one trehalase (ACYPI007462) reported to play a role in several
processes, including chitin biosynthesis in other insects, and one
RR-2 protein (ACYPI006670).[50] Whereas the
saliva-related proteins and trehalase were identified with unique
peptides not detected in the three other cuticular structures, the
RR-2 protein was solely identified from two shared peptides (GSYSLLEADGSTR
and TVEYTADDYNGFNAVVK), also identified in antennae, legs, and wings.
Because these peptides have been assigned to ACYPI006670 in the stylets,
but to other RR-2 proteins in antennae, legs, and wings by the data
search program (Tables S14–S17),
we cannot strictly conclude that this protein (ACYPI006670) is stylet-specific.
Distribution of the Identified CPs in the Different Investigated
Cuticular Anatomical Structures
Minimal lists of CPs recovered
by MS analyses comprise 39 CPs in antennae, 38 CPs in legs, 41 CPs
in wings, and 15 CPs in stylets (Tables and 2, Tables S14–S17). No peptides from the
CPF and CPCFC families were identified in any cuticular structure.
The representation of distinct CPs families was similar in antennae,
legs, and wings (Figure A, Table ), with
proteins of the CPR (RR-1 and RR-2), CPAP1, CPAP3, and TWDL families.
In stylets, only two protein families were represented, CPR (RR-1
and RR-2) and CPAP3.
Table 2
Exhaustive List of
Cuticular Proteins
Identified in the Four Cuticular Structuresa
U
= unique peptide found in protein.
S = peptide shared among different proteins. The 15 CPs identified
by mass spectrometry in aphid stylets are indicated in red (see also Table S17). RR-1 proteins are highlighted in
green, RR-2 in grey, CPAP1 in yellow, CPAP3 in orange, and Tweedle
in blue. ID, identifier.
Figure 4
Comparative
CP profiling in A. pisum cuticular
structures. (A) Distribution of the different CPs from minimal lists
in antennae, legs, wings, and stylets. (B) Venn diagram indicates
the number of all cuticular proteins potentially present in antennae,
legs, wings, and stylets.
Table 3
Classification of Cuticular Proteins
of the Different Structuresa
CPAP1
CPAP3
CPR_RR1b
CPR_RR-2b
TWDLb
Totalb
Antennae
1
6
(8)-10
(22)-59
2
(39)-78
Legs
2
5
(6)-7
(22)-61
(1)-2
(36)-77
Wings
2
6
(8)-9
(24)-58
(1)-2
(41)-77
Stylets
0
3
(4)-5
(8)-30
0
(15)-38
Whole genome
11
8
15
110
3
147
Number of proteins
with peptides
identified by MS analyses assigned in the different CP families for
each cuticular structure. Total number of CPs identified in the structures
is indicated.
The smallest
numbers of CPs that
can explain all observed peptides recovered from MS analysis (minimal
list) are indicated in parentheses.
U
= unique peptide found in protein.
S = peptide shared among different proteins. The 15 CPs identified
by mass spectrometry in aphid stylets are indicated in red (see also Table S17). RR-1 proteins are highlighted in
green, RR-2 in grey, CPAP1 in yellow, CPAP3 in orange, and Tweedle
in blue. ID, identifier.Comparative
CP profiling in A. pisum cuticular
structures. (A) Distribution of the different CPs from minimal lists
in antennae, legs, wings, and stylets. (B) Venn diagram indicates
the number of all cuticular proteins potentially present in antennae,
legs, wings, and stylets.Number of proteins
with peptides
identified by MS analyses assigned in the different CP families for
each cuticular structure. Total number of CPs identified in the structures
is indicated.The smallest
numbers of CPs that
can explain all observed peptides recovered from MS analysis (minimal
list) are indicated in parentheses.RR-2 proteins were the most abundant in all structures
and were
often identified with shared peptides (Table ). RR-1 and TWDL proteins were identified
either with shared or with unique peptides. Finally, CPAP1 and CPAP3
were only identified with unique peptides (Table ). The number of CPs unequivocally identified
with unique peptides was 26, 26, 30, and 9 in antennae, legs, wings,
stylets, respectively. Comparative analysis of CPs potentially
present in cuticular structures indicated that only a few
proteins were restricted to a single structure: one CPAP3 in antennae
(ACYPI001579), one CPAP1 (ACYPI004632), and two RR-2 in legs (ACYPI006712
and ACYPI003698), one CPAP1 (ACYPI45536), one CPAP3 (ACYPI000583),
one RR-1 (ACYPI001775), three RR-2 in wings (ACYPI000670, ACYPI001681,
ACYPI006015), and none in the stylets (Figure B, Table , Table S18).
Localization
of RR-1 Proteins in the Stylets
To confirm
the presence of the 38 CPs identified by MS in aphid stylets and to
give information on their localization and accessibility, A. pisum stylets were dissected and immunolabeled with antibodies
targeting peptides from these proteins (Table ). All CPs located in the acrostyle with
peptides directly accessible at its surface will be renamed stylins.
Table 4
Correspondence between Peptides/Proteins
Identified in A. pisum Stylets by MS Analysis and
Peptides Used for Antibody Productiona
AA, amino acid;
CP, Cuticular protein;
ID, identifier; n.d., not done; N/A, not applicable; (+): labeling
detected; (−): no labeling observed under our experimental
conditions; Ac, epitope exposed at the surface and directly accessible;
PAc, epitope poorly accessible (labeling visible as dots without Chitinase
treatment); Em, epitope embedded, not accessible at the surface of
the stylets. CP identified with unique peptides are in red; peptides
not identified by LC-MS/MS analysis for which antibodies were already
available in our laboratory are in italics. Differences in the AA
sequence are indicated in blue. Antibody ID is followed by a reference
number when the corresponding antibody was described in a previous
study.
AA, amino acid;
CP, Cuticular protein;
ID, identifier; n.d., not done; N/A, not applicable; (+): labeling
detected; (−): no labeling observed under our experimental
conditions; Ac, epitope exposed at the surface and directly accessible;
PAc, epitope poorly accessible (labeling visible as dots without Chitinase
treatment); Em, epitope embedded, not accessible at the surface of
the stylets. CP identified with unique peptides are in red; peptides
not identified by LC-MS/MS analysis for which antibodies were already
available in our laboratory are in italics. Differences in the AA
sequence are indicated in blue. Antibody ID is followed by a reference
number when the corresponding antibody was described in a previous
study.Four to five RR-1
proteins were identified in our MS analyses (Table S17) including Stylin-01 (ACYPI009006)
and Stylin-02 (ACYPI003649). These two stylins have previously been
detected in A. pisum stylets using specific antibodies.[26]A third RR-1 protein, ACYPI001610, was
identified with a unique
peptide of 13 AA covering 10.8% of the mature protein (Table , Table S17). This peptide could not be detected with an anti-1-03
specific antibody in our experimental conditions despite extensive
Chitinase digestion treatment (Figure A). Three additional antibodies targeting distinct
peptides of ACYPI001610, anti-1-04, anti-1-15, and anti-1-17 were
then used to detect this protein in aphid stylets (Table ). Anti-1-04 and anti-1-17 antibodies
revealed their corresponding peptides at the tip of aphid maxillary
stylets in the common canal and on the lateral edge of the stylet.
Labeling was only observed after preincubation of stylets with Chitinase,
indicating that these two peptides were embedded in the chitin matrix
(Figure A). A stronger
labeling was observed when using anti-1-15 antibody, which labeled
evenly the acrostyle without the need for Chitinase treatment. Noticeably,
the targeted peptide VEGGYSYTAPDGTPI is part of the RR-1 chitin-binding
domain, and its labeling decreased upon Chitinase digestion (Figure A). The protein ACYPI001610
is thus accessible at the surface of the acrostyle and was renamed
Stylin-03.
Figure 5
Detection of cuticular proteins in A. pisum stylets
by immunolabeling. Immunolabeling of A. pisum maxillary
stylets with antibodies targeting peptides of cuticular proteins identified
by the LC-MS/MS analyses. Representative images of labeling observed
for antibodies targeting peptides from CPR-RR-1 proteins (A), CPR-RR2
proteins (B), and CPAP3 proteins (C) are shown for untreated stylets
and stylets treated with Chitinase prior to immunolabeling. Incubation
times with Chitinase ranging from 15 to 60 min are indicated on the
top right of images. Scale bars of 5 μm are included.
Detection of cuticular proteins in A. pisum stylets
by immunolabeling. Immunolabeling of A. pisum maxillary
stylets with antibodies targeting peptides of cuticular proteins identified
by the LC-MS/MS analyses. Representative images of labeling observed
for antibodies targeting peptides from CPR-RR-1 proteins (A), CPR-RR2
proteins (B), and CPAP3 proteins (C) are shown for untreated stylets
and stylets treated with Chitinase prior to immunolabeling. Incubation
times with Chitinase ranging from 15 to 60 min are indicated on the
top right of images. Scale bars of 5 μm are included.The fourth RR-1 protein, ACYPI002877, was identified
by MS analyses
with two shared peptides also found in ACYPI000308 (Table , Table S17). These two proteins are 96.5% identical and cannot be
distinguished by immunolabeling. Anti-1-14 antibody, which targets
the peptide GSYTFGYQSADGTQR, labeled the acrostyle and the lateral
edge tip of maxillary stylets of the stylet tip after a Chitinase
treatment. The other identified peptide detected in our MS data sets
(Table , Table S17) and present within the N-terminus
of the two proteins was directly accessible at the surface of the
upper part of the acrostyle in close vicinity of the food canal and
on a lateral edge by anti-1-16 antibody (Figure A). At least one of these two RR-1 proteins
is thus present in the acrostyle, and they have consequently been
renamed Stylin-04 and Stylin-04bis.
Localization of RR-2 Proteins
in the Stylets
Eight
to 30 RR-2 proteins have been identified in stylets, among which only
three were identified by unique peptides. Due to high sequence similarities,
even unique peptides were sometimes nearly identical to others present
in distinct RR-2 proteins and could not be distinguished by immunological
approaches.Solely two RR-2 proteins were unambiguously detected
in the acrostyle. ACYPI066095, identified with four peptides, was
detected with anti-2-16 specific antibody. The unique peptide targeted
by this antibody was found embedded in the acrostyle, only revealed
after Chitinase treatment (Figure B, Table ). The second RR-2 protein, ACYPI007858, was identified by MS with
a single peptide of 11 AA covering 2% of the mature protein and targeted
by anti-2-13 antibody. This peptide was also embedded in the acrostyle,
as well as in the lateral edge of the stylets (Figure B, Table ). Attempts to detect another unique peptide of this
CP with anti-2-14 antibody failed whatever the conditions used. A
third RR-2 protein, ACYPI001599 was detected by MS with a unique peptide,
but no specific antibody could be produced.Four peptides included
in the chitin-binding domain and common
to several identified RR-2 proteins have earlier been reported as
embedded within the acrostyle—pepL, pepS, Ap2-05, and Ap2-08.[22,25] We here extend the listing of RR-2 protein peptides detectable in
stylets with four additional antibodies. Anti-Ap2-04, anti-Ap2-06,
and anti-Ap2-07 are directed against peptides of the conserved RR-2
chitin-binding domain, whereas anti-2-10 targets one peptide present
in the C-terminus of a few proteins. All four antibodies labeled maxillary
stylets after extended Chitinase treatments. Strong fluorescent labeling
appeared as dots along maxillary stylets when using anti-2-06 and
anti-2-10 antibodies, indicating that RR-2 proteins are widely present
under the surface (Figure B, Table ).ACYPI066095 and ACYI007858 formally detected within the acrostyle
with antibodies targeting unique peptides lack several shared peptides
detected in the organ, among which pepL, pepS, and Ap2-08. Therefore,
another RR-2 protein containing these shared peptides may be present
in the organ. We can conclude that at least three RR-2 proteins are
detected in the acrostyle (Figure B).
Localization of CPAP3 Proteins in the Stylets
Three
CPAP3 have been identified in the stylet data sets. With the set of
antibodies produced and used against these proteins, the labeling
was generally weak and visible as dots at the tip of maxillary stylets
(Figure C, Table ). ACYPI007860 was
only barely revealed in the acrostyle after Chitinase treatment with
a single antibody, anti-3-02, targeting a peptide that was not detected
by MS. No labeling could be visualized when using anti-3–01
antibody targeting one unique MS-identified peptide of this protein.
Its location within aphid stylets could therefore not be definitely
stated. ACYPI006031 and ACYPI007911 were detected in the acrostyle,
each with two distinct antibodies. Two peptides of ACYPI006031, respectively
targeted by anti-3-06 and anti-3-07 were found embedded within the
cuticle. For ACYPI007911, the peptide targeted by anti-3-03 was detected
only after Chitinase treatment, whereas that targeted by anti-3-09
antibody was detected directly at the surface of the acrostyle (Figure C). This protein
was renamed Stylin-05.
Repertoire of Cuticular Proteins Present
in the Acrostyle
Of the 15 CP groups identified in stylet
bundles, nine proteins
were unambiguously detected in the acrostyle. These proteins belong
to two CP families, including seven CPR proteins (4 RR-1 and 3 RR-2)
and two CPAP3 (Figure ). Four peptides have been shown to be directly accessible at the
surface: three of them are found in RR-1 proteins and are either part
of the chitin-binding domain (Stylin-03) or present at the N-terminus
and C-terminus of the proteins (Stylin-04/-04bis and Stylin-01/-02,
respectively), and one belongs to a CPAP3 protein and is part of one
of its chitin-binding domain (Figure ). Noticeably, no RR-2 protein (the most numerous CP
class) was detected at the untreated surface of the acrostyle.
Figure 6
Repertoire
of cuticular proteins in the acrostyle. (A) Summary
of peptides and proteins detected in A. pisum maxillary
stylets at the surface of the acrostyle (1), embedded in the organ
(2), or distributed all along the stylets (3). (B) Schematic representation
of the distribution of the CPs identified in maxillary stylets. Scale
bars of 5 μm are included.
Figure 7
Domain
organization of stylins. Schematic representation of stylin
domains with predicted signal peptide, and RR-1 or type 2 chitin-binding
domains. Acrostyle-surface exposed domains are indicated in red.
Repertoire
of cuticular proteins in the acrostyle. (A) Summary
of peptides and proteins detected in A. pisum maxillary
stylets at the surface of the acrostyle (1), embedded in the organ
(2), or distributed all along the stylets (3). (B) Schematic representation
of the distribution of the CPs identified in maxillary stylets. Scale
bars of 5 μm are included.Domain
organization of stylins. Schematic representation of stylin
domains with predicted signal peptide, and RR-1 or type 2 chitin-binding
domains. Acrostyle-surface exposed domains are indicated in red.
Spatial and Temporal Expression of Stylins
in A. pisum
None of the five proteins detected
at the surface of the
acrostyle was found to be stylet-specific. They have been identified
in all four cuticular structures characterized in this study, except
for Stylin-04/-04bis which was/were absent from the wing’s
proteome (Table ).
To characterize possible difference in stylin genes
expression in different tissues, we compared their transcript levels
in antennaless-heads containing the stylet-synthesizing glands, and
in antennae, legs, and wings by real-time RT-qPCR analyses using stylin-specific
primers for Stylin-01, -02, -03, and -05 (Table S3). For Stylin-04/-04bis, we could not design primers able
to specifically amplify a single transcript. The observed patterns
are thus difficult to interpret and provided for information purposes
only.Consistent with the proteomic data, stylin-01, -02, -03, and -05 were expressed not only in the head but also in other body parts.
They exhibited different expression patterns, with stylin-01 and stylin-02 transcripts being more expressed
in the head, whereas stylin-03 was less expressed
and stylin-05 highly variably but equally expressed
in all four tissues (Figure A).
Figure 8
Stylin relative expression patterns in different A. pisum body parts and in different nymphal instars. (A) Stylin genes expression
quantified by qRT-PCR in head, antenna, leg, and wing relative to
stylins expression in Heads. (B) Stylin genes expression quantified
by qRT-PCR in first (1st), second (2nd), third (3rd), and fourth (4th)
nymphal stages and in adults (Ad) relative to stylins expression in
first instars.actin and EF1α genes were used for data normalization. Results are reported as
means ± SD for 3 independent biological replicates. Lowercase
letters indicate significant differences between samples (TukeyHSD, p < 0.005).
Stylin relative expression patterns in different A. pisum body parts and in different nymphal instars. (A) Stylin genes expression
quantified by qRT-PCR in head, antenna, leg, and wing relative to
stylins expression in Heads. (B) Stylin genes expression quantified
by qRT-PCR in first (1st), second (2nd), third (3rd), and fourth (4th)
nymphal stages and in adults (Ad) relative to stylins expression in
first instars.actin and EF1α genes were used for data normalization. Results are reported as
means ± SD for 3 independent biological replicates. Lowercase
letters indicate significant differences between samples (TukeyHSD, p < 0.005).Stylin expression profiles
were also analyzed during A.
pisum development at five different time-points: during the
four larval stages and in adults. rr-1 stylin genes
showed similar expression patterns with an increase in successive
larval stages peaking at the fourth instar (8 to 20 times higher than
the expression level in the first instar) and then decreasing in adults.
Noticeably, expression levels were highly variable in the fourth instar,
the developmental stage of longest duration in A. pisum.[51] Expression levels of stylin-05 gene were comparable in the four larval stages and were significantly
lower in adults (Figure B).
Stylins are Conserved among Aphid Species
We constructed
two phylogenetic trees for RR-1 and CPAP3 protein sequences identified
in the six genomes of aphids and in the phylloxera Daktulosphaira
vitifoliae, all available in AphidBase (Figure A,B). The phylloxera belongs
to Phylloxeroidea, a superfamily considered to be the nearest sister
taxon of the Aphidoidea, which probably diverged 250 My ago.[52] The homologues of each Stylin grouped in a separated
clade, except for homologues of Stylin-01 and Stylin-02 which were
already shown to group in a single clade.[26] Noticeably, only one protein of D. vitifoliae grouped
with Stylin-01 and Stylin-02 homologues, probably reflecting gene
duplication in aphids after their divergence with phylloxerids. Regarding
Stylin-04 and Stylin-04bis found in A. pisum genome,
phylogenetic analyses showed that only one of these two proteins was
present in other aphid species and in D. vitifoliae. A gene duplication of stylin-04 gene likely occurred
only in A. pisum, as previously described in other
conserved gene families in this species.[36]
Figure 9
Stylins
and acrostyle surface peptides are conserved in aphid vectors.
Phylogenetic relationships of (A) CPR_RR-1 proteins with trimmed signal
peptides and (B) full CPAP3 proteins of Acyrthosiphon pisum (Ap), Aphis glycines (Ag), Daktulosphaira
vitifoliae (Dv), Diuraphis noxia (Dn), Myzus cerasi (Mce), Myzus persicae (Mp), Rhopalosiphum padi (Rp). AphidBase accession numbers for
each gene are indicated on the right of the species abbreviation.
It is noteworthy that to retrieve M. persicae sequences
from AphidBase, numbers should be preceded by MYZPE13164_G006_v1.0_.
Branch support values are indicated at the node, and the scale bar
represents probabilities of change from one amino acid to another
in terms of a unit, which is an expected 1% change between two amino
acid sequences. Shaded portions represent stylin clusters and are
numbered from 01 to 05 according to corresponding stylin names. (C)
Conservation of exposed domains of the five stylins identified in M. persicae (Mp), A. pisum (Ap), M. cerasi (Mce), R. padi (Rp), A. glycines (Ag), D. noxia (Dn), and Daktulosphaira vitifolia (Dv). Alignments were performed
using T-Coffee software. Shading was done with BOXSHADE 3.21 software.
Identical residues are shaded in black; similar residues are shaded
in gray. The consensus sequence is shown at the bottom, with periods
indicating conserved substitutions and asterisks indicating identities.
Stylins
and acrostyle surface peptides are conserved in aphid vectors.
Phylogenetic relationships of (A) CPR_RR-1 proteins with trimmed signal
peptides and (B) full CPAP3 proteins of Acyrthosiphon pisum (Ap), Aphis glycines (Ag), Daktulosphaira
vitifoliae (Dv), Diuraphis noxia (Dn), Myzus cerasi (Mce), Myzus persicae (Mp), Rhopalosiphum padi (Rp). AphidBase accession numbers for
each gene are indicated on the right of the species abbreviation.
It is noteworthy that to retrieve M. persicae sequences
from AphidBase, numbers should be preceded by MYZPE13164_G006_v1.0_.
Branch support values are indicated at the node, and the scale bar
represents probabilities of change from one amino acid to another
in terms of a unit, which is an expected 1% change between two amino
acid sequences. Shaded portions represent stylin clusters and are
numbered from 01 to 05 according to corresponding stylin names. (C)
Conservation of exposed domains of the five stylins identified in M. persicae (Mp), A. pisum (Ap), M. cerasi (Mce), R. padi (Rp), A. glycines (Ag), D. noxia (Dn), and Daktulosphaira vitifolia (Dv). Alignments were performed
using T-Coffee software. Shading was done with BOXSHADE 3.21 software.
Identical residues are shaded in black; similar residues are shaded
in gray. The consensus sequence is shown at the bottom, with periods
indicating conserved substitutions and asterisks indicating identities.Stylins are highly conserved among aphid species.
Remarkably, the
domains exposed at the surface of the acrostyle are nearly identical
in the six aphid species (Figure C). The most divergent sequences were D. vitifoliae homologues of Stylin-04/-04 bis and Stylin-05. In addition, as previously
shown in A. pisum, and so suggesting a high conservation
of their function, all five stylins were detected at the tip of the
maxillary stylets of M. persicae, the most important
vector of plant viruses (Figure A).
Discussion
The aphid stylets are
composed of a biomaterial with unique surface
properties ensuring binding, retention, and release of plant viruses
during their journey from one host to another. This transport of viruses
is driven at least by cuticular proteins emerging at the surface of
the cuticle, in direct contact with endogenous and exogenous compounds
flowing in and out maxillary stylets. Although not characterized when
we started our study, we could speculate that various processes might
specifically functionalize the cuticular surface at the tip of maxillary
stylets in the region described as the acrostyle. These speculated
processes could be, for example, the local protein composition of
the cuticle or the degree of protein sclerotization. We thus initially
presumed that some cuticular proteins would be specific to the stylets,
even to the tip of the maxillary stylets, and absent from other anatomical
structures. Our comparative proteomics analysis was designed to identify
these stylet-specific CPs if there were any, because they represent
prime candidate receptors of plant viruses.
Common Set of CPs to Form
Aphid Cuticles
We have here
experimentally generated the first proteome of four cuticular structures
(antennae, legs, wings, and stylets) of the pea aphid. These four
proteomes contain multiple CPs of which a shared subset likely represents
the building blocks of all aphid cuticles, as suggested for the common
subset of CP genes identified in a comparative analysis of seven anatomical
structures of the mosquito Anopheles gambiae.[33,53] The pea aphid proteome profiles were globally similar in antennae,
legs, and wings, with higher complexity/depth than the stylets proteome.
However, a few CPs were specifically found in antennae, legs, or wings,
and whether or how they contribute to specific cuticular properties/functions
in these respective anatomic structures has not been investigated.
Fewer CPs Identified in Stylets
The repertoire of CPs
identified in aphid cuticles was half in stylets compared to in other
structures, reflecting either the absence of some CPs, their significantly
reduced accumulation, or extractability. It is notable that the cuticle
of antennae, legs, and wings originates from epithelial cells, while
that of the stylets is synthesized by the highly specialized retort
glands entirely dedicated to cuticle production.[13,14,54] This sole ontology difference may lead to
cuticles with different degrees of protein complexity. However, a
recent characterization of transcripts in the retort glands revealed
a more complex set of expressed CP genes than in other previously
characterized cuticular transcriptomes.[28] Our MS data show that over 50% of the stylet CPs were identified
by a single peptide, sometimes covering less than 2% of the mature
protein, suggesting that CPs are hardly extractable from this organ
and perhaps explaining the low complexity of the corresponding proteome.
CPs Are Not Evenly Distributed in Aphid Stylets
Our
labeling experiments showed a variable distribution of CPs within
maxillary stylets. Several RR-2 peptides were displayed under the
superficial layers of the cuticle all along maxillary stylets. Some
of them were also detected under the surface of the acrostyle. With
the antibodies used here and in previous studies, we could not evidence
their direct accessibility at the surface of the cuticle.[22,25] However, we cannot strictly rule out such a possibility. Actually,
repeat-rich sequences containing alanine, proline, tyrosine, and serine
residues commonly found in aphid RR-2 proteins and present in several
MS-detected peptides are poorly immunogenic and could not be considered
for peptide synthesis and antibody production.[25] Therefore, their accessibility in stylet cuticle could
not be assessed. By contrast, all RR-1 and CPAP3 have been exclusively
detected at the distal extremity of maxillary stylets (Figure ), with the strongest labeling
being observed for RR-1, that likely decorates the entire surface
of the acrostyle. Despite their apparent specific localization, we
must consider that RR-1 and CPAP3peptides may also be evenly distributed
all over the stylets because these protein families do not seem to
be overrepresented in the core stylet material when compared to other
aphid cuticular anatomical structures (Figure , Table ), and so are likely generally present in any cuticle.
In this hypothesis, what would appear specific would be their accessibility
at the surface of the acrostyle, and this could reveal a general property
of the cuticle that would be locally functionalized by surface modifications
allowing access to normally embedded CPs.Intriguingly, no labeling
was ever observed on mandibular stylets in our experimental conditions.
This striking observation suggests that proteins in this structure
may be tightly cross-linked and no longer detectable with our antibodies
or, as hypothesized above, that these stylets have surface properties
that do not give access to the embedded cuticular proteins. Both hypotheses
assume a comparable protein constitution for maxillary and mandibular
stylets, consistent with their undistinguishable transcript profiles
reported earlier.[28]
Receptor Candidates
As expected for structural components
of arthropod cuticle, stylins are highly conserved in all aphid species
where sequence data are available. Receptors of plant viruses may
also be conserved in vectors because the same viral species can sometimes
be transmitted by dozens of different aphid species.[55,56] Our results indicate that the protein domains accessible at the
surface of the acrostyle share high degrees of identity with their
homologues in other aphid species (Figure C). Interestingly, the most divergent protein
sequences were those of D. vitifoliae, the grapevine
pest species, for which transmission of plant viruses or of other
pathogens has not been reported so far.[57,58]RR-1
proteins are largely represented at the surface of the acrostyle.
Proteins from RR-1 subgroup are generally found in soft cuticle, but
a few of them have also been described in hard cuticle.[24,59−61] Their role in virus transmission has recently emerged
in the literature, and they are associated with both circulative and
noncirculative virus transmission.[26,62,63] CPR1, a RR-1 protein of the small brown planthopperLaodelphax striatellus, was first shown to interact with
pc3, a viral nucleocapsid protein of rice stripe virus (RSV) (62).
RSV is a tenuivirus transmitted in a circulative propagative manner.[64] By contrast with noncirculative viruses, these
viruses are not retained on their vector mouthparts. They are internalized
in insect body, transit through the hemolymph to different internal
tissues, and reach the salivary glands from which they can be inoculated
together with egestion of saliva into a new host plant.[4] Knockdown of CPR1 transcripts resulted in a decrease
in RSV transmission. The authors proposed that CPR1 could assist viral
movement within the insect body, a totally unexpected function for
cuticular protein.[62] For noncirculative
viruses, in vitro competition experiments and transmission
phenotypes associated with stylin genes knockdown
demonstrated the role of Stylin-01 in CaMV transmission by its aphid
vector. Decreasing Stylin-01 transcripts in M. persicae resulted in a reduced CaMV transmission capacity.[26] In an independent study, Stylin-01, named previously Mpcp4
in M. persicae, was shown to interact in yeast with
the coat protein of Cucumber mosaic virus (CMV),
another aphid-transmitted noncirculative virus.[65] However, its role in CMV transmission still lacks direct
evidence.[66] In addition to Stylin-01, Stylin-02,
-03, and -04/-04bis, distributed over the surface of the acrostyle,
now stand as prime candidate receptors of plant viruses.CPAP3
are Cuticular Proteins Analogous to Peritrophins, previously
known as “gasp” or “obstructor” family.[67,68] They have been described in all insect,
and possess three type 2 chitin-binding domains (ChtBD2).[69] They have been detected in different tissues.
However, they were missing from the list of CPs identified in the
proboscis of An. gambiae.[53] They have never been reported associated with virus transmission
so far and are described here for the first time in aphid stylets.
They are involved in cuticle formation and in structural integrity
of cuticles.[69−71] Antibodies produced in our study target less than
7% of their amino acid sequences and further effort will be required
to determine if some CPAP3 domains could be better exposed at the
surface of aphid stylets. Determining if they play a role in virus
transmission would also warrant further investigation. As they belong
to a distinct structural class of chitin-binding proteins relative
to CPR (harboring cysteine-bridged Chitin-Binding Domain vs cysteine-free
Chitin-Binding Domains for CPR), their functional properties might
be distinct from the canonical CPR proteins identified so far as stylins and as active players in plant virus transmission.The RR-2 proteins represent the great majority of CPs identified
in A. pisum stylets. These proteins are generally
found in hard and rigid cuticles, and some of them have been shown
to be essential for cuticle integrity, integument structure, insect
development and could be involved in cuticle formation.[24,72−77] RR-2 distribution within aphid stylets suggests that they likely
constitute the main building blocks of this type of cuticle. Evidence
of their accessibility at the surface of aphid stylets is still lacking.[22,25] So, although not prime candidates, the role of RR-2 proteins in
virus transmission cannot be discarded. They have been cited in several
studies as interacting molecules of both noncirculative and circulative
viruses.[78] For noncirculative viruses,
three RR-2 of M. persicae were shown to interact in vitro with HC-Pro, the ligand protein of a potyvirus Zucchini yellow mosaic virus.[65] One of these proteins is homologous to ACYPI006670 that contains
four peptides detected in maxillary stylets (Table ). Another one has been characterized recently
which reduction of transcripts correlated with a decrease in Potyvirus Y transmission.[79] However,
its putative ortholog in A. pisum was not in our
MS data set, and its presence within M. persicae stylets,
embedded in the chitin or at the surface of the cuticle, remains to
be confirmed.
Conclusions
We provide through this
study the first comparative proteomics
analysis of four aphid cuticular anatomical structures, namely antennae,
legs, wings, and stylets. Our data gives preliminary evidence that
a great number of CP proteins are common to antennae, legs, and wings,
while a few CP proteins seem specific to each appendage. The stylet,
which was of main interest to better understand the vector/virus interaction
was found to be distinct in composition compared to the other three
studied appendages. We determined the repertoire of CPs of aphid stylets
and precisely mapped their accessibility in maxillae at the surface
of the acrostyle. Further characterization for this short list of
proteins showed that they are highly conserved in aphid species and
thus all represent good candidate receptors of plant viruses. These
data contribute to a better characterization of aphid mouthparts,
a crucial insect feeding appendage, but also point out at the surface
specificities of the cuticle and at the distribution of cuticular
protein accessibility, which may be relevant for local functionalization
of this tissue. Beyond feeding appendage and virus receptor candidates,
our proteomic data sets may contribute to future investigations of
other important physiological functions in aphids such as chemoreception
and sensory system.
Authors: Warren Arinaitwe; Alex Guyon; Trisna D Tungadi; Nik J Cunniffe; Sun-Ju Rhee; Amjad Khalaf; Netsai M Mhlanga; Adrienne E Pate; Alex M Murphy; John P Carr Journal: Viruses Date: 2022-08-01 Impact factor: 5.818