Literature DB >> 28327966

De novo transcriptome assemblies of four xylem sap-feeding insects.

Erica E Tassone1, Charles C Cowden2, S J Castle2.   

Abstract

Background: Spittle bugs and sharpshooters are well-known xylem sap-feeding insects and vectors of the phytopathogenic bacterium Xylella fastidiosa (Wells), a causal agent of Pierce's disease of grapevines and other crop diseases. Specialized feeding on nutrient-deficient xylem sap is relatively rare among insect herbivores, and only limited genomic and transcriptomic information has been generated for xylem-sap feeders. To develop a more comprehensive understanding of biochemical adaptations and symbiotic relationships that support survival on a nutritionally austere dietary source, transcriptome assemblies for three sharpshooter species and one spittlebug species were produced. Findings: Trinity-based de novo transcriptome assemblies were generated for all four xylem-sap feeders using raw sequencing data originating from whole-insect preps. Total transcripts for each species ranged from 91 384 for Cuerna arida to 106 998 for Homalodisca liturata with transcript totals for Graphocephala atropunctata and the spittlebug Clastoptera arizonana falling in between. The percentage of transcripts comprising complete open reading frames ranged from 60% for H. liturata to 82% for C. arizonana. Bench-marking universal single-copy orthologs analyses for each dataset indicated quality assemblies and a high degree of completeness for all four species. Conclusions: These four transcriptomes represent a significant expansion of data for insect herbivores that feed exclusively on xylem sap, a nutritionally deficient dietary source relative to other plant tissues and fluids. Comparison of transcriptome data with insect herbivores that utilize other dietary sources may illuminate fundamental differences in the biochemistry of dietary specialization. Published by Oxford University Press on behalf of GIGSCI 2017. This work is written by (a) US Government employee(s) and is in the public domain in the US.

Entities:  

Keywords:  Diet specialization; Insect herbivory; Insect vector; RNA-seq; Transcriptome; Trinity

Mesh:

Year:  2017        PMID: 28327966      PMCID: PMC5467018          DOI: 10.1093/gigascience/giw007

Source DB:  PubMed          Journal:  Gigascience        ISSN: 2047-217X            Impact factor:   6.524


Data description

Background

Resource partitioning among herbivorous insects spans a continuum between specialists that feed on one or a few plant species to generalists that are able to utilize hundreds of species belonging to multiple plant families. A further element of plant partitioning involves the particular location on a plant or tissue type from which an insect feeds [1]. The diversity of plant feeding strategies has evolved along with specialized anatomical features such as mouthparts and digestive systems, unique enzyme complements for processing plant compounds, and partnerships with symbiotic microbiota that contribute to nutritional gain of the host insect. The transcriptome assemblies presented here include four species that feed exclusively on sap from xylem vessels, a relatively rare form of plant feeding from a source that among plant tissues is the most deficient in nitrogen and carbon content [2]. Three of the transcriptomes represent sharpshooter species that are members of the subfamily Cicadellinae (Cicadellidae) that belong to the superfamily Membracoidea (leafhoppers, sharpshooters, treehoppers). The fourth transcriptome represents a spittlebug (Clastopteridae) that belongs to the superfamily Cercopoidea (spittlebugs, froghoppers). All are members of the hemipteran suborder Auchenorrhyncha [3]. Their piercing-sucking mouthparts tap into xylem vessels from which sap is consumed in copious quantities to compensate for its low nutritional value. Sharpshooters are recognized for their efficient assimilation of limited nutrients in xylem sap [4], but putative biochemical mechanisms that enable specialization on xylem sap are unknown. Also unclear is whether the respective roles in host nutrition played by the dual primary endosymbionts are consistent among xylem feeders [5]. Comparison of transcriptomes of four xylem-feeding insects will provide additional knowledge and insight into the survival of ecological specialists on a nutritionally impoverished dietary source.

Samples

The spittlebug Clastoptera arizonana Doering was collected in 2014 from a wild population infesting grapevines in Yavapai County, AZ and established as a glasshouse colony for 8 months prior to sample collection in Maricopa, AZ. Samples of the sharpshooter Cuerna arida Oman and Beamer (tribe Proconiini) were collected in 2015 by sweep net from a wild population in mixed vegetation in Cochise County, AZ. The smoke-tree sharpshooter Homalodisca liturata Ball (Proconiini) was collected in 2015 from Euphorbia tirucalli L. plants in Phoenix, AZ. The blue-green sharpshooter Graphocephala atropunctata (Signoret) (Cicadellini) was collected in 2013 from a wild population in Orange County, CA and maintained as a glasshouse colony on basil (Osimium basilicum L.) until samples were collected in 2015. Live adults of unknown age from all four species were homogenized separately in RNAlater (Ambion/Life Technologies, Carlsbad, CA) and stored at −20°C. Total RNA extractions were performed using a Qiagen RNeasy mini kit followed by quality assessment on an Agilent 2100 bioanalyzer. Library generation yielding 2 × 100 bp paired-end reads (TruSeq RNA Sample Preparation Kit v2; Illumina Inc., San Diego, CA) and sequencing (Illumina HiSeq2000 or HiSeq2500) were performed at the University of Arizona Genomics Center in Tucson, AZ (http://uagc.arl.arizona.edu).

Data filtering

The total number of reads, data quantity, and short read archive numbers for each of the four xylem-feeding insects are shown in Table 1. For each data set, raw quality was assessed and filtered using both FastQC and Trimmomatic (v 0.32) using the parameters ILLUMINACLIP:TruSeq3-PE.fa:2:30:10 LEADING:10 TRAILING:20 SLIDINGWINDOW:4:25 MINLEN:36 to remove adaptor sequence and filter by quality score.
Table 1.

Accession numbers for sequence reads and assembled transcripts for four species of xylem-feeding insects

SampleReadsSize (Gb)Short Read ArchiveBioSampleBioProject
Homalodisca liturata 18 936 52018.9SRX1451710SAMN04293489PRJNA303151
SRX1451711SAMN04293490
SRX1451712SAMN04293491
Clastoptera arizonana 19 038 99817.8SRX1451715SAMN04293493PRJNA303152
SRX1451717SAMN04293494
SRX1451718SAMN04293495
Cuerna arida 14 667 04018.3SRX1451216SAMN04292971PRJNA303150
SRX1451218SAMN04292972
SRX1451467SAMN04292973
Graphocephala atropunctata 16 868 1348.2SRX1411425SAMN04208332PRJNA299492
SRX1411426SAMN04208333
SRX1411427SAMN04208334
Accession numbers for sequence reads and assembled transcripts for four species of xylem-feeding insects

Transcriptome assembly

All raw data for each insect transcriptome were run through the following pipeline. Prior to assembly, the three replicate samples were concatenated and read abundance was normalized to 50× coverage using the in silico normalization tool in Trinity v. 2.0.6 [6] to improve assembly time. Each of the datasets was assembled in Trinity using the default parameters, with the addition of the ‘–jaccard clip’ flag to reduce the generation of transcript fusions from non-strand specific data. Open reading frames were predicted using Transdecoder with all run parameters set to default [6]. The transcriptomes were filtered, sorted, and prepared for NCBI transcriptome shotgun assembly (TSA) submission as previously described [7]. To comply with NCBI TSA submission, all transcripts resulting from endosymbionts or bacteria were removed from the final assembly prior to submission.

Annotation

Functional annotation for each of the transcriptomes was performed at the peptide level using a custom pipeline [7] that defines protein products and assigns transcript names. Predicted proteins and peptides were analyzed using InterProScan 5 [8], using the ‘–iprlookup’ and ‘–goterms’ flags, to search all available databases, including Gene Ontology. Each transcriptome was annotated using BLASTp against the UniProt Swiss Prot database (downloaded 11 February 2015). Annie [9], a program that cross-references SwissProt BLAST and InterProScan5 results to extract qualified gene names and products, was used to generate the transcript annotation file. The resulting .gff3 and .tbl files were further annotated with functional descriptors in Transvestigator [10].

Transcriptome Quality and Comparisons

Assembled transcriptome metrics showed a high percentage of reads mapping back to each transcriptome (Table 2) indicating successful assemblies. TransRate [11] scores ranging from 0.16 to 0.42 were used for quality assessment, and bench-marking universal single-copy orthologs (BUSCO) v. 1.1.b1 results using the arthropod gene set (downloaded December 19, 2015) [12] indicated that the four transcriptomes have a moderate to high level of completeness. It should be noted that both the TransRate value (0.16) and BUSCO results for H. liturata suggest this transcriptome may contain more partial transcripts than the other three assemblies.
Table 2.

Transcriptome assembly statistics and results of BUSCO analysis for four xylem-feeding insects

H. liturata C. arizonana C. arida G. atropunctata
Assembly
Normalized reads9 468 2609 519 49910 714 37532 429 458
Total no. transcripts106 99893 84591 38497 830
Average transcript length and range954 (224–30 062)1232 (224–29 936)901 (224–20 095)962 (224–17 082)
Total assembled bases (all)102 317 189115 686 86879 785 47194 141 447
N50 (all)1650251015601692
% GC37313739
% mapping84918895
TransRate Score0.160.280.250.42
BUSCO
Complete (%)60826866
Duplicated (%)23422624
Fragmented (%)239.21719
Missing (%)1581413
Transcriptome assembly statistics and results of BUSCO analysis for four xylem-feeding insects Each of the assembled transcriptomes was used in a reciprocal tBLASTx search to identify similarities between the four species and their transcriptome assemblies. The final, filtered transcriptomes were made into nucleotide BLAST databases using NCBI Blast+ (v 2.2.30) makeblastdb tool, and all tBLASTx searches were performed using an e-value cutoff of 1e−3. The tBLASTx results (Table 3) indicate similarities between the four xylem-feeder transcriptomes, with the lowest (38%) occurring between members of the two superfamilies (spittlebug and all sharpshooters) and the highest (84%) between H. liturata and C. arida, members of the same subfamily [13].
Table 3.

Total percent matches from tBLASTx reciprocal searches. Transcriptome used as query on the left, and nucleotide database tBLASTx against which the query was performed is shown at the top

Nucleotide Database (% similarity)
H. liturata C. arizonana C. arida G. atropunctata
Query
Homalodisca liturata 42.9476.3156.20
Clastoptera arizonana 40.9038.3538.58
Cuerna arida 83.9043.4058.36
Graphocephala atropunctata 56.8640.3056.10

e-value ≤ 1E-3

Total percent matches from tBLASTx reciprocal searches. Transcriptome used as query on the left, and nucleotide database tBLASTx against which the query was performed is shown at the top e-value ≤ 1E-3

Availability of supporting data

The filtered and annotated transcriptomes have been deposited in GenBank as a TSA under the accessions and BioProject numbers found in Table 1. Datasets further supporting the results of this article are available in the GigaScience repository, GigaDB [13].

Abbreviations

BUSCO: bench-marking universal single-copy orthologs; TSA: transcriptome shotgun assembly; USDA-ARS: United States Department of Agriculture-Agricultural Research Service

Competing Interests

The authors declare that they have no competing interests.

Authors' Contribution

SJC and CCC conceived and performed the experiments; EET analyzed the data and evaluated the conclusions; EET, SJC, and CCC wrote the manuscript. All authors approved the final manuscript. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file. Click here for additional data file.
  8 in total

1.  Phylogeny of the major lineages of Membracoidea (Insecta: Hemiptera: Cicadomorpha) based on 28S rDNA sequences.

Authors:  C H Dietrich; R A Rakitov; J L Holmes; W C Black
Journal:  Mol Phylogenet Evol       Date:  2001-02       Impact factor: 4.286

2.  Symbiosis and insect diversification: an ancient symbiont of sap-feeding insects from the bacterial phylum Bacteroidetes.

Authors:  Nancy A Moran; Phat Tran; Nicole M Gerardo
Journal:  Appl Environ Microbiol       Date:  2005-12       Impact factor: 4.792

3.  BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs.

Authors:  Felipe A Simão; Robert M Waterhouse; Panagiotis Ioannidis; Evgenia V Kriventseva; Evgeny M Zdobnov
Journal:  Bioinformatics       Date:  2015-06-09       Impact factor: 6.937

4.  De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis.

Authors:  Brian J Haas; Alexie Papanicolaou; Moran Yassour; Manfred Grabherr; Philip D Blood; Joshua Bowden; Matthew Brian Couger; David Eccles; Bo Li; Matthias Lieber; Matthew D MacManes; Michael Ott; Joshua Orvis; Nathalie Pochet; Francesco Strozzi; Nathan Weeks; Rick Westerman; Thomas William; Colin N Dewey; Robert Henschel; Richard D LeDuc; Nir Friedman; Aviv Regev
Journal:  Nat Protoc       Date:  2013-07-11       Impact factor: 13.491

5.  Reconstructing a comprehensive transcriptome assembly of a white-pupal translocated strain of the pest fruit fly Bactrocera cucurbitae.

Authors:  Sheina B Sim; Bernarda Calla; Brian Hall; Theodore DeRego; Scott M Geib
Journal:  Gigascience       Date:  2015-03-31       Impact factor: 6.524

6.  InterProScan 5: genome-scale protein function classification.

Authors:  Philip Jones; David Binns; Hsin-Yu Chang; Matthew Fraser; Weizhong Li; Craig McAnulla; Hamish McWilliam; John Maslen; Alex Mitchell; Gift Nuka; Sebastien Pesseat; Antony F Quinn; Amaia Sangrador-Vegas; Maxim Scheremetjew; Siew-Yit Yong; Rodrigo Lopez; Sarah Hunter
Journal:  Bioinformatics       Date:  2014-01-21       Impact factor: 6.937

7.  TransRate: reference-free quality assessment of de novo transcriptome assemblies.

Authors:  Richard Smith-Unna; Chris Boursnell; Rob Patro; Julian M Hibberd; Steven Kelly
Journal:  Genome Res       Date:  2016-06-01       Impact factor: 9.043

8.  De novo transcriptome assemblies of four xylem sap-feeding insects.

Authors:  Erica E Tassone; Charles C Cowden; S J Castle
Journal:  Gigascience       Date:  2017-03-01       Impact factor: 6.524

  8 in total
  6 in total

1.  A deep transcriptomic resource for the copepod crustacean Labidocera madurae: A potential indicator species for assessing near shore ecosystem health.

Authors:  Vittoria Roncalli; Andrew E Christie; Stephanie A Sommer; Matthew C Cieslak; Daniel K Hartline; Petra H Lenz
Journal:  PLoS One       Date:  2017-10-24       Impact factor: 3.240

2.  De novo transcriptome assemblies of four xylem sap-feeding insects.

Authors:  Erica E Tassone; Charles C Cowden; S J Castle
Journal:  Gigascience       Date:  2017-03-01       Impact factor: 6.524

3.  Transcriptome Analysis Reveals Functional Diversity in Salivary Glands of Plant Virus Vector, Graminella nigrifrons.

Authors:  Swapna Priya Rajarapu; Raman Bansal; Priyanka Mittapelly; Andrew Michel
Journal:  Genes (Basel)       Date:  2020-10-29       Impact factor: 4.096

4.  Improved draft reference genome for the Glassy-winged Sharpshooter (Homalodisca vitripennis), a vector for Pierce's disease.

Authors:  Cassandra L Ettinger; Frank J Byrne; Matthew A Collin; Derreck Carter-House; Linda L Walling; Peter W Atkinson; Rick A Redak; Jason E Stajich
Journal:  G3 (Bethesda)       Date:  2021-09-27       Impact factor: 3.154

5.  Characterizing Hox genes in mayflies (Ephemeroptera), with Hexagenia limbata as a new mayfly model.

Authors:  Christopher J Gonzalez; Tobias R Hildebrandt; Brigid O'Donnell
Journal:  Evodevo       Date:  2022-07-27       Impact factor: 3.569

6.  A new lineage of segmented RNA viruses infecting animals.

Authors:  Darren J Obbard; Mang Shi; Katherine E Roberts; Ben Longdon; Alice B Dennis
Journal:  Virus Evol       Date:  2020-01-17
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.