Literature DB >> 15841239

Aphid biology: expressed genes from alate Toxoptera citricida, the brown citrus aphid.

W B Hunter1, P M Dang, M G Bausher, J X Chaparro, W McKendree, R G Shatters, C L McKenzie, X H Sinisterra.   

Abstract

The brown citrus aphid, Toxoptera citricida (Kirkaldy), is considered the primary vector of citrus tristeza virus, a severe pathogen which causes losses to citrus industries worldwide. The alate (winged) form of this aphid can readily fly long distances with the wind, thus spreading citrus tristeza virus in citrus growing regions. To better understand the biology of the brown citrus aphid and the emergence of genes expressed during wing development, we undertook a large-scale 5' end sequencing project of cDNA clones from alate aphids. Similar large-scale expressed sequence tag (EST) sequencing projects from other insects have provided a vehicle for answering biological questions relating to development and physiology. Although there is a growing database in GenBank of ESTs from insects, most are from Drosophila melanogaster and Anopheles gambiae, with relatively few specifically derived from aphids. However, important morphogenetic processes are exclusively associated with piercing-sucking insect development and sap feeding insect metabolism. In this paper, we describe the first public data set of ESTs from the brown citrus aphid, T. citricida. The cDNA library was derived from alate adults due to their significance in spreading viruses (e.g., citrus tristeza virus). Over 5180 cDNA clones were sequenced, resulting in 4263 high-quality ESTs. Contig alignment of these ESTs resulted in 2124 total assembled sequences, including both contiguous sequences and singlets. Approximately 33% of the ESTs currently have no significant match in either the non-redundant protein or nucleic acid databases. Sequences returning matches with an E-value of < or = -10 using BLASTX, BLASTN, or TBLASTX were annotated based on their putative molecular function and biological process using the Gene Ontology classification system. These data will aid research efforts in the identification of important genes within insects, specifically aphids and other sap feeding insects within the Order Hemiptera.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 15841239      PMCID: PMC524662          DOI: 10.1093/jis/3.1.23

Source DB:  PubMed          Journal:  J Insect Sci        ISSN: 1536-2442            Impact factor:   1.857


Introduction

The brown citrus aphid, Toxoptera citricida (Kirkaldy), is one of the most devastating pests of citrus, causing extensive crop losses worldwide. Feeding by this aphid alone can cause severe damage to citrus. However, it poses an even greater threat to citrus because of its efficient transmission of citrus tristeza closterovirus (Fasulo and Halbert, 1993). Since the brown citrus aphid genomic sequence is not available, expressed sequence tags (ESTs) derived from single-pass sequencing of cDNA clones prepared from the brown citrus aphid provide an invaluable resource for the identification of genes associated with the biology of the alate adult life stage. In the past, cloning of genes encoding enzymes of specific biochemical pathways by single-pass sequencing of cDNA clones has been a very successful strategy, particularly when the cDNA libraries have been prepared from tissues with high activity for the respective enzymes (Coyle-Thompson and Banerjee 1993; Newman ; Blaxter ; Cooke ; Rounsley ). This enables investigators to isolate genes derived from specific tissues and/or life stages for more detailed study, which may include developing efficient biocontrol methods. Additionally, ESTs and their accompanying cDNAs, provide the means to construct glass or nylon based arrays that can be used for transcript profiling on a genome-wide scale (DeRisi ; Ruan ; Egger ). A careful bioinformatic analysis identifying life stage-specific ESTs is a prerequisite in order to obtain a comprehensive and representative set of cDNAs for gene expression studies by arrays (Loftus ). Given that there are only a small number of insect ESTs in public databases it was essential to build a life-stage specific library derived from aphids so that analysis of metabolism and development on a genome-wide scale could be accomplished. Even without subsequent array analysis, a relatively large number of ESTs from a specific life stage can provide clues toward the expression of specific genes important to the functions expressly connected with that life stage (Rafalski ; Arbeitman ). In most cases and within statistical limitations, the abundance of a specific cDNA in the EST collection is a measure of gene expression (Audic and Claverie, 1997). This technique, referred to as a “digital or electronic northern”, has been utilized in several similar studies to gauge relative gene expression in various tissues. The data sets are available at GenBank, dbEST under the following accession numbers.: CB814527-CB814982, CB832665-CB833296, CB854878-CB855147, CB909714-CB910020, CB936196-CB936346, CD449954-CD450759.

Materials and Methods

Aphid rearing and collection

Alate brown citrus aphids, Toxoptera citricida, were obtained from a healthy colony maintained by WB Hunter at the USDA, ARS, U.S. Horticultural Research Laboratory, Ft. Pierce, FL. The founders were collected from a single collection site in Orlando, Florida. The colony was reared under continuous asexual reproduction for a period of 3 years on sweet orange, Madam vinous, seedlings in screen cages contained in an insectary, and held at 25°C, 16 L: 8 D. Plants free of insecticide and bearing new flush were cycled into cages on a weekly basis. Aphids and their host plants were surveyed biweekly for any incidence of contaminating insect species (e.g., mites, parasitoids, fungus gnats, shore flies, etc.). High-density aphid populations produced alate aphids that were collected by aspiration within two days of emergence. All alates were collected from the top of the cage so as to avoid sample contamination with other developmental forms or host plant tissue. Upon collection, alates were immediately submerged into liquid nitrogen prior to total RNA isolation. Approximately 50–100 alates were placed into 95% ethanol and stored at −80°C to be used as voucher specimens.

cDNA library construction

Approximately 4500 1–2 day old alate aphids were used in the construction of an expression library. Whole aphids were ground in liquid nitrogen and total RNA extracted using guanidinium salt-phenol-chloroform procedure as described by Strommer . Poly(A)+ RNA was purified using two rounds of selection on oligo dT magnetic beads according to the manufacturer's instructions (Dynal, www.dynal.no). A directional cDNA library was constructed in Lambda Uni-ZAP® XR Vector using Stratagene's ZAP-cDNA Synthesis Kit (Stratagene, www.stratagene.com). The resulting DNA was packaged into lambda particles using Gigapack® III Gold Packaging Extract (Stratagene). An amplified library was generated with a titer of 1.0 × 109 plaque-forming units per mL. Mass excision of the amplified library was carried out using Ex-Assist® helper phage (Stratagene). An aliquot of the excised, amplified library was used for infecting XL1-Blue MRF' cells and subsequently plated on LB agar containing 100 µg/mL ampicillin. Bacterial clones containing excised pBluescript SK(+) phagemids were recovered by random colony selection.

Sequencing of clones

pBluescript SK(+) phagemids were grown overnight at 37°C and 240 rpm in 96-deep well culture plates containing 1.7 mL of LB broth, supplemented with 100 µg/mL ampicillin. Archived stocks were prepared from the cell cultures using 75 µl of a LB-amp, glycerol mixture and 75 µl of cells. These archived stocks are held at the Horticultural Research Laboratory where they are kept in an ultra low temperature freezer set at −80°C. Plasmid DNA was extracted using the Qiagen 9600 liquid handling robot and the QIAprep 96 Turbo miniprep kit according to the recommended protocol (QIAGEN, www.quigen.com). Sequencing reactions were performed using the ABI PRISM® BigDye™ Primer Cycle Sequencing Kit (Applied Biosystems, home.appliedbiosystems.com) along with a universal T3 primer. Reactions were prepared in 96-well format using the Biomek2000™ liquid handling robot (Beckman Coulter, www.beckman.com). Sequencing reaction products were precipitated with 70% isopropanol, resuspended in 15 µL sterile water and loaded onto an ABI 3700 DNA Analyzer (Applied Biosystems).

Computer analysis

Base confidence scores were designated using TraceTuner® (Paracel, www.paracel.com). Low-quality bases (confidence score <20) were trimmed from both ends of sequences. Quality trimming, vector trimming and sequence fragment alignments were executed using Sequencher® software (Gene Codes, www.genecodes.com). Contaminating sequences such as rRNA and mitochondrial DNA were identified using BLASTN and were excluded from analysis along with sequences less than 100 nucleotides in length after both vector and quality trimming. Additional ESTs that corresponded to vector contaminants were removed from the dataset. To estimate the number of genes represented in the library and the redundancy of specific genes, ESTs were assembled into contigs using Sequencher®. Contig assembly parameters that were set using a minimum overlap of 50 bases and 95% identity match.

Functional annotation of ESTs

Putative sequence identity was determined based on BLAST similarity searches using the National Center for Biotechnology Information (NCBI) BLAST server (http://www.ncbi.nlm.nih.gov) with comparisons made to both non redundant nucleic acid (BLASTN) and protein (BLASTX) databases. ESTs that had no significant similarity to any publicly available sequences using BLASTN and BLASTX were then screened individually using TBLASTX. The top 5 hits for each assembled sequence were then formatted using an in-house parsing program that allowed for direct import into a Microsoft Excel® spreadsheet for further analysis. Sequence matches with E-value scores ≤ −10 were considered significant and were categorized according to the Gene Ontology (GO) classification system based on annotation of the 5 ‘best hit’ matches in BLASTX searches. All D. melanogaster matches were cataloged using FlyBase (www.flybase.org). Those sequences without a D. melanogaster hit were annotated using AmiGO (www.geneontology.org).

Results and Discussion

Generation and assembly of adult alate ESTs

An initial 5180 clones were sequenced from the 5′ end. These sequences were trimmed of vector and low-quality sequence and filtered for minimum length (100 bp), producing 4267 high-quality ESTs of 481 bp average length. These ESTs were analyzed with the Sequencher® assembly program to identify those that represent redundant transcripts. ESTs were assembled into 468 contiguous sequences (contigs) with 1656 ESTs remaining as singlets, suggesting a 61% redundancy. Thus, the combined set of contigs and singlets included 2124 sequences (hereafter referred to as ‘assembled sequences’), putatively representing different transcripts. Only 22 contig sequences contained more than 10 ESTs.

EST quality analysis and sequence survey

Of the 2124 assembled sequences analyzed, 993 (representing 2132 ESTs) were similar to known protein sequences in the non-redundant protein database (BLASTX; E ≤ −10). Seven of these assembled sequences, representing 13 ESTs, were identified by BLASTX as contaminating vector sequences and were removed from the dataset. Because some genes encode RNAs rather than proteins, it was necessary to run BLASTN against our dataset. Eight assembled sequences were identified as ribosomal and 2 were identified as mitochondrial DNA, representing 582 and 65 ESTs respectively, and were removed from the dataset. Although the number of ribosomal sequences appears inflated, it has been shown that several non-coding RNAs, such as rRNA, have mRNA-like modifications, such as polyadenylation and splicing. Because this EST dataset was derived from a cDNA library that was enriched for poly(A+) RNA, it is reasonable to assume that some non-coding RNAs should be present (MacIntosh ). An additional 76 ESTs were identified as either rRNA or mitochondrial using TBLASTX, leaving 2031 assembled sequences used in subsequent functional analyses. Of the initial 2124 assembled sequences (representing 4267 ESTs), 1045 (representing 1412 ESTs) showed no significant similarity (E>−10) to any publicly available sequence using BLASTX, BLASTN, or TBLASTX. This result suggests that a large percentage (∼33%) of the ESTs sequenced here are novel. However, this estimation of potential unique sequences within the cDNA library is most likely to be an overestimation due to several factors, such as computer alignment parameters and low quality internal sequences (White ). Moreover, assembled sequences may have lacked an open reading frame because they were too short causing ESTs to consist mostly or entirely of a noncoding region (e.g., 3′ untranslated region) (Whitfield ). Each Toxoptera citricida assembled sequence was tentatively assigned Gene Ontology classification based on annotation of the top 5 “best hit” matches (E ≤ −10) using BLASTX. Nearly all of these were characterized with respect to the functionally annotated genes in D. melanogaster using FlyBase. Of the 993 sequences demonstrating similarity to known protein sequences, 332 (33%) of these were of unknown molecular function and 685 (69%) were of unknown biological process. Tables 1 and 2 summarize assignments of Toxoptera sequences to major molecular functions and biological processes, respectively.
Table 1.

Molecular Function

Table 2.

Biological Process

Genes of interest within the EST dataset

The BLASTX results provide useful information regarding the homology of proteins that may be critical for insect cellular communication and development. Table 3 lists sequences of the brown citrus aphid that match to D. melanogaster genes implicated in signal transduction, cell differentiation, cell fate commitment, embryonic and larval development, morphogenesis, reproduction and cuticle biosynthesis. Typically, genes involved in early development would not be present in cDNA libraries derived from adult tissues. However, many aphid species are composed entirely of viviparous parthenogenetic females. These insects telescopic generations as embryogenesis occurs in un-born daughters, producing up to three generations developing within an adult individual (Sabater ). Therefore, genes involved in the development of several life stages may be represented simultaneously in this analysis.
Table 3.

Genes of interest in the Alate BrCA EST dataset

For the purposes of this paper, brown citrus aphid sequences were grouped into distinct gene ontology classifications. However, it is important to recognize that many of these gene products act in concert with one another to control cell fate determination which, in turn, drives morphological changes such as eye, leg, and wing development (e.g., the Notch pathway) (Coyle-Thompson and Banerjee, 1993; Baonza ).

Conclusions

We have provided a large data set of ESTs from the alate brown citrus aphid and have begun to analyze this valuable resource. The analysis of this data set is continually evolving and some of the conclusions may have to be revised as more advanced bioinformatic tools become available. Being the first EST data set for the brown citrus aphid, its preliminary examination clearly shows that it is substantially different from the aphid EST data set currently available to the public. For the most part, there is considerable congruence between conventional biochemistry regarding insect metabolism and the number of ESTs encoding metabolic enzymes. This data set provides the first experimental access to these genes and the basis for more in-depth molecular and genomic analysis. Moreover, it identifies genes that are critical in the physiology, reproduction, development, and wing morphogenesis of aphids. Genetic information is crucial to advancing our understanding of aphid biology, and will play a major role in the development of future non-chemical, gene-based control strategies against these insect pests.
  16 in total

1.  Informatic selection of a neural crest-melanocyte cDNA set for microarray analysis.

Authors:  S K Loftus; Y Chen; G Gooden; J F Ryan; G Birznieks; M Hilliard; A D Baxevanis; M Bittner; P Meltzer; J Trent; W Pavan
Journal:  Proc Natl Acad Sci U S A       Date:  1999-08-03       Impact factor: 11.205

2.  Identification and analysis of Arabidopsis expressed sequence tags characteristic of non-coding RNAs.

Authors:  G C MacIntosh; C Wilkerson; P J Green
Journal:  Plant Physiol       Date:  2001-11       Impact factor: 8.340

3.  Genes expressed in Brugia malayi infective third stage larvae.

Authors:  M L Blaxter; N Raghavan; I Ghosh; D Guiliano; W Lu; S A Williams; B Slatko; A L Scott
Journal:  Mol Biochem Parasitol       Date:  1996-04       Impact factor: 1.759

4.  Genes galore: a summary of methods for accessing results from large-scale partial sequencing of anonymous Arabidopsis cDNA clones.

Authors:  T Newman; F J de Bruijn; P Green; K Keegstra; H Kende; L McIntosh; J Ohlrogge; N Raikhel; S Somerville; M Thomashow
Journal:  Plant Physiol       Date:  1994-12       Impact factor: 8.340

5.  Annotated expressed sequence tags and cDNA microarrays for studies of brain and behavior in the honey bee.

Authors:  Charles W Whitfield; Mark R Band; Maria F Bonaldo; Charu G Kumar; Lei Liu; Jose R Pardinas; Hugh M Robertson; M Bento Soares; Gene E Robinson
Journal:  Genome Res       Date:  2002-04       Impact factor: 9.043

6.  Further progress towards a catalogue of all Arabidopsis genes: analysis of a set of 5000 non-redundant ESTs.

Authors:  R Cooke; M Raynal; M Laudié; F Grellet; M Delseny; P C Morris; D Guerrier; J Giraudat; F Quigley; G Clabault; Y F Li; R Mache; M Krivitzky; I J Gy; M Kreis; A Lecharny; Y Parmentier; J Marbach; J Fleck; B Clément; G Philipps; C Hervé; C Bardet; D Tremousaygue; H Höfte
Journal:  Plant J       Date:  1996-01       Impact factor: 6.417

7.  Relationships between extramacrochaetae and Notch signalling in Drosophila wing development.

Authors:  A Baonza; J F de Celis; A García-Bellido
Journal:  Development       Date:  2000-06       Impact factor: 6.868

8.  Gliogenesis in Drosophila: genome-wide analysis of downstream genes of glial cells missing in the embryonic nervous system.

Authors:  Boris Egger; Ronny Leemans; Thomas Loop; Lars Kammermeier; Yun Fan; Tanja Radimerski; Martin C Strahm; Ulrich Certa; Heinrich Reichert
Journal:  Development       Date:  2002-07       Impact factor: 6.868

9.  The strawberry notch gene functions with Notch in common developmental pathways.

Authors:  C A Coyle-Thompson; U Banerjee
Journal:  Development       Date:  1993-10       Impact factor: 6.868

10.  Gene expression during the life cycle of Drosophila melanogaster.

Authors:  Michelle N Arbeitman; Eileen E M Furlong; Farhad Imam; Eric Johnson; Brian H Null; Bruce S Baker; Mark A Krasnow; Matthew P Scott; Ronald W Davis; Kevin P White
Journal:  Science       Date:  2002-09-27       Impact factor: 47.728

View more
  15 in total

1.  Characterization of a new bunyavirus and its derived small RNAs in the brown citrus aphid, Aphis citricidus.

Authors:  Wei Zhang; Tengfei Wu; Mengmeng Guo; Tengyu Chang; Li Yang; Yang Tan; Chao Ye; Jinzhi Niu; Jin-Jun Wang
Journal:  Virus Genes       Date:  2019-05-11       Impact factor: 2.332

2.  Comparative analysis of nuclear tRNA genes of Nasonia vitripennis and other arthropods, and relationships to codon usage bias.

Authors:  S K Behura; M Stanke; C A Desjardins; J H Werren; D W Severson
Journal:  Insect Mol Biol       Date:  2010-02       Impact factor: 3.585

3.  Combining next-generation sequencing strategies for rapid molecular resource development from an invasive aphid species, Aphis glycines.

Authors:  Xiaodong Bai; Wei Zhang; Lucia Orantes; Tae-Hwan Jun; Omprakash Mittapalli; M A Rouf Mian; Andrew P Michel
Journal:  PLoS One       Date:  2010-06-29       Impact factor: 3.240

4.  Whitefly (Bemisia tabaci) genome project: analysis of sequenced clones from egg, instar, and adult (viruliferous and non-viruliferous) cDNA libraries.

Authors:  Dena Leshkowitz; Shirley Gazit; Eli Reuveni; Murad Ghanim; Henryk Czosnek; Cindy McKenzie; Robert L Shatters; Judith K Brown
Journal:  BMC Genomics       Date:  2006-04-11       Impact factor: 3.969

5.  Aphids: a model for polyphenism and epigenetics.

Authors:  Dayalan G Srinivasan; Jennifer A Brisson
Journal:  Genet Res Int       Date:  2012-03-21

6.  Symbiont-mediated functions in insect hosts.

Authors:  Qi Su; Xiaomao Zhou; Youjun Zhang
Journal:  Commun Integr Biol       Date:  2013-04-09

7.  A dual-genome microarray for the pea aphid, Acyrthosiphon pisum, and its obligate bacterial symbiont, Buchnera aphidicola.

Authors:  Alex C C Wilson; Helen E Dunbar; Gregory K Davis; Wayne B Hunter; David L Stern; Nancy A Moran
Journal:  BMC Genomics       Date:  2006-03-14       Impact factor: 3.969

8.  Large-scale gene discovery in the pea aphid Acyrthosiphon pisum (Hemiptera).

Authors:  Beatriz Sabater-Muñoz; Fabrice Legeai; Claude Rispe; Joël Bonhomme; Peter Dearden; Carole Dossat; Aymeric Duclert; Jean-Pierre Gauthier; Danièle Giblot Ducray; Wayne Hunter; Phat Dang; Srini Kambhampati; David Martinez-Torres; Teresa Cortes; Andrès Moya; Atsushi Nakabachi; Cathy Philippe; Nathalie Prunier-Leterme; Yvan Rahbé; Jean-Christophe Simon; David L Stern; Patrick Wincker; Denis Tagu
Journal:  Genome Biol       Date:  2006-03-10       Impact factor: 13.583

9.  Annotated ESTs from various tissues of the brown planthopper Nilaparvata lugens: a genomic resource for studying agricultural pests.

Authors:  Hiroaki Noda; Sawako Kawai; Yoko Koizumi; Kageaki Matsui; Qiang Zhang; Shigetoyo Furukawa; Michihiko Shimomura; Kazuei Mita
Journal:  BMC Genomics       Date:  2008-03-03       Impact factor: 3.969

10.  Genomic resources for Myzus persicae: EST sequencing, SNP identification, and microarray design.

Authors:  John S Ramsey; Alex C C Wilson; Martin de Vos; Qi Sun; Cecilia Tamborindeguy; Agnese Winfield; Gaynor Malloch; Dawn M Smith; Brian Fenton; Stewart M Gray; Georg Jander
Journal:  BMC Genomics       Date:  2007-11-16       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.