Literature DB >> 12620096

Beenomes to Bombyx: future directions in applied insect genomics.

Jay D Evans1, Dawn Gundersen-Rindal.   

Abstract

The recent sequencing of the Anopheles gambiae genome showcases the genetic breadth of insects and a trend towards sequencing organisms directly involved with human welfare. We describe traits in other insect species that make them important candidates for genomics projects, and review several recent workshops aimed at uniting researchers working with insect species to efficiently address problems in medicine, biotechnology, and agriculture.

Entities:  

Mesh:

Year:  2003        PMID: 12620096      PMCID: PMC153451          DOI: 10.1186/gb-2003-4-3-107

Source DB:  PubMed          Journal:  Genome Biol        ISSN: 1474-7596            Impact factor:   13.583


The recent sequencing of the Anopheles gambiae genome [1] is a watershed event in genomics for two reasons. First, this species is of sufficient phylogenetic distance from the previously sequenced Drosophila melanogaster to provide the best view to date of changes in genome organization and composition across the insects. The 250 million-year spread between these species, abetted by a high rate of sequence evolution, allows genomic comparisons over an evolutionary time-scale equal to that between humans and fish [2], larger by one-third than that between humans and chickens. Although this is a fraction of the distance covered by insects as a whole, it allows new tests of inferences drawn from Drosophila about gene function in insects in general. The second reason that the Anopheles gambiae genome is a landmark is that Anopheles is the first animal to be sequenced, other than ourselves, whose actions have a strong direct impact on human lives. In the near future such 'applied' genomic projects will probably become the norm, as agencies involved with human health and agriculture develop plans to sequence key pests and beneficial species. This trend is particularly evident in insect genomics. The next two species in the insect genome queue, the honey bee (Apis mellifera) and silkworm moth (Bombyx mori), were selected in part because of their longstanding use in agriculture. Other insect candidates, including another mosquito (Aedes aegypti), the medfly (Ceratitis capitata), and flour beetle (Tribolium castaneum), also have longstanding histories of research driven by their impacts on humans. In this article, we discuss criteria that might be used to evaluate the candidacy of various insect taxa for whole-genome sequencing. Specifically, we compare and contrast genome size, current genetic knowledge, species diversity, and the human impact of insects from 11 different insect orders and suggest how scientists and funders could use these criteria to help justify and prioritize future sequencing efforts. In addition, we briefly summarize recent scientific workshops aimed at integrating scientists and research programs focused on questions concerning basic and applied genomics in non-traditional insect species.

Genome sequencing criteria in insects

Given limited time and funding, robust criteria must be developed by which to weigh insect species as new sequencing candidates. One obvious goal is taxonomic breadth, and the eventual completion of full genome sequences from members representing the three major insect clades (Figure 1[3]) will be an essential contribution to comparative genomics. Taxonomic breadth by itself is not a sufficient criterion for comparing sequencing candidates, however. Full genome sequences from multiple species of Drosophila, for example, can complement each other by clarifying gene function and organization in this well-studied genus, and by extrapolation in insects in general. Furthermore, multiple candidates within the same insect order may warrant sequencing on the basis of other criteria: for example, within the order Diptera are mosquitoes, Drosophila and the economically important tephritid fly Ceratitis capitata.
Figure 1

Phylogenetic tree of insect orders, after Wheeler et al. [3]. Light gray, Archaeognatha (primitive wingless insects); Dark gray, Paleoptera (primitive winged insects); Black, Neoptera (higher insects). Crustacea are shown as an arthropod outgroup. Thysanura include silverfish; Odonata, dragonflies; Orthoptera, grasshoppers and crickets; Phasmida, stick insects; Blattaria, roaches; Heteroptera, true bugs; Homoptera, aphids, scales and tree hoppers; Coleoptera, beetles; Hymenoptera, ants, bees and wasps; Lepidoptera, moths and butterflies; and Diptera, flies.

Here, we use four criteria, from among the many possible, to compare the merits of insects from the 11 different insect orders shown in Figure 1. First, we use genome size based on estimates in the Animal Genome Size Database [4] as a predictor of direct sequencing costs. We estimate a mean genome size for each order, weighted at the level of family (for example, the numerous estimates of genome size in the fly family Drosophilidae were averaged and used as a single data point). This correlates well with estimates for the smallest genome in each order, and arguably is a more relevant estimate of the genome size of potential candidate species. We assume that sequencing costs increase linearly with genome size, given that any economy of scale achieved in sequencing larger genomes is likely to be mitigated by a need for higher sequence redundancy prior to assembly of large genomes. Next, we evaluate the current level of genetic research for different organisms, because this is both an aid to sequence assembly and annotation and reflects the number of scientists who would be likely to benefit from a complete sequence. Our surrogate marker for the level of genetic research is the number of protein sequences present in GenBank [5] as of 15 December 2002 (excluding protein entries for species whose genome sequences are known in full, namely Anopheles gambiae and Drosophila melanogaster). We then compare species diversity across orders [6], because the strongest inferences in terms of gene function and synteny are likely to occur within insect orders, and orders with greater diversity are likely to have greater ecological and economic importance as well as a larger community of researchers. Finally, we estimate the direct human impact of specific insect orders. For this, we counted the number of papers about each insect order referenced in the CAB Abstracts Database [7], an international abstract service for agricultural and applied sciences, from 1993-2002. We then derive a composite score for each insect order by ranking GenBank records, species diversity, and relative human impact, and dividing this value by the mean genome size (Table 1).
Table 1

Insect orders evaluated for sequencing priority

Insect orderGenome sizeGenBank recordsSpecies diversityHuman impactComposite score
Thysanura
Odonata
Orthoptera
Phasmida
Blattaria
Heteroptera
Homoptera
Coleoptera
Hymenoptera
Lepidoptera
Diptera

Insect orders are listed according to the phylogenetic tree shown in Figure 1 and show relative genome size (weighted mean, with size corresponding to the area of the circle). Also shown are the number of protein records in GenBank, number of worldwide species, and human impact estimate (see text for further details). In each case the proportion of the circle that is black indicates the number relative to the order with the highest value for each measure (the highest ranking order being shown with a filled circle). The last column shows a composite ranking of orders assuming that equal weight is given to each of these criteria.

Our results indicate strengths in all criteria for the holometabolous insect orders (those with complete metamorphosis) - Hymenoptera, Diptera, Lepidoptera and Coleoptera - as predicted previously [8] (see Figure 1 legend for the common names of insects in these orders). Coleoptera are the most speciose worldwide, but have slightly lower economic impact than Lepidoptera, Hymenoptera, and Diptera. Beyond the holometabolous insects, the order Homoptera stands out for having species with generally small genomes and great economic and agricultural importance. The representatives from the primitive insect orders Thysanura and Odonata, while valuable from the standpoint of phylogenetic breadth, fare poorly compared to other orders using our criteria. The emphasis in these criteria on insects with recognized human impact and ecological importance is not meant to negate the value of model insect species as sequencing candidates. Model insect genomes can provide general insights into biological mechanisms, gene structure and function, and the conserved evolutionary processes that select for certain genetic traits. Thus model organisms, as illustrated by species of Drosophila, yield invaluable insights for all insect genomes. And as a final caveat, we should emphasize that although we present several ways to compare the merits of different insect groups, we do not mean to infer that these are the only criteria useful for such decisions. (Our views are our own and need not reflect the opinions of our agency or the US government.)

Recent insect genome collaborations, and progress

Several recent workshops have been held with a specific focus on insect genomics and its applications. The Comparative Insect Genomics Workshop (Washington DC, USA; October 2001; sponsored by the US Department of Agriculture) was the first international meeting of scientists from academia, private industry, and government with the purpose of addressing and promoting the broad field of insect genomics. Discussions at this meeting focused on current approaches for analyzing and comparing genomes, the evaluation of candidate insects for genome sequencing, and ways to coordinate genomic efforts and ensure public access to materials and datasets. Leaders from the fruitfly, nematode, plant, and microbial genomics communities discussed the evolution of their own genome initiatives, and offered critiques of impending projects in insects. Because Drosophila-associated projects have served as models for all insect genomicists, there was substantial discussion of how new insect projects might benefit, and might benefit from, studies involving Drosophila. FlyBase [9], a key database for Drosophila genetics, forms one venue for comparative analyses in insects that is already widely used by those working on other insect species. Similar resources available through the US National Institutes of Health [10] and the Gene Ontology Consortium [11] were also identified as being key to generating testable inferences for new genome sequences. Recognizing the success of the completed and ongoing dipteran genome projects, several working groups have formed to develop and promote genome projects in new insect groups. Within the Hymenoptera, an international genomics effort has been emerging for several years around the honey bee, arguably the best studied and economically most important member of this group. Propelled in part by the Comparative Insect Genomics Workshop, a successful funding white paper was submitted to the US National Human Genome Research Institute for honey bee genome sequencing (now nearing completion) at the Baylor College of Medicine Genome Center. A more recent Honey Bee Biotechnology Workshop (Sapporo, Japan; July 2002) focused both on details of this genome project and on independent genomics efforts. New applications of functional genomic techniques described there foretell the many ways researchers will use genomic data to answer basic and applied questions in this species. As one example, two lab groups discussed the successful application of RNA interference methods in honey bee embryos and brains. Within the Lepidoptera, an international genomics effort has centered on the economically important silkworm moth [12], for which a completed genome sequence is expected in 2004. The recent International Workshop of Lepidopteran Genomics (Tsukuba, Japan; September 2002) focused on key aspects of this genome project, most notably the integration of large-insert libraries, expressed sequence tags (ESTs), and applications of transgenic technologies. The International Lepidopteran Genome Project [13] has been charged with applying new technologies to compare the genomes of a growing list of agriculturally important moths and butterflies. Among these, the crop-feeding heliothine moths have long been appreciated as significant genome candidates. One privately funded genome project in this group, involving the tobacco budworm Heliothis virescens, is apparently complete but remains inaccessible to the public. By contrast the Bombyx mori project [12] and projects involving additional heliothine species are expected to be carried out with full public access. Although no formal gatherings have been held to date, working groups representing additional insect orders (for example within the Coleoptera and Homoptera) continue to develop within the insect genomics research community. Well-defined and concerted research efforts, combined with advancing technologies and access to post-genomic tools and data, will speed advances in these taxa. As one example, functional studies using RNA interference and related methods are now feasible for all insect species, using orthologs identified through matches with current genome projects. Additionally, newly available large-insert libraries, for example those available through [14], can be used to begin testing for synteny and structure in diverse insect genomes. Finally, comparative genomics databases from flies, moths, and bees will undoubtedly be used to inform other genomics projects. In conclusion, the field of insect genomics is experiencing an exceptional year that should invigorate insect genetic studies. The outbreak of genome sequences is also likely to impact genetic studies more broadly. New estimates suggest that 61% and 66% of protein coding sequences from Drosophila and Anopheles, respectively, have known orthologs in non-insect genomes (human, mouse, Arabidopsis, worm, yeast, zebrafish, rat and rice [15,16]). This upward trend (only 20-30% of Drosophila genes were identified as having non-insect matches two years ago [17]) is certain to continue with incoming genome data for bees, moths, and their relatives. Researchers studying insect genomes can look forward to using these shared traits to better address general problems in medicine, biotechnology, agriculture, and evolutionary biology.
  5 in total

1.  Immunity-related genes and gene families in Anopheles gambiae.

Authors:  George K Christophides; Evgeny Zdobnov; Carolina Barillas-Mury; Ewan Birney; Stephanie Blandin; Claudia Blass; Paul T Brey; Frank H Collins; Alberto Danielli; George Dimopoulos; Charles Hetru; Ngo T Hoa; Jules A Hoffmann; Stefan M Kanzok; Ivica Letunic; Elena A Levashina; Thanasis G Loukeris; Gareth Lycett; Stephan Meister; Kristin Michel; Luis F Moita; Hans-Michael Müller; Mike A Osta; Susan M Paskewitz; Jean-Marc Reichhart; Andrey Rzhetsky; Laurent Troxler; Kenneth D Vernick; Dina Vlachou; Jennifer Volz; Christian von Mering; Jiannong Xu; Liangbiao Zheng; Peer Bork; Fotis C Kafatos
Journal:  Science       Date:  2002-10-04       Impact factor: 47.728

2.  euGenes: a eukaryote genome information system.

Authors:  Donald G Gilbert
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

3.  Comparative genomics of the eukaryotes.

Authors:  G M Rubin; M D Yandell; J R Wortman; G L Gabor Miklos; C R Nelson; I K Hariharan; M E Fortini; P W Li; R Apweiler; W Fleischmann; J M Cherry; S Henikoff; M P Skupski; S Misra; M Ashburner; E Birney; M S Boguski; T Brody; P Brokstein; S E Celniker; S A Chervitz; D Coates; A Cravchik; A Gabrielian; R F Galle; W M Gelbart; R A George; L S Goldstein; F Gong; P Guan; N L Harris; B A Hay; R A Hoskins; J Li; Z Li; R O Hynes; S J Jones; P M Kuehl; B Lemaitre; J T Littleton; D K Morrison; C Mungall; P H O'Farrell; O K Pickeral; C Shue; L B Vosshall; J Zhang; Q Zhao; X H Zheng; S Lewis
Journal:  Science       Date:  2000-03-24       Impact factor: 47.728

4.  The genome sequence of the malaria mosquito Anopheles gambiae.

Authors:  Robert A Holt; G Mani Subramanian; Aaron Halpern; Granger G Sutton; Rosane Charlab; Deborah R Nusskern; Patrick Wincker; Andrew G Clark; José M C Ribeiro; Ron Wides; Steven L Salzberg; Brendan Loftus; Mark Yandell; William H Majoros; Douglas B Rusch; Zhongwu Lai; Cheryl L Kraft; Josep F Abril; Veronique Anthouard; Peter Arensburger; Peter W Atkinson; Holly Baden; Veronique de Berardinis; Danita Baldwin; Vladimir Benes; Jim Biedler; Claudia Blass; Randall Bolanos; Didier Boscus; Mary Barnstead; Shuang Cai; Angela Center; Kabir Chaturverdi; George K Christophides; Mathew A Chrystal; Michele Clamp; Anibal Cravchik; Val Curwen; Ali Dana; Art Delcher; Ian Dew; Cheryl A Evans; Michael Flanigan; Anne Grundschober-Freimoser; Lisa Friedli; Zhiping Gu; Ping Guan; Roderic Guigo; Maureen E Hillenmeyer; Susanne L Hladun; James R Hogan; Young S Hong; Jeffrey Hoover; Olivier Jaillon; Zhaoxi Ke; Chinnappa Kodira; Elena Kokoza; Anastasios Koutsos; Ivica Letunic; Alex Levitsky; Yong Liang; Jhy-Jhu Lin; Neil F Lobo; John R Lopez; Joel A Malek; Tina C McIntosh; Stephan Meister; Jason Miller; Clark Mobarry; Emmanuel Mongin; Sean D Murphy; David A O'Brochta; Cynthia Pfannkoch; Rong Qi; Megan A Regier; Karin Remington; Hongguang Shao; Maria V Sharakhova; Cynthia D Sitter; Jyoti Shetty; Thomas J Smith; Renee Strong; Jingtao Sun; Dana Thomasova; Lucas Q Ton; Pantelis Topalis; Zhijian Tu; Maria F Unger; Brian Walenz; Aihui Wang; Jian Wang; Mei Wang; Xuelan Wang; Kerry J Woodford; Jennifer R Wortman; Martin Wu; Alison Yao; Evgeny M Zdobnov; Hongyu Zhang; Qi Zhao; Shaying Zhao; Shiaoping C Zhu; Igor Zhimulev; Mario Coluzzi; Alessandra della Torre; Charles W Roth; Christos Louis; Francis Kalush; Richard J Mural; Eugene W Myers; Mark D Adams; Hamilton O Smith; Samuel Broder; Malcolm J Gardner; Claire M Fraser; Ewan Birney; Peer Bork; Paul T Brey; J Craig Venter; Jean Weissenbach; Fotis C Kafatos; Frank H Collins; Stephen L Hoffman
Journal:  Science       Date:  2002-10-04       Impact factor: 47.728

5.  The Anopheles genome and comparative insect genomics.

Authors:  Thomas C Kaufman; David W Severson; Gene E Robinson
Journal:  Science       Date:  2002-10-04       Impact factor: 47.728

  5 in total
  4 in total

1.  The effect of local nucleotides on synonymous codon usage in the honeybee (Apis mellifera L.) genome.

Authors:  Robert W Cutler; Panuwan Chantawannakul
Journal:  J Mol Evol       Date:  2007-05-29       Impact factor: 2.395

2.  The use of Open Reading frame ESTs (ORESTES) for analysis of the honey bee transcriptome.

Authors:  Francis M F Nunes; Valeria Valente; Josane F Sousa; Marco A V Cunha; Daniel G Pinheiro; Rafaela M Maia; Daniela D Araujo; Maria C R Costa; Waleska K Martins; Alex F Carvalho; Nadia Monesi; Adriana M Nascimento; Pablo M V Peixoto; Maria F R Silva; Ricardo G P Ramos; Luis F L Reis; Emmanuel Dias-Neto; Sandro J Souza; Andrew J G Simpson; Marco A Zago; Ademilson E E Soares; Marcia M G Bitondi; Enilza M Espreafico; Foued S Espindola; Maria L Paco-Larson; Zila L P Simoes; Klaus Hartfelder; Wilson A Silva
Journal:  BMC Genomics       Date:  2004-11-03       Impact factor: 3.969

3.  A cricket Gene Index: a genomic resource for studying neurobiology, speciation, and molecular evolution.

Authors:  Patrick D Danley; Sean P Mullen; Fenglong Liu; Vishvanath Nene; John Quackenbush; Kerry L Shaw
Journal:  BMC Genomics       Date:  2007-04-25       Impact factor: 3.969

4.  The evolution of genome size in ants.

Authors:  Neil D Tsutsui; Andrew V Suarez; Joseph C Spagna; J Spencer Johnston
Journal:  BMC Evol Biol       Date:  2008-02-26       Impact factor: 3.260

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.