Literature DB >> 27242037

ArthropodaCyc: a CycADS powered collection of BioCyc databases to analyse and compare metabolism of arthropods.

Patrice Baa-Puyoulet1, Nicolas Parisot1, Gérard Febvay1, Jaime Huerta-Cepas2, Augusto F Vellozo3, Toni Gabaldón4, Federica Calevro1, Hubert Charles1, Stefano Colella5.   

Abstract

Arthropods interact with humans at different levels with highly beneficial roles (e.g. as pollinators), as well as with a negative impact for example as vectors of human or animal diseases, or as agricultural pests. Several arthropod genomes are available at present and many others will be sequenced in the near future in the context of the i5K initiative, offering opportunities for reconstructing, modelling and comparing their metabolic networks. In-depth analysis of these genomic data through metabolism reconstruction is expected to contribute to a better understanding of the biology of arthropods, thereby allowing the development of new strategies to control harmful species. In this context, we present here ArthropodaCyc, a dedicated BioCyc collection of databases using the Cyc annotation database system (CycADS), allowing researchers to perform reliable metabolism comparisons of fully sequenced arthropods genomes. Since the annotation quality is a key factor when performing such global genome comparisons, all proteins from the genomes included in the ArthropodaCyc database were re-annotated using several annotation tools and orthology information. All functional/domain annotation results and their sources were integrated in the databases for user access. Currently, ArthropodaCyc offers a centralized repository of metabolic pathways, protein sequence domains, Gene Ontology annotations as well as evolutionary information for 28 arthropod species. Such database collection allows metabolism analysis both with integrated tools and through extraction of data in formats suitable for systems biology studies.Database URL: http://arthropodacyc.cycadsys.org/.
© The Author(s) 2016. Published by Oxford University Press.

Entities:  

Mesh:

Year:  2016        PMID: 27242037      PMCID: PMC5630938          DOI: 10.1093/database/baw081

Source DB:  PubMed          Journal:  Database (Oxford)        ISSN: 1758-0463            Impact factor:   3.451


Introduction

More than 10 years have passed since the publication of the initial sequencing and analysis of the human genome (1, 2), which has had a great impact on the way we investigate biological processes, notably with the development of novel technologies enabling comprehensive genomic analyses (3). The genomes of several other organisms were sequenced before and after the human genome, starting with the animal models: Drosophila melanogaster (4), Mus musculus (5) and Rattus norvegicus (6). These data are driving the development of genomics-based research approaches to study the biology of many living organisms beyond humans and established model organisms. Recently, a large community of researchers have launched the Genome 10K project to obtain the full genome sequences of 10 000 vertebrate species (7). A similar initiative has been launched by the Arthropod Genomics consortium: the i5K initiative, which aims at sequencing the genomes of 5000 arthropod species (http://arthropodgenomes.org/wiki/i5K) (8, 9). The availability of the full genome sequence of an organism allows researchers to have a complete view of its metabolism. The BioCyc collection of Pathway/Genome DataBases (PGDBs) (10) constitutes a key resource for studying the metabolism of multiple organisms as it enables comparative studies. The first database of the collection, EcoCyc (11), is at present a comprehensive resource to study Escherichia coli biology (12). The quality of these databases is strongly linked to the annotation used to generate them and, in the most recent release (June 24, 2015—version 19.1), only seven databases are intensively manually curated and frequently updated (BioCyc Tier 1 PGDBs): EcoCyc (12), MetaCyc (13), HumanCyc (14), AraCyc (15), YeastCyc (16), LeishCyc (17) and TrypanoCyc (18). Such expert driven annotation is only possible for large communities of scientists working on the same model and, consequently, the majority of the 5711 BioCyc PGDBs available in this release are computationally derived: 39 are subject to moderate manual curation (BioCyc Tier 2 PGDBs) and 5455 to no manual curation at all (BioCyc Tier 3 PGDBs) (see http://biocyc.org/biocyc-pgdb-list.shtml for an updated listing). The upcoming deluge of fully sequenced genomes, driven by NGS technology, demands the development of a novel genomic infrastructure (19). To contribute to the need of standardized automated annotation, we developed a Cyc Annotation Database System (CycADS) (20). CycADS was successfully used to generate AcypiCyc (http://acypicyc.cycadsys.org), a database dedicated to the pea aphid Acyrthosiphon pisum metabolism that was developed during the annotation phase of the genome of this insect (21). As many other arthropod genomes are available, and many more will be in the future in the context of the i5K initiative, we decided to develop a collection of BioCyc Tier 3 pathway/genome databases for arthropods using the uniform and enriched automated functional annotations provided by the CycADS system.

Implementation

CycADS annotation management system

ArthropodaCyc is a collection of BioCyc PGDBs that contains, at the time of writing (March 2016), the metabolic network of 28 arthropods with sequenced genomes (4, 21–45), including 25 insects, two arachnids and a branchiopod [Figure 1, note that phylogenetic relationships between species are displayed using a cladogram based on available data (46–50)]. All databases in ArthropodaCyc were generated using CycADS (20), an annotation management system programmed in Java (Model-View-Controller structure) and SQL that was originally developed for the annotation of the pea aphid genome (21). CycADS facilitates the collection and management of information obtained from both genomic data and different protein annotation methods in a SQL database. A pipeline to filter for bacterial contaminants was developed and integrated in the protein functional annotation system, which involves multiple methods (see below for a detailed description). All data collected in CycADS were then extracted and formatted to generate, for each organism, an ad hoc input file (a BioCyc ‘Path-o-logic file’ format) used by the Pathway Tools software (51) to produce BioCyc-like enriched metabolic database (20) (Supplementary Figure S1). Genomes are included in ArthropodaCyc if they fulfil the following criteria: (i) the genome sequence is published and (ii) the sequence data can be downloaded in appropriate formats (comprehensive GFF or Genbank file with compatible gene/mRNA/protein features). Each organism database summary page contains updated information relative to the genome data release used to generate it.
Figure 1.

ArthropodaCyc databases list and summary. This table shows the distribution of reactions in the Cyc databases across the six top-level categories identified by the Enzyme Commission (E.C.). Included in this table are all reactions in each database which have been assigned either full or partial E.C. numbers, and for which an enzyme has been identified (these statistics do not include pathway holes). Phylogenetic relationships between species are displayed using a cladogram based on available data (46–50).

ArthropodaCyc databases list and summary. This table shows the distribution of reactions in the Cyc databases across the six top-level categories identified by the Enzyme Commission (E.C.). Included in this table are all reactions in each database which have been assigned either full or partial E.C. numbers, and for which an enzyme has been identified (these statistics do not include pathway holes). Phylogenetic relationships between species are displayed using a cladogram based on available data (46–50).

Filtering possible bacterial contaminations

Organisms across all kingdoms of life are associated with microbial partners, with interactions ranging from parasitism to mutualism. Arthropods are no exception and can harbour microorganisms at their external surface, or internally as endosymbionts, gut microbiota, parasites or pathogens. Despite the use of specific DNA extraction protocols, massive sequencing of arthropod genomes may generate sequences contaminated by prokaryotic DNA. Since the ArthropodaCyc databases aims at collecting high-quality functional annotations, we decided to implement a pipeline for the detection of putative contaminant bacterial sequences, to be used before the reconstruction step of the arthropod metabolic pathways. First, genomic sequences, annotation files (GFF/GBK) and protein sets of each arthropod genome project were retrieved from public repositories (52–62). Genomic sequences smaller than 50 kbp were compared to the NCBI’s RefSeq prokaryotic genome sequences database (63) using BLASTN (64). Contaminant genomic sequences were identified using a 90% identity threshold over at least 90% of the query length. A BLASTP against the NCBI RefSeq prokaryotic protein sequences database was thus performed to check for bacterial contaminations within the remaining proteins. Protein sequences were filtered based on BLAST results using three different criteria: (i) at least 90% amino acid identity over at least 90% of the query length, (ii) at least 90% amino acid identity over at least 50% of both the query and the hit lengths or (iii) at least 95% identity over a sliding window of 100 amino acids. To reduce the risk of removing arthropod sequences (false-positives), we included a last step performing a BLASTP (≥80% amino acid identity over at least 80% of the query length) of the putative contaminant protein list against an invertebrate subset of the reference UniProtKB/Swiss-Prot protein sequence database (65). Proteins with positive hits were reintegrated in the annotation process. Lists of putative bacterial contaminants detected within the 28 arthropod genomes of ArthropodaCyc are provided in Supplementary Table S1. The proteins identified as putative bacterial contaminants were flagged upon extraction and they were not used in the Pathway Tools reactions inference (see below). However, they still appear in the database with a gene/protein page and the information about their status of contaminants.

The functional annotation pipeline

We used multiple methods to perform a functional annotation: the online KAAS-KEGG annotation pipeline (66) and the PRIAM (67), Blast2GO (68, 69) and InterProScan (70) pipelines with a local installation for faster data generation (summary of results in Supplementary Table S2). These methods generated functional information (EC number, KEGG Orthology and Gene Ontology) related to the protein sequences and all annotations were collected in the database using flexible annotation loaders (annotation collector module) available in CycADS. Default parameters were used for software configurations and the BLAST alignments (prior to the Blast2GO analysis) were performed against the reference UniProtKB/Swiss-Prot protein sequences database (65). All annotation data were extracted from the CycADS SQL database for each analysed genome and collected in a Pathologic file that was used in Pathway Tools (51) to generate the corresponding BioCyc PGDB. For several arthropods, a genome wide phylogenetic analysis performed using the Phylome pipeline and collected in PhylomeDB (71) was available. In those cases, Gene Ontology annotations were transferred using orthology relationships to Drosophila melanogaster and integrated in the ArthropodaCyc databases using CycADS as previously described (20). Furthermore, for the arthropods with a Phylome analysis, the orthology predictions generated in MetaPhOrs (June 2015 release) (72) using a combination of phylogenetic information derived from different databases were included into the corresponding BioCyc databases using the orthology functionality of Pathway Tools. Using CycADS, enriched gene records were automatically generated in the Pathologic file format imported by Pathway Tools. Each Pathologic file record contained the gene and the gene product names, synonyms, sequence structural information, as well as the annotations including Enzyme evidences (E.C. numbers), KEGG Orthology, Gene Ontology and MetaPhOrs orthology that were integrated in the databases. In the note section for each gene/protein page, the information relative to the annotation results are recorded to allow the researchers to evaluate the confidence for each putative function assigned to a protein (20). Useful external cross-links, such as to NCBI’s RefSeq or genomic databases of arthropod communities (i.e. AphidBase, VectorBase and Hymenoptera Genome Database), are also integrated thanks to the CycADS pipeline. Moreover, as InterProScan (70) analysis identifies functional domains, we included links to the InterPro external database identifiers when appropriate (see Figure 2 example page).
Figure 2.

Screenshot of an ArthropodaCyc enzyme page. The page provides several information such as: (1) gene name, accession numbers and synonym names; (2) a summary of metabolism annotation evidences from KAAS-KEGG, PRIAM, InterProScan, PhylomeDB and BLAST2GO; (3) genome position with an additional link to the corresponding genome browser, and information on gene and protein length and polypeptide molecular weight; (4) external cross-links to specific genomic databases, enzyme annotation and InterProScan domains information and to phylogeny in PhylomeDB; (5) schematics representing the reaction(s) carried out by the enzyme; (6) Gene ontology terms associated with the enzyme functions; (7) additional information on the reaction(s) carried out by the enzyme, including the pathway(s) (if any) where this reaction may occur; (8) gene local context, including neighbouring genes; (9) gene structure in terms of (added) exons/introns organization and (10) an “Operations box” offering several options for comparative analyses. Filled circles, (2) and (4), represent ArthropodaCyc specific features.

Screenshot of an ArthropodaCyc enzyme page. The page provides several information such as: (1) gene name, accession numbers and synonym names; (2) a summary of metabolism annotation evidences from KAAS-KEGG, PRIAM, InterProScan, PhylomeDB and BLAST2GO; (3) genome position with an additional link to the corresponding genome browser, and information on gene and protein length and polypeptide molecular weight; (4) external cross-links to specific genomic databases, enzyme annotation and InterProScan domains information and to phylogeny in PhylomeDB; (5) schematics representing the reaction(s) carried out by the enzyme; (6) Gene ontology terms associated with the enzyme functions; (7) additional information on the reaction(s) carried out by the enzyme, including the pathway(s) (if any) where this reaction may occur; (8) gene local context, including neighbouring genes; (9) gene structure in terms of (added) exons/introns organization and (10) an “Operations box” offering several options for comparative analyses. Filled circles, (2) and (4), represent ArthropodaCyc specific features. It is important to underline that, as data formats in genomics can be very disparate depending on the source file format used (i.e. GFF, Gbk), data on each arthropod genome were manually checked to ensure that a unique identifier for genes and products was present and that clear relations could be established among the different features. The flexible CycADS parsers were parameterized using its detailed configuration file (20). Finally, all automatic tasks of Pathway Tools consistency checker were run before databases saving and publishing. With the perspective to make available an up-to-date database representative of the fast evolving field of arthropod genomics, ArthropodaCyc will be updated annually.

Discussion on features and usage

The ArthropodaCyc collection of enriched BioCyc databases for arthropods whose genome has been fully sequenced and assembled is a key resource for all members of the Arthropod Genomics Consortium (http://arthropodgenomes.org/wiki/Main_Page). Our collection takes advantage of CycADS (20): a powerful annotation management system allowing to manage multiple genomes and to generate a set of BioCyc database where each organism has been annotated using identical tools and automatized procedures. Furthermore, our collection is enriched by phylogeny-based orthology predictions available in PhylomeDB/MetaPhOrs (71, 72) and customized hyperlinks to organism specific genome browsers. The ArthropodaCyc collection of databases takes also full advantage of the rich BioCyc interface and tools for metabolism data analysis (73). Several analyses can be performed using the BioCyc online interface that includes advanced query tools (74), and powerful web-based genomic data viewers (75). Moreover, ArthropodaCyc offers the possibility to download the data in formats suitable for data analysis: either using other tools, such as for example Cytoscape (76) and MetExplore (77), or for use in personally developed analysis software and pipelines. We are already using ArthropodaCyc to contribute to the analysis of the metabolism in genome annotation projects currently ongoing on different insect species: the green peach aphid Myzus persicae [manuscript in preparation], the milkweed bug Oncopeltus fasciatus [manuscript in preparation](78–80) and the rice weevil Sitophilus oryzae. Beyond the single organism analysis, the BioCyc interface provides the user with tools for comparative analyses (51) to identify interesting features of a given organism metabolism that could shed light on its biology. The interest in comparative analyses will greatly grow as more arthropods are sequenced. Even if a full comparative analysis in arthropods is beyond the purpose of this article, we provide here a few examples of its usage. ArthropodaCyc can be used for the identification of enzymes and/or pathways unique to a given organism or group of organisms. As an example, we used the present version of the ArthropodaCyc database to verify the lack of the tyrosine degradation pathway that we had originally described in the pea aphid genome by using the AcypiCyc database (20, 21) and comparing this genome to the few insect genomes available at that time. We could confirm that A. pisum is the only insect lacking this pathway among the 25 available in the database at present (Figure 3A). We interpreted this loss as an explanation of the pea aphid lifestyle and we linked it to the high demand for tyrosine by pea aphids in connection with their unbalanced plant phloem sap diet. Even though the nutrition of these insects is complemented by their primary symbiont, Buchnera aphidicola, this bacterium provides only precursors for tyrosine biosynthesis. The enzymes catalysing the last two steps of tyrosine synthesis are in fact encoded in the aphid genome. In particular, multiple genes coding the aspartate transaminase (E.C. 2.6.1.1), the enzyme involved in the synthesis of phenylalanine from phenylpyruvate are present in the pea aphid genome with one of them (ACYPI004243) specifically regulated during embryo development in aphids reproducing by parthenogenesis (81). We thus checked whether this aspartate transaminase gene expansion was also present in the genomes of the other 27 arthropods present in the ArthropodaCyc database. Remarkably, only six out of the other 27 arthropods show a gene expansion comparable to the pea aphid (5 or 4 genes), and all the other 21 arthropods present only 2 or 3 genes encoding for this enzyme (Supplementary Table S3). Even if further phylogenetic analyses would be needed to better understand the origin of the differences in this enzyme-coding gene, this is an example of another possible use of ArthropodaCyc to easily explore the number of genes and their structures for specific enzymes (an interesting genome variation beyond the relatively rare presence/absence case of enzymatic reactions).
Figure 3.

Two examples of insect pathway differences identified using ArthropodaCyc. (A) Pathway of tyrosine degradation, comparison between A. pisum and the other insects of ArthropodaCyc; (B) Pathway of lysine degradation, comparison between the five species of Diptera and the other insects of ArthropodaCyc. In each pathway, green coloured enzymes are present, while grey enzymes and reactions are absent. Enzymes: 1.13.11.5 = homogentisate 1,2-dioxygenase; 1.13.11.27 = 4-hydroxyphenylpyruvate dioxygenase; 1.2.1.31 = L-aminoadipate-semialdehyde dehydrogenase; 1.2.4.2 = oxoglutarate dehydrogenase (succinyl-transferring); 1.5.1.8 = saccharopine dehydrogenase (NADP+, L-lysine-forming); 1.5.1.9 = saccharopine dehydrogenase (NAD+, L-glutamate-forming); 2.3.1.61 = dihydrolipoyllysine-residue succinyltransferase; 2.6.1.5 = tyrosine transaminase; 2.6.1.39 = 2-aminoadipate transaminase; 2.6.1.57 = aromatic-amino-acid transaminase; 3.7.1.2 = fumarylacetoacetase; 5.2.1.2 = maleylacetoacetate isomerase.

Two examples of insect pathway differences identified using ArthropodaCyc. (A) Pathway of tyrosine degradation, comparison between A. pisum and the other insects of ArthropodaCyc; (B) Pathway of lysine degradation, comparison between the five species of Diptera and the other insects of ArthropodaCyc. In each pathway, green coloured enzymes are present, while grey enzymes and reactions are absent. Enzymes: 1.13.11.5 = homogentisate 1,2-dioxygenase; 1.13.11.27 = 4-hydroxyphenylpyruvate dioxygenase; 1.2.1.31 = L-aminoadipate-semialdehyde dehydrogenase; 1.2.4.2 = oxoglutarate dehydrogenase (succinyl-transferring); 1.5.1.8 = saccharopine dehydrogenase (NADP+, L-lysine-forming); 1.5.1.9 = saccharopine dehydrogenase (NAD+, L-glutamate-forming); 2.3.1.61 = dihydrolipoyllysine-residue succinyltransferase; 2.6.1.5 = tyrosine transaminase; 2.6.1.39 = 2-aminoadipate transaminase; 2.6.1.57 = aromatic-amino-acid transaminase; 3.7.1.2 = fumarylacetoacetase; 5.2.1.2 = maleylacetoacetate isomerase. As another example application, we also decided to explore the database to search for pathways that would be characteristic of a specific group of insects and we identified the 2-aminoadipate transaminase (E.C. 2.6.1.39) in the lysine degradation pathway as uniquely missing in the genomes of the five dipteran species available in ArthropodaCyc (A. aegypti, A. gambiae, C. quinquefasciatus, D. melanogaster and G. morsitans) (Figure 3B). These examples provided here show the power of ArthropodaCyc in finding differences between specific organisms that might be linked with their biology, even though for the lysine degradation it is difficult to speculate on possible reasons for this apparent loss of a complete pathway as these five dipteran species live in multiple habitats and feed on very diversified diets.

Conclusions

We present here ArthropodaCyc, the most comprehensive collection of BioCyc databases for arthropods, which we expect to be of great interest for a broad community of scientists. Several genomes of arthropods are being sequenced and many more will be sequenced in the future as part of the i5K initiative. The CycADS pipeline empowers both the development and the update of the PGDB in ArthropodaCyc. Our databases are an arthropod research resource that is also linked, whenever possible, to single organism community based genomic databases, thus offering to the researchers an integrated access to different sources of annotations. Click here for additional data file.
  78 in total

1.  Pathway Tools version 13.0: integrated software for pathway/genome informatics and systems biology.

Authors:  Peter D Karp; Suzanne M Paley; Markus Krummenacker; Mario Latendresse; Joseph M Dale; Thomas J Lee; Pallavi Kaipa; Fred Gilham; Aaron Spaulding; Liviu Popescu; Tomer Altman; Ian Paulsen; Ingrid M Keseler; Ron Caspi
Journal:  Brief Bioinform       Date:  2009-12-02       Impact factor: 11.622

2.  Genome sequence of the Brown Norway rat yields insights into mammalian evolution.

Authors:  Richard A Gibbs; George M Weinstock; Michael L Metzker; Donna M Muzny; Erica J Sodergren; Steven Scherer; Graham Scott; David Steffen; Kim C Worley; Paula E Burch; Geoffrey Okwuonu; Sandra Hines; Lora Lewis; Christine DeRamo; Oliver Delgado; Shannon Dugan-Rocha; George Miner; Margaret Morgan; Alicia Hawes; Rachel Gill; Robert A Holt; Mark D Adams; Peter G Amanatides; Holly Baden-Tillson; Mary Barnstead; Soo Chin; Cheryl A Evans; Steve Ferriera; Carl Fosler; Anna Glodek; Zhiping Gu; Don Jennings; Cheryl L Kraft; Trixie Nguyen; Cynthia M Pfannkoch; Cynthia Sitter; Granger G Sutton; J Craig Venter; Trevor Woodage; Douglas Smith; Hong-Mei Lee; Erik Gustafson; Patrick Cahill; Arnold Kana; Lynn Doucette-Stamm; Keith Weinstock; Kim Fechtel; Robert B Weiss; Diane M Dunn; Eric D Green; Robert W Blakesley; Gerard G Bouffard; Pieter J De Jong; Kazutoyo Osoegawa; Baoli Zhu; Marco Marra; Jacqueline Schein; Ian Bosdet; Chris Fjell; Steven Jones; Martin Krzywinski; Carrie Mathewson; Asim Siddiqui; Natasja Wye; John McPherson; Shaying Zhao; Claire M Fraser; Jyoti Shetty; Sofiya Shatsman; Keita Geer; Yixin Chen; Sofyia Abramzon; William C Nierman; Paul H Havlak; Rui Chen; K James Durbin; Amy Egan; Yanru Ren; Xing-Zhi Song; Bingshan Li; Yue Liu; Xiang Qin; Simon Cawley; Kim C Worley; A J Cooney; Lisa M D'Souza; Kirt Martin; Jia Qian Wu; Manuel L Gonzalez-Garay; Andrew R Jackson; Kenneth J Kalafus; Michael P McLeod; Aleksandar Milosavljevic; Davinder Virk; Andrei Volkov; David A Wheeler; Zhengdong Zhang; Jeffrey A Bailey; Evan E Eichler; Eray Tuzun; Ewan Birney; Emmanuel Mongin; Abel Ureta-Vidal; Cara Woodwark; Evgeny Zdobnov; Peer Bork; Mikita Suyama; David Torrents; Marina Alexandersson; Barbara J Trask; Janet M Young; Hui Huang; Huajun Wang; Heming Xing; Sue Daniels; Darryl Gietzen; Jeanette Schmidt; Kristian Stevens; Ursula Vitt; Jim Wingrove; Francisco Camara; M Mar Albà; Josep F Abril; Roderic Guigo; Arian Smit; Inna Dubchak; Edward M Rubin; Olivier Couronne; Alexander Poliakov; Norbert Hübner; Detlev Ganten; Claudia Goesele; Oliver Hummel; Thomas Kreitler; Young-Ae Lee; Jan Monti; Herbert Schulz; Heike Zimdahl; Heinz Himmelbauer; Hans Lehrach; Howard J Jacob; Susan Bromberg; Jo Gullings-Handley; Michael I Jensen-Seaman; Anne E Kwitek; Jozef Lazar; Dean Pasko; Peter J Tonellato; Simon Twigger; Chris P Ponting; Jose M Duarte; Stephen Rice; Leo Goodstadt; Scott A Beatson; Richard D Emes; Eitan E Winter; Caleb Webber; Petra Brandt; Gerald Nyakatura; Margaret Adetobi; Francesca Chiaromonte; Laura Elnitski; Pallavi Eswara; Ross C Hardison; Minmei Hou; Diana Kolbe; Kateryna Makova; Webb Miller; Anton Nekrutenko; Cathy Riemer; Scott Schwartz; James Taylor; Shan Yang; Yi Zhang; Klaus Lindpaintner; T Dan Andrews; Mario Caccamo; Michele Clamp; Laura Clarke; Valerie Curwen; Richard Durbin; Eduardo Eyras; Stephen M Searle; Gregory M Cooper; Serafim Batzoglou; Michael Brudno; Arend Sidow; Eric A Stone; J Craig Venter; Bret A Payseur; Guillaume Bourque; Carlos López-Otín; Xose S Puente; Kushal Chakrabarti; Sourav Chatterji; Colin Dewey; Lior Pachter; Nicolas Bray; Von Bing Yap; Anat Caspi; Glenn Tesler; Pavel A Pevzner; David Haussler; Krishna M Roskin; Robert Baertsch; Hiram Clawson; Terrence S Furey; Angie S Hinrichs; Donna Karolchik; William J Kent; Kate R Rosenbloom; Heather Trumbower; Matt Weirauch; David N Cooper; Peter D Stenson; Bin Ma; Michael Brent; Manimozhiyan Arumugam; David Shteynberg; Richard R Copley; Martin S Taylor; Harold Riethman; Uma Mudunuri; Jane Peterson; Mark Guyer; Adam Felsenfeld; Susan Old; Stephen Mockrin; Francis Collins
Journal:  Nature       Date:  2004-04-01       Impact factor: 49.962

3.  Genome sequence of the tsetse fly (Glossina morsitans): vector of African trypanosomiasis.

Authors: 
Journal:  Science       Date:  2014-04-25       Impact factor: 47.728

4.  FlyBase 101--the basics of navigating FlyBase.

Authors:  Peter McQuilton; Susan E St Pierre; Jim Thurmond
Journal:  Nucleic Acids Res       Date:  2011-11-29       Impact factor: 16.971

5.  Obligate mutualism within a host drives the extreme specialization of a fig wasp genome.

Authors:  Jin-Hua Xiao; Zhen Yue; Ling-Yi Jia; Xin-Hua Yang; Li-Hua Niu; Zhuo Wang; Peng Zhang; Bao-Fa Sun; Shun-Min He; Zi Li; Tuan-Lin Xiong; Wen Xin; Hai-Feng Gu; Bo Wang; John H Werren; Robert W Murphy; David Wheeler; Li-Ming Niu; Guang-Chang Ma; Ting Tang; Sheng-Nan Bian; Ning-Xin Wang; Chun-Yan Yang; Nan Wang; Yue-Guan Fu; Wen-Zhu Li; Soojin V Yi; Xing-Yu Yang; Qing Zhou; Chang-Xin Lu; Chun-Yan Xu; Li-Juan He; Li-Li Yu; Ming Chen; Yuan Zheng; Shao-Wei Wang; Shuang Zhao; Yan-Hong Li; Yang-Yang Yu; Xiao-Ju Qian; Yue Cai; Lian-Le Bian; Shu Zhang; Jun-Yi Wang; Ye Yin; Hui Xiao; Guan-Hong Wang; Hui Yu; Wen-Shan Wu; James M Cook; Jun Wang; Da-Wei Huang
Journal:  Genome Biol       Date:  2013-12-20       Impact factor: 13.583

6.  The draft genome of a socially polymorphic halictid bee, Lasioglossum albipes.

Authors:  Sarah D Kocher; Cai Li; Wei Yang; Hao Tan; Soojin V Yi; Xingyu Yang; Hopi E Hoekstra; Guojie Zhang; Naomi E Pierce; Douglas W Yu
Journal:  Genome Biol       Date:  2013-12-20       Impact factor: 13.583

7.  VectorBase: an updated bioinformatics resource for invertebrate vectors and other organisms related with human diseases.

Authors:  Gloria I Giraldo-Calderón; Scott J Emrich; Robert M MacCallum; Gareth Maslen; Emmanuel Dialynas; Pantelis Topalis; Nicholas Ho; Sandra Gesing; Gregory Madey; Frank H Collins; Daniel Lawson
Journal:  Nucleic Acids Res       Date:  2014-12-15       Impact factor: 16.971

8.  PhylomeDB v4: zooming into the plurality of evolutionary histories of a genome.

Authors:  Jaime Huerta-Cepas; Salvador Capella-Gutiérrez; Leszek P Pryszcz; Marina Marcet-Houben; Toni Gabaldón
Journal:  Nucleic Acids Res       Date:  2013-11-25       Impact factor: 16.971

9.  The evolutionary dynamics of major regulators for sexual development among Hymenoptera species.

Authors:  Matthias Biewer; Francisca Schlesinger; Martin Hasselmann
Journal:  Front Genet       Date:  2015-04-10       Impact factor: 4.599

10.  TrypanoCyc: a community-led biochemical pathways database for Trypanosoma brucei.

Authors:  Sanu Shameer; Flora J Logan-Klumpler; Florence Vinson; Ludovic Cottret; Benjamin Merlet; Fiona Achcar; Michael Boshart; Matthew Berriman; Rainer Breitling; Frédéric Bringaud; Peter Bütikofer; Amy M Cattanach; Bridget Bannerman-Chukualim; Darren J Creek; Kathryn Crouch; Harry P de Koning; Hubert Denise; Charles Ebikeme; Alan H Fairlamb; Michael A J Ferguson; Michael L Ginger; Christiane Hertz-Fowler; Eduard J Kerkhoven; Pascal Mäser; Paul A M Michels; Archana Nayak; David W Nes; Derek P Nolan; Christian Olsen; Fatima Silva-Franco; Terry K Smith; Martin C Taylor; Aloysius G M Tielens; Michael D Urbaniak; Jaap J van Hellemond; Isabel M Vincent; Shane R Wilkinson; Susan Wyllie; Fred R Opperdoes; Michael P Barrett; Fabien Jourdan
Journal:  Nucleic Acids Res       Date:  2014-10-09       Impact factor: 16.971

View more
  7 in total

1.  Drosophila-associated bacteria differentially shape the nutritional requirements of their host during juvenile growth.

Authors:  Jessika Consuegra; Théodore Grenier; Patrice Baa-Puyoulet; Isabelle Rahioui; Houssam Akherraz; Hugo Gervais; Nicolas Parisot; Pedro da Silva; Hubert Charles; Federica Calevro; François Leulier
Journal:  PLoS Biol       Date:  2020-03-20       Impact factor: 8.029

2.  CiliateGEM: an open-project and a tool for predictions of ciliate metabolic variations and experimental condition design.

Authors:  Alessio Mancini; Filmon Eyassu; Maxwell Conway; Annalisa Occhipinti; Pietro Liò; Claudio Angione; Sandra Pucciarelli
Journal:  BMC Bioinformatics       Date:  2018-11-30       Impact factor: 3.169

3.  Bacteriocyte Reprogramming to Cope With Nutritional Stress in a Phloem Sap Feeding Hemipteran, the Pea Aphid Acyrthosiphon pisum.

Authors:  Stefano Colella; Nicolas Parisot; Pierre Simonet; Karen Gaget; Gabrielle Duport; Patrice Baa-Puyoulet; Yvan Rahbé; Hubert Charles; Gérard Febvay; Patrick Callaerts; Federica Calevro
Journal:  Front Physiol       Date:  2018-10-25       Impact factor: 4.566

4.  Molecular evolutionary trends and feeding ecology diversification in the Hemiptera, anchored by the milkweed bug genome.

Authors:  Kristen A Panfilio; Iris M Vargas Jentzsch; Joshua B Benoit; Deniz Erezyilmaz; Yuichiro Suzuki; Stefano Colella; Hugh M Robertson; Monica F Poelchau; Robert M Waterhouse; Panagiotis Ioannidis; Matthew T Weirauch; Daniel S T Hughes; Shwetha C Murali; John H Werren; Chris G C Jacobs; Elizabeth J Duncan; David Armisén; Barbara M I Vreede; Patrice Baa-Puyoulet; Chloé S Berger; Chun-Che Chang; Hsu Chao; Mei-Ju M Chen; Yen-Ta Chen; Christopher P Childers; Ariel D Chipman; Andrew G Cridge; Antonin J J Crumière; Peter K Dearden; Elise M Didion; Huyen Dinh; Harsha Vardhan Doddapaneni; Amanda Dolan; Shannon Dugan; Cassandra G Extavour; Gérard Febvay; Markus Friedrich; Neta Ginzburg; Yi Han; Peter Heger; Christopher J Holmes; Thorsten Horn; Yi-Min Hsiao; Emily C Jennings; J Spencer Johnston; Tamsin E Jones; Jeffery W Jones; Abderrahman Khila; Stefan Koelzer; Viera Kovacova; Megan Leask; Sandra L Lee; Chien-Yueh Lee; Mackenzie R Lovegrove; Hsiao-Ling Lu; Yong Lu; Patricia J Moore; Monica C Munoz-Torres; Donna M Muzny; Subba R Palli; Nicolas Parisot; Leslie Pick; Megan L Porter; Jiaxin Qu; Peter N Refki; Rose Richter; Rolando Rivera-Pomar; Andrew J Rosendale; Siegfried Roth; Lena Sachs; M Emília Santos; Jan Seibert; Essia Sghaier; Jayendra N Shukla; Richard J Stancliffe; Olivia Tidswell; Lucila Traverso; Maurijn van der Zee; Séverine Viala; Kim C Worley; Evgeny M Zdobnov; Richard A Gibbs; Stephen Richards
Journal:  Genome Biol       Date:  2019-04-02       Impact factor: 13.583

5.  The transposable element-rich genome of the cereal pest Sitophilus oryzae.

Authors:  Nicolas Parisot; Carlos Vargas-Chávez; Clément Goubert; Patrice Baa-Puyoulet; Séverine Balmand; Louis Beranger; Caroline Blanc; Aymeric Bonnamour; Matthieu Boulesteix; Nelly Burlet; Federica Calevro; Patrick Callaerts; Théo Chancy; Hubert Charles; Stefano Colella; André Da Silva Barbosa; Elisa Dell'Aglio; Alex Di Genova; Gérard Febvay; Toni Gabaldón; Mariana Galvão Ferrarini; Alexandra Gerber; Benjamin Gillet; Robert Hubley; Sandrine Hughes; Emmanuelle Jacquin-Joly; Justin Maire; Marina Marcet-Houben; Florent Masson; Camille Meslin; Nicolas Montagné; Andrés Moya; Ana Tereza Ribeiro de Vasconcelos; Gautier Richard; Jeb Rosen; Marie-France Sagot; Arian F A Smit; Jessica M Storer; Carole Vincent-Monegat; Agnès Vallier; Aurélien Vigneron; Anna Zaidman-Rémy; Waël Zamoum; Cristina Vieira; Rita Rebollo; Amparo Latorre; Abdelaziz Heddi
Journal:  BMC Biol       Date:  2021-11-09       Impact factor: 7.431

6.  Rapid transcriptional plasticity of duplicated gene clusters enables a clonally reproducing aphid to colonise diverse plant species.

Authors:  Thomas C Mathers; Yazhou Chen; Gemy Kaithakottil; Fabrice Legeai; Sam T Mugford; Patrice Baa-Puyoulet; Anthony Bretaudeau; Bernardo Clavijo; Stefano Colella; Olivier Collin; Tamas Dalmay; Thomas Derrien; Honglin Feng; Toni Gabaldón; Anna Jordan; Irene Julca; Graeme J Kettles; Krissana Kowitwanich; Dominique Lavenier; Paolo Lenzi; Sara Lopez-Gomollon; Damian Loska; Daniel Mapleson; Florian Maumus; Simon Moxon; Daniel R G Price; Akiko Sugio; Manuella van Munster; Marilyne Uzest; Darren Waite; Georg Jander; Denis Tagu; Alex C C Wilson; Cock van Oosterhout; David Swarbreck; Saskia A Hogenhout
Journal:  Genome Biol       Date:  2017-02-13       Impact factor: 13.583

7.  The genome sequence of the grape phylloxera provides insights into the evolution, adaptation, and invasion routes of an iconic pest.

Authors:  Claude Rispe; Fabrice Legeai; Paul D Nabity; Rosa Fernández; Arinder K Arora; Patrice Baa-Puyoulet; Celeste R Banfill; Leticia Bao; Miquel Barberà; Maryem Bouallègue; Anthony Bretaudeau; Jennifer A Brisson; Federica Calevro; Pierre Capy; Olivier Catrice; Thomas Chertemps; Carole Couture; Laurent Delière; Angela E Douglas; Keith Dufault-Thompson; Paula Escuer; Honglin Feng; Astrid Forneck; Toni Gabaldón; Roderic Guigó; Frédérique Hilliou; Silvia Hinojosa-Alvarez; Yi-Min Hsiao; Sylvie Hudaverdian; Emmanuelle Jacquin-Joly; Edward B James; Spencer Johnston; Benjamin Joubard; Gaëlle Le Goff; Gaël Le Trionnaire; Pablo Librado; Shanlin Liu; Eric Lombaert; Hsiao-Ling Lu; Martine Maïbèche; Mohamed Makni; Marina Marcet-Houben; David Martínez-Torres; Camille Meslin; Nicolas Montagné; Nancy A Moran; Daciana Papura; Nicolas Parisot; Yvan Rahbé; Mélanie Ribeiro Lopes; Aida Ripoll-Cladellas; Stéphanie Robin; Céline Roques; Pascale Roux; Julio Rozas; Alejandro Sánchez-Gracia; Jose F Sánchez-Herrero; Didac Santesmasses; Iris Scatoni; Rémy-Félix Serre; Ming Tang; Wenhua Tian; Paul A Umina; Manuella van Munster; Carole Vincent-Monégat; Joshua Wemmer; Alex C C Wilson; Ying Zhang; Chaoyang Zhao; Jing Zhao; Serena Zhao; Xin Zhou; François Delmotte; Denis Tagu
Journal:  BMC Biol       Date:  2020-07-23       Impact factor: 7.431

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.