Nadav Rappoport1, Michal Linial2. 1. School of Computer Science and Engineering, The Rachel and Selim Benin School of Computer Science and Engineering, The Hebrew University, Jerusalem, Israel. nadavrap@cs.huji.ac.il. 2. Department of Biological Chemistry, The Alexander Silberman Institute of Life Sciences, The Hebrew University of Jerusalem, Edmond J. Safra Campus, Givat Ram, Jerusalem, 91904, Israel. michall@cc.huji.ac.il.
Abstract
BACKGROUND: Insects belong to a class that accounts for the majority of animals on earth. With over one million identified species, insects display a huge diversity and occupy extreme environments. At present, there are dozens of fully sequenced insect genomes that cover a range of habitats, social behavior and morphologies. In view of such diverse collection of genomes, revealing evolutionary trends and charting functional relationships of proteins remain challenging. RESULTS: We analyzed the relatedness of 17 complete proteomes representative of proteomes from insects including louse, bee, beetle, ants, flies and mosquitoes, as well as an out-group from the crustaceans. The analyzed proteomes mostly represented the orders of Hymenoptera and Diptera. The 287,405 protein sequences from the 18 proteomes were automatically clustered into 20,933 families, including 799 singletons. A comprehensive analysis based on statistical considerations identified the families that were significantly expanded or reduced in any of the studied organisms. Among all the tested species, ants are characterized by an exceptionally high rate of family gain and loss. By assigning annotations to hundreds of species-specific families, the functional diversity among species and between the major clades (Diptera and Hymenoptera) is revealed. We found that many species-specific families are associated with receptor signaling, stress-related functions and proteases. The highest variability among insects associates with the function of transposition and nucleic acids processes (collectively coined TNAP). Specifically, the wasp and ants have an order of magnitude more TNAP families and proteins relative to species that belong to Diptera (mosquitoes and flies). CONCLUSIONS: An unsupervised clustering methodology combined with a comparative functional analysis unveiled proteomic signatures in the major clades of winged insects. We propose that the expansion of TNAP families in Hymenoptera potentially contributes to the accelerated genome dynamics that characterize the wasp and ants.
BACKGROUND: Insects belong to a class that accounts for the majority of animals on earth. With over one million identified species, insects display a huge diversity and occupy extreme environments. At present, there are dozens of fully sequenced insect genomes that cover a range of habitats, social behavior and morphologies. In view of such diverse collection of genomes, revealing evolutionary trends and charting functional relationships of proteins remain challenging. RESULTS: We analyzed the relatedness of 17 complete proteomes representative of proteomes from insects including louse, bee, beetle, ants, flies and mosquitoes, as well as an out-group from the crustaceans. The analyzed proteomes mostly represented the orders of Hymenoptera and Diptera. The 287,405 protein sequences from the 18 proteomes were automatically clustered into 20,933 families, including 799 singletons. A comprehensive analysis based on statistical considerations identified the families that were significantly expanded or reduced in any of the studied organisms. Among all the tested species, ants are characterized by an exceptionally high rate of family gain and loss. By assigning annotations to hundreds of species-specific families, the functional diversity among species and between the major clades (Diptera and Hymenoptera) is revealed. We found that many species-specific families are associated with receptor signaling, stress-related functions and proteases. The highest variability among insects associates with the function of transposition and nucleic acids processes (collectively coined TNAP). Specifically, the wasp and ants have an order of magnitude more TNAP families and proteins relative to species that belong to Diptera (mosquitoes and flies). CONCLUSIONS: An unsupervised clustering methodology combined with a comparative functional analysis unveiled proteomic signatures in the major clades of winged insects. We propose that the expansion of TNAP families in Hymenoptera potentially contributes to the accelerated genome dynamics that characterize the wasp and ants.
Authors: G M Rubin; M D Yandell; J R Wortman; G L Gabor Miklos; C R Nelson; I K Hariharan; M E Fortini; P W Li; R Apweiler; W Fleischmann; J M Cherry; S Henikoff; M P Skupski; S Misra; M Ashburner; E Birney; M S Boguski; T Brody; P Brokstein; S E Celniker; S A Chervitz; D Coates; A Cravchik; A Gabrielian; R F Galle; W M Gelbart; R A George; L S Goldstein; F Gong; P Guan; N L Harris; B A Hay; R A Hoskins; J Li; Z Li; R O Hynes; S J Jones; P M Kuehl; B Lemaitre; J T Littleton; D K Morrison; C Mungall; P H O'Farrell; O K Pickeral; C Shue; L B Vosshall; J Zhang; Q Zhao; X H Zheng; S Lewis Journal: Science Date: 2000-03-24 Impact factor: 47.728
Authors: Robert D Finn; Jaina Mistry; John Tate; Penny Coggill; Andreas Heger; Joanne E Pollington; O Luke Gavin; Prasad Gunasekaran; Goran Ceric; Kristoffer Forslund; Liisa Holm; Erik L L Sonnhammer; Sean R Eddy; Alex Bateman Journal: Nucleic Acids Res Date: 2009-11-17 Impact factor: 16.971
Authors: John H Werren; Stephen Richards; Christopher A Desjardins; Oliver Niehuis; Jürgen Gadau; John K Colbourne; John H Werren; Stephen Richards; Christopher A Desjardins; Oliver Niehuis; Jürgen Gadau; John K Colbourne; Leo W Beukeboom; Claude Desplan; Christine G Elsik; Cornelis J P Grimmelikhuijzen; Paul Kitts; Jeremy A Lynch; Terence Murphy; Deodoro C S G Oliveira; Christopher D Smith; Louis van de Zande; Kim C Worley; Evgeny M Zdobnov; Maarten Aerts; Stefan Albert; Victor H Anaya; Juan M Anzola; Angel R Barchuk; Susanta K Behura; Agata N Bera; May R Berenbaum; Rinaldo C Bertossa; Márcia M G Bitondi; Seth R Bordenstein; Peer Bork; Erich Bornberg-Bauer; Marleen Brunain; Giuseppe Cazzamali; Lesley Chaboub; Joseph Chacko; Dean Chavez; Christopher P Childers; Jeong-Hyeon Choi; Michael E Clark; Charles Claudianos; Rochelle A Clinton; Andrew G Cree; Alexandre S Cristino; Phat M Dang; Alistair C Darby; Dirk C de Graaf; Bart Devreese; Huyen H Dinh; Rachel Edwards; Navin Elango; Eran Elhaik; Olga Ermolaeva; Jay D Evans; Sylvain Foret; Gerald R Fowler; Daniel Gerlach; Joshua D Gibson; Donald G Gilbert; Dan Graur; Stefan Gründer; Darren E Hagen; Yi Han; Frank Hauser; Da Hultmark; Henry C Hunter; Gregory D D Hurst; Shalini N Jhangian; Huaiyang Jiang; Reed M Johnson; Andrew K Jones; Thomas Junier; Tatsuhiko Kadowaki; Albert Kamping; Yuri Kapustin; Bobak Kechavarzi; Jaebum Kim; Jay Kim; Boris Kiryutin; Tosca Koevoets; Christie L Kovar; Evgenia V Kriventseva; Robert Kucharski; Heewook Lee; Sandra L Lee; Kristin Lees; Lora R Lewis; David W Loehlin; John M Logsdon; Jacqueline A Lopez; Ryan J Lozado; Donna Maglott; Ryszard Maleszka; Anoop Mayampurath; Danielle J Mazur; Marcella A McClure; Andrew D Moore; Margaret B Morgan; Jean Muller; Monica C Munoz-Torres; Donna M Muzny; Lynne V Nazareth; Susanne Neupert; Ngoc B Nguyen; Francis M F Nunes; John G Oakeshott; Geoffrey O Okwuonu; Bart A Pannebakker; Vikas R Pejaver; Zuogang Peng; Stephen C Pratt; Reinhard Predel; Ling-Ling Pu; Hilary Ranson; Rhitoban Raychoudhury; Andreas Rechtsteiner; Justin T Reese; Jeffrey G Reid; Megan Riddle; Hugh M Robertson; Jeanne Romero-Severson; Miriam Rosenberg; Timothy B Sackton; David B Sattelle; Helge Schlüns; Thomas Schmitt; Martina Schneider; Andreas Schüler; Andrew M Schurko; David M Shuker; Zilá L P Simões; Saurabh Sinha; Zachary Smith; Victor Solovyev; Alexandre Souvorov; Andreas Springauf; Elisabeth Stafflinger; Deborah E Stage; Mario Stanke; Yoshiaki Tanaka; Arndt Telschow; Carol Trent; Selina Vattathil; Eveline C Verhulst; Lumi Viljakainen; Kevin W Wanner; Robert M Waterhouse; James B Whitfield; Timothy E Wilkes; Michael Williamson; Judith H Willis; Florian Wolschin; Stefan Wyder; Takuji Yamada; Soojin V Yi; Courtney N Zecher; Lan Zhang; Richard A Gibbs Journal: Science Date: 2010-01-15 Impact factor: 47.728
Authors: Lothar Wissler; Jürgen Gadau; Daniel F Simola; Martin Helmkampf; Erich Bornberg-Bauer Journal: Genome Biol Evol Date: 2013 Impact factor: 3.416
Authors: Jan Philip Oeyen; Patrice Baa-Puyoulet; Joshua B Benoit; Leo W Beukeboom; Erich Bornberg-Bauer; Anja Buttstedt; Federica Calevro; Elizabeth I Cash; Hsu Chao; Hubert Charles; Mei-Ju May Chen; Christopher Childers; Andrew G Cridge; Peter Dearden; Huyen Dinh; Harsha Vardhan Doddapaneni; Amanda Dolan; Alexander Donath; Daniel Dowling; Shannon Dugan; Elizabeth Duncan; Elena N Elpidina; Markus Friedrich; Elzemiek Geuverink; Joshua D Gibson; Sonja Grath; Cornelis J P Grimmelikhuijzen; Ewald Große-Wilde; Cameron Gudobba; Yi Han; Bill S Hansson; Frank Hauser; Daniel S T Hughes; Panagiotis Ioannidis; Emmanuelle Jacquin-Joly; Emily C Jennings; Jeffery W Jones; Steffen Klasberg; Sandra L Lee; Peter Lesný; Mackenzie Lovegrove; Sebastian Martin; Alexander G Martynov; Christoph Mayer; Nicolas Montagné; Victoria C Moris; Monica Munoz-Torres; Shwetha Canchi Murali; Donna M Muzny; Brenda Oppert; Nicolas Parisot; Thomas Pauli; Ralph S Peters; Malte Petersen; Christian Pick; Emma Persyn; Lars Podsiadlowski; Monica F Poelchau; Panagiotis Provataris; Jiaxin Qu; Maarten J M F Reijnders; Björn Marcus von Reumont; Andrew J Rosendale; Felipe A Simao; John Skelly; Alexandros G Sotiropoulos; Aaron L Stahl; Megumi Sumitani; Elise M Szuter; Olivia Tidswell; Evangelos Tsitlakidis; Lucia Vedder; Robert M Waterhouse; John H Werren; Jeanne Wilbrandt; Kim C Worley; Daisuke S Yamamoto; Louis van de Zande; Evgeny M Zdobnov; Tanja Ziesmann; Richard A Gibbs; Stephen Richards; Masatsugu Hatakeyama; Bernhard Misof; Oliver Niehuis Journal: Genome Biol Evol Date: 2020-07-01 Impact factor: 3.416