| Literature DB >> 31969194 |
Gregg W C Thomas1, Elias Dohmen2,3,4, Daniel S T Hughes5,6, Shwetha C Murali5,7, Monica Poelchau8, Karl Glastad9,10, Clare A Anstead11, Nadia A Ayoub12, Phillip Batterham13, Michelle Bellair5,14, Greta J Binford15, Hsu Chao5, Yolanda H Chen16, Christopher Childers8, Huyen Dinh5, Harsha Vardhan Doddapaneni5, Jian J Duan17, Shannon Dugan5, Lauren A Esposito18, Markus Friedrich19, Jessica Garb20, Robin B Gasser11, Michael A D Goodisman9, Dawn E Gundersen-Rindal21, Yi Han5, Alfred M Handler22, Masatsugu Hatakeyama23, Lars Hering24, Wayne B Hunter25, Panagiotis Ioannidis26,27, Joy C Jayaseelan5, Divya Kalra5, Abderrahman Khila28, Pasi K Korhonen11, Carol Eunmi Lee29, Sandra L Lee5, Yiyuan Li30, Amelia R I Lindsey31,32, Georg Mayer24, Alistair P McGregor33, Duane D McKenna34, Bernhard Misof35, Mala Munidasa5, Monica Munoz-Torres36,37, Donna M Muzny5, Oliver Niehuis38, Nkechinyere Osuji-Lacy5, Subba R Palli39, Kristen A Panfilio40, Matthias Pechmann41, Trent Perry13, Ralph S Peters42, Helen C Poynton43, Nikola-Michael Prpic44,45, Jiaxin Qu5, Dorith Rotenberg46, Coby Schal47, Sean D Schoville48, Erin D Scully49, Evette Skinner5, Daniel B Sloan50, Richard Stouthamer31, Michael R Strand51, Nikolaus U Szucsich52, Asela Wijeratne34,53, Neil D Young11, Eduardo E Zattara54, Joshua B Benoit55, Evgeny M Zdobnov26, Michael E Pfrender30, Kevin J Hackett56, John H Werren57, Kim C Worley5, Richard A Gibbs5, Ariel D Chipman58, Robert M Waterhouse59, Erich Bornberg-Bauer2,3,60, Matthew W Hahn1, Stephen Richards61,62.
Abstract
BACKGROUND: Arthropods comprise the largest and most diverse phylum on Earth and play vital roles in nearly every ecosystem. Their diversity stems in part from variations on a conserved body plan, resulting from and recorded in adaptive changes in the genome. Dissection of the genomic record of sequence change enables broad questions regarding genome evolution to be addressed, even across hyper-diverse taxa within arthropods.Entities:
Keywords: Arthropods; DNA methylation; Evolution; Gene content; Genome assembly; Genomics; Protein domains
Mesh:
Year: 2020 PMID: 31969194 PMCID: PMC6977273 DOI: 10.1186/s13059-019-1925-7
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1OrthoDB orthology delineation for the i5K pilot species. The bars show Metazoa-level orthologs for the 76 selected arthropods and three outgroup species (of 13 outgroup species used for orthology analysis) partitioned according to their presence and copy number, sorted from the largest total gene counts to the smallest. The 28 i5K species generated in this study with a total of 533,636 gene models are indicated in bold green font. A total of 38,195 orthologous protein groups were annotated among the total 76 genomes
Fig. 2Arthropod phylogeny inferred from 569 to 4097 single-copy protein-coding genes among the six multi-species orders, crustaceans, and non-spider chelicerates (Additional file 1: Table S13) and 150 single-copy genes for the orders represented by a single species and the deeper nodes. Divergence times estimated with non-parametric rate smoothing and fossil calibrations at 22 nodes (Additional file 1: Table S14). Species in bold are those sequenced within the framework of the i5K pilot project. All nodes, except those indicated with red shapes, have bootstrap support of 100 inferred by ASTRAL. Nodes of particular interest are labeled in orange and referred to in the text. Larger fonts indicate multi-species orders enabling CAFE 3.0 likelihood analyses (see “Methods”). Nodes leading to major taxonomic groups have been labeled with their node number and the number of genes inferred at that point. See Additional file 2: Figure S16 and Additional file 1: Table S12 for full node labels
Fig. 3Summary of major results from gene family, protein domain, and methylation analyses. a We identify 147 gene families emerging during the evolution of insects, including several which may play an important role in insect development and adaptation. b Contrastingly, we find only ten emergent gene families during the evolution of holometabolous insects, indicating many gene families were already present during this transition. c Among all lineage nodes, we find that the node leading to Lepidoptera has the most emergent gene families. d We find that rates of gene gain and loss are highly correlated with rates of protein domain rearrangement. Leafcutter ants have experienced high rates of both types of change. e Blattella germanica has experienced the highest number of rapid gene family changes, possibly indicating its ability to rapidly adapt to new environments. f We observe signals of CpG methylation in all Araneae (spiders) genomes investigated (species shown: the brown recluse spider, Loxosceles reclusa) and the genome of the bark scorpion, Centruroides exilicauda. The two peaks show different CG counts in different gene features, with depletion of CG sequences in the left peak due to methylated C’s mutating to T. This suggests epigenetic control of a significant number of spider genes. Additional plots for all species in this study are shown in Additional file 2: Figure S5
Fig. 4Rate of genomic change along the arthropod phylogeny: a frequency of amino acid substitutions per site, b gene gains/losses, and c domain changes. All rates are averaged per My and color-indicated as branches of the phylogenetic tree. Species names are shown on the right; specific subclades are highlighted by colors according to the taxonomic groups noted in Fig. 2