| Literature DB >> 34279216 |
Bernard Y Kim1, Jeremy R Wang2, Daniel R Matute3, Dmitri A Petrov1, Danny E Miller4, Olga Barmina5, Emily Delaney5, Ammon Thompson5, Aaron A Comeault6, David Peede3, Emmanuel R R D'Agostino3, Julianne Pelaez7, Jessica M Aguilar7, Diler Haji7, Teruyuki Matsunaga7, Ellie E Armstrong1, Molly Zych8, Yoshitaka Ogawa9, Marina Stamenković-Radak10, Mihailo Jelić10, Marija Savić Veselinović10, Marija Tanasković11, Pavle Erić11, Jian-Jun Gao12, Takehiro K Katoh12, Masanori J Toda13, Hideaki Watabe14, Masayoshi Watada15, Jeremy S Davis16, Leonie C Moyle17, Giulia Manoli18, Enrico Bertolini18, Vladimír Košťál19, R Scott Hawley20, Aya Takahashi9, Corbin D Jones3, Donald K Price21, Noah Whiteman7, Artyom Kopp5.
Abstract
Over 100 years of studies in Drosophila melanogaster and related species in the genus Drosophila have facilitated key discoveries in genetics, genomics, and evolution. While high-quality genome assemblies exist for several species in this group, they only encompass a small fraction of the genus. Recent advances in long-read sequencing allow high-quality genome assemblies for tens or even hundreds of species to be efficiently generated. Here, we utilize Oxford Nanopore sequencing to build an open community resource of genome assemblies for 101 lines of 93 drosophilid species encompassing 14 species groups and 35 sub-groups. The genomes are highly contiguous and complete, with an average contig N50 of 10.5 Mb and greater than 97% BUSCO completeness in 97/101 assemblies. We show that Nanopore-based assemblies are highly accurate in coding regions, particularly with respect to coding insertions and deletions. These assemblies, along with a detailed laboratory protocol and assembly pipelines, are released as a public resource and will serve as a starting point for addressing broad questions of genetics, ecology, and evolution at the scale of hundreds of species.Entities:
Keywords: D. melanogaster; Drosophila; Drosophilidae; comparative genomics; evolutionary biology; genetics; genome assembly; genomics; long reads; nanopore
Mesh:
Year: 2021 PMID: 34279216 PMCID: PMC8337076 DOI: 10.7554/eLife.66405
Source DB: PubMed Journal: Elife ISSN: 2050-084X Impact factor: 8.713