| Literature DB >> 33749729 |
Maria de la Paz Celorio-Mancera1, Pasi Rastas2, Rachel A Steward1, Soren Nylin1, Christopher W Wheat1.
Abstract
The comma butterfly (Polygonia c-album, Nymphalidae, Lepidoptera) is a model insect species, most notably in the study of phenotypic plasticity and plant-insect coevolutionary interactions. In order to facilitate the integration of genomic tools with a diverse body of ecological and evolutionary research, we assembled the genome of a Swedish comma using 10X sequencing, scaffolding with matepair data, genome polishing, and assignment to linkage groups using a high-density linkage map. The resulting genome is 373 Mb in size, with a scaffold N50 of 11.7 Mb and contig N50 of 11,2Mb. The genome contained 90.1% of single-copy Lepidopteran orthologs in a BUSCO analysis of 5,286 genes. A total of 21,004 gene-models were annotated on the genome using RNA-Seq data from larval and adult tissue in combination with proteins from the Arthropoda database, resulting in a high-quality annotation for which functional annotations were generated. We further documented the quality of the chromosomal assembly via synteny assessment with Melitaea cinxia. The resulting annotated, chromosome-level genome will provide an important resource for investigating coevolutionary dynamics and comparative analyses in Lepidoptera.Entities:
Keywords: zzm321990 Polygonia c-albumzzm321990 ; butterfly genome; comparative genomics; linkage map; quantitative annotation assessment
Mesh:
Year: 2021 PMID: 33749729 PMCID: PMC8140205 DOI: 10.1093/gbe/evab054
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Fig. 1.Pipeline for genome assembly and linkage map construction of the P. c-album genome. Details of the results from each step are indicated within each box.
Fig. 2.Genome validation and genetic diversity. (A) Ortholog homology ratio improved with the combination of RNA-Seq and protein data. There was greater homology between B. mori proteins and proteins predicted using both the RNA-Seq and protein trained annotation (blue) than using either RNA-Seq trained annotation (red) or protein-trained annotation (yellow) only. (B) Synteny between the M. cinxia genome (colored chromosomes) and the P. c-album genome (noncolored linkage groups).