| Literature DB >> 35022701 |
Olli-Pekka Smolander1,2, Daniel Blande3, Virpi Ahola3,4, Pasi Rastas1, Jaakko Tanskanen5, Juhana I Kammonen1, Vicencio Oostra3,6, Lorenzo Pellegrini1, Suvi Ikonen3, Tad Dallas7, Michelle F DiLeo3, Anne Duplouy3,8, Ilhan Cem Duru1, Pauliina Halimaa9, Aapo Kahilainen3, Suyog S Kuwar10,11, Sirpa O Kärenlampi9, Elvira Lafuente12, Shiqi Luo13, Jenny Makkonen9, Abhilash Nair3, Maria de la Paz Celorio-Mancera14, Ville Pennanen15, Annukka Ruokolainen3, Tarja Sundell1, Arja I Tervahauta9, Victoria Twort8, Erik van Bergen3, Janina Österman-Udd3, Lars Paulin1, Mikko J Frilander1, Petri Auvinen1, Marjo Saastamoinen3,16.
Abstract
BACKGROUND: The Glanville fritillary (Melitaea cinxia) butterfly is a model system for metapopulation dynamics research in fragmented landscapes. Here, we provide a chromosome-level assembly of the butterfly's genome produced from Pacific Biosciences sequencing of a pool of males, combined with a linkage map from population crosses.Entities:
Keywords: zzm321990 Melitaea cinxiazzm321990 ; Glanville fritillary; genome; spatial ecology
Mesh:
Year: 2022 PMID: 35022701 PMCID: PMC8756199 DOI: 10.1093/gigascience/giab097
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Assembly statistics
| Statistic |
|
|
| |
|---|---|---|---|---|
| Version 2 | Version 1 Scaffolds | |||
| Length (bp) | 484,462,241 | 389,907,520 | 460,334,017 | 349,759,982 |
| N (%) | <0.01 | 7.42 | 0.10 | 22.47 |
| Scaffold count | 31 | 8,261 | 696 | 2,969 |
| Longest scaffold (bp) | 22,190,643 | 668,473 | 21,465,692 | 15,427,984 |
| Scaffold N50 length (bp) | 17,331,753 | 119,328 | 16,796,068 | 12,597,868 |
| Scaffold N50 count (L50) | 13 | 970 | 13 | 13 |
| Contig count | 529 | 48,180 | 726 | 53,510 |
| Contig N50 length (bp) | 1,831,849 | 14,057 | 12,201,325 | 10,538 |
| Contig N50 count (L50) | 79 | 7,366 | 16 | 6,914 |
Assembly statistics were calculated for the M. cinxia Version 2 genome, M. cinxia Version 1 scaffolds, and B. mori using the assembly-stats program v 17.02 [30]. Statistics for H. melpomene v2.5 and P. napi v1.1 were obtained from LepBase [31].
Figure 1:An overview of the assembly and annotation process of the improved Glanville fritillary genome.
Figure 2:A dot-plot structural comparison of the H. melpomene genome against the M. cinxia v2 genome. The alignment was created using D-GENIES (1.2.0) [48]. The diagonal lines indicate the collinearity between the 2 species. The lack of collinearity in sex chromosomes is visible in the upper left corner between Mcnxia_v2 chr 01 and Hmel2.5 chr 21. The visible vertical lines show repeats that are resolved in Mcinxia_v2 but are present in all chromosomes in Hmel2.5_chr.
Evidence tracks that were used during the manual annotation of 1,232 M. cinxia genes
| Evidence track | Type | Description |
|---|---|---|
| Maker 1 | Gene prediction | Initial maker gene predictions based on EST alignments |
| Maker 2 | Gene prediction | Second round of gene predictions from EST alignments, protein alignments, and gene predictors trained on maker 1. |
| RNA-seq abdomen pool | RNA-seq alignment | RNA-seq reads aligned to the genome with STAR [ |
| RNA-seq mixed-tissue pool | RNA-seq alignment | |
|
| Protein alignment | Proteins sequences aligned to the genome with AAT |
|
| Protein alignment | |
|
| Protein alignment | |
|
| Protein alignment | |
| RNA-seq female larvae family 80 | RNA-seq alignment | RNA-seq reads aligned to the genome with STAR [ |
| RNA-seq female larvae family 70 | RNA-seq alignment | |
| RNA-seq female larvae family 119 | RNA-seq alignment | |
| RNA-seq female larvae family 120 | RNA-seq alignment | |
| RNA-seq male larvae family 80 | RNA-seq alignment | |
| RNA-seq male larvae family 119 | RNA-seq alignment |
Figure 3:A circos plot showing the orthologs between M. cinxia and H. melpomene. Orthologs between M. cinxia and H. melpomene were identified using OrthoFinder and filtered for 1-to-1 orthologs. The internal links in the circos plot indicate the orthologs between M. cinxia and H. melpomene. The links are coloured according to the M. cinxia chromosome.
BUSCO completeness estimates of the v2 genome based on the eukaryota, arthropoda, and metazoa gene sets
| Lineage | BUSCO Category, No. (%) | ||||
|---|---|---|---|---|---|
| Complete | Single-copy | Duplicated | Fragmented | Missing | |
| Eukaryota | 237 (93.0) | 234 (91.8) | 3 (1.2) | 9 (3.5) | 9 (3.5) |
| Arthropoda | 960 (94.8) | 946 (93.4) | 14 (1.4) | 16 (1.6) | 37 (3.6) |
| Metazoa | 905 (94.9) | 891 (93.4) | 14 (1.5) | 16 (1.7) | 33 (3.4) |
Figure 4:Relative amounts of different repeat classes in M. cinxia genome. Repeat classes and coverage of the M. cinxia genome v2: DNA: Class II; LINE: long interspersed nuclear elements; LTR: long terminal repeats; LOW_COMPLEXITY: low-complexity repeated DNA; RC: rolling circle elements (e.g., helitrons); SINE: short interspersed nuclear elements; Satellite: satellite DNA; SIMPLE_REPEAT: simple repeated motifs; EXON: exonic regions; UNCOVERED: rest of the chromosomes.