| Literature DB >> 34115123 |
Svitlana Lukicheva1, Jean-François Flot1, Patrick Mardulyn1.
Abstract
Coleoptera is the most species-rich insect order, yet is currently underrepresented in genomic databases. An assembly was generated for ca. 1.7-Gb genome of the leaf beetle Gonioctena quinquepunctata by first assembling long-sequence reads (Oxford Nanopore; ± 27-fold coverage) and subsequently polishing the resulting assembly with short sequence reads (Illumina; ± 85-fold coverage). The unusually large size (most Coleoptera species are associated with a reported size below 1 Gb) was at least partially attributed to the presence of a large fraction of repeated elements (73.8%). The final assembly was characterized by an N50 length of 432 kb and a BUSCO score of 95.5%. The heterozygosity rate was ±0.6%. Automated genome annotation informed by RNA-Seq resulted in 40,568 predicted proteins, which is much larger than the typical range 17,000-23,000 predicted for other Coleoptera. However, no evidence of a genome duplication was detected. This new reference genome will contribute to our understanding of genetic variation in the Coleoptera. Among others, it will also allow exploring reproductive barriers between species, investigating introgression in the nuclear genome, and identifying genes involved in resistance to extreme climate conditions.Entities:
Keywords: Chrysomelidae; de novo assembly; genome annotation; whole-genome sequence
Mesh:
Year: 2021 PMID: 34115123 PMCID: PMC8290105 DOI: 10.1093/gbe/evab134
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Summary of Assembly Statistics
| Assembly | Size (Mb) | 1,732 |
| Number of contigs | 10,033 | |
| Number of contigs >50 k | 5,755 | |
| Longest contig (Mb) | 3.03 | |
| Contig N50 | 4,32,124 | |
|
| 0 | |
| GC (%) | 34.61 | |
| BUSCO | Complete (%) | 95.5 |
| Complete duplicated (%) | 2 | |
| Fragmented (%) | 2.2 | |
| Missing (%) | 2.3 | |
| Repetitive elements | Total (%) | 66.09 |
| SINEs (%) | 0 | |
| LINEs (%) | 13.76 | |
| LTR (%) | 4.9 | |
| DNA transposons (%) | 11.98 | |
| Unclassified (%) | 42.22 | |
| Annotation | Predicted genes | 38,493 |
| Predicted proteins | 40,568 | |
| Functionally annotated | 19,357 | |
| Mean gene length | 15,141 | |
| Mean exon length | 267 | |
| Mean intron length | 6,479 | |
| Exons per gene | 3.53 | |
| Introns per gene | 2.53 |
Comparison of the genome characteristics of G. quinquepunctata with those of four other species of chrysomelid beetles (in bold), of nine other beetle species and one outgroup (Bombyx mori). A maximum-likelihood phylogenetic tree was estimated for these species from an amino-acid alignment of the 52 single-copy proteins found in all 15 genomes. 1a: Assembly lengths, in Mb. 1b: Total number of predicted proteins (in green), number of predicted single-copy proteins (in yellow) and number of predicted species-specific proteins (in red). 1c: ML tree; bootstrap support values indicated along interior branches.