| Literature DB >> 26227816 |
Martijn F L Derks1, Sandra Smit2, Lucia Salis3, Elio Schijlen4, Alex Bossers5, Christa Mateman6, Agata S Pijl6, Dick de Ridder7, Martien A M Groenen8, Marcel E Visser9, Hendrik-Jan Megens8.
Abstract
The winter moth (Operophtera brumata) belongs to one of the most species-rich families in Lepidoptera, the Geometridae (approximately 23,000 species). This family is of great economic importance as most species are herbivorous and capable of defoliating trees. Genome assembly of the winter moth allows the study of genes and gene families, such as the cytochrome P450 gene family, which is known to be vital in plant secondary metabolite detoxification and host-plant selection. It also enables exploration of the genomic basis for female brachyptery (wing reduction), a feature of sexual dimorphism in winter moth, and for seasonal timing, a trait extensively studied in this species. Here we present a reference genome for the winter moth, the first geometrid and largest sequenced Lepidopteran genome to date (638 Mb) including a set of 16,912 predicted protein-coding genes. This allowed us to assess the dynamics of evolution on a genome-wide scale using the P450 gene family. We also identified an expanded gene family potentially linked to female brachyptery, and annotated the genes involved in the circadian clock mechanism as main candidates for involvement in seasonal timing. The genome will contribute to Lepidopteran genomic resources and comparative genomics. In addition, the genome enhances our ability to understand the genetic and molecular basis of insect seasonal timing and thereby provides a reference for future evolutionary and population studies on the winter moth.Entities:
Keywords: Lepidoptera; circadian clock; cytochrome P450; phenology; sexual dimorphism; winter moth
Mesh:
Substances:
Year: 2015 PMID: 26227816 PMCID: PMC4558862 DOI: 10.1093/gbe/evv145
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
F—Male (left) and female (right) winter moth. The pictures show female brachyptery in winter moth.
Sequence Data: Raw and Preprocessed
| Library | Insert Size | Raw | Preprocessed | ||||||
|---|---|---|---|---|---|---|---|---|---|
| # Reads (Million) | # Bases (Gb) | Read Length | Coverage | # Reads (Million) | # Bases(Gb) | Read Length | Coverage | ||
| PE400_1 | −145 bp | 58.8 | 14.5 | 246.5 | 22.7 | 56.1 | 13.8 | 246.4 | 21.6 |
| PE400_2 | −193 bp | 31.2 | 7.8 | 248.2 | 12.1 | 29.7 | 7.4 | 248.2 | 11.5 |
| MP1-2 kb | 1–2 kb | 22.9 | 2.3 | 101 | 3.6 | 14.7 | 1.3 | 86.4 | 2.0 |
| MP3kb | 3 kb | 15.1 | 1.5 | 101 | 2.4 | 4.0 | 0.3 | 85.9 | 0.53 |
| MP4-5 kb | 4–5 kb | 10.4 | 1.1 | 101 | 1.6 | 1.7 | 0.1 | 82.3 | 0.21 |
| Total | 138.5 | 27.1 | 42.4 | 106.1 | 22.9 | 35.8 | |||
Lepidopteran Genome Properties and Validation
| Species | ||||||
|---|---|---|---|---|---|---|
| Genome assembly | ||||||
| Assembly size (Mb) | 638.2 | 431.7 | 248.6 | 269.7 | 394.0 | 393.3 |
| Number of scaffolds | 25,801 | 43,622 | 5,397 | 3,807 | 1,819 | 6,299 |
| N50 scaffold size (kb) | 65.6 | 3,717.0 | 715.6 | 277.0 | 737.2 | 258.3 |
| Min scaffold length (bp) | 502 | 11 | 300 | 124 | 1,842 | 1,512 |
| Max scaffold length (Mb) | 0.50 | 16.12 | 6.24 | 1.45 | 3.49 | 0.68 |
| Gene annotation | ||||||
| Protein-coding | 16,912 | 14,623 | 15,130 | 12,669 | 18,071 | 16,667 |
| InterPro domain | 11,793 | 9,892 | 10,034 | 9,803 | 12,877 | 8,529 |
| Genomic features | ||||||
| Repeat (%) | 53.5 | 43.6 | 10.2 | 24.9 | 34.0 | 27.9 |
| GC (%) | 38.6 | 38.8 | 31.6 | 32.8 | 38.0 | 32.6 |
| Coding (%) | 2.9 | 4.1 | 8.4 | 6.5 | 6.4 | 4.0 |
| Intron (%) | 17.7 | 16.3 | 28.1 | 25.8 | 30.8 | 30.5 |
| Gene length (bp) | 7,752 | 6,029 | 6,001 | 6,779 | 8,083 | 8,129 |
| tRNAs | 579 | 441 | 379 | 2,373 | 521 | 2,478 |
| Quality control | ||||||
| CEGMA partial (248) | 234 | 240 | 238 | 234 | 227 | 208 |
| Core | 46.6 | 47.7 | 54.0 | 51.2 | 48.8 | 41.6 |
| | 13.0 | 13.4 | 14.4 | 13.4 | 13.0 | 11.2 |
| | 17.9 | 17.8 | 18.3 | 17.8 | 17.7 | 15.4 |
aPercentage of genes of which more than 90% of the sequence is found on a single scaffold.
F—Lepidoptera P450 gene family tree. The tree shows species-specific P450 gene family expansions in six Lepidoptera, with large winter moth expansions in CYP3 and CYP4. Branch colors represent the Cytochrome P450 clades: CYP2 (blue), CYP3 (yellow), CYP4 (red), CYP-Mito (green).
F—Phylogenetic tree of the four rdx(-like) orthology groups. Orthology group OG318 (yellow) contains 25 copies of an O. brumata specific rdx-like gene. The Drosophila ortholog is in group OG2662. All genes have a BZB-POZ domain to interact with CUL3 except for OG6733. Only proteins in group OG2662 also contain the MATH domain.
F—Circadian clock mechanism in Lepidoptera. (A) List of clock genes, identified in the winter moth genome. (B) The mechanism comprises the core transcriptional/translational feedback loop (left) and the modulatory feedback loop (right). Clock (Clk) and Cycle (Cyc) form a heterodimer and drive the transcription of period (Per), timeless (Tim), and cryptrochrome 2 (Cry2) by binding their upstream E-box elements. Cry2 is responsible for the inhibition of Clk:Cyc transcription, that is, its own activators. Per and Tim bind through a PAS domain. Casein kinase II (CkII), discs overgrown (Dbt), and the protein phosphatase 2A (Pp2A) are involved in the posttranslational modifications and activation of Per and Tim. Jetleg (Jet) is responsible for Tim degradation, modulated by Cry1, activated by light. Slimb signals Per degradation. In insects, Per and Tim are transcribed mostly during the night. Vri (negative) and Pdp1 (positive) regulate the expression of Clk in the secondary feedback loop.