| Literature DB >> 32161089 |
Kumar Saurabh Singh1, David J Hosken2, Nina Wedell2, Richard Ffrench-Constant2, Chris Bass2, Simon Baxter3, Konrad Paszkiewicz4, Manmohan D Sharma1.
Abstract
Meadow brown butterflies (Maniola jurtina) on the Isles of Scilly represent an ideal model in which to dissect the links between genotype, phenotype and long-term patterns of selection in the wild - a largely unfulfilled but fundamental aim of modern biology. To meet this aim, a clear description of genotype is required. Here we present the draft genome sequence of M. jurtina to serve as a founding genetic resource for this species. Seven libraries were constructed using pooled DNA from five wild caught spotted females and sequenced using Illumina, PacBio RSII and MinION technology. A novel hybrid assembly approach was employed to generate a final assembly with an N50 of 214 kb (longest scaffold 2.9 Mb). The sequence assembly described here predicts a gene count of 36,294 and includes variants and gene duplicates from five genotypes. Core BUSCO (Benchmarking Universal Single-Copy Orthologs) gene sets of Arthropoda and Insecta recovered 90.5% and 88.7% complete and single-copy genes respectively. Comparisons with 17 other Lepidopteran species placed 86.5% of the assembled genes in orthogroups. Our results provide the first high-quality draft genome and annotation of the butterfly M. jurtina.Entities:
Keywords: Genome assembly; Lepidoptera; Maniola jurtina; comparative genomics; meadow brown
Mesh:
Year: 2020 PMID: 32161089 PMCID: PMC7202024 DOI: 10.1534/g3.120.401071
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1Female Maniola jurtina (picture credit: Richard ffrench-Constant).
Figure 2Schematic overview of the workflow used for sequencing, genome size estimation, assembly and annotation of the M. jurtina genome. 1. An artist’s impression of a female M. jurtina (samples collected from multiple fields and processed for DNA extraction); 2. Multiple sequencing approaches adopted along with genome characterization using genome scope; 3. Genome assembly using a hybrid approach; 4. Genome completeness assessment; 5. De novo genes prediction and repeat detection; 6. Functional annotation; 7. Comparative analysis. Note that transcriptome data (orange segment) were obtained from publicly available sources at NCBI and only used for genome annotation.
jurtina genome properties
| Properties | Genome |
|---|---|
| # scaffolds (> 1000 bp) | 10,860 |
| Total length (>= 1000 bp) | 618,415,580 |
| Largest scaffold | 2,944,739 |
| Total length | 618,415,580 |
| GC (%) | 36.90 |
| N50 | 214,423 |
| N75 | 78,459 |
| L50 | 658 |
| L75 | 1,875 |
| # N’s per 100 kbp | 8,864.86 |
Candidate wing spot patterning genes obtained from a literature search are listed in column 1. Column 2 has the number of annotated orthologs across Lepidoptera in our NCBI protein database search using the full gene name as listed and alternate names (comma separated). Column 3 presents the number of proteins per gene that matched in our Exonerate workflow
| Candidate gene | NCBI Protein Matches within Lepidoptera | protein2genome matches against the | |
|---|---|---|---|
| 1 | 1 | 0 | |
| 2 | 18 | 4 | |
| 3 | 25 | 4 | |
| 4 | 5 | 0 | |
| 5 | 32 | 7 | |
| 6 | 205 | 191 | |
| 7 | 26 | 0 | |
| 8 | 145 | 86 | |
| 9 | 47 | 1 | |
| 10 | 204 | 101 | |
| 11 | 4 | 2 | |
| 12 | 101 | 33 | |
| 13 | 19 | 0 | |
| 14 | 26 | 10 | |
| 15 | 39 | 6 | |
| 16 | 15 | 0 | |
| 17 | 16 | 3 | |
| 18 | 32 | 22 | |
| 19 | 117 | 72 | |
| 20 | 40 | 0 | |
| 21 | 5 | 3 | |
| 22 | 58 | 0 | |
| 23 | 2 | 0 | |
| 24 | 65 | 2 | |
| 25 | 1 | 1 | |
| 26 | 74 | 23 | |
| 27 | 2 | 0 | |
| 28 | 0 | — | |
| 29 | 1 | 1 | |
| 30 | 23 | 1 |
Different assembly versions, data, software used and summary statistics
| Version | Data | Assembler | N50 | #Sequences | Total length |
|---|---|---|---|---|---|
| 1 | Short-read PE | Spades | 48,073 | 53,043 | 319,930,151 |
| 2 | Long-reads (PacBio + MinION) | Canu | 32,954 | 10,463 | 296,564,618 |
| 3 | Version 1 + Long-read (PacBio + Minion) | DBG2OLC | 60,269 | 46,361 | 317,966,984 |
| 4 | Version 2 + 3 | QuickMerge | 92,579 | 30,457 | 762,970,634 |
| 5 | Version 4 + PE + MP + PacBio + MinION | Redundans | 213,669 | 10,863 | 616,464,047 |
Figure 3Evolutionary and comparative genomic analysis. (A) Ortholog analysis of M. jurtina with 16 other arthropod species. SC indicates common orthologs with the same number of copies in different species, MC indicates common orthologs with different copy numbers in different species, UP indicates species specific paralogs, UC indicates all genes which were not assigned to a gene family, MS indicates moths specific genes and BS indicates butterfly specific genes. (B) Species phylogenetic tree and gene family evolution. Numbers on the node indicate counts of the gene families that are expanding (green), contracting (red) and rapidly evolving (blue).
Figure 4Genome comparisons Comparison of the Maniola jurtina genome with Heliconious melpomene and Bicyclus anynana. The dot plots were generated using Mummer. The plots show relatedness of M. jurtina with (A) H. melpomene and (B) B. anynana. Both of these genomes were taken as references (x-axis) and queried using M. jurtina (y-axis) genome. In both plots, blue, green and orange colored dots represent the unique forward, unique reverse and repetitive alignments respectively. Plot B shows more consistent and contiguous alignments than plot A. The dot plots were generated using https://dnanexus.github.io/dot/.