| Literature DB >> 32428241 |
Sarah Bouchemousse1, Laurent Falquet1,2, Heinz Müller-Schärer1.
Abstract
Rapid evolution of weed biological control agents (BCAs) to new biotic and abiotic conditions is poorly understood and so far only little considered both in pre-release and post-release studies, despite potential major negative or positive implications for risks of nontargeted attacks or for colonizing yet unsuitable habitats, respectively. Provision of genetic resources, such as assembled and annotated genomes, is essential to assess potential adaptive processes by identifying underlying genetic mechanisms. Here, we provide the first sequenced genome of a phytophagous insect used as a BCA, that is, the leaf beetle Ophraella communa, a promising BCA of common ragweed, recently and accidentally introduced into Europe. A total 33.98 Gb of raw DNA sequences, representing ∼43-fold coverage, were obtained using the PacBio SMRT-Cell sequencing approach. Among the five different assemblers tested, the SMARTdenovo assembly displaying the best scores was then corrected with Illumina short reads. A final genome of 774 Mb containing 7,003 scaffolds was obtained. The reliability of the final assembly was then assessed by benchmarking universal single-copy orthologous genes (>96.0% of the 1,658 expected insect genes) and by remapping tests of Illumina short reads (average of 98.6 ± 0.7% without filtering). The number of protein-coding genes of 75,642, representing 82% of the published antennal transcriptome, and the phylogenetic analyses based on 825 orthologous genes placing O. communa in the monophyletic group of Chrysomelidae, confirm the relevance of our genome assembly. Overall, the genome provides a valuable resource for studying potential risks and benefits of this BCA facing environmental novelties.Entities:
Keywords: zzm321990 Ophraella communazzm321990 ; Chrysomelidae; SMRT-cell sequencing; biological control agent; de novo assembly; whole-genome sequence
Mesh:
Year: 2020 PMID: 32428241 PMCID: PMC7486951 DOI: 10.1093/gbe/evaa102
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Summary of Global Statistics of the Ophraella communa genome assembly and the three other Chrysomelid Genome Assemblies Available in Online Databases (i.e., GenBank-NCBI, ENA, i5k Workspace, and InsectBase)
| Species | Total Sequence Length | Sequencing Technology | Coverage | Number of Scaffolds | Scaffold N50 (bp) | Number of Contigs | Contig N50 (bp) | GC% | Protein Count | % of Transposable Elements | Accession Number | References |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| 774,411,302 | PacBio Sequel II | 43.3× | 7,003 | 195,463 | – | – | 31.9% | 75,642 | 58.2 | GCA_902651945 | Present study |
|
| 641,992,784 | Illumina HiSeq | 52.3× | 26,908 | 139,046 | 45,556 | 46,596 | 35.6% | 19,038 | 16.9 | GCA_000500325.2 |
|
|
| 2,418,073,815 | Illumina HiSeq | 50.0× | 87,712 | 489,108 | 585,680 | 6,238 | 36.5% | 28,061 | – | GCA_003013835.2 | Unpublished |
|
| 1,007,816,681 | PacBio Sequel I or II | 32.0× | 15,778 | 212,245 | – | – | 37.7% | 21,264 | 63.7 | GCA_900659725.1 |
|
Fig. 1.Summary of quality assessments and analysis of the Ophraella communa genome assembly: (A) Completeness of the present genome assembly and the three Chrysomelid genomes, assessed by the recovery of 1,658 insect benchmarking genes using BUSCO; (B) Remapping tests performed on paired-ends (PE) reads after trimming, to assess the reliability of the present genome assembly for population genomic studies of O. communa, on all the samples and for each population; (C) Taxonomic distribution of the 75,642 genes identified in the proteome of O. communa using OrthoDB; (D) Maximum-likelihood tree (–ln L = –6,269,795.57) built on 825 universal single-copy orthologous proteins among 12 Polyphaga beetles and one lepidoptera (Bombyx mori) used as an outgroup, numbers next to nodes indicate internode certainty scores based on 100 bootstrap replicates. The color of dotes at nodes corresponds to the taxonomic color codes in the bar plot (C).