| Literature DB >> 27289092 |
Jan M Meyer1, Gabriel V Markov2, Praveen Baskaran1, Matthias Herrmann1, Ralf J Sommer1, Christian Rödelsperger3.
Abstract
Beetles represent the largest insect order and they display extreme morphological, ecological and behavioral diversity, which makes them ideal models for evolutionary studies. Here, we present the draft genome of the scarab beetle Oryctes borbonicus, which has a more basal phylogenetic position than the two previously sequenced pest species Tribolium castaneum and Dendroctonus ponderosae providing the potential for sequence polarization. Oryctes borbonicus is endemic to La Réunion, an island located in the Indian Ocean, and is the host of the nematode Pristionchus pacificus, a well-established model organism for integrative evolutionary biology. At 518 Mb, the O. borbonicus genome is substantially larger and encodes more genes than T. castaneum and D. ponderosae We found that only 25% of the predicted genes of O. borbonicus are conserved as single copy genes across the nine investigated insect genomes, suggesting substantial gene turnover within insects. Even within beetles, up to 21% of genes are restricted to only one species, whereas most other genes have undergone lineage-specific duplications and losses. We illustrate lineage-specific duplications using detailed phylogenetic analysis of two gene families. This study serves as a reference point for insect/coleopteran genomics, although its original motivation was to find evidence for potential horizontal gene transfer (HGT) between O. borbonicus and P. pacificus The latter was previously shown to be the recipient of multiple horizontally transferred genes including some genes from insect donors. However, our study failed to provide any clear evidence for additional HGTs between the two species.Entities:
Keywords: Coleoptera; Scarabaeoidea; cytochrome P450; glutathione S-transferase
Mesh:
Year: 2016 PMID: 27289092 PMCID: PMC4987105 DOI: 10.1093/gbe/evw133
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Fig. 1.—Conserved and lineage-specific patterns of gene content evolution. (A) Schematic phylogeny of investigated insect genomes (Hunt et al. 2007; Trautwein 2012) and distribution of genes in different orthology classes. (B) Amount of coding sequence in different orthology classes. (C–E) Protein domain (PFAM) count comparison between all three beetle genomes. Large protein domain families that show the most extreme differences in gene counts are labeled in each comparison.
Genome statistics
| Genome Assembly | De Novo Transcriptome | Evidence-Based Annotation | Gene Prediction | ||
|---|---|---|---|---|---|
| Scaffolds | Contigs | Trinity | MAKER2 | AUGUSTUS | |
| Total size (Mb) | 517.9 | 426.3 | 27.2 | 16.6 | 25.2 |
| N sequences | 150,243 | 30,471 | 18,177 | 20,504 | 23,278 |
| Largest (kb) | 1,101 | 457 | 16 | 32.4 | 41 |
| Smallest (kb) | 0.1 | 0.1 | 0.5 | <0.1 | <0.1 |
| N50 (kb) | 104.8 | 33.1 | 2.0 | 1.5 | 1.8 |
| N sequences (length>N50) | 1,365 | 3,590 | 3,915 | 3,168 | 3,712 |
Note.—Assembly features such as size and number of sequences were collected for the raw Contig assembly, scaffolded genome, and three types of gene annotations. Please note that arbitrary minimum cutoffs were used by the different programs.
Fig. 2.—A maximum-likelihood tree of beetle cytosolic GSTs. The tree is rooted with sequences from Drosophila and Apis, and was calculated under the LG+G model. A linear version is available in supplementary figure S8, Supplementary Material online.
Fig. 3.—A maximum-likelihood tree of insect CYPs. The tree was calculated under the LG+G model, and is rooted by CYP51 sequences. A linear version of the tree is available in supplementary figure S9, Supplementary Material online.