Literature DB >> 33365580

New mitochondrial genomes of 39 soil dwelling Coleoptera from metagenome sequencing.

Carmelo Andújar1,2,3, Paula Arribas1,2,3, Michal Motyka4, Mathew Bocek4, Ladislav Bocak4, Benjamin Linard2,5, Alfried P Vogler2,3.   

Abstract

High-throughput DNA methods hold great promise for the study of the hyperdiverse arthropod fauna of the soil. We used the mitochondrial metagenomic approach to generate 39 mitochondrial genomes from adult and larval specimens of Coleoptera collected from soil samples. The mitogenomes correspond to species from the families Carabidae (6), Chrysomelidae (1), Curculionidae (9), Dermestidae (1), Elateridae (1), Latridiidae (1), Scarabaeidae (3), Silvanidae (1), Staphylinidae (12), and Tenebrionidae (4). All the mitogenomes followed the putative ancestral gene order for Coleoptera. We provide the first available mitogenome for 30 genera of Coleoptera, including endogean representatives of the genera Torneuma, Coiffaitiella, Otiorhynchus, Oligotyphlopsis, and Typhlocharis.
© 2019 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.

Entities:  

Keywords:  Coleoptera; endogean; mitochondrial metagenomics; next-generation sequencing; soil

Year:  2019        PMID: 33365580      PMCID: PMC7687447          DOI: 10.1080/23802359.2019.1637289

Source DB:  PubMed          Journal:  Mitochondrial DNA B Resour        ISSN: 2380-2359            Impact factor:   0.658


The mitochondrial metagenomics approach (MMG) provides a cost-effective method for sequencing mitochondrial genomes from numerous species (Andújar et al. 2015; Crampton-Platt et al. 2015). Total genomic DNA from multiple specimens, either extracted individually or in bulk, is shotgun sequenced in a metagenomics mixture, followed by assembly with standard genomic assemblers, from which whole mitochondrial genomes emerge preferentially due to their high copy number relative to most of the nuclear genome. This ‘genome skimming’ approach was used to sequence the mitogenomes of beetle specimens collected from soil samples of the southern Iberian Peninsula at Sierra de Grazalema (36.7N, −5.4W), Sierra de Cabra (37.4N, −4.3W) and Sierra Madrona (38.4N, −4.3W) (see Andújar et al. 2015) by following the ‘Floatation-Berlese-Floatation’ (FBF) protocol (Arribas et al. 2016). Briefly, aliquots of the DNA extracts from 1494 specimens (vouchered at the Natural History Museum London) were pooled to generate 3 pools with roughly equimolar DNA concentration per specimen, after the dsDNA concentration of extracts was measured (Qubit 2.0 Fluorometer, Life Technologies Corp., Carlsbad, CA). Further, TruSeq DNA libraries were constructed and sequenced in the Illumina MiSeq platform (Illumina Inc., San Diego, CA) (2 × 300 bp; 800–950 bp insert size). The output was processed and assembled in three assemblers as indicated in Andújar et al. (2017). The resulting contigs were subjected to super-assembly in Geneious 7.1.9 (http://www.geneious.com) using the de novo assembly function and showed wide overlap. The procedure resulted in more than 200 mitogenomes longer than 5000 bp, of which 39 were selected for further annotation and identification to species or genus level. Thirty-four of these include the full set of protein-coding, rRNA and tRNA genes (>15,000 bp), of which 17 were complete circular mitogenomes. The remaining 17 were not circularized due to difficulties with the assembly of the control region. Five additional mitogenomes were incomplete by the lack of one or two genes (sequence length between 12,221 and 14,453 bp). The mitogenomes were annotated using gene predictions with MITOS (Bernt et al. 2013) and manually refined in Geneious. All mitogenomes were structured following the putatively ancestral gene order for the Coleoptera. Mitogenomes assembled from the shotgun mixture were linked to particular specimens using the cox1 barcode sequences obtained from the same specimens with PCR-Sanger sequencing. For those cases where Sanger sequencing failed (5/39), validation was performed by unambiguous match to the species level on BOLD Public Data Portal (Ratnasingham and Hebert 2007, accessed on 20th March 2019). The mitogenomes correspond to 39 different species from the families Carabidae (6), Chrysomelidae (1), Curculionidae (9), Dermestidae (1), Elateridae (1), Latridiidae (1), Scarabaeidae (3), Silvanidae (1), Staphylinidae (12) and Tenebrionidae (4), and include representatives from 37 genera. For 30 of these genera, we provide the first available mitogenome and only two species (Oryzaephilus surinamensis and Hypera postica) have an available mitogenome. The new mitogenomes include endogean representatives of the genera Torneuma, Coiffaitiella, Otiorhynchus, Oligotyphlopsis, and Typhlocharis. For further details on specimens and mitogenomes see Figure 1, Tables 1 and 2, and GenBank Accession Numbers.
Figure 1.

Phylogenetic tree from maximum-likelihood analysis showing the relationships of the 39 newly generated mitogenomes. Circles in branch tips indicate the locality where each specimen was collected (Sierra de Grazalema: black; Sierra de Cabra: grey and Sierra Madrona: white). Shaded frames according with beetle families. GenBank accession numbers are in brackets.

Table 1.

Additional data for the 39 mitogenomes of Coleoptera in this study.

GB accessionVoucher code*FamilySpeciesFG**FSP***Life stageIdentification
MK692552BMNH 1041149CarabidaeMicrolestes mauritanicusxxAdultJ.L. Lencina det.
MK692553BMNH 1042258TenebrionidaeOochrotus unicolorxxAdultJ.L. Lencina det.
MK692554BMNH 1041892CurculionidaeTorneuma sp.xxAdultC. Hernando det.
MK692556BMNH 1044019StaphylinidaeAchenium seditiosumxxAdultV. Assing det.
MK692557BMNH 1041971CarabidaeTyphlocharis sp.xxAdultC. Andújar det.
MK692559BMNH 1041157CarabidaeMicrolestes reitterixxAdultJ.L. Lencina det.
MK692560BMNH 1042672StaphylinidaeOthius myrmecophilus xLarvaBOLD match > 99%
MK692567BMNH 1042021StaphylinidaeOligotyphlopsis sp.xxAdultC. Hernando det.
MK692568BMNH 1042062CurculionidaeTychius pusillusxxAdultBOLD match > 99%
MK692574BMNH 1042482CarabidaeTrechus ObtususxxLarvaBOLD match > 99%
MK692579BMNH 1041943StaphylinidaeTachyporus nitidulusxxAdultV. Assing det.
MK692585BMNH 1041967ElateridaeCardiophorus signatusxxAdultJ.L. Lencina det.
MK692586BMNH 1041911CurculionidaeCoiffaitiella sp.xxAdultC. Hernando det.
MK692587BMNH 1043732CurculionidaeElliptacalles longusxxAdultBOLD match > 99%
MK692591BMNH 1041150ScarabaeidaeAmmoecius elevatusxxAdultJ.L. Lencina det.
MK692592BMNH 1041990CurculionidaeTorneuma sp.xxAdultC. Hernando det.
MK692593BMNH 1042238TenebrionidaeScaurus uncinusxxAdultJ.L. Lencina det.
MK692597BMNH 1043977StaphylinidaeAtheta sp.  AdultV. Assing det.
MK692599NAStaphylinidaeOcypus aethiops xNABOLD match > 99%
MK692601BMNH 1042249StaphylinidaeMedon sp.  AdultV. Assing det.
MK692603BMNH 1042190StaphylinidaeMicrillus testaceusxxAdultV. Assing det.
MK692605BMNH 1042074CurculionidaeHypera postica  AdultBOLD match > 99%
MK692606BMNH 1042031TenebrionidaeCnemeplatia atroposxxAdultJ.L. Lencina det.
MK692607BMNH 1042209ScarabaeidaePleurophorus caesusxxAdultJ.L. Lencina det.
MK692609BMNH 1041982CurculionidaeProtapion trifoliixxAdultBOLD match > 99%
MK692616BMNH 1041162StaphylinidaeGeostiba sp.xxAdultV. Assing det.
MK692625NAChrysomelidaeCryptophagus pilosusxxNABOLD match > 99%
MK692626BMNH 1042569CarabidaeCalathus granatensis xLarvaBOLD match > 99%
MK692638NAStaphylinidaeLomechusa pubicollisxxNABOLD match > 99%
MK692642BMNH 1041893SilvanidaeOryzaephilus surinamensis  AdultJ.L. Lencina det.
MK692645NACurculionidaeEchinodera andalusiensisxxNABOLD match > 99%
MK692646BMNH 1042067CurculionidaeOtiorhynchus sp.xxAdultC. Hernando det.
MK692648BMNH 1042068ScarabaeidaeEsymus pusillusxxAdultJ.L. Lencina det.
MK692661BMNH 1042524StaphylinidaeAnotylus inustusxxLarvaBOLD match > 99%
MK692677BMNH 1041924LatridiidaeCorticaria sp.xxAdultJ.L. Lencina det.
MK692678BMNH 1042175DermestidaeThorictus sp.xxAdultJ.L. Lencina det.
MK692681NACarabidaeSyntomus foveatusxxNABOLD match > 99%
MK692702BMNH 1042255StaphylinidaeMocyta fungixxAdultBOLD match > 99%
MK692707BMNH 1042182TenebrionidaeCentorus elongatusxxAdultJ.L. Lencina det.

Genbank Accession Numbers, voucher codes, taxonomic identification, life stage, and information on whether the provided mitogenomes are the first available for the genus (FG column) and for the species (FSP column).

All mitogenomes were obtained by bulk sequencing of a mix of specimens. Voucher code refers to the specimen with PCR-Sanger sequencing that matches (100% similarity) the obtained mitogenomes. Vouchers are not available for mitogenomes if PCR-Sanger sequencing failed for a particular specimen.

FG: Marked with ‘x’ if the mitogenome is the first available for the genus.

FSP: Marked with ‘x’ if the mitogenome is the first available for the species.

Table 2.

Sampling localities for the 39 mitogenomes of Coleoptera in this study.

GB accessionLocality*Latitude (N)Longitude (W)Altitude (Meters)Habitat
MK692552La Dehesilla, Benaocaz, Cádiz, Spain36.7074−5.4570480Olea europaea field
MK692553N-420 km 105, Fuencaliente, Ciudad Real, Spain38.4445−4.3247730Grassland-riverside
MK692554Arroyo del Espino, El Bosque, Cádiz, Spain36.7613−5.5069275Riverside
MK692556Nava de Cabra, Cabra, Córdoba, Spain37.4856−4.3634995Grassland
MK692557La Dehesilla, Benaocaz, Cádiz, Spain36.7074−5.4567470Grassland
MK692559La Dehesilla, Benaocaz, Cádiz, Spain36.7074−5.4570480Olea europaea field
MK692560Robledo de las Hoyas, Fuencaliente, Ciudad Real, Spain38.4371−4.3413950Quercus faginea forest
MK692567Arroyo del Bosque, El Bosque, Cádiz, Spain36.7667−5.5011290Riverside
MK692568Llanos del Republicano, Villaluenga del Rosario, Cádiz, Spain36.6817−5.3574810Grassland
MK692574Llanos del Republicano, Villaluenga del Rosario, Cádiz, Spain36.6907−5.3639925Quercus suber forest
MK692579Huerta Hedionda, Tavizna, Benaocaz, Cádiz, Spain36.7192−5.4850360Olea europaea field
MK692585Colada de la Breña, Benaocaz, Cádiz, Spain36.7070−5.4704430Quercus suber forest
MK692586El Pinsapar, Puerto del Pinar, Grazalema, Cádiz, Spain36.7726−5.42401115Abies pinsapo forest
MK692587Puerto del Boyar, Grazalema, Cádiz, Spain36.7536−5.39391120Grassland
MK692591La Dehesilla, Benaocaz, Cádiz, Spain36.7074−5.4570480Olea europaea field
MK692592Arroyo del Bosque, El Bosque, Cádiz, Spain36.7667−5.5011290Riverside
MK692593Cortijo del Navazuelo, Carcabuey, Córdoba, Spain37.4852−4.34121035Grassland
MK692597Ermita Nta. Sra. de la Sierra, Cabra, Córdoba, Spain37.4905−4.38131145Pinus halepensis
MK692599Sierra de Cabra, Córdoba, SpainNANANANA
MK692601Ladera de la Casa de Cipriano, Fuencaliente, Ciudad Real, Spain38.4190−4.3138765Quercus suber forest
MK692603Nava de Cabra, Cortijo de los Benítez, Cabra, Córdoba, Spain37.4856−4.3634995Grassland
MK692605Casa de la Viñuela, Cabra, Córdoba, Spain37.4852−4.38611020Quercus faginea forest
MK692606Llanos del Republicano, Villaluenga del Rosario, Cádiz, Spain36.6857−5.3648910Quercus ilex forest
MK692607Arroyo del Palancar, Carcabuey, Córdoba, Spain37.4628−4.2676525Riverside
MK692609Arroyo del Bosque, El Bosque, Cádiz, Spain36.7667−5.5011290Riverside
MK692616Llanos del Campo, Benamahoma, Cádiz, Spain36.7556−5.4556642Quercus ilex forest
MK692625Sierra de Grazalema, Cádiz, SpainNANANANA
MK692626Llanos del Republicano, Villaluenga del Rosario, Cádiz, Spain36.6907−5.3639925Quercus suber forest
MK692638Sierra Madrona, Ciudad Real, SpainN.A.N.A.N.A.N.A.
MK692642El Boyar, Cortijo del Santo, Grazalema, Cádiz, Spain36.7549−5.4194920Quercus ilex forest
MK692645Sierra de Grazalema, Cádiz, SpainNANANANA
MK692646Ermita Nta. Sra. de la Sierra, Cabra, Córdoba, Spain37.4905−4.38131145Pinus halepensis
MK692648Nava de Cabra, Cabra, Córdoba, Spain37.5067−4.3671968Quercus ilex forest
MK692661Camino Viejo a la Ermita, Cabra, Córdoba, Spain37.4811−4.3885970Grassland
MK692677Arroyo del Espino, El Bosque, Cádiz, Spain36.7613−5.5069275Riverside
MK692678Cortijo del Navazuelo, Carcabuey, Córdoba, Spain37.4852−4.34121035Grassland
MK692681Sierra de Grazalema, Cádiz, SpainNANANANA
MK692702Collado del Navazuelo, Carcabuey, Córdoba, Spain37.4801−4.3347995Olea europaea field
MK692707Nava de Cabra, Cortijo de los Benítez, Cabra, Córdoba, Spain37.4856−4.3634995Grassland

All mitogenomes were obtained by bulk sequencing of a mix of specimens, and voucher codes to particular specimens assigned by the PCR-Sanger sequencing that matches (100% similarity) the obtained mitogenomes (see Table 1). Metagenomes not linked to a particular vouchered specimen are assigned to a locality but lack detailed information (precise coordinates, altitude, and habitat).

Phylogenetic tree from maximum-likelihood analysis showing the relationships of the 39 newly generated mitogenomes. Circles in branch tips indicate the locality where each specimen was collected (Sierra de Grazalema: black; Sierra de Cabra: grey and Sierra Madrona: white). Shaded frames according with beetle families. GenBank accession numbers are in brackets. Additional data for the 39 mitogenomes of Coleoptera in this study. Genbank Accession Numbers, voucher codes, taxonomic identification, life stage, and information on whether the provided mitogenomes are the first available for the genus (FG column) and for the species (FSP column). All mitogenomes were obtained by bulk sequencing of a mix of specimens. Voucher code refers to the specimen with PCR-Sanger sequencing that matches (100% similarity) the obtained mitogenomes. Vouchers are not available for mitogenomes if PCR-Sanger sequencing failed for a particular specimen. FG: Marked with ‘x’ if the mitogenome is the first available for the genus. FSP: Marked with ‘x’ if the mitogenome is the first available for the species. Sampling localities for the 39 mitogenomes of Coleoptera in this study. All mitogenomes were obtained by bulk sequencing of a mix of specimens, and voucher codes to particular specimens assigned by the PCR-Sanger sequencing that matches (100% similarity) the obtained mitogenomes (see Table 1). Metagenomes not linked to a particular vouchered specimen are assigned to a locality but lack detailed information (precise coordinates, altitude, and habitat). For the 39 newly generated mitogenomes, the 13 protein-coding genes (PCGs) were extracted using Geneious and individually aligned using the FFT-NS-i-x2 algorithm of MAFFT (Katoh et al. 2002). Individual gene alignments were trimmed and concatenated to get a final dataset of 39 taxa and 12,940 bp. This alignment was used for phylogenetic inference using Maximum-likelihood in IQ-TREE (Nguyen et al. 2015), performed through the IQ-TREE web server (Trifinopoulos et al. 2016) without data partitioning, allowing the software to determine the best-fit substitution model and estimating an ultrafast bootstrap approximation with 10,000 replicates. The obtained tree showed the expected relationships among the families within Coleoptera, including the monophyly of the suborders Adephaga and Polyphaga and the monophyly of all families (with more than one mitogenome) (Figure 1).
  8 in total

1.  MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform.

Authors:  Kazutaka Katoh; Kazuharu Misawa; Kei-ichi Kuma; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2002-07-15       Impact factor: 16.971

2.  Phylogenetic community ecology of soil biodiversity using mitochondrial metagenomics.

Authors:  Carmelo Andújar; Paula Arribas; Filip Ruzicka; Alex Crampton-Platt; Martijn J T N Timmermans; Alfried P Vogler
Journal:  Mol Ecol       Date:  2015-05-14       Impact factor: 6.185

3.  W-IQ-TREE: a fast online phylogenetic tool for maximum likelihood analysis.

Authors:  Jana Trifinopoulos; Lam-Tung Nguyen; Arndt von Haeseler; Bui Quang Minh
Journal:  Nucleic Acids Res       Date:  2016-04-15       Impact factor: 16.971

4.  The mitochondrial genome of Iberobaenia (Coleoptera: Iberobaeniidae): first rearrangement of protein-coding genes in the beetles.

Authors:  Carmelo Andujar; Paula Arribas; Benjamin Linard; Robin Kundrata; Ladislav Bocak; Alfried P Vogler
Journal:  Mitochondrial DNA A DNA Mapp Seq Anal       Date:  2016-05-23       Impact factor: 1.514

5.  MITOS: improved de novo metazoan mitochondrial genome annotation.

Authors:  Matthias Bernt; Alexander Donath; Frank Jühling; Fabian Externbrink; Catherine Florentz; Guido Fritzsch; Joern Pütz; Martin Middendorf; Peter F Stadler
Journal:  Mol Phylogenet Evol       Date:  2012-09-07       Impact factor: 4.286

6.  IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies.

Authors:  Lam-Tung Nguyen; Heiko A Schmidt; Arndt von Haeseler; Bui Quang Minh
Journal:  Mol Biol Evol       Date:  2014-11-03       Impact factor: 16.240

7.  bold: The Barcode of Life Data System (http://www.barcodinglife.org).

Authors:  Sujeevan Ratnasingham; Paul D N Hebert
Journal:  Mol Ecol Notes       Date:  2007-05-01

8.  Soup to Tree: The Phylogeny of Beetles Inferred by Mitochondrial Metagenomics of a Bornean Rainforest Sample.

Authors:  Alex Crampton-Platt; Martijn J T N Timmermans; Matthew L Gimmel; Sujatha Narayanan Kutty; Timothy D Cockerill; Chey Vun Khen; Alfried P Vogler
Journal:  Mol Biol Evol       Date:  2015-05-08       Impact factor: 16.240

  8 in total
  1 in total

1.  Mitogenomics of the Olive Seed Weevil, Anchonocranus oleae Marshall and Implications for Its Phylogenetic Position in Curculionidae.

Authors:  Samuel J Smit; Elleunorah Allsopp; Zwannda Nethavhani; Virgilio Caleca; Rolf G Oberprieler; Barbara van Asch
Journal:  Insects       Date:  2022-07-06       Impact factor: 3.139

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.