Literature DB >> 35187268

The genome sequence of the bishop's mitre shieldbug, Aelia acuminata (Linnaeus, 1758).

Liam Crowley1, Maxwell V L Barclay2.   

Abstract

We present a genome assembly from an individual male Aelia acuminata (the bishop's mitre shieldbug; Arthropoda; Insecta; Hemiptera; Pentatomidae). The genome sequence is 1,170 megabases in span. The majority of the assembly (99.78%) is scaffolded into 8 chromosomal pseudomolecules, with the X and Y sex chromosome assembled. Copyright:
© 2021 Crowley L et al.

Entities:  

Keywords:  Aelia acuminata; Hemiptera; bishop’s mitre shieldbug; chromosomal; genome sequence

Year:  2021        PMID: 35187268      PMCID: PMC8817068          DOI: 10.12688/wellcomeopenres.17400.1

Source DB:  PubMed          Journal:  Wellcome Open Res        ISSN: 2398-502X


Species taxonomy

Eukaryota; Metazoa; Ecdysozoa; Arthropoda; Hexapoda; Insecta; Pterygota; Neoptera; Paraneoptera; Hemiptera; Heteroptera; Panheteroptera; Pentatomomorpha; Pentatomoidea; Pentatomidae; Pentatominae; Aelia; Aelia acuminata (Linnaeus, 1758) (NCBI:txid1511221).

Background

The Bishop's mitre shieldbug, Aelia acuminata, is a common mid-sized (8-9 mm) shieldbug. The vernacular name derives from the characteristically pointed head and pronotum forming a shape reminiscent of its namesake. It is common in grassland habitats across Europe, North Africa, the Middle East and Northern Asia. In the UK it is common and widespread across the south, and can be found in most dry grassland habitats. It feeds on the seeds of a range of grasses in the Poaceae family and may be a minor pest in cereal fields ( Vaccino ). It is univoltine, with adults mating and laying eggs in the spring and early summer. Nymphs develop through five instars throughout the summer months and reach adulthood by August. Aelia acuminata overwinters as an imago, with the increased photoperiod a key factor in termination of diapause in the spring ( Hodek, 1971).

Genome sequence report

The genome was sequenced from one male A. acuminata ( Figure 1) collected from Wytham Woods, Oxfordshire (biological vice-county: Berkshire), UK (latitude 51.771, longitude -1.338). A total of 31-fold coverage in Pacific Biosciences single-molecule long reads and 24-fold coverage in 10X Genomics read clouds were generated. Primary assembly contigs were scaffolded with chromosome conformation Hi-C data. Manual assembly curation corrected 241 missing/misjoins and removed 25 haplotypic duplications, reducing the assembly size by 0.73% and the scaffold number by 67.21% and increasing the scaffold N50 by 132.84%.
Figure 1.

Image of the Aelia acuminata specimen (ihAelAcum1) used for genome sequencing, taken during preservation and processing.

The specimen is shown alongside a FluidX sample tube 43.9 mm in length.

Image of the Aelia acuminata specimen (ihAelAcum1) used for genome sequencing, taken during preservation and processing.

The specimen is shown alongside a FluidX sample tube 43.9 mm in length. The final assembly has a total length of 1,170 Mb in 81 sequence scaffolds with a scaffold N50 of 172 Mb ( Table 1). The majority of the assembly sequence (99.78%) was assigned to 8 chromosomal-level scaffolds, representing 6 autosomes (numbered by sequence length), and the X and Y sex chromosome ( Figure 2– Figure 5; Table 2). The assembly has a BUSCO v5.1.2 ( Manni ) completeness of 99.3% (single 97.6%, duplicated 1.7%) using the hemiptera_odb10 reference set. While not fully phased, the assembly deposited is of one haplotype. Contigs corresponding to the second haplotype have also been deposited.
Table 1.

Genome data for Aelia acuminata, ihAelAcum1.1.

Project accession data
Assembly identifierihAelAcum1.1
Species Aelia acuminata
SpecimenihAelAcum1 (genome assembly, Hi-C); ihAelAcum5 (Hi-C)
NCBI taxonomy IDNCBI:txid1511221
BioProjectPRJEB45203
BioSample IDSAMEA7520372
Isolate informationMale, whole organism (ihAelAcum1); unknown sex, head/thorax (ihAelAcum5);
Raw data accessions
PacificBiosciences SEQUEL IIERR6406217, ERR6544656
10X Genomics IlluminaERR6054995-ERR6054998
Hi-C IlluminaERR6054999, ERR6055000
Genome assembly
Assembly accessionGCA_911387785.1
Accession of alternate haplotypeGCA_911387705.1
Span (Mb)1,170
Number of contigs691
Contig N50 length (Mb)4.7
Number of scaffolds81
Scaffold N50 length (Mb)172.2
Longest scaffold (Mb)235.2
BUSCO * genome scoreC:99.3%[S:97.6%,D:1.7%],F:0. 2%,M:0.5%,n:2510

*BUSCO scores based on the hemiptera_odb10 BUSCO set using v5.1.2. C= complete [S= single copy, D=duplicated], F=fragmented, M=missing, n=number of orthologues in comparison. A full set of BUSCO scores is available at https://blobtoolkit.genomehubs.org/view/ihAelAcum1.1/dataset/CAJVQU01/busco.

Figure 2.

Genome assembly of Aelia acuminata, ihAelAcum1.1: metrics.

The BlobToolKit Snailplot shows N50 metrics and BUSCO gene completeness. The main plot is divided into 1,000 size-ordered bins around the circumference with each bin representing 0.1% of the 1,170,046,398 bp assembly. The distribution of scaffold lengths is shown in dark grey with the plot radius scaled to the longest scaffold present in the assembly (235,202,771 bp, shown in red). Orange and pale-orange arcs show the N50 and N90 scaffold lengths (172,186,306 and 93,278,539 bp), respectively. The pale grey spiral shows the cumulative scaffold count on a log scale with white scale lines showing successive orders of magnitude. The blue and pale-blue area around the outside of the plot shows the distribution of GC, AT and N percentages in the same bins as the inner plot. A summary of complete, fragmented, duplicated and missing BUSCO genes in the hemiptera_odb10 set is shown in the top right. An interactive version of this figure is available at https://blobtoolkit.genomehubs.org/view/ihAelAcum1.1/dataset/CAJVQU01/snail.

Figure 5.

Genome assembly of Aelia acuminata, ihAelAcum1.1: Hi-C contact map.

Hi-C contact map of the ihAelAcum1.1 assembly, visualised in HiGlass.

Table 2.

Chromosomal pseudomolecules in the genome assembly of Aelia acuminata, ihAelAcum1.1.

INSDC accessionChromosomeSize (Mb)GC%
OU426978.11235.2033.3
OU426979.12187.8833.3
OU426981.13158.1132.8
OU426982.14150.2432.9
OU426983.15132.5332.8
OU426984.1693.2833
OU426980.1X172.1933.1
OU426985.1Y14.8233.9
OU426986.1MT0.0226.3
-Unplaced25.7833.7
*BUSCO scores based on the hemiptera_odb10 BUSCO set using v5.1.2. C= complete [S= single copy, D=duplicated], F=fragmented, M=missing, n=number of orthologues in comparison. A full set of BUSCO scores is available at https://blobtoolkit.genomehubs.org/view/ihAelAcum1.1/dataset/CAJVQU01/busco.

Genome assembly of Aelia acuminata, ihAelAcum1.1: metrics.

The BlobToolKit Snailplot shows N50 metrics and BUSCO gene completeness. The main plot is divided into 1,000 size-ordered bins around the circumference with each bin representing 0.1% of the 1,170,046,398 bp assembly. The distribution of scaffold lengths is shown in dark grey with the plot radius scaled to the longest scaffold present in the assembly (235,202,771 bp, shown in red). Orange and pale-orange arcs show the N50 and N90 scaffold lengths (172,186,306 and 93,278,539 bp), respectively. The pale grey spiral shows the cumulative scaffold count on a log scale with white scale lines showing successive orders of magnitude. The blue and pale-blue area around the outside of the plot shows the distribution of GC, AT and N percentages in the same bins as the inner plot. A summary of complete, fragmented, duplicated and missing BUSCO genes in the hemiptera_odb10 set is shown in the top right. An interactive version of this figure is available at https://blobtoolkit.genomehubs.org/view/ihAelAcum1.1/dataset/CAJVQU01/snail.

Genome assembly of Aelia acuminata, ihAelAcum1.1: GC coverage.

BlobToolKit GC-coverage plot. Scaffolds are coloured by phylum. Circles are sized in proportion to scaffold length. Histograms show the distribution of scaffold length sum along each axis. An interactive version of this figure is available at https://blobtoolkit.genomehubs.org/view/ihAelAcum1.1/dataset/CAJVQU01/blob.

Genome assembly of Aelia acuminata, ihAelAcum1.1: cumulative sequence.

BlobToolKit cumulative sequence plot. The grey line shows cumulative length for all scaffolds. Coloured lines show cumulative lengths of scaffolds assigned to each phylum using the buscogenes taxrule. An interactive version of this figure is available at https://blobtoolkit.genomehubs.org/view/ihAelAcum1.1/dataset/CAJVQU01/cumulative.

Genome assembly of Aelia acuminata, ihAelAcum1.1: Hi-C contact map.

Hi-C contact map of the ihAelAcum1.1 assembly, visualised in HiGlass.

Methods

Sample acquisition, DNA extraction and sequencing

A single male A. acuminata (ihAelAcum1) was collected from Wytham Woods, Oxfordshire (biological vice-county: Berkshire), UK (latitude 51.771, longitude -1.338) by Liam Crowley, University of Oxford, using a sweep net. The sample was identified by the same individual, and preserved on dry ice. A second specimen of unknown sex (ihAelAcum5) was collected from Hever Castle, Hever, Kent, UK (latitude 51.188367, longitude 0.119781) by Maxwell Barclay, Natural History Museum, by hand. The sample was identified by the same individual and was snap-frozen in liquid nitrogen. DNA was extracted from the whole organism of ihAelAcum1 at the Wellcome Sanger Institute Scientific Operations core from the whole organism using the Qiagen MagAttract HMW DNA kit, according to the manufacturer’s instructions. Pacific Biosciences HiFi circular consensus and 10X Genomics read cloud DNA sequencing libraries were constructed according to the manufacturers’ instructions. Sequencing was performed by the Scientific Operations core at the Wellcome Sanger Institute on Pacific Biosciences SEQUEL II and Illumina HiSeq X instruments. Hi-C data were generated from the head/thorax tissue of ihAelAcum5 and whole organism tissue of ihAelAcum1 using the Arima Hi-C+ kit and sequenced on an Illumina NovaSeq 6000 instrument.

Genome assembly

Assembly was carried out with Hifiasm ( Cheng ); haplotypic duplication was identified and removed with purge_dups ( Guan ). One round of polishing was performed by aligning 10X Genomics read data to the assembly with longranger align, calling variants with freebayes ( Garrison & Marth, 2012). The assembly was then scaffolded with Hi-C data ( Rao ) using SALSA2 ( Ghurye ). The assembly was checked for contamination and corrected using the gEVAL system ( Chow ) as described previously ( Howe ). Manual curation ( Howe ) was performed using gEVAL, HiGlass ( Kerpedjiev ) and Pretext. The mitochondrial genome was assembled using MitoHiFi ( Uliano-Silva ). The genome was analysed and BUSCO scores generated within the BlobToolKit environment ( Challis ). Table 3 contains a list of all software tool versions used, where appropriate.
Table 3.

Software tools used.

Software toolVersionSource
Hifiasm0.15.1-r329 Cheng et al., 2021
purge_dups1.2.3 Guan et al., 2020
SALSA22.2 Ghurye et al., 2019
longranger align2.2.2 https://support.10xgenomics.com/genome-exome/software/pipelines/latest/advanced/other-pipelines
freebayes1.3.1-17-gaa2ace8 Garrison & Marth, 2012
MitoHiFi2.0 singularity Uliano-Silva et al., 2021
gEVALN/A Chow et al., 2016
HiGlass1.11.6 Kerpedjiev et al., 2018
PretextView0.2.x https://github.com/wtsi-hpag/PretextView
BlobToolKit2.6.2 Challis et al., 2020

Ethics/compliance issues

The materials that have contributed to this genome note have been supplied by a Darwin Tree of Life Partner. The submission of materials by a Darwin Tree of Life Partner is subject to the Darwin Tree of Life Project Sampling Code of Practice. By agreeing with and signing up to the Sampling Code of Practice, the Darwin Tree of Life Partner agrees they will meet the legal and ethical requirements and standards set out within this document in respect of all samples acquired for, and supplied to, the Darwin Tree of Life Project. Each transfer of samples is further undertaken according to a Research Collaboration Agreement or Material Transfer Agreement entered into by the Darwin Tree of Life Partner, Genome Research Limited (operating as the Wellcome Sanger Institute), and in some circumstances other Darwin Tree of Life collaborators.

Data availability

European Nucleotide Archive: Aelia acuminata (Bishop's mitre shieldbug). Accession number PRJEB45203; https://identifiers.org/ena.embl/PRJEB45203. The genome sequence is released openly for reuse. The A. acuminata genome sequencing initiative is part of the Darwin Tree of Life (DToL) project. All raw sequence data and the assembly have been deposited in INSDC databases. The genome will be annotated and presented through the Ensembl pipeline at the European Bioinformatics Institute. Raw data and assembly accession identifiers are reported in Table 1. Genome assembly of Bishop’s Mitre Shieldbug,  Aelia acuminata by Crowley et al. This is one of the best chromosome-scale genome assemblies available to date for a pentatomid stink bug species. Authors present the workflow clearly and I believe this genome will be an excellent reference for other pentatomid genome assemblies. I encourage authors to re-examine chromosome assignments by comparing the genome sequence to other published hemipteran genomes rather than sorting by chromosome size. This will help comparative studies down the road as more hemipteran/pentatomid genomes are sequenced and published. In addition, publishing the parameters used for various operations as a supplementary data file would be helpful to the scientific community. Are sufficient details of methods and materials provided to allow replication by others? Yes Is the rationale for creating the dataset(s) clearly described? Yes Are the datasets clearly presented in a useable and accessible format? Yes Are the protocols appropriate and is the work technically sound? Yes Reviewer Expertise: Genetics I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. The authors have presented the genome report of the  Aelia acuminata sized 1,170 megabases with 8 chromosomal pseudomolecules using long read genome assembly. The methods for sample collection and the data analyses are technically and scientifically sound as they have utilized recent tools. The genome of A. acuminata can provide good addition to the Hemiptera order and baseline for the phylogenetic studies. Major: I believe CAJVQU010000004.1, and CAJVQU010000005.1 are the X and Y chromosomes. Authors have not pointed out which is X and Y chromosomes. Please provide the reason for using only One round of polishing. Other missing part in the methodology is the annotation of the genome. I would like authors to add an annotation report in the main article with information on genes and pseudogenes information. Minor: I was wondering if they can show the BUSCO plot in traditional figure and add ANI identity to the close related species' genomes and I would recommend authors to show them in the phylogenetic tree using single copy genes. Further, the codes used in the analysis should be deposited in the dyrad or figshare. Are sufficient details of methods and materials provided to allow replication by others? Yes Is the rationale for creating the dataset(s) clearly described? Yes Are the datasets clearly presented in a useable and accessible format? Yes Are the protocols appropriate and is the work technically sound? Yes Reviewer Expertise: Bioinformatics, living organism Genomics I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard. Crowley, Barclay et al., report a genome of a hemipteran - bishop’s mitre shieldbug. The assembly is on a chromosomal scale, including identified sex chromosomes. The methods were described clearly, with an exception of the assignment of X and Y chromosomes, which seems to be left out of the methods section. I suppose they showed a clear signature of monoploid coverage, however, I was less sure how authors decided on which of the two monoploid chromosomes was X and which Y. The final assembly spans 1,170 Mb in 81 scaffolds. 99.78% of nucleotides were assigned to chromosomes, but I wondered how many scaffolds were anchored to chromosomes? Does the coverage not anchored scaffolds indicate linkage to one of the two sex chromosomes, automates or is it impossible to tell? Are sufficient details of methods and materials provided to allow replication by others? Partly Is the rationale for creating the dataset(s) clearly described? Yes Are the datasets clearly presented in a useable and accessible format? Yes Are the protocols appropriate and is the work technically sound? Yes Reviewer Expertise: genome evolution, non-mendelian inheritance, bioinformatics I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard.
  10 in total

1.  Sensitivity to photoperiod in Aelia acuminata (L.) after adult diapause.

Authors:  I Hodek
Journal:  Oecologia       Date:  1971-01       Impact factor: 3.225

2.  Integrating Hi-C links with assembly graphs for chromosome-scale assembly.

Authors:  Jay Ghurye; Arang Rhie; Brian P Walenz; Anthony Schmitt; Siddarth Selvaraj; Mihai Pop; Adam M Phillippy; Sergey Koren
Journal:  PLoS Comput Biol       Date:  2019-08-21       Impact factor: 4.475

3.  A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping.

Authors:  Suhas S P Rao; Miriam H Huntley; Neva C Durand; Elena K Stamenova; Ivan D Bochkov; James T Robinson; Adrian L Sanborn; Ido Machol; Arina D Omer; Eric S Lander; Erez Lieberman Aiden
Journal:  Cell       Date:  2014-12-11       Impact factor: 41.582

4.  Haplotype-resolved de novo assembly using phased assembly graphs with hifiasm.

Authors:  Haoyu Cheng; Gregory T Concepcion; Xiaowen Feng; Haowen Zhang; Heng Li
Journal:  Nat Methods       Date:  2021-02-01       Impact factor: 28.547

5.  gEVAL - a web-based browser for evaluating genome assemblies.

Authors:  William Chow; Kim Brugger; Mario Caccamo; Ian Sealy; James Torrance; Kerstin Howe
Journal:  Bioinformatics       Date:  2016-04-07       Impact factor: 6.937

6.  BlobToolKit - Interactive Quality Assessment of Genome Assemblies.

Authors:  Richard Challis; Edward Richards; Jeena Rajan; Guy Cochrane; Mark Blaxter
Journal:  G3 (Bethesda)       Date:  2020-04-09       Impact factor: 3.154

7.  Significantly improving the quality of genome assemblies through curation.

Authors:  Kerstin Howe; William Chow; Joanna Collins; Sarah Pelan; Damon-Lee Pointon; Ying Sims; James Torrance; Alan Tracey; Jonathan Wood
Journal:  Gigascience       Date:  2021-01-09       Impact factor: 6.524

8.  Identifying and removing haplotypic duplication in primary genome assemblies.

Authors:  Dengfeng Guan; Shane A McCarthy; Jonathan Wood; Kerstin Howe; Yadong Wang; Richard Durbin
Journal:  Bioinformatics       Date:  2020-05-01       Impact factor: 6.937

9.  HiGlass: web-based visual exploration and analysis of genome interaction maps.

Authors:  Peter Kerpedjiev; Nezar Abdennur; Fritz Lekschas; Chuck McCallum; Kasper Dinkla; Hendrik Strobelt; Jacob M Luber; Scott B Ouellette; Alaleh Azhir; Nikhil Kumar; Jeewon Hwang; Soohyun Lee; Burak H Alver; Hanspeter Pfister; Leonid A Mirny; Peter J Park; Nils Gehlenborg
Journal:  Genome Biol       Date:  2018-08-24       Impact factor: 13.583

10.  BUSCO Update: Novel and Streamlined Workflows along with Broader and Deeper Phylogenetic Coverage for Scoring of Eukaryotic, Prokaryotic, and Viral Genomes.

Authors:  Mosè Manni; Matthew R Berkeley; Mathieu Seppey; Felipe A Simão; Evgeny M Zdobnov
Journal:  Mol Biol Evol       Date:  2021-09-27       Impact factor: 16.240

  10 in total
  2 in total

Review 1.  Gene Editing and Genetic Control of Hemipteran Pests: Progress, Challenges and Perspectives.

Authors:  Inaiara D Pacheco; Linda L Walling; Peter W Atkinson
Journal:  Front Bioeng Biotechnol       Date:  2022-06-07

2.  Chromosome length genome assembly of the redbanded stink bug, Piezodorus guildinii (Westwood).

Authors:  Surya Saha; K Clint Allen; Lukas A Mueller; Gadi V P Reddy; Omaththage P Perera
Journal:  BMC Res Notes       Date:  2022-03-22
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.