| Literature DB >> 35419492 |
Abstract
We present a genome assembly from an individual male Gymnosoma rotundatum (Arthropoda; Insecta; Diptera; Tachinidae). The genome sequence is 779 megabases in span. The majority of the assembly (97.07%) is scaffolded into six chromosomal pseudomolecules, with the X sex chromosome assembled. Copyright:Entities:
Keywords: Diptera; Gymnosoma rotundatum; chromosomal; genome sequence
Year: 2022 PMID: 35419492 PMCID: PMC8987346 DOI: 10.12688/wellcomeopenres.17782.1
Source DB: PubMed Journal: Wellcome Open Res ISSN: 2398-502X
Figure 1. Images of the Gymnosoma rotundatum specimen, taken during preservation and processing.
Left, lateral view; right, dorsal view.
Genome data for Gymnosoma rotundatum, idGymRotn1.2.
|
| |
|---|---|
| Assembly identifier | idGymRotn1.2 |
| Species |
|
| Specimen | idGymRotn1 |
| NCBI taxonomy ID | 569046 |
| BioProject | PRJEB46301 |
| BioSample ID | SAMEA7849381 |
| Isolate information | Male, thorax (genome assembly), head (Hi-C) |
|
| |
| PacificBiosciences SEQUEL II | ERR6939227 |
| 10X Genomics Illumina | ERR6688431-ERR6688434 |
| Hi-C Illumina | ERR6688430 |
|
| |
| Assembly accession | GCA_916610165.2 |
|
| GCA_916610175.2 |
| Span (Mb) | 779 |
| Number of contigs | 623 |
| Contig N50 length (Mb) | 9.4 |
| Number of scaffolds | 392 |
| Scaffold N50 length (Mb) | 137.8 |
| Longest scaffold (Mb) | 182.0 |
| BUSCO
| C:98.8%[S:98.3%,D:0.4%],F:0.5%,M:0.7%,n:3285 |
*BUSCO scores based on the diptera_odb10 BUSCO set using v5.2.2. C= complete [S= single copy, D=duplicated], F=fragmented, M=missing, n=number of orthologues in comparison. A full set of BUSCO scores is available at https://blobtoolkit.genomehubs.org/view/idGymRotn1.2/dataset/CAKAJB02/busco.
Figure 2. Genome assembly of Gymnosoma rotundatum, idGymRotn1.2: metrics.
The BlobToolKit Snailplot shows N50 metrics and BUSCO gene completeness. The main plot is divided into 1,000 size-ordered bins around the circumference with each bin representing 0.1% of the 779,146,119 bp assembly. The distribution of scaffold lengths is shown in dark grey with the plot radius scaled to the longest scaffold present in the assembly (182,003,241 bp, shown in red). Orange and pale-orange arcs show the N50 and N90 scaffold lengths (137,798,182 and 132,556,942 bp), respectively. The pale grey spiral shows the cumulative scaffold count on a log scale with white scale lines showing successive orders of magnitude. The blue and pale-blue area around the outside of the plot shows the distribution of GC, AT and N percentages in the same bins as the inner plot. A summary of complete, fragmented, duplicated and missing BUSCO genes in the diptera_odb10 set is shown in the top right. An interactive version of this figure is available at https://blobtoolkit.genomehubs.org/view/idGymRotn1.2/dataset/CAKAJB02/snail.
Figure 5. Genome assembly of Gymnosoma rotundatum, idGymRotn1.2: Hi-C contact map.
Hi-C contact map of the idGymRotn1.2 assembly, visualised in HiGlass. Chromosomes are presented in order of size from left to right and top to bottom. An interactive version of this figure is available here.
Chromosomal pseudomolecules in the genome assembly of Gymnosoma rotundatum, idGymRotn1.2.
| INSDC accession | Chromosome | Size (Mb) | GC% |
|---|---|---|---|
| OU744336.1 | 1 | 182.00 | 30.1 |
| OU744337.1 | 2 | 152.51 | 30.3 |
| OU744338.1 | 3 | 137.80 | 30.1 |
| OU744339.1 | 4 | 134.05 | 30.3 |
| OU744340.1 | 5 | 132.56 | 30.1 |
| OU744341.1 | X | 17.34 | 32.5 |
| OU744342.1 | MT | 0.02 | 18.9 |
| - | Unplaced | 22.87 | 32.2 |
Software tools used.
| Software tool | Version | Source |
|---|---|---|
| Hifiasm | 0.15.1 |
|
| purge_dups | 1.2.3 |
|
| SALSA2 | 2.2 |
|
| longranger align | 2.2.2 |
|
| freebayes | 1.3.1-17-gaa2ace8 |
|
| MitoHiFi | 2.0 |
|
| HiGlass | 1.11.6 |
|
| PretextView | 0.2.x |
|
| BlobToolKit | 3.0.5 |
|