| Literature DB >> 33111957 |
Bingjie Chen1, Zhicheng Sun1, Fangrui Lou1, Tian-Xiang Gao2, Na Song1.
Abstract
Acanthogobius ommaturus is one of the suitable species to study the genetic mechanism of adaptive evolution, but there are few reports on its genetics. In the present study, the genomic survey sequencing method was used to analyze the genome characters of A. ommaturus. A total of 50.50 G high-quality sequence data were obtained in the present study. From the 19-mer distribution frequency, the estimated genome size was 928.01 Mb. The calculated sequence repeat rate was about 38.31%, the heterozygosity was approximately 0.17%, and the GC% content was approximately 40.88%. Moreover, 475,724 simple sequence repeats (SSRs) were identified. Among them, dinucleotide repeats were the most (53.70% of the total SSRs), followed by tri- (35.36%), hexa- (4.59%), tetra- (4.57%) and penta- (1.77%) nucleotide repeats type. This is the first genome-wide feature of this species to be reported.Entities:
Keywords: Acanthogobius ommaturus; GC content; Genome assembly; Genome size; SSR
Mesh:
Year: 2020 PMID: 33111957 PMCID: PMC7670571 DOI: 10.1042/BSR20201295
Source DB: PubMed Journal: Biosci Rep ISSN: 0144-8463 Impact factor: 3.840
Figure 1K-mer (K = 19) analysis for estimation of the genome size of A. ommaturus
The X-axis was depth and the Y-axis was the proportion that represented the frequency at that depth.
Statistics of A. ommaturus assembled genome sequences
| Contigs | Scaffolds | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Total length (bp) | Total number | Max length (bp) | N50 (bp) | N90 (bp) | Total length (bp) | Total number | Max length (bp) | N50 (bp) | N90 (bp) | |
| 45 | 705,413,224 | 1,522,798 | 35,678 | 1120 | 203 | 720,996,115 | 598,674 | 51,912 | 2458 | 436 |
| 55 | 765,231,759 | 1,421,279 | 43,980 | 1237 | 216 | 776,582,763 | 717,328 | 60,036 | 2528 | 343 |
| 65 | 819,722,014 | 1,395,163 | 40,568 | 1267 | 227 | 827,753,551 | 848,081 | 66,118 | 2443 | 301 |
| 75 | 861,431,452 | 1,387,872 | 54260 | 1282 | 236 | 867,370,160 | 944,567 | 67,355 | 2343 | 283 |
A assembly evaluation summary by BUSCO
| Term | Results |
|---|---|
| Complete (C) | 1300 (38.76%) |
| Complete (C) and single-copy (S) | 1285 (38.31%) |
| Complete (C) and duplicated (D) | 15 (0.45%) |
| Fragmented (F) | 1150 (34.29%) |
| Missing (M) | 904 (26.95%) |
| Total | 3354 |
Simple sequence repeat (SSR) distribution statistics for A. ommaturus
| Statistics | Di- | Tri- | Tetra- | Penta- | Hexa- |
|---|---|---|---|---|---|
| SSR number | 187,501 | 123,452 | 15,969 | 6,182 | 16,034 |
| Percentage | 53.70% | 35.36% | 4.57% | 1.77% | 4.59% |
Figure 2The distribution and frequency of microsatellite motifs
(A) Frequency of different dinucleotide microsatellite motifs. (B) Frequency of different trinucleotide microsatellite motifs. (C) Frequency of different tetranucleotide microsatellite motifs. (D) Frequency of different pentabase microsatellite motifs. (E) Frequency of different hexanucleotide microsatellite motifs.