| Literature DB >> 34698106 |
Tianyan Yang1, Xinxin Huang1, Zijun Ning1, Tianxiang Gao1.
Abstract
Harpadon nehereus forms one of the most important commercial fisheries along the Bay of Bengal and the southeast coast of China. In this study, the genome-wide survey dataset first produced using next-generation sequencing (NGS) was used to provide general information on the genome size, heterozygosity and repeat sequence ratio of H. nehereus. About 68.74 GB of high-quality sequence data were obtained in total and the genome size was estimated to be 1315 Mb with the 17-mer frequency distribution. The sequence repeat ratio and heterozygosity were calculated to be 52.49% and 0.67%, respectively. A total of 1,027,651 microsatellite motifs were identified and dinucleotide repeat was the most dominant simple sequence repeat (SSR) motif with a frequency of 54.35%. As a by-product of whole genome sequencing, the mitochondrial genome is a powerful tool to investigate the evolutionary relationships between H. nehereus and its relatives. The maximum likelihood (ML) phylogenetic tree was constructed according to the concatenated matrix of amino acids translated from the 13 protein-coding genes (PCGs). Monophyly of two species of the genus Harpadon was revealed in the present study and they formed a monophyletic clade with Saurida with a high bootstrap value of 100%. The results would help to push back the frontiers of genomics and open the doors of molecular diversity as well as conservation genetics studies on this species.Entities:
Keywords: Harpadon nehereus; genome survey; microsatellite identification; phylogenetics
Mesh:
Year: 2021 PMID: 34698106 PMCID: PMC8928995 DOI: 10.3390/cimb43030091
Source DB: PubMed Journal: Curr Issues Mol Biol ISSN: 1467-3037 Impact factor: 2.976
Figure 1The base distribution and K-mer (K = 17) analysis of H. nehereus. (a) On the x-axis, the read-1 base contents are on the left side of the dotted line and the read-2 base contents are on the right. Different colors represent different base types. The y-axis represents the sequencing depth. (b) The x-axis means K-mer depth and the y-axis represents the frequency for the corresponding depth.
Assembly statistics for stitched contigs and scaffolds of H. nehereus.
| Sample | Total Length (bp) | Total | Total Number (≥2 kb) | Max Length (bp) | N50 (bp) | N90 (bp) | |
|---|---|---|---|---|---|---|---|
|
| contigs | 1,352,668,147 | 3,766,078 | 73,281 | 30,284 | 596 | 138 |
| scaffolds | 1,363,443,545 | 2,539,084 | 133,089 | 92,118 | 1568 | 162 |
The statistics of SSRs in the H. nehereus genome based on repeat types.
| SSR Type | Number | Percent (%) | Sequence Number | Total SSR Length (bp) |
|---|---|---|---|---|
| p1 | 266,761 | 25.96 | 208,219 | 3,369,363 |
| p2 | 558,543 | 54.35 | 400,985 | 17,745,246 |
| p3 | 66,270 | 6.45 | 62,046 | 1,638,864 |
| p4 | 105,426 | 10.26 | 97,016 | 3,527,488 |
| p5 | 25,182 | 2.45 | 23,958 | 883,000 |
| p6 | 5469 | 0.53 | 5374 | 191,406 |
| Total | 1,027,651 | 100 | 797,598 | 27,355,367 |
Figure 2The distribution and frequency of microsatellite motifs in H. nehereus. (a–f) present frequency of the mono-, di-, tri-, tetra-, penta- and hexanucleotide microsatellite motifs.
Figure 3The complete mitogenome structure of H. nehereus.
Figure 4The phylogenetic tree inferred from mitochondrial genomes of related species.