| Literature DB >> 31835725 |
Yuanqing Huang1, Dongneng Jiang1, Ming Li1, Umar Farouk Mustapha1, Changxu Tian1, Huapu Chen1, Yang Huang1, Siping Deng1, Tianli Wu1, Chunhua Zhu1, Guangli Li1.
Abstract
The spotted scat, Scatophagus argus, is a species of fish that is widely propagated within the Chinese aquaculture industry and therefore has significant economic value. Despite this, studies of its genome are severely lacking. In the present study, a genomic survey of S. argus was conducted using next-generation sequencing (NGS). In total, 55.699 GB (female) and 51.047 GB (male) of high-quality sequence data were obtained. Genome sizes were estimated to be 598.73 (female) and 597.60 (male) Mbp. The sequence repeat ratios were calculated to be 27.06% (female) and 26.99% (male). Heterozygosity ratios were 0.37% for females and 0.38% for males. Reads were assembled into 444,961 (female) and 453,459 (male) contigs with N50 lengths of 5,747 and 5,745 bp for females and males, respectively. The average guanine-cytosine (GC) content of the female genome was 41.78%, and 41.82% for the male. A total of 42,869 (female) and 43,283 (male) genes were annotated to the non-redundant (NR) and SwissProt databases. The female and male genomes contained 66.6% and 67.8% BUSCO core genes, respectively. Dinucleotide repeats were the dominant form of simple sequence repeats (SSR) observed in females (68.69%) and males (68.56%). Additionally, gene fragments of Dmrt1 were only observed in the male genome. This is the first report of a genome-wide characterization of S. argus.Entities:
Keywords: Dmrt1; aquaculture; genome size; illumina sequencing; sex control; sex determining gene
Year: 2019 PMID: 31835725 PMCID: PMC6940847 DOI: 10.3390/ani9121117
Source DB: PubMed Journal: Animals (Basel) ISSN: 2076-2615 Impact factor: 2.752
Statistics of sequencing data of female and male S. argus.
| Library | Insert Size | Raw Base | Effective Rate | Clean Base | Error Rate | Q20 | Q30 | GC Content |
|---|---|---|---|---|---|---|---|---|
| female | 350 | 55,808,601,300 | 99.80 | 55,699,379,400 | 0.03 | 96.60 | 91.94 | 41.50 |
| male | 350 | 51,153,870,900 | 99.79 | 51,047,381,700 | 0.03 | 96.63 | 92.03 | 41.52 |
Figure 1Distribution of 17-mer depth and frequency of female and male S. argus. The x-axis indicates depth; the y-axis indicates the proportion representing the frequency at that depth divided by the total frequency of all depths.
Data statistics and analysis of 17-mer.
| Identity | K-Mer | K-Mer Depth | K-Mer Number | Genome Size (Mbp) | Revised Genome Size (Mbp) | Heterozygous Ratio (%) | Repeat (%) |
|---|---|---|---|---|---|---|---|
| female | 17 | 74 | 45,374,105,016 | 613.16 | 598.73 | 0.37 | 27.06 |
| male | 17 | 68 | 41,637,691,628 | 612.32 | 597.60 | 0.38 | 26.99 |
Statistics of the assembled S. argus genome sequences.
| Identity | Total Length (bp) | Total Number | Max Length (bp) | N50 Length (bp) | N90 Length (bp) | |
|---|---|---|---|---|---|---|
| contig | female | 580,837,740 | 444,961 | 123,323 | 5,747 | 590 |
| male | 582,143,644 | 453,459 | 110,347 | 5,745 | 576 | |
| scaffold | female | 585,986,615 | 335,162 | 231,008 | 13,556 | 821 |
| male | 588,188,524 | 340,134 | 196,230 | 13,591 | 824 |
Figure 2GC content and depth correlation analysis of female and male S. argus. The x-axis is the percentage GC content and the y-axis represents sequencing depth. The distribution of sequence depth is on the right side, while the distribution of GC content is at the top.
Gene function annotation statistics of S. argus.
| Database | Number (Female/Male) | Percent (Female/Male) |
|---|---|---|
| NR | 42,825/43,238 | 45.14%/45.38% |
| Swissport | 33,093/33,359 | 34.89%/35.01% |
| KEGG | 40,854/41,245 | 43.07%/43.29% |
| KOG | 26,420/26,680 | 27.85%/28.00% |
| Annotated | 42,869/43,283 | 45.19%/45.43% |
| Unannotated | 51,993/51,990 | 54.81%/54.57% |
| Total | 94,862/95,273 | 100%/100% |
Figure 3Structure of the Dmrt1 and Dmrt1b genes. Dmrt1 and Dmrt1b are located on the male and female sex chromosomes, respectively. Numbers indicate base pairs (loci) of exon and intron sequences. Percentages indicate the similarity of Dmrt1 and Dmrt1b. Arrows indicate the start and stop codons. Different colored rectangles represent different exons.
SSR (simple sequence repeat) types detected in female and male S. argus.
| SSR Mining | Total (Female/Male) |
|---|---|
| Total number of sequences examined | 335,162/340,134 |
| Total number of identified SSRs | 299,574/299,893 |
| Number of SSR containing sequences | 78,202/77,788 |
| Total number of identified SSRs | 299,574/299,893 |
| Number of sequences containing more than 1 SSR | 39,136/39,104 |
| Number of SSRs present in compound formation | 48,384/48,510 |
Figure 4Frequency of SSR types in the genomic survey of female and male S. argus.