| Literature DB >> 34146395 |
Yuanqing Huang1, Umar Farouk Mustapha1, Yang Huang1, Changxu Tian1, Wei Yang1,2, Huapu Chen1, Siping Deng1, Chunhua Zhu1,3, Dongneng Jiang1, Guangli Li1.
Abstract
The spotted scat, Scatophagus argus is a member of the family Scatophagidae found in Indo-Pacific coastal waters. It is an emerging commercial aquaculture species, particularly in East and Southeast Asia. In this study, the first chromosome-level genome of S. argus was constructed using PacBio and Hi-C sequencing technologies. The genome is 572.42 Mb, with a scaffold N50 of 24.67 Mb. Using Hi-C data, 563.28 Mb (98.67% of the genome) sequences were anchored and oriented in 24 chromosomes, ranging from 12.57 Mb to 30.38 Mb. The assembly is of high integrity, containing 94.26% conserved single-copy orthologues, based on BUSCO analysis. A total of 24,256 protein-coding genes were predicted in the genome, and 96.30% of the predicted genes were functionally annotated. Evolutionary analysis showed that S. argus diverged from the common ancestor of Japanese puffer (Takifugu rubripes) approximately 114.8 Ma. The chromosomes of S. argus showed significant correlation to T. rubripes chromosomes. A comparative genomic analysis identified 49 unique and 90 expanded gene families. These genomic resources provide a solid foundation for functional genomics studies to decipher the economic traits of this species.Entities:
Keywords: Hi-C proximity mapping; PacBio sequencing; chromosomal assembly; genomics; spotted scat
Mesh:
Year: 2021 PMID: 34146395 PMCID: PMC8214404 DOI: 10.1093/gbe/evab092
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Fig. 1.Characteristics of spotted scat genome assembly. (A) A spotted scat (Scatophagus argus). (B) Spotted scat genome contig contact matrix using Hi-C data. LGs 1–24 are the abbreviations of Lachesis group 1–24, representing the 24 chromosomes. The color bar illuminates the logarithm of the contact density from red (high) to white (low) in the plot. Only sequences anchored on chromosomes are shown. (C) Features of spotted scat genome. (a) Chromosome length; (b) GC content; (c) gene density; (d) repeat sequence; (e) long terminal repeated (LTE); (f) long interspersed nuclear elements (LINE); and (g) simple sequence repeat (SSR). (D) Phylogenetic tree of 10 teleost species genomes, which was constructed using 3,473 single-copy orthologous genes. Divergence times of spotted gar and yellow crocker, zebrafish and Japanese puffer, Japanese medaka and three-spined stickleback, yellow crocker and Japanese puffer from the TimeTree database were used for calibration. The numbers on the branches indicate the estimated diverge times in millions of years ago. (E) Genome comparison between spotted scat and Japanese puffer. Each colored arc represents the best match between the two species. LG1–24 represents chromosomes 1–24 of the spotted scat genome, and roman numerals represent chromosomes 1–22 of the Japanese puffer genome.
Summary of the Spotted Scat Scatophagus argus Genome Assembly and Annotation
| Chromosome-Level Genome Assembly | |
|---|---|
| Genome Assembly and Chromosomes Construction | |
| Contig N50 size (bp) | 21,048,838 |
| Contig N90 size (bp) | 4,427,241 |
| Maximum contig size (bp) | 30,132,598 |
| Scaffold N50 (bp) | 24,670,690 |
| Scaffold N90 (bp) | 19,600,000 |
| Maximum scaffold size (bp) | 30,379,288 |
| Number of chromosomes | 24 |
| Total length of chromosomes (bp) | 572,536,915 |
| Genome Quality Evaluation | |
| Proportion of CEG orthologs (%) | 98.91 |
| Proportion of highly conserved CEG orthologs (%) | 99.19 |
| Proportion of complete BUSCO orthologs (%) | 96.97 |
| Proportion of complete and single-copy BUSCO orthologs (%) | 94.26 |
| Proportion of complete and duplicated BUSCO orthologs (%) | 2.71 |
| Proportion of fragmented BUSCO orthologs (%) | 0.98 |
| Proportion of missing BUSCO orthologs (%) | 2.05 |
| Gene Annotation | |
| Number of GO annotation | 12,515 |
| Number of KEGG annotation | 14,651 |
| Number of KOG annotation | 16,017 |
| Number of TrEMBL annotation | 23,176 |
| Number of NR annotation | 23,335 |
| Number of all annotated | 23,359 |