| Literature DB >> 34946822 |
Yang Yang1, Lina Wu1, Zhuoying Weng1, Xi Wu1, Xi Wang1, Junhong Xia1,2, Zining Meng1,2, Xiaochun Liu1,2.
Abstract
The humpback grouper (Cromileptes altivelis), an Epinephelidae species, is patchily distributed in the reef habitats of Western Pacific water. This grouper possesses a remarkably different body shape and notably low growth rate compared with closely related grouper species. For promoting further research of the grouper, in the present study, a high-quality chromosome-level genome of humpback grouper was assembled using PacBio sequencing and high-throughput chromatin conformation capture (Hi-C) technology. The assembled genome was 1.013 Gb in size with 283 contigs, of which, a total of 143 contigs with 1.011 Gb in size were correctly anchored into 24 chromosomes. Moreover, a total of 26,037 protein-coding genes were predicted, of them, 25,243 (96.95%) genes could be functionally annotated. The high-quality chromosome-level genome assembly will provide pivotal genomic information for future research of the speciation, evolution and molecular-assisted breeding in humpback groupers. In addition, phylogenetic analysis based on shared single-copy orthologues of the grouper species showed that the humpback grouper is included in the Epinephelus genus and clustered with the giant grouper in one clade with a divergence time of 9.86 Myr. In addition, based on the results of collinearity analysis, a gap in chromosome 6 of the humpback grouper was detected; the missed genes were mainly associated with immunity, substance metabolism and the MAPK signal pathway. The loss of the parts of genes involved in these biological processes might affect the disease resistance, stress tolerance and growth traits in humpback groupers. The present research will provide new insight into the evolution and origin of the humpback grouper.Entities:
Keywords: Cromileptes altivelis; collinearity; evolution; genome; humpback grouper
Mesh:
Year: 2021 PMID: 34946822 PMCID: PMC8701792 DOI: 10.3390/genes12121873
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.096
Figure 1The characteristic of humpback grouper (Cromileptes altivelis).
The statistics of sequencing and assembled information of humpback grouper genome.
| Raw Data | Reads Number | Reads Base (bp) | N50 (bp) | Max Length (bp) | GC Content (%) |
|---|---|---|---|---|---|
| Illumina data for annotation | 39,031,897 | 11,658,953,034 | 150 | 150 | 49.7 |
| Illumina data for survey | 1,307,931,088 | 196,189,663,200 | 150 | 150 | 41.2 |
| PacBio data | 6,672,321 | 119,331,383,944 | 27,957 | 224,636 | 41.0 |
| HiC data | 336,775,366 | 100,882,400,830 | 150 | 150 | 42.6 |
| Assembled data | Contig or Scaffold number | Genome size (bp) | |||
| Survey | - | ~1,070,000,000 | - | - | 41.2 |
| Contig assembled using PacBio | 470 | 1,044,397,337 | 18,092,086 | 49,150,803 | 41.3 |
| Contig assembled using PacBio+Hi-C | 283 | 1,013,358,489 | 18,269,829 | 49,150,803 | 41.2 |
| Scaffold | 164 | 1,013,370,389 | 52,436,080 | 41.2 | |
| Chromosome | 24 | 1,010,598,072 | 43,466,351 | 52,436,080 | 41.2 |
Figure 2Genome assembly of humpback grouper. (A) Estimation of genome size, repeat content and heterozygous by survey using 19-mers in humpback grouper; (B) Verification of genome integrality based on the Core Eukaryotic Genes Mapping Approach (CEGMA) database; red parts and blue parts indicated missed and presented core eukaryotic genes (CEG) in humpback grouper genome, respectively; (C) The genome contig contact matrix; the blocks indicated the contacts between linkage groups; color depth indicated the degree of contacts.
Figure 3Analysis of divergence time of humpback grouper and other grouper species. Large yellow croaker (L. crocea) was set as outgroup. (A) The orthogroups statistics of the eight species; (B) phylogenetic tree of humpback grouper and other grouper species, red number indicated the bootstrap values; (C) Analysis of divergence time between humpback grouper and other grouper species using MCMCtree in PAML, the bold word indicated fish species used in the present study; two calibration times were obtained from TimeTree database; black number indicated the mean value of divergence time, blue number indicated the 95% confidence intervals of divergence times; Cr.—Cretaceous, Pa.—Paleogene, N. —Neogene.
Figure 4Collinearity analysis between humpback grouper and giant grouper. (A) Scatterplot of collinearity between humpback grouper and giant grouper, red circle showed the missing fragment in chromosome 6 of humpback grouper genome; (B) The distribution of genes in chromosome 6 of giant grouper and humpback grouper, multiple genes located in located in 29,818,799~43,791,507 bp of chromosome 6 were missed. (C) The KEGG enrichment analysis of missed genes.