| Literature DB >> 34654820 |
Dongdong Xu1, Wanchang Zhang2, Ruiyi Chen3, Hongbin Song3, Lu Tian3, Peng Tan3, Ligai Wang3, Qihui Zhu3, Bin Wu4, Bao Lou5, Jiumeng Min4, Juhong Zhou4.
Abstract
The yellow drum (Nibea albiflora) is an economically important sciaenid fish in East Asian countries. In this study, we sequenced and assembled a near-complete gynogenetic yellow drum genome. We generated 45.63 Gb of Illumina short-reads and 80.27 Gb of PacBio long-reads and assembled them into a 628.01-Mb genome with a contig N50 of 4.42 Mb. Twenty-four chromosomes with a scaffold N50 of 26.73 Mb were obtained using the Hi-C analysis. We predicted a set of 27,069 protein-coding genes, of which 1,581 and 2,583 were expanded and contracted gene families, respectively. The most expanded genes were categorised into the protein binding, zinc-ion binding and ATP binding functional pathways. We built a high-density genetic linkage map that spanned 4,300.2 cM with 24 linkage groups and a resolution of 0.69 cM. The high-quality reference genome and annotated profiles that we produced will not only increase our understanding of the genetic architecture of economic traits in the yellow drum, but also help us explore the evolution and unique biological characteristics of sciaenid fishes.Entities:
Mesh:
Year: 2021 PMID: 34654820 PMCID: PMC8521588 DOI: 10.1038/s41597-021-01045-z
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Summary of the assemblies and annotations of several Sciaenid fish genomes.
| Genome feature | Larimichthys crocea[ | Larimichthys crocea[ | Miichthys miiuy[ | Collichthys lucidus[ | Nibea albiflora[ | Nibea albiflora* |
|---|---|---|---|---|---|---|
| Assembled sequences (bp) | 643,981,144 | 678,964,076 | 619,300,777 | 877,428,965 | 565,299,463 | 628,130,494 |
| Contig N50 size (bp) | 25,711 | 63,110 | 73,323 | 1,098,566 | 50,300 | 4,421,865 |
| Contig N90 size (bp) | 6,585 | 14,160 | 13,596 | 152,174 | 215,633 | |
| Scaffold N50 size (bp) | 498,737 | 1,034,540 | 1,145,539 | 35,900,000 | 2,254,189 | 26,729,459 |
| Scaffold N90 size (bp) | 100,943 | 285,329 | 82,882 | 20,155,850 | ||
| GC content (%) | 40.96 | 42 | 41.8 | 42.61 | ||
| Avg. coding DNA sequence length (bp) | 1491 | 1,766 | 1,789 | 1,844 | 1,561 | |
| Benchmarking Universal Single-Copy Ortholog (%) | 97.6 | 97.7 | 92.3 |
*The present study.
Fig. 1A yellow drum, Nibea albiflora.
Statistical description of the yellow drum genome and annotation.
| Genome assembly | Statistics |
|---|---|
| Contig N50 (Mb) | 4.42 |
| Scaffold N50 (Mb) | 26.73 |
| Scaffold N90 (Mb) | 20.16 |
| Estimated genome size (Mb) | 650.42 |
| Assembled genome size (Mb) | 628.01 |
| Number of Chromosome (N) | 24 |
| Longest chromosome length (Mb) | 47.73 |
| Average chromosome length (Mb) | 25.13 |
| Gap length (kb) | 118.80 |
Fig. 2Number of orthologous genes in N. albiflora and 11 other species.
Annotation of protein-coding genes in the yellow drum.
| Genome assembly | Number | Percent (%) | |
|---|---|---|---|
| Total | 27,069 | 100 | |
| Annotated | InterPro | 22,383 | 82.69% |
| GO | 17,156 | 63.38% | |
| KEGG | 21,892 | 80.87% | |
| Swissport | 22,656 | 83.70% | |
| TrEMBL | 25,344 | 93.63% | |
| Unannotated | 1653 | 6.11% |
Fig. 3Associated statistics for the annotated gene models, including: total number of genes, CDS, and exon and intron length for the yellow drum and other related teleost species. Total gene number, CDS, and exon and intron length were compared with those of related species, including D. rerio, G. aculeatus, O. latipes, and X. maculatus.
Fig. 4Phylogenic analysis and divergence time of the yellow drum and 11 other teleosts.
Fig. 5Circos plot of the synteny analysis between the yellow drum and the large yellow croaker genome.
Whole genome alignment results between the genome of the yellow drum (Nibea albiflora) and the large yellow croaker (Larimichthys crocea).
| Chromosome of the yellow drum | Total length (bp) | Best match in the large yellow croaker chromosomes | Alignment length (bp) | Coverage (%) |
|---|---|---|---|---|
| 1 | 28,993,770 | VII | 15,505,145 | 53.48% |
| 2 | 16,663,570 | II | 9,240,440 | 55.45% |
| 3 | 27,493,751 | XIX | 21,149,698 | 76.93% |
| 4 | 27,960,023 | III | 11,826,427 | 42.30% |
| 5 | 29,955,022 | XI | 16,528,515 | 55.18% |
| 6 | 27,071,202 | XX | 14,480,688 | 53.49% |
| 7 | 23,659,324 | XII | 18,757,016 | 79.28% |
| 8 | 26,729,459 | XIV | 14,257,898 | 53.34% |
| 9 | 47,731,074 | V | 21,189,745 | 44.39% |
| 10 | 29,213,361 | XXI | 11,093,766 | 37.97% |
| 11 | 26,531,260 | XV | 16,961,513 | 63.93% |
| 12 | 24,620,309 | XVI | 12,696,102 | 51.57% |
| 13 | 23,222,488 | XVII | 10,433,022 | 44.93% |
| 14 | 21,758,424 | XXIV | 7,981,200 | 36.68% |
| 15 | 28,077,900 | IV | 14,196,944 | 50.56% |
| 16 | 16,255,354 | XVIII | 6,170,456 | 37.96% |
| 17 | 19,564,411 | VI | 18,122,787 | 92.63% |
| 18 | 36,185,908 | I | 16,040,007 | 44.33% |
| 19 | 25,823,092 | VIII | 17,027,123 | 65.94% |
| 20 | 21,377,824 | IX | 16,360,606 | 76.53% |
| 21 | 20,155,850 | XXIII | 8,262,898 | 41.00% |
| 22 | 6,948,392 | XXII | 5,105,376 | 73.48% |
| 23 | 26,849,991 | 9,557,319 | 35.60% | |
| 24 | 22,182,248 | X | 13,346,593 | 60.17% |
| Un | 23,106,487 | XVIII | 11,825,562 | 51.18% |
| Total | 628,130,494 | 338,116,846 | 53.83% |
| Measurement(s) | DNA • genome • sequence_assembly • sequence feature annotation |
| Technology Type(s) | DNA sequencing • sequence assembly process • sequence annotation |
| Sample Characteristic - Organism | Nibea albiflora |