| Literature DB >> 30239684 |
Changwei Shao1,2, Chang Li3,4,5, Na Wang1,2, Yating Qin4,5, Wenteng Xu1, Qun Liu4, Qian Zhou1,2, Yong Zhao4, Xihong Li1, Shanshan Liu4,5, Xiaowu Chen6, Shahid Mahboob7,8, Xin Liu4,5, Songlin Chen1,2.
Abstract
Background: The spotted sea bass (Lateolabrax maculatus) is a valuable commercial fish that is widely cultured in China. While analyses using molecular markers and population genetics have been conducted, genomic resources are lacking. Findings: Here, we report a chromosome-scale assembly of the spotted sea bass genome by high-depth genome sequencing, assembly, and annotation. The genome scale was 0.67 Gb with contig and scaffold N50 length of 31 Kb and 1,040 Kb, respectively. Hi-C scaffolding of the genome resulted in 24 pseudochromosomes containing 77.68% of the total assembled sequences. A total of 132.38 Mb repeat sequences were detected, accounting for 20.73% of the assembled genome. A total of 22, 015 protein-coding genes were predicted, of which 96.52% were homologous to proteins in various databases. In addition, we constructed a phylogenetic tree using 1,586 single-copy gene families and identified 125 unique gene families in the spotted sea bass genome. Conclusions: We assembled a spotted sea bass genome that will be a valuable genomic resource to understanding the biology of the spotted sea bass and will also lead to the development of molecular breeding techniques to generate spotted sea bass with better economic traits.Entities:
Mesh:
Year: 2018 PMID: 30239684 PMCID: PMC6240815 DOI: 10.1093/gigascience/giy114
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1.Example of a spotted sea bass (L. maculatus) (Image from Jilun Hou)
Whole genome alignment results between the spotted sea bass (L. maculatus) and European sea bass (D. labrax) genomes
| Pseudochromosomes of spotted sea bass | Length (bp) | The best-match results in | Coverage, % | The second-best-match results in | Coverage, % |
|---|---|---|---|---|---|
| Lma_HiC_1 | 22,914,103 | Dla_LG2 | 96.21 | Dla_LG11 | 0.26 |
| Lma_HiC_2 | 22,535,790 | Dla_LG7 | 93.28 | Dla_LG8 | 0.65 |
| Lma_HiC_3 | 23,764,490 | Dla_LG15 | 95.20 | Dla_LG24 | 0.66 |
| Lma_HiC_4 | 19,156,603 | Dla_LG18–21 | 94.39 | Dla_LG15 | 0.48 |
| Lma_HiC_5 | 21,471,159 | Dla_LG14 | 94.70 | Dla_LG13 | 0.50 |
| Lma_HiC_6 | 27,060,119 | Dla_LG6 | 92.85 | Dla_LG11 | 2.44 |
| Lma_HiC_7 | 17,749,143 | Dla_LG11 | 95.87 | Dla_LG7 | 0.37 |
| Lma_HiC_8 | 21,392,500 | Dla_LG9 | 93.69 | Dla_LG1A | 1.20 |
| Lma_HiC_9 | 20,127,546 | Dla_LG19 | 94.41 | Dla_LG20 | 0.81 |
| Lma_HiC_10 | 17,765,475 | Dla_LG3 | 86.09 | Dla_LG14 | 8.92 |
| Lma_HiC_11 | 12,827,312 | Dla_LG24 | 93.01 | Dla_LG5 | 0.45 |
| Lma_HiC_12 | 23,523,986 | Dla_LG8 | 92.79 | Dla_LG7 | 0.90 |
| Lma_HiC_13 | 21,871,954 | Dla_LG12 | 95.07 | Dla_lg17 | 0.36 |
| Lma_HiC_14 | 20,194,087 | Dla_LG1B | 90.35 | Dla_LG20 | 2.49 |
| Lma_HiC_15 | 23,659,279 | Dla_LG20 | 94.65 | Dla_LG19 | 0.53 |
| Lma_HiC_16 | 22,793,363 | Dla_LG10 | 95.07 | Dla_LG5 | 0.56 |
| Lma_HiC_17 | 22,884,195 | Dla_LG4 | 96.46 | Dla_LG10 | 0.34 |
| Lma_HiC_18 | 24,927,748 | Dla_LG22–25 | 95.53 | Dla_LG1A | 0.92 |
| Lma_HiC_19 | 22,343,975 | Dla_LG1A | 94.42 | Dla_LG8 | 0.37 |
| Lma_HiC_20 | 21,152,183 | Dla_LG16 | 95.69 | Dla_LG13 | 0.48 |
| Lma_HiC_21 | 19,085,413 | Dla_LG17 | 95.02 | Dla_LG12 | 0.47 |
| Lma_HiC_22 | 21,943,731 | Dla_LG13 | 94.82 | Dla_LG14 | 0.77 |
| Lma_HiC_23 | 28,603,024 | Dla_LG5 | 95.13 | Dla_LG6 | 0.74 |
| Lma_HiC_24 | 19,492,233 | Dla_LGx | 94.63 | Dla_LG6 | 0.45 |
| Average | 21,634,975 | / | 94.14 | / | 1.09 |
The collinear analysis results were generated by LASTZ.
Figure 2.Collinear blocks between the spotted sea bass (L. maculatus) and European sea bass (D. labrax) genomes. Each colored arc represents a best match between the two species. Lma_HiC1–24 represents pseudochromosomes 1–24 of the spotted sea bass genome and Dla_LG1–24 represents chromosomes 1–24 of the European sea bass genome.
Figure 3.Phylogenetic tree constructed with orthologous genes. Phylogenetic tree was constructed using 1,586 single-copy orthologous gene families from nine teleost species. Divergence times from human—D. rerio (438∼455 Mya), D. rerio—O. latipes (258∼307 Mya), O. latipes—O. niloticus (87∼151 Mya), and T. nigroviridis—T. rubripes (42∼59 Mya) from the TimeTree database were used as the calibration times. The blue numbers on the branches indicate the estimated diverge times in millions of years ago (Mya), and red circles indicate the calibration time.
Figure 4.Venn diagram of orthologous gene families. Nine teleost species (D. rerio, D. labrax, G. aculeatus, L. calcarifer, L. maculatus, O. niloticus, O. latipes, T. nigroviridisand, and T. rubripes) were used to generate the Venn diagram based on the gene family cluster analysis.