| Literature DB >> 30010754 |
Guangyi Fan1,2,3,4, Judy Chan1, Kailong Ma3,4, Binrui Yang1, He Zhang2,3,4, Xianwei Yang2,3,4, Chengcheng Shi2,3,4, Henry Chun-Hin Law1, Zhitao Ren1, Qiwu Xu2,3,4, Qun Liu2,3,4, Jiahao Wang2,3,4, Wenbin Chen3,4, Libin Shao2,3,4, David Gonçalves5, Andreia Ramos5, Sara D Cardoso6, Min Guo1, Jing Cai1, Xun Xu2,3,4, Jian Wang3,7, Huanming Yang3,7, Xin Liu2,3,4, Yitao Wang1.
Abstract
Background: Siamese fighting fish Betta splendens are notorious for their aggressiveness and accordingly have been widely used to study aggression. However, the lack of a reference genome has, to date, limited the understanding of the genetic basis of aggression in this species. Here, we present the first reference genome assembly of the Siamese fighting fish. Findings: Frist, we sequenced and de novo assembled a 465.24-Mb genome for the B. splendens variety Giant, with a weighted average (N50) scaffold size of 949.03 Kb and an N50 contig size of 19.01 Kb, covering 99.93% of the estimated genome size. To obtain a chromosome-level genome assembly, we constructed one Hi-C library and sequenced 75.24 Gb reads using the BGISEQ-500 platform. We anchored approximately 93% of the scaffold sequences into 21 chromosomes and evaluated the quality of our assembly using the high-contact frequency heat map and Benchmarking Universal Single-Copy Orthologs. We also performed comparative chromosome analyses between Oryzias latipes and B. splendens, revealing a chromosome conservation evolution in B. splendens. We predicted 23,981 genes assisted by RNA-sequencing data generated from brain, liver, muscle, and heart tissues of Giant and annotated 15% repetitive sequences in the genome. Additionally, we resequenced five other B. splendens varieties and detected ∼3.4 M single-nucleotide variations and 27,305 insertions and deletions. Conclusions: We provide the first chromosome-level genome for the Siamese fighting fish. The genome will lay a valuable foundation for future research on aggression in B. splendens.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30010754 PMCID: PMC6251983 DOI: 10.1093/gigascience/giy087
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Statistics of the assembly using SOAPdenovo and Hi-C data
| Type | Scaffold original | Contig original | Scaffold (Hi-C) | Contig (Hi-C) |
|---|---|---|---|---|
| Total number | 92,886 | 138,929 | 91,819 | 139,323 |
| Total length (bp) | 465,240,853 | 421,527,246 | 465,132,837 | 421,527,246 |
| Average length (bp) | 5,008.73 | 3,034.12 | 5,066 | 3,026 |
| N50 (bp) | 949,032 | 19,014 | 19,754,490 | 18,890 |
| N90 (bp) | 59,769 | 3,504 | 13,781,534 | 3,470 |
Figure 1:Hi-C interaction heat map for B. Splendens reference genome showing interactions between the 21 chromosomes.
Evaluation results of the genome and gene set using BUSCO
| Genome | Genes | |||
|---|---|---|---|---|
| Number | Percentage (%) | Number | Percentage (%) | |
| Complete | 4,375 | 95.4 | 4,128 | 90.1 |
| Single-copy complete | 4,232 | 92.3 | 3,937 | 85.9 |
| Duplicated complete | 142 | 3.1 | 191 | 4.2 |
| Fragmented | 128 | 2.8 | 338 | 7.4 |
| Missing | 82 | 1.8 | 118 | 2.5 |
| Total | 4,584 | - | 4,584 | - |
Figure 2:Collinear relationship between B. splendens and O. latipes. Green represents the chromosomes of B. splendens and the other multicolor represent the chromosomes of O. latipes.
Figure 3:Five gene families with prominent expansion in B. splendens when compared with 109 other species including NLRP3 (K12800), TRIM47 (K12023), CLCN7 (K05016), AVPR2 (K04228), and MGAM (K12047) using the KEGG database.
Figure 4:The gene phylogenetic tree of NLRP3 gene family (KO: K12800) using the genes of B. splendens and other species. Clade 1 and clade 2 show two prominent expansion subfamilies of B. splendens.