| Literature DB >> 31330852 |
Weiwei Chen1,2, Chao Bian2, Xinxin You2, Jia Li2, Lizhen Ye2, Zhengyong Wen1, Yunyun Lv1,2, Xinhui Zhang2, Junmin Xu2,3, Shaosen Yang2, Ruobo Gu3, Xueqiang Lin2, Qiong Shi4,5,6.
Abstract
Limbs originated from paired fish fins are an important innovation in Gnathostomata. Many studies have focused on limb development-related genes, of which the T-box transcription factor 4 gene (tbx4) has been considered as one of the most essential factors in the regulation of the hindlimb development. We previously confirmed pelvic fin loss in tbx4-knockout zebrafish. Here, we report a high-quality genome assembly of the Japanese eel (Anguilla japonica), which is an economically important fish without pelvic fins. The assembled genome is 1.13 Gb in size, with a scaffold N50 of 1.03 Mb. In addition, we collected 24 tbx4 sequences from 22 teleost fishes to explore the correlation between tbx4 and pelvic fin evolution. However, we observed complete exon structures of tbx4 in several pelvic-fin-loss species such as Ocean sunfish (Mola mola) and ricefield eel (Monopterus albus). More interestingly, an inversion of a special tbx4 gene cluster (brip1-tbx4-tbx2b- bcas3) occurred twice independently, which coincides with the presence of fin spines. A nonsynonymous mutation (M82L) was identified in the nuclear localization sequence (NLS) of the Japanese eel tbx4. We also examined variation and loss of hindlimb enhancer B (HLEB), which may account for pelvic fin loss in Tetraodontidae and Diodontidae. In summary, we generated a genome assembly of the Japanese eel, which provides a valuable genomic resource to study the evolution of fish tbx4 and helps elucidate the mechanism of pelvic fin loss in teleost fishes. Our comparative genomic studies, revealed for the first time a potential correlation between the tbx4 gene cluster and the evolutionary development of toxic fin spines. Because fin spines in teleosts are usually venoms, this tbx4 gene cluster may facilitate the genetic engineering of toxin-related marine drugs.Entities:
Keywords: Japanese eel (Anguilla japonica); fin spine; genome sequencing and assembly; pelvic fin; tbx4; tbx4 gene cluster; teleost fish
Mesh:
Substances:
Year: 2019 PMID: 31330852 PMCID: PMC6669545 DOI: 10.3390/md17070426
Source DB: PubMed Journal: Mar Drugs ISSN: 1660-3397 Impact factor: 5.118
Figure 1A 17-mer distribution of the Japanese eel genome sequencing. Only the sequencing data from short-insert libraries (500 and 800 bp) were used for the k-mer analysis. The x-axis is the sequencing depth of each unique 17-mer, and the y-axis is the percentage of these unique 17-mers. The peak depth (K_depth) is 37, and the corresponding k-mer number (N) is 37,982,773,125. We therefore calculated the genome size (G) to be ~1.03 Gb based on the following formula [12]: G=N/K_depth.
Summary of the assembling results.
| Step | Software | Contig N50 (bp) | Scaffold N50 (bp) | Contig number | Scaffold number | Total length |
|---|---|---|---|---|---|---|
| Primary assembling | SOAPdenovo | 1,999 | 383,798 | 1,227,464 | 462,272 | 1,167,219,893 |
| Gap filling | krskgf | 3,868 | 375,823 | 850,121 | 462,272 | 1,150,479,312 |
| Gapclose1.12 | 5,372 | 376,296 | 761,523 | 462,272 | 1,154,146,689 | |
| Gapclose1.10 | 10,215 | 376,491 | 624,151 | 462,272 | 1,154,798,407 | |
| Scaffold extending | SSPACE | 10,236 | 858,288 | 608,352 | 351,879 | 1,228,736,536 |
A BUSCO assessment of our Japanese eel assembly.
| Parameter | Number | Percentage (%) |
|---|---|---|
| Complete BUSCOs (C) | 3847 | 83.9 |
| Complete and single-copy BUSCOs (S) | 3346 | 73.0 |
| Complete and duplicated BUSCOs (D) | 501 | 10.9 |
| Fragmented BUSCOs (F) | 380 | 8.3 |
| Missing BUSCOs (M) | 357 | 7.8 |
| Total BUSCO groups searched (n) | 4584 | --- |
Figure 2Structures of the tbx4 genes in various vertebrate species. Green boxes and lines represent exons and introns, respectively. Numbers inside the boxes are the exact amino acid numbers, indicating their similarity among various species.
Figure 3Similarity of the T-box domain in the tbx4 genes of various vertebrate species. The blue color represents conserved sites. The yellow color represents the similarity >80% and white shows less conserved sites. The colored dots beneath the sequence alignment indicate conservation track, ranging from blue (non-conserved) to red (the most conserved). The red box highlights the NLS region of the tbx4 genes. A red arrow indicates a nonsynonymous mutation in the Japanese eel tbx4 gene.
Figure 4Visualization of the tbx4 NLS regions in different vertebrate species using Bioedit (Tom Hall Ibis Therapeutics, Carlsbad, CA, USA). Red characters represent the codons and amino acids of nonsynonymous mutations.
Figure 5Alignment of the TBX4 protein sequences of Japanese eel against zebrafish. The red box highlights the NLS regions (same as Figure 4).
Figure 6Phylogenetic and synteny comparisons of the tbx4 genes in vertebrates. The figure in the left is a Bayesian tree. Numbers on the branches are bootstrap supports (black) obtained from the phyML-3.1 reconstruction. Spotted gar was used as the outgroup. The figure in the right represents the synteny of tbx4. Distances between genes and the gene length are not drawn to scale. The red branches and the two black boxes highlight two remarkable inversions of the brip1-tbx4-tbx2b-bcas3 cluster in teleost species. The five-point star represents the determined pelvic fin loss in the examined species.
Figure 7The brip1-tbx4-tbx2b-bcas3 cluster. Colored boxes and lines represent genes and intergenic regions, respectively. The distance between two adjacent genes is indicated underneath the lines, while the length of exons is drawn to scale. Genes in the same orientation as tbx4 are marked above the horizontal lines; however, genes in the opposite orientation are placed below the lines.
Figure 8A VISTA plot to compare the HLEB sequences of Acanthopterygii fishes against the reported 873-bp HLEB from three-spined stickleback (Gasterosteus aculeatus). Sequence identity along the y-axis, ranging from 50% to 100%, is shown in 100-bp sliding windows across the examined region (x-axis, bp). The pink shadows stand for regions with <100-bp continuous bases at ≥70% identity.
Accession numbers for the 27 T-box family members and seven adjacent genes of tbx4.
| Gene | Species Name | Accession Number |
|---|---|---|
|
|
| AAH67719.1 |
|
|
| NP_001077044.1 |
|
|
| XP_021324416.1 |
|
|
| ADA61227.1 |
|
|
| Q07998.1 |
|
|
| XP_001343633.3 |
|
|
| XP_693121.1 |
|
|
| AAG48249.1 |
|
|
| AAM54074.1 |
|
|
| AAI65213.1 |
|
|
| AAI63460.1 |
|
|
| XP_003198807.1 |
|
|
| XP_017206601.2 |
|
|
| Q8AXX2.1 |
|
|
| AAF64322.1 |
|
|
| NP_001164070.1 |
|
|
| ACU00296.1 |
|
|
| AAH68364.1 |
|
|
| Q7ZTU9.4 |
|
|
| NP_001095140.2 |
|
|
| XP_002662050.2 |
|
|
| AAI62554.1 |
|
|
| Q9IAK8.2 |
|
|
| ADX53331.1 |
|
|
| P79742.1 |
|
|
| Q8JIS6.2 |
|
|
| JAQ45978.1 |
|
|
| Q90476.1 |
|
|
| SBP21433.1 |
|
|
| SBP60348.1 |
|
|
| SBP60348.1 |
|
|
| SBP60348.1 |
|
|
| SBP60348.1 |