| Literature DB >> 30572844 |
Ning Li1, Lisui Bao1, Tao Zhou1, Zihao Yuan1, Shikai Liu1, Rex Dunham2, Yuanning Li3, Kun Wang4, Xiaoyan Xu1, Yulin Jin1, Qifan Zeng1, Sen Gao1, Qiang Fu1, Yang Liu1, Yujia Yang1, Qi Li5, Axel Meyer6, Dongya Gao1, Zhanjiang Liu7.
Abstract
BACKGROUND: Walking catfish (Clarias batrachus) is a freshwater fish capable of air-breathing and locomotion on land. It usually inhabits various low-oxygen habitats, burrows inside the mudflat, and sometimes "walks" to search for suitable environments during summer. It has evolved accessory air-breathing organs for respiring air and corresponding mechanisms to survive in such challenging environments. Thereby, it serves as a great model for understanding adaptations to terrestrial life.Entities:
Keywords: Adaptation; Air-breathing organ; Duplication; Evolution; Fish; Genome
Mesh:
Year: 2018 PMID: 30572844 PMCID: PMC6302426 DOI: 10.1186/s12864-018-5355-9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary statistics for walking catfish (Clarias batrachus) genome sequencing, assembly and annotation
| Genome sequencing | ||||
| Library | # of reads | Read length | Trimmed data | Genome coverage |
| Paired-end 180 bp | 426 M | 100 bp | 39.2 Gb | 46X |
| Mate-pair 3 kb | 483 M | 100 bp | 32.1 Gb | 38X |
| Genome assembly | ||||
| Total | N50 | Longest | Assembled size | |
| Contigs | 87,962 | 19.0 kb | 194 kb | 747.7 Mb |
| Scaffolds | 10,041 | 361.2 kb | 2843 kb | 821.8 Mb |
| Genome annotation | ||||
| Number of genes | Repetitive elements content | |||
| 22,914 | 30.3% | |||
Fig. 1Annotation of the walking catfish genome. The distribution of repetitive elements and their contents are shown for Clarias batrachus (a) and Ictalurus punctatus (b). The average number of substitutions per site for each fragment was estimated using the Jukes-Cantor formula. SINE, short interspersed elements; LTR, long terminal repeats; LINE, long interspersed elements. c Venn diagram showing the number of homologues of the predicted genes from the C. batrachus genome in various databases: NR (non-redundant protein database), TrEMBL, and SwissProt
Fig. 2Comparisons of genomic features of Clarias batrachus with those non-air-breathing fish species. a Schematic presentation of comparative genomic analyses of C. batrachus against non-air-breathing teleost fishes. In the catfish lineage, the protein sequences of the C. batrachus and I. punctatus genomes were comparatively subtracted, resulting in the identification of 1854 genes specific to the C. batrachus genome; similarly, 10 additional non-air-breathing fishes were added to compare with C. batrachus genome, resulting in the identification of 110 genes that were only present in the C. batrachus genome. The names of these 110 C. batrachus specific genes were searched with all existing sequences from non-air-breathing fish species in the NCBI database, resulting in no genes specific to the C. batrachus genome. b Comparison of the values of dN/dS ratio among various fish species against the ancestor estimated from 150 randomly picked single-copy genes with 10,000 bootstrap replicates. The red line represents the average dN/dS value in C. batrachus, noting that it is the second most rapidly evolving genome. c The number of gene families exhibiting expansion (red) / contraction (blue). C. batrachus is marked with red solid circle, showing that it has the largest number of expanded gene families. D. rerio, Danio rerio; G. aculeatus, Gasterosteus aculeatus; T. nigroviridis, Tetraodon nigroviridis; T. rubripes, Takifugu rubripes; O. latipes, Oryzias latipes; G. morhua, Gadus morhua; A. mexicanus, Astyanax mexicanus; O. niloticus, Oreochromis niloticus; X. maculatus, Xiphophorus maculatus; P. formosa, Poecilia formosa; I. punctatus, Ictalurus punctatus
Fig. 3Maximum likelihood phylogenetic trees of expanded gene families in the Clarias batrachus genome. a Phylogenetic tree of myoglobin (mb) in vertebrates showing gene expansion of mb in the C. batrachus genome. The red solid circles represent the genes in the C. batrachus genome. Bootstrap support values (1000 replications) are indicated on the nodes. b Phylogenetic tree of the olfactory receptor related to class A G protein-coupled receptor (ora) gene family in vertebrates showing the expansion of ora1 in the C. batrachus genome with taste receptor of type 2 member 1 (T2R1) as an outgroup. The three clades (ora1-ora2, ora3-ora4 and ora5-ora6) formed from six members are indicated on the right of the figure. The dark green solid triangle represents the expansion of vomeronasal type 1 receptors (V1Rs) in mammals. The red solid triangle represents the 15 copies in the C. batrachus genome. The term “teleosts” here is used to indicate the non-air-breathing fish species discussed in this study. The detailed phylogenetic tree with species names and sequence names is displayed in the Additional file 1: Figure S2. c Phylogenetic tree of sulfotransferase 6b1 (sult6b1) in vertebrates showing gene expansion of sult6b1 in the C. batrachus genome. The red solid circles represent the genes in the C. batrachus genome. Bootstrap support values (1000 replications) are indicated on the nodes. D. rerio, Danio rerio; G. aculeatus, Gasterosteus aculeatus; T. nigroviridis, Tetraodon nigroviridis; T. rubripes, Takifugu rubripes; O. latipes, Oryzias latipes; G. morhua, Gadus morhua; A. mexicanus, Astyanax mexicanus; O. niloticus, Oreochromis niloticus; X. maculatus, Xiphophorus maculatus; P. formosa, Poecilia formosa; I. punctatus, Ictalurus punctatus; C. carpio, Cyprinus carpio; P. annectens, Protopterus annectens; M. musculus, Mus musculus; H. sapiens, Homo sapiens; X. tropicalis, Xenopus tropicalis; L. chalumnae, Latimeria chalumnae; A. carolinensis, Anolis carolinensis
Expression of myoglobin and sulfotransferase 6b1 genes (FPKM) in the air-breathing organ, gill, brain, head kidney and liver of Clarias batrachus as determined by analysis of RNA-Seq datasets. RNA-Seq datasets from air-breathing organ and gill were generated from this project; those from brain (SRR2057993), head kidney (SRR2057995), and liver (SRR2052655) were downloaded from NCBI. Each continuous bold and italic gene cluster indicated tandemly duplicated genes. FPKM, Fragments Per Kilobase per Million mapped fragments
| Gene name | Gene ID | Air-breathing organ | Gill | Brain | Head kidney | Liver |
|---|---|---|---|---|---|---|
| Myoglobin |
| 0 | 0 | 0 | 2.91 | 2.84 |
|
| 0 | 0 | 8.48 | 0.85 | 4.14 | |
|
| 0.25 | 0.18 | 3.14 | 1.57 | 1.53 | |
|
| 0.59 | 0.05 | 1.45 | 1.55 | 0 | |
|
| 0.09 | 0 | 4.48 | 9.88 | 2.92 | |
| g21774.t1 | 4.52 | 5.07 | 0 | 1.02 | 0 | |
| g23112.t1 | 0.46 | 0.76 | 0 | 0.58 | 0 | |
|
| 0 | 0 | 26.84 | 25.61 | 5.11 | |
|
| 0 | 0 | 15.71 | 12.23 | 3.31 | |
| g7580.t1 | 0 | 0 | 0 | 2.32 | 0 | |
|
| 0 | 0 | 17.4 | 23.41 | 3.31 | |
|
| 0 | 0 | 0 | 0.59 | 0 | |
|
| 0 | 0.56 | 16.23 | 13.52 | 3.27 | |
|
| 0 | 0.16 | 0 | 0 | 0 | |
|
| 6.18 | 139.88 | 838.66 | 643.35 | 145 | |
| Sulfotransferase 6b1 | g16408.t1 | 0.2 | 1.16 | 0 | 0 | 0 |
|
| 2.78 | 1.22 | 3.96 | 1.94 | 1.95 | |
|
| 23.9 | 28.5 | 13.36 | 18.93 | 2.09 | |
|
| 3.42 | 3.79 | 6.16 | 3.34 | 1.28 | |
|
| 2.91 | 2.42 | 1.57 | 1.62 | 1.28 | |
|
| 0 | 0.58 | 0 | 0 | 0 | |
|
| 0.54 | 0.53 | 0.57 | 1.43 | 1.75 | |
|
| 9.37 | 16.57 | 5.6 | 4.94 | 0.26 | |
|
| 24.61 | 14.81 | 89.49 | 44.81 | 0 | |
| g24152.t1 | 0.13 | 0.53 | 0 | 0 | 0 | |
| g24341.t1 | 0 | 0.06 | 0 | 0 | 0 | |
| g24380.t1 | 0 | 0.31 | 0 | 0 | 15.45 |
A list of genes differentially expressed between the gill and the air-breathing organ in Clarias batrachus
| Category name | Gene name | Gene ID | FDR |
|---|---|---|---|
| Acid-base balance | Solute carrier family 4 member 1 | g12078.t1 | 0 |
| Carbonic anhydrase | g8816.t1 | 0 | |
| Carbonic anhydrase 2 | g3019.t1 | 1.60E-07 | |
| Carbonic anhydrase 4 | g7074.t1 | 7.21E-07 | |
| Carbonic anhydrase 9 | g411.t1 | 5.37E-05 | |
| Carbonic anhydrase 6 | g456.t1 | 7.39E-04 | |
| Ion homeostasis | Sodium/potassium-transporting ATPase subunit alpha-1 | g15272.t1 | 0 |
| Sarcoplasmic/endoplasmic reticulum calcium ATPase 1 | g16367.t1 | 0 | |
| Sarcoplasmic/endoplasmic reticulum calcium ATPase 1 | g5688.t1 | 0 | |
| Sarcoplasmic/endoplasmic reticulum calcium ATPase 2 | g9634.t1 | 0 | |
| Sodium potassium-transporting ATPase subunit beta-233 | g9311.t1 | 0 | |
| Adenosylhomocysteinase 3 | g1970.t1 | 2.72E-10 | |
| Adenosylhomocysteinase 3 | g1011.t1 | 1.19E-08 | |
| Calsequestrin-1 | g3495.t1 | 5.23E-04 | |
| Calsequestrin-2 | g9400.t1 | 3.67E-02 | |
| Pendrin | g2970.t1 | 0 | |
| Sodium/hydrogen exchanger 3 | g3908.t1 | 0 | |
| Elastic fiber formation | Transforming growth factor beta-2 | g2430.t1 | 8.22E-03 |
| Fibulin-1 | g4468.t1 | 9.66E-08 | |
| Transforming growth factor beta-3 | g14668.t1 | 2.68E-02 | |
| Fibulin-2 | g20129.t1 | 5.73E-03 | |
| Bone morphogenetic protein 4 | g189.t1 | 8.79E-06 | |
| Fibronectin | g6260.t1 | 0 | |
| Fibronectin | g12205.t1 | 9.01E-03 | |
| Latent-transforming growth factor beta-binding protein 3 | g14988.t1 | 8.22E-03 | |
| Oxygen binding and transport | Hemoglobin subunit alpha | g20835.t1 | 0 |
| Hemoglobin subunit beta | g20836.t1 | 3.55E-11 | |
| Hemoglobin subunit alpha | g20837.t1 | 0 | |
| Hemoglobin subunit beta | g20838.t1 | 0 | |
| Hemoglobin subunit beta | g21168.t1 | 0 | |
| Hemoglobin subunit alpha | g21169.t1 | 0 | |
| Hemoglobin subunit beta | g21170.t1 | 0 | |
| Hemoglobin subunit alpha | g21171.t1 | 0 | |
| Angiogenesis | Sphingosine 1-phosphate receptor 1 | g5232.t1 | 7.89E-03 |
| Fibronectin | g6260.t1 | 8.79E-06 | |
| Semaphorin-3E | g5331.t1 | 6.86E-05 | |
| C-X-C chemokine receptor type 4 | g23603.t1 | 0 | |
| Neuropilin-1a | g3757.t1 | 5.00E-02 | |
| Cadherin-5 | g21848.t1 | 7.36E-03 | |
| BMP-binding endothelial regulator protein | g7031.t1 | 2.71E-02 | |
| Bone morphogenetic protein 4 | g189.t1 | 5.73E-03 | |
| Bone morphogenetic protein 5 | g11279.t1 | 2.76E-03 | |
| Bone morphogenetic protein 8A | g14806.t1 | 9.26E-03 | |
| Bone morphogenetic protein receptor type-1A | g14843.t1 | 9.45E-05 | |
| Transforming growth factor beta-2 | g2430.t1 | 8.22E-03 | |
| Transforming growth factor beta-3 | g14668.t1 | 2.68E-02 | |
| Rho-related GTP-binding protein RhoB | g5763.t1 | 1.23E-09 | |
| Thrombospondin-1 | g5480.t1 | 3.33E-02 | |
| Mothers against decapentaplegic homolog 6 | g4737.t1 | 6.93E-13 | |
| Mothers against decapentaplegic homolog 6 | g4739.t1 | 3.88E-11 | |
| Mothers against decapentaplegic homolog 3 | g3356.t1 | 4.28E-03 |