| Literature DB >> 28327946 |
Jian Xu1, Chao Bian2,3,4, Kunci Chen5, Guiming Liu6, Yanliang Jiang1, Qing Luo5, Xinxin You2,3, Wenzhu Peng1,7, Jia Li3, Yu Huang3, Yunhai Yi3, Chuanju Dong1,8, Hua Deng9, Songhao Zhang1, Hanyuan Zhang1, Qiong Shi2,3,10, Peng Xu1,7.
Abstract
The Northern snakehead (Channa argus), a member of the Channidae family of the Perciformes, is an economically important freshwater fish native to East Asia. In North America, it has become notorious as an intentionally released invasive species. Its ability to breathe air with gills and migrate short distances over land makes it a good model for bimodal breath research. Therefore, recent research has focused on the identification of relevant candidate genes. Here, we performed whole genome sequencing of C. argus to construct its draft genome, aiming to offer useful information for further functional studies and identification of target genes related to its unusual facultative air breathing. Findings: We assembled the C. argus genome with a total of 140.3 Gb of raw reads, which were sequenced using the Illumina HiSeq2000 platform. The final draft genome assembly was approximately 615.3 Mb, with a contig N50 of 81.4 kb and scaffold N50 of 4.5 Mb. The identified repeat sequences account for 18.9% of the whole genome. The 19 877 protein-coding genes were predicted from the genome assembly, with an average of 10.5 exons per gene.Entities:
Keywords: Channa argus; annotation; gene prediction; genome assembly
Mesh:
Year: 2017 PMID: 28327946 PMCID: PMC5530311 DOI: 10.1093/gigascience/gix011
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure: 1:the Northern snakehead fish, Channa argus.
summary of the Channa argus genome assembly and annotation
| Genome assembly | |
|---|---|
| Contig N50 size (kb) | 81 |
| Contig number (>100 bp) | 29 146 |
| Scaffold N50 size (Mb) | 4.5 |
| Scaffold number (>100 bp) | 5297 |
| Total length (Mb) | 615.3 |
| Genome coverage (X) | 224.6 |
| The longest scaffold (bp) | 18 736 006 |
| Genome annotation | |
| Protein-coding gene number | 19 877 |
| Mean transcript length (kb) | 16.5 |
| Mean exons per gene | 10.5 |
| Mean exon length (bp) | 175.0 |
| Mean intron length (bp) | 1537.3 |
the detailed classification of repeat sequences of Channa argus
|
|
|
|
| |||||
|---|---|---|---|---|---|---|---|---|
|
|
|
|
|
|
|
|
|
|
| DNA | 17 984 515 | 2.92 | 6 784 728 | 1.10 | 25 663 752 | 4.17 | 35 435 946 | 5.76 |
| LINE | 16 799 343 | 2.73 | 17 563 763 | 2.85 | 54 890 557 | 8.92 | 60 651 866 | 9.86 |
| SINE | 4 512 385 | 0.73 | 0 | 0 | 6 672 552 | 1.08 | 9 026 285 | 1.47 |
| LTR | 4 421 728 | 0.72 | 3 031 607 | 0.49 | 24 144 657 | 3.92 | 26 983 318 | 4.39 |
| Other | 8125 | 0.001 | 0 | 0 | 0 | 0 | 8125 | 0.001 |
| Unknown | 0 | 0 | 0 | 0 | 9 413 375 | 1.53 | 9 413 375 | 1.53 |
| Total | 41 585 442 | 6.76 | 27 363 267 | 4.45 | 103 162 115 | 16.77 | 116 545 270 | 18.94 |
Figure: 2:genome evolution. (a) Orthologous gene families across five fish genomes (Snakehead fish, Zebrafish, Asian seabass, Mudskipper, and Arowana). (b) Phylogeny of ray-finned fishes (the arowana as the outgroup species).