| Literature DB >> 31681425 |
Liandong Yang1, Ying Wang2, Tai Wang3, Shengchang Duan4, Yang Dong4, Yanping Zhang3, Shunping He1,5.
Abstract
Cobitoidea is one of the two superfamilies in Cypriniformes; however, few genomes have been sequenced for Cobitoidea fishes. Here, we obtained a total of 252.90 Gb of short Illumina reads and 31.60 Gb of long PacBio Sequel reads, representing approximate genome coverage of 256× and 50×, respectively. The final assembled genome is about 583.47 Mb with contig N50 sizes of 2.87 Mb, which accounts for 91.44% of the estimated genome size of 638.07 Mb. Using Hi-C-based chromatin contact maps, 99.31% of the genome assembly was placed into 25 chromosomes, and the N50 is 22.3 Mb. The gene annotation completeness was evaluated by BUSCO, and 2,470 of the 2,586 conserved genes (95.5%) could be found in our assembly. Repetitive elements were calculated to reach 33.08% of the whole genome. Moreover, we identified 25,406 protein-coding genes, of which 92.59% have been functionally annotated. This genome assembly will be a valuable genomic resource to understand the biology of the Tibetan loaches and will also set a stage for comparative analysis of the classification, diversification, and adaptation of fishes in Cobitoidea.Entities:
Keywords: PacBio sequencing; Triplophysa siluroides; adaptation; evolution; genome assembly
Year: 2019 PMID: 31681425 PMCID: PMC6807559 DOI: 10.3389/fgene.2019.00991
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Figure 1Photograph of the Tibetan loach, Triplophysa siluroides.
Summary of genome assembly of T. siluroides.
| Terms | Size (bp) | Number |
|---|---|---|
| N90 | 452,639 | 251 |
| N80 | 881,594 | 163 |
| N70 | 1,446,248 | 112 |
| N60 | 2,005,727 | 77 |
| N50 | 2,872,994 | 53 |
| Max length | 14,649,642 | — |
| Total length | 583,471,586 | — |
| Total number | — | 1,004 |
| Total number (≥1 kb) | — | 1,004 |
| Total number (≥5 kb) | 1,001 |
Figure 2Hi-C interaction heat map showing the clustering of the primary hybrid contigs into 25 sets of chromosome-scale superscaffolds.
Chromosome metrics before and after Hi-C scaffolding.
| Terms | Contig (original) | Scaffold (Hi-C) |
|---|---|---|
| Number | 1,004 | 856 |
| N50 | 2,872,994 | 22,312,937 |
| Total length | 582,350,959 | 578,738,912 |
Figure 3Genome evolution. (A) A Venn diagram of the orthologous gene families shared among five ray-finned fish genomes (Danio rerio, Hippocampus comes, Takifugu rubripes, Triplophysa siluroides, Xiphophorus maculatus). (B) Phylogeny of the 14 ray-finned fishes with the spotted gar as the outgroup. The bootstrap support value for the topology is 100.