| Literature DB >> 30564274 |
Wansheng Jiang1, Ying Qiu2,3, Xiaofu Pan1, Yuanwei Zhang1, Xiaoai Wang1, Yunyun Lv2,3, Chao Bian2,3, Jia Li3, Xinxin You2,3, Jieming Chen2,3, Kunfeng Yang1, Jinlong Yang4, Chao Sun1, Qian Liu1, Le Cheng4, Junxing Yang1, Qiong Shi2,3.
Abstract
A Yunnan-Guizhou Plateau fish, the Kanglang white minnow (Anabarilius grahami), is a typical "3E" (Endangered, Endemic, and Economic) species in China. Its distribution is limited to Fuxian Lake, the nation's second deepest lake, with a significant local economic value but a drastically declining wild population. This species has been evaluated as VU (Vulnerable) in the China Species Red List. As one of the "Four Famous Fish" in Yunnan province, the artificial breeding has been achieved since 2003. It has not only re-established its wild natural populations by reintroduction of the artificial breeding stocks, but also brought a wide and popular utilization of this species to the local fish farms. A. grahami has become one of the main native aquaculture species in Yunnan province, and the artificial production has been emerging in steady growth each year. To promote the conservation and sustainable utilization of this fish, we initiated its whole genome sequencing project using an Illumina Hiseq2500 platform. The assembled genome size of A. grahami is 1.006 Gb, accounting for 98.63% of the estimated genome size (1.020 Gb), with contig N50 and scaffold N50 values of 26.4 kb and 4.41 Mb, respectively. Approximately about 50.38% of the genome was repetitive. A total of 25,520 protein-coding genes were subsequently predicted. A phylogenetic tree based on 4,580 single-copy genes from A. grahami and 18 other cyprinids revealed three well-supported subclades within the Cyprinidae. This is the first inter-subfamily relationship of cyprinids at genome level, providing a simple yet useful framework for understanding the traditional but popular subfamily classification systems. Interestingly, a further population demography of A. grahami uncovered a historical relationship between this fish and Fuxian Lake, suggesting that range expansion or shrinkage of the habitat has had a remarkable impact on the population size of endemic plateau fishes. Additionally, a total of 33,836 simple sequence repeats (SSR) markers were identified, and 11 loci were evaluated for a preliminary genetic diversity analysis in this study, thus providing another useful genetic resource for studying this "3E" species.Entities:
Keywords: Cyprinidae; SSR; genome sequencing; plateau fish; population history
Year: 2018 PMID: 30564274 PMCID: PMC6288284 DOI: 10.3389/fgene.2018.00614
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Fish species selected for the phylogenetic analysis of Cyprinidae in the present study.
| No. | Scientific name | Subfamily classification∗ | Data type | Accession No.# |
|---|---|---|---|---|
| 1 | Hypophthalmichthyinae | Transcriptome | SRR342398 | |
| 2 | Hypophthalmichthyinae | Transcriptome | SRR3036336 | |
| 3 | Gobioninae | Transcriptome | SRR1185341 | |
| 4 | Gobioninae | Transcriptome | SRR1660441 | |
| 5 | Xenocyprinae | Transcriptome | SRR5351748 | |
| 6 | Cultrinae | Transcriptome | SRR959086 | |
| 7 | Cultrinae | Genome | ||
| 8 | Leuciscinae | Genome | PRJEB5920 | |
| 9 | Leuciscinae | Transcriptome | SRR5997852 | |
| 10 | Danioninae | Genome | PRJNA11776 | |
| 11 | Danioninae | Transcriptome | SRR5451065 | |
| 12 | Schizothoracinae | Transcriptome | SRR1583887 | |
| 13 | Schizothoracinae | Transcriptome | SRR1552917 | |
| 14 | Cyprininae | Genome | PRJNA202478 | |
| 15 | Cyprininae | Transcriptome | SRR1038441 | |
| 16 | Barbinae | Genome | PRJNA274017 | |
| 17 | Barbinae | Genome | PRJNA274017 | |
| 18 | Labeoninae | Transcriptome | SRP012989 | |
| 19 | Acheilognathinae | Transcriptome | SRR2043486 | |
FIGURE 1Genome-size estimation and genome annotation of A. grahami. (A) 17-mer frequency distribution of sequenced reads. (B) The number of predicted genes in A. grahami reciprocally homologous to other five representative fishes. (C) 22,406 predicted protein-coding genes with matching entries in the four popular public databases.
Summary of the genome assembly and annotation for A. grahami.
| Genome assembly | Parameter | Genome annotation | Parameter |
|---|---|---|---|
| Contig N50 size (kb) | 26.37 | Protein-coding gene | 25,520 |
| Scaffold N50 size (Mb) | 4.41 | Annotated functional gene | 22,406 (87.80%) |
| Estimated genome size (Gb) | 1.020 | Unannotated functional gene | 3,114 (12.20%) |
| Assembled genome size (Gb) | 1.006 | Repeat content | 50.38% |
| Genome coverage (×) | 188.88 | Average gene length (bp) | 9,152 |
| Longest scaffold (bp) | 18,552,664 | Average exon length (bp) | 197 |
FIGURE 2Inter-subfamily phylogenetic relationships within the Cyprinidae. The analysis was based on the 4,580 single-copy genes of two datasets (dataset I and II) using the ML and BI methods. Supporting values are presented as ML-I/ML-II/BI-I/BI-II at each node, where asterisks (∗) denote bootstrap value (ML) or posterior probability (BI) of 100%. The position of A. grahami is marked in bold, and a photo of a live specimen of this species is shown on the top left.
FIGURE 3Estimated population demography of A. grahami using the PSMC model. The bold red line represents the estimated effective population size changes of A. grahami, and the thin pink lines represent 100 bootstrap estimations. The demarcated blue blocks denote three main periods during the development of Fuxian Lake, including (I) lacus formation period (3–0.1 Ma), (II) large lake period (0.1–0.012 Ma), and (III) deepening lake period (0.012 Ma to present).
FIGURE 4A flowchart for the process of SSR loci identification, evaluation, and application in this study. The corresponding numbers of loci retained after each step are presented in brackets.
The average genetic parameters at 11 SSR loci of A. grahami in four different populations (n = 30 per population).
| Populations | EFCC1 | EFCC2 | Huoyanshan | Luchong | ||||
|---|---|---|---|---|---|---|---|---|
| Mean | SD | Mean | SD | Mean | SD | Mean | SD | |
| 3.273 | 1.348 | 3.091 | 1.221 | 3.000 | 0.894 | 2.818 | 0.982 | |
| 0.467 | 0.182 | 0.451 | 0.260 | 0.449 | 0.182 | 0.391 | 0.187 | |
| 0.411 | 0.150 | 0.362 | 0.178 | 0.390 | 0.128 | 0.354 | 0.146 | |
| 0.348 | 0.129 | 0.308 | 0.153 | 0.334 | 0.113 | 0.298 | 0.116 | |
| -0.132 | 0.123 | -0.195 | 0.194 | -0.137 | 0.151 | -0.090 | 0.199 | |
| 0.112 | 0.006∗ | 0.834 | 0.689 | |||||