| Literature DB >> 35539357 |
Hua Ye1,2, Zhengshi Zhang1,2, Chaowei Zhou1,2, Chengke Zhu1,2, Yuejing Yang1,2, Mengbin Xiang1,2, Xinghua Zhou1,2, Jian Zhou3, Hui Luo1,2.
Abstract
Schizothorax waltoni (S. waltoni) is one kind of the subfamily Schizothoracinae and an indigenous economic tetraploid fish to Tibet in China. It is rated as a vulnerable species in the Red List of China's Vertebrates, owing to overexploitation and biological invasion. S. waltoni plays an important role in ecology and local fishery economy, but little information is known about genetic diversity, local adaptation, immune system and so on. Functional gene identification and molecular marker development are the first and essential step for the following biological function and genetics studies. For this purpose, the transcriptome from pooled tissues of three adult S. waltoni was sequenced and analyzed. Using paired-end reads from the Illumina Hiseq4000 platform, 83 103 transcripts with an N50 length of 2337 bp were assembled, which could be further clustered into 66 975 unigenes with an N50 length of 2087 bp. The majority of the unigenes (58 934, 87.99%) were successfully annotated by 7 public databases, and 15 KEGG pathways of immune-related genes were identified for the following functional research. Furthermore, 19 497 putative simple sequence repeats (SSRs) of 1-6 bp unit length were detected from 14 690 unigenes (21.93%) with an average distribution density of 1 : 3.28 kb. We identified 3590 unigenes (5.36%) containing more than one SSR, providing abundant potential polymorphic markers in functional genes. This is the first reported high-throughput transcriptome analysis of S. waltoni, and it would provide valuable genetic resources for the functional genes involved in multiple biological processes, including the immune system, genetic conservation, and molecular marker-assisted breeding of S. waltoni. This journal is © The Royal Society of Chemistry.Entities:
Year: 2018 PMID: 35539357 PMCID: PMC9079874 DOI: 10.1039/c8ra00619a
Source DB: PubMed Journal: RSC Adv ISSN: 2046-2069 Impact factor: 4.036
Assembled transcripts and unigenes obtained from transcriptome analysis
| Terms | Transcripts | Unigenes |
|---|---|---|
| Total number | 83 103 | 66 975 |
| Shortest length (bp) | 224 | 224 |
| Longest length (bp) | 29 395 | 29 395 |
| Total length (bp) | 94 640 129 | 64 031 094 |
| Average length (bp) | 1139 | 956 |
| N50 length (bp) | 2337 | 2087 |
Fig. 1Length distribution of assembled unigenes. Assembled unigene numbers (y-axis) were plot against length interval (x-axis).
Transcripts annotation by various databases
| Database | Hit number | Percentage (%) |
|---|---|---|
| Nr | 35 510 | 53.02% |
| Nt | 57 301 | 85.56% |
| Swiss-prot | 28 877 | 43.12% |
| GO | 16 278 | 24.30% |
| KEGG | 25 742 | 38.44% |
| COG | 11 334 | 16.92% |
| Pfam | 21 692 | 32.39% |
| Total | 58 934 | 87.99% |
Fig. 2The length distribution of coding sequences (CDS). (A) E-value distribution, (B) similarity distribution, (C) species distribution.
Fig. 3COG functional classification of putative protein for Schizothorax waltoni transcriptome. Out of 66 975 unigenes, 11 334 unigenes were grouped into 25 COG categories. The letters on the x-axis represent different COG assortments.
Fig. 4Gene ontology (GO) annotation (2nd level GO terms) of Schizothorax waltoni transcriptome. Unigenes were annotated by gene ontology (GO) terms which belong to three main categories: biological process, cellular component, or molecular function.
Fig. 5Identified immune-related KEGG pathways of assembled unigenes of Schizothorax waltoni.
Repeat numbers and unit length distribution of putative SSR markers in the transcriptome
| Repeat numbers | Motif length | Total | Percent (%) | |||||
|---|---|---|---|---|---|---|---|---|
| Mono | Di | Tri | Tetra | Penta | Hexa | |||
| 4 | 0 | 0 | 0 | 0 | 150 | 82 | 232 | 1.19 |
| 5 | 0 | 0 | 1480 | 214 | 27 | 4 | 1725 | 8.85 |
| 6 | 0 | 2331 | 635 | 127 | 6 | 6 | 3105 | 15.93 |
| 7 | 0 | 1274 | 385 | 14 | 4 | 4 | 1681 | 8.62 |
| 8 | 0 | 785 | 221 | 13 | 4 | 2 | 1025 | 5.26 |
| 9 | 0 | 604 | 37 | 7 | 0 | 2 | 650 | 3.33 |
| 10 | 0 | 503 | 44 | 7 | 2 | 0 | 556 | 2.85 |
| 11 | 0 | 801 | 30 | 3 | 0 | 1 | 835 | 4.28 |
| 12 | 1213 | 421 | 14 | 4 | 3 | 2 | 1657 | 8.50 |
| 13 | 846 | 192 | 12 | 3 | 3 | 0 | 1056 | 5.42 |
| 14 | 668 | 191 | 12 | 1 | 1 | 0 | 873 | 4.48 |
| 15 | 477 | 200 | 7 | 5 | 2 | 0 | 691 | 3.54 |
| 16 | 374 | 159 | 6 | 8 | 2 | 0 | 549 | 2.82 |
| 17 | 287 | 161 | 1 | 1 | 1 | 1 | 452 | 2.32 |
| 18 | 238 | 114 | 0 | 3 | 0 | 0 | 355 | 1.82 |
| 19 | 162 | 97 | 1 | 6 | 1 | 0 | 267 | 1.37 |
| ≥20 | 2813 | 943 | 10 | 18 | 2 | 2 | 3788 | 19.43 |
| Total | 7078 | 8776 | 2895 | 434 | 208 | 106 | 19 497 | 100 |
| Percent (%) | 36.30 | 45.01 | 14.85 | 2.23 | 1.07 | 0.54 | 100 | |
Fig. 6Frequency distribution of the top twenty most abundant SSRs based on repeated nucleotide types. Each histogram represented one detected SSR type in transcriptome of Schizothorax waltoni.