| Literature DB >> 26925843 |
Deyou Ma1,2,3, Aijun Ma1,2, Zhihui Huang1,2, Guangning Wang1,2, Ting Wang1,2, Dandan Xia1,2, Benhe Ma1,2.
Abstract
BACKGROUND: Turbot Scophthalmus maximus is an economically important species extensively aquacultured in China. The genetic selection program is necessary and urgent for the sustainable development of this industry, requiring more and more genome background knowledge. Transcriptome sequencing is an excellent alternative way to identify transcripts involved in specific biological processes and exploit a considerable quantity of molecular makers when no genome sequences are available. In this study, a comprehensive transcript dataset for major tissues of S. maximus was produced on basis of an Illumina platform.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26925843 PMCID: PMC4771204 DOI: 10.1371/journal.pone.0149414
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of Illumina transcriptome, assembly and annotation for Scophthalmus maximus.
| Raw results (after trimming) | Assembly results | Annotation results | |||
|---|---|---|---|---|---|
| Clean bases (G) | 8.844 | Contigs | 106,643 | Nr and Swiss-Prot annotations | 24052 |
| Read pairs | 44,219,773 | Unigenes (after eliminating redundancy) | 71,107 | COG hits | 37058 |
| Read length (bp) | 100 | Min-Max length of unigenes | 201–17,407 | GO mapped | 16540 |
| Average length of unigenes | 892 | KEGG hits | 11938 | ||
Fig 1E-value distribution of S. maximus transcriptome unigenes matched to Nr database.
Fig 2Top 20 hit species distribution based on BLASTp.
Fig 3COG classification of putative proteins for S. maximus transcriptome.
Fig 4Distribution of Gene Ontology (GO) functional categories (level 2) of transcripts for S. maximus.
(A) Cellular component; (B) Molecular function; (C) Biological process. Each annotated sequence is assigned at least one GO term. Numbers refer to percentage of assigned unigenes in each category.
Fig 5KEGG classification of non-redundant unigenes for S. maximus transcriptome.
Identified genes involved in sex determination and development in the present turbot transcriptome.
| Gene Name | Annotation | Query Name | Length | Expression quantity (♂/♀) | Q-value | Trend |
|---|---|---|---|---|---|---|
| Doublesex and mab-3-related transcription factor 1 | comp30078_c2_seq1 | 566 | 6.43 | 1.36E-10 | up | |
| Follicle stimulating hormone receptor II | comp34552_c0_seq1 | 3389 | 2.39 | 2.73E-70 | up | |
| SRY-box containing protein 9 | comp36634_c7_seq1 | 1879 | -0.20 | 0.47 | ||
| SRY-box containing protein 6-like | comp21968_c1_seq1 | 560 | 2.53 | 0.023 | ||
| Androgen receptor | comp34574_c1_seq1 | 411 | -0.51 | 0.69 | ||
| gonadal soma derived factor 1 | comp38118_c0_seq1 | 2047 | 1.33 | 1.4E-160 | up | |
| meiotic nuclear division protein 1 homolog | comp34969_c0_seq1 | 1598 | -0.75 | 2.7E-07 | ||
| Müllerian inihibiting substance | comp15802_c0_seq1 | 3092 | 0.22 | 0.477 | ||
| Cytosolic heat shock protein 90 alpha | comp18173_c0_seq1 | 294 | 0.07 | 1.00 | ||
| Axonemal dynein light intermediate polypeptide 1 | comp21140_c0_seq1 | 1007 | 8.30 | 5.61E-32 | up | |
| Ropporin-1-like protein-like isoform X2 | comp28799_c0_seq1 | 1720 | 3.26 | 3.45E-19 | up | |
| P450 aromatase | comp20558_c0_seq1 | 2026 | -7.64 | 5.1E-45 | down | |
| Zona pellucida glycoprotein 5 | comp20390_c0_seq1 | 1801 | -12.65 | 0 | down | |
| Zygote arrest protein 1-like | comp20043_c0_seq1 | 1446 | -12.54 | 0 | down | |
| Gonadotropin alpha | comp28377_c1_seq1 | 830 | -11.45 | 0 | down | |
| Growth differentiation factor 9 | comp30868_c0_seq1 | 3680 | -5.76 | 0 | down | |
| Steroidogenic acute regulatory protein | comp23962_c0_seq1 | 967 | 0.32 | 1 | ||
| StAR-related lipid transfer protein 5-like | comp35171_c2_seq1 | 5165 | -1.50 | 0 | down | |
| StAR-related lipid transfer protein 7-like | comp37065_c0_seq1 | 3481 | -2.00 | 0 | down | |
| SRY-box containing protein 17 | comp26556_c1_seq1 | 266 | 1.07 | 0.48 | ||
| SRY-box containing protein 19 | comp34381_c0_seq1 | 2961 | -2.04 | 0 | down | |
| Forkhead transcription factor L2 | comp34087_c3_seq1 | 1401 | -5.00 | 2.68E-46 | down | |
| Vitellogenin receptor isoform 1 | comp36664_c17_seq1 | 3808 | -3.35 | 0 | down | |
| Zona pellucida sperm-binding protein 4-like | comp30721_c0_seq1 | 2238 | -9.38 | 3.1E-122 | down | |
| Zona pellucida sperm-binding protein 3-like | comp29565_c0_seq1 | 1247 | -13.66 | 0 | down | |
| Zona pellucida sperm-binding protein | comp28111_c0_seq1 | 1390 | -12.61 | 0 | down | |
| Protein fem-1 homolog C-like | comp35674_c0_seq1 | 4180 | -2.02 | 1.2E-151 | down | |
| Frizzled-3-like | comp36698_c1_seq1 | 3514 | -1.84 | 5.9E-209 | down | |
| P43 5S RNA-binding protein-like | comp20349_c2_seq1 | 1515 | -8.56 | 0 | down | |
| Transcription factor IIIA-like | comp17728_c0_seq1 | 1282 | -14.12 | 0 | down | |
| Wee1-like protein kinase 2-like isoform X1 | comp36348_c0_seq1 | 1904 | -7.57 | 0 | down | |
| Ovarian cancer-associated gene 2 protein homolog | comp31026_c0_seq1 | 1363 | -5.28 | 2.2E-289 | down | |
| Cathepsin S-like | comp38095_c0_seq1 | 2038 | -8.92 | 0 | down | |
| Vasa | comp36885_c2_seq1 | 2658 | -0.21 | 0 | ||
| Doublesex and mab-3-related transcription factor 2 | comp30078_c0_seq1 | 1113 | -4.68 | 9.19E-07 | down | |
| Doublesex and mab-3-related transcription factor 3 | comp29037_c2_seq1 | 1392 | -1.55 | 0.00073 | down | |
| Doublesex and mab-3-related transcription factor 5 | comp556746_c0_seq1 | 291 | -1.51 | 0.519288 | ||
| SRY-box containing protein 8a | comp36634_c2_seq1 | 2437 | -1.54 | 2.35E-13 | down | |
| Sex hormone binding globulin | comp17530_c0_seq1 | 2212 | -1.33 | 4.1E-62 | down | |
| Steroid 11-beta-hydroxylase | comp248709_c0_seq1 | 422 | 1.49 | 0.23 |
Genes with Italics bold are novel.
Genes of interest for growth and muscle development in Scophthalmus maximus.
| Gene Name | Annotation | Query Name | Length | Expression quantity (♂/♀) | Q-value | Trend | |
|---|---|---|---|---|---|---|---|
| Growth hormone precursor | comp17564_c0_seq1 | 775 | -10.66 | 4.01E-123 | down | ||
| Growth hormone receptor | comp34224_c0_seq1 | 1177 | -0.78 | 0.04 | |||
| Growth hormone releasing hormone | comp374142_c0_seq1 | 325 | 2.66 | 0.10 | |||
| Growth hormone inhibiting hormone-like | comp23996_c0_seq1 | 1004 | 0.32 | 0.59 | |||
| Somatostatin receptor | comp24799_c1_seq1 | 1202 | -1.13 | 0.0039 | |||
| Insulin-like growth factors | comp38041_c8_seq1 | 3754 | -0.88 | 1.85E-45 | |||
| IGF receptor | comp33325_c0_seq1 | 3008 | -0.44 | 0.040 | |||
| IGF binding protein | comp32674_c4_seq1 | 1454 | -1.43 | 4.03E-51 | down | ||
| Myostatin | comp20346_c1_seq1 | 2996 | 0.16 | 0.60 | |||
| Myogenic regulatory factor 5 | comp37586_c17_seq1 | 1393 | -2.65 | 1.21E-34 | down | ||
| Fatty acid-binding protein | comp38300_c0_seq1 | 606 | -4.13 | 0 | down | ||
| 5-Hydroxytryptamine receptor | comp26484_c1_seq1 | 1741 | 1.66 | 0.48 | |||
| Prolactin | comp38254_c0_seq1 | 747 | -15.12 | 0 | down | ||
| Prolactin receptor | comp37125_c1_seq1 | 6331 | -2.36 | 2.99E-188 | down |
The intriguing immune-related genes identified in the turbot transcriptome.
| Gene Name | Annotation | Query Name | |
|---|---|---|---|
| Toll-like receptor 1 | comp32484_c3_seq1 | ||
| Toll-interacting protein | comp21400_c0_seq1;comp23286_c0_seq1;comp23286_c1_seq1 | ||
| PREDICTED: TNF receptor-associated factor 3-like | comp22885_c1_seq1;comp23509_c0_seq1 | ||
| Toll-like receptor 21 | comp32484_c3_seq1 | ||
| Toll-like receptor 22 | comp36502_c11_seq1 | ||
| Interleukin-34 | comp30033_c0_seq1 | ||
| Interleukin-18 receptor 1 | comp3046_c0_seq1;comp22532_c0_seq1 | ||
| Interleukin-1 receptor 1 | comp28977_c1_seq1 | ||
| PREDICTED: TNF receptor-associated factor 2-like | comp26413_c0_seq1;comp26467_c3_seq1;comp26467_c5_seq1 | ||
| PREDICTED: Complement component C9-like | comp37672_c0_seq1 | ||
| Selectin P | comp35827_c0_seq1 | ||
| Collectin 10/11/12 | comp31521_c1_seq1;comp31124_c0_seq1;comp30265_c0_seq1;comp31642_c0_seq1 |
Fig 6Frequency distribution of SSRs by motif length found in Scophthalmus maximus.
Fig 7Polyacrylamide gel electrophoresis for one SSR marker (comp15993_c0_seq1) in the 30 individuals.
Characterization of 17 polymorphic SSR loci in a turbot family of 90 individuals.
| Transcripts ID | Repeat motif | Primer sequences (5’-3’) | Length | |||||
|---|---|---|---|---|---|---|---|---|
| (GACCTG)3 | 129 | 58 | 5 | 0.75 | 0.79 | 0.65 | ||
| (CCTGGA)3 | 125 | 58 | 2 | 0.5 | 0.39 | 0.27 | ||
| (CTC)6 | 132 | 57 | 2 | 0.5 | 0.41 | 0.27 | ||
| (GAC)5 | 233 | 57 | 2 | 0.5 | 058 | 0.33 | ||
| (GGA)5 | 120 | 57 | 2 | 0.5 | 0.35 | 0.37 | ||
| (GCT)6 | 120 | 58 | 2 | 0.5 | 0.43 | 0.28 | ||
| (CGC)6 | 213 | 58 | 2 | 0. 5 | 0.53 | 0.31 | ||
| (GCTA)5 | 148 | 57 | 2 | 0.5 | 0.48 | 0.37 | ||
| (AGGTTC)6 | 135 | 59 | 2 | 0.5 | 0.42 | 0.28 | ||
| (TGC)6 | 143 | 58 | 2 | 0.5 | 0.46 | 0.29 | ||
| (AAT)7 | 178 | 58 | 2 | 0.5 | 0.35 | 0.25 | ||
| (CGTG)5 | 187 | 59 | 2 | 0.5 | 0.53 | 0.31 | ||
| (CGTG)6 | 207 | 58 | 2 | 0.75 | 0.64 | 0.5 | ||
| (TGG)7 | 126 | 58 | 2 | 0.5 | 0.49 | 0.3 | ||
| (AG)9 | 123 | 58 | 2 | 0.5 | 0.51 | 0.3 | ||
| (GAG)6 | 174 | 58 | 2 | 0.5 | 0.29 | 0.22 | ||
| (GCTC)5 | 182 | 58 | 4 | 1 | 1 | 0.7 | ||
Ta annealing temperature, He expected heterozygosity, Ho observed heterozygosity, Na observed number of alleles, PIC polymorphic information content.
Fig 8Distribution of putative single nucleotide polymorphisms (SNPs) containing in Scophthalmus maximus ESTs.
Fig 9Genotyping result using HRM with small amplicon.
(A) S10 genotyping; (B) S11 genotyping; (C) S14 genotyping.