| Literature DB >> 28158238 |
Yan-He Li1,2, Han-Ping Wang1, Hong Yao1, Paul O'Bryant1, Dean Rapp1, Liang Guo1, Eman A Waly1.
Abstract
Transcriptome sequencing could facilitate discovery of sex-biased genes, biological pathways and molecular markers, which could help clarify the molecular mechanism of sex determination and sexual dimorphism, and assist with selective breeding in aquaculture. Yellow perch has unique gonad system and sexual dimorphism and is an alternative model to study mechanism of sex determination, sexual dimorphism and sexual selection. In this study, we performed the de novo assembly of yellow perch gonads and muscle transcriptomes by high throughput Illumina sequencing. A total of 212,180 contigs were obtained, ranging from 127 to 64,876 bp, and N50 of 1,066 bp. The assembly RNA-Seq contigs (≥200bp) were then used for subsequent analyses, including annotation, pathway analysis, and microsatellites discovery. No female- and pseudo-male-biased genes were involved in any pathways while male-biased genes were involved in 29 pathways, and neuroactive ligand receptor interaction and enzyme of trypsin (enzyme code, EC: 3.4.21.4) was highly involved. Pyruvate kinase (enzyme code, EC: 2.7.1.40), which plays important roles in cell proliferation, was highly expressed in muscles. In addition, a total of 183,939 SNPs, 11,286 InDels and 41,479 microsatellites were identified. This study is the first report on transcriptome information in Percids, and provides rich resources for conducting further studies on understanding the molecular basis of sex determinations, sexual dimorphism, and sexual selection in fish, and for population studies and marker-assisted selection in Percids.Entities:
Mesh:
Year: 2017 PMID: 28158238 PMCID: PMC5291366 DOI: 10.1371/journal.pone.0171187
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Statistics of yellow perch transcriptome sequences assembly.
| Reads count | 548,699,676 |
| Matched reads count | 486,557,128 |
| Not matched reads count | 62,142,548 |
| Number of contigs | 212,180 |
| Reads in pairs | 378,170,740 |
| Broken paired reads | 95,485,390 |
| Average length of reads (bp) | 97,94 |
| N75(bp) | 419 |
| N50(bp) | 1,066 |
| N25(bp) | 2,916 |
| Minimum length (bp) | 127 |
| Maximum length (bp) | 64,876 |
| Average length (bp) | 695 |
| Count | 212,180 |
| Total length | 147,493,106 |
| N75(bp) | 392 |
| N50(bp) | 908 |
| N25(bp) | 2,371 |
| Minimum length (bp) | 74 |
| Maximum length (bp) | 54,724 |
| Average length (bp) | 636 |
| Count | 231,356 |
| Total length | 147,146,312 |
Fig 1Length distribution of contigs assembled.
Fig 2The Venn diagram of genes distributions in the six fish (zebrafish, tilapia, cichlid, guppy, medaka and amazon molly).
Fig 3Functional classification of yellow perch unigenes based on three main Gene Ontology (GO) categories: biological process, molecular function and cellular component.
KEGG biomedical mapping for yellow perch.
| KEGG categories represented | Number of unigene sequences | Number of mapped KO | |
|---|---|---|---|
| Metabolism | 4,046 | 2,984 | |
| Global and overiew maps | 1,547 | 1,239 | |
| Carbohydrate metabolism | 479 | 312 | |
| Energy metabolism | 195 | 130 | |
| Lipid metabolism | 439 | 281 | |
| Nucleotide metabolism | 240 | 174 | |
| Amino acid metabolism | 379 | 290 | |
| Metabolism of other amino acids | 109 | 67 | |
| Glycan biosynthesis and metabolism | 306 | 227 | |
| Metabolism of cofactors and vitamins | 165 | 130 | |
| Metabolism of terpenoids and polyketides | 36 | 32 | |
| Biosynthesis of other secondary metabolites | 46 | 32 | |
| Xenobiotics biodegradation and metabolism | 105 | 70 | |
| Genetic information processing | 1,287 | 1,054 | |
| Transcription | 189 | 158 | |
| Translation | 441 | 355 | |
| Folding, sorting and degradation | 441 | 346 | |
| Replication and repair | 216 | 195 | |
| Enviromental information processing | 3,096 | 2,030 | |
| Membrane transport | 28 | 28 | |
| Signal transduction | 2,635 | 1,679 | |
| Signaling moleculars and interaction | 433 | 323 | |
| Cellular processes | 1,689 | 1,149 | |
| Transport and catabolism | 491 | 342 | |
| Cell motility | 191 | 118 | |
| Cell growth and death | 422 | 325 | |
| Cellular community | 585 | 364 | |
| Organismal systems | 4,159 | 2,594 | |
| Immune system | 983 | 630 | |
| Endocrine system | 1,039 | 651 | |
| Circulatory system | 271 | 163 | |
| Digestive system | 400 | 244 | |
| Excretory system | 158 | 88 | |
| Nervous system | 787 | 499 | |
| Sensory system | 140 | 82 | |
| Development | 240 | 149 | |
| Enviromental adaptation | 141 | 88 | |
| Total | 14,277 | 9,811 | |
Fig 4Neuroactive ligand-receptor interaction KEGG map for male yellow perch.
The genes identified being involved in pathways were shown in green boxes.
Fig 5Neuroactive ligand-receptor interaction KEGG map for gonads of yellow perch.
The genes identified being involved in pathways were shown in green boxes.
Statistics of microsatellites identification in yellow perch.
| Microsatellites identified | |
|---|---|
| Transcriptome sequences | 41,479 |
| Di-nucleotide repeats | 31,996 |
| Tri-nucleotide repeats | 7,226 |
| Tetra-nucleotide repeats | 2,088 |
| Penta-nucleotide repeats | 112 |
| Hexa-nucleotide repeats | 57 |
| Number of microsatellites containing sequences | 18,210 |
| NF specifically expressed contigs | 0 |
| any nucleotide repeats | 0 |
| NM specifically expressed contigs | 133 |
| Di-nucleotide repeats | 110 |
| Tri-nucleotide repeats | 15 |
| Tetra-nucleotide repeats | 8 |
| Number of microsatellites containing sequences | 110 |
| PM specifically expressed contigs | 4 |
| Di-nucleotide repeats | 4 |
| Number of microsatellites containing sequences | 4 |
| Number of microsatellites with sufficient flanking sequence (≥ 50 bp) for PCR primer design | |
| Muscles specifically expressed contigs | 93 |
| Di-nucleotide repeats | 76 |
| Tri-nucleotide repeats | 12 |
| Tetra-nucleotide repeats | 3 |
| Penta-nucleotide repeats | 1 |
| Hexa-nucleotide repeats | 1 |
| Number of microsatellites containing sequences | 74 |
| Gonads specifically expressed contigs | 1,036 |
| Di-nucleotide repeats | 824 |
| Tri-nucleotide repeats | 159 |
| Tetra-nucleotide repeats | 50 |
| Penta-nucleotide repeats | 3 |
| Number of microsatellites containing sequences | 847 |
The primer sequences and their polymorphism assessment results.
| Locus name | Primer sequences (5′−3′) | Ta(°C) | Observed no. of alleles | Size range(bp) | N | HObs | HExp | PIC |
|---|---|---|---|---|---|---|---|---|
| YPM 6 | 55 | 4 | 187–208 | 32 | 0.156 | 0.205 | 0.195 | |
| YPM27 | 55 | 8 | 108–205 | 25 | 0.92 | 0.876 | 0.842 | |
| YPM41 | 55 | 4 | 210–231 | 31 | 0.387 | 0.342 | 0.32 | |
| YPM30 | 55 | 11 | 174–210 | 31 | 1 | 0.705 | 0.645 | |
| YPM20 | 55 | 6 | 246–276 | 31 | 0.548 | 0.499 | 0.464 | |
| YPM34 | 55 | 2 | 190–206 | 32 | 0.406 | 0.329 | 0.271 | |
| YPM46 | 55 | 14 | 173–218 | 28 | 0.929 | 0.89 | 0.861 | |
| YPM17 | 55 | na | na | na | na | na | na | |