| Literature DB >> 24651578 |
Jun Cui1, Hongdi Wang1, Shikai Liu2, Lifu Zhu3, Xuemei Qiu1, Zhiqiang Jiang1, Xiuli Wang1, Zhanjiang Liu2.
Abstract
Single nucleotide polymorphisms (SNPs) have become the marker of choice for genome-wide association studies in many species. High-throughput sequencing of RNA was developed primarily to analyze global gene expression, while it is an efficient way to discover SNPs from the expressed genes. In this study, we conducted transcriptome sequencing of the swimbladder of Takifugu rubripes using Illumina HiSeq2000 platform to identify gene-associated SNPs in the swimbladder. A total of 30,312,181 unique-mapped-reads were obtained from 44,736,850 raw reads. A total of 62,270 putative SNPs were discovered, which were located in 11,306 expressed genes and 2,246 scaffolds. The average minor allele frequency (MAF) of the SNPs was 0.26. GO and KEGG pathway analysis were conducted to analyze the genes containing SNPs. Validation of selected SNPs revealed that 54% of SNPs (26/48) were true SNPs. The results suggest that RNA-Seq is an efficient and cost-effective approach to discover gene-associated SNPs. In this study, a large number of SNPs were identified and these data will be useful resources for population genetic study, evolution analysis, resource assessment, genetic linkage analysis and genome-wide association studies.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24651578 PMCID: PMC3961390 DOI: 10.1371/journal.pone.0092502
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
The genome distribution of the mapped reads.
| Read distribution | Number of reads | Percentage |
| Exonic region | 18,120,867 | 59.78% |
| Intergenic region | 9,776,865 | 32.25% |
| Intronic region | 1,435,258 | 4.73% |
| Exon-intron junction | 176,167 | 0.58% |
Summary of SNP types identified from the T. rubirpes swimbladder.
| SNP type | Number |
| Homozygote | |
| A | 1,887 |
| C | 2,914 |
| G | 2,887 |
| T | 1,830 |
| Heterozygote | |
| G/T | 4,730 |
| A/C | 4,725 |
| A/G | 16,972 |
| G/C | 4,665 |
| A/T | 4,586 |
| C/T | 17,074 |
| Total | 62,270 |
Classification of putative SNPs.
| SNP classification | Number of putative SNPs |
| Inter-genic | 12,903 |
| Down_stream(+1 k) | 12,303 |
| Exon | 24,525 |
| Intron | 4,210 |
| Up_stream(−1 k) | 8,329 |
| Total | 62,270 |
Inter-genic SNPs were identified from regions between genes, while Down_stream(+1 k) and Up_stream(−1 k) represents SNPs identified from regions of 1 kb downstream and upstream of the genes.
Figure 1Distribution of minor allele frequencies (MAFs) of SNPs identified from the T. rubirpes swimbladder.
The X-axis represents the SNP minor allele frequency in percentage, while the Y-axis represents the number of SNPs with given minor allele frequency
Figure 2SNP distribution among genes.
The X-axis represents gene size (number of SNPs per gene)
Figure 3SNP distribution among scaffolds.
The X-axis represents scaffold size (number of SNPs per scaffold)
Figure 4Gene Ontology of genes containing putative SNPs.
KEGG biochemical mappings for genes containing SNPs.
| KEGG categories | Number of genes |
|
| |
| Amino Acid Metabolism | 344 |
| Biosynthesis of Polyketides and Nonribosomal Peptides | 4 |
| Biosynthesis of Secondary Metabolites | 69 |
| Carbohydrate Metabolism | 422 |
| Energy Metabolism | 162 |
| Glycan Biosynthesis and Metabolism | 150 |
| Lipid Metabolism | 331 |
| Metabolism of Cofactors and Vitamins | 130 |
| Metabolism of Other Amino Acids | 103 |
| Nucleotide Metabolism | 239 |
| Xenobiotics Biodegradation and Metabolism | 118 |
|
| |
| Folding, Sorting and Degradation | 266 |
| Replication and Repair | 161 |
| Transcription | 196 |
| Translation | 116 |
|
| |
| Membrane Transport | 39 |
| Signal Transduction | 1142 |
| Signaling Molecules and Interaction | 386 |
|
| |
| Behavior | 28 |
| Cell Communication | 522 |
| Cell Growth and Death | 365 |
| Cell Motility | 200 |
| Transport and Catabolism | 316 |
|
| |
| Circulatory System | 138 |
| Development | 187 |
| Endocrine System | 544 |
| Immune System | 1116 |
| Nervous System | 246 |
| Sensory System | 35 |
Identification of expressed Claudin genes containing SNPs.
| Ensembl Gene ID | Gene name | RPKM value | Number of SNPs |
| ENSTRUG00000018609 |
| 234.65 | 5 |
| ENSTRUG00000016497 |
| 75.65 | 5 |
| ENSTRUG00000007521 |
| 43.24 | 6 |
| ENSTRUG00000010140 |
| 14.33 | 3 |
| ENSTRUG00000004991 |
| 8.98 | 6 |
| ENSTRUG00000011829 |
| 6.15 | 4 |
| ENSTRUG00000015308 |
| 3.81 | 1 |
| ENSTRUG00000003031 |
| 2.89 | 2 |
| ENSTRUG00000010901 |
| 1.83 | 4 |
| ENSTRUG00000001287 |
| 1.63 | 4 |
| ENSTRUG00000013204 |
| 1.21 | 2 |
| ENSTRUG00000007366 |
| 0.78 | 3 |
| ENSTRUG00000011741 |
| 0.76 | 2 |
| ENSTRUG00000009832 |
| 0.31 | 1 |
| ENSTRUG00000016459 |
| 0.21 | 2 |
| ENSTRUG00000010378 |
| 0.14 | 1 |
Identification of expressed Wnt genes containing SNPs.
| Ensembl Gene ID | Gene name | RPKM value | Number of SNPs |
| ENSTRUG00000016453 |
| 48.59 | 1 |
| ENSTRUG00000001530 |
| 29.03 | 8 |
| ENSTRUG00000008614 |
| 28.52 | 7 |
| ENSTRUG00000000172 |
| 16.51 | 4 |
| ENSTRUG00000003640 |
| 11.5 | 14 |
| ENSTRUG00000016522 |
| 1.71 | 2 |
| ENSTRUG00000014284 |
| 1.38 | 3 |
| ENSTRUG00000012568 |
| 0.33 | 1 |
Summary of SNP validation.
| Ensembl Gene ID | Gene Name | Number of SNPs tested | Number of SNPs validated |
| ENSTRUG00000011255 | Translocase of outer mitochondrial membrane 20 homolog | 5 | 2 |
| ENSTRUG00000008698 | RAB9A, member RAS oncogene family | 7 | 4 |
| ENSTRUG00000014751 | Family with sequence similarity 46, member A | 5 | 3 |
| ENSTRUG00000009192 | Coiled-coil domain containing 47 | 5 | 2 |
| ENSTRUG00000006299 | Mitochondrial ribosomal protein L21 | 8 | 5 |
| ENSTRUG00000006704 | Calpain small subunit 1 | 5 | 3 |
| ENSTRUG00000004026 | C-type lectin domain family 11, member A | 6 | 3 |
| ENSTRUG00000014304 | Proliferating cell nuclear antigen | 7 | 4 |