| Literature DB >> 31610627 |
Young-Sup Lee1, Kyung-Hye Won1, Jae-Don Oh1, Donghyun Shin1.
Abstract
We sought the novel concept, transcript capacity (TC) and analyzed TC. Our approach to estimate TC was through an in silico method. TC refers to the capacity that a transcript exerts in a cell as enzyme or protein function after translation. We used the genome-wide association study (GWAS) beta effect and transcription level in RNA-sequencing to estimate TC. The trait was body fat percent and the transcript reads were obtained from the human protein atlas. The assumption was that the GWAS beta effect is the gene's effect and TC was related to the corresponding gene effect and transcript reads. Further, we surveyed gene ontology (GO) in the highest TC and the lowest TC genes. The most frequent GOs with the highest TC were neuronal-related and cell projection organization related. The most frequent GOs with the lowest TC were wound-healing related and embryo development related. We expect that our analysis contributes to estimating TC in the diverse species and playing a benevolent role to the new bioinformatic analysis.Entities:
Keywords: RNA-seq; fat; genome-wide association study; in silico method; transcript capacity
Year: 2019 PMID: 31610627 PMCID: PMC6808637 DOI: 10.5808/GI.2019.17.3.e31
Source DB: PubMed Journal: Genomics Inform ISSN: 1598-866X
Fig. 1.The flow chart of the analysis. The transcript capacity was calculated in silico using genome-wide association study (GWAS) and RNA-sequencing (RNA-seq). SNP, single nucleotide polymorphism; TA, transcript activity (synonym: transcript capacity); TPM, transcript per million reads.
Fig. 2.Transcript reads per million (TPM) versus genome-wide association study (GWAS) beta effect. Larger beta effect genes had a smaller TPM across the board.
Fig. 3.The Manhattan plot of–log10(p-value) including total analyzed single nucleotide polymorphisms (SNPs) (A) and transcript capacity (TC) (B) under p-value < 0.05 across chromosomes. The P-value was obtained from the genome-wide association study (GWAS) of the SNPs and those encompassing genes. TC was calculated using the GWAS beta effect and transcript per million reads.
Summary statistics of beta effect, TPM, and TC in our analysis
| Min | Max | Average | SD | |
|---|---|---|---|---|
| Beta | –0.9378 | 1.326 | 0.004 | 0.387 |
| TPM | 0.1 | 1864.5 | 19.855 | 70.551 |
| TC | 0.000146 | 7.483 | 0.371 | 0.857 |
Beta, GWAS beta effect; TPM, transcript per million reads; TC, transcript capacity.
Genes with the lowest p-values (cutoff 0.0001) and their TC estimation
| CHROM | SNP | Position | Beta | p-value | Gene | TPM | TC |
|---|---|---|---|---|---|---|---|
| 1 | SNP_A-2261105 | 56496439 | –0.49 | 0.00000569 | 147.9 | 0.003 | |
| 1 | SNP_A-2137301 | 77704683 | 0.39 | 0.00000908 | 45 | 0.009 | |
| 4 | SNP_A-4203089 | 134200024 | 0.4 | 0.0000107 | 1.6 | 0.25 | |
| 5 | SNP_A-1821378 | 75406064 | 0.43 | 0.0000138 | 37.7 | 0.01 | |
| 1 | SNP_A-1848974 | 56496993 | –0.46 | 0.0000169 | 147.9 | 0.003 | |
| 1 | SNP_A-2302611 | 56520770 | –0.38 | 0.0000176 | 147.9 | 0.003 | |
| 1 | SNP_A-2276931 | 235264645 | 0.38 | 0.0000191 | 12.7 | 0.03 | |
| 5 | SNP_A-4258112 | 75370070 | 0.43 | 0.0000201 | 37.7 | 0.01 | |
| 5 | SNP_A-4193701 | 75413307 | 0.42 | 0.0000233 | 37.7 | 0.01 | |
| 14 | SNP_A-2203004 | 32144721 | 0.54 | 0.0000247 | 26.9 | 0.02 | |
| 5 | SNP_A-2111116 | 171218237 | –0.42 | 0.0000335 | 0.2 | 2.12 | |
| 22 | SNP_A-2212209 | 42624987 | 0.7 | 0.0000391 | 337.7 | 0.002 | |
| 11 | SNP_A-1946877 | 114585252 | 0.36 | 0.000041 | 0.1 | 3.6 | |
| 14 | SNP_A-2203008 | 32145585 | 0.41 | 0.0000533 | 26.9 | 0.015 | |
| 10 | SNP_A-4304897 | 77131018 | –0.46 | 0.0000558 | 11.1 | 0.04 | |
| 16 | SNP_A-2161617 | 78907824 | –0.45 | 0.0000603 | 10.5 | 0.04 | |
| 6 | SNP_A-4211857 | 96568889 | 0.58 | 0.0000811 | 41.6 | 0.014 | |
| 2 | SNP_A-2266417 | 75681594 | –0.35 | 0.0000813 | 35 | 0.01 | |
| 2 | SNP_A-2266417 | 75681594 | –0.35 | 0.0000813 | 10.1 | 0.03 | |
| 2 | SNP_A-4257172 | 46029907 | 0.38 | 0.000082 | 5 | 0.08 | |
| 5 | SNP_A-1943727 | 75381858 | 0.4 | 0.0000879 | 37.7 | 0.01 | |
| 5 | SNP_A-2283204 | 75433640 | 0.39 | 0.0000927 | 37.7 | 0.01 | |
| 1 | SNP_A-1827525 | 216928377 | –0.34 | 0.0000937 | 0.4 | 0.85 |
The estimation of TC and TPM belongs to the corresponding gene.
TC, transcript capacity; CHROM, chromosome name; SNP, single nucleotide polymorphism; TPM, transcript reads per million.
GO for the highest (as TC values) top 1% of the genes
| Term | Count | p-value | Genes | Fold enrichment |
|---|---|---|---|---|
| GO:0007399~nervous system development | 29 | 0.000000124 | 2.89 | |
| GO:0048858~cell projection morphogenesis | 16 | 0.00000257 | 4.34 | |
| GO:0032990~cell part morphogenesis | 16 | 0.00000349 | 4.23 | |
| GO:0030030~cell projection organization | 20 | 0.00000418 | 3.32 | |
| GO:0032989~cellular component morphogenesis | 18 | 0.000075 | 2.95 | |
| GO:0035082~axoneme assembly | 5 | 0.0000991 | 20.4 | |
| GO:0048666~neuron development | 15 | 0.000104 | 3.36 | |
| GO:0000902~cell morphogenesis | 17 | 0.000123 | 2.96 | |
| GO:0022008~neurogenesis | 18 | 0.000227 | 2.69 | |
| GO:0048699~generation of neurons | 17 | 0.000346 | 2.71 | |
| GO:0042384~cilium assembly | 7 | 0.000355 | 7.35 | |
| GO:0030182~neuron differentiation | 16 | 0.00037 | 2.82 | |
| GO:0010927~cellular component assembly involved in morphogenesis | 8 | 0.000385 | 5.88 | |
| GO:0048468~cell development | 21 | 0.000399 | 2.31 | |
| GO:0007268~chemical synaptic transmission | 11 | 0.000405 | 3.94 | |
| GO:0098916~anterograde trans-synaptic signaling | 11 | 0.000405 | 3.94 | |
| GO:0099536~synaptic signaling | 11 | 0.000405 | 3.94 | |
| GO:0099537~trans-synaptic signaling | 11 | 0.000405 | 3.94 | |
| GO:0001578~microtubule bundle formation | 5 | 0.000443 | 13.86 | |
| GO:0044782~cilium organization | 7 | 0.000593 | 6.67 | |
| GO:0030031~cell projection assembly | 9 | 0.000636 | 4.64 | |
| GO:0060271~cilium morphogenesis | 7 | 0.000922 | 6.13 |
The major gene ontology (GO) terms were neuronal and cell projection organization-related.
TC, transcript capacity.
GO for the lowest (as TC values) bottom 1% of the genes
| Term | Count | p-value | Gene | Fold enrichment |
|---|---|---|---|---|
| GO:0007369~gastrulation | 6 | 0.0000374 | 15.22 | |
| GO:0042060~wound healing | 7 | 0.0000543 | 9.98 | |
| GO:0009611~response to wounding | 7 | 0.000136 | 8.45 | |
| GO:0009790~embryo development | 10 | 0.000165 | 4.69 | |
| GO:0035987~endodermal cell differentiation | 4 | 0.000206 | 33.52 | |
| GO:0001944~vasculature development | 8 | 0.000217 | 6.2 | |
| GO:0031589~cell-substrate adhesion | 6 | 0.000273 | 9.99 | |
| GO:0001706~endoderm formation | 4 | 0.000366 | 27.65 | |
| GO:0007492~endoderm development | 4 | 0.000751 | 21.69 |
The major gene ontology (GO) terms were wound-healing and embryo development.
TC, transcript capacity.