| Literature DB >> 25075616 |
Shangwei Ning1, Zuxianglan Zhao1, Jingrun Ye1, Peng Wang1, Hui Zhi1, Ronghong Li1, Tingting Wang1, Jianjian Wang2, Lihua Wang2, Xia Li1.
Abstract
Large intergenic non-coding RNAs (lincRNAs) are a new class of functional transcripts, and aberrant expression of lincRNAs was associated with several human diseases. The genetic variants in lincRNA transcription factor binding sites (TFBSs) can change lincRNA expression, thereby affecting the susceptibility to human diseases. To identify and annotate these functional candidates, we have developed a database SNP@lincTFBS, which is devoted to the exploration and annotation of single nucleotide polymorphisms (SNPs) in potential TFBSs of human lincRNAs. We identified 6,665 SNPs in 6,614 conserved TFBSs of 2,423 human lincRNAs. In addition, with ChIPSeq dataset, we identified 139,576 SNPs in 304,517 transcription factor peaks of 4,813 lincRNAs. We also performed comprehensive annotation for these SNPs using 1000 Genomes Project datasets across 11 populations. Moreover, one of the distinctive features of SNP@lincTFBS is the collection of disease-associated SNPs in the lincRNA TFBSs and SNPs in the TFBSs of disease-associated lincRNAs. The web interface enables both flexible data searches and downloads. Quick search can be query of lincRNA name, SNP identifier, or transcription factor name. SNP@lincTFBS provides significant advances in identification of disease-associated lincRNA variants and improved convenience to interpret the discrepant expression of lincRNAs. The SNP@lincTFBS database is available at http://bioinfo.hrbmu.edu.cn/SNP_lincTFBS.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25075616 PMCID: PMC4116217 DOI: 10.1371/journal.pone.0103851
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Architecture of SNP@lincTFBS.
Figure 2SNPs in human lincRNA TFBSs.
(A) The number distribution of lincRNAs classified as chromosomes. Blue bars represent all lincRNAs. Red bars represent lincRNAs have TFBSs in their promoter regions. Green bars represent lincRNAs have SNPs in their TFBSs. (B) Statistics of lincRNA TFBSs with SNPs for each transcription factor. The quantity of lincRNA TFBSs for each transcription factor (left). The quantity of lincRNA TFBSs with SNPs for each transcription factor (middle). Density of lincRNA TFBSs with SNPs for each transcription factor (right). (C) Distribution of SNPs in lincRNA TFBSs with respect to distance to the lincRNAs. The x-axis displays the 1 kb window within 5 kb upstream to 1 kb downstream region of the start site of lincRNA and the y-axis displays the fraction of SNPs in lincRNA TFBSs located within this window.
Figure 3The homepage and an example of SNP@lincTFBS database.
Screenshot of the main search page and corresponding result page, search as lincRNA ENSG00000177640.
Disease-associated SNPs in lincRNA TFBSs.
| Disease or phenotype | lincRNA | SNP | PubMed ID |
| Multiple complex diseases | ENSG00000204092 | rs16868911 | 17554300 |
| Celiac disease | ENSG00000224099 | rs1542865 | 17558408 |
| Multiple complex diseases | ENSG00000226029 | rs11543230 | 17554300 |
| Suicide attempts in bipolar disorder | ENSG00000227336 | rs4886217 | 21423239 |
| Obesity (extreme) | ENSG00000228153 | rs17413714 | 21935397 |
| Multiple complex diseases | ENSG00000228590 | rs9309325 | 17554300 |
| Suicide attempts in bipolar disorder | ENSG00000228909 | rs7587562 | 21423239 |
| Multiple continuous traits in DGI samples | ENSG00000230812 | rs2716133 | 17463246 |
| Type II Diabetes Mellitus | ENSG00000233081 | rs12683158 | 17463248 |
| Multiple continuous traits in DGI samples | ENSG00000241884 | rs9856163 | 17463246 |
| Progressive supranuclear palsy | ENSG00000251009 | rs1545606 | 21685912 |
| Response to statin therapy | ENSG00000253111 | rs2001844 | 20339536 |
| Coronary heart disease | ENSG00000253111 | rs6982502 | 22319020 |
| Response to statin therapy | ENSG00000253111 | rs6982502 | 20339536 |
| Urinary metabolites | ENSG00000253184 | rs822249 | 21572414 |
| Urinary metabolites | ENSG00000253248 | rs1031282 | 21572414 |
| Alzheimer's disease | ENSG00000253583 | rs6472116 | 22005930 |
| Type II Diabetes Mellitus | ENSG00000254822 | rs12793795 | 17463248 |
| Major depressive disorder (broad) | ENSG00000259284 | rs8028149 | 20038947 |
| Childhood asthma | ENSG00000264968 | rs4065275 | 17611496 |
| Multiple complex diseases | ENSG00000267416 | rs12951337 | 17554300 |
| Serum urate | ENSG00000269290 | rs493573 | 21768215 |