| Literature DB >> 32523949 |
Hecun Zou1, Lan-Xiang Wu1, Lihong Tan2,3, Fei-Fei Shang1, Hong-Hao Zhou1,3.
Abstract
Single-nucleotide variants (SNVs) are the most common genetic variants and universally present in the human genome. Genome-wide association studies (GWASs) have identified a great number of disease or trait-associated variants, many of which are located in non-coding regions. Long intergenic non-protein coding RNAs (lincRNAs) are the major subtype of long non-coding RNAs; lincRNAs play crucial roles in various disorders and cellular models via multiple mechanisms. With rapid growth in the number of the identified lincRNAs and genetic variants, there is great demand for an investigation of SNVs in lincRNAs. Hence, in this article, we mainly summarize the significant role of SNVs within human lincRNA regions. Some pivotal variants may serve as risk factors for the development of various disorders, especially cancer. They may also act as important regulatory signatures involved in the modulation of lincRNAs in a tissue- or disorder-specific manner. An increasing number of researches indicate that lincRNA variants would potentially provide additional options for genetic testing and disease risk assessment in the personalized medicine era.Entities:
Keywords: biological function; disease susceptibility; long/large intergenic non-protein coding RNA; single-nucleotide variant; transcription
Year: 2020 PMID: 32523949 PMCID: PMC7261909 DOI: 10.3389/fcell.2020.00347
Source DB: PubMed Journal: Front Cell Dev Biol ISSN: 2296-634X
FIGURE 1(A) Genomic distribution of single-nucleotide variants in cancers; a majority of cancer-associated single-nucleotide variants (SNVs) [single-nucleotide polymorphisms (SNPs)] are found in the intergenic or intronic regions, and only small numbers are located in protein-coding regions of the human genome. (B) Classification of long non-coding RNA (lncRNA) transcripts; long intergenic non-protein coding RNA (lincRNA) is a major subtype of lncRNA.
Overviews of trait-associated variants on the chr8q24 locus.
| LincRNA | Trait-associated variants | Diseases | Position | References |
| rs378854 | Adiposity | Intron | ||
| rs10505477 | Colorectal, gastric, and lung cancers | Intron | ||
| rs138042437 | Prostate cancer | Intron | ||
| rs6983267 | Colorectal cancer, endometrial carcinoma | Enhancer | ||
| rs6983267 | Prostate, breast, colon, and colorectal cancers; myeloid malignancies | Exon | ||
| rs7463708 | Prostate cancer | Enhancer | ||
| rs10086908 | Prostate cancer | Promoter | ||
| rs1456315, rs7463708 | Prostate cancer | Exon | ||
| rs13252298, rs1456315 | Colorectal cancer | Exon | ||
| rs183373024 | Prostate cancer | Exon | ||
| rs13281615 | Breast cancer | Promoter | ||
| rs2720709, rs2648875 | End-stage renal disease (ESRD) | Intron, intron | ||
| rs378854 | Prostate cancer | Promoter | ||
| rs13255292, rs4733601 | Diffuse large B cell lymphoma | Intron, downstream |
Overviews of other lincRNAs encompassing trait-associated variants.
| LincRNA | Trait-associated variants | Diseases | Position | References |
| rs3803662 | Breast cancer, lung cancer | Exon | ||
| rs6939340 | Neuroblastoma | Intron | ||
| rs145204276 | Hepatocellular carcinoma (HCC), colorectal, and gastric cancers | Promoter | ||
| rs217727 | Coronary artery disease, type 2 diabetes | Exon | ||
| rs2067051 | Pneumoconiosis, coronary artery disease | Exon | ||
| rs2107425 | Ovarian and breast cancers, hypertrophic cardiomyopathy | Intron | ||
| rs2839698 | HCC, bladder, colorectal, and gastric cancer | Exon | ||
| rs7763881, rs1041279 | HCC | Intron | ||
| rs11655237 | Pancreatic cancer | Exon | ||
| rs11752942 | Esophageal squamous cell carcinoma (ESCC) | Exon | ||
| rs1875147 | Leprosy | Intron | ||
| rs619586 | Pulmonary arterial hypertension (PAH), coronary atherosclerotic and congenital heart disease (CAD/CHD), breast cancer | Exon | ||
| rs1194338 | Colorectal cancer | Promoter | ||
| rs4102217 | HCC | Promoter | ||
| rs941576, rs34552516 | Type 1 diabetes (T1D) | Intron | ||
| rs2331291 | Myocardial infarction | Intron | ||
| rs1894720 | Paranoid schizophrenia | Exon | ||
| rs6434568, rs16834898 | Prostate cancer | Intron | ||
| rs11672691 | Prostate cancer | Promoter | ||
| rs965513 | Papillary thyroid carcinoma (PTC) | Intron | ||
| rs944289 | PTC, large-vessel ischemic stroke | Promoter | ||
| rs8506 | ESCC, gastric cancer | Exon | ||
| rs2288947, rs8105637 | Colorectal cancer, gastric cancer | Exon, intron |
FIGURE 2Graphical representations of the driving effect of variants [single-nucleotide variants (SNVs)] on long intergenic non-protein coding RNA (lincRNA) regions. (A) Genetic variants located in promoter may affect the binding of transcription factors and regulate the transcription of lincRNAs. (B) Genetic variants on enhancer affect the binding of transcription co-regulators and regulate lincRNA expression. (C) Genetic variants on intron may affect the process of splicing and stability of lincRNA conformation. (D) Genetic variants located in exons affect the lincRNA secondary structure, lincRNA stability, and interactive function. (E) Genetic variants on exons may affect the sponging of microRNAs (miRNAs).
Some databases that may be used for driver identification.
| Tools | Functional annotation | Link | References |
| LincSNP 2.0 | Store and annotate disease-associated SNVs in human lncRNAs and their transcription factor binding sites (TFBSs) | ||
| lncRNASNP2 | Comprehensive information of SNVs and mutations in lncRNAs, as well as their impacts on lncRNA structure and function | ||
| LncVar | A database of genetic variation associated with long non-coding genes in six species | ||
| The 1000 Genomes Project | The largest public catalog of human variation and genotype data | ||
| dbSNP | A public-domain archive for a broad collection of simple genetic polymorphisms | ||
| GWAS Catalog | A catalog that has provided data from published genome-wide association studies | ||
| GWAS4D | A web server that systematically evaluates GWAS signals and identifies context-specific regulatory variants |