Literature DB >> 17099235

PolymiRTS Database: linking polymorphisms in microRNA target sites with complex traits.

Lei Bao1, Mi Zhou, Ligang Wu, Lu Lu, Dan Goldowitz, Robert W Williams, Yan Cui.   

Abstract

Polymorphism in microRNA Target Site (PolymiRTS) database is a collection of naturally occurring DNA variations in putative microRNA target sites. PolymiRTSs may affect gene expression and cause variations in complex phenotypes. The database integrates sequence polymorphism, phenotype and expression microarray data, and characterizes PolymiRTSs as potential candidates responsible for the quantitative trait locus (QTL) effects. It is a resource for studying PolymiRTSs and their implications in phenotypic variations. PolymiRTS database can be accessed at http://compbio.utmem.edu/miRSNP/.

Entities:  

Mesh:

Substances:

Year:  2006        PMID: 17099235      PMCID: PMC1669716          DOI: 10.1093/nar/gkl797

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Identification of causal genetic variants underlying complex traits is a major goal in genetic studies. Linkage analysis has long been used to discover chromosomal intervals harboring sequence variants that cause variations in quantitative traits. A typical quantitative trait locus (QTL) interval usually contains many genes ranging from several dozens to several hundreds, hence it is critical to be able to focus on genetic variants in the interval that are most likely to have functional impacts. Among them, nonsynonymous single nucleotide polymorphisms (SNPs) that alter protein sequences and regulatory polymorphisms that affect gene expression are natural high-priority candidates. Although regulatory polymorphisms are much more challenging to be identified and characterized, experimental and analytical tools are being actively developed for this purpose (1–4). Polymorphisms in miRNA target sites (PolymiRTS) represent a specific class of regulatory polymorphisms that may regulate posttranscriptional gene expression. A recent work reports that PolymiRTS can underlie the effects of physiological QTLs (pQTLs) that control classic higher order traits (5). MicroRNAs (miRNAs) are a family of small RNAs that pair to the transcripts of protein-coding genes and cause translational repression or mRNA destabilization (6,7). Hundreds of miRNAs have been identified in humans and mice and many of them have been shown to regulate their target genes that control diverse biological processes such as differentiation, proliferation, apoptosis and morphogenesis (7). PolymiRTS may affect the base-pairing process, hence affect the miRNA-mediated gene repression which in turn cause phenotypic variations. It has been found that miRNA-mediated target mRNA destabilization is widespread in mammals (8–10). Thus, for miRNAs acting by this mechanism, the PolymiRTS may lead to heritable variations in gene expression. Variations in gene expression across a population can be assessed by a newly developed genetical genomics approach (11–13). The genetical genomics approach treats gene expression level as quantitative trait. Linkage mapping is then used to discover the genetic loci regulating gene expression traits (eQTLs). PolymiRTS may induce a cis-acting eQTL that coincides with the gene's physical location. We proposed a simple conceptual model (Figure 1) that represents information flow from PolymiRTS to complex trait via cis-acting eQTL. Based on this model scheme, we create a database integrating SNP, phenotype and expression microarray data of human and mouse.
Figure 1

Conceptual QTL model. A PolymiRTS (triangle) may cause the gene expression variation (diamond) in segregating population and a cis-acting eQTL is observed. The variation in gene expression in turn may cause phenotype variation (rectangle) and a pQTL is observed.

Conceptual QTL model. A PolymiRTS (triangle) may cause the gene expression variation (diamond) in segregating population and a cis-acting eQTL is observed. The variation in gene expression in turn may cause phenotype variation (rectangle) and a pQTL is observed.

DATA SOURCES AND PROCESSING

Identifying and annotating PolymiRTS

The method of identifying and annotating PolymiRTS is shown in Figure 2. SNPs that are located in the 3′-untranslated regions (3′-UTRs) of all known genes by UCSC genome annotation (mouse: mm7 and human: hg18) (14) were extracted from dbSNP build 126 (15). Genomic locations of these SNPs were mapped onto mRNAs. For each SNP, we assessed whether its two alleles lead to different miRNA target sites. To be conservative, we only consider the 3′-UTR SNPs that affect the match to the seed region of the miRNA. Mature miRNA sequences were downloaded from the miRBase (16). We used the criteria of TargetScanS (17) in the prediction of miRNA sites. Basically, besides requiring a perfect Watson–Crick match to the seed 2–7 nt of miRNA, we further require that there is either a perfect match to the 8th nt of miRNA, or an anchor adenosine immediately downstream the 2–7 seed in the target. We assigned the PolymiRTS to one of the four classes: ‘D’ (an allele disrupts a conserved miRNA site), ‘N’ (a derived allele disrupts a nonconserved miRNA site), ‘C’ (a derived allele creates a new miRNA site) and ‘O’ (other cases when the ancestral allele can not be determined unambiguously). PolymiRTS of class ‘C’ may cause abnormal gene repression and PolymiRTS of class ‘D’ may cause loss of normal repression control. These two classes of PolymiRTS are most likely to have functional impacts. We used the pre-calculated 17-way Multiz alignments of vertebrate genomes to derive the annotations. For a miRNA site to be conserved, we require that it is present in at least two other vertebrate genomes in addition to the query genome. For mouse SNPs, their ancestral alleles were determined by mouse versus rat (rn3) genome alignment. For human SNPs, their ancestral alleles were determined by human versus chimpanzee (panTro1) genome alignment. Additionally, we categorized PolymiRTS with A/G alleles because they are supposed to be less deleterious with their ability to form G:U wobble base-pairs with miRNAs.
Figure 2

Methods of identifying and annotating PolymiRTS.

Methods of identifying and annotating PolymiRTS.

PolymiRTSs within cis-acting expression QTL (eQTL) intervals

The genes with both cis-acting eQTL and PolymiRTS are featured in the database. First, gene expression levels in cerebellum, hippocampus, striatum, eye, whole brain and hematopoietic cell (18) were assessed in recombinant inbred mouse strains (BXD) derived from two parental strains C57BL/6J and DBA/2J. Gene expression levels were treated as quantitative traits and were mapped onto genomic regions (eQTL) using standard marker regression. A gene is said to have a suggestive (significant) cis-acting eQTL, if the LOD (log of odds) peak location is within 10 MB from the gene's physical location and the LOD >2.8 (>4.3) (19). Second, gene expression levels in lymphoblastoid cells of 194 human individuals from 14 CEPH families (20) were downloaded from the GEO database (21) and the raw data were processed by using the RMA protocol (22). Genotypes for 1628 autosomal SNP markers were downloaded from The SNP Consortium database (23). We used Merlin (24) to remove genotype errors and perform family-based linkage analysis. A gene is said to have a cis-acting eQTL, if the LOD peak location is within 10 MB from the gene's physical location and the P-value is <0.05.

PolymiRTSs in physiological QTL (pQTL) intervals

We first mapped the QTLs (with a LOD >2.8) for more than 800 published BXD phenotypes (physiological/behavioral traits) (18). For each QTL, we linked it with genes that are physically located in the QTL interval and have at least one PolymiRTS. These genes are candidate causal genes underlying the pQTL.

DATABASE CONTENT AND ACCESS

Table 1 shows the major data fields for a typical PolymiRTS record. The users can access the database by several options. First, a web interface is implemented for browsing the entries. Second, a text search interface is designed for query by SNP ID, miRNA ID, GenBank accession, HUGO gene identifier and gene description. Third, a chromosome location search is offered so that the users can specify a genomic interval of interest and search all the PolymiRTSs within the interval. For mouse, we also provide the inbred strain comparison option. By combining strain comparison with the range search, the user can retrieve all the PolymiRTSs that are candidate variants underlying the pQTL identified by them. Finally, we provide flat file downloads for all the data.
Table 1

Field description

FieldDescription
LocationSNP location in the mRNA transcript
SNPIDLink to dbSNP
Wobble base-pairWhether the SNP can form a G:U wobble basepair with the miRNA. Y: Yes; N: No
Ancestral alleleIf applicable, ancestral allele is denoted
AlleleTwo alleles of the SNP in the mRNA transcript
StrainGenotypes of two mouse inbred strains to be compared
miRIDLink to miRBase
SupportOccurrence of miRNA site in other vertebrate genomes
Function ClassC: derived allele creates a new miRNA sites
N: derived allele disrupts a nonconserved miRNA site
D: allele disrupts conserved a miRNA site
O: other cases when the ancestral allele cannot be determined unambiguously
miRSiteSequence context of the miRNA site. Seed region are in capital letters and SNPs are highlighted in red
Field description

DISCUSSIONS AND FUTURE WORK

miRNAs regulate posttranscriptional gene expression through mRNA destabilization or translational repression. Destabilization of mRNA may cause variations in the transcript levels while translational repression does not (25–27). The effects of miRNAs that act through translational repression cannot be detected by expression microarrays. Therefore, in the PolymiRTS database, the sequence variations in the target sites of miRNAs that act through translational repression may be linked to pQTLs but not to eQTLs. A recent study shows that there are two broad categories of miRNA target sites: (i) 5′-dominant sites with sufficient 5′-pairing, and (ii) 3′-compensatory sites with insufficient 5′-pairing which require strong 3′-pairing (28). By using the TargetScanS algorithm alone, we are likely to miss many target sites from the second category. Because some mismatches in the 5′-pairing region of 3′-compensatory target sites are tolerated which is difficult to estimate the influence of SNPs on the function of 3′-compensatory target sites. Hence, at this stage we want to be conservative and focus on the major and well-studied category of 5′-dominant sites. Currently, we did not take miRNA gene expression pattern into consideration due to lack of large-scale miRNA expression data. However, in many cases, such information would be very valuable. For example, it is known that a set of genes evolutionarily avoid having miRNA target site in their 3′-UTRs (8). They are called miRNA antitargets. Obviously, creation of a new miRNA site in an antitarget is most likely to have severe consequences. Since antitargets tend to be highly and specifically expressed in the tissue where the miRNA is expressed (8), we can use this information to pick PolymiRTS in the putative antitargets and further prioritize strong functional candidates. With the anticipated accumulation of such miRNA profiling data, we would expect a much better annotation of PolymiRTS in the near future.
  29 in total

1.  dbSNP: the NCBI database of genetic variation.

Authors:  S T Sherry; M H Ward; M Kholodov; J Baker; L Phan; E M Smigielski; K Sirotkin
Journal:  Nucleic Acids Res       Date:  2001-01-01       Impact factor: 16.971

2.  Genetical genomics: the added value from segregation.

Authors:  R C Jansen; J P Nap
Journal:  Trends Genet       Date:  2001-07       Impact factor: 11.639

3.  Merlin--rapid analysis of dense genetic maps using sparse gene flow trees.

Authors:  Gonçalo R Abecasis; Stacey S Cherny; William O Cookson; Lon R Cardon
Journal:  Nat Genet       Date:  2001-12-03       Impact factor: 38.330

4.  The human genome browser at UCSC.

Authors:  W James Kent; Charles W Sugnet; Terrence S Furey; Krishna M Roskin; Tom H Pringle; Alan M Zahler; David Haussler
Journal:  Genome Res       Date:  2002-06       Impact factor: 9.043

5.  In vivo characterization of regulatory polymorphisms by allele-specific quantification of RNA polymerase loading.

Authors:  Julian C Knight; Brendan J Keating; Kirk A Rockett; Dominic P Kwiatkowski
Journal:  Nat Genet       Date:  2003-03-10       Impact factor: 38.330

6.  Detection of regulatory variation in mouse genes.

Authors:  Christopher R Cowles; Joel N Hirschhorn; David Altshuler; Eric S Lander
Journal:  Nat Genet       Date:  2002-10-15       Impact factor: 38.330

7.  A comparison of normalization methods for high density oligonucleotide array data based on variance and bias.

Authors:  B M Bolstad; R A Irizarry; M Astrand; T P Speed
Journal:  Bioinformatics       Date:  2003-01-22       Impact factor: 6.937

8.  The SNP Consortium website: past, present and future.

Authors:  Gudmundur A Thorisson; Lincoln D Stein
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

9.  A mutation creating a potential illegitimate microRNA target site in the myostatin gene affects muscularity in sheep.

Authors:  Alex Clop; Fabienne Marcq; Haruko Takeda; Dimitri Pirottin; Xavier Tordoir; Bernard Bibé; Jacques Bouix; Florian Caiment; Jean-Michel Elsen; Francis Eychenne; Catherine Larzul; Elisabeth Laville; Françoise Meish; Dragan Milenkovic; James Tobin; Carole Charlier; Michel Georges
Journal:  Nat Genet       Date:  2006-06-04       Impact factor: 38.330

10.  Relief of microRNA-mediated translational repression in human cells subjected to stress.

Authors:  Suvendra N Bhattacharyya; Regula Habermacher; Ursula Martine; Ellen I Closs; Witold Filipowicz
Journal:  Cell       Date:  2006-06-16       Impact factor: 41.582

View more
  77 in total

1.  Knowledge-based analysis of functional impacts of mutations in microRNA seed regions.

Authors:  Anindya Bhattacharya; Yan Cui
Journal:  J Biosci       Date:  2015-10       Impact factor: 1.826

Review 2.  Advanced technologies for genomic analysis in farm animals and its application for QTL mapping.

Authors:  Xiaoxiang Hu; Yu Gao; Chungang Feng; Qiuyue Liu; Xiaobo Wang; Zhuo Du; Qingsong Wang; Ning Li
Journal:  Genetica       Date:  2008-12-18       Impact factor: 1.082

3.  Associations between NBS1 polymorphisms, haplotypes and smoking-related cancers.

Authors:  Sungshim L Park; Delara Bastani; Binh Y Goldstein; Shen-Chih Chang; Wendy Cozen; Lin Cai; Carlos Cordon-Cardo; Baoguo Ding; Sander Greenland; Na He; Shehnaz K Hussain; Qingwu Jiang; Yuan-Chin A Lee; Simin Liu; Ming-Lan Lu; Thomas M Mack; Jenny T Mao; Hal Morgenstern; Li-Na Mu; Sam S Oh; Allan Pantuck; Jeanette C Papp; Jianyu Rao; Victor E Reuter; Donald P Tashkin; Hua Wang; Nai-Chieh Y You; Shun-Zhang Yu; Jin-Kou Zhao; Zuo-Feng Zhang
Journal:  Carcinogenesis       Date:  2010-05-17       Impact factor: 4.944

Review 4.  Functional genetic variation of human miRNAs and phenotypic consequences.

Authors:  Christelle Borel; Stylianos E Antonarakis
Journal:  Mamm Genome       Date:  2008-09-12       Impact factor: 2.957

Review 5.  MicroRNA-206: the skeletal muscle-specific myomiR.

Authors:  John J McCarthy
Journal:  Biochim Biophys Acta       Date:  2008-03-12

Review 6.  Emerging roles of microRNAs as molecular switches in the integrated circuit of the cancer cell.

Authors:  Georgia Sotiropoulou; Georgios Pampalakis; Evi Lianidou; Zissimos Mourelatos
Journal:  RNA       Date:  2009-06-26       Impact factor: 4.942

7.  Lumbar disc degeneration is linked to a carbohydrate sulfotransferase 3 variant.

Authors:  You-Qiang Song; Tatsuki Karasugi; Kenneth M C Cheung; Kazuhiro Chiba; Daniel W H Ho; Atsushi Miyake; Patrick Y P Kao; Kit Ling Sze; Anita Yee; Atsushi Takahashi; Yoshiharu Kawaguchi; Yasuo Mikami; Morio Matsumoto; Daisuke Togawa; Masahiro Kanayama; Dongquan Shi; Jin Dai; Qing Jiang; Chengai Wu; Wei Tian; Na Wang; John C Y Leong; Keith D K Luk; Shea-ping Yip; Stacey S Cherny; Junwen Wang; Stefan Mundlos; Anthi Kelempisioti; Pasi J Eskola; Minna Männikkö; Pirkka Mäkelä; Jaro Karppinen; Marjo-Riitta Järvelin; Paul F O'Reilly; Michiaki Kubo; Tomoatsu Kimura; Toshikazu Kubo; Yoshiaki Toyama; Hiroshi Mizuta; Kathryn S E Cheah; Tatsuhiko Tsunoda; Pak-Chung Sham; Shiro Ikegawa; Danny Chan
Journal:  J Clin Invest       Date:  2013-11       Impact factor: 14.808

8.  Identification of a functional SNP in the 3'UTR of CXCR2 that is associated with reduced risk of lung cancer.

Authors:  Bríd M Ryan; Ana I Robles; Andrew C McClary; Majda Haznadar; Elise D Bowman; Sharon R Pine; Derek Brown; Mohammed Khan; Kouya Shiraishi; Takashi Kohno; Hirokazu Okayama; Ramakrishna Modali; Jun Yokota; Curtis C Harris
Journal:  Cancer Res       Date:  2014-12-05       Impact factor: 12.701

Review 9.  MicroRNA polymorphisms: the future of pharmacogenomics, molecular epidemiology and individualized medicine.

Authors:  Prasun J Mishra; Joseph R Bertino
Journal:  Pharmacogenomics       Date:  2009-03       Impact factor: 2.533

10.  Gene expression in the mouse eye: an online resource for genetics using 103 strains of mice.

Authors:  Eldon E Geisert; Lu Lu; Natalie E Freeman-Anderson; Justin P Templeton; Mohamed Nassr; Xusheng Wang; Weikuan Gu; Yan Jiao; Robert W Williams
Journal:  Mol Vis       Date:  2009-08-31       Impact factor: 2.367

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.