| Literature DB >> 26384425 |
Sophia Cammaerts1, Mojca Strazisar1, Jenne Dierckx2, Jurgen Del Favero3, Peter De Rijk1.
Abstract
Genetic variants in or near miRNA genes can have profound effects on miRNA expression and targeting. As user-friendly software for the impact prediction of miRNA variants on a large scale is still lacking, we created a tool called miRVaS. miRVaS automates this prediction by annotating the location of the variant relative to functional regions within the miRNA hairpin (seed, mature, loop, hairpin arm, flanks) and by annotating all predicted structural changes within the miRNA due to the variant. In addition, the tool defines the most important region that is predicted to have structural changes and calculates a conservation score that is indicative of the reliability of the structure prediction. The output is presented in a tab-separated file, which enables fast screening, and in an html file, which allows visual comparison between wild-type and variant structures. All separate images are provided for downstream use. Finally, we tested two different approaches on a small test set of published functionally validated genetic variants for their capacity to predict the impact of variants on miRNA expression.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26384425 PMCID: PMC4756848 DOI: 10.1093/nar/gkv921
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Visual output of miRVaS. The left figure in each panel represents the wild-type miRNA, the right figure represents the same miRNA with the variant. Orange regions: seed sequence, magenta: mature miRNA, dark blue: terminal loop, cyan: hairpin. The variant is highlighted in red in the wild-type and the variant miRNA, bases with structural changes are highlighted in black. (A) Structure prediction for a SNP within the seed sequence. Up- and downstream flanks used for the prediction: 20 nt/20 nt. The SNP enlarges an interior loop. (B) Structure prediction for a deletion of one base within the flanking region. Up- and downstream flanks used for the prediction: 30nt/40nt. The deletion does not change the secondary structure drastically. (C) Structure prediction for an insertion of two bases in the flanking region. Up- and downstream flanks used for the prediction: 75 nt/75 nt. This insertion causes structural changes in the lower hairpin arms and in the flanking regions.
Figure 2.Annotation scheme for variant location and structural changes. Annotations are made in the format: region (reference ± start:stop). A detailed description of the annotation can be found in the Results section. (A) Schematic overview of the miRNA hairpin including flanks. Functional regions used for annotation are highlighted according to the colour scheme used by miRVaS. The length of the flanking region is determined by the user. (B) A schematic view of structural and sequence changes in a miRNA gene with two mature sequences (5p and 3p). Black bars depict regions with structural change, sequence changes are highlighted in red. Regions are denoted in the upper part of the figure and correspond to regions in Panel A. Reference regions are shown below the hairpin (a: arm, m: mature, l: loop). (C) A schematic view of structural and sequence changes in a miRNA gene with one mature sequence (5p). (D) Annotation of examples of sequence and structural changes that are depicted in Panels B and C: (i) a structural change in the 5p flanking region covering bases 50–40 upstream of the hairpin arm; (ii) a sequence change in the lower 5p hairpin arm, 10 bases upstream of the 5p mature, (iii) a structural change in the upper 5p hairpin arm, 4 bases downstream of the mature 5p; (iv) a structural change in the loop region, 6 bases downstream of the upper 5p arm; (v) a structural change in the upper 3p hairpin arm, 5 bases upstream of the mature 3p; (vi) a structural change spanning the end of the mature 3p (bases 19–22 downstream of the upper 3p arm, base 22 is the last base of the 3p mature sequence) and the beginning of the lower 3p arm (1st–3rd base downstream of the mature 3p); (vii) a sequence change outside the given flank region, 88 bases downstream of the lower 3p hairpin arm; (viii) a structural change within the seed sequence, 3rd base downstream of the lower 5p hairpin arm; (ix) a sequence change within the 5p mature sequence, 16 bases downstream of the lower 5p hairpin arm; (x) a structural change within the 3p hairpin arm, 15 bases downstream of the calculated loop region.
miRVaS tabular output: description of the different fields
| chromosome | chr19 | Genomic location of the variant in zero based half open coordinates, type of variant, reference and alternative allele. Type can be snp (SNP), ins (insertion) or del (deletion). |
| begin | 52196527 | |
| end | 52196528 | |
| type | snp | |
| ref | G | |
| alt | T | |
| mir_location | mature5p(a+8) | Location of the variant relative to the functional regions within the tested miRNA. |
| Example: the variant is located in the 5p mature sequence, 8 nt downstream of the lower 5p hairpin arm. | ||
| mir_name | hsa-mir-125a | Tested miRNA. |
| MFE_highest_impact* | seed | Highest region in which a structural change is present for the MFE structure. Order: seed > mature > arm > loop > flank. |
| Example: the highest region with structural changes is the seed region. | ||
| MFE_impact* | mature5p(a+2,5,8:9)seed&mature3p(a+14:15,19:20) | Describes all structural changes in the variant structure compared to the reference structure for the MFE structure. |
| Example: structural changes are within the 5p mature and seed sequence (positions 2, 5 and 8–9, counting from the lower 5p hairpin arm) and within the 3p mature sequence (positions 14–15 and 19–20, counting from the upper 3p hairpin arm). | ||
| ref_mfefreq | 0.000285697 | Frequency of the MFE structure for the reference allele. |
| var_mfefreq | 0.0000634247 | Frequency of the MFE structure for the variant allele. |
| MFE_conservation* | conserved | Whether the hairpin structure is conserved with given flank sizes compared to when using no flanks. When not conserved, the number of bases with changed structure is given. |
| Example: the structure of the hairpin is conserved. | ||
| ref_MFE_deltaG* | −91.2 | Free energy of the MFE structure prediction for the reference allele (kcal/mol). |
| var_MFE_deltaG* | −85.5 | Free energy of the MFE structure prediction for the variant allele (kcal/mol). |
As an example, the output for a variant within MIR125 is described (for up- and downstream flanks of 100 nt). Rows and columns are transposed for clarity. *Columns are also calculated for centroid and MEA predictions (not shown).
Performance of miRVaS determined by the test set for the ΔΔG approach with two different ΔΔG strategies (|ΔΔG| > 0 or density interval)
| |ΔΔG| > 0 | Density interval | |
|---|---|---|
| TP | 25 | 11 |
| FP | 11 | 0 |
| TN | 3 | 14 |
| FN | 3 | 17 |
| Sensitivity (%) | 89 | 39 |
| Specificity (%) | 21 | 100 |
| 0.3825 | 0.0075 |
P-value: two-tailed P-value of Fisher's exact test.
Performance of miRVaS determined by the test set for the hairpin approach
| CEN | MEA | MFE | |
|---|---|---|---|
| TP | 19 | 19 | 20 |
| FP | 2 | 2 | 2 |
| TN | 12 | 12 | 12 |
| FN | 9 | 9 | 8 |
| Sensitivity (%) | 68 | 68 | 71 |
| Specificity (%) | 86 | 86 | 86 |
| 0.0025 | 0.0025 | 0.0008 |
CEN: centroid, MEA: maximal expected accuracy, MFE: minimal free energy.