| Literature DB >> 30999843 |
Giovanni Perconti1, Patrizia Rubino1, Flavia Contino2, Serena Bivona2,3, Giorgio Bertolazzi4, Michele Tumminello4, Salvatore Feo2,3, Agata Giallongo5, Claudia Coronnello6,7.
Abstract
BACKGROUND: MicroRNAs (miRNAs) are small non-coding RNA molecules mediating the translational repression and degradation of target mRNAs in the cell. Mature miRNAs are used as a template by the RNA-induced silencing complex (RISC) to recognize the complementary mRNAs to be regulated. To discern further RISC functions, we analyzed the activities of two RISC proteins, AGO2 and GW182, in the MCF-7 human breast cancer cell line.Entities:
Keywords: RIP-Chip data analysis; RISC proteins AGO2 and GW182; microRNA regulatory activity; microRNA target prediction
Mesh:
Substances:
Year: 2019 PMID: 30999843 PMCID: PMC6471694 DOI: 10.1186/s12859-019-2683-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1RIP-Chip experiments overview. a and c Western Blot analysis of proteins immunoprecipitated and co-immunoprecipitated with anti-AGO2 or anti-GW182 antibody (IP). IgGs in a) are the negative controls. IN and FT made up 1% of the cytoplasmic lysate used for each IP sample. GW182 was specifically co-immunoprecipitated with AGO2 (c, left panel), and AGO2 was specifically co-immunoprecipitated with GW182 (c, right panel). b Enrichment analysis of seven highly expressed miRNAs in anti-AGO2 and anti-GW182 IP compared to IgG-IP controls. d Average Linkage Cluster analysis of mRNA and miRNA expression profiles of IP, IN and FT samples from three independent experiments; distance is computed as 1- Correlation (Pearson). AGO2-IP and GW182-IP mRNA expression profiles are highlighted in blue and green, respectively. In mRNA expression clustering, we considered all the 16,323 genes with a detected expression level in the samples considered. In miRNA expression clustering, we considered 508 miRNAs with a detected expression level in at least one sample
Definition of variables used to model miRNA activity
| Variable name | Formula |
|
|---|---|---|
| F1 | ∑ | number in 3’UTR |
| F2 | ∑ | number in 3’UTR |
| F3 | ∑ | number in 3’UTR |
| F4 | ∑ | number in 3’UTR |
| F1d | ∑ | density in 3’UTR |
| F2d | ∑ | density in 3’UTR |
| F3d | ∑ | density in 3’UTR |
| F4d | ∑ | density in 3’UTR |
| F5 | ∑ | number in coding region |
| F6 | ∑ | number in coding region |
| F7 | ∑ | number in coding region |
| F8 | ∑ | number in coding region |
| F5d | ∑ | density in coding region |
| F6d | ∑ | density in coding region |
| F7d | ∑ | density in coding region |
| F8d | ∑ | density in coding region |
| F9 | Not applicable | |
| L1 | length of 3’UTR | Not applicable |
| L2 | length of coding region | Not applicable |
The column provides details about the miRNA predicted binding sites used to compute BS (the binding sites matrix). For each variable, the Formula defines the values associated to each mRNA
Fig. 2Behavior overview of variables listed in Table 1. a Heatmap representation of the correlation block matrix of the variables computed with AGO2_IN1 miRNA and mRNA expression profiles. The reported numbers are the correlation values, expressed in the range [− 100:100]. b ROC-AUC values obtained by classifying enriched/underrepresented genes associated with the variables computed with each IN expression profile. c Wilcoxon test p-values (log10) obtained by comparing the variable values associated with the enriched/underrepresented gene sets. In both b) and c), the variables computed with the three AGO2 IN profiles were used to distinguish enriched and underrepresented genes in AGO2-IP vs FT. The variables computed with the three GW182 IN profiles were used to distinguish enriched and underrepresented genes in GW182-IP vs FT.
Fig. 3Graphic representation of selected features values associated to enriched and underrepresented genes. Empirical cumulative distribution function (ECDF) of F6, F4d, F8, L1 and L2 variables computed for enriched (UP) and underrepresented (LOW) genes in AGO2 IP vs FT and GW182 IP vs FT analyses. The reported p-values were obtained by performing a Wilcoxon-test comparing the values assumed by the selected set of genes with the values assumed by all the genes (16,363, green lines)
Fig. 4Graphic representation of the effect of miRNA expression profile shuffling. Each boxplot represents the AUC values obtained with 1000 simulated miRNA expression profiles. The percentage on the right of each boxplot refers to the number of times an AUC value was greater than the AUC obtained with the original miRNA expression profile (red vertical line). a Performance of simulated F6 variables in distinguishing AGO2 enriched/underrepresented genes. b Performance of simulated F6 variables in distinguishing GW182 enriched/underrepresented genes. c Performance of simulated F4d variables in distinguishing AGO2 enriched/underrepresented genes. d Performance of simulated SVM models (F6&F4d variables) in distinguishing AGO2 enriched/underrepresented genes. e Performance of simulated F8 variables in distinguishing AGO2 enriched/underrepresented genes. f Performance of simulated F8 variables in distinguishing GW182enriched/underrepresented genes
Fig. 5Support Vector Machine models performance summary. AUC values of SVM models trained with any pair of variables defined in Table 1, used to classify enriched/underrepresented genes in AGO2-IP vs FT comparison. Variables were computed by using the AGO2_IN1 expression profiles. Values are in the range [0:100]. Values in the diagonal refer to single variable performance. The ROC plot at bottom left represents the results obtained with the best-performing SVM model (F6&F4d, black line) and with the two single variables, F6 (red line) and F4d (green line)