| Literature DB >> 34335693 |
Xibo Sun1, Leiming Cheng2, Jinyang Liu3,4, Cuinan Xie3,4, Jiasheng Yang5, Fu Li6.
Abstract
Long non-coding RNAs (lncRNAs) are widely concerned because of their close associations with many key biological activities. Though precise functions of most lncRNAs are unknown, research works show that lncRNAs usually exert biological function by interacting with the corresponding proteins. The experimental validation of interactions between lncRNAs and proteins is costly and time-consuming. In this study, we developed a weighted graph-regularized matrix factorization (LPI-WGRMF) method to find unobserved lncRNA-protein interactions (LPIs) based on lncRNA similarity matrix, protein similarity matrix, and known LPIs. We compared our proposed LPI-WGRMF method with five classical LPI prediction methods, that is, LPBNI, LPI-IBNRA, LPIHN, RWR, and collaborative filtering (CF). The results demonstrate that the LPI-WGRMF method can produce high-accuracy performance, obtaining an AUC score of 0.9012 and AUPR of 0.7324. The case study showed that SFPQ, SNHG3, and PRPF31 may associate with Q9NUL5, Q9NUL5, and Q9UKV8 with the highest linking probabilities and need to further experimental validation.Entities:
Keywords: PRPF31; SFPQ; SNHG3; lncRNA similarity; lncRNA–protein interaction; protein similarity; weighted graph-regularized matrix factorization
Year: 2021 PMID: 34335693 PMCID: PMC8322775 DOI: 10.3389/fgene.2021.690096
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
The performance of five LPI prediction methods.
| Methods | Precision | Recall | Accuracy | F1-score | AUC | AUPR |
| LPBNI | 0.3794 | 0.4037 | 0.9573 | 0.3876 | 0.8569 | 0.3302 |
| LPI-IBNRA | 0.5093 | 0.4165 | 0.4521 | 0.8718 | 0.4351 | |
| LPIHN | 0.4122 | 0.2800 | 0.9412 | 0.3324 | 0.8451 | 0.2299 |
| RWR | 0.3617 | 0.3521 | 0.9531 | 0.3543 | 0.8134 | 0.2827 |
| CF | 0.3033 | 0.2949 | 0.9488 | 0.2965 | 0.7686 | 0.2357 |
| LPI-WGRMF | 0.8906 |
FIGURE 1The AUC values of six LPI prediction methods.
FIGURE 2The AUPR values of six LPI prediction methods.
The top five proteins associated with the four lncRNAs.
| lncRNAs | Proteins | Confirmed | LPI-WGRMF | LPBNI | LPIIBNRA | LPIHN | RWR | CF |
| MTND2P28 | Q9NUL5 | NO | 1 | 1 | 4 | 2 | 7 | 2 |
| O00425 | YES | 2 | 2 | 2 | 1 | 1 | 1 | |
| P26599 | YES | 3 | 8 | 10 | 11 | 4 | 11 | |
| Q07955 | YES | 4 | 16 | 17 | 18 | 5 | 15 | |
| Q9Y6M1 | YES | 5 | 3 | 1 | 3 | 2 | 3 | |
| RPI001_1001892 | Q9NUL5 | YES | 1 | 1 | 1 | 1 | 1 | 1 |
| Q07955 | YES | 2 | 9 | 13 | 15 | 8 | 13 | |
| P35637 | YES | 3 | 5 | 5 | 5 | 4 | 5 | |
| P26599 | YES | 4 | 15 | 17 | 16 | 9 | 16 | |
| Q9NZI8 | YES | 5 | 4 | 4 | 3 | 5 | 3 | |
| RPI001_1002045 | Q9NUL5 | YES | 1 | 1 | 1 | 1 | 1 | 1 |
| P35637 | YES | 2 | 4 | 2 | 5 | 4 | 5 | |
| Q01844 | YES | 3 | 6 | 6 | 6 | 6 | 6 | |
| P31483 | YES | 4 | 9 | 10 | 8 | 7 | 9 | |
| Q9Y6M1 | YES | 5 | 3 | 4 | 3 | 3 | 3 | |
| RP11-169K16.7 | Q9UKV8 | YES | 1 | 1 | 1 | 1 | 2 | 1 |
| Q9H9G7 | YES | 2 | 2 | 4 | 2 | 1 | 7 | |
| Q9UL18 | YES | 3 | 7 | 3 | 4 | 4 | 10 | |
| Q9HCK5 | YES | 4 | 6 | 2 | 3 | 3 | 9 | |
| Q9NUL5 | YES | 5 | 5 | 5 | 6 | 5 | 2 |