Literature DB >> 31057602

LPI-IBNRA: Long Non-coding RNA-Protein Interaction Prediction Based on Improved Bipartite Network Recommender Algorithm.

Guobo Xie1, Cuiming Wu1, Yuping Sun1, Zhiliang Fan1, Jianghui Liu2.   

Abstract

According to the latest research, lncRNAs (long non-coding RNAs) play a broad and important role in various biological processes by interacting with proteins. However, identifying whether proteins interact with a specific l<span class="Gene">ncRNA through biological experimental methods is difficult, costly, and time-consuming. Thus, many bioinformatics computational methods have been proposed to predict lncRNA-protein interactions. In this paper, we proposed a novel approach called Long non-coding RNA-Protein Interaction Prediction based on Improved Bipartite Network Recommender Algorithm (LPI-IBNRA). In the proposed method, we implemented a two-round resource allocation and eliminated the second-order correlations appropriately on the bipartite network. Experimental results illustrate that LPI-IBNRA outperforms five previous methods, with the AUC values of 0.8932 in leave-one-out cross validation (LOOCV) and 0.8819 ± 0.0052 in 10-fold cross validation, respectively. In addition, case studies on four lncRNAs were carried out to show the predictive power of LPI-IBNRA.

Entities:  

Keywords:  bipartite network; interaction prediction; lncRNA; protein; second-order correlation elimination

Year:  2019        PMID: 31057602      PMCID: PMC6482170          DOI: 10.3389/fgene.2019.00343

Source DB:  PubMed          Journal:  Front Genet        ISSN: 1664-8021            Impact factor:   4.599


1. Introduction

L<span class="Gene">ncRNA, a class of <span class="Gene">ncRNAs (non-coding RNAs) of more than 200 nucleotides, that do not encode proteins, has gained increasing scientific interest in recent years (Jorge et al., 2012; Hajjari et al., 2016). Only 2% of RNAs in the human transcriptome can encode proteins while others, called ncRNAs, cannot. Note that most ncRNAs are lncRNAs. Compared to other ncRNAs, lncRNAs are longer and have complex secondary or higher-order structures (Bonasio and Shiekhattar, 2014), and their genes often have independent regulatory elements such as promoters and enhancers (Ulitsky and Bartel, 2013). There is increasing evidence that lncRNAs are related to the regulation of gene expression levels such as epigenetic regulation, transcriptional regulation, and multiple levels of post-transcriptional regulation (Sarah and Jeff, 2013), but only a few functions and mechanisms of lncRNA have been studied (Maarabouni et al., 2008; Lee et al., 2016). Moreover, interactions of lncRNAs with other molecules also have become hot spots in oncology research over the past years. The studies found that an important way for lncRNAs to function is by interacting with proteins (Khalil and Rinn, 2011). LncRNAs play a broad and important regulatory role in various processes such as tumorigenesis, cancer progression and metastasis by interacting with proteins. Thus, the prediction and identification of lncRNA-protein interactions can further reveal lncRNA-related functions and is beneficial for the study on the pathogenesis of complex diseases at the molecular level (Faghihi et al., 2008; Chen and Yan, 2013; Cui et al., 2013; Li et al., 2013; Chen et al., 2016a,c, 2017b, 2018c). Numerous biological experimental methods were exploited to confirm protein-related RNAs (Ule et al., 2005; Galgano and Gerber, 2011; Zambelli and Pavesi, 2015). However, such experimental methods are laborious, time-consuming, and costly (Huang et al., 2012). Recently, various computational methods have been proposed to address the challenges in bioinformatics (He et al., 2018a,b; Zou et al., 2018), such as l<span class="Gene">ncRNA-protein (Hu et al., 2017; Shen et al., 2019a,b), <span class="Disease">miRNA-disease (Chen and Huang, 2017; Chen et al., 2018a,b,d,e,f; Jiang et al., 2018a,b; Xie et al., 2018), drug-target (Chen et al., 2016b; Wang et al., 2017; Wu et al., 2018) and microbe-disease associations predictions (Chen et al., 2017a; Peng et al., 2018). The methods for inferring lncRNA-protein associations can roughly be classified into two types: the machine learning methods and the network-based methods. The so-called machine learning methods usually use the biological features of lncRNAs and proteins, and then employ a supervised classifier to identify whether proteins have potential interactions with a specific lncRNA (Zhan et al., 2018). For example, Bellucci et al. (2011) proposed to utilize secondary structure, hydrogen bonding and van der Waals contributions for feature integration, which has a beneficial effect for inferring the binding propensity of protein and ncRNA. Protein and lncRNA sequence information is utilized in Muppirala et al. (2011), with the employment of a support vector machine (SVM) and random forest (RF). Suresh et al. (2015) proposed an SVM-based method named RPI-Pred, which uses high-order 3D structural features and sequences of the lncRNA and protein. Hu et al. (2018) developed a method called HLPI-Ensembl, adopting the ensemble strategy based on extreme gradient boosting (XGB), SVM and RF. However, the main drawback of these methods is the insufficiency of negative samples of lncRNA-protein interactions. The lack of negative samples may cause the unstable performance of the supervised classifier. Moreover, selecting appropriate features to predict lncRNA-protein interactions is not an easy task. Apart from the aforementioned methods, there are other approaches for potential l<span class="Gene">ncRNA-protein interaction prediction, with the employment of network analysis algorithms. For instance, Li et al. (2015) presented a method called <span class="Chemical">LPIHN, which constructs a heterogeneous network, and implements a random walk with restart on the heterogeneous network. In order to improve prediction performance, some recent network-based methods use recommender algorithms to infer lncRNA-protein interactions. For example, Ge et al. (2016) proposed a method called LPBNI, which only uses known lncRNA-protein interactions and implements the two-step propagation on a bipartite network. Zhao et al. (2018b) introduced an approach based on the bipartite network called LPI-BNPRA, which infers lncRNA-protein interactions by constructing bias ratings for lncRNAs and proteins, using agglomerative hierarchical clustering. By implementing two-round resource allocation on bipartite networks, these approaches achieved impressive results. But predictive validity of these investigations remains insufficient due to the existence of high-order correlations, which might have a negative effect on the lncRNA-protein interaction prediction. For example, the proteins directly correlated by the same lncRNA, could also be indirectly correlated by other media proteins, resulting in correlation redundancy. Properly eliminating the redundancy induced by the second-order correlation might further enhance the accuracy of the prediction. This inspired us to develop an effective network-based recommender algorithm for lncRNA-protein interaction prediction. Motivated by the effectiveness of high-order correlation elimination in the study of Qiu et al. (2014), we propose a novel method named <span class="Chemical">LPI-IBNRA for inferring new l<span class="Gene">ncRNA-protein interactions. LPI-IBNRA uses known lncRNA-protein and protein-protein interactions, and lncRNA expression similarity, and then eliminates second-order correlations on the bipartite network appropriately to enhance the prediction accuracy. Compared with previous machine learning methods, our method does not require negative samples. Compared with many existing network-based methods (Ge et al., 2016; Zhao et al., 2018b), our method yields comparable or even better results due to second-order correlation elimination. Both 10-fold cross validation and LOOCV were carried out to assess the prediction ability of the proposed method. Experimental results illustrated that LPI-IBNRA outperformed five other methods by achieving higher AUC values. In addition, case studies on four lncRNAs further demonstrated the predictive power of LPI-IBNRA. Therefore, we conclude that LPI-IBNRA is feasible and effective for inferring potential lncRNA-protein interactions.

2. Materials and Methods

2.1. Human LncRNA-Protein Interactions

The known <span class="Gene">ncRNA-protein interaction dataset was downloaded from the NPInter v2.0 database (Yuan et al., 2014). We limited the organism to “<span class="Species">Homo sapiens” and the type of ncRNAs to “NONCODE”, in order to filter ncRNAs and their interacting proteins. The lncRNAs were further filtered from these ncRNAs, through a human lncRNA dataset from the NONCODE 4.0 database (Xie et al., 2014). We deleted duplicate interactions. Considering the sample requirement of LOOCV, we removed the lncRNAs and proteins that have only one interaction. We then obtained 4796 distinct experimentally confirmed lncRNA-protein interactions, containing 26 proteins and 1105 lncRNAs. We denoted np as the number of known proteins, nl as the number of known lncRNAs, and matrix I ∈ ℝ as the adjacency matrix of protein-lncRNA interactions. The interaction between protein p and lncRNA l could be denoted as follows:

2.2. Protein-Protein Interaction Score Matrix and Similarity Matrix

Protein-protein interactions (PPI) were obtained from the STRING 9.1 database (Franceschini et al., 2013), which included weighted protein-protein interactions through co-expression data, genomic context predictions, automated text mining, and high-throughput lab experiments. We then deleted the redundant PPI data, and obtained 214 PPI data, and the corresponding interaction scores based on the known l<span class="Gene">ncRNA-protein dataset. The symmetric matrix AP was denoted as an interaction score matrix based on PPI data, where AP(p, p) is the interaction score between proteins p and p. AP could then be standardized as follows: where R(p) is the sum of the elements in i-row of AP. Considering the hypothesis that similar proteins tend to exhibit a similar interaction and non-interaction pattern with l<span class="Gene">ncRNAs (Zheng et al., 2017), we calculated the protein similarity with the utilization of Gaussian kernel interaction profiles. We denoted X(p) as the ith row vector of matrix I, in which the nonzero values occur at the indices where the corresponding l<span class="Gene">ncRNA have one interaction with a protein p. Then the similarity between proteins p and p based on Gaussian kernel interaction profiles could be calculated as follows: where the adjustment coefficient β for the kernel bandwidth is defined as follows:

2.3. LncRNA-LncRNA Similarity Matrix

L<span class="Gene">ncRNA expression profiles were downloaded from the NONCODE 4.0 database (Xie et al., 2014). After removing the superfluous data, we obtained the expression profiles of 1,105 l<span class="Gene">ncRNAs in 24 human tissues or cell types. Then the Pearson correlation coefficient (PPC) was applied for the calculation of lncRNA expression similarity between each pair of lncRNA expression profiles (Wang et al., 2010; Ganegoda et al., 2013; Tang et al., 2014). We denoted E(i) = {e, e, …, e} and E(j) = {e, e, …, e} as the expression profiles of l and l. The expression similarity AL(l, l) between lncRNAs l and l was calculated as follows: where AL(l, l) denotes the absolute value of PCC between l and l, cov(E(i), E(j)) is the covariance between E(i) and E(j), σ and σ are standard deviations of E(i) and E(j), respectively. We denoted X(p) as the ith column vector of matrix I, in which the nonzero values occur at the indices where the corresponding protein has one interaction with the l<span class="Gene">ncRNA l. Similar to the aforementioned protein case, the Gaussian interaction profile kernel similarity for l<span class="Gene">ncRNAs could be computed as follows: where

2.4. Integrated Similarity Matrix for Proteins and LncRNAs

Note that the Gaussian interaction profile kernel similarity is an association information-based measurement, which can be utilized to complement protein-protein interactions and l<span class="Gene">ncRNA expression similarity. Motivated by the study of Chen (2015), we constructed the integrated protein similarity matrix Sim and integrated the l<span class="Gene">ncRNA similarity matrix Sim as follows:

2.5. LPI-IBNRA

The flow chart of <span class="Chemical">LPI-IBNRA is shown in Figure 1. At first, we denoted S ∈ ℝ as the resource score matrix based on protein similarity, S ∈ ℝ as the one based on l<span class="Gene">ncRNA similarity. These two matrices were computed as follows:
Figure 1

The flowchart of LPI-IBNRA method.

where represents the score between protein p and l<span class="Gene">ncRNA l based on protein similarity, and represents the score between protein p and l<span class="Gene">ncRNA l based on lncRNA similarity. The flowchart of <span class="Chemical">LPI-IBNRA method. Then the integrated resource score matrix was initialized as the weighted sum of S and S as follows: where parameter γ ∈ [0, 1] is a scalar controlling the relative contributions of protein similarity and l<span class="Gene">ncRNA similarity in S. Following the general setting, we set the parameter γ = 0.5 in this paper, making S and S equally weighted. The final score matrix can be obtained by updating the S column by column. In other words, the calculation process can be partitioned into nl runs, each of which corresponds to a specific l<span class="Gene">ncRNA. Thus, at the beginning of the kth run, the score for protein p interacting with the given l<span class="Gene">ncRNA l can be initialized as follows: Then the 1st-round of our allocation model is to allocate the score of the l<span class="Gene">ncRNA l from the protein p, which can be calculated as follows: where is obtained by a summing operation over all initial scores from l<span class="Gene">ncRNAs interacting with protein p. The score of l<span class="Gene">ncRNA l can be obtained by summing scores over all proteins connected with l: In the 2nd-round, resource scores were allocated in a similar way as the first round. The score allocated from the l<span class="Gene">ncRNA l to the protein p was calculated as follows: where is the sum of initial scores from all proteins interacting with l<span class="Gene">ncRNA l. The score of protein p was allocated from all l<span class="Gene">ncRNAs that interacted with p as follows: As described from Equation (13) to (17), we first initialized the score of protein p from the given l<span class="Gene">ncRNA l and then updated it by a two-round resource allocation. An example is given in Figure 2. We defined as the final resource score matrix, which can be represented as follows:
Figure 2

The basic idea of LPI-IBNRA. First, two resource score matrices which are computed based on protein similarity and lncRNA similarity, respectively, are combined to construct the initial integrated resource score network. Secondly, each protein gains its initial score from a specific lncRNA. Next, in two-round resource allocation, the score is allocated from proteins to lncRNAs, and then propagated back to proteins. Finally, the weight matrix is optimized by second-order correlation eliminations to obtain the final scores of proteins.

The basic idea of <span class="Chemical">LPI-IBNRA. First, two resource score matrices which are computed based on protein similarity and l<span class="Gene">ncRNA similarity, respectively, are combined to construct the initial integrated resource score network. Secondly, each protein gains its initial score from a specific lncRNA. Next, in two-round resource allocation, the score is allocated from proteins to lncRNAs, and then propagated back to proteins. Finally, the weight matrix is optimized by second-order correlation eliminations to obtain the final scores of proteins. S can also be computed in a vectorized form as: where is a column vector of S, is a column vector of S, and W ∈ ℝ is the weight matrix. Then Equation (17) can also be represented as: where In the l<span class="Gene">ncRNA-protein interaction network, the proteins interacting with the same l<span class="Gene">ncRNA are considered to be directly correlated, i.e., having the low-order correlation, while higher-order correlations between these proteins might also arise from indirect associations. Such high-order correlations might have a negative effect on the lncRNA-protein interaction prediction. Based on the studies of Zhou et al. (2009) and Liu et al. (2010), we eliminated second-order correlations in an appropriate way to further enhance the accuracy of the prediction: where the parameter α ∈ (−1, 0). The final score matrix for inferring potential l<span class="Gene">ncRNA-protein interactions can then be calculated as follows: After the calculations, we can recommend proteins to the given l<span class="Gene">ncRNA l in descending order by the kth column of .

2.6. Performance Evaluation

We evaluated the classification performance of the proposed <span class="Chemical">LPI-IBNRA method by applying two types of classification schemes, i.e., LOOCV and 10-fold cross validation. The performance of <span class="Chemical">LPI-IBNRA was evaluated in terms of several widely-used indicators, including precision (PRE), sensitivity (SEN), accuracy (ACC), F1 score, and Matthews correlation coefficient (MCC), expressed as follows: where TP, TN, FP, and FN count the number of true positives, true negatives, false positives, and false negatives, respectively. As a popular method for performance evaluation, the receiver operating characteristic (ROC) curve was also utilized in our experiments. The area under the ROC curve (AUC) = 1 indicates perfect performance, while AUC = 0.5 indicates random performance. The precision-recall curve (PR curve) and the area under the PR curve (AUPR) are also used to reduce the negative influence of false positive data on the method performance. The larger the AUC and AUPR is, the better performance the evaluated method has.

3. Results

3.1. Comparison With Other Methods

We used the aforementioned 4,796 known <span class="Species">human l<span class="Gene">ncRNA-protein interactions to carry out the above-mentioned two cross validation schemes. In each LOOCV trial, each known lncRNA-protein interaction was used as a test sample while the rest were used as training samples. To analyze the influence of parameter α on the performance of LPI-IBNRA, we applied LOOCV for the selection of parameter α. As shown in Figure 3A, the performance of LPI-IBNRA drops a lot when α is smaller than –0.70. When α is larger than –0.70, the performance of LPI-IBNRA decreases slightly. Thus, the parameter α is set to –0.70 due to the optimal performance.
Figure 3

(A)The AUC values of LPI-IBNRA method with different values of α. (B) ROC curves of lncRNA-protein interaction predictions by all compared methods in LOOCV. (C) Precision-recall curves of all compared methods. (D) ROC curves of lncRNA-protein interaction predictions by all compared methods in 10-fold cross validation.

(A)The AUC values of <span class="Chemical">LPI-IBNRA method with different values of α. (B) ROC curves of l<span class="Gene">ncRNA-protein interaction predictions by all compared methods in LOOCV. (C) Precision-recall curves of all compared methods. (D) ROC curves of lncRNA-protein interaction predictions by all compared methods in 10-fold cross validation. Five previous approaches were used for comparison in the experiments, including collaborative filtering (CF), random walk with restart (RWR), <span class="Chemical">LPBNI, <span class="Chemical">LPIHN, and LPI-BNPRA. LPBNI, LPIHN, and LPI-BNPRA are network-based methods that infer potential lncRNA-protein interactions, while CF and RWR have been used as benchmark methods in Ge et al. (2016) and Wen et al. (2017). RWR is often utilized as a powerful tool for network-based methods to forecast association (Zhao et al., 2018a,c; Zhu et al., 2018), while CF is a well-known recommender algorithm which can infer the information from similar neighborhoods (Fu et al., 2014; Zeng et al., 2017). In our experiments, RWR was implemented to make predictions based on the protein-protein similarity network, while a simple version of the CF algorithm was adopted to calculate the prediction scores between lncRNAs and proteins. Here, we reproduced these methods on the same dataset by ourselves. See Figures 3B,C and Table 1 for the results of LOOCV. We can see from Figure 3B that our proposed method achieved an AUC of 0.8932, which exhibited a considerable improvement over the five previous methods (i.e., 12.81% for CF, 10.71% for RWR, 1.56% for <span class="Chemical">LPBNI, 2.00% for <span class="Chemical">LPIHN and 3.39% for LPI-BNPRA). In addition, the comparison of these methods, in terms of precision vs. recall, is presented in Figure 3C. It can be seen that LPI-IBNRA almost achieved a higher precision than the other methods at every recall value. Moreover, LPI-IBNRA outperformed the other methods in terms of AUPR, PRE, SEN, ACC, F1 score and MCC, which is presented in Table 1. As shown in Figure 3D, in 10-fold cross validation, LPI-IBNRA achieved an AUC of 0.8819 ± 0.0052 and was superior to the comparison methods, including CF (0.7655 ± 0.0069), RWR (0.7800 ± 0.0076), LPBNI (0.8695 ± 0.0047), LPIHN (0.8591 ± 0.0044), and LPI-BNPRA (0.8413 ± 0.0351).
Table 1

Performance evaluation of all compared methods in LOOCV in terms of several widely-used indicators.

MethodsAUCAUPRPRESENACCF1-scoreMCC
LPI-IBNRA0.89320.70980.87780.35800.88450.52730.5152
CF0.76510.52490.79410.19220.85680.32250.3452
RWR0.78610.55000.83160.24600.86580.39490.4063
LPBNI0.87760.66240.85260.28820.87290.44740.4496
LPIHN0.87320.58920.75180.15220.85010.26420.2933
LPI-BNPRA0.85930.61480.82000.22710.86270.37010.3855
Performance evaluation of all compared methods in LOOCV in terms of several widely-used indicators. The aforementioned results indicate that in both LOOCV and 10-fold cross evaluation, <span class="Chemical">LPI-IBNRA outperforms other methods in terms of the AUC values. The outstanding performance of <span class="Chemical">LPI-IBNRA demonstrates its stable and satisfying abilities in inferring potential lncRNA-protein interactions. The superior performance of the proposed method could be attributed to second-order correlation elimination, which is more suitable for our task and can lead to better prediction performance.

3.2. Case Studies

In addition, four case studies have been carried out to further evaluate the effectiveness of <span class="Chemical">LPI-IBNRA. The interactions in our benchmark dataset were obtained in NPInter v2.0 which was established in 2013. NPInter was then upgraded to NPInter v3.0 in 2016 (Hao et al., 2016), which includes newly discovered l<span class="Gene">ncRNA-protein interactions. Thus, we predicted novel lncRNA-protein interactions based on known interactions in the benchmark dataset, then confirmed our predictions in NPInter v3.0. For each lncRNA, the proteins ranked within the top 5 were considered as potential proteins that interact with the given lncRNA. Case studies were carried out on four lncRNAs, including lncRNA DLEU2, CRHR1-1T1, LRRC75A-AS1 and SNHG5. Table 2 shows the prediction results and whether there were confirmations for these l<span class="Gene">ncRNAs. It indicates that five (<span class="Gene">DLEU2), five (CRHR1-1T1), five (LRRC75A-AS1), and four (SNHG5) out of the top five predicted lncRNA-interacted proteins, were confirmed by NPInter v3.0. The rankings of these lncRNA-protein interactions in other benchmark method predictions are also listed in Table 2. It can be observed that several novel interactions did not have high rankings in the predictions of other methods, and these interactions are likely to be ignored by these methods. Therefore, LPI-IBNRA has great potential to predict new lncRNA-protein interactions.
Table 2

The top five ranked proteins for lncRNA DLEU2, CRHR1-1T1, LRRC75A-AS1, and SNHG5.

LncRNAProteinConfirmed?RankCFRWRLPBNILPIHNLPI-BNPRA
DLEU2ENSP00000290341Confirmed171131
DLEU2ENSP00000258729Confirmed2172216
DLEU2ENSP00000381031Confirmed316357
DLEU2ENSP00000371634Confirmed4163628
DLEU2ENSP00000254108Confirmed5104442
CRHR1-IT1ENSP00000254108Confirmed1114322
CRHR1-IT1ENSP00000240185Confirmed2292117
CRHR1-IT1ENSP00000350028Confirmed311081210
CRHR1-IT1ENSP00000381031Confirmed41211958
CRHR1-IT1ENSP00000220592Confirmed5514106
LRRC75A-AS1ENSP00000385269Confirmed161111
LRRC75A-AS1ENSP00000254108Confirmed272222
LRRC75A-AS1ENSP00000381031Confirmed384363
LRRC75A-AS1ENSP00000350028Confirmed4967129
LRRC75A-AS1ENSP00000258962Confirmed51105165
SNHG5ENSP00000290341Confirmed111141
SNHG5ENSP00000350028Confirmed2210463
SNHG5ENSP00000240185Confirmed37113134
SNHG5ENSP00000254108432212
SNHG5ENSP00000381031Confirmed5485106
The top five ranked proteins for l<span class="Gene">ncRNA <span class="Gene">DLEU2, CRHR1-1T1, LRRC75A-AS1, and SNHG5.

4. Discussion and Conclusion

In this article, we proposed a novel method <span class="Chemical">LPI-IBNRA for predicting l<span class="Gene">ncRNA-protein interactions, based on the known lncRNA-protein interactions, lncRNA expression similarity and protein-protein interactions. We integrated the known interactions and similarity as the initial resource scores for a two-round resource allocation of a bipartite network recommendation. Furthermore, we optimized the weight matrix by eliminating second-order correlations appropriately, to obtain the final result of lncRNA-protein interaction prediction. We finally acquired gratifying and reliable prediction performance in LOOCV, 10-fold cross evaluation and case studies. Thus, we believe that LPI-IBNRA can make reliable predictions and might guide future experimental studies on lncRNA-protein interactions. <span class="Chemical">LPI-IBNRA has the following improvements over several previous methods in predicting l<span class="Gene">ncRNA-protein interactions. First, with the employment of the bipartite network recommender algorithm, we utilized the known lncRNA-protein interactions to construct a bipartite network between lncRNAs and proteins, and then allocated the resource scores via interaction edges between lncRNA nodes and protein nodes. Therefore, the negative sample set is not required in our methods. Second, we assigned weights to each edge on the bipartite network, which is distinct from most former bipartite network methods. Thus, the resource scores would not be evenly distributed during the resource allocation process. Finally, we eliminated second-order correlations on the bipartite network appropriately, to enhance prediction accuracy. Although impressive results have been achieved, there is still much room for improvement in our method. At first, though known l<span class="Gene">ncRNA-protein interactions have been more than before, it is still very difficult for the proposed method to obtain adequate results based on the prediction. Moreover, as the resource allocation of the bipartite network recommendation algorithm is based on known l<span class="Gene">ncRNA-protein interactions, LPI-IBNRA is not suitable to predict interactions of lncRNAs without any known interacted protein.

Author Contributions

GX designed the experiments. CW and ZF performed the experiments. GX, CW, YS, ZF, and JL conceived the project and analyzed the data. CW and YS wrote the manuscript and all authors contributed to the writing.

Conflict of Interest Statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
  59 in total

1.  CLIP: a method for identifying protein-RNA interaction sites in living cells.

Authors:  Jernej Ule; Kirk Jensen; Aldo Mele; Robert B Darnell
Journal:  Methods       Date:  2005-12       Impact factor: 3.608

2.  Predicting protein associations with long noncoding RNAs.

Authors:  Matteo Bellucci; Federico Agostini; Marianela Masin; Gian Gaetano Tartaglia
Journal:  Nat Methods       Date:  2011-06       Impact factor: 28.547

3.  RNA-binding protein immunopurification-microarray (RIP-Chip) analysis to profile localized RNAs.

Authors:  Alessia Galgano; André P Gerber
Journal:  Methods Mol Biol       Date:  2011

4.  The prostate cancer-up-regulated long noncoding RNA PlncRNA-1 modulates apoptosis and proliferation through reciprocal regulation of androgen receptor.

Authors:  Zilian Cui; Shancheng Ren; Ji Lu; Fubo Wang; Weidong Xu; Yi Sun; Min Wei; Junyi Chen; Xu Gao; Chuanliang Xu; Jian-Hua Mao; Yinghao Sun
Journal:  Urol Oncol       Date:  2012-01-20       Impact factor: 3.498

Review 5.  RNA-protein interactions in human health and disease.

Authors:  Ahmad M Khalil; John L Rinn
Journal:  Semin Cell Dev Biol       Date:  2011-02-17       Impact factor: 7.727

6.  Correlation and quantitation of microRNA aberrant expression in tissues and sera from patients with breast tumor.

Authors:  Fengjun Wang; Zhiguo Zheng; Jiangfeng Guo; Xianfeng Ding
Journal:  Gynecol Oncol       Date:  2010-12       Impact factor: 5.482

7.  Expression of a noncoding RNA is elevated in Alzheimer's disease and drives rapid feed-forward regulation of beta-secretase.

Authors:  Mohammad Ali Faghihi; Farzaneh Modarresi; Ahmad M Khalil; Douglas E Wood; Barbara G Sahagan; Todd E Morgan; Caleb E Finch; Georges St Laurent; Paul J Kenny; Claes Wahlestedt
Journal:  Nat Med       Date:  2008-06-29       Impact factor: 53.440

8.  Predicting RNA-protein interactions using only sequence information.

Authors:  Usha K Muppirala; Vasant G Honavar; Drena Dobbs
Journal:  BMC Bioinformatics       Date:  2011-12-22       Impact factor: 3.169

9.  GAS5, a non-protein-coding RNA, controls apoptosis and is downregulated in breast cancer.

Authors:  M Mourtada-Maarabouni; M R Pickard; V L Hedge; F Farzaneh; G T Williams
Journal:  Oncogene       Date:  2008-10-06       Impact factor: 9.867

10.  STRING v9.1: protein-protein interaction networks, with increased coverage and integration.

Authors:  Andrea Franceschini; Damian Szklarczyk; Sune Frankild; Michael Kuhn; Milan Simonovic; Alexander Roth; Jianyi Lin; Pablo Minguez; Peer Bork; Christian von Mering; Lars J Jensen
Journal:  Nucleic Acids Res       Date:  2012-11-29       Impact factor: 16.971

View more
  6 in total

1.  AptaNet as a deep learning approach for aptamer-protein interaction prediction.

Authors:  Neda Emami; Reza Ferdousi
Journal:  Sci Rep       Date:  2021-03-16       Impact factor: 4.379

2.  DeepLPI: a multimodal deep learning method for predicting the interactions between lncRNAs and protein isoforms.

Authors:  Dipan Shaw; Hao Chen; Minzhu Xie; Tao Jiang
Journal:  BMC Bioinformatics       Date:  2021-01-18       Impact factor: 3.169

3.  LPI-SKF: Predicting lncRNA-Protein Interactions Using Similarity Kernel Fusions.

Authors:  Yuan-Ke Zhou; Jie Hu; Zi-Ang Shen; Wen-Ya Zhang; Pu-Feng Du
Journal:  Front Genet       Date:  2020-12-09       Impact factor: 4.599

4.  A novel lncRNA-protein interaction prediction method based on deep forest with cascade forest structure.

Authors:  Xiongfei Tian; Ling Shen; Zhenwu Wang; Liqian Zhou; Lihong Peng
Journal:  Sci Rep       Date:  2021-09-23       Impact factor: 4.379

Review 5.  Probing lncRNA-Protein Interactions: Data Repositories, Models, and Algorithms.

Authors:  Lihong Peng; Fuxing Liu; Jialiang Yang; Xiaojun Liu; Yajie Meng; Xiaojun Deng; Cheng Peng; Geng Tian; Liqian Zhou
Journal:  Front Genet       Date:  2020-01-31       Impact factor: 4.599

6.  Predicting lncRNA-Protein Interaction With Weighted Graph-Regularized Matrix Factorization.

Authors:  Xibo Sun; Leiming Cheng; Jinyang Liu; Cuinan Xie; Jiasheng Yang; Fu Li
Journal:  Front Genet       Date:  2021-07-16       Impact factor: 4.599

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.