Literature DB >> 32476993

LipoSVM: Prediction of Lysine Lipoylation in Proteins based on the Support Vector Machine.

Meiqi Wu1, Pengchao Lu2, Yingxi Yang3, Liwen Liu1, Hui Wang4, Yan Xu1, Jixun Chu1.   

Abstract

BACKGROUND: Lysine lipoylation which is a rare and highly conserved post-translational modification of proteins has been considered as one of the most important processes in the biological field. To obtain a comprehensive understanding of regulatory mechanism of lysine lipoylation, the key is to identify lysine lipoylated sites. The experimental methods are expensive and laborious. Due to the high cost and complexity of experimental methods, it is urgent to develop computational ways to predict lipoylation sites.
METHODOLOGY: In this work, a predictor named LipoSVM is developed to accurately predict lipoylation sites. To overcome the problem of an unbalanced sample, synthetic minority over-sampling technique (SMOTE) is utilized to balance negative and positive samples. Furthermore, different ratios of positive and negative samples are chosen as training sets.
RESULTS: By comparing five different encoding schemes and five classification algorithms, LipoSVM is constructed finally by using a training set with positive and negative sample ratio of 1:1, combining with position-specific scoring matrix and support vector machine. The best performance achieves an accuracy of 99.98% and AUC 0.9996 in 10-fold cross-validation. The AUC of independent test set reaches 0.9997, which demonstrates the robustness of LipoSVM. The analysis between lysine lipoylation and non-lipoylation fragments shows significant statistical differences.
CONCLUSION: A good predictor for lysine lipoylation is built based on position-specific scoring matrix and support vector machine. Meanwhile, an online webserver LipoSVM can be freely downloaded from https://github.com/stars20180811/LipoSVM.
© 2019 Bentham Science Publishers.

Entities:  

Keywords:  Lysine lipoylation; amino acids; post-translational modifications; prediction; scoring matrix; support vector machine

Year:  2019        PMID: 32476993      PMCID: PMC7235397          DOI: 10.2174/1389202919666191014092843

Source DB:  PubMed          Journal:  Curr Genomics        ISSN: 1389-2029            Impact factor:   2.236


INTRODUCTION

Protein post-translational modifications (PTMs) refer to the chemical modifications of proteins after translation. Studies have shown that the production of PTMs mainly through the splicing of the peptide chain backbone, adds new groups to specific amino acid side chains, or chemically modifying existing groups [1]. PTMs play key roles in regulating various biological functions, such as protein activity, stability and interaction profiles [2]. Lysine is not only the most modified amino acid but it is also the amino acid that is affected by a wide range of PTMs among the 20 standard amino acids [3]. Common lysine post-translational modifications include acetylation [4], methylation [5], ubiquitination [6], sumoylation [7] and phosphorylation [8]. Lipoylation is one of the rare PTMs that involves the covalent attachment of lipoamide to a lysine residue via an amide bond [9-12]. Different from other post-translational modifications that rely on local amino acid motifs, lipoylated substrate is not significantly affected by conservative amino acid mutations on both sides of modified lysine [13]. So far, only four lipoylated multimeric metabolic enzymes have been found in mammals and one in bacteria [14, 15]. Despite the rare occurrence, it plays an important role in many key metabolic processes and protein interactions. For example, AoDH which is the only lipoylated protein complex found in bacteria plays roles in the catabolism of the acetoin energy storage molecule in acetyl-CoA and acetaldehyde [16]. And the lipoylated enzyme KDH regulates the binding of a surrogate carbon source to glucose in the TCA cycle, catalyzing the removal of alpha-ketoglutaric acid to form succinyl-CoA [17]. In addition, many studies have demonstrated that lipoylated complexes are inextricably linked to disease, including Warburg effect [18-20], HIV infection [21, 22] and herpesvirus [23]. Based on the various studies mentioned above, it is evident that deciphering the biological function of lipoylation may help in revealing the underlying molecular causes of these diseases. In addition, lipoylation with high evolutionary conservation and lipoylated enzymes are critically linked to the development of disease and the maintenance of health [14, 15]. Based on the above findings, it is evident that deciphering the biological functions of lipoylation may help in revealing the underlying molecular causes of these diseases. And understanding the mechanism of lipoylated complexes is critical, which helps in the diagnosis and treatment of diseases. The first condition for understanding the mechanism of lipoylation is to identify the lipoylation sites. Traditional molecular biology and biochemistry techniques, such as nuclear magnetic resonance spectroscopy [24], protein purification [25], and western blotting using antibody against lipoic acid [14] provide valuable insights into the function of lipoylated proteins. Moreover, mass spectrometry provides a means of studying the lipoylation status of specific lysine residues in different cell types, tissues and biological environments [26]. All of these methods have the drawbacks of low throughput, extensive time-consumption and high-cost. Hence, it is necessary to predict the lipoylation sites through computational approaches that are convenient and high throughput. In this work, a widely used algorithm SVM is implemented to construct predictors. To reduce the negative impact of unbalanced data on classifier performance, the positive samples were oversampled by synthetic minority over-sampling (SMOTE) [27]. Subsequently, different ratios of positive and negative samples are selected as training sets, respectively. And five different encoding schemes including bi-profile Bayes (BPB), AAindex, position-specific scoring matrix (PSSM), BLOSUM62 matrix and binary are implemented. In addition, comparisons with other algorithms K-Nearest Neighbor (KNN), Decision Tree, Logistic Regression (LR) and Naive Bayes show the effectiveness of Support Vector Machine (SVM) in predicting lipoylation sites in proteins. A comparison with existing tools has been implemented to demonstrate the effectiveness of LipoSVM. A flowchart of the LipoSVM is given in Fig. ().

MATERIALS AND METHODS

Benchmark Dataset

575 proteins with 593 experimentally annotated lysine lipoylation sites were retrieved from UniProt (http://www.uniprot.org/) by searching the keywords “lipoylation” and “lipoylated protein”. These proteins are scanned by a sliding window whose center is lysine (K). The missing amino acids are filled with pseudo amino acid “X”. In this work, the optimal window length is 17. As a result, 593 lipoylated fragments and 2183 non-lipoylated fragments are obtained. A fragment was assigned with experimentally validated lysine lipoylation site in positive dataset S + or in negative dataset S. In general, the training set with high homology could cause over-fitting which impairs the generalization of a predictor. Therefore, if there are more than 40% of residues if the two compared fragments are same, only one of them should be retained. After removing the redundant fragments, 53 positive and 1028 negative fragments were obtained (Supplementary Table ).

Feature Constructions

As existing machine-learning algorithms cannot process sequence samples directly, therefore, to represent the biological sequence samples with an effective mathematical expression is an essential step [28]. In this work, bi-profile Bayes (BPB), AAindex, position-specific scoring matrix (PSSM), BLOSUM62 matrix and binary are utilized to convert protein fragments into vectors with different dimensions.

Bi-profile bayes (BPB)

BPB is also known as the bilateral Bayes algorithm which was proposed by Shao et al. [29]. It encodes positive and negative samples with the position information of amino acids. First, according to the known positive and negative samples, a frequency matrix FP of each amino acid at each position in the positive samples and a frequency matrix FN at each position of the negative samples are obtained. FP is calculated as follows: where P is the frequency of i-th amino acid in j-th position for a given positive dataset. L is the length of a protein fragment. FN can be obtained in the same way.

Physicochemical and Biochemical Properties

AAindex is a database of numerical indices representing various physicochemical and biochemical properties of amino acids and the pairs of amino acids [30]. There are 566 entries in amino acid index database (http://www.genome.jp/dbget-bin/www_bfind?aaindex). In some instances, the values are not reported for all amino acids [30]. Thus, 14 common physicochemical properties (Supplementary Table ) from Amino Acid Index Database are selected for the characterization of amino acids.

Position-specific Scoring Matrix (PSSM)

To obtain information about sequential evolution, the position-specific scoring matrix [31] can be utilized. By combining matrix V obtained via two-sample t-test [32] with position weight matrixes F and F, the following PSSM matrix which is used for encoding can be constructed (the detailed process is shown in Supplementary ). where M>0, can be calculated as follows: If M>0, the probability that the i-th amino acid in the i-th position appears in the positive fragments is greater. Otherwise, it is more likely to be in the negative fragments.

BLOSUM62 Matrix

BLOSUM matrices have belonged to the most common substitution matrix series for protein homology search and sequence alignments [33]. The essential characteristics of protein evolution can be learned the from analysis of aligned protein sequences. Thus, a row of BLOSUM62 matrix is applied to represent an amino acid.

Binary

The small range of amino acids around the lipoylation site is the main sequence feature of lysine lipoylated fragment and has been shown to be useful for predicting lipoylation sites [34]. These amino acids can be represented by binary encoding. Therefore, each of the 21 amino acids (20 amino acids plus the pseudo amino acid “X”) are encoded as a 21-dimensional vector containing only 0 and 1.

Imbalance Data Processing

The imbalance of positive and negative samples in the training set has a massive impact on predictor performance. In the process of data preprocessing, over-sampling and under-sampling are the common means to deal with the unbalanced issues. Since only 53 positive samples are obtained, therefore, the oversampling method is preferred. SMOTE is a powerful oversampling method that has achieved great success in solving class imbalance [35]. The pseudo-code of the SMOTE algorithm is shown in Supplementary . The number of positive samples reaches 212 after SMOTE. Then, 50 positive and 50 negative samples are randomly selected as the independent test set.

Algorithm

Support Vector Machine (SVM) is a universal classification algorithm and it is widely used in the field of biological computing [36, 37]. The main idea of SVM is to find a hyperplane that maximizes the distance between classification boundary points. For a given training dataset where n is the number of training set, yi represents sample label. Then the optimal hyperplane, where represents the weight vector, b denotes that the bias is constructed in division samples. The kernel function such as linear kernel function, polynomial function, radial basis function (RBF), and sigmoid kernel function [38] are needed to map data into high-dimension space. LIBSVM is utilized to construct the predictor. C-support vector classification (C-SVC) is chosen as a formulation, and RBF is chosen as the kernel function. The built predictor for lysine lipoylation with SVM is called LipoSVM.

Model Evaluation

In general, performance evaluations of predictors in statistical prediction are K-fold cross-validation test, jackknife test, and independent dataset test [37]. 10-fold cross-validation and an independent dataset test are chosen to validate these models. To obtain a reliable estimation, the 10-fold cross-validation is repeated 10 times. Accuracy (Acc), specificity (Sp), sensitivity (Sn), area under the ROC curve (AUC) and Matthews Correlation Coefficient (MCC) are widely-accepted measurements [39]. In the following formula, accuracy indicates that the percentage of the test set should be correctly predicted. The specificity (also called the true negative rate) represents the proportion of negatives that are correctly predicted. The sensitivity (also called the true positive rate or the recall) measures the proportion of positives that are correctly predicted. The MCC is considered a balanced measure and can be used even if the size of the class is very different. where TP denotes the number of true positive samples, TN denotes the number of true negative samples, FP denotes the number of falsepositive samples, FN denotes the number of false-negative samples.

RESULTS AND DISCUSSION

Performance of LipoSVM

To obtain an optimal predictor, different parameters including window size, proportion of positive and negative samples, penalty factor and kernel parameter have been adjusted. The results show that when the window length is 17 (determined by the highest MCC value), the performance is optimal in 10-fold cross-validation (Table ). Since the ratio of positive samples to negative samples in the training set is about 1:6, positive and negative samples from 1:1 to 1:6 as training sets were randomly selected. The results show that the variance of MCC between different encoding schemes is the smallest when the ratio is 1:5. However, when the ratio is 1:1, model with PSSM encoding scheme has better performance than others (Table , Fig. ) which further indicates the necessity to mitigate the impact of category imbalance. Furthermore, the AUC value of independent test set reaches to 0.9997 which demonstrates the generalization performance of LipoSVM (Fig. ).

The Comparison of Different Features

In this work, five encoding schemes which contain evolutionary information, sequence location information, amino acid composition information, and physicochemical properties are applied to encode protein fragments. BPB is utilized to obtain a 34-dimensional feature vector, a 238-dimensional feature vector through 14 physicochemical properties from AAindex. Along with a 17-dimensional feature vector by PSSM and a 357-dimensional feature vector by binary or BLOSUM62 matrix encoding scheme. As shown in Table and Fig. (, the contribution of different encoding schemes to classifier performance is discrepant. Although the model is optimal under PSSM and 1:1 ratio, the variance of MCC between different ratios is the largest in this encoding method. In contrast, the variance of MCC of BLOSUM62 matrix is the smallest, followed by BPB, Binary and AAindex. The results show that it is pivotal to express the biological sequences with mathematical expressions that truly reflect their intrinsic correlation with prediction targets.
Fig. (2)

The values of MCC with different ratio data sets and encoding schemes. The X-axis represents different encoding schemes, the Y-axis has average values of MCC and the black bars represent standard error.

Analysis between Lysine Lipoylation and Non-Lipoylation Fragments

To intuitively understand the difference between positive and negative samples, the composition of various amino acids in lipoylated and non-lipoylated fragments is calculated (Fig. ). Besides, Two Sample Logo [32] is used to analyze the occurrence of amino acide around lysine lipoylation and non-lipoylation (Fig. ) sites. From Fig. (, it can be observed that there is a certain difference in the percentage of the amino acids between the lipoylated and non-lipoylated fragments. Among the lipoylated protein fragments, valine (V) has the highest proportion, followed by glutamic (E) and Serine (S), while non-lipoylated protein fragments have the highest percentage of lysine (K), followed alanine (A) and glutamic (E). It is clear that valine (V) and lysine (K) ratios are significantly different in positive and negative samples, which are the key amino acids to distinguish positive and negative samples. From Fig. (, it further illustrates that the compositional and positional information of lipoylated and non-lipoylated fragments show significant statistical difference.
Fig. (4)

The proportion of different amino acids between lysine lipoylation and non-lipoylation fragments. The X-axis represents different amino acids, and the Y-axis is the percentage of different amino acids.

Fig. (5)

Two Sample Logo (p<0.05) of compositional bias around the lysine lipoylation and non-lipoylation sites.

Comparison of Different Algorithms and the Existing Predictor LipoPred

To verify the effectiveness of the SVM algorithm, it was compared with other algorithms including K-Nearest Neighbor (KNN), Decision Tree, Logistic Regression (LR) and Naive Bayes. It can be seen from Table that the model obtained by SVM is superior to the model obtained by other algorithms. The models trained by KNN are the worst because KNN only relies on several points in the nearest neighbor to classify. Essentially, there is no training process. In addition, this predictor is superior to the existing predictor LipoPred [40] which with ACC 0.9994 and MCC 0.9930.

CONCLUSION

Protein lysine lipoylation is a key post-transcriptional modification in cell regulation. To fully understand the molecular mechanisms of biological processes associated with lipoylation, a preliminary but critical step is to identify lipoylated substrate and corresponding lipoylation sites. It is desirable and necessary to achieve large-scale identification of lipoylated proteins through computational ways. To overcome this challenge, SMOTE is first implemented to balance positive and negative datasets. Subsequently, the different ratios of positive and negative samples are selected as training sets. By comparing different encoding schemes and ratios, the optimal predictor LipoSVM is obtained. The comparison with other classification algorithms and the existing predictor LipoPred for lysine lipoylation proves the effectiveness of LipoSVM. The results show that machine learning can replace redundant experimental methods to identify acetylation sites with high accuracy and throughput, which contributes to the research of lipoylation proteins.
Table 1

Performance of various window lengths in a 10-fold cross-validation.

- ACC (%) Sp (%) Sn (%) MCC AUC
999.6499.6199.940.98890.9979
1199.8599.8498.770.99310.9989
1399.7299.7998.890.98190.9992
1599.8699.8499.960.99590.9995
1799.9699.98100.000.99900.9997
1999.9299.9799.400.99650.9964
2199.9399.8999.440.99680.9978
Table 2

Performance of models with different ratios and encoding schemes.

Ratio Encoding Schemes ACC (%) Sp (%) Sn (%) MCC AUC
1:1BPB99.84±0.2199.88±0.2599.81±0.280.9969±0.00410.9989±0.0031
AAIndex99.91±0.1499.88±0.2399.94±0.180.9981±0.00280.9992±0.0008
PSSM99.98±0.1899.96±0.0999.99±0.240.9992±0.00130.9996±0.0027
BLOSUM6299.84±0.2599.91±0.1899.69±0.500.9969±0.00500.9979±0.0012
Binary99.81±0.3199.89±0.2299.63±0.630.9963±0.00620.9989±0.0011
1:2BPB99.96±0.1299.97±0.0999.94±0.190.9991±0.00280.9997±0.0012
AAIndex99.75±0.3999.99±0.0299.26±1.160.9945±0.00870.9986±0.0018
PSSM99.67±0.1999.51±0.2899.98±0.110.9927±0.00420.9958±0.0013
BLOSUM6299.94±0.1399.99±0.0399.81±0.400.9986±0.00290.9979±0.0014
Binary99.96±0.1299.97±0.0999.94±0.190.9991±0.00280.9995±0.0020
1:3BPB99.92±0.0899.96±0.0899.81±0.280.9979±0.00200.9983±0.0017
AAIndex99.83±0.1199.98±0.0699.38±0.390.9955±0.00290.9979±0.0032
PSSM99.95±0.1299.93±0.1499.99±0.040.9986±0.00270.9999±0.0021
BLOSUM6299.91±0.2399.99±0.0499.63±0.920.9975±0.00620.9979±0.0012
Binary99.95±0.0799.99±0.0599.81±0.280.9988±0.00190.9994±0.0032
1:4BPB99.94±0.0899.98±0.0499.75±0.300.9981±0.00260.9987±0.0025
AAIndex99.92±0.1199.99±0.0299.63±0.560.9977±0.00350.9985±0.0013
PSSM99.97±0.0599.97±0.0699.99±0.090.9992±0.00150.9994±0.0011
BLOSUM6299.92±0.1199.99±0.0499.63±0.560.9977±0.00350.9978±0.0015
Binary99.88±0.1699.99±0.0699.38±0.830.9961±0.00520.9972±0.0026
1:5BPB99.96±0.0799.99±0.0499.81±0.280.9985±0.00240.9992±0.0015
AAIndex99.96±0.0899.99±0.0599.75±0.490.9985±0.00300.9982±0.0036
PSSM99.94±0.0599.92±0.0699.99±0.040.9978±0.00180.9979±0.0012
BLOSUM6299.93±0.1099.99±0.1099.57±0.620.9974±0.00370.9977±0.0035
Binary99.96±0.0799.99±0.0899.75±0.410.9985±0.00240.9987±0.0026
1:6BPB99.95±0.0499.99±0.1299.63±0.300.9978±0.00180.9979±0.0019
AAIndex99.89±0.1499.98±0.0499.38±0.870.9956±0.00530.9967±0.0024
PSSM99.91±0.0099.90±0.0099.99±0.040.9964±0.00000.9991±0.0012
BLOSUM6299.95±0.1099.99±0.0999.63±0.740.9978±0.00430.9978±0.0033
Binary99.97±0.0499.99±0.0699.79±0.290.9988±0.00170.9983±0.0019
Table 3

Performance of models with different algorithms and encoding schemes.

Algorithms Encoding Schemes ACC (%) Sp (%) Sn (%) MCC AUC
SVMBPB99.84±0.2199.88±0.2599.81±0.280.9969±0.00410.9989±0.0031
AAIndex99.91±0.1499.88±0.2399.94±0.180.9981±0.00280.9992±0.0008
PSSM99.98±0.1899.96±0.0999.99±0.240.9992±0.00130.9996±0.0027
BLOSUM6299.84±0.2599.91±0.1899.69±0.500.9969±0.00500.9979±0.0012
Binary99.81±0.3199.89±0.2299.63±0.630.9963±0.00620.9989±0.0011
KNNBPB98.84±0.2199.69±0.4199.99±0.140.9969±0.00410.9996±0.0001
AAIndex84.26±1.6169.01±3.3499.51±0.770.7198±0.02580.9603±0.0109
PSSM99.13±0.3898.27±0.7799.98±0.290.9829±0.00760.9959±0.0014
BLOSUM6290.99±1.3181.97±2.6299.99±0.360.8336±0.02240.9818±0.0049
Binary83.46±1.6466.91±3.2899.98±0.630.7092±0.02610.9648±0.0122
DecisionBPB97.99±0.6297.59±0.8098.39±0.880.9600±0.01240.9804±0.0078
TreeAAIndex96.60±1.2693.64±2.2199.57±0.620.9339±0.02420.9674±0.0128
PSSM97.31±0.6396.23±0.9798.39±1.000.9466±0.01270.9743±0.0085
BLOSUM6297.72±0.9695.86±1.8999.57±0.390.9551±0.01840.9791±0.0087
Binary96.60±0.6293.52±1.0899.69±0.410.9339±0.01190.9684±0.0079
LogisticBPB99.91±0.2099.88±0.2599.94±0.180.9981±0.00390.9993±0.0017
RegressionAAIndex99.51±0.2599.01±0.4999.98±0.830.9902±0.00490.9989±0.0102
PSSM99.54±0.2199.07±0.4199.97±0.480.9908±0.00410.9991±0.0076
BLOSUM6299.72±0.2999.44±0.5899.98±0.620.9945±0.00580.9994±0.0076
Binary99.94±0.1299.88±0.2599.99±0.070.9988±0.00250.9996±0.0108
NaïveBPB99.54±0.3299.57±0.6299.51±0.250.9908±0.00630.9995±0.0004
BayesAAIndex98.73±0.5999.81±0.2897.65±1.160.9750±0.01160.9964±0.0043
PSSM99.96±0.7299.99±0.0499.99±0.350.9991±0.01400.9994±0.0091
BLOSUM6297.78±0.9499.13±0.6396.42±1.430.9560±0.01870.9949±0.0033
Binary98.86±0.5999.99±0.1497.72±1.170.9775±0.01150.9883±0.0061
  40 in total

Review 1.  Swinging arms and swinging domains in multifunctional enzymes: catalytic machines for multistep reactions.

Authors:  R N Perham
Journal:  Annu Rev Biochem       Date:  2000       Impact factor: 23.643

2.  Isolation of a polypeptide that has lymphocyte-differentiating properties and is probably represented universally in living cells.

Authors:  G Goldstein; M Scheid; U Hammerling; D H Schlesinger; H D Niall; E A Boyse
Journal:  Proc Natl Acad Sci U S A       Date:  1975-01       Impact factor: 11.205

3.  Discovery of a putative acetoin dehydrogenase complex in the hyperthermophilic archaeon Sulfolobus solfataricus.

Authors:  Karl A P Payne; David W Hough; Michael J Danson
Journal:  FEBS Lett       Date:  2010-02-18       Impact factor: 4.124

4.  iMulti-HumPhos: a multi-label classifier for identifying human phosphorylated proteins using multiple kernel learning based support vector machines.

Authors:  Md Al Mehedi Hasan; Shamim Ahmad; Md Khademul Islam Molla
Journal:  Mol Biosyst       Date:  2017-07-25

5.  Pyruvate dehydrogenase and pyruvate dehydrogenase kinase expression in non small cell lung cancer and tumor-associated stroma.

Authors:  Michael I Koukourakis; Alexandra Giatromanolaki; Efthimios Sivridis; Kevin C Gatter; Adrian L Harris
Journal:  Neoplasia       Date:  2005-01       Impact factor: 5.715

6.  Tyr-301 phosphorylation inhibits pyruvate dehydrogenase by blocking substrate binding and promotes the Warburg effect.

Authors:  Jun Fan; Hee-Bum Kang; Changliang Shan; Shannon Elf; Ruiting Lin; Jianxin Xie; Ting-Lei Gu; Mike Aguiar; Scott Lonning; Tae-Wook Chung; Martha Arellano; Hanna J Khoury; Dong M Shin; Fadlo R Khuri; Titus J Boggon; Sumin Kang; Jing Chen
Journal:  J Biol Chem       Date:  2014-08-07       Impact factor: 5.157

7.  iNitro-Tyr: prediction of nitrotyrosine sites in proteins with general pseudo amino acid composition.

Authors:  Yan Xu; Xin Wen; Li-Shu Wen; Ling-Yun Wu; Nai-Yang Deng; Kuo-Chen Chou
Journal:  PLoS One       Date:  2014-08-14       Impact factor: 3.240

8.  A deep learning method to more accurately recall known lysine acetylation sites.

Authors:  Meiqi Wu; Yingxi Yang; Hui Wang; Yan Xu
Journal:  BMC Bioinformatics       Date:  2019-01-23       Impact factor: 3.169

9.  Computational identification of protein methylation sites through bi-profile Bayes feature extraction.

Authors:  Jianlin Shao; Dong Xu; Sau-Na Tsai; Yifei Wang; Sai-Ming Ngai
Journal:  PLoS One       Date:  2009-03-17       Impact factor: 3.240

10.  Hybrid Method Based on Information Gain and Support Vector Machine for Gene Selection in Cancer Classification.

Authors:  Lingyun Gao; Mingquan Ye; Xiaojie Lu; Daobin Huang
Journal:  Genomics Proteomics Bioinformatics       Date:  2017-12-12       Impact factor: 7.691

View more
  2 in total

1.  Mal-Light: Enhancing Lysine Malonylation Sites Prediction Problem Using Evolutionary-based Features.

Authors:  Wakil Ahmad; Easin Arafat; Ghazaleh Taherzadeh; Alok Sharma; Shubhashis Roy Dipta; Abdollah Dehzangi; Swakkhar Shatabda
Journal:  IEEE Access       Date:  2020-04-22       Impact factor: 3.367

2.  Investigation of immune complexes formed by mitochondrial antigens containing a new lipoylated site in sera of primary biliary cholangitis patients.

Authors:  N Aibara; K Ohyama; M Nakamura; H Nakamura; M Tamai; N Kishikawa; A Kawakami; K Tsukamoto; M Nakashima; N Kuroda
Journal:  Clin Exp Immunol       Date:  2021-03-22       Impact factor: 5.732

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.