| Literature DB >> 27579307 |
Zhijun Liao1, Yong Huang2, Xiaodong Yue3, Huijuan Lu4, Ping Xuan5, Ying Ju6.
Abstract
Gamma-aminobutyric acid type-A receptors (GABAARs) belong to multisubunit membrane spanning ligand-gated ion channels (LGICs) which act as the principal mediators of rapid inhibitory synaptic transmission in the human brain. Therefore, the category prediction of GABAARs just from the protein amino acid sequence would be very helpful for the recognition and research of novel receptors. Based on the proteins' physicochemical properties, amino acids composition and position, a GABAAR classifier was first constructed using a 188-dimensional (188D) algorithm at 90% cd-hit identity and compared with pseudo-amino acid composition (PseAAC) and ProtrWeb web-based algorithms for human GABAAR proteins. Then, four classifiers including gradient boosting decision tree (GBDT), random forest (RF), a library for support vector machine (libSVM), and k-nearest neighbor (k-NN) were compared on the dataset at cd-hit 40% low identity. This work obtained the highest correctly classified rate at 96.8% and the highest specificity at 99.29%. But the values of sensitivity, accuracy, and Matthew's correlation coefficient were a little lower than those of PseAAC and ProtrWeb; GBDT and libSVM can make a little better performance than RF and k-NN at the second dataset. In conclusion, a GABAAR classifier was successfully constructed using only the protein sequence information.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27579307 PMCID: PMC4992803 DOI: 10.1155/2016/2375268
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1GABA conformation.
Figure 2Model of direct GABA production.
Figure 3GABAAR modulation patterns of transmembrane domain, a homology model of the transmembrane domains of a GABAAR showing the five-M2-helix domains forming the chloride ion channel (blue) and M1, M3, and M4 helices for single α1 (grey) or β3 (green) subunit. The helices may embed into the postsynaptic membrane in mammalian CNS.
Pfam accession numbers (83 entries) for GABAARs as positive group.
| PF00008, PF00012, PF00018, PF00022, PF00028, PF00053, PF00055, PF00057, PF00059, PF00060, PF00069 |
| PF00078, PF00084, PF00087, PF00090, PF00100, PF00130, PF00147, PF00163, PF00168, PF00169, PF00209 |
| PF00226, PF00240, PF00270, PF00271, PF00335, PF00387, PF00388, PF00397, PF00400, PF00454, PF00520 |
| PF00564, PF00621, PF00627, PF00643, PF00651, PF00665, PF00754, PF00850, PF00892, PF01082, PF01352 |
| PF01436, PF01479, PF01498, PF01529, PF02072, PF02140, PF02214, PF02259, PF02260, PF02460, PF02891 |
| PF02931, PF02932, PF02991, PF03144, PF03416, PF03521, PF04849, PF06220, PF07645, PF07690, PF07707 |
| PF08007, PF08266, PF08377, PF08625, PF08771, PF09279, PF09497, PF11865, PF11938, PF12248, PF12448 |
| PF12662, PF13499, PF15311, PF15974, PF16457, PF16492 |
Confusion matrix classifier (RF) from three kinds of feature vector extraction algorithms.
| PseAAC | ProtrWeb | This work | ||||
|---|---|---|---|---|---|---|
| Human GABAARs | Human non-GABAARs | Human GABAARs | Human non-GABAARs | GABAAR proteins | Non-GABAAR proteins | |
| Positive cases | 55 | 2 | 55 | 3 | 2007 | 76 |
| Negative cases | 3 | 56 | 3 | 55 | 346 | 10576 |
Figure 4Sn, Sp, Acc, and MCC values listed from PseAAC, ProtrWeb, and our work. Note: PseAAC and ProtrWeb only include human 58 GABAARs and 58 non-GABAARs because of the web amount limitation; our method contains all the GABAARs and non-GABAARs (2353 versus 10652).
Classification results for four classifiers based on 360 GABAARs and 9598 non-GABAARs.
| Classifier | Sensitivity (%) | Specificity (%) | Accuracy (%) | MCC | Correctly classified rate |
|---|---|---|---|---|---|
| GDBT | 51.39 | 99.66 | 75.52 | 0.5828 | 0.9791 |
| RF | 41.39 | 99.86 | 70.63 | 0.5085 | 0.9775 |
| libSVM | 58.89 | 97.76 | 78.32 | 0.6148 | 0.9635 |
|
| 51.94 | 98.17 | 75.06 | 0.5651 | 0.9650 |
Figure 5Motifs of human GABAARs found by the MEME system (for details see Table 3). (a) Locations of the nine discovered motifs (showing the top 32 sequences). (b) Nine motif logos found by MEME.
Human conserved motifs of GABAARs found by MEME system (in regular expression).
| Motif | Width |
| Best possible match |
|---|---|---|---|
| 1 | 50 | 1.6 | L[KRS]R[KNR][IMV]GYF[IV][IL]QTY[IL]P[CS][IT][LM][TI][VT][IV]LS |
|
| |||
| 2 | 50 | 8.6 | T[TV]PN[KR][LM][LI]R[IL]F[PD][DN]GT[VLI]LYT[LM]R[LI]T[ITV]TA |
|
| |||
| 3 | 32 | 1.9 | P[KR][VI][SA]Y[VAI][TK]A[MI]DW[FY][IL]AVC[FY][AV]FVF[SL]AL |
|
| |||
| 4 | 29 | 2.4 | L[TR]L[ND]N[LR][ML][AV]SK[IL]W[TV]PDT[FY]F[HRV]N[GS]KKS[FIV] |
|
| |||
| 5 | 50 | 3.7 | GYDNRLRP[GN][FL]G[GE][PR][PI][TV][EQ][VI]XT[DN]I[YD][VI][TA] |
|
| |||
| 6 | 29 | 1.7 | [DS][VI]S[KA]ID[KR][YW]SRI[VFL]FPV[AL]FG[LF]FN[LV]VYW[AVL] |
|
| |||
| 7 | 29 | 3.5 | Q[FY][DS][LFI][VL]G[QL][TR][VN][GST][TS]E[TI][VI]K[STF]STG[ED] |
|
| |||
| 8 | 29 | 2.4 | [AFG][RS][LQS][VMY][LGP][AQT][NPS][IS][QLV][EKQ]DE[ALT][KN] |
|
| |||
| 9 | 41 | 5.8 | [TY]W[LK]RGN[DE]S[VL][RK][GT][LD]E[HK][LI][RS]L[AS]Q[YF][TL] |