Literature DB >> 18545655

MemBrain: improving the accuracy of predicting transmembrane helices.

Abstract

Prediction of transmembrane helices (TMH) in alpha helical membrane proteins provides valuable information about the protein topology when the high resolution structures are not available. Many predictors have been developed based on either amino acid hydrophobicity scale or pure statistical approaches. While these predictors perform reasonably well in identifying the number of TMHs in a protein, they are generally inaccurate in predicting the ends of TMHs, or TMHs of unusual length. To improve the accuracy of TMH detection, we developed a machine-learning based predictor, MemBrain, which integrates a number of modern bioinformatics approaches including sequence representation by multiple sequence alignment matrix, the optimized evidence-theoretic K-nearest neighbor prediction algorithm, fusion of multiple prediction window sizes, and classification by dynamic threshold. MemBrain demonstrates an overall improvement of about 20% in prediction accuracy, particularly, in predicting the ends of TMHs and TMHs that are shorter than 15 residues. It also has the capability to detect N-terminal signal peptides. The MemBrain predictor is a useful sequence-based analysis tool for functional and structural characterization of helical membrane proteins; it is freely available at http://chou.med.harvard.edu/bioinf/MemBrain/.

Entities: Chemical Gene Species

Mesh：

Substances：
Membrane Proteins

Year: 2008 PMID： 18545655 PMCID： PMC2396505 DOI： 10.1371/journal.pone.0002399

Source DB: PubMed Journal: PLoS One ISSN： 1932-6203 Impact factor: 3.240

Introduction

Motivation for a more accurate TMH predictor

Membrane-embedded α helical, polytopic proteins constitute the majority of ion channels, transporters, and receptors in living organisms. This class of proteins, which account for ∼40% of all membrane proteins, are difficult targets for high resolution structural studies. Although experimentally determined structures of integral membrane proteins have been increasing at a fast rate in recent years, they only sum to less than 1% of the structures in the Protein Data Bank (PDB). Probably the first analysis that researchers perform when studying a helical membrane protein, whether it is for functional or structural characterization, is prediction of TMHs from the protein amino acid sequence. Knowledge of TMHs is very useful in initial elucidation of the overall topology of the protein, as well as in the rational design of protein constructs for structural studies. Computational tools for TMH prediction are widely available. In this paper and in previous papers on TMH prediction, TMH is defined as a segment of helix that is embedded in the membrane. Hence, TMH sequence ends when the transmembrane region ends, although the helix can continue beyond the membrane. In general, residues of TMHs are mostly hydrophobic. Hence, earlier TMH prediction programs, such as TOP-PRED [1], compute sequence hydrophobicity from amino acid hydrophobicity scales assigned by biophysical and chemical measurements [2]–[4], and predict TMH propensity based on the average hydrophobicity score of a sliding prediction window of N successive residues along the sequence. Later predictors use more statistics-based, machine learning techniques. For example, PHDhtm [5] is based on neural networks, and TMHMM [6] and Phobius [7] are based on the hidden Markov model. The available TMH predictors are used routinely in membrane protein characterization and, in most cases, are sufficiently reliable in providing descriptive information about TMHs [8]. However, as more high resolution structures of helical membrane proteins become available, we learn that TMH has a wide length distribution. About 5% of the TMHs in the known structures are very short (<15 residues) and only span the membrane partially. These helices are known as the ‘half TMHs’ (see an example in the structure of the glycerol-conducting channel [9]). Very long TMHs (>40 residues) have also been found in the membrane proteins, e.g., the metalloenzyme particulate methane monooxygenase protein [10]. None of the existing TMH predictors perform satisfactorily in detecting TMHs of irregular lengths. For example, TOP-PRED [1] predicts all the TMHs to be 21 residues long, TMHMM [6] cannot predict TMHs shorter than 16 residues or longer than 35 residues, and SOSUI [11] cannot predict TMHs longer than 25 residues. We developed a TMH prediction method, named MemBrain, which aims to improve the accuracy of TMH prediction. MemBrain was trained using the standard training dataset that was used by many other predictors, yet performed ∼20% better than others when tested with a benchmark testing dataset. The improvement came mainly from the capability of MemBrain to predict accurately the ends of TMHs and therefore to detect TMHs of irregular lengths. Such capability was realized by applying the powerful optimized evidence-theoretic K-nearest neighbor (OET-KNN) prediction algorithm [12]–[14] to protein sequence representations that include sequence evolution information, and by merging results from prediction sequence windows of different sizes. Our results show that, with the fast expanding database of experimental membrane protein structures, there is still much room for improving the accuracy of TMH prediction using a pure statistics-based protocol.

Results

The algorithm

A flowchart of the MemBrain predictor is shown in Figure 1. We represented a protein sequence of N residues by the position-specific scoring matrix (PSSM) (N rows and 20 columns), generated using the PSI-BLAST program [15] (see Methods section). The PSSM contains sequence evolution information from multiple sequence alignment against the SWISS-PROT protein database, and therefore provides a more complete description of the characteristics of a protein sequence. The propensity of a residue at positions i for being a part of a TMH was predicted based on a sequence segment of length L centered on i, where L is an odd number that represents the prediction window size. The prediction window size has a profound effect on the prediction outcome. Large window size, e.g., L = 17 (used in the PHDhtm predictor [5]), is more effective for predicting residues in the middle of a long TMH due to higher content of neighborhood information. However, it performs poorly for residues near the ends of TMHs, and is incapable of predicting half TMHs shorter than 15 residues. On the other hand, if L is too small, the prediction accuracy generally suffers as a result of losing the neighborhood sequence information. In the MemBrain predictor, we combined two window sizes to minimize the bias caused by the use of only one window size. We found that the fusion of two window sizes, 13 and 15, gave the best prediction results.

Figure 1

A flowchart diagram of the MemBrain protocol.

For TMH prediction, we used the standard training dataset which was used by most other TMH predictors, including TMHMM [6], Phobius [7], THUMBU [16] and SVMtm [17]. This dataset includes 50 helical membrane proteins of known TMH regions (see Supplementary Table S1). For each of the 50 proteins, the PSSM was generated using the PSI-BLAST program. From the PSSM, the matrix elements (L×20) for various sequence segments of L = 13 or 15 were extracted and stored in the training vectors or , respectively (see Methods section for details of constructing these vectors). These training vectors were labeled as ‘TMH’ if the residue j at the middle of the sequence segment is a part of a TMH, and were otherwise labeled as ‘NOT TMH’. From the 50 PSSMs, we built a training set of 14,531 vectors of L = 13 and 14,431 vectors of L = 15. These vectors were used as statistical rulers for making predictions on the target protein. Given a query protein, the PSSM was constructed and the query vector for sequence segment centered on residue i () was defined. To predict the TMH propensity of residue i, denoted here as E , we applied the OET-KNN algorithm for which the inputs are the query vector and all in the training set with the same dimension. The OET-KNN algorithm is a classification tool which has proven to be powerful in pattern recognition [12], [14] as well as in the prediction of sub-cellular locations of proteins [13], [18]. In the OET-KNN calculation (described in details in the Methods section), the Euclidean distances between and all were calculated, and the 50 closest matches were used to calculate E , which ranges from 0 to 1, where 0 and 1 are zero and unity probability of TMH, respectively. The TMH propensity obtained for L = 13, , was merged with that obtained for L = 15, , by simple averaging. Thus the combined TMH propensity for residue i is , ranging from 0 to 1. The procedure was repeated to cover all residues, (L-1)/2≤i≤N – (L-1)/2, in the query protein. For a query protein, the E vs. i plot gives an overview of the residue-specific TMH propensity. We used the median filter technique [19] to smooth the TMH propensity profile, and at the same time, to reduce noise. The final step is to determine the TMHs based on the smoothened propensity profile. In most other predictors, a fixed threshold is used to segment the scores, i.e., residues having scores larger than the threshold are assigned as TMH [11], [17], [20]. However, the optimal threshold for defining two TMHs separated by long loops is very different from the threshold required for identifying TMHs separated by short loops or tight turns. High-resolution structures show that two consecutive TMHs are often connected by very short loops or turns. In these cases, since the loop residues only represent a small region of the prediction window, the TMH propensity calculated for the short loops are higher than those of long loops. To solve this problem, we used a dynamic threshold method in which a base threshold propensity of 0.4 was first used to define TMH fragments. Then we raised the threshold according to the shape of the local propensity profile for identifying short loops or helical breaks in these fragments (see Methods section for details). Finally, in some membrane proteins, the first N-terminal TMH is a N-terminal signal peptide. We included an extra module in the MemBrain program to detect potential N-terminal signal peptide in a membrane protein using methods described in ref. [21].

Performance

To test the MemBrain predictor and compare its performance with the existing TMH predictors, we constructed a testing dataset consisting of 70 helical membrane proteins of known high resolution structures which do not overlap with the training dataset (see Supplementary Table S2). There are a total of 378 TMHs in the testing dataset. The performances of the TMH predictors were evaluated with four different scores. The TMH prediction success rate (V. VTMH is simply the fraction of TMHs in the testing set that are correctly predicted [22]; it is defined aswhere a TMH is considered predicted correctly if it has an overlap of at least 9 residues with the prediction. However, we note that such definition is not robust, and in some other studies, different lengths of residue overlap were used [22], [23]. The protein prediction success rate (V. VP is the fraction of helical proteins in the testing set that are correctly predicted [22]; it is defined aswhere a protein is considered predicted correctly if all the TMHs in this protein are correctly predicted (as defined in VTMH above) and the number of predicted TMHs is equal to the observed number of TMHs in the protein. The N and C scores. These two scores evaluate the accuracy of predicting the ends of TMHs [22]. N and C scores are the number of N- and C-terminal residues that do not match when aligning the predicted and observed TMHs. In the best case, if the predicted and observed TMHs are completely matched, the N and C scores equal to 0. The normalized RMSD. Finally, we calculated the normalized distance between the predicted and known TMH representation vectors, denoted by p = [p 1, p 2,…, p], in which p is assigned to 1 if residue i is a part of a TMH and is otherwise assigned to 0. The normalized distance, or RMSD, is defined aswhere p and p 0 are the predicted and known TMH representation vectors of a protein, respectively. The normalized RMSD is less subjective than the definition of VTMH and VP above. Table 1 compares the performances of MemBrain and other TMH predictors as judged by the four different scorings described above. MemBrain performs significantly better than other predictors in all four scoring categories. The VTMH and VP scores have been widely used in evaluation of TMH predictors. MemBrain VTMH and VP scores are 97.9% and 87.1%, respectively, which are about 6–16% better than Phobius (the best performer in this scoring category among the published predictors). MemBrain also has an improved capability to predict correctly the ends of TMHs as shown by the mean N and C scores of 3.2 and 3.1, which are about 20% better than the best published predictor for this scoring category. Finally the MemBrain mean normalized rmsd is 0.35, also about 20% better than the second-best performing predictor Phobius. The observed and predicted TMHs for the 70 membrane proteins in the testing dataset are given in Supplementary Data S1.

Table 1

Performance comparison of various TMH predictorsa.

Predictor	V_TMH	V_P	N-score	C-score	RMSD_N
THUMBU[16] b	85.5%	47.1%	6.9±4.9	6.7±4.9	0.58±0.19
SOSUI[11] c	89.1%	57.1%	5.0±4.1	5.0±4.2	0.44±0.21
DAS-TMfilter[20] d	90.7%	64.3%	6.5±5.0	5.5±5.3	0.58±0.16
TOP-PRED[1] e	92.6%	60.0%	4.5±3.8	4.6±3.9	0.45±0.15
TMHMM[6] f	91.0%	65.7%	4.5±3.8	4.5±3.9	0.44±0.15
Phobius[7] g	91.8%	71.4%	4.6±4.0	4.4±4.1	0.44±0.19
MemBrain h	97.9%	87.1%	3.2±3.0	3.1±2.8	0.35±0.14

The testing dataset consists of 378 TMH segments from 70 proteins (see Supplementary Table S2).

http://sparks.informatics.iupui.edu/Softwares-Services_files/thumbup.htm [16].

http://bp.nuap.nagoya-u.ac.jp/sosui/ [11].

http://mendel.imp.ac.at/sat/DAS/DAS.html [20].

http://bioweb.pasteur.fr/seqanal/interfaces/toppred.html [1].

http://www.cbs.dtu.dk/services/TMHMM/ [6].

http://phobius.cgb.ki.se/ [7].

http://chou.med.harvard.edu/bioinf/MemBrain/.

The testing dataset consists of 378 TMH segments from 70 proteins (see Supplementary Table S2). http://sparks.informatics.iupui.edu/Softwares-Services_files/thumbup.htm [16]. http://bp.nuap.nagoya-u.ac.jp/sosui/ [11]. http://mendel.imp.ac.at/sat/DAS/DAS.html [20]. http://bioweb.pasteur.fr/seqanal/interfaces/toppred.html [1]. http://www.cbs.dtu.dk/services/TMHMM/ [6]. http://phobius.cgb.ki.se/ [7]. http://chou.med.harvard.edu/bioinf/MemBrain/.

Discussion

The above prediction scores obtained from a fairly complete testing dataset show that MemBrain is the best TMH predictor to date. Probably the most attractive feature of MemBrain is the improved ability in correctly identifying the ends of TMHs. This capability is important because there is a wide distribution of TMH length amongst the 70 helical polytopic membrane proteins in the testing dataset (Fig. 2a), e.g., TMH can be as short as 10 residues. Most TMH predictors cannot detect TMHs shorter than 15 residues (e.g., Figures 2b&c show that the shortest TMH predicted by TMHMM and Phobius, the predictors which gave the second best N and C scores in Table 1, is 17 residues). However the length distribution of TMHs predicted by MemBrain matches most closely to that of the observed dataset (Fig. 2d). We also noticed that MemBrain shows similar improvements in prediction when considering only TMHs that are longer than 15 residues (see Supplementary Table S3).

Figure 2

TMH length distribution in (a) 70 known membrane protein structures in the testing dataset, (b) TMHs predicted by TMHMM [6], (c) TMHs predicted by Phobius [7], and (d) TMHs predicted by MemBrain.

The improvement came from a combination of the steps used in our protocol shown in Figure 1. First, the PSSM representation contains sequence evolution information, which provides more complete sampling for statistical prediction methods. The advantage of a pure statistical approach over hydrophobicity-based prediction methods is that the prediction outcome does not depend on our interpretation of amino acid sequence in TMH formation, which could introduce bias. Second, the OET-KNN algorithm is a powerful classification method that can combine many different evidences and deal with the uncertainty to reach the optimal decision. Third, the fusion of two prediction window sizes provides more flexibility in accounting for length variation of TMHs, and thus reduces the bias towards a fixed TMH length introduced by using only one window size (as treated in all the previous predictors). Finally, assignment of TMHs using the dynamic threshold method further refines the prediction by detecting short loops and turns that separate TMHs. A somewhat unsatisfying aspect of the TMH-only prediction is the complete absence of amphipathic, extramembrane helices that are common in helical membrane protein structures. In both the training and testing datasets, the TMH sequences are defined to end when the transmembrane regions end. However, according to many high resolution structures, a considerable portion of transmembrane helices extend well beyond the lipid bilayer and become hydrophilic. Therefore, TMH predictors cannot predict the extramembrane portions of helices. Our future direction is to develop methods to predict both transmembrane and extramembrane helical segments in helical polytopic membrane proteins.

Methods

Construction of query and training vectors

The PSSM matrix of a protein P of N residues, which contains sequence evolution information, is defined aswhere a denotes the probability of residue i of the protein being changed to amino acid type j as determined from multiple sequence alignments [15]. The matrix elements in Eq. 4 were generated using the PSI-BLAST [15], which searches the SWISS-PROT database (version 52.0 released on 6-March-2007) against the sequence of the protein. For prediction studies, a residue at position i of the protein can be represented by a query vector, , composed of the PSSM matrix elements of the query protein corresponding to a sequence segment of length L centered on i, e.g.,where L is an odd number. Eq. 5 is also used to construct training vectors, , from their corresponding PSSM matrices of proteins in the training dataset.

Calculation of TMH propensity

Consider the problem of predicting the propensity of residue i of the query protein belonging to a structural pattern, denoted by φ, whereWe represent the residue by a query vector (see Eq. 5 above), constructed for prediction window size L. The knowledge basis used for the prediction is given by the training dataset, T, e.g.,where vectors were also constructed as in Eq. 5 for window size L, and their corresponding patterns φ's are known. Let SK be a set of vectors consisting of K in T that have the shortest Euclidean distances to , referred to here as the K nearest neighbors of . For any , the knowledge that has a pattern φ is a piece of evidence which increases our belief that also has the pattern φ. This evidence is quantified, as in refs. [24], [25], by an evidence function where is the Euclidean distance between and , and the parameter is associated with a particular pattern φ; the delta function in Eq. 8 isIn OET-KNN, is optimized by maximizing the prediction accuracy of every sample in T. Using the detailed optimization protocol described in ref. [14], we found the following values of : , , , and . Combining the knowledge of the K nearest neighbors in SK, the evidence of belonging to the pattern φ isThe final evidences are then normalized as into satisfy the normalization condition . Finally, after merging the prediction results obtained using two different window sizes, L = 13 and 15, the propensity of residue i belonging to TMH is

Dynamic threshold segmentation

To assign TMH fragments based on the propensity profile, we used a dynamic threshold segmentation approach. First, residues with propensity greater than or equal to 0.4 were considered as TMH. The base threshold, λ = 0.4, was selected by optimizing the self-consistency test performance as was done in refs. [11], [17], [20]. A TMH is initially assigned when λ intersects the propensity profile at two consecutive points. For example, given λ = 0.4, the N-terminal residue of a TMH is residue n0 if En0−1<λ and En0>λ. Moving along the sequence, the next encounter of Ec0>λ and Ec0+1<λ defines the C-terminal residue of the TMH to be residue c0. Hence, the initial assignment of TMH is from residues n0 to c0. The value of λ was then increased by increment of 0.05 until λ intersects the profile within the initial TMH at four points. In this case, the original TMH was split into two TMH segments. The first TMH is from residues n0 to c1, where Ec1>λ and Ec1+1<λ, and the second TMH is from residues n1 to c0, where En1−1<λ and En1>λ. A TMH shorter than 5 residues was not segmented out and remained as a part of the original TMH. Figure 3 shows an example of dynamic threshold assignment of TMHs in the protein lactose permease of Escherichia coli (PDB code: 1PV7) [26]. Note that the short loops between the 3rd and 4th TMHs, and between the 9th and 10th TMHs were successfully detected using this method.

Figure 3

The residue-specific TMH propensity of lactose permease of Escherichia coli (PDB code: 1PV7) [26], illustrating the method of assignment of TMHs by dynamic threshold segmentation.

The observed TMHs, assigned in ref. [26], are shown as the gray boxes.

The residue-specific TMH propensity of lactose permease of Escherichia coli (PDB code: 1PV7) [26], illustrating the method of assignment of TMHs by dynamic threshold segmentation.

The observed TMHs, assigned in ref. [26], are shown as the gray boxes. All algorithms used in MemBrain were implemented in the C programming language and executed in the Linux operating system. (0.02 MB DOC) Click here for additional data file. (0.02 MB DOC) Click here for additional data file. (0.05 MB DOC) Click here for additional data file. (0.10 MB DOC) Click here for additional data file.

23 in total

Review 1. Improving the accuracy of PSI-BLAST protein database searches with composition-based statistics and other refinements.

Authors: A A Schäffer; L Aravind; T L Madden; S Shavirin; J L Spouge; Y I Wolf; E V Koonin; S F Altschul
Journal: Nucleic Acids Res Date: 2001-07-15 Impact factor: 16.971

2. TM or not TM: transmembrane protein prediction with low false positive rate using DAS-TMfilter.

Authors: Miklos Cserzo; Frank Eisenhaber; Birgit Eisenhaber; Istvan Simon
Journal: Bioinformatics Date: 2004-01-01 Impact factor: 6.937

3. Predicting the topology of transmembrane helical proteins using mean burial propensity and a hidden-Markov-model-based method.

Authors: Hongyi Zhou; Yaoqi Zhou
Journal: Protein Sci Date: 2003-07 Impact factor: 6.725

4. Structure and mechanism of the lactose permease of Escherichia coli.

Authors: Jeff Abramson; Irina Smirnova; Vladimir Kasho; Gillian Verner; H Ronald Kaback; So Iwata
Journal: Science Date: 2003-08-01 Impact factor: 47.728

5. Snorkeling preferences foster an amino acid composition bias in transmembrane helices.

Authors: Aaron K Chamberlain; Yohan Lee; Sanguk Kim; James U Bowie
Journal: J Mol Biol Date: 2004-05-28 Impact factor: 5.469

6. The progress of membrane protein structure determination.

Authors: Stephen H White
Journal: Protein Sci Date: 2004-07 Impact factor: 6.725

7. A combined transmembrane topology and signal peptide prediction method.

Authors: Lukas Käll; Anders Krogh; Erik L L Sonnhammer
Journal: J Mol Biol Date: 2004-05-14 Impact factor: 5.469

8. SVMtm: support vector machines to predict transmembrane segments.

Authors: Zheng Yuan; John S Mattick; Rohan D Teasdale
Journal: J Comput Chem Date: 2004-04-15 Impact factor: 3.376

9. Signal-3L: A 3-layer approach for predicting signal peptides.

Authors: Hong-Bin Shen; Kuo-Chen Chou
Journal: Biochem Biophys Res Commun Date: 2007-08-31 Impact factor: 3.575

10. Structure of a glycerol-conducting channel and the basis for its selectivity.

Authors: D Fu; A Libson; L J Miercke; C Weitzman; P Nollert; J Krucinski; R M Stroud
Journal: Science Date: 2000-10-20 Impact factor: 47.728

38 in total

1. Acyl chain specificity of ceramide synthases is determined within a region of 150 residues in the Tram-Lag-CLN8 (TLC) domain.

Authors: Rotem Tidhar; Shifra Ben-Dor; Elaine Wang; Samuel Kelly; Alfred H Merrill; Anthony H Futerman
Journal: J Biol Chem Date: 2011-12-05 Impact factor: 5.157

2. ANOs 3-7 in the anoctamin/Tmem16 Cl- channel family are intracellular proteins.

Authors: Charity Duran; Zhiqiang Qu; Adeboye O Osunkoya; Yuanyuan Cui; H Criss Hartzell
Journal: Am J Physiol Cell Physiol Date: 2011-11-09 Impact factor: 4.249

3. High-accuracy prediction of transmembrane inter-helix contacts and application to GPCR 3D structure modeling.

Authors: Jing Yang; Richard Jang; Yang Zhang; Hong-Bin Shen
Journal: Bioinformatics Date: 2013-08-14 Impact factor: 6.937

4. Genetic analysis of MraY inhibition by the phiX174 protein E.

Authors: Yi Zheng; Douglas K Struck; Thomas G Bernhardt; Ry Young
Journal: Genetics Date: 2008-09-14 Impact factor: 4.562

Review 5. In silico studies on DARC.

Authors: Alexandre G de Brevern; Ludovic Autin; Yves Colin; Olivier Bertrand; Catherine Etchebest
Journal: Infect Disord Drug Targets Date: 2009-06

6. A transmembrane domain and GxxxG motifs within L2 are essential for papillomavirus infection.

Authors: Matthew P Bronnimann; Janice A Chapman; Chad K Park; Samuel K Campos
Journal: J Virol Date: 2012-10-24 Impact factor: 5.103

7. Architectural organization of the metabolic regulatory enzyme ghrelin O-acyltransferase.

Authors: Martin S Taylor; Travis R Ruch; Po-Yuan Hsiao; Yousang Hwang; Pingfeng Zhang; Lixin Dai; Cheng Ran Lisa Huang; Christopher E Berndsen; Min-Sik Kim; Akhilesh Pandey; Cynthia Wolberger; Ronen Marmorstein; Carolyn Machamer; Jef D Boeke; Philip A Cole
Journal: J Biol Chem Date: 2013-09-17 Impact factor: 5.157

8. Membrane topology of hedgehog acyltransferase.

Authors: Armine Matevossian; Marilyn D Resh
Journal: J Biol Chem Date: 2014-12-08 Impact factor: 5.157

9. qFit-ligand Reveals Widespread Conformational Heterogeneity of Drug-Like Molecules in X-Ray Electron Density Maps.

Authors: Gydo C P van Zundert; Brandi M Hudson; Saulo H P de Oliveira; Daniel A Keedy; Rasmus Fonseca; Amelie Heliou; Pooja Suresh; Kenneth Borrelli; Tyler Day; James S Fraser; Henry van den Bedem
Journal: J Med Chem Date: 2018-12-06 Impact factor: 7.446

10. Transmembrane topology and signal peptide prediction using dynamic bayesian networks.

Authors: Sheila M Reynolds; Lukas Käll; Michael E Riffle; Jeff A Bilmes; William Stafford Noble
Journal: PLoS Comput Biol Date: 2008-11-07 Impact factor: 4.475