Literature DB >> 17526524

pKNOT: the protein KNOT web server.

Yan-Long Lai1, Shih-Chung Yen, Sung-Huan Yu, Jenn-Kang Hwang.   

Abstract

Knotted proteins are more commonly observed in recent years due to the enormously growing number of structures in the Protein Data Bank (PDB). Studies show that the knot regions contribute to both ligand binding and enzyme activity in proteins such as the chromophore-binding domain of phytochrome, ketol-acid reductoisomerase or SpoU methyltransferase. However, there are still many misidentified knots published in the literature due to the absence of a convenient web tool available to the general biologists. Here, we present the first web server to detect the knots in proteins as well as provide information on knotted proteins in PDB-the protein KNOT (pKNOT) web server. In pKNOT, users can either input PDB ID or upload protein coordinates in the PDB format. The pKNOT web server will detect the knots in the protein using the Taylor's smoothing algorithm. All the detected knots can be visually inspected using a Java-based 3D graphics viewer. We believe that the pKNOT web server will be useful to both biologists in general and structural biologists in particular.

Entities:  

Mesh:

Substances:

Year:  2007        PMID: 17526524      PMCID: PMC1933195          DOI: 10.1093/nar/gkm304

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


INTRODUCTION

Knotted proteins have become more common in recent years (1–14) due to the enormously growing number of structures deposited in the Protein Data Bank (PDB). The knots in proteins are more than just topological novelties. The knotted regions have been shown to be important in both ligand binding and enzyme activity. For example, the unique knot topology in bacterial phytochrome (6) is common to all red/far-red photochromic phytochrome and is important in stabilizing the chromophore-binding region. The knot regions in TrmD tRNA methyltransferase (MTase) have been shown to be important for S-adenosyl-L-methionine (AdoMet) binding and catalytic activity (3). The deep trefoil knot region in N-acetylornithine transcarbamylase forms part of the active site (10). The figure-eight knot in the mainly α-helical domain of ketol–acid reductoisomerase (KARI) forms most of the keto–acid substrate-binding site (11). In addition, knots in proteins present a challenge in the study of protein folding, for it is hard to image a peptide chain to thread through a hoop to form a knot in a reproducible way (15). Interestingly, a recent study (16) showed that YibK (4), a SpoU MTase containing a deep trefoil knot, is able to fold efficiently and behaves remarkably similar to other proteins. Though the identification of a general knot is a topologically difficult problem, it is relatively easy to identify knots in proteins. However, there were still many cases of misidentified knots in proteins (17,18) due to the lack of a convenient tool available to general biologists. The causes of the misidentification of knots in proteins may be due to the presence of mobile loops, missing residues or just visual error in tracing out the entangled protein chains. For example, the SET domain was originally identified to have a knot, but later it was pointed out that part of the loop relevant to the formation of the knot is in fact connected through hydrogen bonds (17). As a result, the knot in the SET domain turns out not to be an authentic one. Other examples of misidentified knots are the trefoil knot in clathrin D6 coat protein (19), the left-handed trefoil knot in ubiquitin hydrolase (15) and the figure-eight knot in histone K79 methyltransferase (19). These knots are in fact caused by breaks in the chain and are therefore not authentic knots. A more recent example is the misidentified trefoil knot in the chromophore-binding domain of phytochrome (6), which in fact contains a figure-eight knot.

METHOD AND IMPLEMENTATION

The pKNOT web server detects the knot in a protein by smoothing the protein chain using the Taylor's algorithm (15). The algorithm first fixes both N and C termini in space, then repeatedly smoothes and straightens the protein chain. The chain is reduced in such a way that, with details of the chains eliminated, the knot can be easily detected. If the protein does not contain a knot, the chain will simply shrink into a straight line. The Taylor's algorithm formally goes as follows: Let the protein chain of length N be described by (r1,r2,…,r), where r is the coordinate of the i-th Cα atom. A new coordinate is taken to be , where 2 ≤ i ≤ N − 1. The termini remain fixed, i.e. and . The iterative procedure will continue to progressively smooth the chain. The main idea is to prevent the chains from passing through each other. This is done by checking that the triangles defined by and do not intersect any line segments defined by for j < i and for j > i. In practice, most protein chains reduce to a straight line defined either by two termini or to an obvious knot in less than 50 iterations. However, there are cases that will take 500 or more iterations to converge. Figure 1 shows a typical example of a chain-smoothing procedure from the original structure of the chromophore-binding domain of bacterial phytochrome (1ZTU) to the final smoothed chain that can be easily identified to contain a figure-eight knot.
Figure 1.

An example of the chain smoothing process. The protein is the chromophore-binding domain of bacterial phytochrome (1ZTU). The X-ray structure of 1ZTU (left) is shown in the cartoon representation and two progressively smoothed chains are shown (center and left). The color is ramped by residues from blue at the N-terminus (labeled by N) to red at the C-terminus (labeled by C). The crossover points are numbered sequentially from the N-terminus. The figure-eight knot is characterized by four crossover points, alternately under and over. The structural pictures are produced using Pymol (Delano Scientific, San Carlos, http://pymol.sourceforge.net/).

An example of the chain smoothing process. The protein is the chromophore-binding domain of bacterial phytochrome (1ZTU). The X-ray structure of 1ZTU (left) is shown in the cartoon representation and two progressively smoothed chains are shown (center and left). The color is ramped by residues from blue at the N-terminus (labeled by N) to red at the C-terminus (labeled by C). The crossover points are numbered sequentially from the N-terminus. The figure-eight knot is characterized by four crossover points, alternately under and over. The structural pictures are produced using Pymol (Delano Scientific, San Carlos, http://pymol.sourceforge.net/).

Data set and pre-computed knots

To speed up the web server, we pre-computed all proteins in the PDB as of January 12, 2007, which consists of 41 013 proteins comprising 34 971 X-ray structures and 6042 NMR protein structures. The crystal structures of homologous protein chains (even those with identical sequences) as well as the solution structures of the same protein were checked for the presence of knots. The chains with breaks or discontinuities are visually checked for their relevance in knot formation. If the proteins have a missing gap so large that it is improper to simply connect the two ends of the missing fragment to complete the chain, the identified knots will be disregarded. All final smoothed chains that appear to form a knot, i.e. not a simple straight line, were visually examined to decide whether these knots are authentic knots, slipknots or artificial knots caused by large breaks in the chains. The knots in proteins are quite simple in that they can be visually identified, and no sophisticated analysis [such as the Jones polynomials or others (20)] is required. In summary, pKNOT provides information about all knotted proteins, such as their protein classes, their knotted types and the cores and depths of the knotted regions. The core is the smallest region that will remain knotted when the residues are successively deleted from both ends (15), and the depth is the product of the number of residues that must be deleted from both ends in order to free the knot (15). Users can also upload the protein structure coordinates in the PDB format and the pKNOT server will progressively smooth the chains on the fly and then present the final smoothed chain as well as the original chain in a JAVA-based 3D graphics viewer AstexViewer (21) for users to inspect.

Input format

The web page of the pKNOT web server is shown in Figure 2. The users can either type in the PDB ID or upload a structural file in the PDB format. In the latter case, the default iteration number is set to 500 and the collision threshold, to 0.5 Å. The user can either ignore or preserve the breaks in the chain when smoothing the chain. The former option will close the breaks by using the shortest line segment connecting the breaks, while the latter option preserves the breaks in the chain and smoothes each individual segment, keeping the endpoints of each segment fixed. The default is set to ignore the breaks in the chain. The users can also choose from the pull-down menu the number of iterations to smooth the chain. The collision threshold is the distance threshold to determine whether a line segment will intersect the triangle during the smoothing procedures.
Figure 2.

The web page of the pKNOT web server. The users can either type in the PDB ID or upload a structural file in the PDB format. When submitting the structural file, the user can choose to either ignore or preserve the breaks in the chain. The default iteration number is set to 500 and the collision threshold, to 0.5 Å.

The web page of the pKNOT web server. The users can either type in the PDB ID or upload a structural file in the PDB format. When submitting the structural file, the user can choose to either ignore or preserve the breaks in the chain. The default iteration number is set to 500 and the collision threshold, to 0.5 Å.

Output format and visualization of chains and knots

Upon query, pKNOT will return a table of the CHAIN, LENGTH, KNOT TYPE and DISPLAY STRUCTURE (Figure 3). When clicking on the column of KNOT TYPE, the server will return a list of all the proteins of the given knot type. pKNOT also provides the molecular viewer AstexViewer so that the users can visualize and manipulate in real time the protein structure and the knot in the protein. Both the original structure and the knot are shown in the same graphics window and the user can toggle on and off one of them for easy inspection.
Figure 3.

Upon query, the pKNOT server will return a table of the CHAIN, LENGTH, KNOT TYPE, CORE, DEPTH and DISPLAY STRUCTURE (upper center). When clicking on the column of KNOT TYPE, the server will return a list of all the proteins of the given knot type in the database of the pKNOT server (lower left). PKNOT also provides a JAVA-based 3D molecular viewer AstexViewer and the users can visualize and manipulate in real time the protein structure and the knot in the protein (lower right).

Upon query, the pKNOT server will return a table of the CHAIN, LENGTH, KNOT TYPE, CORE, DEPTH and DISPLAY STRUCTURE (upper center). When clicking on the column of KNOT TYPE, the server will return a list of all the proteins of the given knot type in the database of the pKNOT server (lower left). PKNOT also provides a JAVA-based 3D molecular viewer AstexViewer and the users can visualize and manipulate in real time the protein structure and the knot in the protein (lower right).

RESULTS

The knotted proteins come from the following protein classes: (1) methyltransferase, (2) transcarbamylase, (3) carbonic anhydrase, (3) ketol–acid reductosiomerase, (4) ubiquitin hydrolase, (5) methionine adenosyl transferase, (6) the chromophore-binding domain of bacterial phytochrome and (7) the inner core shell component protein of bluetongue virus. In addition, we also identified two knotted NMR structures: 1POQ and 1J2O. However, it is not clear whether these knots are authentic or due to incorrect structural refinement, since only one knotted model is identified among all NMR models for each protein (model 7 in 1POQ and model 14 in1J2O).

The knot types in proteins

There are three types of knot (up to the mirror image) identified in the PDB: the trefoil knot, the figure-eight knot and the knot with five crossings(15,19).

The trefoil knot

The trefoil knot (also called the threefoil or overhand knot) is the simplest knot of all, which is characterized by three crossings. It is mathematically denoted as a 31 knot. The proteins with a trefoil knot are (1) methyltransferase, (2) transcarbamylase, (3) methionine adenosyltransferase, (4) carbonic anhydrase and (5) YMPa superantigen (NMR).

The figure-eight knot

The figure-eight knot is characterized by four crossover points, alternately under and over. There is only one prime knot with four crossings and is denoted as the 41 knot. The proteins with a 41 knot are (1) the chromophore-binding domain of bacterial phytochrome, (2) the core protein of bluetongue virus, (3) ketol–acid reductoisomerase and (4) a LIM-ldbl-LID chimeric protein (NMR).

The 52 knots

There are two types of knot with five crossings: the 51 and 52 knots. Only the 52 knot has been identified in the protein structure and, as of writing, no proteins with six or more crossings have been identified in the PDB. The only protein family with a 52 knot is ubiquitin c-terminal hydrolase (1).

Comparison with other work

It will be interesting to compare our results with those of the recent work by Lua and Grosberg (19). For example, they identified 19 knot proteins using the RANDOM method from the PDB-REPRDB data set (22) comprising 4716 representative protein. However, 5 of the identified 19 knotted proteins (1T0H:B, 1GKU:B, 1U2Z:C, 1M72:B and 1XI4:C) are questionable, since all of them have very large gaps in their structures due to missing residues. These knots arise either from the artificial virtual bonds that are used to connect the gaps or from the nonstandard PDB format. For example, 1T0H:B(23) has missing residues 414–424. A knot will form only if a virtual bond of length 32 Å connects the structural gap; 1U2Z:C has missing residues 570–573 and 575. The total distance of the structural gaps is around 52 Å. If these chain breaks were connected by virtual bonds, there will be a 41 knot. However, we notice that there is a chain in the complex (i.e. 1U2Z:A), which has identical sequence with 1U2Z:C and does not have a knot even if the structural gaps are connected by virtual bonds.

CONCLUSION

Here we have presented the first web server to detect knots in proteins. With an increasing number of proteins with knots deposited in PDB, we believe that the pKNOT web server will be useful to both biologists in general and structural biologists in particular.
  22 in total

1.  A deeply knotted protein structure and how it might fold.

Authors:  W R Taylor
Journal:  Nature       Date:  2000-08-24       Impact factor: 49.962

2.  Structure of the YibK methyltransferase from Haemophilus influenzae (HI0766): a cofactor bound at a site formed by a knot.

Authors:  Kap Lim; Hong Zhang; Aleksandra Tempczyk; Wojciech Krajewski; Nicklas Bonander; John Toedt; Andrew Howard; Edward Eisenstein; Osnat Herzberg
Journal:  Proteins       Date:  2003-04-01

3.  PDB-REPRDB: a database of representative protein chains from the Protein Data Bank (PDB) in 2003.

Authors:  Tamotsu Noguchi; Yutaka Akiyama
Journal:  Nucleic Acids Res       Date:  2003-01-01       Impact factor: 16.971

4.  AstexViewer: a visualisation aid for structure-based drug design.

Authors:  Michael J Hartshorn
Journal:  J Comput Aided Mol Des       Date:  2002-12       Impact factor: 3.686

5.  A knot or not a knot? SETting the record 'straight' on proteins.

Authors:  William R Taylor; Bing Xiao; Steven J Gamblin; Kuang Lin
Journal:  Comput Biol Chem       Date:  2003-02       Impact factor: 2.877

6.  The active site of the SET domain is constructed on a knot.

Authors:  Steven A Jacobs; Joel M Harp; Srikripa Devarakonda; Youngchang Kim; Fraydoon Rastinejad; Sepideh Khorasanizadeh
Journal:  Nat Struct Biol       Date:  2002-11

7.  Structure and function of the antibiotic resistance-mediating methyltransferase AviRb from Streptomyces viridochromogenes.

Authors:  Tanja G Mosbacher; Andreas Bechthold; Georg E Schulz
Journal:  J Mol Biol       Date:  2005-01-21       Impact factor: 5.469

8.  Crystal structure of tRNA(m1G37)methyltransferase: insights into tRNA recognition.

Authors:  Hyung Jun Ahn; Hyeon-Woo Kim; Hye-Jin Yoon; Byung Il Lee; Se Won Suh; Jin Kuk Yang
Journal:  EMBO J       Date:  2003-06-02       Impact factor: 11.598

9.  Crystal structure of class I acetohydroxy acid isomeroreductase from Pseudomonas aeruginosa.

Authors:  Hyung Jun Ahn; Su Jung Eom; Hye-Jin Yoon; Byung Il Lee; Hyeongjin Cho; Se Won Suh
Journal:  J Mol Biol       Date:  2003-04-25       Impact factor: 5.469

10.  Statistics of knots, geometry of conformations, and evolution of proteins.

Authors:  Rhonald C Lua; Alexander Y Grosberg
Journal:  PLoS Comput Biol       Date:  2006-05-19       Impact factor: 4.475

View more
  16 in total

Review 1.  Knot theory in understanding proteins.

Authors:  Rama Mishra; Shantha Bhushan
Journal:  J Math Biol       Date:  2011-11-22       Impact factor: 2.259

2.  Conservation of complex knotting and slipknotting patterns in proteins.

Authors:  Joanna I Sułkowska; Eric J Rawdon; Kenneth C Millett; Jose N Onuchic; Andrzej Stasiak
Journal:  Proc Natl Acad Sci U S A       Date:  2012-06-08       Impact factor: 11.205

3.  Tightening the knot in phytochrome by single-molecule atomic force microscopy.

Authors:  Thomas Bornschlögl; David M Anstrom; Elisabeth Mey; Joachim Dzubiella; Matthias Rief; Katrina T Forest
Journal:  Biophys J       Date:  2009-02-18       Impact factor: 4.033

Review 4.  Protein structure databases.

Authors:  Roman A Laskowski
Journal:  Mol Biotechnol       Date:  2011-06       Impact factor: 2.695

5.  LinkProt: a database collecting information about biological links.

Authors:  Pawel Dabrowski-Tumanski; Aleksandra I Jarmolinska; Wanda Niemyska; Eric J Rawdon; Kenneth C Millett; Joanna I Sulkowska
Journal:  Nucleic Acids Res       Date:  2016-10-28       Impact factor: 16.971

6.  The ygeW encoded protein from Escherichia coli is a knotted ancestral catabolic transcarbamylase.

Authors:  Yongdong Li; Zhongmin Jin; Xiaolin Yu; Norma M Allewell; Mendel Tuchman; Dashuang Shi
Journal:  Proteins       Date:  2011-05-09

Review 7.  Energy functions in de novo protein design: current challenges and future prospects.

Authors:  Zhixiu Li; Yuedong Yang; Jian Zhan; Liang Dai; Yaoqi Zhou
Journal:  Annu Rev Biophys       Date:  2013-02-28       Impact factor: 12.981

8.  Knotted vs. unknotted proteins: evidence of knot-promoting loops.

Authors:  Raffaello Potestio; Cristian Micheletti; Henri Orland
Journal:  PLoS Comput Biol       Date:  2010-07-29       Impact factor: 4.475

9.  A knot in the protein structure - probing the near-infrared fluorescent protein iRFP designed from a bacterial phytochrome.

Authors:  Olesya V Stepanenko; Grigory S Bublikov; Olga V Stepanenko; Daria M Shcherbakova; Vladislav V Verkhusha; Konstantin K Turoverov; Irina M Kuznetsova
Journal:  FEBS J       Date:  2014-04-01       Impact factor: 5.542

10.  In-silico analysis of caspase-3 and -7 proteases from blood-parasitic Schistosoma species (Trematoda) and their human host.

Authors:  Shakti Kumar; Devendra Kumar Biswal; Veena Tandon
Journal:  Bioinformation       Date:  2013-05-25
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.