| Literature DB >> 30988258 |
Bart van Beusekom1, Natasja Wezel1, Maarten L Hekkelman1, Anastassis Perrakis1, Paul Emsley2, Robbie P Joosten1.
Abstract
N-Glycosylation is one of the most common post-translational modifications and is implicated in, for example, protein folding and interaction with ligands and receptors. N-Glycosylation trees are complex structures of linked carbohydrate residues attached to asparagine residues. While carbohydrates are typically modeled in protein structures, they are often incomplete or have the wrong chemistry. Here, new tools are presented to automatically rebuild existing glycosylation trees, to extend them where possible, and to add new glycosylation trees if they are missing from the model. The method has been incorporated in the PDB-REDO pipeline and has been applied to build or rebuild 16 452 carbohydrate residues in 11 651 glycosylation trees in 4498 structure models, and is also available from the PDB-REDO web server. With better modeling of N-glycosylation, the biological function of this important modification can be better and more easily understood. open access.Entities:
Keywords: Coot; N-glycans; PDB-REDO; carbohydrates; crystallography; model building; validation
Mesh:
Substances:
Year: 2019 PMID: 30988258 PMCID: PMC6465985 DOI: 10.1107/S2059798319003875
Source DB: PubMed Journal: Acta Crystallogr D Struct Biol ISSN: 2059-7983 Impact factor: 7.652
Figure 1The percentage of PDB entries, per year of deposition, that contain carbohydrates or are glycosylated for the past 20 years. The percentage of carbohydrate-containing entries has been growing steadily over the last ten years.
Number of carbohydrate residues built in PDB-REDO entries available as of 31 October 2018
| Process | No. of residues built | No. of entries modified |
|---|---|---|
| Rebuilding | 6364 | 1961 |
| Tree extension | 4031 | 1721 |
| Whole-tree addition | 6057 | 2372 |
|
|
|
|
This also includes carbohydrate residues that were built when a rebuilt tree could be further extended.
Figure 2(a) The number of carbohydrate residues built by Carbivore by year of deposition in total and per 100 sequons (with sequence motif Asn-X-S/T). (b) The number of carbohydrate residues per glycosylation tree in the PDB versus the PDB-REDO databank. The length of most glycosylation trees remains unchanged, a considerable number of trees become longer and only a small portion of trees are shortened. The number of asparagines that are glycosylated in neither the PDB nor the PDB-REDO databank is not determined because it is not relevant here. (c) The distribution of the RSCC for each carbohydrate residue that was built in by Carbivore in current PDB-REDO entries. The sharp increases at 0.60 and 0.70 are caused by the RSCC filters (see Section 2).
Figure 3Carbohydrate remodeling: a comparison between PDB (left) and PDB-REDO (right). Top: new glycosylation-tree modeling at AsnA24 in PDB entry 2aaa (Boel et al., 1990 ▸). Clear difference density is visible at this asparagine, which follows the glycosylation sequence motif. After flipping the side chain of AsnA24, four carbohydrate residues can be built at this position; there is also partial density for a fifth mannose, but this was not built. Middle: glycosylation-tree rebuilding at AsnA529 in PDB entry 3d12 (Xu et al., 2008 ▸). The seven carbohydrate moieties in the PDB entry (and indicated in the figure) are carbohydrate residues that are not commonly found in N-glycosylation, which can now be replaced automatically with the correct residues. It may be possible that the wrong residue names have arisen as an unwanted side effect from PDB remediation efforts (Henrick et al., 2008 ▸). The residues for which the abbreviations have not been defined (LXZ, NGA and GL0) are similar to NAG, NAG and GAL, respectively, but with one or more inverted chiral centers. Bottom: glycosylation-tree extension at AsnC81 in PDB entry 6g46 (Hussein et al., 2018 ▸). Three residues could be added at this position, which was enabled partly because of improved refinement in PDB-REDO (R free decreased from 23.1% to 21.5%). Ten water molecules were deleted. In all cases, amino acids are shown in blue and carbohydrate residues in gold. For sake of clarity, the 2mF o − DF c map is contoured at 1.2σ (top), 1.5σ (middle) and 1.0σ (bottom). The mF o − DF c map is shown at 3.0σ in all cases. CCP4mg (McNicholas et al., 2011 ▸) was used to generate this figure.
Figure 4Glycosylation tree at AsnG165 in PDB entry 1mql. The LINK records between the different carbohydrate residues are missing, causing the carbohydrate residues to be pushed apart owing to van der Waals restraints. This is exacerbated by the leaving ‘O1’ atoms that were not removed when the carbohydrate tree was built. The 2mF o − DF c map and the mF o − DF c map are contoured at 1.5σ and 3.0σ, respectively. CCP4mg (McNicholas et al., 2011 ▸) was used to generate this figure.