| Literature DB >> 36048148 |
Eugene Krissinel1, Andrey A Lebedev1, Ville Uski1, Charles B Ballard1, Ronan M Keegan1, Oleg Kovalevskiy1, Robert A Nicholls2, Navraj S Pannu3, Pavol Skubák3, John Berrisford4, Maria Fando5, Bernhard Lohkamp6, Marcin Wojdyr7, Adam J Simpkin8, Jens M H Thomas8, Christopher Oliver9, Clemens Vonrhein7, Grzegorz Chojnowski10, Arnaud Basle11, Andrew Purkiss12, Michail N Isupov13, Stuart McNicholas14, Edward Lowe15, Josep Triviño16, Kevin Cowtan14, Jon Agirre14, Daniel J Rigden8, Isabel Uson16, Victor Lamzin10, Ivo Tews17, Gerard Bricogne7, Andrew G W Leslie2, David G Brown18.
Abstract
Nowadays, progress in the determination of three-dimensional macromolecular structures from diffraction images is achieved partly at the cost of increasing data volumes. This is due to the deployment of modern high-speed, high-resolution detectors, the increased complexity and variety of crystallographic software, the use of extensive databases and high-performance computing. This limits what can be accomplished with personal, offline, computing equipment in terms of both productivity and maintainability. There is also an issue of long-term data maintenance and availability of structure-solution projects as the links between experimental observations and the final results deposited in the PDB. In this article, CCP4 Cloud, a new front-end of the CCP4 software suite, is presented which mitigates these effects by providing an online, cloud-based environment for crystallographic computation. CCP4 Cloud was developed for the efficient delivery of computing power, database services and seamless integration with web resources. It provides a rich graphical user interface that allows project sharing and long-term storage for structure-solution projects, and can be linked to data-producing facilities. The system is distributed with the CCP4 software suite version 7.1 and higher, and an online publicly available instance of CCP4 Cloud is provided by CCP4. open access.Entities:
Keywords: CCP4; CCP4 Cloud; data management; distributed computing; macromolecular crystallography; project management
Mesh:
Substances:
Year: 2022 PMID: 36048148 PMCID: PMC9435598 DOI: 10.1107/S2059798322007987
Source DB: PubMed Journal: Acta Crystallogr D Struct Biol ISSN: 2059-7983 Impact factor: 5.699
Figure 1Examples of CCP4 Cloud configurations. FE, Front End; NC, Number Cruncher; UI, User Interface with optional CC (client-side NC); Data, access to collected X-ray diffraction data sets and other data used for MX structure determination. Arrows denote http(s) connections. (a) Single-host configuration, suitable for an individual working without an internet connection. (b) A multi-user setup using a central host machine, suitable for small to medium-sized laboratories. (c) Fully distributed, multi-component setup with single point of access, suitable for large facilities and research centres; it can optionally allow data acquisition from external sources. (d) Fully distributed setup with multiple access points. Configurations (a) and (c) work out of the box using the CCP4 software suite installed on the user’s machine [using the CCP4 Cloud setup at RAL Harwell in case (c)]. All configurations may be tailored to specific conditions and requirements and can be built on top of the CCP4 Cloud components that are included in the CCP4 software suite. See the discussion in the text.
Data objects used in CCP4 Cloud
See the discussion in Section 3.
| Data object | ID | File equivalent |
|---|---|---|
| Unmerged reflection data | UNMERGED | MTZ or HKL file with unmerged reflection data |
| Merged reflection data | HKL | Relevant columns of MTZ file with merged reflection data ( |
| Macromolecular sequence | SEQUENCE | Sequence files in FASTA or PIR format |
| Macromolecular coordinates | XYZ | PDB or mmCIF coordinate file |
| Phases | PHASES | Relevant columns of MTZ file ( |
| Ligand description | LIGAND | CIF file with ligand description |
| Ligand library | LIB | CIF or LIB file with multiple ligand descriptions |
| MR model | MODEL | Combination of sequence and single-model coordinate files |
| MR ensemble | ENSEMBLE | Combination of sequence and multi-model coordinate files |
| Structure | STRUCTURE | Combination of related XYZ and PHASES objects |
| Structure revision | REVISION | Combination of HKL, SEQUENCE and STRUCTURE objects |
Used for automatic naming of data objects in CCP4 Cloud.
Figure 2Naming conventions for data objects (a) and structure revisions (b) in CCP4 Cloud. All fields are generated automatically, and only ‘name’ can be overridden by the user. Suffixation is used when a job produces multiple outputs, for example for the original and inverted hands in experimental phasing.
Figure 3Schematic of CCP4 Cloud task interfaces. (a) input panel, (b) output panel (Job report).
Figure 4Screenshot of a CCP4 Cloud project. An indication of the principal structure solution stages is displayed on the left. See the discussion in the text.