| Literature DB >> 35199148 |
Jan Ludwiczak1, Aleksander Winski1, Stanislaw Dunin-Horkawicz1.
Abstract
MOTIVATION: The wealth of protein structures collected in the Protein Data Bank enabled large-scale studies of their function and evolution. Such studies, however, require the generation of customized data sets combining the structural data with miscellaneous accessory resources providing functional, taxonomic, and other annotations. Unfortunately, the functionality of currently available tools for the creation of such data sets is limited and their usage frequently requires laborious surveying of various data sources and resolving inconsistencies between their versions.Entities:
Year: 2022 PMID: 35199148 PMCID: PMC9048648 DOI: 10.1093/bioinformatics/btac121
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.931
Fig. 1.General overview of the features and functionalities of the localpdb package. At its core, localpdb syncs the raw PDB data and entries (in PDB and mmCIF formats) and makes them available to the user through the DataFrame objects. With the weekly releases of new data, the local files can be updated, however, the possibility to access the previous versions is retained through the tracking mechanism. The functionalities of the localpdb can be further extended with the configurable plugin system that allows to fetch and track the updates from the additional data sources. localpdb also provides access to the RCSB search API that can be used for complex queries based on multiple criteria. Finally, each version of the localpdb can be independently recreated on a different machine or by other users by exporting a small configuration file