Literature DB >> 34301175

PDBeCIF: an open-source mmCIF/CIF parsing and processing package.

Glen van Ginkel1, Lukáš Pravda1, José M Dana1, Mihaly Varadi1, Peter Keller2, Stephen Anyango1, Sameer Velankar3.   

Abstract

BACKGROUND: Biomacromolecular structural data outgrew the legacy Protein Data Bank (PDB) format which the scientific community relied on for decades, yet the use of its successor PDBx/Macromolecular Crystallographic Information File format (PDBx/mmCIF) is still not widespread. Perhaps one of the reasons is the availability of easy to use tools that only support the legacy format, but also the inherent difficulties of processing mmCIF files correctly, given the number of edge cases that make efficient parsing problematic. Nevertheless, to fully exploit macromolecular structure data and their associated annotations such as multiscale structures from integrative/hybrid methods or large macromolecular complexes determined using traditional methods, it is necessary to fully adopt the new format as soon as possible.
RESULTS: To this end, we developed PDBeCIF, an open-source Python project for manipulating mmCIF and CIF files. It is part of the official list of mmCIF parsers recorded by the wwPDB and is heavily employed in the processes of the Protein Data Bank in Europe. The package is freely available both from the PyPI repository ( http://pypi.org/project/pdbecif ) and from GitHub ( https://github.com/pdbeurope/pdbecif ) along with rich documentation and many ready-to-use examples.
CONCLUSIONS: PDBeCIF is an efficient and lightweight Python 2.6+/3+ package with no external dependencies. It can be readily integrated with 3rd party libraries as well as adopted for broad scientific analyses.
© 2021. The Author(s).

Entities:  

Keywords:  CCD; PDB; PDBx/mmCIF; Parser; Protein structure; Small molecule; Software

Year:  2021        PMID: 34301175     DOI: 10.1186/s12859-021-04271-9

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.169


  16 in total

1.  The HADDOCK web server for data-driven biomolecular docking.

Authors:  Sjoerd J de Vries; Marc van Dijk; Alexandre M J J Bonvin
Journal:  Nat Protoc       Date:  2010-04-15       Impact factor: 13.491

2.  PIIMS Server: A Web Server for Mutation Hotspot Scanning at the Protein-Protein Interface.

Authors:  Feng-Xu Wu; Jing-Fang Yang; Long-Can Mei; Fan Wang; Ge-Fei Hao; Guang-Fu Yang
Journal:  J Chem Inf Model       Date:  2021-01-05       Impact factor: 4.956

3.  SHIFTX2: significantly improved protein chemical shift prediction.

Authors:  Beomsoo Han; Yifeng Liu; Simon W Ginzinger; David S Wishart
Journal:  J Biomol NMR       Date:  2011-03-30       Impact factor: 2.835

4.  The brain of Mitsukurina owstoni.

Authors:  H Masai; Y Sato; M Aoki
Journal:  J Hirnforsch       Date:  1973

5.  The Protein Data Bank archive as an open data resource.

Authors:  Helen M Berman; Gerard J Kleywegt; Haruki Nakamura; John L Markley
Journal:  J Comput Aided Mol Des       Date:  2014-07-26       Impact factor: 3.686

6.  Dali server update.

Authors:  Liisa Holm; Laura M Laakso
Journal:  Nucleic Acids Res       Date:  2016-04-29       Impact factor: 16.971

7.  New tools and functions in data-out activities at Protein Data Bank Japan (PDBj).

Authors:  Akira R Kinjo; Gert-Jan Bekker; Hiroshi Wako; Shigeru Endo; Yuko Tsuchiya; Hiromu Sato; Hafumi Nishi; Kengo Kinoshita; Hirofumi Suzuki; Takeshi Kawabata; Masashi Yokochi; Takeshi Iwata; Naohiro Kobayashi; Toshimichi Fujiwara; Genji Kurisu; Haruki Nakamura
Journal:  Protein Sci       Date:  2017-09-18       Impact factor: 6.725

8.  PDBe: improved findability of macromolecular structure data in the PDB.

Authors:  David R Armstrong; John M Berrisford; Matthew J Conroy; Aleksandras Gutmanas; Stephen Anyango; Preeti Choudhary; Alice R Clark; Jose M Dana; Mandar Deshpande; Roisin Dunlop; Paul Gane; Romana Gáborová; Deepti Gupta; Pauline Haslam; Jaroslav Koča; Lora Mak; Saqib Mir; Abhik Mukhopadhyay; Nurul Nadzirin; Sreenath Nair; Typhaine Paysan-Lafosse; Lukas Pravda; David Sehnal; Osman Salih; Oliver Smart; James Tolchard; Mihaly Varadi; Radka Svobodova-Vařeková; Hossam Zaki; Gerard J Kleywegt; Sameer Velankar
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

9.  H++ 3.0: automating pK prediction and the preparation of biomolecular structures for atomistic molecular modeling and simulations.

Authors:  Ramu Anandakrishnan; Boris Aguilar; Alexey V Onufriev
Journal:  Nucleic Acids Res       Date:  2012-05-08       Impact factor: 16.971

10.  RCSB Protein Data Bank: powerful new tools for exploring 3D structures of biological macromolecules for basic and applied research and education in fundamental biology, biomedicine, biotechnology, bioengineering and energy sciences.

Authors:  Stephen K Burley; Charmi Bhikadiya; Chunxiao Bi; Sebastian Bittrich; Li Chen; Gregg V Crichlow; Cole H Christie; Kenneth Dalenberg; Luigi Di Costanzo; Jose M Duarte; Shuchismita Dutta; Zukang Feng; Sai Ganesan; David S Goodsell; Sutapa Ghosh; Rachel Kramer Green; Vladimir Guranović; Dmytro Guzenko; Brian P Hudson; Catherine L Lawson; Yuhe Liang; Robert Lowe; Harry Namkoong; Ezra Peisach; Irina Persikova; Chris Randle; Alexander Rose; Yana Rose; Andrej Sali; Joan Segura; Monica Sekharan; Chenghua Shao; Yi-Ping Tao; Maria Voigt; John D Westbrook; Jasmine Y Young; Christine Zardecki; Marina Zhuravleva
Journal:  Nucleic Acids Res       Date:  2021-01-08       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.