Premium
CIF Applications. VIII. pdb2cif : translating PDB entries into mmCIF format
Author(s) -
Bernstein H. J.,
Bernstein F. C.,
Bourne P. E.
Publication year - 1998
Publication title -
journal of applied crystallography
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.429
H-Index - 162
ISSN - 1600-5767
DOI - 10.1107/s0021889897007309
Subject(s) - protein data bank (rcsb pdb) , computer science , computer graphics (images) , programming language , crystallography , information retrieval , chemistry , stereochemistry
pdb2cif is a new version of an awk script originally written by P. E. Bourne in 1993 to translate from the 1992 Protein Data Bank (PDB) format to the then‐emerging macromolecular Crystallographic Information File (mmCIF) definition. This new version of pdb2cif translates from all current PDB formats, including the 1992 PDB format and the 1996 PDB Atomic Coordinate Entry Format, Version 2.0, to the 1997 mmCIF format as defined in the mmCIF dictionary 1.0.00. The program is provided as an m4 script from which both perl and awk versions can be produced. The program identifies mmCIF entities implicitly by sequence homology among PDB SEQRES records. With minor additions to the dictionary, the resultant mmCIF data‐sets are substantially compliant with the mmCIF 1.0.00 dictionary.