Premium
Hamming distance geometry of a protein conformational space: Application to the clustering of a 4‐ns molecular dynamics trajectory of the HIV‐1 integrase catalytic core
Author(s) -
Laboulais Cyril,
Ouali Mohammed,
Le Bret Marc,
GabarroArpa Jacques
Publication year - 2002
Publication title -
proteins: structure, function, and bioinformatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 1.699
H-Index - 191
eISSN - 1097-0134
pISSN - 0887-3585
DOI - 10.1002/prot.10081
Subject(s) - hamming distance , hamming space , partition (number theory) , sequence (biology) , space (punctuation) , hamming code , cluster analysis , combinatorics , mathematics , physics , algorithm , computer science , biology , genetics , block code , statistics , decoding methods , operating system
Protein structures can be encoded into binary sequences (Gabarro‐Arpa et al., Comput Chem 2000;24:693–698) these are used to define a Hamming distance in conformational space: the distance between two different molecular conformations is the number of different bits in their sequences. Each bit in the sequence arises from a partition of conformational space in two halves. Thus, the information encoded in the binary sequences is also used to characterize the regions of conformational space visited by the system. We apply this distance and their associated geometric structures to the clustering and analysis of conformations sampled during a 4‐ns molecular dynamics simulation of the HIV‐1 integrase catalytic core. The cluster analysis of the simulation shows a division of the trajectory into two segments of 2.6 and 1.4 ns length, which are qualitatively different: the data points to the fact that equilibration is only reached at the end of the first segment. The Hamming distance is compared also to the r.m.s. deviation measure. The analysis of the cases studied so far shows that under the same conditions the two measures behave quite differently, and that the Hamming distance appears to be more robust than the r.m.s. deviation. Proteins 2002;47:169–179. © 2002 Wiley‐Liss, Inc.