Premium
A Similarity‐based Data‐fusion Approach to the Visual Characterization and Comparison of Compound Databases
Author(s) -
MedinaFranco José L.,
Maggiora Gerald M.,
Giulianotti Marc A.,
Pinilla Clemencia,
Houghten Richard A.
Publication year - 2007
Publication title -
chemical biology and drug design
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.59
H-Index - 77
eISSN - 1747-0285
pISSN - 1747-0277
DOI - 10.1111/j.1747-0285.2007.00579.x
Subject(s) - similarity (geometry) , set (abstract data type) , virtual screening , chemical space , computer science , data mining , information retrieval , fusion , test set , data set , database , sensor fusion , construct (python library) , artificial intelligence , drug discovery , bioinformatics , biology , image (mathematics) , programming language , linguistics , philosophy
A low‐dimensional method, based on the use of multiple fusion‐based similarity measures, is described for graphically depicting and characterizing relationships among molecules in compound databases. The measures are used to construct multi‐fusion similarity maps that characterize the relationship of a set of ‘test’ molecules to a set of ‘reference’ molecules. The reference set is very general and can be made of molecules from, for example, the set of test molecules itself (the self‐referencing case), from a small library or large compound collection, or from actives in a given assay or group of assays. The test set is any collection of compounds to be analyzed with respect to the specified reference set. Multiple fusion similarity measures tend to provide more information than single fusion‐based measures, including information on the nature of the chemical‐space neighborhoods surrounding reference‐set molecules. A general discussion is presented on how to interpret multi‐fusion similarity maps, and several examples are given that illustrate how these maps can be used to compare compound libraries or collections, to select compounds for screening or acquisition, and to identify new active molecules using ligand‐based virtual screening.