z-logo
Premium
Fragment Prioritization on a Large Mutagenicity Dataset
Author(s) -
Floris Matteo,
Raitano Giuseppa,
Medda Ricardo,
Benfenati Emilio
Publication year - 2017
Publication title -
molecular informatics
Language(s) - English
Resource type - Journals
SCImago Journal Rank - 0.481
H-Index - 68
eISSN - 1868-1751
pISSN - 1868-1743
DOI - 10.1002/minf.201600133
Subject(s) - identification (biology) , computer science , prioritization , context (archaeology) , data mining , fragment (logic) , set (abstract data type) , algorithm , biology , engineering , paleontology , botany , management science , programming language
The identification of structural alerts is one of the simplest tools used for the identification of potentially toxic chemical compounds. Structural alerts have served as an aid to quickly identify chemicals that should be either prioritized for testing or for elimination from further consideration and use. In the recent years, the availability of larger datasets, often growing in the context of collaborative efforts and competitions, created the raw material needed to identify new and more accurate structural alerts. This work applied a method to efficiently mine large toxicological dataset for structural alert showing a strong statistical association with mutagenicity. In details, we processed a large Ames mutagenicity dataset comprising 14,015 unique molecules obtained by joining different data sources. After correction for multiple testing, we were able to assign a probability value to each fragment. A total of 51 rules were identified, with p‐value < 0.05. Using the same method, we also confirmed the statistical significance of several mutagenicity rules already present and largely recognized in the literature. In addition, we have extended the application of our method by predicting the mutagenicity of an external data set.

This content is not available in your region!

Continue researching here.

Having issues? You can contact us here