Computational Toxicity Prediction Protocol

To predict the toxicity of the input compound, a 2D similarity search is performed on an updated version of the in-house toxicity database SuperToxic (17 (link)) and the most similar compounds to the input molecule are considered. The set used for prediction consists of approximately 38 000 unique compounds with known oral LD₅₀ values measured in rodents. The data was gathered from public sources and literature and prepared using Instant JChem 6.2.0 (January 2014), ChemAxon (http://www.chemaxon.com), for standardization purposes.
From the standardized molecule structures, InChI keys were calculated and used to remove duplicates in the dataset. In the case of multiple LD₅₀ values measured for one compound, the lowest dose value was kept to represent the worst-case toxicity of a compound. Six toxicity classes were defined based on the GHS classification scheme using the LD₅₀ thresholds of 5, 50, 300, 2000 and 5000 mg/kg body weight. Each compound of the dataset was represented using a concatenated fingerprint consisting of the ‘FP2’ and ‘FP4’ fingerprints of Mychem (http://mychem.sourceforge.net/) as well as the ECFP4 fingerprint (18 (link)). The fingerprints were calculated using Open Babel (19 (link)) and JChem 6.1.3 (November 2013), ChemAxon (http://www.chemaxon.com), respectively. The similarity between two compounds was calculated using the Tanimoto Index.
In addition to the similarity search, the prediction method takes into account the presence of toxic fragments. All compounds in the database were fragmented using RECAP (20 (link)) as well as the in-house method ROTBONDS (21 (link)). The occurrence of each distinct fragment in molecules of the prediction dataset was tested using its SMILES string, computed with JChem 6.1.3 (November 2013) in a substructure search which was implemented using Open Babel's (19 (link)) fast search. To determine fragments over-represented in the most toxic classes, a propensity analysis (22 (link)) was performed. Propensity scores (PS) were calculated for every fragment and toxicity class. Toxic fragments were defined as those showing a PS above a threshold of 3 in classes I, II or III, and a PS below 1 in classes IV–VI. Based on these conditions, a total number of 1591 and 1580 fragments specific to toxicity classes I–III, generated with the ROTBONDS and RECAP fragmentation method, respectively, were contemplated for prediction.

Free full text: Click here

Drwal M.N., Banerjee P., Dunkel M., Wettig M.R, & Preissner R. (2014). ProTox: a web server for the in silico prediction of rodent oral toxicity. Nucleic Acids Research, 42(Web Server issue), W53-W58.

Publication 2014

Body weight Molecule structures Rodents

Corresponding Organization :

Other organizations : Charité - Universitätsmedizin Berlin, Humboldt-Universität zu Berlin, German Cancer Research Center

Top 5 similar protocols

Protocol cited in 52 other protocols

Variable analysis

independent variables

2D similarity search performed on an updated version of the in-house toxicity database SuperToxic
Presence of toxic fragments

dependent variables

Toxicity prediction of the input compound

control variables

Standardization of molecule structures using Instant JChem 6.2.0 (January 2014), ChemAxon
Removal of duplicate compounds using InChI keys
Keeping the lowest LD50 value to represent the worst-case toxicity of a compound
Representation of compounds using concatenated fingerprints (FP2, FP4, ECFP4)
Calculation of similarity between compounds using the Tanimoto Index
Fragmentation of compounds using RECAP and ROTBONDS methods
Substructure search to determine fragments over-represented in the most toxic classes

Annotations

Based on most similar protocols

Etiam vel ipsum. Morbi facilisis vestibulum nisl. Praesent cursus laoreet felis. Integer adipiscing pretium orci. Nulla facilisi. Quisque posuere bibendum purus. Nulla quam mauris, cursus eget, convallis ac, molestie non, enim. Aliquam congue. Quisque sagittis nonummy sapien. Proin molestie sem vitae urna. Maecenas lorem.

As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.

About PubCompare

Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.

We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.

However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.

Ready to get started?

Revolutionizing how scientists
search and build protocols!