We screen a large scale chemical compound dataset (about 10 million compounds) targeting 8 representative protein targets taken from the DUD.E diverse data set in order to examine the efficiency and effectiveness of the DFCNN method. For each target, the corresponding dataset contains some active compounds (between 40 and 536) in the DUD.E dataset and 10,402, 895 drug-like compounds from ZINC database. The effectiveness is measured by the prediction-random ratio (Ratio0.9), defined as TPR0.9/Random0.9, where TPR0.9 indicates the ratio (N0.9/Active_num) between the number of active compounds with a DFCNN score larger than 0.9 (N0.9) and the active number of compounds (Active_num). The total number of the compounds (Total_num) with score above 0.9 is defined as NN. The random selection rate (Random0.9) is defined as NN/Total_num. Using cutoff score of 0.9, the prediction-random ratio measures the ratio of predicted TPR and random selection TPR.
Deep Learning-based Protein-Drug Binding Prediction
We screen a large scale chemical compound dataset (about 10 million compounds) targeting 8 representative protein targets taken from the DUD.E diverse data set in order to examine the efficiency and effectiveness of the DFCNN method. For each target, the corresponding dataset contains some active compounds (between 40 and 536) in the DUD.E dataset and 10,402, 895 drug-like compounds from ZINC database. The effectiveness is measured by the prediction-random ratio (Ratio0.9), defined as TPR0.9/Random0.9, where TPR0.9 indicates the ratio (N0.9/Active_num) between the number of active compounds with a DFCNN score larger than 0.9 (N0.9) and the active number of compounds (Active_num). The total number of the compounds (Total_num) with score above 0.9 is defined as NN. The random selection rate (Random0.9) is defined as NN/Total_num. Using cutoff score of 0.9, the prediction-random ratio measures the ratio of predicted TPR and random selection TPR.
Corresponding Organization : Shenzhen University
Protocol cited in 4 other protocols
Variable analysis
- DFCNN model
- Protein-drug binding probability
- Prediction-random ratio (Ratio_0.9)
- Number of active compounds with DFCNN score larger than 0.9 (N_0.9)
- Total number of compounds with score above 0.9 (NN)
- Random selection rate (Random_0.9)
- Molecular vector of protein pocket and ligand generated by Mol2vec
- Dataset extracted from PDBbind database
- Negative data samples generated by cross-combination of proteins and ligands from PDBbind database
- Positive data samples taken from protein-ligand pairs in experimental structure
- 8 representative protein targets taken from the DUD.E diverse data set
- 10,402,895 drug-like compounds from ZINC database
- Active compounds (between 40 and 536) in the DUD.E dataset
Annotations
Based on most similar protocols
As authors may omit details in methods from publication, our AI will look for missing critical information across the 5 most similar protocols.
About PubCompare
Our mission is to provide scientists with the largest repository of trustworthy protocols and intelligent analytical tools, thereby offering them extensive information to design robust protocols aimed at minimizing the risk of failures.
We believe that the most crucial aspect is to grant scientists access to a wide range of reliable sources and new useful tools that surpass human capabilities.
However, we trust in allowing scientists to determine how to construct their own protocols based on this information, as they are the experts in their field.
Ready to get started?
Sign up for free.
Registration takes 20 seconds.
Available from any computer
No download required
Revolutionizing how scientists
search and build protocols!