AI for drug discovery
Ho Leung Ng
Manhattan, Kansas
- 0 Collaborators
We are applying machine learning to computational drug discovery, focusing on cancer, immunology, and malaria. ...learn more
Project status: Under Development
Groups
DeepLearning,
Computaional Biology
Intel Technologies
Intel Opt ML/DL Framework,
Intel Python
Overview / Usage
Active topics are 1) predicting drug binding sites in target proteins, 2) developing ML-based scoring functions to identify bioactive molecules in virtual screening, and 3) using generative adversarial network methods to discover bioactive molecules with novel chemotypes. We are focusing on drugs for cancer, immunology, and malaria.
We recently placed 4th in the Drug Design Data Resource Grand Challenge (https://drugdesigndata.org/) to predict the binding activity of a set of molecules to the Alzheimer's Disease drug target, BACE1. Our strategy used a neural network trained on chemical data from closely related BACE1 inhibitors.
Methodology / Approach
We are focusing on several different approaches in our projects.
-
Training ML-based scoring functions on target-specific data for docking and virtual screening. We treat docking (predicting the correct binding pose) as a separate problem from predicting binding affinity. Scoring functions must be fast to compute to be useful for virtual screening.
-
Training ML-based scoring functions for fragment screening. Fragments tend to be smaller and more soluble than traditional drug-like molecules. We are developing scoring functions specific for their chemical properties.
-
Using structure conservation data, docking, and fragment screening to predict alternative drug binding sites in protein targets. This approach can be useful to obtain better target specificity, synergistic effects with other drug treatment, and identification of drug molecules with novel chemical scaffolds.
-
Developing GANs to generate novel drug-like molecules. By including new cost functions, one can in principle train GANs to focus or avoid certain chemical scaffolds, propose molecules with lower toxicity, and explore new chemical space.
Technologies Used
Intel python, numpy, scipy, scikit-learn, pytorch, tensorflow, keras, intel math kernel library, intel compilers, intel CPUs