Due to relatively high costs and labor required for experimental profiling of the full target space of chemical compounds, various machine learning models have been proposed as cost-effective means to advance this process in terms of predicting the most potent compound-target interactions for subsequent verification. However, most of the model predictions lack direct experimental validation in the laboratory, making their practical benefits for drug discovery or repurposing applications largely unknown. Here, we therefore introduce and carefully test a systematic computational-experimental framework for the prediction and pre-clinical verification of drug-target interactions using a well-established kernel-based regression algorithm as the prediction model. To evaluate its performance, we first predicted unmeasured binding affinities in a large-scale kinase inhibitor profiling study, and then experimentally tested 100 compound-kinase pairs. The relatively high correlation of 0.77 (p < 0.0001) between the predicted and measured bioactivities supports the potential of the model for filling the experimental gaps in existing compound-target interaction maps. Further, we subjected the model to a more challenging task of predicting target interactions for such a new candidate drug compound that lacks prior binding profile information. As a specific case study, we used tivozanib, an investigational VEGF receptor inhibitor with currently unknown off-target profile. Among 7 kinases with high predicted affinity, we experimentally validated 4 new off-targets of tivozanib, namely the Src-family kinases FRK and FYN A, the non-receptor tyrosine kinase ABL1, and the serine/threonine kinase SLK. Our sub-sequent experimental validation protocol effectively avoids any possible information leakage between the training and validation data, and therefore enables rigorous model validation for practical applications. These results demonstrate that the kernel-based modeling approach offers practical benefits for probing novel insights into the mode of action of investigational compounds, and for the identification of new target selectivities for drug repurposing applications.
Significant efforts have been devoted in recent years to the development of machine learning models to support different stages of drug development process. Given the enormous size of the chemical universe, such models could offer a complementary and cost-effective means to experimental determination of drug-target interactions, toward prioritization of the most potent ones for further verification in the laboratory. In order to demonstrate the benefits of the prediction models in practical application cases, we carefully evaluated the predictive power of a well-established machine learning model in filling the gaps in existing profiling studies and prediction of target interactions for a new drug candidate. As a specific case study, we focused on kinase inhibitors, which form the largest class of new drugs approved for cancer treatment, but are also known to have wide multi-target activities contributing both to their therapeutic and toxic responses. The high agreement observed between the predicted and experimentally-measured drug-target bioactivities under the implemented rigorous validation setup demonstrates the potential of the machine learning approach, not only for filling the gaps in existing drug-target interaction maps, but also toward off-target interaction prediction for investigational drugs, and finding potential new uses for already approved drugs (drug repurposing).