Predicting clinically promising therapeutic hypotheses using tensor factorization

Citation DatabioRxiv, ISSN: 2692-8205

Publication Year2018

2
Citations
0
Usage
0
Captures
0
Mentions
0
Social Media

Metric Options: Counts1 Year3 Year

Metrics Details

Citations
2
- Citation Indexes
  2

Article Description

Determining which target to pursue is a challenging and error-prone first step in developing a therapeutic treatment for a disease, where missteps are potentially very costly given the long-time frames and high expenses of drug development. We identified examples of successes and failures of target-indication pairs in clinical trials across 875 targets and 574 disease indications to build a gold-standard data set of 6,140 known clinical outcomes. We used information from Open Targets and others databases that covered 17 different sources of evidence for target-indication association and represented the data as a matrix of 21,437×2,211×17 with over two million non-null values. We designed and executed three benchmarking strategies to examine the performance of multiple machine learning models: Logistic Regression, Elasticnet, Random Forest, Tensor Factorization and Gradient Boosting Machine. With ten-fold cross validation, tensor factorization achieved AUROC=0.82±0.02 and AUPRC=0.71±0.03. Across multiple validation schemes, this was comparable or better than other methods. Tensor factorization is a general form of matrix factorization that has been successfully exploited in recommendation systems that suggest items to users based on their existing preference on a small number of items. Our application, using Bayesian probabilistic modelling, extends the capacity of matrix factorization to model multiple relationships between and among targets and indications. We use the model to show that our predicted probabilities of success correlate with clinical phases, and within clinical phase we can predict which trials are most likely to succeed.

Bibliographic Details

DOI10.1101/272740

URL IDhttp://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=85095636639&origin=inward; http://dx.doi.org/10.1101/272740; https://dx.doi.org/10.1101/272740; https://www.biorxiv.org/content/10.1101/272740v1

AUTHOR(S)

Jin Yao; Mark R. Hurle; Pankaj Agarwal; Matthew R. Nelson

PUBLISHER(S)

Cold Spring Harbor Laboratory

TAG(S)

Biochemistry, Genetics and Molecular Biology; Agricultural and Biological Sciences; Immunology and Microbiology; Neuroscience; Pharmacology, Toxicology and Pharmaceutics

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know