PlumX Metrics
Embed PlumX Metrics

Chemical XAI to Discover Probable Compounds’ Spaces Based on Mixture of Multiple Mutated Exemplars and Bioassay Existence Ratio

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), ISSN: 1611-3349, Vol: 12402 LNCS, Page: 177-189
2020
  • 3
    Citations
  • 0
    Usage
  • 4
    Captures
  • 0
    Mentions
  • 0
    Social Media
Metric Options:   Counts1 Year3 Year

Metrics Details

Conference Paper Description

Chemical industry pays much cost and long time to develop a new compound having aimed biological activity. On average, 10,000 candidates are prepared for each successful compound. The developers need to efficiently discover initial candidates before actual synthesis, optimization and evaluation. We developed a similarity-based chemical XAI system to discover probable compounds’ spaces based on mixture of multiple mutated exemplars and bioassay existence ratio. Our system piles up 4.6k exemplars and 100M public DB compounds into vectors including 41 features. Users input two biologically active sets of exemplars customized with differentiated features. Our XAI extracts compounds’ spaces simultaneously similar to multiple customized exemplars using vectors’ distances and predicts their biological activity and target with the probability shown as existence ratio of bioassay that is the information of biological activity and target obtained from public DB or literature including related specific text string. The basis of prediction is explainable by showing biological activity and target of similar compounds included in the extracted spaces. The mixture of multiple mutated exemplars and bioassay existence ratio shown as probability with the basis of prediction can help the developers extract probable compounds’ spaces having biological activity from unknown space. The response time to extract the spaces between two sets of 128 exemplars and 100M public DB compounds was 9 min using single GPU with HDD read and 1.5 min on memory. The bioassay existence ratio of extracted spaces was 2–14 times higher than the average of public ones. The correlation coefficient and R2 between predicted and actual pIC50 of biological activity were 0.85 and 0.73 using randomly selected 64 compounds. Our XAI discovered probable compounds’ spaces from large space at high speed and probability.

Provide Feedback

Have ideas for a new metric? Would you like to see something else here?Let us know