en

Optimize Development Process of New
Drug Candidates

The Dataiku Molecular Property Prediction Solution helps perform molecular virtual screening on target protein responses by querying small molecules with known bioactivity via ChEMBL and PubChem databases.

 

Query Public
Chemical Databases

Explore previously studied and novel molecular structures (SMILES) for drug candidates. Easily query public chemical databases with ChEMBL and PubChem APIs. Build the solution flow by selecting a preferred chemical database, then query by target protein and specify feature generation and modeling options for bioactivity prediction.

EXPLORE THE PROJECT
 

Reuse and Extend
Coding Functions

Power recipes in the flow using Python with open source chemoinformatics libraries. Create molecular descriptors and fingerprint features from SMILES chemical structures with RDKit and pre-trained transformer language models like ChemBERTA with HuggingFace.

LEARN MORE ABOUT EXTENSIBILITY WITH DATAIKU
 

Cluster and Predict Bioactivity

Gain deeper understanding of the molecular structure space with dimension reduction methods like t-SNE, PCA, and clustering, then train ML models to predict the bioactivity of small molecules for a given protein target.

SEE THE SOLUTION IN ACTION
 

Deeply Explore
Molecular Properties

Use dynamic, interactive dashboards to understand previously studied and novel molecules. Understand chemical properties and molecular descriptors with interactive visualizations, analyses, and tables. Filter to review individual molecule summaries, and screen leading candidates in-silico through the Dataiku App.

GO TO THE CHEMICAL SPACE ANALYSIS DASHBOARD
 

Visualize Development Candidates

Assess novel small molecules and gain valuable insights by computing and visualizing molecular descriptors and fingerprints. Use these insights and the trained bioactivity prediction model to score new molecules, and  find similar molecules in previous experiments.

EXPLORE THE MOLECULAR SIMILARITY DASHBOARD
 

Accelerate Molecular Prediction

Gain full flexibility by adjusting to specific research fields with a composable and extendable solution. Adapt and expand research to gain new insights and make progressively better molecular predictions.

EXPLORE THE DATAIKU FLOW
 

Answer Key Molecular Research Questions

The Dataiku Solution for Molecular Property Prediction helps answer a broad range of questions like:

  • How can I reduce cost and time in the drug discovery and development process?
  • How can I use AI and ML to research molecules and their bioactivity for a given protein target?
  • Can I use ML to help prioritize hit-to-lead development and experimentation?
READ THE DOCUMENTATION ON MOLECULAR PROPERTY PREDICTION
 

Optimize Molecular Screening Before Experimental Work

Improve the success and efficiency of the drug development cycle by prioritizing lead development candidate experimentation. Accelerate the process of predicting target protein bioactivity in novel small molecules. Leverage AI to build a pipeline of potential drug solutions to help bring a stable compound quickly to preclinical and clinical testing.

DISCOVER THE FULL CAPABILITIES OF DATAIKU

The Total Economic Impact™️
Of Dataiku

A composite organization in the commissioned study conducted by Forrester Consulting on behalf of Dataiku saw the following benefits:

70 %+

reduction in time spent on data analysis, extraction, and preparation.

42 %

reduction in time spent on model lifecycle activities (training, deployment, and monitoring).

413 %

return on investment

$ 23 .5M

net present value over three years.

See Dataiku in Action

Watch the end-to-end Dataiku demo to understand how the Universal AI Platform can fit your needs for one use case … and beyond.