Try Dataiku
Turn Raw Data Into AI-Enabled Services
Get Started
or, Download Community Edition
- Installed on your infrastructure
- Files or open source databases
- Up to 3 users
Discover
For Small Teams
Everything in Free, plus:
- Up to 5 users
- 20+ database connectors
- Process in-memory or in-database using Spark
- Limited automation
- Standard support
Business
For Mid-Sized Teams
Everything in Discover, plus:
- Up to 20 users
- Unlimited & elastic computations with Kubernetes
- Full automation, limited deployment
- Advanced security
- Silver customer service
Enterprise
Scalable Automation and Governance
Everything in Business, plus:
- All database connectors
- Full deployment capabilities
- Isolation framework
- Unlimited instances and resource governance
- Gold customer service
Need to Upgrade?
More DetailsCompare editions | Free | Discover | Business | Enterprise |
---|---|---|---|---|
Community INSTALL NOW |
For Small Teams CONTACT US |
For Mid-Sized teams CONTACT US |
Scalable Automation and Governance CONTACT US |
Basics | ||||
---|---|---|---|---|
Visual interactive data preparation and data transformation | ||||
Visual machine learning and automated features preprocessing | ||||
Built-in charts and dashboards | ||||
Code notebooks and recipes | ||||
Custom web applications and plugins | ||||
Collaboration |
Data Connectors | ||||
---|---|---|---|---|
Smart incremental rebuild | ||||
Concurrent jobs | Up to 3 | Unlimited | Unlimited | Unlimited |
Filesystem, FTP, HTTP, SSH, SFTP and cloud storage | ||||
PostgreSQL, MySQL | ||||
Editable datasets | ||||
Hadoop (HDFS) | ||||
Enterprise SQL (Oracle, MS SQL Server) | ||||
Analytic SQL (Vertica, Greenplum, Redshift, BigQuery, Snowflake) | ||||
NoSQL (MongoDB, Cassandra, Elasticsearch) | ||||
Teradata, Netezza, Exadata, Hana |
Visual Transformation & Exploration | ||||
---|---|---|---|---|
Visual interactive data preparation (80+ processors) | ||||
Visual transformations (Group, join, union, split, sampling, etc.) | ||||
15 built-in chart types | ||||
Visual interactive statistics | ||||
Dashboards | ||||
Custom web applications | ||||
In-database charts engine | MySQL, PostgreSQL | |||
Distributed charts engine (Impala & Athena) |
Core Capabilities | ||||
---|---|---|---|---|
Python, R, SQL, Shell | ||||
Notebook & IDE | ||||
Code Env & Git integration | ||||
Webapp, Rshiny, Bokeh | ||||
Spark code (PySpark, SparkR, SparkSQL) | ||||
Hadoop code (Hive, Impala) |
Computing Infrastructure | ||||
---|---|---|---|---|
In-memory processing | ||||
In-database processing | MySQL, PostgreSQL | Limited to allowed SQL databases | Limited to allowed SQL databases | |
In-cluster processing | Single Hadoop/Spark cluster (on-prem or cloud-managed) | Multiple Kubernetes clusters resources for Spark, Hadoop, or in-memory computation | Multiple Kubernetes clusters resources for Spark, Hadoop, or in-memory computation | |
Automatic Kubernetes clusters creation | AWS, GCP, or Azure | AWS, GCP, or Azure |
Machine Learning | ||||
---|---|---|---|---|
Visual ML: regression, classification, clustering | ||||
Automated features preprocessing | ||||
Custom Python algorithms | ||||
In-memory engines | ||||
Distributed engine: Spark |
Production & Automation | ||||
---|---|---|---|---|
ML models versioning | ||||
Batch scoring | In-memory only | |||
Custom user interfaces with hosted webapps | ||||
Pipeline scheduling | Up to 2 triggers | |||
Monitoring, notifications | ||||
Partitioning management | ||||
Real-time prediction API | 2 API nodes only | Serverless & unlimited |
Team & Collaboration | ||||
---|---|---|---|---|
Multi-users | Up to 3 | Up to 5 | Up to 20 | Unlimited |
Discussion and wikis | ||||
Change management | ||||
Role-based security | ||||
LDAP support | ||||
SSO support |
More | ||||
---|---|---|---|---|
Plugins (use and contribute) | ||||
Public REST API | ||||
Support | Community | Standard | Silver | Gold |

Dataiku DSS Scores an overall 4.9 out of 5 rating
based on 79 total ratings in the last 12 months for the DSMLP market, as of November 13th 2020
The Gartner Peer Insights Logo is a trademark and service mark of Gartner, Inc., and/or its affiliates, and is used herein with permission. All rights reserved. Gartner Peer Insights reviews constitute the subjective opinions of individual end users based on their own experiences and do not represent the views of Gartner or its affiliates.