en

Data Preparation with Dataiku

Connect, cleanse, and prepare data for analytics and machine learning projects at scale.

 

Visual Flow

The Dataiku flow provides a visual representation of a project’s data pipeline and is the central space where coders and non-coders view and analyze data, add recipes to join and transform datasets, and build predictive models.

The visual flow also contains code-based and plugin elements for added customization and extensibility.

 

Connect to Leading Data Sources

Dataiku provides pre-built connectors to dozens of leading data sources both on-premises and in the cloud, including Amazon S3, Azure Blob Storage, Google Cloud Storage, Snowflake, SQL databases, NoSQL databases, HDFS, and more.

 

Data Preparation and Enrichment

Dataiku provides easy-to-use visual interfaces to join datasets, group and aggregate, clean, transform, and enrich data, all with a few clicks. Best of all, Dataiku automatically documents all steps in a recipe as part of the visual flow.

If you’d rather code than click, create code recipes using familiar languages such as Python, R, and SQL, developed and edited in your favorite IDE.

 

100 Built-in Data Transformers

The powerful prepare recipe includes 100 built-in data transformers for common data manipulations like binning, concatenation and strings manipulation, currency and date conversions, geo-enrichment, and reshaping.

Dataiku even suggests relevant functions for you based on the data’s type and values.

For custom transformations, write formulas using a spreadsheet-like expression language or Python code for ultimate flexibility.

 

Specialized Data Preparation

Dataiku offers a wide variety of functions and tools to parse and enrich specialized data types such as geospatial data, time series, images, and text with additional metadata and structure.

Examples include geo joins and geocoding, time series resampling, image annotation, text vectorization, and much more.

Go Further

Discover how Dataiku Enables Business Experts

Beyond data preparation, explore how Dataiku helps analysts and business experts.

Discover

Get a Demo

See what Dataiku has to offer for business analysts.

Watch Now

Dig Deeper With a Sample Project

Explore this Dataiku project on how to use visual data preparation to clean a dataset.

Sample Project

Data Prep Tips & Tricks

Download an e-book containing practical solutions to common data preparation mistakes.

Get the ebook

Get Started with Dataiku

Start Your Dataiku 14-Day Free Trial
or Install the Free Edition of Dataiku

Get Started