Connect to All Your Data Sources

As Dataiku connects to existing infrastructure, there is no need to move data for processing. Format and schema detection allows instant access to data.

Connect to all your data sources

Connect to more than 25 data storage systems

  •  Analytical MPP databases (Teradata, Greenplum, Vertica)
  •  Cloud databases (Amazon Redshift, Google BigQuery, Snowflake, Azure SQL)
  •  Operational databases (Oracle, MS SQL Server, PostgreSQL, MySQL)
  •   NoSQL stores (MongoDB, Cassandra, Elasticsearch)
  •  Hadoop (HDFS)
  •  Cloud object storage (Amazon S3, Google Cloud Storage, Azure Blob Storage)
  •   Remote data sources (API, HTTP, FTP, SCP, SFTP)
  •  And more!
Connect to more than 25 data storage systems

Extend existing connectivity

  •  Connect to nearly any data available out there thanks to DSS Plugins.
  •  Use R or Python to create custom connectors for any APIs, databases, or file-based formats and share them with your team or the community.
  •  Leverage existing Dataiku Plugins and connectors implemented by the user community.
Extend existing connectivity

Automatically detect dataset format and schema

  •  Dataiku automatically infers both the format and the schema of your data.
  •  With instant access to data, no need to write fastidious formatting settings before reading a dataset anymore.
  •  In just a few clicks, even non-technical team members can access data and interact with data, whatever the format or type.
Automatically detect dataset format and schema

Get Started with Dataiku

Start an online hosted trial, download the free edition,
or compare the features of the Lite, Team, and Enterprise editions.

Get Started