Connect to all your data sources from Dataiku

READ DATASHEET
  • Connect to existing
    infrastructure
  • Extend
    existing connectivity
  • Format and schema
    detection
  • Connect to existing
    infrastructure
  • Extend
    existing connectivity
  • Format and schema
    detection
Product demonstration of connection to data storage systems

Connect to more than 25 data storage systems

  • Analytical MPP databases (Teradata, Greenplum, Vertica)
  • Cloud databases (Amazon Redshift, Google BigQuery, Snowflake, Azure SQL)
  • Operational databases (Oracle, MS SQL Server, PostgreSQL, MySQL)
  • NoSQL stores (MongoDB, Cassandra, Elasticsearch)
  • Hadoop (HDFS)
  • Cloud object storage (Amazon S3, Google Cloud Storage, Azure Blob Storage)
  • Remote data sources (API, HTTP, FTP, SCP, SFTP)
  • And more!
Website screenshot: DSS plugins let you extend the power of DSS

Extend existing connectivity

  • Connect to nearly any data available out there thanks to DSS Plugins.
  • Use R or Python to create custom connectors for any APIs, databases, or file-based formats and share them with your team or the community.
  • Leverage existing Dataiku Plugins and connectors implemented by the user community.
Computer showing the format and schema detection feature

Automatically detect dataset format and schema

  • Dataiku automatically infers both the format and the schema of your data.
  • With instant access to data, no need to write fastidious formatting settings before reading a dataset anymore.
  • In just a few clicks, even non-technical team members can access data and interact with data, whatever the format or type.