How Does Dataiku Connect to Your Data Sources?

READ DATASHEET
  • Connect to existing
    infrastructure
  • Format and schema
    detection
  • No need to move
    data for processing
  • Connect to existing
    infrastructure
  • Format and schema
    detection
  • No need to move
    data for processing
Product demonstration of connection to data storage systems

Connect to more than 25 data storage systems

  • File uploads (Filesystem, FTP, HTTP, SSH, SFTP)
  • PostgreSQL, MySQL
  • Enterprise SQL (Oracle, MS SQL Server)
  • Analytic SQL (Vertica, Greenplum, Redshift, Teradata, Exadata)
  • Other SQL databases
  • NoSQL (MongoDB, Cassandra, Elasticsearch)
  • Hadoop (HDFS)
  • Cloud (S3)
  • And more!
Website screenshot: DSS plugins let you extend the power of DSS

Extend with plugins

  • Connect to nearly any data available out there thanks to a custom API connector.
  • Package your custom Python projects and connectors to share them with your team or the community.
  • Use Dataiku DSS plugins and connectors implemented by the user community.
Computer showing the format and schema detection feature

Format and schema detection

  • DSS automatically infers both the format and the schema of your data.
  • With instant access to data, no need to write fastidious formatting settings before reading a dataset anymore.
  • In just a few clicks, even non-technical team members can access data whatever the format or type.
Data servers illustrating existing infrastructure

No need to move data for processing

DSS pushes computation in your existing SQL, Hadoop, or Spark infrastructure.