Snowflake stages export

This plugin lets you export a DSS Snowflake dataset to a Snowflake stage.

Plugin information

Version	1.0.0
Author	Dataiku (Nicolas Monchy, Allan Francani)
Released	2021-06
Last updated	2021-06
License	Apache Software License
Source code	Github
Reporting issues	Github

Snowflake stages are Snowflake objects pointing to a storage location. They are used to load and unload data from Snowflake, without passing credentials in the copy command.

This plugin is about unloading data from a DSS Snowflake dataset to a Snowflake stage.

Prerequisites

To use this plugin, the Snowflake stages need to be already created. This also applies to custom file formats if you wish to use any.

How to use

Let’s assume that you have installed this plugin and that you have a Dataiku DSS project with Snowflake datasets.

Export a dataset from the flow

Select the dataset you want to export
Click on Export to Snowflake stages at the bottom of the right panel
Select the destination stage from the list
If needed, customize the output path prefix (defaults to /<project key>)
If needed, select a custom file format from the list
Choose whether to overwrite potential existing files with the same name
Click on RUN MACRO

The name of the output file(s) is prefixed with the name of the dataset you exported.

Export a dataset from a scenario

From a scenario, add a Execute macro step
Select the Export to Snowflake stages macro
Select the dataset to export
All the other steps are similar to the previous section

ℹ️ If you have multiple Snowflake connections in your project, make sure to select a dataset, a stage and a file format from the same connection.

Customize the output

Output path

By default, the output files are written to /<project key>/
This path will be replaced by any value that you enter. If you want to write the output files at the root of the stage, simply enter /

File format

Snowflake lets you create custom file formats. You may ask your Snowflake administrator to create new ones if needed. By default, the plugin will use the default file format of your connection.

Additional instructions are available in our Github repository.