When you work on a project, you will most probably start by drafting your work, wether you perform an analysis or preliminary data exploration. This must be done in the Lab environment which contains :
Once your draft is well worked out, you will assemble it as pieces of the Flow together with all other proper works it already contains. To understand how to deploy the Lab work in the flow of the project, please refer to the From the lab to the flow page.
You can prepare your data in Dataiku DSS by using either a visual preparation recipe, or a visual analysis of the Lab environment. Learn when to choose the Lab over the prepare recipe.
When working on a prepare recipe or in an analysis, you get live visual feedback for all the preparation steps that you add. This couldn’t happen if you were previewing whole datasets (big data just doesn’t fit in DSS’s memory), so you are simply looking at a sample. To understand how to change the sampling or work on the complete data, head to the Sampled vs Complete data page.
When you edit an existing recipe or when your data is updated, you will need to rebuild your downstream datasets to update their contents. Make sure you understand the tools to accelerate this task by reading our Rebuilding Data page.
Last but not least, when your development work is done, your flow should be now steady. It is time to deploy your work in production. Dataiku DSS greatly facilitates all automation and monitoring tasks, so make sure you understand all production related features by reading our portal on automation.
Dataiku DSS is all about making teams more efficient, so don’t forget to promote your work to others and learn to collaborate efficiently!