Workflows, Automation, And Monitoring
Dataiku lets you deploy workflows easily with safe versioning and rollback. Then automate your deployments as part of a larger production strategy.
Learn MoreAt a very high level, AutoML is about using machine learning techniques to automatically do machine learning. Or in other words, it means automating the process of applying machine learning. Early on, AutoML was almost exclusively used for the automatic selection of the best-performing algorithms for a given task and for tuning the hyperparameters of said algorithms.
Yet AutoML can have a broader scope with later versions of auto-sklearn and tpot (and has). Its development has spurred the application of automation to the whole data-to-insights pipeline, from cleaning the data to tuning algorithms through feature selection and feature creation, even operationalization. At this larger scale, it’s no longer AutoML, but augmented analytics. Today, automated analytics can add efficiency to large swaths of the data pipeline, with the potential to impact the entire process and influence the structure of data teams long term.
“By 2025, 50% of data scientist activities will be automated by AI, easing the acute talent shortage.”
– Gartner, How Augmented Machine Learning Is Democratizing Data Science; Jim Hare, Carlie Idoine, Peter Krensky, 29 August 2019 (Report available to Gartner subscribers)
Since the early days of the product (all the way back to 2014), Dataiku has offered a visual AutoML suite that guides the user through all of the machine learning steps (train-test split, feature handling, metrics to optimize, and different templates of pre-set algorithms).
The interface offers a one-button option, simply called “Train” – this will automatically infer the feature handling, pre-select a collection of algorithms, and returns the best performing one. But of course, it’s still up to the user to tune those parameters and select the best possible settings based on their experience.
More specifically, Dataiku offers AutoML features for:
Data scientists are a critical piece of the AI puzzle, and they know what tools and technologies work best for any given task.
Read moreDataiku lets you deploy workflows easily with safe versioning and rollback. Then automate your deployments as part of a larger production strategy.
Learn MoreDeploy your ML models in production in one click. High performance and scalable scoring, deployable on the cloud with Kubernetes.
Learn MoreDataiku makes it easy to leverage machine learning technologies and get instant visual and statistical feedback on model performance.
Learn MoreGet instant insights from your data whatever their size or format, and share with your teams.
Learn More