Dataiku
Learn
Samples
Predict the risk of customer churn

Predict the risk of customer churn

View this sample project to learn how to segment your customer base and predict the risk of churn in Dataiku

We’re a major telecom operator. Just like pretty much any company in the world, we are concerned with keeping our customers happy, so they won’t leave us. In other words, we want to reduce churn. To do this, we set up a task force of data analysts and people from our business teams who came up with several business goals to reduce churn.

Business Goal

Get to know our customers better, by accessing the data about their plans and usage, and getting in touch with interesting profiles
Target clients with more effective advertising based on their usage profiles
Retrieve customers with very high likeliness of churn so we could get in touch and offer them special deals before they even thought of leaving

How Did We Do This ?

We had our data science team collect historic data from users on their phone usage, and work on creating features from very large log files. They specified which clients had churned.

They also built the same features for our current clients, so we could deploy the model and predict who would churn.

Because we wanted to do more than just answer the yes no question of “will they churn,” we decided to build two models instead of one:

A first model that segments our customers into relevant groups (by using clustering algorithms), for targeting.
A second model that uses these segments (clusters) to predict the churn likeliness of each unlabeled customer (by using classification algorithms), so that business units can then check scores on a daily basis and target these customers.

Dashboard

Look at the visual insights we built to monitor churn and understand our customers behavior. We updated these as we went along by adding graph steps to our preparation scripts. What could be a predictive business intelligence.

EXPLORE !

Flow

Check out the few steps of data preparation and machine learning that are needed for this advanced analytics operation. You'll notice cleaning recipes (in yellow), and the 2 models in green.

EXPLORE !

Visual Preparation

Look at the data preparation script to clean the customer data and create new features.

EXPLORE !

Clustering model

Read here how we created our first model to segment our customers, and then deployed on our current customers' data AND on our historical data.

EXPLORE !

Churn model

Understand how we then worked on our second algorithm to predict churn behavior.

EXPLORE !