It is easy to create a variety of useful charts in DSS, as you have already seen in Tutorial: Basics.
Let’s say that the Haiku T-Shirts company wants to understand more about their typical order size —- they know from experience that most customers order a single shirt, but they do occasionally get larger orders. What they don’t know is whether these larger orders constitute a significant portion of their business, and whether certain categories of shirts are more likely to be ordered in larger quantities.
Working with the
haiku_shirt_sales data, to create a chart:
The resulting chart shows us that 10 equal-width bins loses a lot of information, because all orders of 1-5 shirts are clumped together…
So let’s break the display of
nb_shirts down into raw values:
Click on the
nb_tshirts label and select None, use raw values.
Create a filter to remove hoodies from the chart.
The vast majority of orders were for 1 shirt; from the perspective of number of orders, this is not a significant portion of Haiku T-Shirts’ business. From the scale of the X axis, we can see that at least one person made an order of close to 40 t-shirts, but the total is too small to see on the chart, relative to the number of orders for 1 shirt.
In order to get a better view of the categories by order size: - Click on the chart type selector and choose Stacked 100%.
totalto the Tooltip area. On the
totaldropdown, select Sum. This adds summary statistics to your tooltips.
Now we can easily see that the proportion of sales by category appears to differ by order size. By hovering over bars in the chart, we can see, for example, that while women’s black T-shirts account for a greater and greater proportion of sales as the order size increases from 1 to 5 shirts, the total value of the orders decreases.
Thus, whether these visual differences represent a statistically significant model that the Haiku T-Shirts company can exploit is a question we’ll leave for further analysis, because there is always a next step in data science!
DSS charts are portable. You can download them as an image (PNG) or an Excel document.
There are two places where you can create charts in DSS:
Both Analyses and Datasets give you control over which data your chart is created with – sampled or complete.
We strongly recommend that, unless you have a relatively small dataset, you use a sample for building interactive charts in Analyses. This is because an Analysis is intended for exploration and quick visual feedback, and thus always uses the in-memory DSS engine.
When building charts on a Dataset however, you can also use an in-database or in-cluster engine, depending on the location of the original data. Look at the following page for additional information on sampling and engines for charts.