The Data Literacy Cheat Sheet: Charts: Which One Should You Use?
The Data Literacy Cheat Sheet: Charts: Which One Should You Use?
The Data Literacy Cheat Sheet: Charts: Which One Should You Use?
Comparing multiple values Displaying the composition of a whole Showing the relationship between
sets
Circular
Column Bar area Area Waterfall Pie
Line Scatter
Bullet Stacked
Line Scatter Stacked
bar column
REMEMBER! Averages:
The five characteristics Which one should you use?
of good data:
An average, also referred to as a “Measure of central tendency”, is a value that attempts to
identify the central position within a set of data. Mean, Median and Mode are types of
Credible
average.
Actionable
MEAN Does your data have a continuous distribution that’s relatively symmetrical? Use MEAN
Unbiased
(often referred to as just ‘average’).
Statistically relevant
MEDIAN Does your data contain significant outliers? Use MEDIAN - it's less influenced by this.
Easy to interpret
MODE represents the most common value in a dataset. If you’re dealing with Nominal data
(non-numeric categories like “industry vertical”), MODE is the only appropriate average to use.
CHARTING TIPS
Weighted average
A weighted average is a type of MEAN, where some values in the data set are given more influence than others. Each value to
be averaged is given a weight, representing the importance of that value in the average.
Weighted averages are important when you are dealing with frequencies or distributions, or when working with data that’s
unequal in some way.
CHARTING TIPS
Data correlation
Correlation is a statistical relationship between two data sets. Correlation can have a numeric value on a scale from -1 to 1.
A POSITIVE correlation is present when both values increase together, whereas in a NEGATIVE correlation, one value decreases
as the other increases.
e
ativ osit
ive
neg ct p on
fect n tion e CHARTING TIPS
Per relatio rela rf
Pe relati
Name cor
cor No cor Correlation is not causation!
-1 -0.5 0 0.5 1 A strong correlation between two data sets
Value Negative correlation Positive correlation does not necessarily mean that one thing
causes the other (causation). There could
be other reasons the data has a strong
Typical correlation.
Chart
Glossary
QUALITATIVE DATA is descriptive - it describes something, e.g. Reason for customer cancellation.
QUANTITATIVE DATA is always numerical (involves numbers), e.g. Revenue lost from customer cancellations.
DISCRETE DATA can only take certain values (like whole numbers), e.g. Number of customers churned.
CONTINUOUS DATA can take any value, within a given range, e.g. Customer churn rate.
CATEGORICAL DATA can be sorted, according to defined groups or categories, e.g. Industry vertical.
STATISTICAL SIGNIFICANCE is when the observed outcome of an experiment is unlikely to have occurred due to chance.
This is an important factor when running multi-variant (A/B) tests on your product or website.
Determining statistical significance can be complex. We recommend using a free tool such as AB Testguide: https://abtestguide.com/calc
More resources
The Data Viz Checklist by Stephanie Evergreen: http://stephanieevergreen.com/wp-content/uploads/2016/10/DataVizChecklist_May2016.pdf
Graph Choice Chart by Tuva Labs: https://tuvalabs.com/static/documents/Graph_Choice_Chart.pdf
Try the World's Best Subscription Analytics App for FREE at chartmogul.com