A data visualisation consists of data symbols (and guides) and labels.
We create a data symbol by mapping data values to the visual features of a shape.
Which visual feature we choose depends on:
We have so far focused on a single mapping from data values to a visual feature (and back).
All data visualisations employ multiple visual features, but we have so far assumed that those features operate independently.
We have also largely assumed that we are only interested in mapping back to the raw data.
In this section we will begin to consider more complex mappings between data values and visual features.
Describing Mappings
Visual Summaries
Redundant Mappings
Independent Visual Features
Visual Features that Interact
Other Types of Interaction
Describing Mappings
The RWCperGame
data frame contains measures of
performance at the Rugby World Cup of 2023 for different countries, plus
the hemisphere
that each country is from.
## # A tibble: 6 × 11
## country hemisphere yellowcards redcards cleanbreaks
## <chr> <fct> <dbl> <dbl> <dbl>
## 1 Namibia South 1 0.5 2.5
## 2 Romania North 1.25 0 2.75
## 3 Chile South 1.25 0 3.75
## 4 Samoa South 1.25 0.25 3.75
## 5 Australia South 0.5 0 5.25
## 6 Georgia North 0.5 0 5.25
## # ℹ 6 more variables: tackles <dbl>, points <dbl>,
## # conversions <dbl>, offloads <dbl>, tries <dbl>,
## # runs <dbl>
The simplest case involves mapping one data value from one variable to one visual feature of one data symbol.
The number of cleanbreaks
for one
country maps to the position of one
point.
cleanbreaks
for one
country maps to the position of one
point.The simplest case involves mapping one data value from one variable (a) to one visual feature (x) of one data symbol.
The simplest case involves mapping back from the visual feature to the raw data.
The position of each point allows us to compare
the cleanbreaks
of different countries.
cleanbreaks
per game.cleanbreaks
.cleanbreaks
compared to others.Visual Summaries
We can also perceive visual summaries from a visual feature:
cleanbreaks
.cleanbreaks
.cleanbreaks
.We can map back from a visual feature to the raw data.
We can also map back from visual summaries to data statistics.
An effective data visualisation will produce visual summaries of the data.
An effective visual summary relies on rapid parallel processing of multiple basic data symbols and features.
An effective visual summary does not require conscious attention to each individual data symbol.
Visual summaries provide less precision, but more detail than a typical mathematical summary.
Redundant Mappings
We can map one data value from one variable to two visual features of one data symbol.
The number of cleanbreaks
for one
country maps to the position and
colour of one point.
cleanbreaks
for one
country maps to the position and
colour of one point.We can map one data value from one variable (a) to two visual features (x and y) of one data symbol.
A redundant mapping provides multiple paths from the visual features back to the raw data.
It can be effective to map a data value to multiple visual features.
Independent Visual Features
We can map data values from two variables to two visual features of a data symbol.
The cleanbreaks
and the hemisphere
from
one country maps to the position and
colour of one point.
cleanbreaks
and the hemisphere
from
one country maps to the position and
colour of one point.We can map data values from two variables (a and b) to two visual features (x and y) of one data symbol.
Position and colour do not interact.
We can perceive the positions of points independently of the colour of the points.
An effective data visualisation allows each individual visual feature to map back to the corresponding data values.
Visual Features that Interact
cleanbreaks
and the tries
from one
country map to the horizontal and vertical
position of one point.cleanbreaks
and the tries
from one
country map to the horizontal and vertical
position of one point.Position in space provides visual summaries.
cleanbreaks
and tries
.Some combinations of visual features can be effective by producing additional visual features.
crimeEthnicity
data frame contains the crime
rate
and the pop
ulation for different
ethnicity
s over multiple year
s.## ethnicity year count prop yearDate
## 1 Māori 2011 5957 46.451965 2011-06-30
## 2 Pasifika 2011 1092 8.515284 2011-06-30
## 3 Asian 2011 243 1.894885 2011-06-30
## 4 MELAA 2011 99 0.771990 2011-06-30
## 5 Other 2011 58 0.452277 2011-06-30
## 6 European 2011 5375 41.913600 2011-06-30
pop
and rate
to the
horizontal and vertical length (width and height) of
rectangles.pop
and rate
to the
horizontal and vertical length (width and height) of
rectangles.count
.Some combinations of visual features can be effective by producing additional visual features.
Additional visual features map back to data statistics.
We may not be able to map back to the raw data.
Other Types of Interaction
country
and
cleanbreaks
?An effective data visualisation combines visual features that generate additional visual features only on purpose.
We want to combine visual features that interact when the interaction is useful.
We want to combine visual features that are independent when interaction is not useful.
Gestalt proximity says that items that are close together are seen as a group.
Discretised data values can create columns or rows of data symbols.
Summary
We can perceive visual summaries that map back to data statistics.
When we combine visual features, the features may act independently, or the features may interact.
Interactions between features can produce additional visual features.
We want to select visual features that interact only when we want to produce the additional visual effects.
The interaction between visual features can be sensitive to context.
Exercises
The RWCperGame
data frame contains measures of
performance at the Rugby World Cup of 2023 for different countries, plus
the hemisphere
that each country is from.
We can also calculate the conversionRate
for each
country.
conversionRate
country
with another.tries
?## country tries conversionRate
## 11 Namibia 0.75 0.6666667
## 14 Romania 1.00 0.7500000
## 3 Chile 1.00 0.5000000
## 15 Samoa 2.75 0.7272727
## 2 Australia 2.75 0.6363636
## 7 Georgia 1.75 0.5714286
## [1] 0.3820937
Comparing one country
with another.
Is there a correlation with the number of tries
?