Member-only story

VC: Everything about Scatter Plots

Jeheonpark
6 min readSep 2, 2020

--

Scatterplots are one of the most popular visualization techniques in the world. Its purposes are recognizing clusters and correlations in ‘pairs’ of variables. There are many variations of scatter plots. We will look at some of them.

Strip Plots

Strip Plots, Seaborn Documentation

Scatter plots in which one attribute is categorical are called ‘strip plots’. Since it is hard to see the data points when we plot the data points as a single line, we need to slightly spread the data points, you can check the above and we can also divide the data points based on the given label.

Scatterplot Matrices (SPLOM)

Scatterplot Matrices

SPLOM produce scatterplots for all pairs of variables and place them into a matrix. Total unique scatterplots are (p²-p)/2. The diagonal is filled with KDE or histogram most of the time. As you can see, there is an order of scatterplots. Does the order matter? It cannot affect the value of course but it can affect the perception of people.

Ordering is matter, Image taken from [Peng et al. 2004]

--

--

Jeheonpark
Jeheonpark

Written by Jeheonpark

Jeheon Park, Software Engineer at Kakao in South Korea

No responses yet