Grouped Box Plot
What is a grouped box plot?
Grouped box plots are used to display clustered data using five key sample statistics. The "box" itself represents the interquartile range, with a central line indicating the median (Q2), and its boundaries defining the lower first quartile (Q1) and upper third quartile (Q3). The "whiskers" which extend outwards, commonly represent either the full data range, Tukey's fences (calculated as 1.5 * IQR), or specific percentiles. By clustering the data in this way, grouped box plots enable clear side-by-side comparisons both between different groups and across their respective subcategories.
To dot plot or NOT to dot plot?
Box plots offer valuable insights into a dataset's underlying distribution, revealing symmetry, data concentration, skewness, and potential outliers. However, discerning the precise shape of the distribution from a box plot is often less straightforward than with a histogram, nor does it provide any information about the total number of observations.
This is why Graphmatik overlays a distribution dot plot on top of the generated box plot by default. You can of course toggle off these individual data points if you'd like.
toggle off
points, outliers will still be shown, as is standard with box-and-whisker plotsChart properties
Prop | Default | Description |
---|---|---|
central tendency | median | median The middle most value of a sorted set of numbers. |
whiskers | range | range The difference between the highest and lowest values within a set. 1.5 * Interquartile range (1.5*IQR) A range representing Q1 - 1.5 * IQR and Q3 + 1.5 * IQR. 2.5 percentile - 97.5 percentile (2.5-97.5 %tile) The difference between the 2.5 percentile and the 97.5 percentile, representing the middle 95% of a set. |
sort | none | none The clusters are arranged in insertion order. ascending Clusters are arranged from smallest to largest. descending Clusters are arranged from largest to smallest. |
group by | factor | factor Bars will be grouped by the selected factor, with the other factor defining the subgroups. |