Line Chart
What is a line chart?
A line chart is designed to display information as a series of data points connected by straight line segments, making it excellent at showcasing trends in data over time. Typically, the x-axis represents the independent variable (often time), while the y-axis or dependent variable represents the values being measured. The individual data points are plotted on the graph, and the lines between them illustrate the progression or relationship between the variables.
Line charts are versatile and can reveal trends with just a glance. This makes them one of the most popular and adaptable forms of data visualization.
First-class support for error visualization
It's important to provide an accurate image of the data by displaying any uncertainty. That's why Graphmatik line charts automatically calculate necessary summary statistics and support error visualization by default.
Graphmatik lets you customize your error visuals allowing you to plot area fills or error bars.
Y-axis: To cut or NOT to cut?
Cutting the y-axis, means setting the lower bound of the y-axis above zero. To say this is "SPICY drama" in the data vizualization community would be an understatement.
The argument against, rightly points out, that cutting the axis can exaggerate differences and potentially mislead viewers into perceiving larger changes than actually exist. While the argument for suggest time-series (line charts) are specifically designed to show trends and variations relative to a baseline and NOT absolute changes with respect to zero.
The concensus can best be summarized by Edward Tufte, an expert in data visualization.
“In general, in a time-series, use a baseline that shows the data not the zero point..." - Edward Tufte
Graphmatik's default behaviour is to cut the axis. Whether starting the y-axis at zero is appropriate for your data depends on the specific context of the data.
Tips for creating beautiful line and area charts
Chart properties
Prop | Default | Description |
---|---|---|
central tendency | mean | mean The sum of a set of values divided by the number of values in the set. median The middle most value of a sorted set of numbers. |
error | SEM | standard error of the mean (SEM) How much the sample means vary from the population mean. mean standard deviation (SD) A measure of the variation of a set of values around their mean. mean 95% confidence interval (95% CI) 95% probability that the population parameter lies within this range. mean or median range The difference between the highest and lowest values within a set. mean or median Interquartile range (IQR) The middle 50% of a set of values (i.e. 3rd quartile - 1st quartile).median |
error format | areas | areas Error shown as filled areas between dashed bounds. bars Visualize error as a set of upper and lower error bars. |
points | all | all Show individual data points as dots. leading Show ONLY the leading data point as a dot. false Hide all individual data points. |
line | linear | linear Data points are connected by straight line segments. curved Data points are connected by curved line segments. step before Staircase-like line segments. The y-value changes before the x-value. step after Staircase-like line segments. The y-value changes after the x-value. |