Skip to main content
  1. Grade 11 Mathematics/
  2. Statistics/

Variance, Standard Deviation & Data Representation

Table of Contents

The Logic of Dispersion
#

In Grade 10 you learned to summarise data with the mean, median, and mode — measures of central tendency (where the data clusters). But knowing the centre isn’t enough. Two classes can both average 60% on a test, yet one class has everyone between 55% and 65% while the other has marks scattered from 20% to 100%.

Dispersion measures tell you how spread out the data is. Grade 11 introduces two precise measures: the variance and the standard deviation.


1. Why Not Just Use the Range?
#

The range ($\text{max} - \text{min}$) is a simple measure of spread, but it only uses two data points. One extreme value changes the range dramatically while telling you nothing about what’s happening in between.

We need a measure that uses every data point. The natural idea: measure how far each point is from the mean, then average those distances.

The Problem with Simple Averages of Deviations
#

If you calculate $x - \bar{x}$ for every data point and add them up, you always get zero — the positive deviations cancel the negative ones perfectly. (This is a mathematical property of the mean.)

$x$$x - \bar{x}$
4$-4$
7$-1$
8$0$
10$+2$
11$+3$
Sum0

The fix: Square every deviation first. Squaring makes everything positive and also penalises large deviations more heavily (which is what we want — a point 10 units away is more concerning than two points each 5 units away).


2. Variance ($\sigma^2$)
#

The variance is the mean of the squared deviations:

$$\boxed{\sigma^2 = \frac{\sum (x_i - \bar{x})^2}{n}}$$
SymbolMeaning
$x_i$Each individual data value
$\bar{x}$The mean of all data values
$n$The number of data values
$\sigma^2$The variance

The variance tells you the average squared distance from the mean. Its unit is the square of the original unit (e.g., if data is in cm, variance is in cm²).


3. Standard Deviation ($\sigma$)
#

To get back to the original units, take the square root of the variance:

$$\boxed{\sigma = \sqrt{\frac{\sum (x_i - \bar{x})^2}{n}}}$$

What it means in plain English: On average, each data point sits about $\sigma$ units away from the mean.

Interpreting Standard Deviation
#

$\sigma$ valueWhat it means
Small $\sigma$Data is tightly clustered around the mean — consistent
Large $\sigma$Data is widely spread — variable, less predictable
$\sigma = 0$Every data value is identical (no spread at all)

The 68% Rule: In roughly symmetric data, about 68% of the data falls within one standard deviation of the mean: $[\bar{x} - \sigma;\; \bar{x} + \sigma]$.


4. Worked Example — Standard Deviation by Hand
#

Data: $4;\; 7;\; 8;\; 10;\; 11$ ($n = 5$)

Step 1 — Calculate the mean:

$$\bar{x} = \frac{4 + 7 + 8 + 10 + 11}{5} = \frac{40}{5} = 8$$

Step 2 — Build the deviation table:

$x_i$$x_i - \bar{x}$$(x_i - \bar{x})^2$
4$-4$16
7$-1$1
8$0$0
10$+2$4
11$+3$9
Sum030

Check: The deviations column must sum to 0. If it doesn’t, your mean is wrong.

Step 3 — Variance:

$$\sigma^2 = \frac{30}{5} = 6$$

Step 4 — Standard deviation:

$$\sigma = \sqrt{6} \approx 2.45$$

Interpretation: On average, each data point is about 2.45 units away from the mean of 8.


5. Standard Deviation from a Frequency Table
#

When data is given with frequencies, the formula adjusts to account for repeated values:

$$\boxed{\sigma = \sqrt{\frac{\sum f_i(x_i - \bar{x})^2}{\sum f_i}}}$$

The mean is: $\bar{x} = \frac{\sum f_i x_i}{\sum f_i}$

Worked Example — Frequency Table
#

Test scores for 20 learners:

Score ($x_i$)Frequency ($f_i$)$f_i \cdot x_i$
326
5525
7856
9436
10110
Total20133

Step 1 — Mean: $\bar{x} = \frac{133}{20} = 6.65$

Step 2 — Squared deviations × frequency:

$x_i$$f_i$$x_i - \bar{x}$$(x_i - \bar{x})^2$$f_i(x_i - \bar{x})^2$
32$-3.65$$13.3225$$26.645$
55$-1.65$$2.7225$$13.6125$
78$0.35$$0.1225$$0.98$
94$2.35$$5.5225$$22.09$
101$3.35$$11.2225$$11.2225$
Total2074.55

Step 3 — Variance: $\sigma^2 = \frac{74.55}{20} = 3.7275$

Step 4 — Standard deviation: $\sigma = \sqrt{3.7275} \approx 1.93$


6. Grouped Data (Class Intervals)
#

When data is given in intervals (e.g., 10–20, 20–30, …), you don’t know the exact values. Use the midpoint of each class as $x_i$:

$$\text{midpoint} = \frac{\text{lower boundary} + \text{upper boundary}}{2}$$

Worked Example — Grouped Data
#

Heights of 50 Grade 11 learners:

Height (cm)Midpoint ($x_i$)Frequency ($f_i$)$f_i \cdot x_i$
140–1501454580
150–160155121860
160–170165203300
170–180175101750
180–1901854740
Total508230

Mean: $\bar{x} = \frac{8230}{50} = 164.6$ cm

Now calculate $\sum f_i(x_i - \bar{x})^2$:

$x_i$$f_i$$(x_i - 164.6)^2$$f_i(x_i - 164.6)^2$
1454$384.16$$1536.64$
15512$92.16$$1105.92$
16520$0.16$$3.20$
17510$108.16$$1081.60$
1854$416.16$$1664.64$
Total505392.00
$$\sigma = \sqrt{\frac{5392}{50}} = \sqrt{107.84} \approx 10.38 \text{ cm}$$

Interpretation: The heights are spread about 10.4 cm either side of the mean on average.

⚠️ Grouped data gives an approximation — we assumed all values in each class are at the midpoint. The true $\sigma$ may differ slightly.


7. Histograms, Frequency Polygons & Ogives
#

Histograms
#

  • Class intervals on the x-axis, frequency on the y-axis
  • Bars touch (no gaps) — the data is continuous
  • The tallest bar = the modal class (the interval with the highest frequency)
  • Bar width equals the class width

Frequency Polygons
#

  • Plot the midpoint of each class against its frequency
  • Connect the points with straight lines
  • Extend to the x-axis one class interval before and after the data (so the polygon closes)

Ogives (Cumulative Frequency Curves)
#

An ogive shows the running total of frequencies. It answers the question: “How many data values are less than or equal to this boundary?”

How to draw an ogive:

  1. Create a cumulative frequency column (running total)
  2. Plot cumulative frequency against the upper boundary of each class
  3. Start with the point (lower boundary of first class, 0)
  4. Connect with a smooth S-shaped curve

Worked Example — Reading an Ogive
#

Using the height data from Section 6:

Upper boundaryCumulative frequency
1400
1504
16016
17036
18046
19050

Reading values from the ogive:

  • Median ($Q_2$): at $\frac{50}{2} = 25$ → read across from 25 on the y-axis to the curve → read down to get approximately 164 cm
  • $Q_1$: at $\frac{50}{4} = 12.5$ → approximately 158 cm
  • $Q_3$: at $\frac{3 \times 50}{4} = 37.5$ → approximately 172 cm
  • IQR = $172 - 158 = 14$ cm

Exam tip: When a question says “use the ogive to estimate the median,” draw horizontal and vertical dashed lines on the graph and label the coordinates. Show your working on the graph — marks are awarded for this.


8. Comparing Two Datasets
#

A very common exam question: “Compare the performance of Class A and Class B.”

The Strategy
#

Compare…Using…
Centre (who did better overall?)Mean or median
Spread (who was more consistent?)Standard deviation or IQR

Worked Example — Comparing Classes
#

Class A: $\bar{x} = 62$, $\sigma = 8$

Class B: $\bar{x} = 62$, $\sigma = 15$

Centre: Both classes have the same mean (62%), so their overall performance was the same.

Spread: Class A has a much lower standard deviation ($\sigma = 8$ vs $\sigma = 15$), so Class A’s marks were more consistent (clustered closer to the mean). Class B had a wider range of marks — some very high, some very low.

Another Example — Different Means
#

Team X: $\bar{x} = 45$, $\sigma = 5$

Team Y: $\bar{x} = 52$, $\sigma = 12$

Centre: Team Y scored higher on average (52 vs 45).

Spread: Team X was more consistent ($\sigma = 5$ vs $\sigma = 12$). Team Y had higher scores overall but with much more variation between players.

Exam phrasing: “The data for [X] is more consistent/spread out than [Y] because the standard deviation is lower/higher.”


9. Symmetric vs Skewed Data
#

The shape of the data distribution tells you about the relationship between mean and median.

ShapeMean vs MedianBox plot clueHistogram clue
SymmetricMean ≈ MedianMedian centred in box, whiskers roughly equalBell-shaped, mirror image
Positively skewed (right)Mean > MedianMedian closer to $Q_1$, long right whiskerTail stretches to the right
Negatively skewed (left)Mean < MedianMedian closer to $Q_3$, long left whiskerTail stretches to the left

Why the mean shifts: The mean is pulled toward extreme values (outliers/tail), while the median stays in the middle of the ranked data. So in positively skewed data, a few very high values drag the mean above the median.

Which Measure to Use?
#

Data shapeBest centre measureBest spread measure
SymmetricMeanStandard deviation
SkewedMedianIQR

10. Outliers
#

An outlier is a data value that is unusually far from the rest. The formal test:

$$\boxed{x < Q_1 - 1.5 \times IQR \quad \text{or} \quad x > Q_3 + 1.5 \times IQR}$$

Where $IQR = Q_3 - Q_1$.

Worked Example — Detecting Outliers
#

Data: $2;\; 15;\; 18;\; 20;\; 22;\; 23;\; 25;\; 27;\; 28;\; 30;\; 65$

$Q_1 = 18$, $Q_3 = 28$, $IQR = 10$

Lower fence: $18 - 1.5(10) = 3$

Upper fence: $28 + 1.5(10) = 43$

Any value below 3 or above 43 is an outlier.

  • $x = 2$: below 3 → outlier
  • $x = 65$: above 43 → outlier

Impact of Outliers on Statistics
#

MeasureAffected by outliers?
MeanYes — dragged toward the outlier
MedianNo — only depends on the middle value(s)
Standard deviationYes — outliers inflate squared deviations
IQRNo — only depends on $Q_1$ and $Q_3$
RangeYes — completely determined by extremes

Exam tip: If a question asks “which measure of central tendency is more appropriate?”, check for skewness or outliers. If present, the median is better than the mean.


🚨 Common Mistakes
#

MistakeWhy it’s wrongFix
Using $s_x$ instead of $\sigma_x$ on the calculator$s_x$ is the sample standard deviation (divides by $n-1$); school maths uses $\sigma_x$ (divides by $n$)Always select $\sigma_x$ in STAT mode
Forgetting to multiply by frequencyIn a frequency table, each $(x - \bar{x})^2$ must be weighted by its frequencyAlways include the $f_i$ column in your table
Forgetting the square root$\sigma^2$ is the variance; $\sigma$ is the standard deviationThe last step is always $\sigma = \sqrt{\sigma^2}$
Using midpoints as exact valuesGrouped data uses midpoints as estimates, not exact valuesAcknowledge “estimated” in your answer
Reading ogive at the wrong axisThe median is at cumulative frequency $\frac{n}{2}$, not at $x = \frac{n}{2}$Start on the y-axis, go across to the curve, then down to the x-axis
Confusing skewness direction“Positively skewed” means the tail goes to the right (positive direction)Remember: the name matches the direction of the tail, not the peak
Stating “data is consistent” without referencing $\sigma$You must mention the standard deviation value to justify a consistency claim“The data is more consistent because $\sigma = 5$ is lower than $\sigma = 12$”

💡 Pro Tips for Exams
#

1. Calculator Method
#

Use STAT mode: enter all data → select 1-VAR stats → read $\bar{x}$ and $\sigma_x$ directly. The by-hand method is for understanding and for “show your working” questions; the calculator is for speed.

2. The “Deviation Sum = 0” Check
#

After calculating all your deviations ($x_i - \bar{x}$), add them up. If the sum isn’t 0, your mean is wrong. Fix it before continuing.

3. Comparing Datasets — The Two-Sentence Template
#

When asked to compare two datasets, always make two statements:

  1. Centre: “Class A performed better/worse on average because [mean/median] = …”
  2. Spread: “Class A was more/less consistent because [σ/IQR] = …”

This guarantees full marks on comparison questions.

4. Reading Ogives Accurately
#

Always draw dashed lines on the ogive when reading values. Start from the y-axis (cumulative frequency), draw a horizontal line to the curve, then a vertical line down to the x-axis. Label both coordinates.

🔗 Related Grade 11 topics:


🏠 Back to Statistics

Related

The Exponential Graph (Grade 11)

Master exponential growth and decay, the horizontal asymptote, sketching, finding the equation, and understanding why the graph behaves the way it does — with full worked examples.

The Hyperbola (Grade 11)

Master the shifted hyperbola — understand WHY asymptotes exist, how every parameter affects the graph, how to sketch from the equation, find the equation from a graph, determine lines of symmetry, and solve intersection problems — with full worked examples.