This article is a guide on sample standard deviation, including concepts, a step-by-step process to calculate it, and a list of examples.
In This Article
Standard deviation measures the spread of data relative to its mean. You can think of standard deviation as the average distance between a single data point and the mean. A large standard deviation shows that data is more widely dispersed about the mean, and a small standard deviation shows that data are tightly clustered around the mean. You can calculate standard deviation by taking the square root of the variance.
In statistics, a population refers to the entire set of objects or events being studied, and a sample is a subset of the population. As an example, imagine you are studying national elections in the United States. Your population of interest consists of every single eligible voter across the 50 states. Because you canβt collect data for the entire population, you draw random samples (subsets) of voters and make inferences about the population from these samples. Sometimes statisticians have data for an entire population. Most of the time, however, they have to work with samples.
When you are dealing with population data and want to calculate a standard deviation, use the population standard deviation formula given above. A population standard deviation is denoted by the lowercase Greek letter sigma, π.
When you are dealing with sample data and want to calculate a standard deviation, use the sample standard deviation formula given above. A sample standard deviation is denoted by the lowercase letter s.
We calculate standard deviation as the square root of the variance. This may cause you to wonder: why do we use standard deviation when we already have variance? Great question!
To answer this question, first notice that in both the equation for variance and the equation for standard deviation, you take the squared deviation (the squared distances) between each data point and the sample mean ( x i β x Λ ) 2 (x_i-\bar)^2 ( x i β β x Λ ) 2 . You do this so that the negative distances between the mean and the data points below the mean do not cancel out the positive distances between the mean and data points above it.
By squaring deviations, we convert each deviation into a positive number, and from there, you can measure average dispersion. There is a downside to doing this, though. By squaring the deviations, you square the units of the data as well. Imagine your data is measured in ounces. By taking squared deviations, you are now measuring the dispersion in your data in terms of ounces squared β a unit of measurement that is difficult to interpret.
Standard deviation is intended to be a standardized measure of dispersion. By taking the square root of the variance, we convert the statistic back into the same units as the data. Ounces, not ounces squared!
Say you have the following sample data for temperatures measured in degrees Fahrenheit. For simplicity, this sample just has four observations (n=4).
Sample Data |
Using the formula for sample standard deviation, letβs go through a step-by-step example of how to find the standard deviation for this sample.
s = β ( x i β x Λ ) 2 n β 1 s = \sqrt^<>(x_i-\bar)^2>> s = n β 1 β β ( x i β β x Λ ) 2 β\fracCalculate the sample mean xΜ .
x Λ = 51 + 58 + 61 + 62 4 = 58 Β° F \barCalculate the deviation between each data point and the sample mean ( x i β x Λ ) (x_i-\bar) ( x i β β x Λ ) .
51 β 58 = β 7 51-58 = - 7 51 β 58 = β 7 58 β 58 = 0 58-58 = 0 58 β 58 = 0 61 β 58 = 3 Β° F 61-58 = 3Β°F 61 β 58 = 3Β° F 62 β 58 = 4 Β° F 62-58 = 4Β°F 62 β 58 = 4Β° FNotice that there are both positive and negative deviations.
Square the deviations, ( x i β x Λ ) 2 (x_i-\bar)^2 ( x i β β x Λ ) 2 .
( β 7 ) 2 = 49 \left(-7\right)^2=49 ( β 7 ) 2 = 49 0 2 = 0 0^2 = 0 0 2 = 0 3 2 = 9 3^2 = 9 3 2 = 9 4 2 = 16 4^2 =16 4 2 = 16By squaring the deviations, we turn all of the deviations positive.
Sum the squared distances, β ( x i β x Λ ) 2 \sum(x_i-\bar)^2 β ( x i β β x Λ ) 2 .
49 + 0 + 9 + 16 = 74 49+0+9+16 = 74 49 + 0 + 9 + 16 = 74Divide the sum of the squares by the number of data points minus one, β ( x i β x Λ ) 2 ( n β 1 ) \dfrac<\sum(x_i-\bar
Notice that this is the variance, s 2 s^2 s 2 , and it is measured in degrees Fahrenheit squared!
Take the square root of the variance.
s = β ( x i β x Λ ) 2 n β 1 = 24.6 6 Λ = 4.97 s = \sqrt^<>(x_i-\bar)^2>> = \sqrt> = 4.97 s = n β 1 β β ( x i β β x Λ ) 2 β\frac
And there is your sample standard deviation. 4.97 Β°F.
Outlier (from the co-founder of MasterClass) has brought together some of the world's best instructors, game designers, and filmmakers to create the future of online college.
Check out these related courses:
How data describes our world.
How data describes our world.
Why small choices have big impact.
Why small choices have big impact.
How money moves our world.
How money moves our world.
The science of the mind.
The science of the mind.