How is Standard Deviation UsedMore Properties of Sigma in Statistics
The standard deviation function is widely used in statistics. The alternative formula for sigma is described, along with how to combine two or more standard deviations.
Standard deviation may be calculated for a sample without calculating the sample mean first. This is useful when calculating the standard deviation of a large sample. The difference between the sample standard deviation and the population standard deviation, and how to combine two or more standard deviation values are described. Alternative Standard Deviation CalculationThe article What is Standard Deviation describes how to calculate the standard deviation of the numbers 1, 2, 3, 4, and 5 using the conventional equation. The standard deviation was calculated as 1.414 There is an alternative method that gives exactly the same result: 1. Square each number: 1 4 9 16 25 2. Sum the squares: 1 + 4 + 9 + 16 + 25 = 55. This is the sum of squares. 3. Sum the original numbers 1 + 2 + 3 + 4 + 5 = 15. This is the sum 4. Square the sum: 15 x 15 = 225. This is the square of the sum. 5. Divide the square of the sum by the number of data points: 225 / 5 = 45. This is the average square. 6. Subtract the average square from the sum of squares: 55 - 45 = 10. 7. Divide this number by the number of samples: 10 / 5 = 2 8. Take the square root: square root of 2 = 1.414, as before. Sample vs Population Standard DeviationIn the example shown, the entire population was only five numbers. When a sample is taken from a much larger population, and when the sample size is small, the population standard deviation is underestimated. The formula in step 7 then changes slightly. Instead of dividing by the number of the sample, the sum is divided by the number in the sample minus one: 7. Divide this number by the number in the sample minus 1: 10 / ( 5 - 1) = 2.5 The standard deviation is now the square root of 2.5 = 1.581, and it is called the sample standard deviation. Pooled Standard DeviationSometimes it is necessary to combine two standard deviations. For example, if the standard deviation of the height of women in Minnesota is 5 cm, and the standard deviation of the height of men in Minnesota is 6 cm, what is the overall standard deviation of the population of Minnesota? The key to this is that standard deviations should not be added directly. Variances are added, where variance is the standard deviation squared: Overall variance = Variance for men + Variance for women So, overall variance = SD(men) x SD (men) + SD(women) + SD(women) = 6 x 6 + 5 x 5 = 36 + 25 = 61 So the overall standard deviation is the square root of 61 = 7.8 cm Combining two standard deviations in this way is valid when the mean of the two samples is roughly the same. To be strictly accurate, the pooled variance is the variance between the samples added to the variance within the samples. Summary of Standard DeviationThere is an alternative, easy to use formula for standard deviation. The standard deviation of samples underestimates the standard deviation of the population, and needs to be adjusted. Standard deviations are combined by adding the variances. References for Basic StatisticsThe article What is Standard Deviation describes how to calculate the basic standard deviation. There are many good references available, but Statistics For Dummies is a good starting point for beginners.
The copyright of the article How is Standard Deviation Used in Engineering is owned by Martin Bell. Permission to republish How is Standard Deviation Used in print or online must be granted by the author in writing.
CommentsMar 17, 2009 10:20 PM
Guest :
Mar 18, 2009 2:59 AM
Martin Bell :
2 Comments
Related Articles
Related Topics
Reference
More in Technology
|