Know how many numbers are in your sample. In the case of the sample of palm trees, there are 5 in this sample. Know what the numbers represent. In our example, these numbers represent measurements of trees. Look at the variation in the numbers. Does the data vary across a large range, or a small range?
The mean is the average of all the numbers in your sample. To calculate this you will add all the numbers in your sample together, then divide by the sample size. In mathematical notation, n represents the sample size. In the case of our sample of tree heights, n = 5 since there are 5 numbers in this sample.
For example, using the sample of 5 palm trees, our sample consists of 7, 8, 8, 7. 5, and 9. 7 + 8 + 8 + 7. 5 + 9 = 39. 5. This is the sum of all the numbers in your sample. Check your answer to make sure you did your addition correctly.
For example, use our sample of tree heights: 7, 8, 8, 7. 5, and 9. There are 5 number in our sample so n = 5. The sum of tree heights in our sample was 39. 5. You would then divide this figure by 5 to figure out the mean. 39. 5/5 = 7. 9. The mean tree height is 7. 9 feet. The population mean is often represented by the symbol μ, therefore μ = 7. 9
This calculation will give you an idea about how far your data is spread out. Samples with low variance have data that is clustered closely about the mean. Samples with high variance have data that is spread far from the mean. Variance is often used to compare the distributions between two data sets or samples.
In our sample of tree heights (7, 8, 8, 7. 5, and 9 feet) the mean was 7. 9. 7 - 7. 9 = -0. 9, 8 - 7. 9 = 0. 1, 8 - 7. 9 = 0. 1, 7. 5 - 7. 9 = -0. 4, and 9 - 7. 9 = 1. 1. Do these calculations again to check your math. It is extremely important that you have the right figures for this step.
Remember, in our sample we subtracted the mean of 7. 9 from each of our data points (7, 8, 8, 7. 5, and 9) and came up with the following: -0. 9, 0. 1, 0. 1, -0. 4, and 1. 1. Square all of these figures: (-0. 9)^2 = 0. 81, (0. 1)^2 = 0. 01, (0. 1)^2 = 0. 01, (-0. 4)^2 = 0. 16, and (1. 1)^2 = 1. 21. The squares from this calculation are: 0. 81, 0. 01, 0. 01, 0. 16, and 1. 21. Check your answers before proceeding to the next step.
In our sample of tree heights, the squares were as follows: 0. 81, 0. 01, 0. 01, 0. 16, and 1. 21. 0. 81 + 0. 01 + 0. 01 + 0. 16 + 1. 21 = 2. 2 For our example of tree heights, the sum of squares is 2. 2. Check your addition to make sure that you have the right figure before moving on.
In our sample of tree heights (7, 8, 8, 7. 5, and 9 feet), the sum of squares was 2. 2. There are 5 numbers in this sample. Therefore n = 5. n - 1 = 4 Remember the sum of squares is 2. 2. To find the variance, calculate the following: 2. 2 / 4. 2. 2 / 4 = 0. 55 Therefore the variance for this sample of tree heights is 0. 55.
Variance is how spread out your data is from the mean or mathematical average. Standard deviation is a figure that represents how spread out your data is in your sample. In our sample of tree heights, the variance was 0. 55.
In our sample of tree heights, the variance was 0. 55. √0. 55 = 0. 741619848709566. You will often get a very large decimal figure when you calculate this step. It is ok to round to the second or third decimal place for your standard deviation figure. In this case, you could use 0. 74. Using a rounded figure, the standard deviation in our sample of tree heights is 0. 74
Write down all the steps you took when you did your calculations. This will allow you to see where you made a mistake, if any. If you come up with different figures for mean, variance, and standard deviation during your check, repeat the calculations looking at your process carefully.
Remember, a z-score is a measure of how many standard deviations a data point is away from the mean. In the formula X represents the figure you want to examine. For example, if you wanted to find out how many standard deviations 7. 5 was from the mean in our example of tree heights, you would plug in 7. 5 for X in the equation. In the formula, μ stands for the mean. In our sample of tree heights the mean was 7. 9. In the formula, σ stands for the standard deviation. In our sample of tree heights the standard deviation was 0. 74.
For example, in our sample of tree heights we want to find out how many standard deviations 7. 5 is from the mean of 7. 9. Therefore, you would perform the following: 7. 5 - 7. 9. 7. 5 - 7. 9 = -0. 4. Double check that you have the correct mean and subtraction figure before you proceed.
In our sample of tree heights, we want the z-score for the data point 7. 5. We already subtracted the mean from 7. 5, and came up with a figure of -0. 4. Remember, the standard deviation from our sample of tree heights was 0. 74.
- 0. 4 / 0. 74 = - 0. 54 Therefore the z-score in this case is -0. 54. This z-score means that 7. 5 is -0. 54 standard deviations away from the mean in our sample of tree heights. Z-scores can be both positive and negative numbers. A negative z-score indicates that the data point is less than the mean, and a positive z-score indicates the data point in question is larger than the mean.