What is the distribution of the sample means?
- For any given population it can often be difficult or impractical to find the true value of the population mean, µ
- The population could be too large to collect data using a census or
- Collecting the data could compromise the individual data values and therefore taking a census could destroy the population
- Instead, the population mean can be estimated by taking the mean from a sample from within the population
- If a sample of size is taken from a population, , and the mean of the sample, is calculated then the distribution of the sample means, , is the distribution of all values that the sample mean could take
- If the population, , has a normal distribution with mean, µ , and variance, σ2 , then the mean expected value of the distribution of the sample means, would still be µ but the variance would be reduced
- Taking a mean of a sample will reduce the effect of any extreme values
- The greater the sample size, the less varied the distribution of the sample means would be
- The distribution of the means of the samples of size taken from the population, will have a normal distribution with:
- Mean, = µ
- Standard deviation
- For a random variable the distribution of the sample mean would be
- The standard deviation of the distribution of the sample means depends on the sample size,
- It is inversely proportional to the square root of the sample size
- This means that the greater the sample size, the smaller the value of the standard deviation and the narrower the distribution of the sample means
A random sample of 10 observations is taken from the population of the random variable and the sample mean is calculated as . Write down the distribution of the sample mean, .
- Look carefully at the distribution given to determine whether the variance or the standard deviation has been given.
Does the distribution of the sample means always follow a normal distribution?
- If the variable for the population follows a normal distribution then the sample mean distribution also follows a normal distribution
- If the variable for the population does not follow a normal distribution then the sample mean distribution does not necessarily follow a normal distribution
- If the sample size is small then can not be assumed to follow a normal distribution
- If the sample size is big enough then we can use the Central Limit Theorem to approximate using a normal distribution
What is the central limit theorem?
- If a random sample of size n is taken from a population with mean µ and variance then the Central Limit Theorem states that can be approximated by the normal distribution provided n is large enough
- Notice the variance is still divided by the size of the sample
- We usually say n is large enough if it is at least 30
- This is a powerful theorem as it allows us to use the normal distribution for even when the population itself does not follow a normal distribution
- If the population follows a normal distribution then the Central Limit Theorem is not needed as will be normal automatically
- This is important as you might be asked whether the Central Limit Theorem was needed
The integers 1 to 29 are written on counters and placed in a bag. The expected value when one is picked at random is 15 and the variance is 70. Susie randomly picks 40 integers, returning the counter after each selection.
Find the probability that the mean of Susie’s 40 numbers is less than 13. Explain whether it was necessary to use the Central Limit Theorem in your calculation.
- If asked to explain whether it was necessary to use the Central Limit Theorem, check whether the population follows a normal distribution, if it does not then check the size of the sample. If your answer is yes comment on both of these things.