• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

Chebyshevs Theorem and The Empirical Rule

Extracts from this document...

Introduction

Chebyshev’s Theorem and The Empirical Rule

Suppose we ask 1000 people what their age is. If this is a representative sample then there will be very few people of 1-2 years old just as there will not be many 95 year olds. Most will have an age somewhere in their 30’s or 40’s. A list of the number of people of a certain age may look like this:

Age

Number of people

0

1

1

2

2

3

3

8

..

..

..

..

30

45

31

48

..

..

..

..

60

32

61

30

..

..

..

..

80

6

81

3

Next, we can turn this list into a scatter diagram with age on the horizontal axis and the number of people of a certain age on the vertical axis.

From the statistical point of view a scatter diagram may have two shapes.  

It may be shaped or at least looks approximately like a 'bell curve', which looks like this:

A 'bell curve' is perfectly symmetrical with respect to a vertical line through its peak and is sometimes called a "Gauss curve" or a "normal curve".

...read more.

Middle

 years which simplifies to a range of 22 to 58 years.At least 93.75% of all the ages will lie in the range of image20.png.
In our case this means that at least 93.75% of the people will have an age in the range of
image21.png years which simplifies to a range of 16 to 64 years.At least 96% of all the ages will lie in the range of image22.png.
In our case this means that at least 96% of the people will have an age in the range of
image23.png years which simplifies to a range of 10 to 70 years.At least 97.2% of all the ages will lie in the range of image12.png.
In our case this means that at least 97.2% of the people will have an age in the range of
image01.png years which simplifies to a range of 4 to 76 years.

How can we calculate these percentages? To calculate the 75%, the 88.9%, the 93.75%, etc, we look at the number of standard deviations in the respective intervals. The 75% goes together with 'mean ± 1 standard deviation', the 88.9% with 'mean ± 2 standard deviations', the 93.

...read more.

Conclusion

image17.png. The same is true for the difference between the mean and the lower limit of this interval. According to the table above this coincides with 96%.

The Empirical Rule

When the data values seem to have a normal distribution, or approximately so, we can use a much easier theorem than Chebyshev’s.

The "empirical rule" states that in cases where the distribution is normal, the following statements are true:

  • Approximately 68% of the data values will fall within 1 standard deviation of the mean.
  • Approximately 95% of the data values will fall within 2 standard deviations of the mean.
  • Approximately 99.7% of the data values will fall within 3 standard deviations of the mean.

Example 3:

The average salary for graduates entering the actuarial field is $60,000. If the salaries are normally distributed with a standard deviation of $5000, then what percentage of the graduates will have a salary between $50,000 and $70,000?

Solution:

Both $50,000 and $70,000 are $10,000 away from the mean of $60,000. This is two standard deviations away from the mean, so 95% of the graduates will have a salary in this interval.

© 2008 UMUC – European Division

Ron Souverein and Nada Wray

...read more.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related AS and A Level Probability & Statistics essays

  1. "The lengths of lines are easier to guess than angles. Also, that year 11's ...

    The scatter graphs show this data. Overall, I feel I have managed to prove the hypothesis of "the length of lines is easier to guess than angles" as correct through many calculations and graphs. Year 11's will be more accurate at estimating.

  2. Used Cars - What main factor that affects the price of a second hand ...

    3.5 50 -46.5 2162.25 39.5 28.5 11 121 39.5 47 -7.5 56.25 32777 What does Spearman's Rank Correlation show? Spearman's shows that the findings from each scatter graph is correct as they match what spearman's shows. Here I placed my findings from using spearman's on a scale to the correlation.

  1. Estimating the length of a line and the size of an angle.

    I will pre-test it to make sure I get the correct results before I collect the actual results and to see if any amendments and alterations need to be made to the sheet. In order for me to collect the data I would have to meet the pupils.

  2. find out if there is a connection between people's IQ and their average KS2 ...

    This might be because girls mature faster than boys and are more sensible and determined, so they buckle down with their work and study. The way in which I am going to sample my data so there is a lot less bias is called stratified sampling.

  1. I want to find out if there is a connection between people's IQ and ...

    My slope turned out to be 11.52 and this means that as the average KS2 SAT result increases by one level, the IQ level will increase by 11.52 points. This looks right on my scatter graph and makes sense.

  2. The case is about the Monetta Financial Services Company, an investment house.

    * Similarly, minimum and maximum price appreciation for the IPOs allocated to directors was 12.5% and 68.8% respectively. While minimum and maximum price for IPOs allocated to fund clients were 0% and 69.4% that represents that the range is much wider for IPOs allocated to fund clients.

  1. "Males in the 11-18years age range will guess the angles and lengths better than ...

    Gender Age Angle est (deg) Length est (cm) 70 f 61 30 4 74 f 56 40 3 75 f 55 40 5.1 77 f 54 37 5 79 f 51 30 3.5 80 f 50 40 3 83 f 49 45 5 84 f 49 45 3.5 85 f 48 43 3.3 87 f

  2. Identifying Relationships -Introduction to Statistical Inference.

    Are the RV PACKAGES and the factor TYPE related in any way? Are they dependent? Investigating relationships If there was no relationship between the RV and the factor i.e. acceptance of package customers was completely independent of type of hotel, you would expect the percentage of respondent's in each package

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work