• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

Chebyshevs Theorem and The Empirical Rule

Extracts from this document...

Introduction

Chebyshev’s Theorem and The Empirical Rule

Suppose we ask 1000 people what their age is. If this is a representative sample then there will be very few people of 1-2 years old just as there will not be many 95 year olds. Most will have an age somewhere in their 30’s or 40’s. A list of the number of people of a certain age may look like this:

Age

Number of people

0

1

1

2

2

3

3

8

..

..

..

..

30

45

31

48

..

..

..

..

60

32

61

30

..

..

..

..

80

6

81

3

Next, we can turn this list into a scatter diagram with age on the horizontal axis and the number of people of a certain age on the vertical axis.

From the statistical point of view a scatter diagram may have two shapes.  

It may be shaped or at least looks approximately like a 'bell curve', which looks like this:

A 'bell curve' is perfectly symmetrical with respect to a vertical line through its peak and is sometimes called a "Gauss curve" or a "normal curve".

...read more.

Middle

 years which simplifies to a range of 22 to 58 years.At least 93.75% of all the ages will lie in the range of image20.png.
In our case this means that at least 93.75% of the people will have an age in the range of
image21.png years which simplifies to a range of 16 to 64 years.At least 96% of all the ages will lie in the range of image22.png.
In our case this means that at least 96% of the people will have an age in the range of
image23.png years which simplifies to a range of 10 to 70 years.At least 97.2% of all the ages will lie in the range of image12.png.
In our case this means that at least 97.2% of the people will have an age in the range of
image01.png years which simplifies to a range of 4 to 76 years.

How can we calculate these percentages? To calculate the 75%, the 88.9%, the 93.75%, etc, we look at the number of standard deviations in the respective intervals. The 75% goes together with 'mean ± 1 standard deviation', the 88.9% with 'mean ± 2 standard deviations', the 93.

...read more.

Conclusion

image17.png. The same is true for the difference between the mean and the lower limit of this interval. According to the table above this coincides with 96%.

The Empirical Rule

When the data values seem to have a normal distribution, or approximately so, we can use a much easier theorem than Chebyshev’s.

The "empirical rule" states that in cases where the distribution is normal, the following statements are true:

  • Approximately 68% of the data values will fall within 1 standard deviation of the mean.
  • Approximately 95% of the data values will fall within 2 standard deviations of the mean.
  • Approximately 99.7% of the data values will fall within 3 standard deviations of the mean.

Example 3:

The average salary for graduates entering the actuarial field is $60,000. If the salaries are normally distributed with a standard deviation of $5000, then what percentage of the graduates will have a salary between $50,000 and $70,000?

Solution:

Both $50,000 and $70,000 are $10,000 away from the mean of $60,000. This is two standard deviations away from the mean, so 95% of the graduates will have a salary in this interval.

© 2008 UMUC – European Division

Ron Souverein and Nada Wray

...read more.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related AS and A Level Probability & Statistics essays

  1. I want to find out if there is a connection between people's IQ and ...

    but it is clear that the year 11s'skew is a lot higher than the year 7s' skew. This means that year 11s' have got more high KS2 SATs results than low. My hypothesis is still right according to this diagram because, even though the year 11's highest level is at

  2. Estimating the length of a line and the size of an angle.

    I have turned down quota sampling because it is not very reliable because it depends on the interviewer to choose the sample. Who may choose the sample unfairly or may make up some results, which distorts the data. While the advantage of it is that it is very cheap and you can get exactly what is required.

  1. "The lengths of lines are easier to guess than angles. Also, that year 11's ...

    Then, I am going to draw some histograms, using the frequency density from the grouped frequency tables. These will show the density of the data in certain groups. This shows which group had the most estimates in it. Next, I am going to draw some cumulative frequency tables and curves.

  2. find out if there is a connection between people's IQ and their average KS2 ...

    Spearmans ranks all the data for IQ and Average KS2 SATs result and then finds the difference of the rank, then finds the correlation. The scale that they both use is from 1 to -1, the closer the number is to 1, the more positive and strong it is.

  1. The case is about the Monetta Financial Services Company, an investment house.

    of 16.7%. As compared to IPOs allocated to fund clients where the mean appreciation in the price is 22.7% with a standard deviation of 19.3%. It clearly indicates that IPOs allocated to directors have higher returns with low risk attached to them.

  2. "Males in the 11-18years age range will guess the angles and lengths better than ...

    4 131 m 15 37 3.5 132 m 15 40 4 133 m 15 40 4.5 143 m 15 35 6 144 m 15 40 4 146 m 16 40 4.5 147 m 15 35 7 148 m 16 40 5 Females aged 30+years: No. Gender Age Angle est (deg)

  1. Identifying Relationships -Introduction to Statistical Inference.

    In some cases this is all that will be required. A graphical presentation can help with this. ( see Week 5 lecture for the SPSS commands to produce this type of graph ) The Research Question Investigate whether the acceptance of package customers is associated with the type of hotel?

  2. The average pupil.

    I will therefore create a problems/solutions table in order to help me during any difficult moments within the investigation. Problem Possible cause of problem Solution I have a large amount of data, which consists of a mass of numbers (i.e.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work