• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

Chebyshevs Theorem and The Empirical Rule

Extracts from this document...

Introduction

Chebyshev’s Theorem and The Empirical Rule

Suppose we ask 1000 people what their age is. If this is a representative sample then there will be very few people of 1-2 years old just as there will not be many 95 year olds. Most will have an age somewhere in their 30’s or 40’s. A list of the number of people of a certain age may look like this:

Age

Number of people

0

1

1

2

2

3

3

8

..

..

..

..

30

45

31

48

..

..

..

..

60

32

61

30

..

..

..

..

80

6

81

3

Next, we can turn this list into a scatter diagram with age on the horizontal axis and the number of people of a certain age on the vertical axis.

From the statistical point of view a scatter diagram may have two shapes.  

It may be shaped or at least looks approximately like a 'bell curve', which looks like this:

A 'bell curve' is perfectly symmetrical with respect to a vertical line through its peak and is sometimes called a "Gauss curve" or a "normal curve".

...read more.

Middle

 years which simplifies to a range of 22 to 58 years.At least 93.75% of all the ages will lie in the range of image20.png.
In our case this means that at least 93.75% of the people will have an age in the range of
image21.png years which simplifies to a range of 16 to 64 years.At least 96% of all the ages will lie in the range of image22.png.
In our case this means that at least 96% of the people will have an age in the range of
image23.png years which simplifies to a range of 10 to 70 years.At least 97.2% of all the ages will lie in the range of image12.png.
In our case this means that at least 97.2% of the people will have an age in the range of
image01.png years which simplifies to a range of 4 to 76 years.

How can we calculate these percentages? To calculate the 75%, the 88.9%, the 93.75%, etc, we look at the number of standard deviations in the respective intervals. The 75% goes together with 'mean ± 1 standard deviation', the 88.9% with 'mean ± 2 standard deviations', the 93.

...read more.

Conclusion

image17.png. The same is true for the difference between the mean and the lower limit of this interval. According to the table above this coincides with 96%.

The Empirical Rule

When the data values seem to have a normal distribution, or approximately so, we can use a much easier theorem than Chebyshev’s.

The "empirical rule" states that in cases where the distribution is normal, the following statements are true:

  • Approximately 68% of the data values will fall within 1 standard deviation of the mean.
  • Approximately 95% of the data values will fall within 2 standard deviations of the mean.
  • Approximately 99.7% of the data values will fall within 3 standard deviations of the mean.

Example 3:

The average salary for graduates entering the actuarial field is $60,000. If the salaries are normally distributed with a standard deviation of $5000, then what percentage of the graduates will have a salary between $50,000 and $70,000?

Solution:

Both $50,000 and $70,000 are $10,000 away from the mean of $60,000. This is two standard deviations away from the mean, so 95% of the graduates will have a salary in this interval.

© 2008 UMUC – European Division

Ron Souverein and Nada Wray

...read more.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related AS and A Level Probability & Statistics essays

  1. "The lengths of lines are easier to guess than angles. Also, that year 11's ...

    -2 4 4 45 8 27.8 -19.8 392.04 5 35 22.5 10 12.5 156.25 4.5 40 14 16.5 -2.5 6.25 5 45 22.5 27.8 -5.3 28.09 6 40 30.5 16.5 14 196 TOTALS -19.8 392.04 Now, to find out the correlation I will substitute the values for the year 11

  2. Estimating the length of a line and the size of an angle.

    The total sample size for the number of males in year 11 is 145/227 * 30 = 19 Quarters Males Number of pupils in the sample 1st quarter 35 35/145 * 19 = 5 2nd quarter 32 32/145 * 19 = 4 3rd quarter 32 32/145 * 19 = 4

  1. find out if there is a connection between people's IQ and their average KS2 ...

    I will stratify by the school years and genders; this will give me how many pieces of data I need to collect from each year and from each gender. Each of the groups (years and genders) needs to be fairly represented in my sample if I am going to avoid

  2. I want to find out if there is a connection between people's IQ and ...

    First of all I calculated the mean, I did this on Excel using a simple equation. =AVERAGE(L12:L61) and =AVERAGE(M12:M61) I can see where the mean result and IQ would be on my scatter graph. Quite a few students are around that point and it looks as if there are an equal amount of students on either side of the mean.

  1. The case is about the Monetta Financial Services Company, an investment house.

    * Similarly, minimum and maximum price appreciation for the IPOs allocated to directors was 12.5% and 68.8% respectively. While minimum and maximum price for IPOs allocated to fund clients were 0% and 69.4% that represents that the range is much wider for IPOs allocated to fund clients.

  2. "Males in the 11-18years age range will guess the angles and lengths better than ...

    Gender Age Angle est (deg) Length est (cm) 70 f 61 30 4 74 f 56 40 3 75 f 55 40 5.1 77 f 54 37 5 79 f 51 30 3.5 80 f 50 40 3 83 f 49 45 5 84 f 49 45 3.5 85 f 48 43 3.3 87 f

  1. Identifying Relationships -Introduction to Statistical Inference.

    Are the RV PACKAGES and the factor TYPE related in any way? Are they dependent? Investigating relationships If there was no relationship between the RV and the factor i.e. acceptance of package customers was completely independent of type of hotel, you would expect the percentage of respondent's in each package

  2. The average pupil.

    I will therefore create a problems/solutions table in order to help me during any difficult moments within the investigation. Problem Possible cause of problem Solution I have a large amount of data, which consists of a mass of numbers (i.e.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work