• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month
Page
  1. 1
    1
  2. 2
    2
  3. 3
    3
  4. 4
    4
  5. 5
    5
  6. 6
    6
  7. 7
    7
  8. 8
    8
  9. 9
    9
  10. 10
    10
  11. 11
    11
  12. 12
    12
  13. 13
    13

My aim is that within the limits of a small-scale survey I will collect sample data of a population, and by using estimation techniques I will determine the population's parameters (such as the mean and the variance).

Extracts from this document...

Introduction

Mathematics Coursework – Statistics.

S1 Task A: Measurements.

Aim.

My aim is that within the limits of a small-scale survey I will collect sample data of a population, and by using estimation techniques I will determine the population’s parameters (such as the mean and the variance). My population is smarties, and in this investigation I am looking at the individual weight of random smarties, which will be my sample. I decided to stick with weight, as it is a property that will vary a lot, I think, and so I hope will prove an interesting investigation. An important factor to help me decide on how large my sample should be is that the size of the sample must be quite small, because it is stated so in my aim. However, to make accurate estimates of population parameters the sample must be large enough.

Therefore to help me decide on the size of my sample, I have accordingly looked at the Central Limit Theorem, which states that:

  • If the sample size is large enough, the distribution of the sample mean is approximately Normal.
  • The variance of the distribution of the sample mean is equal to the variance of the sample mean divided by the sample size.

The

...read more.

Middle

1.00

49

0.99

448

0.98

2

0.97

2

0.96

4

0.95

02344555779

0.94

19

0.93

24499

0.92

12

0.91

35

0.90

16

0.89

38

0.88

0.87

0.86

7

Although not necessary, I thought it would be somewhat useful to depict my sample data onto a stem and leaf diagram. Other information about the sample includes the lowest value, which is 0.867g, the highest is 1.110g, and the range is 0.243g.


Sample Parameters.

image00.png

Mean.

Using the total sum of the fifty smarties and dividing it by fifty to obtain the mean.

image01.png

Variance.

The formula for variance states that you take the ‘Mean of the squares minus the square of the mean’.

image12.png

Standard Deviation.

The standard deviation is found by finding the square root of the variance.

image13.png


Population Parameters.

Estimate of the Mean of the population of smarties.

The mean is an unbiased estimator, that is, the mean of its distribution is equal to the mean of the parent population. For this reason it can be used as an estimator for the mean of the population of smarties. As the mean of my sample is 0.976, then an estimate of the mean of the population of smarties is therefore:

image14.png

Estimate of the Variance of the population of smarties.

The variance of the sample is a biased estimator. A biased estimator is one for which the mean of its distribution is not equal to the population value it is estimating. Therefore it must be converted to an unbiased estimator, by multiplying the sample variance by the number of smarties.

image15.png

...read more.

Conclusion

Possible Extension.

A statistical analysis of entire tubes of smarties could be carried out. The actual weight of the smarties could be compared to the price on the tube to determine whether the manufacturers are lying about how much smartie there is in their packets. Also similar investigations looking at how many smarties per packet, average weights of packets, etc.

Weighing smarties of different colours could also be done to find if there are any differences between them. Or even counting how many smarties of different colours you get in different packets. But yet again an investigation like this would be harder to carry out, as you would need at least fifty packets of smarties to carry out a ‘small scale’ investigation…

Also, a larger sample size could be taken to determine the mean and variance more accurately, a lot more accurately in fact.

Lastly, I could have extended my confidence interval calculations; I could have included a 99% confidence of the mean varying only ± 0.001g, which would have shown I would have needed a massive sample, possibly over 20,000 to get that much confidence in such a small interval.

Charles Mallah         Mathematics Coursework (Statistics)        Deadline 18-03-02

...read more.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related AS and A Level Probability & Statistics essays

  1. Aim: in this task, you will investigate the different functions that best model the ...

    Although the population is unlikely to have the same number of babies born each year, it is a better model to use as it doesn't change suddenly - a sudden population change would only occur in times of war etc., which no model would be able to predict.

  2. Standard addition was used to accurately quantify for quinine in an unknown urine sample ...

    As can be seen from the structure of quinine (fig.1), quinine is a polycyclic aromatic compound and as such, quinine can be estimated by fluorescence spectroscopy at levels as low as 0.1�g/cm3 Changes in the system pH, if it has an effect on the charge status of the chromaphore, may influence pH.

  1. GCSE Mathematics Coursework: Statistics Project

    Fig 3 is a scatter graph showing the relationship between the average amount of TV that boys watch and their weight. There is a very weak correlation, as the correlation coefficient, r, is only -0.04. Unlike my hypothesis which predicted that the more hours of television watched, the bigger the

  2. The aim of this investigation was to look at the reliability and validity of ...

    be soft, round, relaxed and extrovert, Mesomorphs who were athletic and had aggressive tendencies and finally Ectomorphs who were thin, frail and introverted. Due to the complexity of the personality, psychologists moved away from this kind of theory to concentrate on trait theories.

  1. Statistics coursework

    - IQ of girls in year 7 (Table 3) IQ Frequency Cumulative Frequency Percentage of total 60<IQ<70 0 0 0 70<IQ<80 0 0 0 80<IQ<90 3 3 5.17 90<IQ<100 10 13 22.41 100<IQ<110 33 46 79.31 110<IQ<120 9 55 94.83 120<IQ<130 2 57 98.28 130<IQ<140 1 58 100 - IQ of boys in year 7 (Table 4)

  2. Anthropometric Data

    Has it is not possible for a child who is 2 years and 3 months to have a foot length of 154 (mm) and 69 (mm) foot breadth. Has research shows that a normal 2 year old should be a size 7 in socks.

  1. Chebyshevs Theorem and The Empirical Rule

    For instance, for k = 2.5 we get the result that in the interval years Example 1: Students Who Care is a student volunteer program in which college students donate work time in community centers for homeless people. Professor Gill is the faculty sponsor for this student volunteer program.

  2. &amp;quot;The lengths of lines are easier to guess than angles. Also, that year 11's ...

    of the line was, had the highest frequency density, but was not the most densely populated. The year 11 data shows that not many people guessed in the correct group as it is not very dense. Cumulative frequency tables group the data so you can see how much the data has gone up from group to group.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work