• Join over 1.2 million students every month
• Accelerate your learning by 29%
• Unlimited access from just £6.99 per month
Page
1. 1
1
2. 2
2
3. 3
3
4. 4
4
5. 5
5
6. 6
6
7. 7
7
8. 8
8
9. 9
9
10. 10
10
11. 11
11
12. 12
12
13. 13
13

# My aim is that within the limits of a small-scale survey I will collect sample data of a population, and by using estimation techniques I will determine the population's parameters (such as the mean and the variance).

Extracts from this document...

Introduction

Mathematics Coursework – Statistics.

S1 Task A: Measurements.

Aim.

My aim is that within the limits of a small-scale survey I will collect sample data of a population, and by using estimation techniques I will determine the population’s parameters (such as the mean and the variance). My population is smarties, and in this investigation I am looking at the individual weight of random smarties, which will be my sample. I decided to stick with weight, as it is a property that will vary a lot, I think, and so I hope will prove an interesting investigation. An important factor to help me decide on how large my sample should be is that the size of the sample must be quite small, because it is stated so in my aim. However, to make accurate estimates of population parameters the sample must be large enough.

Therefore to help me decide on the size of my sample, I have accordingly looked at the Central Limit Theorem, which states that:

• If the sample size is large enough, the distribution of the sample mean is approximately Normal.
• The variance of the distribution of the sample mean is equal to the variance of the sample mean divided by the sample size.

The

Middle

1.00

49

0.99

448

0.98

2

0.97

2

0.96

4

0.95

02344555779

0.94

19

0.93

24499

0.92

12

0.91

35

0.90

16

0.89

38

0.88

0.87

0.86

7

Although not necessary, I thought it would be somewhat useful to depict my sample data onto a stem and leaf diagram. Other information about the sample includes the lowest value, which is 0.867g, the highest is 1.110g, and the range is 0.243g.

Sample Parameters.

Mean.

Using the total sum of the fifty smarties and dividing it by fifty to obtain the mean.

Variance.

The formula for variance states that you take the ‘Mean of the squares minus the square of the mean’.

Standard Deviation.

The standard deviation is found by finding the square root of the variance.

Population Parameters.

Estimate of the Mean of the population of smarties.

The mean is an unbiased estimator, that is, the mean of its distribution is equal to the mean of the parent population. For this reason it can be used as an estimator for the mean of the population of smarties. As the mean of my sample is 0.976, then an estimate of the mean of the population of smarties is therefore:

Estimate of the Variance of the population of smarties.

The variance of the sample is a biased estimator. A biased estimator is one for which the mean of its distribution is not equal to the population value it is estimating. Therefore it must be converted to an unbiased estimator, by multiplying the sample variance by the number of smarties.

Conclusion

Possible Extension.

A statistical analysis of entire tubes of smarties could be carried out. The actual weight of the smarties could be compared to the price on the tube to determine whether the manufacturers are lying about how much smartie there is in their packets. Also similar investigations looking at how many smarties per packet, average weights of packets, etc.

Weighing smarties of different colours could also be done to find if there are any differences between them. Or even counting how many smarties of different colours you get in different packets. But yet again an investigation like this would be harder to carry out, as you would need at least fifty packets of smarties to carry out a ‘small scale’ investigation…

Also, a larger sample size could be taken to determine the mean and variance more accurately, a lot more accurately in fact.

Lastly, I could have extended my confidence interval calculations; I could have included a 99% confidence of the mean varying only ± 0.001g, which would have shown I would have needed a massive sample, possibly over 20,000 to get that much confidence in such a small interval.

Charles Mallah         Mathematics Coursework (Statistics)        Deadline 18-03-02

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

## Found what you're looking for?

• Start learning 29% faster today
• 150,000+ documents available
• Just £6.99 a month

Not the one? Search for your essay title...
• Join over 1.2 million students every month
• Accelerate your learning by 29%
• Unlimited access from just £6.99 per month

# Related AS and A Level Probability & Statistics essays

1. ## The aim of this investigation was to look at the reliability and validity of ...

For a trait theory to be acceptable as a personality theory it must firstly isolate basic traits which describe personality and measure them accurately. This is attempted using a process called Factor Analysis. A pioneer in this field was Raymond Cattell (1965).

2. ## Statistics coursework

IQ Frequency Cumulative Frequency Percentage of total 60<IQ<70 1 1 1.49 70<IQ<80 1 2 2.99 80<IQ<90 4 6 8.96 90<IQ<100 15 21 31.34 100<IQ<110 40 61 91.04 110<IQ<120 6 67 100 120<IQ<130 0 67 100 130<IQ<140 0 67 100 - IQ of girls in year 11 (Table 5)

1. ## Aim: in this task, you will investigate the different functions that best model the ...

Although the population is unlikely to have the same number of babies born each year, it is a better model to use as it doesn't change suddenly - a sudden population change would only occur in times of war etc., which no model would be able to predict.

2. ## Standard addition was used to accurately quantify for quinine in an unknown urine sample ...

Both classes of substances have delocalised ?-electrons that can be placed in low-lying excited singlet states. In polycyclic aromatic systems where the number of ?-electrons available is greater than in benzene, these compounds and their derivatives are usually much more fluorescent than benzene and its derivatives.

1. ## I have been given the task of finding what affects the price of a ...

Predictions * For age I believe there will be a very strong negative correlation as the older the car gets the lower the price. * For MPG I believe there will be a weak positive correlation as the higher the MPG the higher the price but I believe it doesn't affect it that much.

2. ## Statistics Coursework

Using all of these diagrams I will then compare all of the students' attendance for each year. Then I will also analyse all of these graphs and diagrams and actually come to a conclusion that tells me all the information I need (e.g.

1. ## Design an investigation to see if there is a significant relationship between the number ...

may not be as accurate as they would be if more precise equipment was used. For example, I am only measuring the lengths of the fronds accurately to the nearest mm. If I measured to a greater degree of accuracy, my results would become more reliable; however I am limited by the range of equipment available to me.

2. ## Anthropometric Data

shows that all the points lie generally on an upward diagonal and also shows that there is an outlier. Outlier An outlier can occur due the basic linear relationship between (x) and (y), a single outlier occur in the (x)

• Over 160,000 pieces
of student written work
• Annotated by
experienced teachers
• Ideas and feedback to
improve your own work