• Join over 1.2 million students every month
• Accelerate your learning by 29%
• Unlimited access from just £6.99 per month
Page
1. 1
1
2. 2
2
3. 3
3
4. 4
4
5. 5
5
6. 6
6

# Collect data with a view to estimating population parameters using estimation techniques.

Extracts from this document...

Introduction

Statistics Coursework

Task: You are required to collect data with a view to estimating population parameters using estimation techniques. This should involve taking a random sample as well as calculating and comparing confidence intervals.

I have decided to estimate the population parameters for sentence length in 2 different genres of books. I have chosen a horror book and a drama book to see how sentence length varies between them. In theory I would expect the horror book to have much shorter sentences to add suspense whilst I would expect the drama to have longer more descriptive sentences.

Method:

As it would be too time consuming to record the sentence length for the whole population (the whole book). I am going to use sampling. To try and avoid any bias I will use the random number function on a calculator to find a page in the book and then I will record the length of the first full sentence. I will take a 100 samples for each book as this is enough that I will be able to gain accurate estimates for the population parameters but not use too much time. If by chance 2 the random number function produces a number that has already been used I will simply take the length of the second sentence on that page.

Middle

16

129

9

73

7

208

18

98

5

51

145

11

Data For Drama Book

 Page Sentence Length 51 19 148 20 234 29 114 18 195 6 313 4 239 19 115 11 10 2 203 9 191 8 118 21 109 10 317 4 217 9 298 9 241 9 10 6 232 10 57 11 114 32 80 11 196 14 49 11 67 9 282 15 280 31 226 18 71 24 315 16 308 5 203 9 226 14 147 38 224 10 236 19 185 18 257 5 317 11 1 29 169 15 66 9 267 17 106 20 232 28 160 37 300 25 322 8 49 21 26 29 276 41 214 15 233 7 131 9 76 8 71 8 317 9 177 5 155 13 266 6 95 5 308 3 93 6 55 8 96 4 311 6 65 9 128 21 288 18 203 4 210 19 166 20 175 14 280 13 249 8 245 19 182 4 312 19 52 23 73 13 221 6 204 12 73 13 189 9 129 25 50 25 230 6 273 22 218 12 31 39 149 28 96 7 48 14 80 18 13 11 167 4 34 23 43 10 94 7 49 16

The first thing for me to do is to find the Mean, Standard Deviation and Variance of the sample I have taken. As it would be extremely time consuming trying to find the exact mean and variance for 100 results I have set up frequency tables which will allow me

Conclusion

Obviously this is highly impractical but it shows how inaccurate my estimate is due to the fact that I took so few samples. Also I only sampled 1 book from each genre so it is difficult for me to accurately say that all books from these genres will be the same. It is possible that different authors with different writing styles will produce different sentence lengths. For example another horror writer may use longer sentences whilst another drama writer might use shorter sentences.

So if I was to extend this investigation I would firstly take more samples to ensure greater accuracy which would therefore allow greater certainty in any conclusions drawn. Secondly I would compare a number of different horror books against each other to see if their population parameters were similar or if they varied. Another progression could be to sample a number of horror books by the same author to see if they are at all similar in their population parameters.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

## Found what you're looking for?

• Start learning 29% faster today
• 150,000+ documents available
• Just £6.99 a month

Not the one? Search for your essay title...
• Join over 1.2 million students every month
• Accelerate your learning by 29%
• Unlimited access from just £6.99 per month

# Related AS and A Level Probability & Statistics essays

1. ## Statistics coursework

Total of KS2 results Frequency Cumulative Frequency Percentage of total 5<C<8 3 3 2.4 8<C<10 16 19 15.2 10<C<12 42 61 48.8 12<C<14 42 103 82.4 14<C<16 22 125 100 16<C<18 0 125 100 - Total of KS2 results for year 11 (Table 10)

2. ## Anthropometric Data

and a foot length ranging from 138-144 (mm) tend to be repeated at this age group. This also give a firm prediction of how Confident the prediction is in this middle region of determine Small size, Median size, and Large size socks. This can be determining when I have drawn the regression line.

1. ## Design an investigation to see if there is a significant relationship between the number ...

However, if my results are not as I expected and a strong relationship is not shown between length of longest frond and number of bladders, I may plot the graph of frequency of Fucus vesiculosus against number of bladders, in order to analyse my results further: I am then going

2. ## Standard addition was used to accurately quantify for quinine in an unknown urine sample ...

Concentration of the analyte has an effect on fluorescence intensity. The power of fluorescence emission, F, is proportional to the radiant power of the excitation beam that is absorbed by the system. That is, Equation 1 Where P0 is the power of the beam incident upon the solution and P

1. ## Guestimate - investigate how well people estimate the length of lines and the size ...

The modal group for year 7 was 45 < d < 55. My histogram for year 10 had a much smaller range, which shows that it was more consistent and the estimates were less varied. This histogram proves my first hypothesis which states that year 10 are better at estimating angles than year 7.

2. ## Original Writing: Dramatic Monologue - Monologue of a Disturbed Personality

He would have gone straight to the management and told the he was suing them. But I have inherited my mothers 'easy going' nature to all situations in life. "Yes....the sample....just a small scratch of the skin on your face that's all."

1. ## Chebyshevs Theorem and The Empirical Rule

* At least 93.75% of all the ages will lie in the range of . In our case this means that at least 93.75% of the people will have an age in the range of years which simplifies to a range of 16 to 64 years.

2. ## &amp;quot;The lengths of lines are easier to guess than angles. Also, that year 11's ...

These will show the estimates of the line for one individual person plotted against their estimate for the angle. From these scatter graphs you can see whether or not anybody guessed exactly the correct size or length. These things should help me prove or disprove my hypothesis.

• Over 160,000 pieces
of student written work
• Annotated by
experienced teachers
• Ideas and feedback to