• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

Collect data with a view to estimating population parameters using estimation techniques.

Extracts from this document...

Introduction

Statistics Coursework

Task: You are required to collect data with a view to estimating population parameters using estimation techniques. This should involve taking a random sample as well as calculating and comparing confidence intervals.

I have decided to estimate the population parameters for sentence length in 2 different genres of books. I have chosen a horror book and a drama book to see how sentence length varies between them. In theory I would expect the horror book to have much shorter sentences to add suspense whilst I would expect the drama to have longer more descriptive sentences.

Method:

As it would be too time consuming to record the sentence length for the whole population (the whole book). I am going to use sampling. To try and avoid any bias I will use the random number function on a calculator to find a page in the book and then I will record the length of the first full sentence. I will take a 100 samples for each book as this is enough that I will be able to gain accurate estimates for the population parameters but not use too much time. If by chance 2 the random number function produces a number that has already been used I will simply take the length of the second sentence on that page.

...read more.

Middle

16

129

9

73

7

208

18

98

5

51

145

11

Data For Drama Book

Page

Sentence Length

51

19

148

20

234

29

114

18

195

6

313

4

239

19

115

11

10

2

203

9

191

8

118

21

109

10

317

4

217

9

298

9

241

9

10

6

232

10

57

11

114

32

80

11

196

14

49

11

67

9

282

15

280

31

226

18

71

24

315

16

308

5

203

9

226

14

147

38

224

10

236

19

185

18

257

5

317

11

1

29

169

15

66

9

267

17

106

20

232

28

160

37

300

25

322

8

49

21

26

29

276

41

214

15

233

7

131

9

76

8

71

8

317

9

177

5

155

13

266

6

95

5

308

3

93

6

55

8

96

4

311

6

65

9

128

21

288

18

203

4

210

19

166

20

175

14

280

13

249

8

245

19

182

4

312

19

52

23

73

13

221

6

204

12

73

13

189

9

129

25

50

25

230

6

273

22

218

12

31

39

149

28

96

7

48

14

80

18

13

11

167

4

34

23

43

10

94

7

49

16

The first thing for me to do is to find the Mean, Standard Deviation and Variance of the sample I have taken. As it would be extremely time consuming trying to find the exact mean and variance for 100 results I have set up frequency tables which will allow me

...read more.

Conclusion

Obviously this is highly impractical but it shows how inaccurate my estimate is due to the fact that I took so few samples. Also I only sampled 1 book from each genre so it is difficult for me to accurately say that all books from these genres will be the same. It is possible that different authors with different writing styles will produce different sentence lengths. For example another horror writer may use longer sentences whilst another drama writer might use shorter sentences.

So if I was to extend this investigation I would firstly take more samples to ensure greater accuracy which would therefore allow greater certainty in any conclusions drawn. Secondly I would compare a number of different horror books against each other to see if their population parameters were similar or if they varied. Another progression could be to sample a number of horror books by the same author to see if they are at all similar in their population parameters.

...read more.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related AS and A Level Probability & Statistics essays

  1. Standard addition was used to accurately quantify for quinine in an unknown urine sample ...

    fluorescence power of a solution versus concentration of the emitting species should be linear at low concentrations. When c becomes great enough so that the absorbance is larger than about 0.05, linearity is lost; F then lies below an extrapolation of the straight-line plot.

  2. Original Writing: Dramatic Monologue - Monologue of a Disturbed Personality

    He would have gone straight to the management and told the he was suing them. But I have inherited my mothers 'easy going' nature to all situations in life. "Yes....the sample....just a small scratch of the skin on your face that's all."

  1. I am investigating how well people estimate the length of a line and the ...

    Year 7 and 10 Females Estimates Error % error 1.5 0 0 1.5 0 0 1.5 0 0 1.5 0 0 1.5 0 0 1.5 0 0 1.5 0 0 1.5 0 0 1.5 0 0 1.5 0 0 1.5 0 0 1.5 0 0 1.5 0 0 1.45 0.05

  2. "The lengths of lines are easier to guess than angles. Also, that year 11's ...

    This is the spread of data from the mean. This is quite a low spread. This is a table to show the Year 9 estimates for the size of angle 6. Estimate for Angle size Frequency (f) Class Width (w) Frequency Density (f) / (w) Mid-Point (x) (f) x (x)

  1. Statistics coursework

    Therefore once again I have decided to draw a cumulative frequency graph but one that shows the cumulative percentage of the sample so the strata size is taken into account when comparing. - Total of KS2 results for year 7 (Table 9)

  2. Anthropometric Data

    In this case length is not an issue for that particular child the investigation is on the breadth, this may say the child's feet is broader than the other children in the chosen age group. The outlier will be put in and will be used when calculating the r value

  1. Chebyshevs Theorem and The Empirical Rule

    Summarizing the above we get the following table: Interval k % 2 75 3 88.9 4 93.75 5 96 6 97.2 Do we have to restrict ourselves to whole numbers as values for k? No, we may take any value for k as long as it larger than 1.

  2. DATA HANDLING COURSEWORK

    This graph shows us many things. One of these things is that more girls than boys have heights that are up to 1.50m. We can see this because the pink line is above the blue in the graph emphasising that more boys are taller than girls.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work