# The aim of this investigation is to gain statistical information to show authorship of a text.

Introduction

AS Mathematics: (AQA) Statistics Coursework DESIGN Introduction: The aim of this investigation is to gain statistical information to show authorship of a text. For this investigation, I will use two pieces of text in order to investigate authorship. In order for the investigation to be valid, the two pieces of text I need to use should have a different theme attached to them. By theme, I mean they need to be different in a broad way i.e. different genre, different age readers. I had a number of different texts to compare but I decided to use one adult text and one child text as this will give me a more obvious variation and expectation. For this investigation I will be calculating the mean of the distribution for both populations. I will then be able to calculate the standard deviation and variance, and I will be using the unbiased estimator for both populations. I will calculate the standard error and confidence intervals for both populations. My data will be represented using frequency distribution tables and these can show the trends of a frequency distribution graph. The normal distribution diagrams will also be used for the confidence intervals representation. Population: In a statistical enquiry, you often need information about a particular group. This group is known as the POPULATION and it could be small, large or infinite. The population for my investigation is the all the words of each separate book. Sampling: Sampling is the selection of individual members of a population. The advantage of taking a sample is that it is cheaper, quicker and the results are easier to analyse and the appropriate for this type of investigation. Unfortunately, it does have some disadvantages that are difficult to avoid as the results may include natural variation or bias, and so may not be representative of the whole population and thus the results are meaningless. ...read more.

Middle

I I I 7 7 I I I 3 8 I I I I 4 9 I I I I I I I 7 10 I I 2 11 I 1 12 I 1 The distribution is not normal and I will discuss how a certain theorem acknowledges this. Raw Data for Children's Text: Word Page Word Word Length 1 24 mum 3 2 8 to 2 3 7 was 3 4 29 their 5 5 31 trouble 7 6 3 she 3 7 18 was 3 8 45 there 5 9 5 the 3 10 19 very 4 11 37 a 1 12 38 he 2 13 20 it 2 14 45 to 2 15 26 and 3 16 15 eggs 4 17 25 chris 5 18 30 friends 7 19 35 archie 6 20 33 to 2 21 40 yellow 6 22 2 hands 5 23 10 out 3 24 43 house 5 25 6 on 2 26 42 jacket 6 27 14 was 3 28 38 oh 1 29 17 said 4 30 25 there 5 31 35 for 3 32 37 chris 5 33 36 cat 3 34 10 coops 5 45 43 half 4 46 40 of 2 47 16 the 3 48 15 place 5 49 33 bring 5 40 19 six 3 41 45 picture 7 42 46 lots 4 43 27 sing 4 44 41 down 4 45 4 glass 5 46 36 the 3 47 23 too 3 48 37 it 2 49 14 the 3 50 26 want 4 Frequency Distribution table and graph for Children's Text: No. of letters(x) Tally Frequency (f) 1 I I 2 2 I I I I I I I I 8 3 I I I I I I I I I I I I I I I 15 4 I I I I I I I I 8 5 I I I I I I I I I I I 11 6 I I I 3 7 I ...read more.

Conclusion

