• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

Statistical coursework that uses data from 'Mayfield High School.'

Extracts from this document...

Introduction

Hayley Lloyd-Henry 11R Maths Statistics Coursework

MAYFIELD HIGH SCHOOL

Introduction

I have chosen to do this statistical coursework that uses data from ‘Mayfield High School.’ Although this is a fictitious school, the data is based on a real school. As the data has been collected for me, it is called secondary data.

I believe that this coursework will allow me to illustrate my ability to handle data, use specific techniques and apply higher level statistical maths by being able to use a variety of methods in order to analyse and compare sets of data. During this project I will be examining the relationships between the attributes of the pupils of Mayfield High School. My aim is took produce a line of enquiry which has two or more statistics regarding the pupils which are related to each other.

This table shows how many boys and girls there are in each year group at Mayfield High.

Year Group

Number of Boys

Number of Girls

Total

7

150

150

300

8

145

125

270

9

120

140

260

10

100

100

200

11

84

86

170

The total Number of students at the school is 1200

Data is provided for each pupil in the following categories:

  • Name
  • Age
  • Year Group
  • IQ
  • Weight
  • Height
  • Hair colour
  • Eye colour
  • Shoe size
  • Distance from home to school
  • Usual method if travel to school
  • Number of Brothers or sisters
  • Key stage 2 & 3 results in English, Mathematics and Science

...read more.

Middle

Calculations:

  1. Mean
  2. Mode
  3. Median
  4. Mean & Modal Class for Grouped Continuous Data – This calculates the mean for grouped continuous data.
  5. Interquartile Range - The distance between the upper and lower quartiles. As a measure of variability, it is less sensitive than the standard deviation or range to the possible presence of outliers. It is also used to define the box in a box-and-whisker plot.
  6. Standard Deviation - It is the most commonly used measure of spread.
  7. Normal distribution - Normal distributions are a family of distributions that have the same general shape. They are symmetric with scores more concentrated in the middle than in the tails. Normal distributions are sometimes described as bell shaped.
  8. Spearman’s Rank Correlation Coefficient - The Spearman's Rank Correlation Coefficient is used to discover the strength of a link between two sets of data.
  9. Equation of Line of Best fit – Equation of line that shows underlying spread.

Collecting the Data

In order to find my results, I will need to sort the data and put it into tables. As I am using stratified sampling, I have had to count up the amount of boys and girls in each year and work out my sample size.

...read more.

Conclusion

>

16

9

1.75

63

17

9

1.46

45

18

9

1.5

70

19

9

1.82

66

20

10

1.8

49

21

10

1.6

50

22

10

1.62

52

23

10

1.65

50

24

10

1.77

59

25

11

1.91

82

26

11

1.62

56

27

11

1.74

50

28

11

2

86

 Results

Girls

Year

Height (cm)

Weight (kg)

1

7

1.61

45

2

7

1.61

47

3

7

1.56

43

4

7

1.48

42

5

7

1.5

40

6

7

1.56

53

7

7

1.58

48

8

8

1.72

43

9

8

1.62

53

10

8

1.62

54

11

8

1.6

46

12

8

1.75

45

13

8

1.48

46

14

9

1.57

38

15

9

1.62

54

16

9

1.64

40

17

9

1.6

46

18

9

1.8

60

19

9

1.6

51

20

10

1.52

45

21

10

1.72

56

22

10

1.66

45

23

10

1.73

42

24

11

1.7

50

25

11

1.68

48

26

11

1.52

38

27

11

1.62

48

Organising My Results

Although I have already presented my results into 2 separate tables, one for each gender, the results are not concise enough. In order to fully analyse my results, I will need to put my results into scatter diagrams and histograms etc. Therefore, my results need to be grouped into around 5-8 groups, which are the same for both genders. This is because when I put my results into the scatter diagrams (etc), I will need to compare both genders, thus requiring me to use the same groups for both sexes. Once I have chosen my groups, I will enter the information into the frequency tables and use those for me histograms and scatter diagrams.


...read more.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related AS and A Level Probability & Statistics essays

  1. Marked by a teacher

    The heights of 16-18 year old young adults varies between males and females. My ...

    5 star(s)

    This one shows the confidence intervals of 90% and 99% for the heights of males aged 16-18 years old. As you can see by these to diagrams the bigger the confidence the more confident I am that the population will lie between the two values.

  2. The aim of this investigation was to look at the reliability and validity of ...

    We then issued a statement based on the Critical Table Value. Ranking E and N scores-using Spearman Rho Correlation Coefficient Test A result table illustrating the outcome of Eysenck's personality inventory, illustrating the 'E' scores for form A and B.

  1. Statistics coursework

    This will make comparing the two easier and will mean my conclusion is likely to be more accurate. I have chosen to use a cumulative frequency graph as it gives you a general idea of any trends. It also allows you to see the median IQ for each gender along with inter-quartile range.

  2. Mayfield High School Maths Coursework

    x 50 = 5 143 143/1183 x 50 = 6 261 10 106 106/1183 x 50 = 4 94 94/1183 x 50 = 4 200 11 84 84/1183 x 50 = 4 86 86/1183 x 50 = 4 170 The number in bold, tells me how much samples I will

  1. Investigating the Relationship Between the Amount of Money a Football Club Receives and its ...

    Barnsley 13 46 7 9 7 35 30 7 8 8 24 26 59 23000 �2,000,000 3 1 Birmingham C 4 46 12 7 4 32 15 11 5 7 34 22 81 30009 �2,000,000 29 1 Bolton W 6 46 13 6 4 44 25 7 10 6 34

  2. I am investigating how well people estimate the length of a line and the ...

    1/2 37 37-2= 18.5 Median 1/4(36+1) 1/4 37 37-4= 9.25 Lower quartile boundary 3/4(36+1) 3/4 37 (37-4) x3= 27.45 Upper Quartile Boundary Table leading onto box plot for hypothesis 2 (Girls will be better at guessing the size of a short line than boys)

  1. Standard addition was used to accurately quantify for quinine in an unknown urine sample ...

    Volume Of Stand. Soln. Of Quinine (Cm3) 0 5 10 15 20 25 Fluorescence Intensity 55 60 71 81 92 100 Excitation wavelength = 350 nm Emission wavelength = 460 nm The data in fig. 4 are shown graphically in fig.

  2. I have been given the task of finding what affects the price of a ...

    Random Mileage Group Tally Frequency 0-5000 1 5000-10,000 1 10,000-20,000 5 20,000-40,000 14 40,000-70,000 19 70,000-110,000 2 Stratified Mileage Group Tally Frequency 0-5000 1 5000-10,000 2 10,000-20,000 4 20,000-40,000 11 40,000-70,000 18 70,000-110,000 5 Then to construct a histogram I would have to work out the frequency density to go on the horizontal axis, this is worked out by.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work