• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month
Page
  1. 1
    1
  2. 2
    2
  3. 3
    3
  4. 4
    4
  5. 5
    5
  6. 6
    6
  7. 7
    7
  8. 8
    8
  9. 9
    9
  10. 10
    10
  11. 11
    11
  12. 12
    12
  13. 13
    13
  14. 14
    14
  15. 15
    15
  16. 16
    16
  17. 17
    17
  18. 18
    18
  19. 19
    19
  20. 20
    20
  21. 21
    21
  • Level: GCSE
  • Subject: Maths
  • Word count: 4082

Statistical investigation.

Extracts from this document...

Introduction

Mathematics GCSE

Mayfield High School

image00.png

Year Group

Number of Boys

Number of Girls

Total

7

151

131

282

8

145

125

270

9

118

143

261

10

106

94

200

11

84

86

170

The total number of students at the school is 1183.

I have been given data on all the students covering a range of different things such as hair colour, eye colour, numbers of brothers or sisters and even favourite music and IQ. There are 27 different categories and there for a total of 31941 datum points.

1183 x 27 =31941

What I first need to do is to decide which line of enquiry I will choose. There are several options but I need to pick I think will show successfully what I can do within the area of statistics. I have decided to compare these four things:

>Year Group

>Sex

>Height

>Weight

I feel that these four sections will enable me to carry out a statistical investigation. I aim to find out whether there is any relationship between them and differences between age and sex.

Collecting Data

The first step I need to take is to take a random sample of the data. Before I do this I will need to decide how many students’ data I want to be analysing. I think 60 would be a good amount allowing me to have 30 boys and 30 girls. Now I need to use stratified sample to find out how many boy and girls I need from each year group.

282/ 1183 x 60 =14.30262

270/ 1183 x 60 =13.694

261/ 1183 x 60 =13.23753

200/ 1183 x 60 =10.1437

170/ 1183 x 60 =8.622147

Total = 60

The reason I have decided to do a stratified sample is because stratified sampling is the best way to represent data in a proportional way. It makes it a much fairer process to randomly select data by giving each one an equal chance of being selected. Because I can’t exactly have 13.694 prices of data I need to simplify each value by rounding it up to the nearest whole number.

...read more.

Middle

Number

8

Male

1.20

36

23

8

Male

1.32

47

87

8

Male

1.69

59

39

8

Male

1.54

42

116

8

Male

1.50

52

86

8

Male

1.42

26

8

8

Male

1.77

54

1

8

Male

1.57

62

128

8

Female

1.62

49

10

8

Female

1.62

54

32

8

Female

1.75

64

53

8

Female

1.41

39

61

8

Female

1.66

72

59

8

Female

1.50

57

46

image05.png

Year Group

Gender

Height (m)

Weight (kg)

Number

9

Male

1.54

42

26

9

Male

1.69

65

74

9

Male

1.56

60

63

9

Male

1.64

35

80

9

Male

1.61

45

30

9

Male

1.73

52

15

9

Female

1.45

51

59

9

Female

1.6

48

55

9

Female

1.56

50

139

9

Female

1.40

41

130

9

Female

1.47

56

141

9

Female

1.68

57

68

9

Female

1.62

55

39

image06.png

Year Group

Gender

Height (m)

Weight (kg)

Number

10

Male

1.74

80

88

10

Male

1.63

60

1

10

Male

1.57

64

84

10

Male

1.82

57

50

10

Male

1.60

47

31

10

Female

1.68

58

59

10

Female

1.60

66

78

10

Female

1.56

56

15

10

Female

1.40

45

5

10

Female

1.68

53

76

image07.png

Year Group

Gender

Height (m)

Weight (kg)

Number

11

Male

1.82

66

33

11

Male

1.65

50

73

11

Male

1.62

48

64

11

Male

1.78

67

20

11

Female

1.03

45

37

11

Female

1.65

52

61

11

Female

1.80

42

40

11

Female

1.69

51

14

11

Female

1.73

50

11

Now that I have 60 pieces of data which represent the population proportionally I can begin my investigation. With the data that I have I can draw several graphs to represent the heights and the weights of the different ages or genders. Because there are so many things I can do with the data I need to decide a systematic way to approach the investigation so that I am not wasting time repeating calculations.  Because height and weight are continuous data I will have to construct histograms to represent the data and in order to do this I need to make cumulative frequency tables.

I am now ready to begin recording my results in a table. To start with I will look at height. When I arrange the data in order of height I wasn’t surprised to find out that the tallest person was in fact in year 11. I expected the shortest person however o be in year 7 so when I found that the shortest male from my data was in year 8 and that the shortest female was in year 11 I was shocked. The girl’s heights varied quite vastly and showed less of a correlation with age.

Boys

Height, h  (cm)

Tally

Frequency

120 ≤ h < 130

1

130 ≤ h < 140

2

140 ≤ h < 150

3

150 ≤ h < 160

7

160 ≤ h < 170

11

170 ≤ h < 180

4

180 ≤ h < 190

2

Girls

Height, h  (cm)

Tally

Frequency

100 ≤ h < 110

1

110 ≤ h < 120

0

120 ≤ h < 130

0

130 ≤ h < 140

1

140 ≤ h < 150

6

150 ≤ h < 160

5

160 ≤ h < 170

14

170 ≤

...read more.

Conclusion

After that I will then calculate spearman’s rank coefficient. The reason I have decided to do this is because I want to find out how correlated the height and weight are. I will calculate the coefficient for the boys, girls and different year groups. That way I will be able to compare the values and decide which year group or gender has a better correlation.

I will then evaluate all my results and make necessary comments to the results I obtained. I need to show my understanding of my results by evaluating all of the outcomes.

I also may carry out research on the BMI. This helps me understand how the relationship between height and weight varies according to age.

Standard Deviation

To begin with I have calculated the standard deviation for the whole population. The standard deviation is a statistic that tells you how tightly all the various examples are clustered around the mean in a set of data. When the examples are pretty tightly bunched together and the bell-shaped curve is steep, the standard deviation is small. When the examples are spread apart and the bell curve is relatively flat, that tells you have a relatively large standard deviation.

The formula for standard deviation is as follows:

image40.png

Standard deviation for the height of the whole population

0.150282

Standard deviation for the height of the boys of the population

0.148333

Standard deviation for the height of the girls of the population

0.15358


image41.png

Standard deviation for the height of the just the year sevens

0.11498686

Standard deviation for the height of the just the year eights

0.163164112

Standard deviation for the height of the just the year nines

0.097848653

Standard deviation for the height of the just the year tens

0.113705272

Standard deviation for the height of the just the year elevens

0.240023147

Scatter graph for year sevens

image42.pngimage28.png

Weight

Scatter graph for year eights

image43.pngimage28.png

Weight

Gradient of the line :  y = 0.9158x + 107.43

Roy Vivasi 11PB

...read more.

This student written piece of work is one of many that can be found in our GCSE Height and Weight of Pupils and other Mayfield High School investigations section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related GCSE Height and Weight of Pupils and other Mayfield High School investigations essays

  1. A hypothesis is the outline of the idea/ideas which I will be testing and ...

    a very weak positive correlation between the two variables that I am investigating for this hypothesis and this means that that the increase in Hours of TV the higher the IQ on average which is rather bemusing as I had expected the opposite using my general knowledge and everyday theory

  2. Trolley Investigation

    It measures the speed in m/s to 3 decimal places. Diagram Method The apparatus will be set up as above. The jack will be used to raise the ramp to a certain height. The height of the start position of the trolley will be measured and then the height of

  1. Maths Data Handling

    0.556x - 40.6 Now that I have these equations, the weight can be predicted when the height is known, or height when the weight is known. For example to predict the weight of a boy who is 170 cm tall: y = 0.625x - 47.5 x = 170 = 106.25

  2. Mayfield igh Investigation

    Due to the fact I am using 25%, I can just divide the number of students in each year by 4. For year 7, there are 131 females (47% approx), and 150 males (53% approx). For year 11, there are 86 females (51% approx), and 84 males (49% approx).

  1. mayfeild statistics

    The gradient, a, of the line is 73.4. The intercept on the y-axis is the point (0, 64.756) so b = 64.756. Thus the equation is y = 73.4x - 64.756, using the formula 'y = ax + b'. This equation can be used like a formula, to find out the weight of someone, provided that they have the height measurement of the person.

  2. Introduction to arodynamics - Investigation into the design features of aircraft.

    This can then be manipulated to show the Mach number at which MCRIT will appear, M = 0.7 / Cos 40o = 0.91. This shows that a wing with 40o sweep can reach speeds of up to Mach 0.91 before sonic flow appears.

  1. Conduct an investigation comparing height and weight from pupils in Mayfield School.

    1.3 1.3 The first value on my graph is 25, so 25 + 40 = 65 So the equation of my line of best for girls suggests that a person who is 175cm tall will weigh 65kg. I will now find the frequency density for my data, to show the

  2. What affects a persons ability to estimate?

    In addition, Year groups 7,9 and 10's higher sets estimated closer on average than their lower sets. Although you would first think that Higher sets won by 3 to 2, when you look at Year 8 there is very unreliable data which I would predict their higher set to do better than the lower set due to there standard deviation.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work