Statistics Mayfield High

Authors Avatar

Statistics / Data Handling Coursework                                Page  of

Statistics Coursework – Mayfield High School Database

This investigation is to analyze data in the Mayfield High School Database and come up with some hypothesis and to conclude whether there is reason to believe that they are true. I am going to investigate the data from Mayfield High is a database of information from the 1183 pupils at the school (when the database was compiled).

Hypothesis:

  1. Do Girls have a higher IQ than boys? It has often been said that girls do better at exams but does this mean that they have a higher IQ?

  1. Is there a correlation between the IQ and KS2 results? Often, those that have a high IQ have high levels in their KS2 results.

  1. Do those that watch more TV have a lesser IQ? Research has suggested that those who watch more TV have a lesser IQ but is this true at Mayfield?

  1. Is it true that the girls are taller than boys? Research has said that while boys may be stronger and heavier, Females are taller in general.

Any problem data will usually be removed. If there is no entry for the field, I will delete the entry. If there are any outliers, I will check whether it is 1.5 times greater than the Inter Quartile Range. If it is quite far away from the Lower Quartile or the Upper Quartile, I will discard the data but if it is not far away, I will decide on whether it should be omitted or not.

Some abbreviations I will use:

LQ = Lower Quartile

UQ = Upper Quartile

IQR = Inter Quartile Range

PMCC = Product Moment Correlation Coefficient

SD = Standard Deviation

KS2 = Key Stage 2

FD = Frequency Density

PMCC:                                         

              _ _

              Σxy - nx·y

r =  ____________________________

                 _          _

      √[ (Σx² - nx²)(Σy² - ny²) ]

SD:                                                                                                 

Mean:                         of grouped data:

                                                           

 = individual data

     = number of values

   = frequency

Normal Distribution:

 

  1. To see whether the first hypothesis is true, I will separate each gender from each Year group and then plot the data in a Box and Whisker Plot. I will take a sample of 50 pieces of data from each Year so that the amount of data will be equal. To find out how many pieces of data I will take from each gender, I will use stratified sampling and use a random number generator to select the exact entries. The data will only be grouped by Year and by gender. I am planning to work out the mean, median, mode, LQ, UQ, IQR, Outliers and SD. I hope that my graphs will show a more intelligent gender for overall; and if not overall, then for each gender.

  • BW1

There are 282 pupils in Year 7: 131 are girls and 151 are boys:

 Girls = 131/282 x 50

         = 23

Therefore I will use 23 pieces of data from the girls

 

Boys = 151/282 x 50

         = 27

Therefore I will use 27 pieces of data from the boys.

The girl’s box plot has minimum and maximum values of 88 and 124 respectively. The median is 102.5, the LQ is 100 and the UQ is 109.75: the IQR being 9.75. There is a low outlier at 84. The mean for the girl’s data is 104 and SD is 9.8. The boy’s box plot has a minimum value of 87 and a maximum value of114. The LQ is 96.5, the median is 100 and the UQ is 107; the IQR is 10.5. There is a low outlier at 74, the mean is 99.9 and the SD is 8.6.

The girl’s IQR is less than that of the boys which suggests that the majority of the girls have higher IQ’s. This is supported by the quartiles which both higher for the girls and the whisker lengths are longer than the boys. The lowest value, highest value and the median are all higher for the girls which points to a similar conclusion. Therefore, I think that the Year 7 girls are more intelligent than the boys. Both box plots have a low outlier but the girls one has a higher value than the boys one; again pointing to the conclusion that the girls are more intelligent. However, the top and bottom 25% of the Female box plot are more spread out whereas the Male values are clustered.

  • BW2

There are 270 pupils in Year 8: 127 are girls and 143 are boys:

 Girls = 127/270 x 50

         = 24

Therefore I will use 24 pieces of data from the girls.

 

Boys = 143/270 x 50

         = 26

Therefore I will use 26 pieces of data from the boys.

The girl’s minimum value is 94 and the maximum value 126. The median is 103, the LQ is 100.75 and the UQ is 113: therefore the IQR is 12.25. There is a mean of 106, a SD of 6.9 and no outliers. The boy’s lowest value is 91, the maximum value is 126 and the median is 102. The LQ is 100 and the UQ is 100.75, so the IQR is 10.75. There is a low outlier at 74 and a high outlier at 132. The mean is 104 and the SD is 11.2.

 

The Male IQR is less than that of the girls which suggests that the majority of boys have higher IQ’s in Year 8. This is supported by the lengths of the whisker for the boys (the girls have short, clustered whiskers) and the highest values. However the boys’ box plot has a very low outlier (possibly due to a student with a learning disorder like dyslexia). There is also a very high outlier which suggests a student with a very high IQ. Nevertheless, the girls have a higher LQ, UQ and lowest value. In conclusion, I think that the girls are more intelligent.

  • BW3

There are 261 pupils in Year 9: 142 are girls and 117 are boys:

 

Girls = 142/261 x 50

         = 27

Therefore I will use 27 pieces of data from the girls.

 

Boys = 143/270 x 50

         = 23

Therefore I will use 23 pieces of data from the boys.

The girls lowest value is 85 and the highest value is 122, the median being 102. The LQ is 96, the UQ is 107 and so the IQR is 11. There is also one low outlier at 78. The mean is 101 and the SD is 9.9. The boys have their lowest value of 89, their highest as 120 and the median is 102. The LQ is 98.5, the UQ us 110.5 and so the IQR is 12. There are 2 low outliers: one at 69 and one at 74. There is a mean of 120 and the SD is 13.

The Boys IQR is greater than that of the girls which suggests that the boys are more intelligent than the girls. This is supported by the higher median and IQR (as well as higher LQ and UQ). However, the girl’s outlier has a higher value than the two that the boys have and the girl’s highest value is greater than that of the boys. The girl’s whisker lengths are less clustered, suggesting that the girls are more intelligent than boys.

Join now!

  • BW4

There are 200 pupils in Year 10: 96 are girls and 104 are boys:

 Girls = 96/200 x 50

         = 24

Therefore I will use 24 pieces of data from the girls.

 

Boys = 104/200 x 50

         = 26

Therefore I will use 26 pieces of data from the boys.

The minimum value for the girl’s box plot here is 79 and the maximum value is 113. The LQ is 93 and the UQ is 103; the IQR being 10. There is ...

This is a preview of the whole essay