• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month
  1. 1
  2. 2
  3. 3
  4. 4
  5. 5
  6. 6
  7. 7
  8. 8
  9. 9
  10. 10
  11. 11
  12. 12
  13. 13
  14. 14
  15. 15
  16. 16
  17. 17
  • Level: GCSE
  • Subject: Maths
  • Word count: 7018

Statistics Mayfield High

Extracts from this document...


Statistics / Data Handling Coursework                                Page  of

Statistics Coursework – Mayfield High School Database

This investigation is to analyze data in the Mayfield High School Database and come up with some hypothesis and to conclude whether there is reason to believe that they are true. I am going to investigate the data from Mayfield High is a database of information from the 1183 pupils at the school (when the database was compiled).


  1. Do Girls have a higher IQ than boys? It has often been said that girls do better at exams but does this mean that they have a higher IQ?
  1. Is there a correlation between the IQ and KS2 results? Often, those that have a high IQ have high levels in their KS2 results.
  1. Do those that watch more TV have a lesser IQ? Research has suggested that those who watch more TV have a lesser IQ but is this true at Mayfield?
  1. Is it true that the girls are taller than boys? Research has said that while boys may be stronger and heavier, Females are taller in general.

Any problem data will usually be removed. If there is no entry for the field, I will delete the entry. If there are any outliers, I will check whether it is 1.5 times greater than the Inter Quartile Range. If it is quite far away from the Lower Quartile or the Upper Quartile, I will discard the data but if it is not far away, I will decide on whether it should be omitted or not.

Some abbreviations I will use:

LQ = Lower Quartile

UQ = Upper Quartile

IQR = Inter Quartile Range

PMCC = Product Moment Correlation Coefficient

SD = Standard Deviation

KS2 = Key Stage 2

FD = Frequency Density


              _ _

              Σxy - nx·y

r =  ____________________________

                 _          _

      √[ (Σx² - nx²)(Σy² - ny²) ]





Mean:                         of grouped data:


image14.pngimage16.png = individual data

image17.png = number of values

image18.png= frequency

Normal Distribution:



...read more.


The Boys IQR is greater than that of the girls which suggests that the boys are more intelligent than the girls. This is supported by the higher median and IQR (as well as higher LQ and UQ). However, the girl’s outlier has a higher value than the two that the boys have and the girl’s highest value is greater than that of the boys. The girl’s whisker lengths are less clustered, suggesting that the girls are more intelligent than boys.

  • BW4

There are 200 pupils in Year 10: 96 are girls and 104 are boys:

 Girls = 96/200 x 50

         = 24

Therefore I will use 24 pieces of data from the girls.






=LQ - (1.5*IQR)










=UQ + (1.5*IQR)



Boys = 104/200 x 50

         = 26

Therefore I will use 26 pieces of data from the boys.






=LQ - (1.5*IQR)










=UQ + (1.5*IQR)



The minimum value for the girl’s box plot here is 79 and the maximum value is 113. The LQ is 93 and the UQ is 103; the IQR being 10. There is an outlier at either end: the low one being at 69 and the high one being at 120. The mean is 98.72 and the SD is 10.8. For the boy’s box plot, the lowest value is 78, the highest value being 117. The LQ is 90 and the 104; the IQR being 14 and the median being100. The mean is 98, the SD is 10.01 and there are no outliers.

The IQ’s of Year 10 girls and boys are very similar.  The median are exactly the same and there is little variation with the length of the whiskers – suggesting that neither gender is more intelligent than the other. Nevertheless, the girls have outliers (including a very low one) which leads to the conclusion that they are more intelligent in Year 10.

  • BW5

...read more.


1.35 ≤ x < 1.45                1.4        0.1        1        10

1.45 ≤ x < 1.55                1.5        0.1        9        90

1.55 ≤ x < 1.65                1.6        0.1        30        300

1.65 ≤ x < 1.75                1.7        0.1        9        90

1.75 ≤ x < 1.85                1.8        0.1        1        10


The lower classes on the boys’ histogram have a fairly low FD compared to the higher classes. The modal class is 1.55 ≤ x < 1.65. The SD is 0.1168 and the mean is 1.642, which shows that the results are a bit skewed but fairly centred on the mean.

The female histogram has a normal distribution with a mean of 1.6 an SD of 0.0721, which shows that the results are very closely distributed around the mean. The modal class is, again, 1.55 ≤ x < 1.65, showing that the majority of the pupils have an average height.  The classes 1.45 ≤ x < 1.55 and 1.55 ≤ x < 1.65 are higher for girls than boys but the other three classes are lower: because the female histogram follows the normal distribution. Therefore, I believe that girls are taller than boys in general.

In Conclusion, I have been able to fully support 3 of my 4 hypothesis; my third hypothesis has been disproved.  Throughout my coursework I have used a range of calculations and graphs in the appropriate places which are useful and have supported my hypotheses to their full extent. I have hand drawn my Box and Whisker Plots because I have to hand draw some of them but also because it is much easier to do them by hand. There are some limits to my work because I had to sample some of my data. This was done for various reasons a few times, mainly to make my work easier to understand. The problem with sampling is that it does not give a full and fair representation of the population though the results may be the same as the un-sampled data; it is likely that is not as you do not use the entirety of the data available. Improvements could have been made to my hypotheses. I think that I have used all the best methods possible and used them to the best of my abilities.

Denis Twomey 10J                                      30/03/2008

...read more.

This student written piece of work is one of many that can be found in our GCSE Height and Weight of Pupils and other Mayfield High School investigations section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related GCSE Height and Weight of Pupils and other Mayfield High School investigations essays

  1. mayfield high statistics coursework

    20 = 2.7 rounded of to 3 Note * I then applied the same methods in calculating the number of Girls for my stratified sampling and here is a table of results that I will be using for my data sampling for this particular hypothesis.

  2. Mayfield School Mathematics Statistics Coursework

    whether there are any mathematical techniques that could be used to do so. Flicking through the `Advanced Modular Mathematics: Statistics 1' published by Heinemann, I see that you can calculate a measure of the correlation in the scatter diagram. This measure is called the productmoment correlation coefficient and is considered

  1. Mayfield High Statistics Coursework

    The class widths are therefore 6, 5 and 2. The area of a histogram represents the frequency. The areas of our bars should therefore be 6, 15 and 4. This information can be marked on the grid. A frequency polygon is an easy way of comparing two sets of data frequencies as they can be drawn on the same graph.

  2. Edexcel GCSE Statistics Coursework

    which affects its shape. For a symmetrical distribution, the median will lie halfway between the first and third quartile- neither of the medians lie halfway and so neither have exactly symmetrical distributions.

  1. Maths Statistics Coursework

    This is an accurate and unbiased way of using the data because stratifying the data means that I am not picking the numbers myself this makes the investigation fairer and gives me more accurate results. I chose 150 random numbers because I think that the more you use the more

  2. Investigation on the shape and size of limpets on a sheltered rocky shore called ...

    = (25 + 25) - 2 = 48 The Significance Level is always 5%, this is the level normally used in biological investigations. When we reject the null hypothesis at the 5% level we will be correct in doing so 95% of the time.

  1. Mayfield Maths Coursework

    I will use this data to compare the IQ results with the Key Stage 2 results, to see if there is a correlation or pattern between the two results. To represent my findings I will use a scatter graph - to see if there is a correlation, a histogram -

  2. GCSE Maths Coursework: Statistics Project

    From looking at this bar chart I can see that the average weight is 50<w<60. This frequency polygon shows the comparison between the two year groups. The year 7 girls clearly weigh less. I can see that none of the girls in Year 7 weigh over 60kg.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work