have chosen to show the relationship between height and weight. The main reason for this is because the data for height and weight is continuous, unlike eye and hair colour and KS2 results which are discrete or qualitative

Authors Avatar

Ben Krawiec                File: Maths-PAJ/Mayfield Coursework/Mayfield High School.doc

Mayfield High School – Edexcel GCSE Coursework

Introduction:

The data we have been given is taken from a real school, while Mayfield High School is a fictitious school. The school consists of 1183 pupils, of which there is the following number of pupils:

For each child the following data was provided: Age, Year Group, IQ, Weight, Height, Hair Colour, Eye Colour, Distance from home to school, Usual method of travel, Number of brothers/sisters, & KS2 Result in English, Mathematics and Science. This gives us a total of 31941 datum points (1183 x 27).

There is a number of different possible lines of enquiry that could be follow, some examples of these are:

  1. the variations in hair colour,
  2. the variations in eye colour,
  3. the relationship between the above two colours,
  4. the distance travelled to school,
  5. the relationship between height and weight,
  6. the relationship between KS2 results,
  7. the relationship between IQ and KS2 results,
  8. the height to weight ratio in terms of body mass index.

From the lines of enquires, I have chosen to show the relationship between height and weight. The main reason for this is because the data for height and weight is continuous, unlike eye and hair colour and KS2 results which are discrete or qualitative, continuous data I can put it into Box Plots, Histograms, Pie Charts, Scatter Graphs and Stem & Leaf diagrams.

Hypothesis:

Before analysing the data I can make hypothesises to show, how what results I am expecting to get. These hypothesise are:

  1. There will be a positive correlation between weight and height, as when a person gets taller, they will also become heavy, due to bone and muscle growth.
  2. There will be a stronger correlation in the males’ height and weight than the females’, as males tend to be taller and less bothered about their appearance, this will cause them to be heavier.

Weight against Height:

As we have been given data for 1183 students, it would be very time consuming to analyse all of the data, therefore we need to take a sample. There are two different samples I could use; the first is a random sample. A random sample would be impractical to use as I could end up with the data for all year 7 boys for example and not get a view of the whole school. Instead of a random sample, I must use a random stratified sample must be used. This means I can take a percentage of each age group and sex, and choose a random sample from that so there is no risk of picking everybody with the same weight or height, and still keep the same ratios that the original population had. I first used a 10% stratified sample which gave me the following number of pupils to analyse, I did this to give a large number of pupils, but not too many.

This gave me 119 pupils’ data to analyse. Using Excel software I can create a random sample, by giving each pupil a random number. This is done using a formula: “=RAND()”. After giving each pupil a random number, I then need to sort the data, so that the list is no longer in alphabetical order, this is done by clicking Data, Sort and then selecting the “random” number column and finalises it by selecting “OK”. I then took the first 10% from each stratum.

Before analysing the data I took out any anomalies in the data, and replace this with the next student. Examples of anomalies are those such as a weight of 5 kilograms or a height of 143 metres.

Join now!

From the data I have selected I can create charts to show the correlation between weight and height, using a best fit or trend line. The chart looks as follows:

From the above chart, with a best fit line, I can see that the line shows positive correlation. This is because the points appear to lie close to the trend line and the line is going upwards (positive) direction. On the chart I have put the data for R which is correlation. The figure I get for R is 0.4412, which on a chart with ten points ...

This is a preview of the whole essay