Data handling investigating height and weight.

Authors Avatar

                

Data Handling Coursework

For my data handling coursework I will be investigating Height and Weight.  I will also be investigating how gender and age affect these.  I need to collect information on Height, Weight, Age and Gender.  The information I am using has been collected by a secondary source so the data should be accurate.  I can foresee that, I could have a problem with data if it’s incorrect, so, if I get any incorrect data I will ignore it.

The following table shows information about the number of boys and girls in each year at Mayfield High School.

Hypotheses

  1. My first hypothesis is: There will be a positive correlation between height and weight.

  1. My second hypothesis is: Is Height affected by age and gender.

  1. My third hypothesis is: Does body mass index change for different years and genders.

Hypothesis 1

To test this hypothesis I will firstly collect a random sample of 100 students.  To collect this random sample I will use my calculator: I will use the random number generator and then multiply it by 1183, the number I get will be a number of a pupil.  If the calculator selects a number that I have already chosen I will ignore it.

The Data: Here is the random sample of 100 students from the school

Now I have collected the random sample I can find out if the correlation between height and weight is positive or negative.  To find this out I will plot the results on a scatter diagram, and then put a line of best fit on.  With the line of best fit I can predict results.

As you can see from the scatter diagram that my hypothesis was correct, there is a positive correlation between height and weight.  This also shows that in general the taller you are the more you weigh.  I can also use the line of best fit to predict someone’s height if I have there weight, or vice-versa, for example, if someone is 1.5m tall then they will be around 45Kg.  But if I want a more accurate way of predicting people’s weights then I will need to find out the equation of the graph.

To work out the gradient I will find two points on the line of best fit for example (1.40x1,40y1) and (1.50x2,45y2) then I will subtract x2 form x1 to give me 0.1x and subtract y2 from y1 to give me 5y.  Then I will divide y by x to give me 50.  Now I have the gradient y=50x I need to work out were the line of best fit crosses the y axis:

I now know that the graph crosses the y-axis at –33.  So the equation for my line of best fit is y-50x-33.  But the computer generated equation is y=52.23x-33.04 which is a lot more accurate than mine so I will used this equation to find out someone’s weight if they are 1.5m tall.

Y=52.23x-33.04

=52.23x1.5-33.04

=45.305

=45Kg

Join now!

The scatter diagram also shows that some pupil’s results are a long way from the line of best fit, this could be because I have used the whole school, therefore the results are very spread out, this could be why the line of best fit show that someone who is 0.6m tall would weigh around 0kg.

Hypothesis 2

To test this second hypothesis, which is:  is Height affected by age and gender; I will firstly need to collect a stratified sample of 30 pupils from years 7, 9 and 11.  I have decided to use this sampling technique because ...

This is a preview of the whole essay