Height and weight

Authors Avatar

General Introduction

        I am going to try and prove both hypotheses by comparing the data in numerous ways. I was given as a secondary source that has been sorted out into categories the data is about a fictitious school but contained real data. There are 1200 pupils at Mayfield school with 18 headers of data overall I was given 21600 datum points and have edited out information that will not be used in these inquiries for example ‘Distance from home to school’. I will also need to delete rows that are missing out data which I need for example if someone doesn’t have their height marked into the database that row with that persons information will need to be deleted. Therefore from starting with 1200 pupils and taking away 21 students that are missing data I am left with 1179 as an overall sample. I will be taking a sample of all 1179 students ii is important that the spread is large so that the fairer the random sample will be therefore I will take around 60 students.

 

Hypotheses

Hypothesis 1

‘The People that watch the most television weigh more then the people that watch less television.’

Hypothesis 2

‘There is a relationship between height and weight.’

Hypothesis 1

‘The variation in the amount of hours spent watching television; the students of Mayfield School watch a lot of television.’

        First I must get a random sample of all the data otherwise the relationship will be too hard to find because there is too much data.

         I will do this by using the RND# function on my calculator. Once I have got over 60 random numbers they will need to be timed by the amount of samples because then the number I am given I will select that record and add it to my top 60, but if I times it by the sample and I get a number bellow 0.4 or less when I round it down I will get zero however there is no zero record therefore I will round all numbers up to no decimal places. To compare the data I will be measuring the correlation by drawing a cumulative frequency curve.

        By scanning throw the random 60 and checking the amount of hours watching television I can develop the table above to plot the points of the cumulative frequency curve.

Now I am able to work out each quartile and divide the graph into a smaller section.

Mean:

        Another way to work out the median would be to use the following formula:-

Join now!

Where x with a line over it is equal to the Mean, Σ is the sum of, f is equal to the frequency and x is equal to the mid point of the grouped data. From using this formula we can work out that the mean is:-

Therefore the mean equals 12.96610169

Median:

To work out the median I halved the cumulative frequency by dividing it by 2, find that point on the axis, draw a line across to the curve and then down to the horizontal axis ...

This is a preview of the whole essay