The relationship between height and weight amongst different ages and genders

Authors Avatar

        Statistics Coursework        

The relationship between height and weight amongst different ages and genders

PLAN

Introduction

I have been given a set of data about different pupils attending a school called Mayfield High School. The data which I have extracted from this are the pupils’ year group, age, height, weight and gender. Therefore, I have decided to compare the changes in the heights and weights of the pupils and how they are different amongst the different year groups and genders.

Hypothesis

I think that as the year group increases, the heights and the weights will increase. However, I think that the heights and weights will increase; but at a decreasing rate. I think that this will happen from my own experience since year 7; I have increased in height and weight, but have not increased them at an increasing rate.

Sampling

The data which I have been given is very large, so I need to decrease the amount of data so it is easier to work with and analyse. Therefore, I will sample the data with a stratified sample, which means I will pick a fraction of the data but the new data will still represent the overall data. For example, if 51% of the pupils are male, then my sample will still have 51% male pupils. This is actually the case, because there are approximately 51% boys in the school and 49% girls in the school. However, this may not be the case in individual year groups, but I don’t have the distribution of boys and girls for each year group, so I will have to presume that the distribution is within the year groups is the same as for the whole school. I have decided to reduce the amount of data to 120. I feel that this is a good number to work with because I will have enough data to find accurate trends but it is not too much data that I will get confused with the volume of data to analyse.

        The data is from a secondary source, which means that I have received some which has already been collected before. If it was primary, then I would have collected it myself. Despite it being secondary data, I still feel that it is reliable because human heights and weights do not change between generations; therefore the data is not yet “out-of-date”.

Join now!

        If I see any data which does not fit with the general trend of the data (outliers), then I will remove that piece of data, but only if it is the only one which is far off the other pieces of data.

        This table shows the population statistics of the school and my sample:

Calculations

These are the calculation which I will do:

Median vs. mean – I feel that in theory, working out the median will be more reliable some ways that the mean. This is because the median is not affected by outliers as much as the ...

This is a preview of the whole essay