Introduction

Mathematics

Statistics Coursework

Aim

My aim for this project, is to conduct a statistical investigation. I will form some hypotheses and try me to prove or disapprove with them.

For my project, I have been asked to select data about pupils height and weight to see if there was a relationship between the two.

This will then help to develop some lines of enquiry for the statistical investigation.

I will randomly stratify my samples throughout the investigation.

I will then, present my investigation clearly, using scatter graphs, histograms, box-plots and cumulative frequency diagrams. diagrams to help me guide the reader through the process of investigating the data and concluding with the final hypothesis.

I am going to use a variety of statistical methods to analyse data, to compare the relationship between weight and height of children from year 7 to year 11.

To help me with my investigation, I have received data from the Mayfield High School. The data consists of 1889 pupils from Year 7 up to year 11. I have decided to take a sample of 20% from each year group as it is a size which is manageable, yet is large enough to provide accurate data that represents the entire population.

The sampling of the data is very important, when looking at such a large amount of the data. It makes the investigation easier and more enjoyable.

There are many sampling possibilities, however, I have decided to look at three of them and choose the most common and effective one.

Systematic Sampling

“If a sample of size s is to be taken from a population of size n, then every n/s member of the population is tested. The starting point is chosen at random.”

I have grouped the data from Year 7, 8 and 9 into the grouped data tables. From this information I will create various graphs and diagrams. These tables will also help me to organise my data, and make it clearer and neater.

One of the reason why I

Conclusion

As a results of my previous hypothesis, I now believe that my investigation would benefit more, if I was to sample a larger variety of pupils from Mayfield High. This had enriched my investigation for hypothesis, providing a larger representative of the school, therefore than making the results more reliable.

My third hypothesis was aiming to prove that year 7 females will be the lightest in school, whereas the year 11 males will be the heaviest pupils. I have used calculations that resulted in mean and median, as well as the frequency diagrams, box plots and histograms. These were all essential in order to provide enough evidence for my investigation. I have also noticed that my graphs show that, boys appear to be heavier than the girls despite the age group; but when looking at the age, this is not always true.

If I was to carry out this investigation again, I would make few changes before preparing myself to tackle this task. One of them would be to perform a vertical dispersion on my scatter graphs, as this will provide a much more accurate results when looking at the line of the best fit, and enable me to produce a more accurate predictions. Furthermore, I would use a larger sample of data to investigate, a this will let me eliminate more outliers and inaccurate results. In addition the correlation of the coefficient would have resulted in a much stronger relationships. This will then help me to get a better idea whether the correlation for males or females was stronger.

Despite the fact that there are few factors that I would change, if I got a chance to undertake this investigation again, I believe that the Mayfield High School investigation provides a reliable evidence that support my hypothesis.

