Mayfield High School data handling Coursework

Authors Avatar
Introduction

In this investigation, I have been given data from Mayfield High School. Although the name of the school is likely to be fictional the data itself has come from real students. The data I have been given is for year 7 to year 11 students. There is information on around 170 students for each year. The following are the types of information we got on each student, year group, surname, forename, age, gender, hair colour, eye colour, which handed they are, favourite colour, favourite music, favourite sport, favourite subject, favourite T.V programme, average hours of T.V watched, IQ, height (m), weight (kg), distance traveled to school, Means of transport to school, number of siblings, number of pets, key stage two results for, maths, English and science. There is too much data so I will use Excel to filter it. I will choose IQ and KS2 results. Since there are too much data I would not be able to examine it probably. Some of the data on the sheet is quantitative data, so I would need numerical data. This is good because I will be able to narrow it down using maths statistics. I will also use reference numbers, so it will help me when I am doing random sampling.

I am going to examine these hypothesis "boys are cleverer than girls" and "The higher the IQ the higher the KS2 results will be". The higher the IQ is the higher their KS2 results should be because their IQ shows their intelligent and so they will be scoring good grades. I am looking at key Stage 2 results because IQ results would have reflected their KS2 results. However there are some anomalies with high IQ but did not achieve the best results in the KS2 SATS. This is because the person might have missed a lot of lessons before the KS2 SATS which would negative impact. I am looking at these because:-

* First it allows me to work out the summary statistics such as giving me rational numbers.

* I will be able to use standard deviation which allows me to compare the genders directly which then I can refer the results to my hypothesis to see which gender is intelligent.

* Also I will be using bi-variate analysis which means I can represent in various ways like using scatter diagrams or any other graphs or diagrams which has a line of best fit.
Join now!


There are two ways of selecting random students according to their special digit assigned to them. I could put all the boys (604) names into a hat and pick out 15 at random and record the names. However, in this case this is not appropriate as there is far too many people and the hat would have to be pretty big. Instead I will use my calculator using the random number button on the calculator.

Data Collection

There are 812 students and we are going to look at a sample of those students. I am going ...

This is a preview of the whole essay