Statistics coursework

Authors Avatar

Karen Hemmings

10A1

Maths statistics coursework

Hypothesis: Generally girls have a higher IQ than boys and therefore have a higher total of KS2 (Key Stage 2) results. However, year 7 KS2 results are higher than the year 11’s KS2 results.

I firstly believe girls have a higher IQ than boys as girls find it easier to concentrate and are more able to work hard over a long period of time. This therefore would affect their KS2 results, as it would mean girls are more able to retain all the information needed in SATs.

Although, I think year 7’s KS2 results will be higher than year 11’s as the tests have become easier over the five years and the teaching has become better too.

Plan

First of all I will filter my data so that I am only left with data for all the year 11s and all the year 7s. I then will delete all the inappropriate columns of data i.e. favourite subject, leaving me with the following columns:

Year Group, Gender, IQ, English, Maths and Science KS2 results

Both these steps will remove any data that is unnecessary in my investigation so that I am left with only suitable data making it easier and quicker for me to use the records.

Following this I will look through the data for any anomalies such as a child achieving a level 7 in their KS2 SATs as this is not possible with the tests taken. I would also exclude any records with some data not present. If there are any I will delete them so that my data is accurate and therefore my final conclusion will be truthful. Once I have done this I will count the total number of year 7 girls I have left along with the total number of year 7 boys, year 11 girls and year 11 boys. Subsequently I will find the sum of these figures so that I can use stratified sampling with the equation:

(Number in strata/population) x sample size

I will use stratified sampling as the number of pupils in each year group is different.  By using this method I will be getting a representative proportion of each year group making my data fair.

Once I have calculated the sample size for each strata I will use random sampling (using the Ran# button on the scientific calculator) to select the calculated number of pieces of data from the strata. However, before that I will set my calculator to be fixed on zero decimal places in order to avoid having to round numbers which would increase the chances of repeated figures.

Before I begin to use the data to prove my hypothesis I will first add an extra column on to my table of data. This will be a column with the total KS2 results of each child. This is so no workings out will have to be done with later graphs.

Once I have completed this I will have all the data I need. My first step in my investigation will be to produce a cumulative frequency graph of girls and boys IQ. On the same graph I will have two lines – one for males IQs and one for females IQs. This will make comparing the two easier and will mean my conclusion is likely to be more accurate. I have chosen to use a cumulative frequency graph as it gives you a general idea of any trends. It also allows you to see the median IQ for each gender along with inter-quartile range. This is essential as the inter-quartile range excludes extremes that could affect the data, whilst the median highlights the IQ in the middle of all the data if it were in ascending order.

Subsequently I have decided to produce a box and whisker diagram of girls and boys IQs. This is simply to make the inter-quartile range clearer so that any extremes are obvious. It is also to show how the data is spread around the median, i.e. whether there is a small range above the median or not. This is necessary to see whereabouts the majority of results lie.

After this I have chosen to produce a stem and leaf diagram of girls and boys IQ. This is because a stem and leaf diagram has the same advantages as a bar chart i.e. using clear bars to show the distribution of data, whilst also retaining the original data so mode, median and mean can be calculated from it. This is needed as the mode shows the most common results, the median shows the middle value when the data is in order and the mean shows a representative figure for the data you have collected.

All these diagrams will either prove or disprove the first part of my hypothesis - that girls have a higher IQ than boys. So the next stage will be to compare the IQs of boys and girls to their total KS2 results. I will do this by using a scatter diagram as this enables me to draw a line of best fit making the relationship between IQ and total of KS2 results, if any, clear. To then investigate how this affects the two sexes KS2 results I will draw another cumulative frequency graph. This is to get a general picture as to whether there is a link between girls having higher IQs than boys meaning they must have higher KS2 results too. This will make my end conclusion more reliable as the median and inter-quartile range can also be calculated from this graph.

Join now!

Following this I will set to prove that year 7’s KS2 results are higher than year 11’s KS2 results. I shall do this in the same way I proved the first part of my hypothesis. I will start with a cumulative frequency graph of year 7’s KS2 results and year 11’s too. This is for the same reasons as I used the same type of graph in the first part of my investigation. As before, I will use a box and whisker to show the same data as on the graph but with extremes being made obvious. Then I ...

This is a preview of the whole essay