Statistic Coursework

Authors Avatar


Aims, Design and Strategy

The aim of this investigation is to discover if there is a link between two variables and see whether they are dependant or independent on each other. In order to carry this out with reliable results I will need to collect suitable data which I can use statistical methods to calculate and analyse correlation coefficients and regression lines, taking into account any anomalies that may affect the correlation coefficients and regression.

In this investigation I am going to be looking into whether more females prefer creative subjects to males, and then in turn whether males prefer logical subjects.  I believe that female students will prefer the creative subjects such as Art and Music as when they were younger they may have spent more time practising these with family and friends. I also believe that females are better at creative subjects then males, and if a student is good at a subject they enjoy it more. I feel that this data should provide me with a strong link. I will also investigate whether there is a link between the amount of television watched and gender. If my first hypothesis is proven correct then I will go on to test whether males watch more telvison than females. I believe that if they have chosen more logical subjects than they will also watch more television having spent less time doing the creative things as a child. I hope that this will show a good connection.

I plan to use to use comparative pie charts and comparative cumulative frequency polygons to evaluate the creativity levels of students' favourite subjects by gender and histograms to prove my secondary hypothesis.

Selection and Collection of Data

Join now!

I will be using Mayfield High data provided by the exam board. This has data from students between the ages of 11 and 16. I have chosen to use this data, as my hypothesis requires the use of males of which there are none at my school and although this data is not primary, I believe it to be a more reliable source than my own data.

The total population of the data is 1187, which is too large to manage so I will take a sample size of 75 which should be enough to get accurate results ...

This is a preview of the whole essay