• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

Statistics Coursework - Bivariate Data.

Extracts from this document...


Bivariate Data                Shahida Jaffer

Statistics Coursework

Bivariate Data


Moving to a new area with so much choice, my parents are skeptical about which middle school my brother should go to.  They want to find a school that doesn’t just do well in one subject at Key Stage 2, but at least two out of the three subjects.  I am going to investigate this by looking at maths and science results for children at Key stage 2 achieving level 5 to see if I can find a school with a positive correlation between them.  This will be shown if a school dos well in maths, and does just as well in science, and if it does badly in maths, does it do equally as badly in science.  


The data I have used was collected from the ‘Department for Education and Skills’ Website, under the section of performance tables for primary schools at Key Stage 2.  I chose to get the data for schools within 15 miles around my postcode (MK5 8BS), the nearest being printed first.

...read more.



Now I will do a hypothesis test using the Pearson's Product Moment Correlation Co-efficient.  This is calculated using the formula below:image00.png

This is easier than it looks!  The first step is to calculate the following:


And then put all of these into the formula to find r (which is always between 1 and -1).  Using programs such as Microsoft Excel, you can highlight the data and the computer can automatically calculate the PPMCC.

Doing this, the PPMCC is 0.8528  - this backs up my thought that my variables have a good positive correlation, as perfect correlation is at 1 or -1.

I will now carry out a hypothesis test on the correlation co-efficient comparing it with ρ (the parent population correlation co-efficient).  This is called a test statistic, and will be a 1-tailed test at a 5% significance level.

Important things to know:

  • The null hypothesis, H0 represents a theory that has been put forward, either because it is believed to be true or because it is to be used as a basis for argument, but has not been proved.
  • The alternative hypothesis, H1, is a statement of what a statistical hypothesis test is set up to establish.
  • The final conclusion once the test has been carried out is always given in terms of the null hypothesis. We either 'reject H0 in favour of H1' or 'do not reject H0'; we never conclude 'reject H1', or even 'accept H1'.
...read more.


So, one can conclude that if a school have a high percentage of students doing well in maths, then they will have a similar high percentage of students doing well at science and achieving level 5.  Similarly, a school that has a poor performance in maths, will have an equally poor performance in science.  

This means that if my parents want to find a good primary school for my brother, then they should choose a school which has a high percentage of students doing well in maths and science.


There are many different ways I could do this if I was to repeat the investigation.  If my parents decide what type of school they want to send my brother to (eg public or private) than I could sort the data into these categories first, and then sample and test.  Another thing that I could do is remove all the schools that have percentages below the national average to see if this makes a difference to my hypothesis.

Page  of 3

...read more.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related AS and A Level Probability & Statistics essays

  1. The mathematical genii apply their Statistical Wizardry to Basketball

    However at this stage I will calculate the relevant parameter for this piece of coursework. I will estimate the expected number of shots required by Lee and Dom to score a basket. Expected Mean Values To find out the expected mean value for a geometric distribution it is defined as

  2. Statistics coursework

    13 7 Female 96 3 3 4 10 7 Female 89 3 3 4 10 7 Female 91 3 3 3 9 7 Female 102 4 5 4 13 7 Female 113 5 5 5 15 7 Female 102 4 4 5 13 7 Female 116 5 6 5 16

  1. Mayfield High School Maths Coursework

    Below are all my samples which I have gathered by using the random formula:- Random Numbers For Year 7 Boys: - 103, 119, 89, 6, 4, 78 For Year 7 girls: - 73, 114, 30, 23, 34, 76 For Year 8 Boys: - 134, 96, 29, 60, 63, 104 For

  2. Anthropometric Data

    A dependent variable is a variable that dependent on another, the independent variable is said to cause an obvious change in the dependent variable. So I' am going to plot the foot breadth (mm) as a dependent against the foot length (mm)

  1. AS statistics coursework - correlation coefficient between height and weight in year 11 boys ...

    height 1.73m weight 58kg 2. height 1.60m weight 45kg 3. height 1.52m weight 48kg To calculate the deviance (residual) of each point from the regression line assuming that the data points are (x1, y1), (x2, y2) etc... then d (residual)

  2. Statistics Coursework

    A 6, 10, 12 and even 15-year-old students can still have a stunning 100% attendance figures at school just by having that one important reason of why they have to come to school everyday (and again it might not have anything to do with age at all).

  1. I am going to design and then carry out an experiment to test people's ...

    17.5 19 7 13 8 7 5 7 7 14 16 16 5.5 17 7 15 9 5 6 13 8 16 16 13 0 7 8 17 8.5 13.5 14.5 7 8 18 14 7 11.5 12 8 19 10 14 5.5 7 8 20 7 12 8 11

  2. "The lengths of lines are easier to guess than angles. Also, that year 11's ...

    This is a relative low percentage which means from the mean there doesn't seem to be much error for the line. For the angle the mean is 624.24/33 which equals 18.92%. This means that from the mean you can see that from the year 11 data, they were better at estimating the length of the line.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work