• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

AS statistics coursework - correlation coefficient between height and weight in year 11 boys and girls

Extracts from this document...


Antony Georgiou

Statistics Coursework


   The aim of this investigation is to discover if there is a link between two variables and see whether they are dependant or independent on each other. In order to carry this out with reliable results I will need to collect suitable data which I can use statistical methods to calculate and analyse correlation coefficients and regression lines, taking into account any anomalies that may affect the correlation coefficients and regression.

   In this investigation I am going to look into whether or not there is a connection between the height and weight of year 11 boys and girls (I chose height and weight as my variables as I feel that they will have a strong correlation and should be dependant on each other i.e. the taller the person is the more they should weigh. I also chose the background variable of gender to see if this influences the result). The population from which I shall gather my sample is the boys and girls in year 11 from Wilnecote High school. I will gather data on 35 boys and 35 girls chosen randomly to give me a set of data that represents the whole year group.

   To pick people from the year at random I will get a list of every boy and every girl from year 11 on separate sheets and assign a number to each name (1-135 for boys and 1-124 for girls). To randomly choose which students will be used I will use the random function on my calculator (as there were 135 boys on the calculator you press 135, Ran, and then the equals button 35 times taking note of each number)

...read more.















































x total


y total


Height : Weight Males yr 11

Height in m (x)

...read more.


I was correct in thinking that the boys would have a steeper gradient but that doesn’t necessarily mean my reason is correct it could be many different reasons that I have overlooked such as sport they do etc.

  1. I think that height and weight will be very dependant on each other.

I was correct in thinking that height and weight would be dependant on each other.

Modifications That Would Make The Investigation More Reliable

  • Look at different year groups and compare results
  • it could have been more accurate if I took a larger sample (the larger the more accurate).
  • Gather secondary information from the internet looking at national data for height and weight rather than localised
  • Look at different schools year 11 pupils for a wider range of sources to produce a more reliable result
  • Use more precise measuring instruments which could measure to 3 or possibly 4 decimal figures
  • I could have used a stratified sample which would take into account that there are more boys than girls in year 11
  • Take many small samples and collect the averages together this will give you more accurate readings and will also allow you to negate anomalies
  • Look into the subjects background and see how environmental differences such as wealth has a trend in the data

All of the above would make improvements to the accuracy of my results however I feel that the easiest to do with the most impact on reliability would be to increase the sample i.e. use the whole year group. Also with equal ease you could take a stratified sample which would reverse the error which would be caused by the difference in amount of students boys : girls.

...read more.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related AS and A Level Probability & Statistics essays

  1. Statistics. I have been asked to construct an assignment regarding statistics. The statistics ...

    Both Birmingham and Chelsea have no mode. The differences between the two means are as shown; Football Club Mean Difference Birmingham City 25,461 15,974 Chelsea FC 41,435 Range The range of the two groups of data will be the highest attendance, minus the lowest attendance, when placed in ascending order.

  2. Statistics coursework

    This is necessary to see whereabouts the majority of results lie. After this I have chosen to produce a stem and leaf diagram of girls and boys IQ. This is because a stem and leaf diagram has the same advantages as a bar chart i.e.

  1. Teenagers and Computers Data And Statistics Project

    the cube, so if the front would be Length - 2 and Height - 2 then multiply the 2 together as you would to find the visible area, the same for other surfaces of the cuboid. As in a general cuboid you would have 3 different lengths as in L, W and H, then add them all together.

  2. Maths Statistics Investigation

    173 Mazda Premacy 18000 7 14495 4495 69 175 Ford Cougar 10000 5 21980 7985 63.7 177 Mercedes A-Class 80000 2 17787 12320 30.7 183 Hyundai Santa Fe 10000 3 17995 14420 19.9 190 Ford Escort 10000 10 12770 1225 90.4 195 Citroen Berlingo 90000 7 8995 2550 71.7 196

  1. Investigate the relationships between height and weight

    118 = 12 * Year 10 Boys 94/1183 X 118 = 9 * Year 10 Girls 106/1183 X 118 = 11 * Year 11 Boys 86/1183 X 118 = 8 * Year 11 Girls 84/1183 X 118 = 11 BOYS GIRLS TOTAL YEAR 7 13 15 28 YEAR 8 12

  2. Investigate the relationship between height and weight and how it changes between gender and ...

    Lq 40 Uq 48.75 Iqr 8.75 The outliers are 26.875 and 61.875 but there are no anomalies is the data Year 7 Males Height (Lower Quartile and Upper Quartile) Lq 147 Uq 159.5 Iqr 12.5 The outliers are 134.5 and 172 Weight Lq 39.5 Uq 49.5 Iqr 12.5 The outliers


    I will use box plots to derive how dispersed the data is, how varied the data is. This will allow me to clear relationships between the samples strata. I will use measure of spread to compare the sample data considering with the same sample but this time excluding all factors.

  2. Statistics Coursework

    the age of the students and their attendance figures at school or there is no relationship at all. However, the students' appreciation of the importance of their attendance figures does and this is why (in my opinion) the attendance figures vary between students.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work