• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

height and foot size

Extracts from this document...

Introduction

GCSE Coursework: Statistics Investigation by Stephanie Liu

Hypothesis 1

I predict that the taller the pupil is, the bigger their foot size will be.

Plan

I’ve been given 60 pieces of data from pupils, about their height and foot size.

I will be using a piece of software called Fathom where I will place this information into a scatter graph, to see whether or not my hypothesis is correct. Fathom will produce a line of best fit on my graph and tell me what my r-value is. The r-value shows the product moment correlation coefficient. I am expecting a positive correlation. To prove that my hypothesis is correct, I am looking for a product moment correlation coefficient from something between 0 to 1 and the closer the line of best fit is to 1; the more evidence there is to back up my hypothesis.

The product moment correlation coefficient is a measurement of the degree of scatter. It is usually denoted by “r” sometimes referred to as the “r-value” and “r” can be any value between -1 and +1. It can be used to tell us how strong the correlation between two variables is. A positive value indicates a positive correlation and the higher the value, the stronger the correlation. Similarly, a negative value indicates a negative correlation and the lower the value the stronger the correlation.

...read more.

Middle

This scatter graph only shows the heights and foot sizes of the boys. As I previously expected, there is a strong positive correlation as for this graph the r-value is 0.860 (square root of 0.74). Evidently, this r-value is a better correlation than the original one, meaning that the equation is more accurate to work out the height or foot size of a male. This might be because they may have stopped growing and therefore their feet and height are in a better proportion. Within this scatter graph there are a few anomalies. For example, there is a boy whose foot measures only 24cm and has a height of approximately 180cm, which is almost 30cm taller than that which the line of best fit predicts. As before, looking at the data for males only, I can see that there is a very high positive correlation, which means that long footed males will tend to be tall. The value for the coefficient is higher than for mixed gender, which concurs with my hypothesis.

Now I am going to repeat the same process to find a better equation to work the height and foot size for female pupils, although this suggests that the relationship for females should be weaker. I will plot a scatter graph for females only and see if this is true.

...read more.

Conclusion

Conclusion

I was given 60 pieces data on pupils’ heights and foot sizes from a school then I investigated the relationship between the foot size and height put placing all the provided data in a scatter graph. Next to make my equations more accurate, I separated the genders and investigated foot size and I found out that there was a better equation for males but I couldn’t get one for females. From my results, generally males had a wider range and was taller and had a bigger foot length than females although I cannot be sure and will need to do further testing.

If I were to re-do this investigation I would like to investigate the age of the pupils from who I have received the data. I think this would have made my investigation better because it would give me some clues to why I got the trends that I got. It could also affect my hypothesises. I would extend the number of pupils I used in this investigation so that I got a wider range of data and may give us more accurate views of the relationships for example something like 600 pieces of data would be significant to draw up conclusions whereas now, as far as my hypothesises are concerned, they are inconclusive and I can’t make a judgement about whether their correct or not. In addition, I would like to remove any anomalies and see how this affects the value of r. This may mean that my r-value goes up.

...read more.

This student written piece of work is one of many that can be found in our GCSE Height and Weight of Pupils and other Mayfield High School investigations section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related GCSE Height and Weight of Pupils and other Mayfield High School investigations essays

  1. A hypothesis is the outline of the idea/ideas which I will be testing and ...

    Hypothesis 2 'Key Stage 4 Students who watch more hours of television on average have a lower IQ Level' Planning For this particular hypothesis I will be only be using the data of the School Year of 10 and 11 due to the fact I have specifically chosen to investigate

  2. Show that different people have different reaction times according to their gender and the ...

    From this graph there doesn't appear to be any correlation of hand spans to reaction times. But I am going to further my investigation into this. I will use Spearmans Rank Coefficient to fine out how much of a correlation there is.

  1. Data Handling - Height and Foot Length

    159 22 123 141 21 138 169 22 139 164 23 140 140 23 144 149 23 Random Male Height MMMR CLASS INTERVAL MID POINT FREQUENCY MID POINT X FREQUENCY 140 < h < 145 142.5 11 1567.5 145 < h < 150 147.5 8 1180 150 < h <

  2. Guesstiamte - investigating whether men or women between the ages of 15-25 are better ...

    is found by adding up all the numbers then dividing this by how many numbers there are. The median is the middle number when all numbers are in numerical order. The mode is the most common number; the number which appears the most.

  1. Maxi Product

    Below is a table proving my rule Partition Product 0,7 0 0.5,6.5 3.25 1,6 6 1.5,5.5 8.25 2,5 10 2.5,4.5 11.25 3,4 12 3.1,3.9 12.09 3.2,3.8 12.16 3.3,3.7 12.12 3.4,3.6 12.24 3.5,3.5 12.25 3.49,3.51 12.2499 My table proves that my algebraic rule m= is correct.

  2. Stratified sampling and Hypotheses - ...

    20 175 Blue F 165 44.5 3 18 213 Blue F 163 54 5.5 19 214 Blue F 163 84 5 17.5 217 Blue F 168 63.2 7.5 20 156 Blue F 162 55 5 18 157 Blue F 160 55 5 18 158 Blue F 174 70 7 20

  1. Maths open box

    46 10 Strange Frank 1.68 69 11 Cooper James 1.71 46 12 Foley Mick 1.70 47 13 Moore Roger 1.32 35 14 Wok Yack 1.75 50 In this scatter diagram is shown a weak correlation between height and weight For this graphs line of best fit, I found the mean

  2. For this investigation I have four hypotheses, which are: 1) ...

    of section of the population to be sampled. This type of sampling is used in market research and can be bias. Convenience Sampling The final type of sampling I could have used was convenience sampling. This type of sampling is very simple as the most convenient sample is chosen.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work