• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

Statistical investigation into pupils at Mayfield high school

Extracts from this document...




In this investigation I have set out to explore two hypotheses regarding secondary data supplied to me concerning pupils at Mayfield High School in years 7 to 11.  The secondary data supplied included data on a range of issues including each pupil’s weight, their IQ and the average number of hours each pupil watched TV per week.  My hypotheses below explore specifically these elements from the secondary data supplied to me.

Hypotheses 1 – Correlation between time spent watching TV and weight

My hypothesis is that there will be a connection between the average number of hours spent watching television and the weight of each pupil.  My prediction is that there will be a positive correlation: i.e. that the heavier pupils will be those who spend more hours watching TV.

The reason for my prediction is that I believe that more time spent in front of the television means that there is less time for activity and, therefore, TV watchers are more likely to put on weight.  I also think that TV watchers are more likely to eat snacks and junk food while watching TV, which would also result in weight gain.

In my conclusion below, I consider outside factors which I have not taken into account in the investigation of this hypothesis which may influence my results.  Such factors are: height, gender, age, method of travel.

Hypotheses 2 – Correlation between time spent watching TV and IQ

...read more.























No. in Year


No. of pupils taken to create sample





















I created, from my secondary data, separate charts for each gender and year group and numbered each pupil within that group according to their alphabetical order. To ensure that there is no bias, the actual pupils selected were selected by a random number generator which was a function available in Excel.  The random number generated was always less than or equal to 1.  I, therefore, multipled each number generated with the number of pupils in each gender and year group to ensure that each pupil was capable of being selected.  If the same number in the same group was generated twice, which did happen once, I selected another random number using the random number button on my calculator and, again, multiplying it by the number of pupils in that gender and year group.  

In this way I produced my stratified sample of 50 pupils which I have set out at Appendix 1.  The number column in appendix is the number of the pupil in that year and gender group, assigned through the random number generation process.

Analysis of Stratified Sample

Because both of my hypotheses predict a connection between the two sets of data in each hypothesis, I decided that, for each hypothesis, I should present the data on a scatter graph.

...read more.


It must be remembered that these results have been drawn from a sample only.  More conclusive results are likely to be obtained from a larger collection of data than the 50 in my sample.

I am surprised by my finding of a negative correlation in the TV v Weight comparison.  Possible explanations for such a correlation, if it is genuine (ignoring factors such as dirty data and limited sample), may include the fact that there is a lot of sport on television and it is possible that the more athletic pupils watch it.

However, my investigations into both hypotheses were limited and always unlikely to have proved a link between the two pieces of data in each case, and even less likely, a direct causal relation.  For instance, I did not take into account external factors which could have affected or improved my results.

External factors which I could have taken into account to improve both my investigations include factors like age, height, gender, method of travel to school and programmes watched.  Most of these factors were available to me in data form on the secondary data supplied to me.  I would have to have done further primary investigation (through e.g. questionnaires) into a breakdown of types of programme watched, if I wanted to discover further possible explanations for any correlations found.  A more conclusive investigation is likely to result if this additional material is used and taken into account, which may support a direct causal relationship.

...read more.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related AS and A Level Probability & Statistics essays

  1. Data Handling - Planning - I intend to investigate the relationship between the number ...

    mean, mode, median and range of the KS2 maths results for males and females. Mean maths results Mean maths result for females = 4.06 Mean maths result for males = 4.66 Mode maths results Mode maths result for females = 4 Mode maths result for males = 5 Median maths

  2. The mathematical genii apply their Statistical Wizardry to Basketball

    The geometric model states that the events must be independent. No distribution could possibly account for the infinite amount of variables/influences that could occur e.g. improving skill as more shots are scored, fatigue etc. The variable would be different in each case.

  1. The aim of this investigation was to look at the reliability and validity of ...

    Ideally a test like this should involve around 25-30 candidates to give us the results we expect, despite this we still had a significant result. The subjects were na�ve so did not have a choice as to whether or not they consented to the test and were not given the right to withdraw.

  2. Design an investigation to see if there is a significant relationship between the number ...

    Only a small number of the species would have possessed this characteristic at one point, but since it has proved to be favourable to the Fucus Vesiculosus species, the algae that had the gene coding for an oily surface thrived, reproduced and consequently meant that algae without this gene could not compete with the seaweed with this adaptation.

  1. Forensic Examination of Drugs by Thin Layer Chromatography.

    Also analytical techniques such as mass spectrometry, infra red spectrometry & 13C NMR spectroscopy could be used before or after separation has been achieved. Materials: The apparatus used in this experiment are listed below. * Filter Paper * Aspirin, Paracetamol & Caffeine samples * Unknown Samples A,B,C &D * Small

  2. Statistics: Survey of Beijing and China during the SARS storm

    In order to see it clearly, I show it as a chart below: I am 95% confidant that the value lie within the range of 99.7037~~122.369. Now we can obtain the normal distribution clearly. In these 61 days (March and April), 121~130 persons died in one day has the highest frequency.

  1. Investigate the relationship between height and weight and how it changes between gender and ...

    With this correlation it is even more likely the hypothesis will be correct so I am pleased with theses results The differences between the height and weight of boys and girls are that boys are heavier and taller then girls in all years.

  2. "The lengths of lines are easier to guess than angles. Also, that year 11's ...

    12.25 6 35 25 8 17 289 4 35 4 8 -4 16 5 40 17.5 17.5 0 0 5 30 17.5 3 14.5 210.25 5 35 17.5 8 9.5 90.25 4.5 40 8.5 14 -5.5 30.25 4.5 45 8.5 20.5 -12 144 5 43 17.5 17.5 0 0 3.5

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work