• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month
  1. 1
  2. 2
  3. 3
  4. 4
  5. 5
  6. 6
  7. 7
  8. 8
  • Level: GCSE
  • Subject: Maths
  • Word count: 2652

Data Handling Project looking at a database based in Excel where there is data from Key Stage 3 and 4 from Mayfield High School

Extracts from this document...


Introduction. This Data Handling Project is looking at a database based in Excel where there is data from Key Stage 3 and 4 from Mayfield High School. This data consists of several columns containing both Quantative and Qualitative Information. Examples of this data are: * Year Group * Name; Surname, Forename 1 and Forename 2 * Age in Months and Years * Month of Birthday * Gender * Hair Colour * Eye Colour * Left/Right Handed * Favourite Colour * Average number of Hours TV Watched per week * SATS Results etc... In this project I am going to make up several Hypothesises that I will use the data from the Data Base to help me prove. However I will not use all of the data, and for each Hypothesis I will Random Sample using the computer 30 entries which fit into certain restrictions applying to that Aim. The Random Sampling method that I am going to use is a computer generated one. The method of doing this is as follows: 1. Filter or sort the necessary data, copy and paste into a new sheet. Add 2 extra columns before this data. In Column 1 leave blank, and in Column 2 type the numbers 1 to X. 2. In the top of Column 2 type =RAND()*X, this makes a number between 1 and X. In the top of Column 1 put SUM in. In Column 1 next to that number put the number 1 and press enter, a new number will appear, put a 1 next to that number etc... ...read more.


[See following sheets] The graph of the Frequency of Hours watched showed me that the most popular amount of hours of watching TV is between 11 and 15 hours, shown on the graph as 24%. From looking at my Data in my Table the mean value is 14 with 4 entries. From these 4 entries of 14 Hrs of TV there are 4 different weights. If I average these weights the average weight comes to [62+42+57+63=ans/4]=56 Kg. To answer the question of "does the heaviest person in my sample watch the most TV?" I am going to average all of the weights for each different hrs of TV Watched and plot this in a bar graph. [Bar Graph on the next page] The results from this table and graph show me that there is no real relationship between the heaviest person and the fact that they watch the most television, as predicted in my Hypothesis. This is shown from the fact that the heaviest person watches only 15 Hrs of TV and the person who watches the most TV weighs only 68Kg. Unfortunately from looking at my Graphs I think that there is once again no real correlation between the size of someone and the amount of TV that they watch. However if I look at my Scatter Diagram I can see that, ignoring the anomaly (in a yellow circle), there does seem to be some weak negative correlation. However from looking at the Scatter Diagram even more I can see that this weak positive correlation isn't coming from the fact that the heaviest person watches the most TV but more like the less TV that is watched means that they are of an average weight. ...read more.


data. Throughout carrying out these hypothesises I have come across a few anomalies - These I have identified in each section, and explained by reasons behind them. Mainly the reasons were that the data was totally im - practable and must have been mistakes in the entering of the data. If I were to repeat each of these hypothesises again I think that I would do a few things differently. These would be: * Use more amounts of data for each hypothesis. This would provide a larger sample and should mean that I will be able to produce a more strong result. E.g. this could change the difference between a positive and a fairly strong positive correlation in a Scatter Diagram of 2 pieces of Quantative data. * Work out the Mean, Mode, Medians for all sets of data within a hypothesis. This will provide more evidence on which to base my conclusion to my hypothesis. Also using a larger sample it may produce more evidential reasons in a simpler form. * Select equal amounts of boys and girls to form my Sample. E.g. 20 or 15 of each. As I experienced choosing unequal amounts of both Boys and Girls can cause a change in the result. * Use a wider range of Interpreting my Data. E.g., using more graphs and diagrams. Although they may just repeat the same information some graphs may show the same results in different and some clearer ways. Also I think that it would also be better to show and perform more calculations within my data. E.g. Converting my data in Percentages. (%) Rachel Butterfield. 10B. Mayfield High Data Handling Coursework. Page 1 of 8 ...read more.

The above preview is unformatted text

This student written piece of work is one of many that can be found in our GCSE Height and Weight of Pupils and other Mayfield High School investigations section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related GCSE Height and Weight of Pupils and other Mayfield High School investigations essays

  1. mayfield high statistics coursework

    a very weak positive correlation between the two variables that I am investigating for this hypothesis and this means that that the increase in Hours of TV the higher the IQ on average which is rather bemusing as I had expected the opposite using my general knowledge and everyday theory

  2. Math Coursework-Mayfield High Data Handling

    year 10, 15 females from year 8, 15 females in year 10) and then separate all the data into their genders. As a result, I will have 30 male data in year 8 and 30 male data in year 10.

  1. Data Handling Project

    50 people in each group 5 year groups 50 divided by 5 groups equals 10 people in each group I will now give each group a number so it doesn't get confusing.

  2. I am going to find out the year 10 male average student at Weavers ...

    1996 value which means that the weights of the students in 2009 are less spread out than the student weights in 1996. I found 1 outlier in both years.1996 data mean before the outlier was dealt was 61kg but after dealing with the outlier is 59.1kg so it is more consistent than it was.

  1. Data Handling Project

    52 1.32 47 1.6 70 1.7 37 1.41 31 1.72 51 1.52 37 1.72 51 1.57 60 1.75 57 -- MEAN -- -- MEAN -- 1.59 48.8 Now I will construct a scatter graph with the data that I have collected to see whether my hypothesis is worth investigating or not.

  2. Mayfield High School

    110 12 105 1260 110 < x ? 120 5 115 575 Total 30 3040 Mean= 3040/30 = 101.3 Median= 30+1/2 = 31/2 = 15.5 Therefore the median would come between... 1+1=2 2+11=13 <-- The number 15.5 lies in this so the median is...90 < x ? 100 13+12=25 <-- This is too big so it isn't the median.

  1. Maths: Data Handling Coursework

    I am going to use the method of stratified sampling which would allow me to choose my sample fairly without being biased. Stratified sampling means dividing a whole population into subgroups and choosing a sample size that reflects the properties of each group.

  2. Data handling coursework: Mayfield High School

    1.73 48 1.60 45 1.47 47 1.5 40 1.43 33 1.73 53 1.63 60 1.65 59 1.35 29 1.5 49 Height (m) Weight (kg) 1.7 69 1.72 42 1.65 45 1.71 68 1.6 55 1.85 55 1.7 47 1.72 62 1.63 56 1.71 56 1.73 56 1.8 63 1.55 54

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work