• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

data handling

Extracts from this document...

Introduction

Data Handling Project

Planning

I intend to investigate the relationship between the number of hours of TV watched per week by students and their KS2 maths results. I think the more TV a student watches the less successful they will be.

Hence, I expect a negative correlation i.e. the two sets of data will be inversely proportional.

Firstly, I will retrieve the relevant data i.e. gender, hours of TV watched and maths results from the spreadsheet provided. As there are on average 200 students in each year, years 7-11, almost over 1000 students, it would be difficult to analyse such large data. Therefore, I will pick one of the five year groups randomly and base my investigation on the selected year group.

I will sort the year group into two sub-groups according to their gender. I will then apply the method of systematic sampling to the data. This will make the data more represent able. I have randomly selected Year 9.

As there are 261 students in Year 9 and I intend to have a sample of 30 students I will therefore select every 8th student and randomly eliminate two, thus leaving me with a sample of 30 students.

261

30

= 8.7

261

32 -2 = 30

8

The collected data sample of 30 students is the raw data.

...read more.

Middle

The evidence from the sample suggests males, on average, scored higher in their KS2 maths results than females, for this particular year.

In order to support the above statement I will compare the mean, mode, median and range of the KS2 maths results for males and females.

Mean maths results

Mean maths result for females = 4.06

Mean maths result for males = 4.66

Mode maths results

Mode maths result for females = 4

Mode maths result for males = 5

Median maths results

Median maths result for females = 4

Median maths result for males =5

Range of maths results

Range of maths results for females = 2

Range of maths results for males =2

I have summarised these results in a table:

Maths results

Mean

Mode

Median

Range

Females

4.06

4

4

2

Males

4.66

5

5

2

Stem and leaf diagrams

Year 9 Females

Stem

Leaf        

Frequency

0

1,6

2

1

0,4,4,6,7,7,8

7

2

1,1,2,4,4

5

3

9

1

4

Year 9 Males

Stem

Leaf        

Frequency

0

4,8

2

1

0,0,0.5,2

4

2

0,0,0,0,1,2

6

3

0,0

2

4

2

1

Averages

Hours of TV watched (hrs)

Mean

Modal class interval

Median

Range

Females

18

10-20

17

38

Males

19

20-30

20

38

From the Year 9 sample, the mean, modal class interval and median were higher for males than for females. The difference in values for these measures for males and females for my Year 9 sample was not too big.

The modal class interval shows that on average males watched more hours of TV yet scored higher grades in their maths results. The range for both males and females for the hours of TV watched was the same. This refutes my original hypothesis for my sample of students from Year 9.

...read more.

Conclusion

15 - 6.25

x 100

15

i.e.  

     = 58.3%

15 - 10

x 100

15

whereas                                        =33.3% of females achieved level 4 and above

                above

15 - 14

x 100

15

Only                                               =6.7% of the males achieved a level 6                

Review

I started with a hypothesis stating that the more time spent watching TV the less successful students would be in their KS2 maths results. Though this is a logical line of enquiry my analysis of the selected data refute this hypothesis.

It was difficult to establish any strong correlation from the scatter diagrams. I have a positive correlation from my scatter diagrams however the gradient of the line of best fit is small, indicating low positive correlation. This could be due to my data being secondary and group sample selective and small. Hence, my results maybe slightly biased.

Considering my analysis for my original selected group i.e. Year 9 and my hypothesis being refuted I extended my project and drew a scatter diagram for a random sample of 60, from the whole school.

Apart from a couple of results at the top of the level 5 column, which are also quite dispersed, I think I have a negative correlation for the overall data. The gradient in this case is more distinctively negative than it was for the first set of data. They are also opposite to each other i.e. the gradient for the line of best fit for Year 9 was positive, while the gradient for the overall group is negative, confirming my hypothesis.

- 25 -

...read more.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related AS and A Level Probability & Statistics essays

  1. Marked by a teacher

    The heights of 16-18 year old young adults varies between males and females. My ...

    5 star(s)

    � = 64.56 � 1.110 � is in the interval [ 63.45 , 65.67 ] 63.45 < x < 65.67 A 90% confidence interval for the height of females aged 16-18 = 64.56 S� = 9.0864 n = 50 ?

  2. Marked by a teacher

    data handling

    3 star(s)

    Scatter Diagrams A scatter diagram tells you how closely two things are related, the term correlation. A Strong Correlation means the two things are closely related to each other. A Weak Correlation means there is very little relationship. The line of best fit is a line that roughly goes through the middle of the points.

  1. Data Handling - Planning - I intend to investigate the relationship between the number ...

    The average number of hours of TV watched per week, being continuous data, will be analysed by recording the results using histograms. Year 9 Females Hours of TV watched p/week Tally Frequency Midpoint 0 ? t ? 7 II 2 3.5 8 ?

  2. I have been given the task of finding what affects the price of a ...

    5000-10,000 2 2/5000=0.0004 10,000-20,000 4 4/10,000=0.0004 20,000-40,000 11 11/20,000=0.00055 40,000-70,000 18 18/30,000=0.0006 70,000-110,000 5 5/40,000=0.000125 Predictions * I predict that the random histogram will have a much more erratic distribution of car mileage while the stratified distribution will be more of bell shape displaying the majority in the mid range with low or no extreme values displayed.

  1. Investigate if there is any correlation between the GDP per capita ($) of a ...

    : ? = 0 (There is no correlation between the two variables in all the countries in the world) : ? > 0 (Positive Correlation) N= 50 I will be doing a one - tail test at the 5% significant level So the critical value = 0.2353 So 0.833872644 >

  2. Fantasy Football - Maths Coursework - Statistics

    and 7+ ratings affecting the number of points more than Clean Sheets and Goals scored in Hypothesis 2. If I were to pick a fantasy football team based on my findings then I would make sure that I picked defenders because they are more consistent at scoring well.

  1. Anthropometric Data

    as a dependent against the foot length (mm) as the independent variable. Knowing that (y) is a dependent of (x), as (x) is an independent foot breadth will be on the (y) axis and foot length will be on the (x) axis. Scatter graph A scatter it shows a relationship between two variables.

  2. Teenagers and Computers Data And Statistics Project

    x 12 = 96 The 3 face corners will always be 8 6. Formula Explanation The first formula that I worked out was the total no of cubes, this was simple as to measure the volume of a cube is to cube the length that you have.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work