Handling Data: Driving Tests

Introduction: I have been given a data set containing 240 data items containing information relating to driving tests.

My aim is to investigate what factors influence a successful outcome from different fields contained in the data sheet:

  • Gender of the Driver
  • Number of 1hr lessons
  • Number of minor of mistakes
  • The driving instructor

Most of the fields shall be investigated to see if there is any pattern connected with successful drivers apart from.

  • The Day and Time the test was taken out

Initial Analysis: The Initial Analysis of the entire data set shows that there is:  

  •  116 Male drivers.
  •  124-Females
  • 60 Learners for Instructor A
  • 100 Learners for Instructor B
  •  40 Learners for Instructor C
  • 40 Learners for Instructor D

The Mean number of:

  • Minor mistakes are 16.78
  • 1Hour Lessons is 23.03

Hypothesis One: For this data set Men on average made fewer minor mistakes than women in their driving test.

Planning: In order to investigate this hypothesis I will take a random sample of 30 male drivers and 30 female drivers. And compare the number of minor mistakes they made in their driving test. To make this comparison I will construct box and whisker diagrams.

 

Sampling: To get random samples from the data set so I can keep this a reliable investigation by using the box and whisker diagrams I shall- Use Microsoft Excel, to begin I will number the males and females in the data set, now I can set random numbers for each driver. Between 0-1 and then select the 1st 60 drivers from each gender.

 

Statistics box

Analysis of Hypothesis 1: From using autograph I found that my 1st hypothesis agreed with the results of the diagrams, though the results were very similar for both Genders. From my statistics box we can see just how close the results were, leading me on to my next hypothesis to help me determine why this is

Hypothesis 2: This Data Set; Male Drivers take more 1hour lessons on average than females do for their driving tests.

Planning: For this Hypothesis the samples obtained for Hypothesis 1 shall be kept the same to make this a reliable and fair investigation. I will also use the same diagrams (Box and Whisker Diagrams) to represent the value for how many 1hour lessons each Gender took before their test.

Statistics box

Join now!

Analysis of Hypothesis 2: In this sample, men on average did take more 1hour lessons than the females did. This could explain why the male samples in hypothesis 1 had a higher success rate than the females. In conclusion of looking at these results and the 1st hypothesis' I would like to investigate the relationship between the two.

Hypothesis 3: In this data set the more 1hour lessons taken by a driver previous to their test is on average to make less minor mistakes in the driving test.

...

This is a preview of the whole essay