Analysis of Hypothesis 2: In this sample, men on average did take more 1hour lessons than the females did. This could explain why the male samples in hypothesis 1 had a higher success rate than the females. In conclusion of looking at these results and the 1st hypothesis' I would like to investigate the relationship between the two.
Hypothesis 3: In this data set the more 1hour lessons taken by a driver previous to their test is on average to make less minor mistakes in the driving test.
Planning: For this Data set I shall take a sample of 30 drivers from each gender and will plot scatter-graphs of Number of 1hour lessons against the number of minor mistakes and will see if there is a pattern or correlation to give me a result for my hypothesis.
Statistics box
Analysis of Hypothesis 3: From my scatter graphs I can see that there is a negative correlation of each graph. The correlation coefficient (negative in both graphs) is not as strong in the female samples, which would represent that the males with more 1hour lessons would benefit more than the females with more 1hour lessons. Now that the relationships of both influences from hypothesis 1 + 2 have been investigated I would like to decide what the rest of the factors do to contribute to the success rate.
Hypothesis 4: Do certain driving instructors produce more successful drivers than others do?
Planning: For this hypothesis, I will use stratified samples of learners from every instructor and put each in box and whisker diagrams to compare them and find which instructor has the greatest amount of successful learners. I will also use these Samples to make Scatter Graphs, to see the results in an easier manner.
Stratified Samples:
Instructor A- 60 Learners = (60/240)120 = 30
Instructor B- 100 Learners = (100/240)120= 50
Instructor C- 40 Learners = (40/240)120 = 20
Instructor D- 40 Learners = (40/240)120 = 20
Statistics box
Statistics box
Analysis of Hypothesis 4: From comparing the Box and Whisker Diagrams and their statistics, I can see that Instructor A has the most successful amount of drivers whilst Instructor D seems to have the worst success rate but as Box and Whisker Diagrams can produce less accurate results, I decided to further this Hypothesis by using scatter graphs. Funny enough the results collected from my scatter graphs were different from the Box and Whisker Diagrams, showing that Instructor C had the highest success rate and B had the lowest success rate. I know this as the gradient was much steeper and the correlation coefficient was more negative than any other of the scatter graphs. What may have influenced the box and whisker diagrams to have different results than my scatter graphs may have been that each sample was of different size. Instructors A had 30 drivers and B had 50 drivers whilst Instructor C + D had only 20 drivers meaning these instructors had less chance of either failing drivers than Instructor’s A + B would have.
Hypothesis 5: In this Data Set, some Instructors do better/worse with the other Gender.
Planning: This shall be my final Hypothesis, so I intend to make a reliable investigation- by taking a sample of 15 Males and 15 Females from each Instructor in the field of “minor mistakes” and creating Box and Whisker Diagrams with these samples. Once this has been done I will use the samples again and create Scatter Graphs for each. Examining the Number of Lessons each driver took against the number of Mistakes they made for each instructor and writing an analysis of each below them.
Statistics box: Female Samples
Female Samples
Statistics box: Male Samples
Male Samples
Female Statistics box
Female Samples from each Instructor
The correlation coefficient shows a very weak negative
Correlation. The scatter graph represents barely any relationship between the number of lessons and number of mistakes.
The correlation coefficient shows a weak negative correlation. This scatter graph suggests that there is not much of a pattern between the results.
The correlation coefficient shows a strong negative correlation between the results. From this scatter graph I can see that the more lessons a driver take the less minor mistakes they make during their test.
In this scatter graph the correlation coefficient is a strong negative correlation. Showing us that the more lessons a driver takes before their test the better the chance of success.
Male Statistics box
Male Samples from Each Instructor
This scatter graph has a very weak negative correlation coefficient. This means I won’t be able to say whether the number of lessons affects the number of minor mistakes made. This is also the same for the female drivers.
The results of this scatter graph are that there is a Strong negative correlation between the number of lessons and the number of mistakes suggesting the more lessons the higher the success rate. These results differ from the Female drivers who had a weak negative correlation.
Instructor C has a scatter graph with a strong negative correlation which suggests that this instructor is good at teaching female and male drivers as both had a strong negative correlation. This is my final scatter graph and it is from the male samples. Here we can see a moderate negative correlation which is almost the same as the female sample of Instructor D which had a strong negative correlation.
Analysis of Hypothesis 5: The results I gathered from the Box and whisker Diagrams in the female samples were that Instructors A + C had better results with a smaller range of results and Instructors B + D were represented as the worst Instructors with a much wider range of results. For the Male Samples Instructor A had quite clearly the best results with a small range of results whilst Instructors C + D had the worst C having a very wide range of results showing to me that you would have a 50/50 chance of success with this Instructor. When I review all the Instructors together and both Genders I have decided Instructor A has the best success rate in both genders showing that it doesn’t matter whether you are male or female you will more likely pass with this Instructor than any other. From my review of Instructor C he/she seems to work better with Females than Males and Instructor D seems to have a high rate of failing drivers this would suggest you won’t have the best chance of succeeding with this Instructor. Finally Instructor B seems to be quite average when it comes to the genders and it appears not to affect whether or not you would succeed.
From analysis of both the scatter graphs and box and whisker diagrams I have to come to the conclusion that both show a different side to the story and can represent the results in a different manner influencing what I think of each Instructor. This has left me to decide that leaving the analysis of both the box and whisker diagrams separate the safest option as so my conclusion of the Instructors is not completely non existent and that is why my analysis of the scatter graphs is below them and the box and whisker diagrams analysis above in the analysis section.
Probability: In this section I will be counting the number of males and females that passed there test and then see how many passed or failed there test with each instructor. Then whenever I calculate how many males and females have passed I can take that number and use it and the total number who took the test and work out the probability in which males and females have of passing with each instructor as a whole and then I can find the probabilities that males and females will pass with each instructor. So here as follows are the probabilities of Drivers Passing there test with each Instructor from both Genders.
Male Probability of Passing their Driving Test
Female Probability of Passing their Driving Test
Over all Chance of Passing their Driving Test