Maths Data handling Corsework

Authors Avatar

Maths Data Handling Coursework

The aim for this piece of coursework is to make 3 hypotheses as a core plan for my investigations, then process, analyse and interpret information from the data I have been provided with from the school shared area. I will do this by using my data handling skills and using computer software such as Microsoft Excel to help me.

         The data I have been provided with contains information about the fitness of Year 7, 8, 9 and 10 pupils. This data consists of information such as bleep test performances in autumn and spring, cross country-Pe house run positions, and whether pupils are involved in rugby or rowing teams. There is also additional information showing what grade pupils are on at their musical instruments as well as a year 10 sports GCSE class data that shows information about pupils and their abilities in many exercises, mostly in circuit training.

This is an example of the data I have used. It is from the Yr 10 data spreadsheet and shows what class a pupil is in, their number, their Pe Bleep test scores in autumn and spring, their position in the Pe house run, the school team they are in and the grade of their musical instrument that they are on. There is also extra information showing why the pupil has not performed one or more of the pieces of information. This information is shown by:

Abs: Absent

DNR: Did Not Run

Inj: Injured

This data will help me in making my three hypotheses as well as help me produce sensible ones. For example, if a pupil is on a school team, he will be fitter than a pupil who is not, because the school team encourages training whereas a pupil not on a school team would be less fit. This example was sensible because it had logic behind it whereas something like - A pupil with a high musical instrument level would do well in the Pe house run - is not sensible and will lose marks.

The hypotheses I will make will be interpreted in different ways because there will be different graphs for the data. For the first hypothesis I will use a scatter graph, plotting points on the graph and looking for strong, weak, positive or negative correlations. For the second hypothesis, I will use a set of box plots, comparing the different pieces of data. Finally for the third hypothesis I will use cumulative frequency to compare two sets of data presented by curves. Then I will analyse and interpret each one carefully, to see whether my hypotheses were correct or not and why.

Plan

Hypothesis 1:

I predict that the better the pupil’s position in the Pe house run, the higher the pupil’s spring bleep test score

I think this will happen because if a pupil does very well in the Pe house run, coming in first place for example, he is most definitely going get a very high bleep test score because he is physically capable. Whereas if someone does poorly in the Pe house run, coming in last place, he is most likely to get a very low bleep test score because he is not able to run for a long period of time.

I am going to investigate this by using Excel in order to sort my data. For this hypothesis, I am going to use the spreadsheets: Yr 7, Yr 8, Yr 9, and Yr 10 data. All 4 spreadsheets have more than 100 pupils in them. First, I will delete any data that is incomplete which I cannot use. Once that is done I will need to use a fair method of getting a total of 40 pupils’ information.

To get this information fairly I will use Stratified Random Sampling. This method will help me take 40 pupils’ information randomly, thus making it fair. I will first need to find out how many students there are in each year and divide each one by the total number of pupils I will use. Then I will get the 4 figures and multiply them by 40, as I am intending to acquire 40 pupil’s information. This will then tell me how many pupils’ information I will need to take from each year group.

Once I have completed my Stratified Random Sampling, I will need to take a certain number of pupils’ information from each year group. To do this fairly, I will need to take random people from each year group, so I am not in any way bias. This should be done because it may make my results incorrect and therefore, my interpretation of the results will also be incorrect. So, in order to randomise the information of each year group I will use Excel.

After I have sorted my data, I will use Excel to make a scatter graph showing my data. This is what I think the scatter graph will look like. A scatter graph is useful because it compares 2 sets of data and helps in seeing whether any correlation exists.

To find a correlation on a scatter graph, I will take the mean of the two sets of data and use them to divide my scatter graph into quadrants. The     is the mean Pe house run position and the    is the mean spring bleep test score.

Join now!

To find a correlation, the top left and bottom right quadrants (1 and 3) have to be compared with the bottom left and the top right quadrants (2 and 4), in terms of the number of points they have all together. I think that I will see that there are more points in quadrants 1 and 3 than in quadrants 2 and 4, making it a negative correlation. If 2 and 4 had the most points it would be a positive correlation. I think that my graph will have a negative correlation because 1st place in the house run is at ...

This is a preview of the whole essay