Investigate correlation if any, between the price of a used car and it's mileage.

Authors Avatar

PLAN:

The previous data shown is the original data of used cars. I have decided to investigate correlation if any, between the price of a used car and it’s mileage. I did this because I wanted to know why some used cars cost so much more than others.

I intend to take a sample of the data provided, simply because it would be extremely difficult to take all the data and use it.

I am going to take a sample of 50 used cars, half of the total amount of data because this should enable me to have enough data to recognise any correlation between the price and mileage with the smallest possible sample for ease.

I aim, not only to investigate the correlation of the given data, but also, how this compares with other used car data. This will be done to see whether this data of used cars follows the trend of data of other used cars or whether this whole data is totally anomalous comparatively. If there is only a slight difference between the two data, then it will be easy for me to justify why they are different and what could have affected them.

My HYPOTHESIS is that as the mileage on a used car increases, the price decreases. I think this will happen in both sets of data.

There are many ways in which I can take a sample of the given data. For instance I could use a totally random sample, for example by picking them out of a hat. However this method might not on this occasion give fair results. For instance 50 Fords and Nissans could be chosen totally randomly, or 50 cars under the price of £5000 could be chosen. This statistically would not give me very good results, as it would not solve my problem. It would only tell me whether there is correlation between mileage in Ford and Nissan used cars or cars under the price of £5000.  

Join now!

I could use a systematic sampling method. This would mean I would have to systematically choose cars from the given list of used car data. For instance, every other one or every fourth one. However, as the list of data is in not ordered in any sort of strata at all, this method would also statistically not give me good enough results. For example, because the data is listed randomly, and I used every fourth one, I may miss out many cars and by chance could end up will all the Fords and Nissans again, or with all the ...

This is a preview of the whole essay