This piece of coursework is designed to test the use and interpretation of statistics in relation to used car prices.

MATHS COURSEWORK: USED CAR PRICES

This piece of coursework is designed to test the use and interpretation of statistics.

My investigations will include scatter diagrams and correlation. I will also use cumulative frequency curves to achieve the best mark possible.

During this investigation I will try to out what influences the price of a second hand car.

I have been given a database, which contains information about some used cars. The database includes many different makes of cars, their length, no of doors, air conditioning, engine size, mileage and more. All these features have been included to find out whether or not they affect the price of used cars.

Hypotheses

I predict that the original price of the car will affect the price of the car when second hand. This is because, for example, after one year, a car with an original price of £40,000 would be expected to cost more than a car after one year with an original cost of £10,000.
I predict that the age of a car will also influence the second-hand price of a car. The older the car, the bigger the decrease in value.
I predict that the mileage of a car will make a considerable difference to the price of a car, as the higher the mileage, the less value the car will have.
The number of owners that a car has had may also cause a decrease in value as the condition of the car may vary.
Finally, I predict that the extra features of a car, e.g., air conditioning, central locking, etc. will also boost the price of a car. This is due to the fact that many customers will prefer a car with these features and would probably be prepared to pay slightly extra for them.

I have been handed the records of 100 cars for this piece of coursework. I have noticed that various records are incomplete for unknown reasons. These anomalies may interfere with my predictions and findings; therefore, I feel that it is vital that they are removed from the convenient records in order to provide fair and relevant results.

Overall, I removed 7 anomalies, which left 93 records out of 100. The anomalies that were removed are shown below:

To begin this investigation, I will have to collect a sample size of data. This is because the previous collection of 93 records was too large and I felt it would be more convenient to use a sample size, which is neither too small, nor too large. I decided to use 36 records, which seems a suitable sample size.