Car sales

Ben Mister 11N Car Sales

Car Sales Data Handling Project

Introduction

This is a project on sampling fifty cars from a database of over two hundred cars. The sampling was done by random selection.

I am investigating the comparative relationships that the age and mileage of a car have on the used price. I will compare the used price of 50 cars, which have been randomly selected, to their age/mileage.

In order to do this investigation I am using an Excel Spreadsheet that contains data on cars of differing ages and mileage along with other details listed in the screen below-

Hypotheses: The older the car, and the higher the mileage of the car, the lower the used price will be.

Plan

The data could also be analysed in many ways

I will obtain the information from a data base in the program excel named, ‘Car Sales’.
I will investigate the relationship through graphs and charts- created in excel
The fixed variable that I have chosen is the used price of the cars.
I will investigate what factors affect it.
I will be investigating a range of variables to go with used price. These include

price new
price used
age
colour
engine
fuel
MPG
mileage
service
owners
length of MOT
road tax
insurance
doors
style
C looking
seats
gears
Air con.
air bags

I will not be investigating every variable, but whilst I am investigating I expect to find some anomalies, these are results/points on the graphs that don’t go with the coloration and line of best fit. If I find any anomalies, I will leave them in, but consider their effect during and in the evaluation and conclusion of my work. All of my graphs and charts will have a line of best fit, to show very clearly the coloration of the graph or chart.

‘I predict that the older the car, and the higher the mileage of the car, the lower the used price will be.’

I am going to collect the require information and make some graphs by using the used car database provided; I will then take a random sample from this. I will then subtract the necessary information from the database and put then into charts to help show my results.

Once I took my random sample of all brands I removed some of the outliers as they would throw the results. I did this by checking that ...