• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

The aim of this project is to investigate which factors influence the costs of second hand cars. The makes we were looking into include Ford, Peugeot, Renault and Vauxhall.

Extracts from this document...


Write Up

The aim of this project is to investigate which factors influence the costs of second hand cars. The makes we were looking into include Ford, Peugeot, Renault and Vauxhall.

To aid us with our research, we were given a bank of secondary data of one hundred and ninety nine cars. Along with these cars, we were given nine variables for each car; colour, engine size, petrol/diesel, year of manufacture, mileage, cost, preliminary cost, make and model. We later had to add an age column for each.

We have been asked which features make most difference. My predictions for which makes the most difference are as follows:

FACTOR                                IMPORTANCE                RANGE

  1. Age                                most important factor                1yr-16yrs
  2. Mileage                                                                1200-150,000image00.png
  3. Cost                                                                 375-132,500
  4. Petrol/Diesel                                                        Petrol/Diesel
  5. Engine size                                                        1,000-2,500
  6. Year                                                                1989-2002
  7. Model                                                                106-Vectra
  8. Make                                                                Ford-Vauxhall
  9. Preliminary Cost                                                        6,000-20,020
  10. Colour                                least important factor                Black-Yellow

Looking through the data, I can see that it is not perfect. There are many missing fields of data, and I had to remove one rogue piece of data, being a Renault Laguna. I knew it was not right from my general knowledge of cars, the price being too high.

I will draw up scatter graphs, histograms and cumulative frequency curves (for cost compared with the whole population’s age followed by individual Makes)

to try and distinguish any correlation’s (patterns) in the cost distribution.

...read more.


What I think will happen is-image01.png

As age increases,      the cost will decrease.image02.png

This is because there is a very strong correlation between these factors. I will look at one model for one example because if I look at a group of similar cars, I can be confident that any observations I make are because of the cost and not anything else. For this model, I draw a scatter graph of cost against age.

For this scatter graph, I put Age on the X axis and Cost on the Y axis as Cost depends on Age. This makes Cost the dependent variable. I also included a line of best fit and an R2 measure of correlation where R=1 is a perfect correlation.

On examining this graph, there’s a very strong negative correlation between cost and age. Based on these findings, I can easily make my first hypothesis:

There is a fairly strong correlation between Cost and Age.

2nd Hypothesis

I am now going to investigate the variable that I believe would be next important in determining the second hand cost. This is Mileage. I think that like the age variable, there may be a sign of negative correlation between Mileage and Cost. Like before, I will make scatter graphs based on each make of my sample, except this time it will be for Mileage and Cost.

...read more.


These estimates are only moderately reliable as the correlation between cost and age is only moderately strong. Based on this analysis of my sample (which was a reasonably good representation), I believe I have evidence to support the first hypothesis.

There are clearly other factors influencing the cost, so I will now move on to mileage. Firstly, I will draw box plots for mileage. From these I can see that on average, Ford cars have the lowest median. This seems strange as they were the oldest on average.

Vauxhalls are the most spread out which fits in with that age being spread out. I will now draw a scatter graph for the sample and for each make. I believe that on the whole, these broadly confirm my hypothesis. The whole sample shows a weak/moderate negative correlation.

It seems to me that I was correct. My initial prediction that age is the most important variable, but mileage is also important was true. The next thing I am going to do is indicate on the cost against age scatter graphs for each make, the mileage for point and the engine size to see if this gives me any insight into the influence of mileage or engine size.

Having looked at these, I don’t see a very clear pattern. Although, on average, cars with a lower mileage have higher prices, but as always, there are exceptions.

...read more.

This student written piece of work is one of many that can be found in our GCSE Gary's (and other) Car Sales section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related GCSE Gary's (and other) Car Sales essays

  1. What Influence Did Henry Ford Have On 1920s America?

    Ford showed cunning not only in understanding the most efficient means of production, but also in understanding what people would want from a car, and in taking control of his industry. Model Ts used standardised parts, meaning if a car should break, the owner could simply order a new part and continue using the car.

  2. Maths Coursework:Used Cars

    0 <2000 5 <3000 2 <4000 3 <5000 2 <6000 0 <7000 0 <8000 2 <9000 2 Price Less than � CF <1000 1 <2000 2 <3000 3 <4000 4 <5000 0 <6000 0 <7000 1 <15000 1 Using the information in the table and graphs I can find the lower quartile, median and upper quartile.

  1. I have been given instructions to collect data for my GCSE statistics coursework and ...

    price of the car, you will have to compare two of the cars with the same make. Example, looking at the rover, it has a 2.3 engine at 6999. But looking at the other rover, it has a size of 1.6, but is only 4995, this could mean that the engine size does affect the price of the car.

  2. Statistic coursework-what has the most influence on the price of a second hand car?

    In addition, Interquartile range is also calculated since outliers could affect the range easily as it only calculates the difference between the highest and lowest value. However, the interquartile range notifies me the range of the middle 50% of the data where is most of the information is.

  1. Used Cards - find which factors will influence the price of a second hand ...

    Random sampling is the purest form of probability sampling. Each member of the population has an equal and known chance of being selected. When there are very large populations, it is often difficult or impossible to identify every member of the population, so data may become biased. Systematic sampling is often used instead of random sampling.

  2. Legal and Ethical Analysis of Ford Pinto

    In the end, the fuel tank was a strap-on tank arrangement located under the real floor-pan and behind the real axle (17). When the Pinto was in the blueprint stage, the federal government had no standards concerning how safe a car must be from gas leakage in rear-end crashes.

  1. Used second hand cars

    The disadvantages are the data may not be exactly what you require and the accuracy of the data may not be known. Primary data on the other hand consists of the same person collecting the data and conducting the investigation. The advantages are: You can collect the data you want.

  2. What factors influence the price of a second hand car

    cars So now I know how many of each make will be in my sample I will randomly select the cars by using the RAN# button on my calculator. I will number the Fords from 1 to 16 and then using my RAN# button I will get a random number

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work