• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month
  1. 1
  2. 2
  3. 3
  4. 4
  5. 5
  6. 6
  7. 7
  8. 8
  9. 9
  10. 10
  11. 11
  12. 12

Descriptive Statistics 1. Mean, median and mode.

Extracts from this document...


Descriptive Statistics 1. Mean, median and mode.


At this stage I want to emphasize the practical relevance of averaging, using the discussion of the mean to illustrate the use of mathematical notation (that was introduced last week), and warn you about some of the possible pitfalls of relying on the average without looking at the pattern of values from which it was calculated. First, however, I will attempt to answer the good question, “why bother learning statistical notation”, that seems to crop up every year.

Why learn mathematical notation?

PSY107 aims to provide not just a recipe book for doing statistics problems in isolation, but aims to leave you with some skills that have general relevance. These include computer literacy, the ability to explore data, critical thinking, and a degree of independence in tackling statistical issues in Psychology and elsewhere. The routine tools that I and many of my colleagues use to do statistics are not algebra and equations, but (often computerized) graphing and data analysis methods. I think that many simple statistical concepts can be communicated using graphs and plain English. Why, if many psychologists do not spend their time writing μ and σ is it necessary to get to grips with the basics of statistical notation?

This question has several answers. First, mathematical language is logical, rigorous, and compact. Mathematical notation can also be used to represent the precise relationship between different statistical concepts.

...read more.


Definition of the mean

The arithmetic mean is the sum of the values divided by the number of values. This is shown below using mathematical notation. It is best to get to grips with the symbols while the statistical concepts they represent (e.g. mean average) are simple and familiar.

Sample mean vs. population mean

μ, the population mean, is the mean derived from the entire population under study. Population is a word with a somewhat elastic meaning, but generally it is up to you, the experimenter, to define your population. It might be all the people in the UK, all the people who shop at KwikSave, or all the lecturers in the Newcastle University Psychology Department. With large populations, it is often impractical to find μ.


image02.png, the sample mean, is calculated from a representative sample of the population. This is usually done by selecting individuals from the population at random to avoid sampling bias. You get sampling bias when all the members of the population under study do not stand an equal chance of being measured.


If you wanted to estimate the mean height of people in the UK, it would be stupid to do all your measuring in primary schools. This is an extreme example, but more realistically, suppose you wanted to get a representative 1000 people to complete a questionnaire on social attitudes. If you did the survey by telephone, your sample would be biased towards telephone owners. If you called between 9 and 5, your sample would be biased towards people without day jobs.

...read more.


The Median.

The median is the value that divides the distribution of values exactly in half. To find the median, sort or rank the values and find the middle value (if there are is an odd number of values) or else the mean of the central two values (if there is an even number of values). It is possible to estimate the median from histograms. Use the information on number of scores to estimate the position of the middle score.

Fig. 7


Estimate the position of the “middle person” on the income axis using the information on the frequency axis. Here there are 18 lecturers, so the median income is at the estimated position between 9 and 10. This is simpler to estimate from a cumulative frequency polygon.

The median average can be more representative than the mean in skewed distributions (e.g. annual income, or National Lottery winnings). Remember to look at the data when you calculate the median average

Fig. 8


The Mode

The mode is the score or category that has the greatest frequency. The modal average can be used with nominal data. As with all other averages, look at the data when you calculate the mode.

Fig. 9


Is a three dimensional representation sensible?

Mike Cox


Version 1

...read more.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related AS and A Level Probability & Statistics essays

  1. The mathematical genii apply their Statistical Wizardry to Basketball

    Same arms used each time. * The weather conditions being similar. In the sports hall there should be no significant alteration of the environment. * Each shot being taken one after the other to gain results, which will be under the most similar conditions.

  2. Reaction Times

    (Mid point - Estimated mean) � (x-x) � Frequency x (Mid point - Estimated mean) � f(x-x) � 2.5 0 2.5 - 14 = -11.5 -11.5� = 132.25 0 x 132.25 = 0 7.5 7 7.5 - 14 = -6.5 -6.5� = 42.25 7 x 42.25 = 295.75 15 13 15 - 14 = 1

  1. Investigate a possible relationship between self-esteem and levels of satisfaction in the undergraduate student ...

    By capturing an audience in this opportunist way, one inevitably ends up with an unrepresentative sample. In this case a group of students from a limited no of courses namely Humanities and Behavioural Studies Degree programmes. They were all first years and all but one, female.

  2. How Can Samples Describe Populations?

    This is referred to as the study population. A complete list is made of the study population, this is known as the sampling frame. [2] Finally, the sample is taken from the sampling frame. The selection in the sampling is an important step in preserving the quality, integrity and most

  1. Teenagers and Computers Data And Statistics Project

    The formula for this was n3 - (n-2)3 - 6[(n-2)2 - 8. I then realized there should be a much simpler formula. Once I took another look at the table I realized it would be 12(n-2).This was because there are 12 edges on a cube and as there are -1

  2. Used Cars - What main factor that affects the price of a second hand ...

    4 SMALL 1200 17395 1 4999 5 SMALL 1200 2760 1 7399 6 SMALL 1300 19880 2 4499 7 SMALL 1300 51000 5 3495 8 SMALL 1400 40 0 8799 9 SMALL 1400 3548 0 7999 10 SMALL 1400 12470 2 7199 11 SMALL 1400 9540 1 9299 12 SMALL

  1. Estimating the length of a line and the size of an angle.

    This is because when I have picked out the number I need I could remove it and choose another number which is a quicker method than putting all the numbers in a hat, mixing them up and picking a number out and a random number table, which can be biased.

  2. "The lengths of lines are easier to guess than angles. Also, that year 11's ...

    Then, I am going to draw some percentage error tables. These will show the error of the estimates and if people estimated below or above the actual size or length of the line. I am then going to draw some scatter graphs showing the errors from the percentage error tables.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work