• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

Statistical coursework that uses data from 'Mayfield High School.'

Extracts from this document...

Introduction

Hayley Lloyd-Henry 11R Maths Statistics Coursework

MAYFIELD HIGH SCHOOL

Introduction

I have chosen to do this statistical coursework that uses data from ‘Mayfield High School.’ Although this is a fictitious school, the data is based on a real school. As the data has been collected for me, it is called secondary data.

I believe that this coursework will allow me to illustrate my ability to handle data, use specific techniques and apply higher level statistical maths by being able to use a variety of methods in order to analyse and compare sets of data. During this project I will be examining the relationships between the attributes of the pupils of Mayfield High School. My aim is took produce a line of enquiry which has two or more statistics regarding the pupils which are related to each other.

This table shows how many boys and girls there are in each year group at Mayfield High.

Year Group

Number of Boys

Number of Girls

Total

7

150

150

300

8

145

125

270

9

120

140

260

10

100

100

200

11

84

86

170

The total Number of students at the school is 1200

Data is provided for each pupil in the following categories:

  • Name
  • Age
  • Year Group
  • IQ
  • Weight
  • Height
  • Hair colour
  • Eye colour
  • Shoe size
  • Distance from home to school
  • Usual method if travel to school
  • Number of Brothers or sisters
  • Key stage 2 & 3 results in English, Mathematics and Science

...read more.

Middle

Calculations:

  1. Mean
  2. Mode
  3. Median
  4. Mean & Modal Class for Grouped Continuous Data – This calculates the mean for grouped continuous data.
  5. Interquartile Range - The distance between the upper and lower quartiles. As a measure of variability, it is less sensitive than the standard deviation or range to the possible presence of outliers. It is also used to define the box in a box-and-whisker plot.
  6. Standard Deviation - It is the most commonly used measure of spread.
  7. Normal distribution - Normal distributions are a family of distributions that have the same general shape. They are symmetric with scores more concentrated in the middle than in the tails. Normal distributions are sometimes described as bell shaped.
  8. Spearman’s Rank Correlation Coefficient - The Spearman's Rank Correlation Coefficient is used to discover the strength of a link between two sets of data.
  9. Equation of Line of Best fit – Equation of line that shows underlying spread.

Collecting the Data

In order to find my results, I will need to sort the data and put it into tables. As I am using stratified sampling, I have had to count up the amount of boys and girls in each year and work out my sample size.

...read more.

Conclusion

>

16

9

1.75

63

17

9

1.46

45

18

9

1.5

70

19

9

1.82

66

20

10

1.8

49

21

10

1.6

50

22

10

1.62

52

23

10

1.65

50

24

10

1.77

59

25

11

1.91

82

26

11

1.62

56

27

11

1.74

50

28

11

2

86

 Results

Girls

Year

Height (cm)

Weight (kg)

1

7

1.61

45

2

7

1.61

47

3

7

1.56

43

4

7

1.48

42

5

7

1.5

40

6

7

1.56

53

7

7

1.58

48

8

8

1.72

43

9

8

1.62

53

10

8

1.62

54

11

8

1.6

46

12

8

1.75

45

13

8

1.48

46

14

9

1.57

38

15

9

1.62

54

16

9

1.64

40

17

9

1.6

46

18

9

1.8

60

19

9

1.6

51

20

10

1.52

45

21

10

1.72

56

22

10

1.66

45

23

10

1.73

42

24

11

1.7

50

25

11

1.68

48

26

11

1.52

38

27

11

1.62

48

Organising My Results

Although I have already presented my results into 2 separate tables, one for each gender, the results are not concise enough. In order to fully analyse my results, I will need to put my results into scatter diagrams and histograms etc. Therefore, my results need to be grouped into around 5-8 groups, which are the same for both genders. This is because when I put my results into the scatter diagrams (etc), I will need to compare both genders, thus requiring me to use the same groups for both sexes. Once I have chosen my groups, I will enter the information into the frequency tables and use those for me histograms and scatter diagrams.


...read more.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related AS and A Level Probability & Statistics essays

  1. Standard addition was used to accurately quantify for quinine in an unknown urine sample ...

    The concentration of analyte in sample is then found by using equation 3. �g cm-� The concentration of the quinine in the unknown urine sample was found to be 121 �g cm-� Discussion The measurements taken during the experimentation may have contained random errors or systematic errors.

  2. I have been given the task of finding what affects the price of a ...

    0-5000 1 1/5000=0.0002 5000-10,000 1 1/5000=0.0002 10,000-20,000 5 5/10,000=0.0005 20,000-40,000 14 14/20,000=0.0007 40,000-70,000 19 19/30,000-0.00063' 70,000-110,000 2 2/40,000=0.00005 Stratified Mileage Group Frequency Frequency Density. 0-5000 1 1/5000=0.0002 5000-10,000 1 1/5000=0.0002 10,000-20,000 5 5/10,000=0.0005 20,000-40,000 14 14/20,000=0.0007 40,000-70,000 19 19/30,000-0.00063' 70,000-110,000 2 2/40,000=0.00005 Mileage Group Frequency Frequency Density 0-5000 1 1/5000=0.0002

  1. Statistics coursework

    This will make comparing the two easier and will mean my conclusion is likely to be more accurate. I have chosen to use a cumulative frequency graph as it gives you a general idea of any trends. It also allows you to see the median IQ for each gender along with inter-quartile range.

  2. Anthropometric Data

    A dependent variable is a variable that dependent on another, the independent variable is said to cause an obvious change in the dependent variable. So I' am going to plot the foot breadth (mm) as a dependent against the foot length (mm)

  1. Statistics. The purpose of this coursework is to investigate the comparative relationships between the ...

    low income). If their lifestyle is abruptly changed, they may consider selling their car for a less/ more expensive model. The car will be passed on during this to people in similar financial lifestyles that the owner had originally. When the vehicle is passed on, essential parts will become to be more reliable.

  2. Teenagers and Computers Data And Statistics Project

    3 1 6 [ (n - 2) 2] 2 12( n-2) 3 8 Total n 3 7. The 10 x 10 x 10 table Number of red faces Number of cubes 0 512 1 384 2 96 3 8 4 0 5 0 6 1000 Total 8.

  1. Investigate the relationships between height and weight

    Anomalies To make sure my data is reliable I will test for anomalies to do this I will use the interquartile range and find out if there are any outliers Year 7 Females Height (Lower Quartile and Upper quartile) Lq 1.03 Uq 1.54 Iqr 0.51 The outliers are 1.03 and

  2. Investigate the relationship between height and weight and how it changes between gender and ...

    spread a collection of data is and is a way of comparing two sets of data in this case which is height and weight. Mean is when a set of data is added together and subtracted by the number of figures there are.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work