# Carrying out an investigation to research the readability of two articles

Extracts from this document...

Introduction

Carrying out an investigation to research the readability of two articles

Plan

In this investigation I will be comparing the readability of two articles from two different types of media. For example taking one article from a gossip magazine and another from a political newspaper, or taking two articles from the same media, such as two newspapers, that are aimed at different types of people. To compare the readability I could either take a set number of words from the two articles, for example taking 100 words from a 300 word article and a 500 word article. This would however be biased towards the first article as a larger percentage of the words would be take from it, and there would also be no clear indication of which 100 words I should take from the article. Whether I should take the first hundred words, or taken randomly from the middle, this would make the data inaccurate and biased. In view of this I will take a stratified sample of a set percentage from both of my articles, this means that if I were taking 10% stratified from each article then 30 words would be taken from the 300 word article and 50 words from the 500 words article.

Middle

Anything other than what is stated above is to be included in the data count. I will now begin collecting my data for my main investigations.

The two articles I will be comparing are from two different magazines, one a gossip magazine called ‘Reveal’ and the other a music magazine called ‘Kerrang!’. The first article from the ‘Reveal’ magazine has a main title of ‘I Beat Beyonce to No 1’ and has a total of 457 words, the second article from the ‘Kerrang!’ magazine is called ‘The Astonishing Rise Of Guitar Hero’ and has a total of 651 words. There is around a 200 word gap between the two articles, but because I am taking a percentage this shouldn’t make a difference in the final results. I am going to take 20% of the words from each article; this means I will be taking 91 words from the Reveal article and 130 words from the Kerrang! article.

Conclusion

To find the median from the cumulative frequency polygon I need to divide the total cumulative frequency from both articles by 2. To find the Interquartile range I will need to find the lower quartile, which is found by dividing the cumulative frequency total by 4, and the upper quartile is found by dividing the cumulative frequency total by 4 the multiplying it by 3. These calculations will now be plotted on the cumulative frequency polygon.

For the ‘Kerrang!’ article the median and Interquartile range are:

Median – 4.1

Interquartile Range = Upper Quartile – Lower Quartile = 6 – 2.7 = 3.3

For the ‘Reveal’ article the Median and Interquartile Range are:

Median – 3.6

Interquartile Range = Upper Quartile – Lower Quartile = 4.2 - 2.4 = 1.8

I have used the Interquartile Range rather than the range of the data, as the range is not as reliable, this is because it can be affected by very high and very low pieces of data whereas the Interquartile range is only the range between the first and third quartiles. This eliminates any outliers which may affect the data.

I will now put the average results into a table below.

Reveal | Kerrang! | |

Mean | 4.46 (2.d.p) | 4.86 (2.d.p) |

Mode | 3 | 3 |

Median | 3.6 | 4.1 |

The two articles have the same mode; however this does not represent the average properly as it does not use all of the data. The ‘Kerrang!’ article has a higher average of word length than the ‘Reveal’ article.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

## Found what you're looking for?

- Start learning 29% faster today
- 150,000+ documents available
- Just £6.99 a month