Handling data.

Authors Avatar
Situation

My father enjoys reading both sport and current affairs articles in the Independent newspaper. Although he enjoys both, he prefers to read the sport articles in the morning, as he says they are easier to read. His reasons for saying that is because he thinks the word lengths are shorter. I will carry out an investigation to determine whether word lengths are shorter in sports articles then they are in current affairs articles.

Hypothesis

The word length in sports articles is generally shorter then that in current affairs articles.

Sampling

It would be impossible for me to investigate every sport and current affairs article in the Independent due to the size of each section of the paper. Instead of doing this, I will take a sample of data from one article of each genre.

Sampling is taking a small section of data to represent a larger body of information (population). There are two main types of sampling, they are Stratified and Random.

Stratified Sampling is when a sample is taken proportionally to the content of each strata, where as if there were 120 pieces of data in one strata and 80 in another, then the proportional reading of these would be 60:40 proportionally.

Random Sampling is when we select our data at random to study it. The problem with Random Sampling is that we cannot make it truly random, so it is a difficult method to use correctly.
Join now!


Bias

When dealing with Random Sampling, it is difficult to make certain that our data is truly random. When using a method such as pointing to words on a page, we encounter Bias as a problem. The two problems that make it Bias are:

* Longer words take up more of the area of the page then smaller words, so you are more likely to land on a larger words then a smaller word.

* When we select one word at random, you then tend to aim for a different area of the page, to ...

This is a preview of the whole essay