Obtain information to make a report on family size and the structure of families of particular size (number of boys/girls).

Authors Avatar

                                                                14th April 2001.

Probability Models For Data.

Coursework Title

Obtain information to make a report on family size and the structure of families of particular size (number of boys/girls).

To carry out this investigation I carried out a survey of 450 families.  The survey was taken in my school by randomly selecting 450 individuals and asking them how many children their parents have.   I did included half brothers/sisters, but decided not to include step brothers/sisters as I felt this may produce some anomalies to my investigation and also these people are not genetically related.

I have found two separate statistics for the probability of male and female births in the Probability Models For Data textbook.  One states that they are both equally likely and therefore there is a 0.5 chance of each birth being either male or female.  The other statistic I found in the textbook stated that the probability of a male birth was more likely than a female birth, it states that the probability of a male birth is 0.513 and the probability of a female birth 0.487.  These probabilities were calculated from a census taken in 1991.

Once I had completed the survey I organised the data into tables of increasing family size.  The data can de seen on the following pages. I decided that I would analyse the number of male births.  I chose to analyse the data using the binomial model.  I chose this model for the following reasons

  • Outcomes are independent of each other, they are said to be mutually exclusive.
  • A random variable, X is associated with the number of male births.

The first thing I did was work out the probability of a male birth in each of the separate family sizes.  To do this I totalled up the number of boys in the family size and then divided this figure by the total number of children in the family size.  Once these probabilities had been calculated I could then use the binomial model to work out the probability of X number of male births.  The following formula is used.

        n  px  (1-p)n-x

        x

Where: n is family size

         x is  the number of male births

and         p is the probability of a male birth.

        Where 0 ≤ p ≤ 1

  To work out the expected frequency of a male birth in the family size I then multiplied my answers by the number of families of the size I was calculating.  All this is shown on following pages.  Once this data has can be calculated I can then use the chi-squared test to see if the model fits the data.  The test is a measure to see how well the model fits the model.

The formula for the test is:

X2= ∑ (O-E)²

           E

Where O = Observed Frequency (The frequency from the data collected)

        E = Expected Frequency (Calculated from the probabilities)

The smaller the value of  X² the better the model fits the data.

There is a table of values for the X² statistic.  It gives values that denote a specific percentage for a certain number of degrees of freedom.  For example: A set of data with two degrees of freedom gives an X² value of 9.34, in the table you look at the nearest lower value if the exact value is not in the table, here it is 9.21 at a probability of 1%.  This means that the outcome will be 9.34 in less than 1% of cases, this suggests that the model does not fit the data.

Now that all the necessary background information has been stated the following pages show the working out of all the necessary data.

Using The Data Obtained In Survey

Family Of One

Number of families of 0ne = 49

Total number of children = 49

Number of boys = 24.

Probability of a male birth = 24/49 = 0.4898

Therefore the probability of a female birth is 1-0.4898 = 0.5102

P(X=0) = 1  (0.4898)0 (0.5102)1 = 0.5102

            0

P(X=1) = 1  (0.4898)1 (0.5102)0 = 0.4898

            1

Therefore the expected frequencies are: 0.5102 x 49 =25

                                         And 0.4898 x 49 = 24

Family Of Two

Number of families of two =141

Total number of children = 282

Number of boys = 144

Probability of a male birth = 144/282 = 0.5106

Therefore probability of female birth = 1-0.5106 = 0.489

P(X=0) = 2  (0.5106)0 (0.489)2 = 0.2395

            0

P(X=1) = 2  (0.5106)1 (0.489)1 = 0.4998

            1

P(X=2) = 2  (0.5106)2 (0.489)0 = 0.2608

            2

Therefore the expected frequencies are: 0.2395 x 141= 33.77

                                            0.4998 x 141= 70.47

                                            0.2608 x 141= 36.77

Family Of Three

Number of families of three =150

Total number of children = 450

Number of boys = 219

Probability of male birth = 219/450 = 0.4867

Therefore probability of female birth = 1-0.4867 =0.5133

P(X=0) = 3 (0.4867)0 (0.5133)3 = 0.1353

            0

P(X=1) = 3 (0.4867)1 (0.5133)² = 0.3847

            1

P(X=2) = 3  (0.4867)² (0.5133)¹ = 0.3648

            2

P(X=3) = 3  (0.4867)³ (0.5133)0 = 0.1153

            3

Therefore the expected frequencies are: 0.1353 x 150 = 20.29

                                            0.3847 x 150 = 57.71

                                            0.3648 x 150 = 54.71

                                            0.1153 x 150 = 17.29

Family Of Four

Number of families = 86

Total number of children = 344

Number of boys = 151

Probability of male birth = 151/344 = 0.4399

Therefore the probability of a female birth = 1-0.4399 = 0.5610

P(X=0) = 4  (0.4399)0 (0.5610)4 = 0.0991

            0

P(X=1) = 4  (0.4399)¹ (0.5610)³ = 0.3101

            1

P(X=2) = 4  (0.4399)² (0.5610)² = 0.3639

            2

P(X=3) = 4  (0.4399)³ (0.5610)¹ = 0.1898

            3

P(X=4) = 4  (0.4399)4 (0.5610)0 = 0.0371

            4

Therefore the expected frequencies are: 0.0991 x 86 = 8.5

                                            0.3101 x 86 = 26.7

                                            0.3639 x 86 = 31.596

                                            0.1898 x 86 = 16.32

                                            0.0371 x 86 = 3

Family Of Five

Number of families = 15

Total number of children = 75

Number of boys = 43

Probability of male birth = 43/75 = 0.5733

Therefore probability of female birth = 1-0.5733 = 0.4267

Join now!

P(X=0) = 5  (0.5733)0 (0.4267)5 = 0.0141

            0

P(X=1) = 5  (0.5733)¹ (0.4267)4 = 0.0951

            1

P(X=2) = 5  (0.5733)² (0.4627)³ = 0.2553

            2

P(X=3) = 5  (0.5733)³ (0.4627)² = 0.3431

            3

P(X=4) =5  (0.5733)4 (0.4627)1= 0.2305

           4

 

P(X=5) = 5 (0.5733)5 (0.4627)0 = 0.0619

            5

Therefore the expected frequencies are: 0.0141 x 15 = 0.212

                                            0.0951 x 15 = 1.42

                                            0.2553 x 15 = 3.83

                                            0.3431 x 15 = 5.15

                                            0.2305 x 15 = ...

This is a preview of the whole essay