• Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month
  1. 1
  2. 2
  3. 3
  4. 4
  5. 5
  6. 6
  7. 7
  8. 8
  9. 9
  10. 10
  11. 11

Mathematically Modelling Basketball Shots

Extracts from this document...


Mathematically Modelling Basketball Shots


The manager of a professional basketball team is having a tough decision in choosing which of his two top scorers this season are better at free-throw shots. The final decision will go towards picking the team for Saturday’s Cup Final match.

        On a training session one week before the match the coach decides to “go all out” and bring some mathematical genii in to model a situation where Lee Grimes and Dominic Aspbury, the goalscorers, will shoot at the basketball net.

        The mathematical genii are students from Cambridge and are benefiting from this opportunity in that they will be able to show evidence of coursework for their final exam. Their coursework will be using their abilities to collect data and “test the appropriateness of a probability model” on a real situation whilst the coach’s aim will be to pick the better of the two players for the “big game.”

If the random variables X and Y count the number of independent trials before the event, having a probability p, occurs then X and Y have geometric distributions:

P ( X = r ) = q r – 1 p        where  r = 1,2,3,……

X~G ( p )           and         Y~G ( p )

I will define X         as being the number of shots required before Lee shoots a basket. Therefore, Y is defined as the number of shots required before Dom shoots a basket. I will be attempting to see if X and Y have geometric distributions by taking samples of X and Y.

The populations are the infinite range of shots capable from the two throwers taken in a discrete time period under varied conditions at the same level of skill.

...read more.


Therefore P(no score) = 1 – 0.231884058

                            = 0.768115942

Therefore for Dom: P (Y = 2) = 0.768115942 x  0.231884058

                                 = 0.1781138416

                                 P (Y = 3) = 0.768115942(3-1)  x  0.231884058

                                                 = 0.1368120813

Expected Frequencies will be: (Y = 1) = 0.231884058 x 80

                                          = 18.55072464

                                             (Y = 2) = 0.1781138416 x 80

                                          = 14.24910733

Chi Squared Distribution

The chi-squared distribution can be applied to measure the ‘goodness of fit’ for the geometric models. It will examine the ‘goodness’ of the model by considering the number of possible outcomes of the events and will analyse the validity of the assumptions.

        Theimage10.pngvalue will be expected to be small to suggest that the model fits the real distribution. A large value would suggest that the model is unlikely to be correct so I will use a 10% critical region to test it.

  • If theimage10.pngvalue lies within the critical region then, assuming the model is correct, it would mean that there is less then 10% chance of a result as high as this occurring. We reject the model as a consequence and conclude insufficient sampling etc.
  • Alternatively, if the image10.pngvalue lies outside the critical region, the result is valid and there is a larger possibility of the value being what it is. The model is assumed to be correct and the model is accepted. Conclusion would be to state that the statistical model is appropriate to the situation and the assumptions are correct.

In the tables, the expected and observed frequencies were calculated but how close together are the values? The closer the observed value to the expected value the more accurate the geometric model will be.

The goodness of fit statistic is:

image09.pngimage09.pngimage02.png         where O = Observed Frequency

                                                                                  E = Expected Frequency

To find the best measure of goodness of fit, add up all values for each statistic and compare with the image03.png2 probabilitydistribution tables.


...read more.


A problem with this is time, as it would take a year to go through just one season, therefore it is impractical and illogical. The physical form of the player should also alter throughout the season so a random sample of more than one season would have to be made.

        A much better way is to watch all training sessions and take a general overview of who supplies the most points in miniature matches from free throws. This gives more of a view of consistency than “on the day” performance but during game situations the performer will be thinking more logically. A sample of eighty straight baskets is tedious and will affect performance.


  • Use a longer time period. The performers were rushed to collect their sample size within two hours as a result of school timetabling and so one of them had to rush his last twenty shots.
  • Use the same time period i.e. one performer did it one day and the other completed it the next day. Conditions may have been different and morale, energy etc may be variated for both Dom and Lee
  • Use foot-mats on the floor so that it indicates an exact position for the feet to stand instead of just using the line. This may be an insignificant difference but to improve the coursework it is better than no difference at all.
  • Using the same basketball. Half way through the sample collection the basketball was lost leaving us the trouble of having to use another basketball – maybe of different weight, age etc and possibly affecting the results


  • I would like to calculate confidence intervals for both expected values (X and Y) to determine my degree of confidence in Lee being a better freethrower.
  • I would also like to be able to see if my result E[X] = E[Y] was statistically significant

...read more.

This student written piece of work is one of many that can be found in our AS and A Level Probability & Statistics section.

Found what you're looking for?

  • Start learning 29% faster today
  • 150,000+ documents available
  • Just £6.99 a month

Not the one? Search for your essay title...
  • Join over 1.2 million students every month
  • Accelerate your learning by 29%
  • Unlimited access from just £6.99 per month

See related essaysSee related essays

Related AS and A Level Probability & Statistics essays

  1. The mathematical genii apply their Statistical Wizardry to Basketball

    I can use the result to model Y as Y~G ( p ) . The conditions I will have to use are going to be as similar as possible to gain independent and identical shots. This will involve: * Five practice shots beforehand so that the feel of shooting is apparent - a warm up before starting.

  2. The aim of this investigation was to look at the reliability and validity of ...

    d � = 89 rs = 1- 6 ? d � n(n� - 1) = 1- 6 x 89 18 (18� - 1) = 1- 6 x 89 18 (324 - 1) = 1- 6 x 89 18 x 323 = 1- 534 5814 = 1- 0.0918473 = 0.9081527 The Spearman's coefficient of rank correlation for the relationship between

  1. Statistics coursework

    5 4 13 7 Male 106 5 4 5 14 7 Male 100 4 4 4 12 7 Male 110 5 5 5 15 7 Male 96 4 3 4 11 7 Male 97 3 4 4 11 7 Male 108 4 5 5 14 7 Male 101 4 4

  2. Investigating the Relationship Between the Amount of Money a Football Club Receives and its ...

    Another method I will use is to split the sampled teams into groups according to how much money they receive and find the standard deviation of the league position, goal difference and stadium capacity within each group. This is useful as one would expect that clubs that receive similar amounts

  1. Study of the height/diameter ratio of limpets inhabiting the middle shore region of exposed ...

    They will die out and the limpets with the ideal characteristics will survive. It can be said that natural selection is at work on this shore. Selection pressure, taking the form of environmental conditions, has pushed the distribution of the population (in this case we have a sample)

  2. Design an investigation to see if there is a significant relationship between the number ...

    to plot, on separate line graphs, the relative abundance of different components found in quadrats layered on the lower shore and middle shore. I'll provide a key so that the different species are distinguishable on the graphs. This will enable me to consider the competition whilst interpreting my results, and

  1. Statistics Coursework

    So, basically overall, I think the attendance of the students does affect their learning and exams results. Plan 1. In order for me to investigate this problem, I was given a secondary type of data of the attendance of all the students in Hamilton Community College in the 2003 - 2004 academic years from Year 7s to Year 11s.

  2. "The lengths of lines are easier to guess than angles. Also, that year 11's ...

    This next curve is for the year 9 estimates of line 2. The box plot on this curve actually shows that the lower quartile is below the actual length of the line. This is because people guessed a smaller length than the actual one of the line.

  • Over 160,000 pieces
    of student written work
  • Annotated by
    experienced teachers
  • Ideas and feedback to
    improve your own work