TY - JOUR
T1 - Statistical methods for analysing discrete and categorical data recorded in performance analysis
AU - Nevill, Alan M.
AU - Atkinson, Greg
AU - Hughes, Mike D.
AU - Cooper, Stephen Mark
PY - 2002/10
Y1 - 2002/10
N2 - In this paper, we identify appropriate statistical methods for analysing categorical differences in discrete variables or 'performance indicators' resulting from performance analysis. The random mechanisms associated with discrete events do not follow a normal distribution; that is, the normal distribution is a continuous not a discrete probability distribution. We propose appropriate statistical methods based on two key discrete probability distributions, the Poisson and binomial distributions. Two approaches are proposed and compared using examples from notational analysis. The first approach is based on the classic chi-square test of significance (both the goodness-of-fit test and the test of independence). The second approach adopts a more contemporary method based on log-linear and logit models fitted using the statistical software GLIM. Provided relatively simple one-way and two-way comparisons in categorical data are required, both of these approaches result in very similar conclusions. However, as soon as more complex models or higher-order comparisons are required, the approach based on log-linear and logit models is shown to be more effective. Indeed, when investigating those factors and categorical differences associated with binomial or binary response variables, such as the proportion of winners when attempting decisive shots in squash or the proportion of goals scored from all shots in association football, logit models become the only realistic method available. By applying log-linear and logit models to discrete events resulting from notational analysis, greater insight into the underlying mechanisms associated with sport performance can be achieved.
AB - In this paper, we identify appropriate statistical methods for analysing categorical differences in discrete variables or 'performance indicators' resulting from performance analysis. The random mechanisms associated with discrete events do not follow a normal distribution; that is, the normal distribution is a continuous not a discrete probability distribution. We propose appropriate statistical methods based on two key discrete probability distributions, the Poisson and binomial distributions. Two approaches are proposed and compared using examples from notational analysis. The first approach is based on the classic chi-square test of significance (both the goodness-of-fit test and the test of independence). The second approach adopts a more contemporary method based on log-linear and logit models fitted using the statistical software GLIM. Provided relatively simple one-way and two-way comparisons in categorical data are required, both of these approaches result in very similar conclusions. However, as soon as more complex models or higher-order comparisons are required, the approach based on log-linear and logit models is shown to be more effective. Indeed, when investigating those factors and categorical differences associated with binomial or binary response variables, such as the proportion of winners when attempting decisive shots in squash or the proportion of goals scored from all shots in association football, logit models become the only realistic method available. By applying log-linear and logit models to discrete events resulting from notational analysis, greater insight into the underlying mechanisms associated with sport performance can be achieved.
KW - Chi-square tests of significance
KW - Discrete events
KW - Log-linear models
KW - Performance indicators
UR - http://www.scopus.com/inward/record.url?scp=0036786724&partnerID=8YFLogxK
U2 - 10.1080/026404102320675666
DO - 10.1080/026404102320675666
M3 - Review article
C2 - 12363298
AN - SCOPUS:0036786724
SN - 0264-0414
VL - 20
SP - 829
EP - 844
JO - Journal of Sports Sciences
JF - Journal of Sports Sciences
IS - 10
ER -