Data Mining Paths for Standard Weekly Training Load in Sub-Elite Young Football Players: A Machine Learning Approach

José E. Teixeira, Samuel Encarnação, Luís Branquinho, Ryland Morgans, Pedro Afonso, João Rocha, Francisco Graça, Tiago M. Barbosa, António M. Monteiro, Ricardo Ferraz, Pedro Forte

Research output: Contribution to journalArticlepeer-review

2 Citations (Scopus)

Abstract

The aim of this study was to test a machine learning (ML) model to predict high-intensity actions and body impacts during youth football training. Sixty under-15, -17, and -19 sub-elite Portuguese football players were monitored over a 6-week period. External training load data were collected from the target variables of accelerations (ACCs), decelerations (DECs), and dynamic stress load (DSL) using an 18 Hz global positioning system (GPS). Additionally, we monitored the perceived exertion and biological characteristics using total quality recovery (TQR), rating of perceived exertion (RPE), session RPE (sRPE), chronological age, maturation offset (MO), and age at peak height velocity (APHV). The ML model was computed by a feature selection process with a linear regression forecast and bootstrap method. The predictive analysis revealed that the players' MO demonstrated varying degrees of effectiveness in predicting their DEC and ACC across different ranges of IQR. After predictive analysis, the following performance values were observed: DEC (x¯ predicted = 41, β = 3.24, intercept = 37.0), lower IQR (IQR predicted = 36.6, β = 3.24, intercept = 37.0), and upper IQR (IQR predicted = 46 decelerations, β = 3.24, intercept = 37.0). The player's MO also demonstrated the ability to predict their upper IQR (IQR predicted = 51, β = 3.8, intercept = 40.62), lower IQR (IQR predicted = 40, β = 3.8, intercept = 40.62), and ACC (x¯ predicted = 46 accelerations, β = 3.8, intercept = 40.62). The ML model showed poor performance in predicting the players' ACC and DEC using MO (MSE = 2.47-4.76; RMSE = 1.57-2.18: R 2 = -0.78-0.02). Maturational concerns are prevalent in football performance and should be regularly checked, as the current ML model treated MO as the sole variable for ACC, DEC, and DSL. Applying ML models to assess automated tracking data can be an effective strategy, particularly in the context of forecasting peak ACC, DEC, and bodily effects in sub-elite youth football training.

Original languageEnglish
Article number114
Pages (from-to)114
Number of pages1
JournalJournal of Functional Morphology and Kinesiology
Volume9
Issue number3
Early online date28 Jun 2024
DOIs
Publication statusPublished - 28 Jun 2024

Keywords

  • artificial intelligence (AI)
  • big data
  • maturation
  • periodization
  • youth

Cite this