Preprint
Article

This version is not peer-reviewed.

Data-Driven Reduction of External Load Variables in Indoor Team Sports Using Local Positioning System

Submitted:

10 June 2026

Posted:

10 June 2026

You are already at the latest version

Abstract
Objectives: Local positioning systems (LPS) used in indoor team sports generate a large number of external load variables, often exceeding practical monitoring capacity. The redundancy and overlap among these variables make it difficult to identify the most informative metrics for performance analysis and load management. This study aimed to reduce the dimensionality of external load variables derived from LPS data and to identify data-driven external-load observation profiles using principal component analysis and clustering techniques. Methods: A total of 188 observations from indoor team sports (basketball, handball, and futsal) were analyzed. Continuous external load variables were standardized and subjected to principal component analysis (PCA), with component retention based on a ≥90% cumulative explained variance threshold. K-means clustering was applied in both the full standardized feature space and the PCA-reduced space. The optimal number of clusters was determined using silhouette analysis and the elbow method. Agreement between clustering solutions was assessed using Adjusted Rand Index (ARI) and Normalized Mutual Information (NMI). Cluster characteristics were further examined using descriptive statistics and variable separation analysis. Results: The first two principal components explained 53.7% of the total variance, representing high-intensity external load and neuromuscular load dimensions, while 12 components were required to exceed 90% cumulative explained variance. Clustering analysis consistently identified three moderately separated clusters in both the full and PCA-reduced spaces. The PCA-based solution demonstrated improved separation (silhouette = 0.362) compared to the full-space solution (silhouette = 0.319). Agreement between clustering approaches was high (ARI = 0.981; NMI = 0.971), indicating that dimensionality reduction largely preserved the main clustering structure within the analyzed dataset. The most discriminative variables included jump load, acceleration load, metabolic power, and anaerobic activity distance. Conclusions: A large set of external load variables can be reduced into interpretable latent dimensions that supported exploratory external-load profile identification. The combination of PCA and clustering provides an exploratory, structure-preserving, and interpretable framework for simplifying load monitoring in indoor team sports, supporting the selection of key performance indicators and more efficient training load management.
Keywords: 
;  ;  ;  ;  ;  
Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.
Prerpints.org logo

Preprints.org is a free preprint server supported by MDPI in Basel, Switzerland.

Subscribe

Disclaimer

Terms of Use

Privacy Policy

Privacy Settings

© 2026 MDPI (Basel, Switzerland) unless otherwise stated