Overcoming the problem of multicollinearity in sports performance data: A novel application of partial least squares correlation analysis.


Journal

PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081

Informations de publication

Date de publication:
2019
Historique:
received: 18 06 2018
accepted: 22 01 2019
entrez: 15 2 2019
pubmed: 15 2 2019
medline: 7 11 2019
Statut: epublish

Résumé

Professional sporting organisations invest considerable resources collecting and analysing data in order to better understand the factors that influence performance. Recent advances in non-invasive technologies, such as global positioning systems (GPS), mean that large volumes of data are now readily available to coaches and sport scientists. However analysing such data can be challenging, particularly when sample sizes are small and data sets contain multiple highly correlated variables, as is often the case in a sporting context. Multicollinearity in particular, if not treated appropriately, can be problematic and might lead to erroneous conclusions. In this paper we present a novel 'leave one variable out' (LOVO) partial least squares correlation analysis (PLSCA) methodology, designed to overcome the problem of multicollinearity, and show how this can be used to identify the training load (TL) variables that influence most 'end fitness' in young rugby league players. The accumulated TL of sixteen male professional youth rugby league players (17.7 ± 0.9 years) was quantified via GPS, a micro-electrical-mechanical-system (MEMS), and players' session-rating-of-perceived-exertion (sRPE) over a 6-week pre-season training period. Immediately prior to and following this training period, participants undertook a 30-15 intermittent fitness test (30-15IFT), which was used to determine a players 'starting fitness' and 'end fitness'. In total twelve TL variables were collected, and these along with 'starting fitness' as a covariate were regressed against 'end fitness'. However, considerable multicollinearity in the data (VIF >1000 for nine variables) meant that the multiple linear regression (MLR) process was unstable and so we developed a novel LOVO PLSCA adaptation to quantify the relative importance of the predictor variables and thus minimise multicollinearity issues. As such, the LOVO PLSCA was used as a tool to inform and refine the MLR process. The LOVO PLSCA identified the distance accumulated at very-high speed (>7 m·s-1) as being the most important TL variable to influence improvement in player fitness, with this variable causing the largest decrease in singular value inertia (5.93). When included in a refined linear regression model, this variable, along with 'starting fitness' as a covariate, explained 73% of the variance in v30-15IFT 'end fitness' (p<0.001) and eliminated completely any multicollinearity issues. The LOVO PLSCA technique appears to be a useful tool for evaluating the relative importance of predictor variables in data sets that exhibit considerable multicollinearity. When used as a filtering tool, LOVO PLSCA produced a MLR model that demonstrated a significant relationship between 'end fitness' and the predictor variable 'accumulated distance at very-high speed' when 'starting fitness' was included as a covariate. As such, LOVO PLSCA may be a useful tool for sport scientists and coaches seeking to analyse data sets obtained using GPS and MEMS technologies.

Identifiants

pubmed: 30763328
doi: 10.1371/journal.pone.0211776
pii: PONE-D-18-17720
pmc: PMC6375576
doi:

Types de publication

Clinical Trial Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e0211776

Déclaration de conflit d'intérêts

Dan Weaving, Sarah Whitehead and Kevin Till are affiliated to Leeds Rhinos Rugby League club, Ben Jones and Kevin Till to Yorkshire Carnegie Rugby Union club, Ben Jones to The Rugby Football League and Matt Ireton to Warrington Wolves Rugby League club. There are no patents, products in development or marketed products to declare. This does not alter our adherence to all the PLOS ONE policies on sharing data and materials, as detailed online in the guide for authors.

Références

Int J Sports Physiol Perform. 2017 Sep;12(8):1059-1068
pubmed: 28051343
PLoS One. 2016 May 25;11(5):e0155047
pubmed: 27224653
Front Physiol. 2017 Dec 12;8:1024
pubmed: 29311959
Int J Sports Physiol Perform. 2014 Nov;9(6):905-12
pubmed: 24589469
PLoS One. 2016 May 02;11(5):e0153960
pubmed: 27135831
PLoS One. 2014 May 21;9(5):e97638
pubmed: 24847797
Sports Med. 2018 Mar;48(3):641-658
pubmed: 29288436
Methods Mol Biol. 2013;930:549-79
pubmed: 23086857
Neuroimage. 2011 May 15;56(2):455-75
pubmed: 20656037
Int J Sports Physiol Perform. 2017 Apr;12(Suppl 2):S218-S226
pubmed: 27736244
J Strength Cond Res. 2001 Feb;15(1):109-15
pubmed: 11708692
Annu Rev Psychol. 2013;64:499-525
pubmed: 22804773
Int J Sports Physiol Perform. 2017 Apr;12(Suppl 2):S255-S262
pubmed: 27834559
Am J Physiol. 1985 Jul;249(1 Pt 2):R1-12
pubmed: 4014489
Int J Sports Physiol Perform. 2016 May;11(4):515-21
pubmed: 26355239
J Appl Physiol (1985). 2001 May;90(5):1770-6
pubmed: 11299267
Br J Sports Med. 2016 Feb;50(4):231-6
pubmed: 26511006
Neuroimage Clin. 2013 Apr 10;2:534-541
pubmed: 24049744
Int J Sports Physiol Perform. 2016 Apr;11(3):400-3
pubmed: 26217047
J Sports Sci. 2012;30(14):1473-80
pubmed: 22857397
J Strength Cond Res. 2017 Sep 11;:
pubmed: 28930869
Int J Sports Med. 2017 Sep;38(10):735-740
pubmed: 28783849
Int J Sports Physiol Perform. 2011 Sep;6(3):311-21
pubmed: 21911857
J Sci Med Sport. 2017 Dec;20(12):1107-1111
pubmed: 28479281
Int J Sports Physiol Perform. 2018 Feb 1;13(2):163-169
pubmed: 28530450

Auteurs

Dan Weaving (D)

Institute for Sport, Physical Activity and Leisure, Leeds Beckett University, Leeds, West Yorkshire, United Kingdom.
Leeds Rhinos Rugby League club, Leeds, United Kingdom.

Ben Jones (B)

Institute for Sport, Physical Activity and Leisure, Leeds Beckett University, Leeds, West Yorkshire, United Kingdom.
Yorkshire Carnegie Rugby Union club, Leeds, United Kingdom.
The Rugby Football League, Leeds, United Kingdom.

Matt Ireton (M)

Institute for Sport, Physical Activity and Leisure, Leeds Beckett University, Leeds, West Yorkshire, United Kingdom.
Warrington Wolves Rugby League club, Warrington, United Kingdom.

Sarah Whitehead (S)

Institute for Sport, Physical Activity and Leisure, Leeds Beckett University, Leeds, West Yorkshire, United Kingdom.
Leeds Rhinos Rugby League club, Leeds, United Kingdom.

Kevin Till (K)

Institute for Sport, Physical Activity and Leisure, Leeds Beckett University, Leeds, West Yorkshire, United Kingdom.
Leeds Rhinos Rugby League club, Leeds, United Kingdom.
Yorkshire Carnegie Rugby Union club, Leeds, United Kingdom.

Clive B Beggs (CB)

Institute for Sport, Physical Activity and Leisure, Leeds Beckett University, Leeds, West Yorkshire, United Kingdom.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH