- 1Environmental Physiology Laboratory, The University of British Columbia, School of Kinesiology, Vancouver, BC, Canada
- 2Department of Biomedical Physiology and Kinesiology and Sports Analytics Group, Simon Fraser University, Burnaby, BC, Canada
- 3Deptartment of Physical Therapy, The University of British Columbia, Vancouver, BC, Canada
- 4Centre for Heart Lung Innovation, Providence Research, The University of British Columbia and St. Paul’s Hospital, Vancouver, BC, Canada
- 5Department of Family Practice, Vancouver Whitecaps FC, Vancouver, BC, Canada
- 6Division of Sport & Exercise Medicine, The University of British Columbia, Vancouver, BC, Canada
Introduction: Wearable near-infrared spectroscopy (NIRS) measurements of muscle oxygen saturation (SmO2) demonstrated good test–retest reliability at rest. We hypothesized SmO2 measured with the Moxy monitor at the vastus lateralis (VL) would demonstrate good reliability across intensities. For relative reliability, SmO2 will be lower than volume of oxygen consumption (V̇O2) and heart rate (HR), higher than concentration of blood lactate accumulation ([BLa]) and rating of perceived exertion (RPE). We aimed to estimate the reliability of SmO2 and common physiological measures across exercise intensities, as well as to quantify within-participant agreement between sessions.
Methods: Twenty-one trained cyclists completed two trials of an incremental multi-stage cycling test with 5 min constant workload steps starting at 1.0 watt per kg bodyweight (W·kg−1) and increasing by 0.5 W kg−1 per step, separated by 1 min passive recovery intervals until maximal task tolerance. SmO2, HR, V̇O2, [BLa], and RPE were recorded for each stage. Continuous measures were averaged over the final 60 s of each stage. Relative reliability at the lowest, median, and highest work stages was quantified as intraclass correlation coefficient (ICC). Absolute reliability and within-subject agreement were quantified as standard error of the measurement (SEM) and minimum detectable change (MDC).
Results: Comparisons between trials showed no significant differences within each exercise intensity for all outcome variables. ICC for SmO2 was 0.81–0.90 across exercise intensity. ICC for HR, V̇O2, [BLa], and RPE were 0.87–0.92, 0.73–0.97, 0.44–0.74, 0.29–0.70, respectively. SEM (95% CI) for SmO2 was 5 (3–7), 6 (4–9), and 7 (5–10)%, and MDC was 12%, 16%, and 18%.
Discussion: Our results demonstrate good-to-excellent test-retest reliability for SmO2 across intensity during an incremental multi-stage cycling test. V̇O2 and HR had excellent reliability, higher than SmO2. [BLa] and RPE had lower reliability than SmO2. Muscle oxygen saturation measured by wearable NIRS was found to have similar reliability to V̇O2 and HR, and higher than [BLa] and RPE across exercise intensity, suggesting that it is appropriate for everyday use as a non-invasive method of monitoring internal load alongside other metrics.
1. Background
Muscle oxygenation is becoming an increasingly popular physiological marker of the internal load to exercise for athletes, practitioners, and sport scientists (1). With the introduction of wearable near-infrared spectroscopy (NIRS), the potential to monitor muscle oxygen saturation (SmO2) in real-time at the location where the activity occurs is becoming increasingly viable. For measurements intended to be used in everyday training environments, the reliability between sessions, i.e., test–retest reliability is a necessary step before use. Despite its importance, deciphering what change over time is expected in common physiological markers remains challenging. Part of this challenge is due to the variety of ways reliability can be assessed using the same measure for different sporting applications (2).
Both relative and absolute test–retest reliability of muscle oxygenation signals measured by the Moxy muscle oxygen monitor (Fortiori Design LLC., Hutchinson, MN, USA), in particular muscle oxygen saturation, has been investigated in the conditions of rest, vascular occlusion, and exercise (3–5). Relative reliability evaluates the observed measurement error relative to observed differences between participants and relates to the ability to consistently differentiate participants. Absolute reliability evaluates the consistency of repeated measurements (repeatability) within a single individual (2, 6). Agreement evaluates how close any two measurements are, in the original units of the parameter being evaluated (2, 6). For this sensor, comparing the reliability of SmO2 with other common physiological measures such as heart rate (HR), rate of systemic oxygen uptake (V̇O2), concentration of blood lactate ([BLa]), and rating of perceived exertion (RPE), will provide a useful contrast for each of these common markers during the same exercise protocol.
The behaviour of SmO2 has been studied in both incremental ramp exercise and constant workloads (CW) protocols at various intensities (7–14). SmO2 is a measure of the relative concentration of oxygenated and deoxygenated hemoglobin and myoglobin in the tissue under the sensor, and reflects the balance of local O2 delivery and O2 extraction (i.e., O2 supply to match energetic demand) (3, 4, 15). A higher rate of O2 delivery than extraction, all else equal, will tend to produce rising SmO2 (excess supply). A higher rate of O2 extraction over delivery will produce falling SmO2 (excess demand). At higher exercise intensity SmO2 at locomotor muscles tends to be lower, reflecting the greater difficulty matching O2 delivery to extraction. During incremental cycling exercise, SmO2 measured at the vastus lateralis (VL) typically desaturates as a function of increasing exercise intensity, generally following a sigmoidal or segmented linear profile (16–20). The behaviour of SmO2 has been studied for both continuous and constant workloads (CW) at various intensities (7–14). With increasing workload, a desaturation is expected for SmO2, indicating increased oxygen extraction at the observed tissues (19). SmO2 desaturation typically varies monotonically as a function of exercise intensity, generally following a sigmoidal or segmented linear profile (16–20).
During CW cycling exercise, SmO2 at the VL will rapidly deoxygenate before reaching a quasi-steady state within 60–120 s, similar to the primary onset phase of systemic V̇O2 kinetics (21, 22). At this quasi-steady state, SmO2 may gradually rise at lower intensity, and gradually fall at higher intensity (15, 23, 24).
An incremental multi-stage cycling test combines aspects of both incremental ramp and CW exercise in a single protocol. Constant workload stages are performed at progressively increasing workloads up to maximal exercise tolerance. Work steps are ideally 3 min in duration or longer to permit physiological measures to reach a relative steady-state at each workload (25, 26). Between each work step, a 1 min passive recovery interval is added to observe the onset desaturation response of each work step. By employing this protocol, the progressive effects of each workload on all performance and physiological markers can be observed in relative isolation in a time efficient single session (27, 28). Recently, this protocol has been used to compare SmO2 with [BLa], using a similar rationale outlined previously (28). Despite the benefits of segmenting each intensity with an integration of passive rest periods for identifying the internal load response of each work step, reliability of SmO2 during such a protocol remains unclear.
Multiple studies have validated various stationary NIRS sensors with each of the physiological and performance markers outlined above, during various modes of exercise (12, 17, 22, 29–31). Despite these reports, what portable, wearable NIRS test–retest reliability will be in comparison with each of these markers during a common exercise assessment protocol remains unclear. If wearable NIRS is intended as a commercially available, non-invasive instrument to be used by athletes and coaches to provide useful information about internal load during exercise, addressing this question is of great importance. The purpose of our study was to estimate the reliability of muscle oxygenation, alongside common performance, and physiological markers across exercise intensity, as well as to provide absolute agreement in original units that can be expected between sessions for each marker.
We hypothesized that SmO2 measured with the Moxy muscle oxygen monitor at the VL would demonstrate good test–retest reliability across intensity during an incremental multi-stage cycling test incremental protocol (3–5, 16, 28). We also hypothesized that relative reliability scores for SmO2 would be lower than for V̇O2 and HR, higher than [BLa] and RPE, and have larger thresholds for minimal detectable change compared with V̇O2 and HR, but not RPE and [BLa] (32–39).
2. Materials and methods
2.1. Participants
Twenty-one trained cyclists (10 females and 11 males; 29.1 ± 7.8 years of age, 69.6 ± 11.3 kg, 174 ± 11 cm, 58.6 ± 7.9 ml kg min−1 peak oxygen uptake) volunteered and provided written informed consent to participate in this experiment. Descriptive statistics are presented in Table 1. To obtain sufficient power of β = 0.8 with α = 0.05, an a priori sample size calculation was made in G*Power software (version 3.1.9.7, Kiel, Germany) using previously reported data from other groups that compared SmO2 values within and between sessions during ramp incremental tests and severe intensity efforts (5, 15, 16, 40, 41). This study was conducted in accordance with the principles established in the declaration of Helsinki and approved by the research ethics committee of The University of British Columbia (H21-00446). The inclusion criteria were healthy athletes with at least 2 years of training experience in cycling, aged 18–48 years, V̇O2peak ≥45 ml kg−1 min−1 for females and ≥50 ml kg−1 min−1 for males, non-smokers, with no history of cardiovascular disease, and no injuries requiring time away from training within the previous six months (42–44). Sex was classified by self-report and all participants identified as either male or female.
2.2. Experimental design
Participants visited the laboratory on two occasions one or two weeks apart, at the same time of day (±1 h) to assess test–retest reliability of NIRS parameters during an incremental multi-stage cycling test. Trials were performed outside of the participants' competitive season. Participants were instructed to avoid strenuous exercise for 24 h (hrs), avoid alcohol for 12 h, avoid caffeine for 4 h, maintain the same diet for 12 h, and get at least 8 h of sleep prior to both trials.
At the start of the first trial, participants' height and mass were recorded, and skinfold measurements (Harpenden Skinfold Caliper, Baty International, West Sussex, England) were taken at the right vastus lateralis at the placement of the NIRS sensor (detailed below).
Each trial consisted of an incremental multi-stage cycling test, with 5 min CW stages, each followed by a 1-min passive rest interval. The protocol began at 1.0 W kg−1 and progressed by 0.5 W kg−1 per stage. Participants performed the first trial at a self-selected cadence and maintained ±5 cadence range throughout the second trial. Participants performed the first trial to maximal exercise tolerance or until cadence fell by more than 10 rpm for 10 s, despite strong verbal encouragement. This cutoff was selected to minimize the effect of changing cadence on SmO2 (45). In the second trial, the protocol was repeated to the same duration as the first trial (n = 18) or to the limits of tolerance if they were unable to reach the same end point (n = 3). Participants maintained a seated position with both hands on the handlebars in their preferred riding position for the entire exercise protocol.
2.3. Data collection
The cycling protocol was performed on the participant's own bicycle mounted to an electronically controlled stationary trainer (Tacx NEO 2T, Garmin International Inc., Olathe, KS, USA). Resistance was controlled and cycling power output (watts) and cadence [revolutions per minute (rpm)] were recorded at 1 Hz using PerfPRO Studio Software (Hartware Technologies, Rockford, MI, USA) installed on a laptop computer.
Heart rate [beats per minute (bpm)] was recorded from a chest strap monitor (Garmin International Inc., Olathe, KS, USA) at 1 Hz. Capillary [BLa] (mmol·L−1) was sampled from the fingertip during the last 30 s of each work stage and measured with an electrochemical biosensor analyzer (Edge USA, USA). RPE was reported at the same time using the Borg category-ratio 0–10 scale (46). Expired gases were collected and sampled by an open circuit metabolic analyzer (TrueOne 2400, ParvoMedics Inc., Sandy, UT, USA) from a 4 L mixing chamber and V̇O2 was recorded as 15 s average values.
Muscle oxygen saturation was measured at the right vastus lateralis (VL) using a wearable NIRS sensor (Moxy Monitor, Fortiori Design LLC., Hutchinson, MN, USA). The Moxy monitor is a self-contained, relatively low cost, wearable continuous wave NIRS sensor that can resolve an arbitrarily scaled heme volume that represents both hemoglobin and myoglobin concentration in the tissue under illumination, as well as SmO2 on a 0%–100% scale (4). The sensor was positioned on the right VL muscle belly at ⅓ the distance from the patella to greater trochanter with the participant seated with knee bent to 90°. The sensor was secured with adhesive tape along with the manufacturer-supplied light shield to minimize signal interference from ambient light and movement.
Moxy employs four wavelengths of near-infrared light (680, 720, 760, and 800 nm), with a single LED source and two detectors at 12.5 and 25 mm separation, giving a maximum penetration depth of approximately 12.5 mm (3). It is recommended that participant skinfold thickness (SF) be less than the maximum penetration depth (SF < ½ the maximum inter-optode distance); however, to better represent real-world use of NIRS with competitive athletes, no exclusion was made on the basis of skinfold measurements, nor was a correction performed for SF (47). The SmO2 signal was used as the primary muscle oxygenation variable in this study (3–5). SmO2 was recorded every 2 s (0.5 Hz) and smoothed with a 5 s moving average as per manufacturer default settings.
2.4. Data analysis
V̇O2peak was determined as the highest two consecutive 15 s measurements within each trial. Peak workload (Wpeak) was determined for each trial using the following formula:
Where WC is the workload of the prior completed stage, ΔW is the final incremental workload (0.5 W kg−1), and t is the time completed at the final work stage. HRpeak was determined as the highest 1 s value recorded during each trial.
To evaluate test–retest reliability of each parameter across exercise intensity between the two trials, we analyzed the first, median, and last work stages performed by each participant. These will be referred to as lowest, median, and highest workloads. Continuous variables (SmO2, V̇O2, and HR) were averaged from the last 60 s within each work stage, omitting the final 10 s to exclude influence from the end of work stage transition to rest.
During analysis, three sets of measurements were excluded due to measurement error or methodological inconsistencies. First, gas exchange measurements for two participants were excluded. All V̇O2 data from one participant was excluded while the data from the final stage of the other was excluded. Second, technical issues with the ergometer resistance occurred for two participants, which affected the lowest and median work stages, respectively. These issues led to the [BLa], V̇O2, and SmO2 data having to be discarded. Lastly, the HR signal during the first stage in one participant was not acquired.
2.5. Statistical analysis
Data analysis and statistical analysis were carried out in R (v4.1.2, R Foundation for Statistical Computing, Vienna, Austria). Descriptive results are presented as mean ± standard deviation (SD) or 95% confidence interval (CI95%), as indicated. A two-way repeated measures ANOVA was performed to evaluate the effects of workload (lowest, median, highest) and trial (1, 2) on each variable. Pairwise post hoc comparisons were made to evaluate for a main effect of trial and adjusted for multiple comparisons with the Bonferroni method. Significance was set at p < 0.05. Normality was assessed by Shapiro–Wilk tests.
Relative reliability was quantified as intraclass correlation coefficient (ICC2,1) (2, 6). Interpretations of poor (<0.5), moderate (0.5–75), good (0.75–0.9), and excellent (>0.9) were used (48).
Absolute reliability and within-subject agreement were quantified as the SEM in original units, which is appropriate for homoscedastic data (2, 49, 50). All outcome variables evaluated in the present study were found to be homoscedastic from visual analysis and Levene's test, except for [BLa]. Absolute reliability was also quantified as a coefficient of variation (CV) from within-subject repeated measurements (Atkinson & Nevill, 1998). The minimal detectable change (MDC) was calculated from SEM at a CI95% (Hopkins 2000; Weir 2005), as:
3. Results
3.1. Test–retest comparisons
Data from a representative participant performing the full incremental multi-stage cycling protocol, with muscle oxygen saturation (SmO2) response from two trials are displayed in Figure 1. Data from the same participant at the lowest, median, and highest workloads are displayed in Figure 2. Descriptive group results for trials 1 and 2 can be found in Table 2. Test-retest comparisons between trials at the lowest, median, and highest workloads are displayed in Figure 3. Between-trials comparisons showed no significant differences within each workload for all outcome variables. Workload at the lowest, median, and highest stages were 71 ± 11, 199 ± 47, and 303 ± 80 W, respectively. Cadence showed an effect of workload between lowest and median (p < 0.01), and lowest and highest workloads (p < 0.001), with no difference between median and highest (94 ± 8) workloads (p = 0.062) (Table 2).
Figure 1. Data shown for a representative participant performing two trials of an incremental multi-stage cycling test. Work stages are 5 minutes with 1-min passive rest intervals. Workload starts at 1.0 W·kg−1 and increases by 0.5 W·kg−1 each stage. Muscle oxygen saturation (SmO2) overlaid from Trial 1 (light blue) and Trial 2 (dark blue). Participants performed the first trial to the maximal limits of tolerance. The second trial was repeated to the same duration, or to the limits of tolerance if they were unable to reach the same duration as the first trial.
Figure 2. Data shown for a representative participant at the lowest, median, and highest work stages from two trials, with workloads displayed in W·kg−1. Muscle oxygen saturation (SmO2) recorded from Trial 1 (light blue) and Trial 2 (dark blue). Mean SmO2 data from the last 60 seconds of each work stage was used for test-retest comparisons (red shaded areas).
Table 2. Performance and physiological parameters for the lowest, median, and highest workloads (mean ± SD).
Figure 3. Between trial comparisons of (A) muscle oxygen saturation (SmO2), (B) heart rate (HR), (C) systemic oxygen uptake (V̇O2), (D) concentration of blood lactate ([Bla]), and (E) rating of perceived exertion (RPE).
3.2. Relative reliability
ICC for SmO2 were good to excellent. ICC for HR and V̇O2 were good to excellent, apart from V̇O2 at the lowest workload, which was moderate. ICC for [BLa] and RPE were poor to moderate (Table 3).
3.3. Absolute reliability
SEM (95% CI) for SmO2 was 5 (3–7), 6 (4–9), and 7 (5–10)%, and MDC was 12%, 16%, and 18% at the lowest, median, and highest workloads, respectively (Table 3).
4. Discussion
The aim of this study was to estimate and compare the reliability of wearable NIRS with common performance and physiological measures across an incremental multi-stage cycling test on a stationary cycling ergometer. Our results for test–retest analysis supported our hypothesis that SmO2 would demonstrate good reliability across intensity, with excellent ICC scores at the lowest and median workloads, and a good ICC at the highest workload. With regards to relative reliability compared with the remaining variables, SmO2 provided similar ICC values to both HR and V̇O2, scoring higher than V̇O2 at the lowest workload, but not as high as both of these variables at the highest workload. Compared with [BLa] and RPE, SmO2 had better ICC scores across intensity (Table 3). This suggests that for use as an everyday tool for monitoring training load, SmO2 is similarly sensitive to physiological variability as these other measures.
Compared to previous studies of reliability of wearable NIRS (3–5), our results demonstrated higher absolute agreement compared to the studies that examined the reliability of wearable NIRS at rest. These studies reported absolute agreement as a CV of approximately 5% SmO2 (2.5%–9.5%) (3), or using Bland-Altman limits of agreement of approximately 10% SmO2 between trials (4), versus our CV of 11%–44% SmO2 from lowest to highest intensity respectively. However, we observed that SmO2 was homoscedastic, with a similar variability at high and low values (at low and high intensity, respectively). CV is less appropriate for homoscedastic data where variance is not proportional to parameter values, such as for SmO2. However, its common use in sport science reliability studies justified its inclusion (2, 49, 50). Caution is advised with the interpretation of CV for SmO2 in this context. Our ICC scores across workloads were comparable to those by Crum et al., who also looked at the reliability of wearable NIRS during an incremental exercise test (IET) (5). Like our CV results, they found higher CV values (lower absolute reliability) as workload increased. It is worth considering that the direction of change for SmO2 alone is inversely related to intensity, while all other parameters increase along with intensity. Our advice for sport practitioners is to consider the absolute limits of agreement when interpreting test–retest values. This can be quantified by the SEM when considering daily variation, and the MDC for interpreting the results of a longitudinal training intervention.
A study by van Hooff et al., investigated the reliability of PortaMon, wearable NIRS sensor during a continuous IET (51). They reported less variability with CV scores of 6, 8, and 10% at baseline, post exercise recovery, and maximal task tolerance, respectively. Despite the lower variability, their relative reliability scores were lower than the ones detected in our results. Unlike the Moxy sensor used in our study, the PortaMon sensor is more expensive, offering greater illumination depth and sampling rate. A study by McManus et al., compared between the PortaMon and Moxy during rest, exercise, and arterial occlusion (3). They found that Moxy displayed a greater dynamic range during exercise and arterial occlusion. Therefore, it is possible that the differences in both absolute and relative reliability between van Hooff's group and our results is explained by sensor differences.
Our results in Table 3 provide useful information for athletes, practitioners, and sport scientists who wish to monitor changes in VL SmO2 over time using a Moxy wearable sensor. Having SEM and MDC quantified across intensity can aid in understanding the expected day to day variance for SmO2 during regular training and testing. Changes within a range of ±SEM may be meaningful for an individual athlete but should not be considered to represent a significant difference. A longitudinal intervention such as a structured training program can be considered to have a significant effect on SmO2 if the values measured at a given absolute workload change by more than the MDC. Additionally, having all other common measures presented in conjunction with SmO2 improves our understanding of what each expected change should be for specific exercise workloads over time.
4.1. SmO2 compared to V̇O2 and HR
As expected, V̇O2 presented excellent reliability at all but the lowest workload, potentially due to a signal noise ratio issue. Since the early 1980’s, V̇O2 has been shown to demonstrate good-to-excellent test–retest reliability scores with a CV of approximately 10% across intensity during incremental exercise tests under controlled conditions (35, 37). In a recent investigation on the reliability of V̇O2 during an IET, Pallarés et al. (2016), observed the reliability of V̇O2 associated with the gas exchange threshold and the respiratory compensation point (37). These are common thresholds used to demarcate the transitions from moderate to heavy and heavy to severe intensity domains, respectively. They reported excellent test–retest ICC scores for both gas exchange threshold and the respiratory compensation point. In their report, within-subject reliability for both thresholds was found to have slightly lower CV (3.6% and 2.1%, respectively) than those found in our study at the median and highest workload (6.4% and 6.5%) (37).
In another study, reliability of V̇O2 and ventilatory measurements were estimated for the ParvoMedics TrueOne 2400 metabolic cart during a cycling ergometer IET (52). The protocol included 10–12 min work steps with 50 W increments, starting at a resistance of 50 W and progressing to 250 W. For V̇O2, their results showed a CV of approximately 5%, providing further strength to our findings, especially during the highest workload.
As for HR, Montoye et al. observed the reliability of HR using a chest strap during an IET. The protocol included 3 min work steps with 50 W increments, starting at a resistance of 50 W to maximal task tolerance. Their results showed excellent test–retest reliability scores, with CV less than 5% as seen in our results (36). In our results, HR showed CV ranging from 2.6% to 4.7% across intensity, and good to excellent ICC scores, which was higher than SmO2 only at the highest workload.
As expected, both V̇O2 and HR did show better reliability compared with SmO2, but only during the highest intensity. Despite difference in the mechanisms related to each measurement, comparing them side-by-side during an incremental exercise test provides an interesting contrast to better understand how reliable SmO2 is between sessions, in reference to these two common measures.
4.2. SmO2 compared to RPE and [BLa]
Our results showed higher reliability for SmO2 compared to both RPE and [BLa] across intensity. Both RPE and [BLa] ICC scores were either poor or moderate, compared with the good to excellent ICC scores for SmO2. Both of these measures have shown a wide range of reliability scores during incremental exercise tests (32–34, 36, 37). In our study, RPE ICC scores were poor-to-moderate, with ICC at the highest workload demonstrating the lowest ICC score across all outcome variables. As for [BLa], ICC scores were poor at the lowest workload, and moderate at the median and highest workloads. Our findings were in agreement with previous reports by Ettema et al., that showed variable test–retest scores for [BLa], across similar intensities (ICC = 0.44–0.92, MDC = 26%–69%) (38). It is worth highlighting that both [BLa] and RPE were taken using single measurements at the end of each stage, unlike the other continuous measures. They are also more prone to subjective interpretation of RPE scores, and investigator error in accurately acquiring capillary blood samples during exercise (34, 53). Despite [BLa] and RPE measuring different mechanisms, they are commonly used in exercise testing. Thus, contrasting them with SmO2 aids in highlighting how reliable each of these measures are in this context.
5. Study limitations
First, our experimental design did not include a familiarization trial to limit participant burden, which may have negatively affected the test–retest reliability outcomes. As previously mentioned, RPE scales are known to require participant familiarization periods, to ensure accurate self-reporting of exercise exertion (53). They are also dependent on other external factors such as psychological stress, emotional state, and readiness (53). As such, it is possible that the lack of familiarization negatively impacted the RPE reliability outcomes compared with the other measures used in our study. Second, the SmO2 signal of the wearable, self-contained Moxy sensor may be more sensitive to changes in blood flow, and changes in tissue properties with small positioning inconsistencies compared to NIRS signals measured with more advanced stationary technologies (21, 47, 54, 55). These sensor limitations may have affected the quality of the signal, and as a result, the test–retest results presented in our study. In order to include a more representative sample size of healthy, trained male and female cyclists, we did not exclude any participants based on skinfold thickness, even though previous reports suggest signal quality may be affected by skinfold thickness greater than 12.5 mm (3, 4, 56). Our rationale for providing a more generalizable subject group was a compromise that possibly affected reliability outcomes. And third, as muscle recruitment is specific to exercise modalities, NIRS responses will be specific to the location and modality (57, 58). Other locations may demonstrate higher physiological variation, although the measurement uncertainty of the Moxy device itself may be consistent across modalities (59).
6. Conclusions
Our main objectives were estimating the reliability of muscle oxygen saturation, alongside common performance and physiological markers across exercise intensity, as well as to provide absolute agreement in original units that can be expected between sessions for each marker. Our findings show that the commercially available Moxy wearable NIRS sensor provides good-to-excellent test–retest reliability during an incremental multi-stage cycling protocol. Compared to other common physiological metrics, reliability was highest for V̇O2 and HR, followed by SmO2, with [BLa] and RPE having lower reliability. Additionally, estimated values of absolute agreement reported in our results provide practitioners with the ability to differentiate measurement error from “true” physiological change. Athletes and coaches should expect a certain range of day to day variability when assessing and monitoring performance. This knowledge can be applied by practitioners who use this non-invasive, affordable technology for training prescription or monitoring internal load alongside other common measures. Wearable NIRS is an important instrument that if used appropriately, can add valuable insight into understanding muscle metabolic responses to exercise.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving human participants were reviewed and approved by the Research Ethics Committee of The University of British Columbia. The patients/participants provided their written informed consent to participate in this study.
Author contributions
The experiment was designed by MK, DC, BS, JG, JA, and AY; JA, HN, and AY collected the data. JA and AY performed the statistical analysis, and wrote the first draft of the manuscript which was reviewed and revised by MK, JG, BS, DC, and JA. All authors contributed to the article and approved the submitted version.
Funding
This study was funded by a Natural Sciences and Engineering Research Council Discovery Grant and an equipment grant from the British Columbia Sport and Exercise Medicine Research Foundation.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Perrey S, Ferrari M. Muscle oximetry in sports science: a systematic review. Sport Med. (2018) 48(3):597–616. doi: 10.1007/s40279-017-0820-1
2. Weir JP. Quantifying test-retest reliability using the intraclass correlation coefficient and the sem. J Strength Cond Res. (2005) 19:231–40.15705040
3. McManus CJ, Collison J, Cooper CE. Performance comparison of the MOXY and PortaMon near-infrared spectroscopy muscle oximeters at rest and during exercise. J Biomed Opt. (2018) 23(01):1. doi: 10.1117/1.JBO.23.1.015007
4. Feldmann A, Schmitz R, Erlacher D. Near-infrared spectroscopy-derived muscle oxygen saturation on a 0% to 100% scale: reliability and validity of the moxy monitor. J Biomed Opt. (2019) 24(11):1. doi: 10.1117/1.JBO.24.11.115001.full
5. Crum EM, O’Connor WJ, Van Loo L, Valckx M, Stannard SR. Validity and reliability of the moxy oxygen monitor during incremental cycling exercise. Eur J Sport Sci. (2017) 17(8):1037–43. doi: 10.1080/17461391.2017.1330899
6. Bartlett JW, Frost C. Reliability, repeatability and reproducibility: analysis of measurement errors in continuous variables. Ultrasound Obs Gynecol. (2008) 31:466–75. doi: 10.1002/uog.5256
7. Yogev A, Arnold J, Clarke D, Guenette JA, Sporer BC, Koehle MS. Comparing the respiratory compensation point with muscle oxygen saturation in locomotor and non-locomotor muscles using wearable NIRS spectroscopy during whole-body exercise. Front Physiol. (2022) 13:818733. doi: 10.3389/fphys.2022.818733
8. Paquette M, Bieuzen F, Billaut F. The effect of HIIT vs. SIT on muscle oxygenation in trained sprint kayakers. Eur J Appl Physiol. (2021) 121:2743–59. doi: 10.1007/s00421-021-04743-z
9. Paquette M, Bieuzen F, Billaut F. Effect of a 3-weeks training camp on muscle oxygenation, VO2 and performance in elite sprint kayakers. Front Sport Act Living. (2020) 2:1–13. doi: 10.3389/fspor.2020.00047
10. Murias JM, Keir DA, Spencer MD, Paterson DH. Sex-related differences in muscle deoxygenation during ramp incremental exercise: response to peltonen et al. Respir Physiol Neurobiol. (2014) 195(1):61–2. doi: 10.1016/j.resp.2014.01.010
11. Fontana FY, Keir DA, Bellotti C, De Roia GF, Murias JM, Pogliaghi S. Determination of respiratory point compensation in healthy adults: can non-invasive near-infrared spectroscopy help? J Sci Med Sport. (2015) 18(5):590–5. doi: 10.1016/j.jsams.2014.07.016
12. Keir DA, Pogliaghi S, Murias JM. The respiratory compensation point and the deoxygenation break point are valid surrogates for critical power and maximum lactate steady state. Med Sci Sports Exerc. (2018) 50(11):2375–8. doi: 10.1249/MSS.0000000000001698
13. Boone J, Koppo K, Barstow TJ, Bouckaert J. Effect of exercise protocol on deoxy[Hb + Mb]: incremental step versus ramp exercise. Med Sci Sports Exerc. (2010) 42(5):935–42. doi: 10.1249/MSS.0b013e3181c0ecea
14. Boone J, Barstow TJ, Celie B, Prieur F, Bourgois J. The interrelationship between muscle oxygenation, muscle activation, and pulmonary oxygen uptake to incremental ramp exercise: influence of aerobic fitness. Appl Physiol Nutr Metab. (2015) 41(1):55–62. doi: 10.1139/apnm-2015-0261
15. Kirby BS, Clark DA, Bradley EM, Wilkins BW. The balance of muscle oxygen supply and demand reveals critical metabolic rate and predicts time to exhaustion. J Appl Physiol. (2021) 130(6):1915–27. doi: 10.1152/japplphysiol.00058.2021
16. Van Der Zwaard S, Jaspers RT, Blokland IJ, Achterberg C, Visser JM, Den Uil AR, et al. Oxygenation threshold derived from near- infrared spectroscopy: reliability and its relationship with the first ventilatory threshold. PLoS One. (2016) 11(9):1–16. doi: 10.1371/journal.pone.0162914
17. Keir DA, Fontana FY, Robertson TC, Murias JM, Paterson DH, Kowalchuk JM, et al. Exercise intensity thresholds: identifying the boundaries of sustainable performance. Med Sci Sports Exerc. (2015) 47(9):1932–40. doi: 10.1249/MSS.0000000000000613
18. Racinais S, Buchheit M, Girard O, Perrey S. Breakpoints in ventilation, cerebral and muscle oxygenation, and muscle activity during an incremental cycling exercise. Front Physiol. (2014) 5:142.24782786
19. Boone J, Vandekerckhove K, Coomans I, Prieur F, Bourgois JG. An integrated view on the oxygenation responses to incremental exercise at the brain, the locomotor and respiratory muscles. Eur J Appl Physiol. (2016) 116(11–12):2085–102. doi: 10.1007/s00421-016-3468-x
20. Spencer MD, Murias JM, Paterson DH. Characterizing the profile of muscle deoxygenation during ramp incremental exercise in young men. Eur J Appl Physiol. (2012) 112(9):3349–60. doi: 10.1007/s00421-012-2323-y
21. Grassi B, Pogliaghi S, Rampichini S, Quaresima V, Ferrari M, Marconi C, et al. Muscle oxygenation and pulmonary gas exchange kinetics during cycling exercise on-transitions in humans. J Appl Physiol. (2003) 95(1):149–58. doi: 10.1152/japplphysiol.00695.2002
22. Bellotti C, Calabria E, Capelli C, Pogliaghi S. Determination of maximal lactate steady state in healthy adults: can NIRS help? Med Sci Sports Exerc. (2013) 45(6):1208–16. doi: 10.1249/MSS.0b013e3182828ab2
23. Neary JP, McKenzie DC, Bhambhani YN. Effects of short-term endurance training on muscle deoxygenation trends using NIRS. Med Sci Sports Exerc. (2002) 34(11):1725–32. doi: 10.1097/00005768-200211000-00006
24. Bhambhani Y, Buckley S, Susaki T. Muscle oxygenation trends during constant work rate cycle exercise in men and women. Med Sci Sports Exerc. (1999) 1:90–8. doi: 10.1097/00005768-199901000-00015
25. Jamnick NA, Botella J, Pyne DB, Bishop DJ. Manipulating graded exercise test variables affects the validity of the lactate threshold and VO2peak. PLoS One. (2018) 13(7):1–21. doi: 10.1371/journal.pone.0199794
26. Bourdon PC, Woolford SM, Buckley JD. Effects of varying the step duration on the determination of lactate thresholds in elite rowers. Int J Sports Physiol Perform. (2018) 13(6):687–93. doi: 10.1123/ijspp.2017-0258
27. Bentley DJ, Newell J, Bishop D. Incremental exercise test design and analysis: implications for performance diagnostics in endurance athletes. Sports Med (Auckland). (2007) 37:575–86. doi: 10.2165/00007256-200737070-00002
28. Batterson PM, Kirby BS, Hasselmann G, Feldmann A. Muscle oxygen saturation rates coincide with lactate-based exercise thresholds. Eur J Appl Physiol. (2023):1–10. doi: 10.1007/s00421-023-05238-9
29. Caen K, Vermeire K, Bourgois JG, Boone J. Exercise thresholds on trial: are they really equivalent? Med Sci Sports Exerc. (2018) 50(6):1277–84. doi: 10.1249/MSS.0000000000001547
30. Inglis EC, Iannetta D, Keir DA, Murias JM. Training-Induced changes in the RCP, [HHb]BP and MLSS: evidence of equivalence. Int J Sports Physiol Perform. (2019) 6:1–7. doi: 10.1123/ijspp.2019-0046
31. Snyder AC, Parmenter MA. Using near-infrared spectroscopy to determine maximal steady state exercise intensity. J Strength Cond Res. (2009) 23(6):1833–40. doi: 10.1519/JSC.0b013e3181ad3362
32. Aunola S, Rusko H. Reproducibility of aerobic and anaerobic thresholds in 20–50 year old men. Eur J Appl Physiol Occup Physiol. (1984) 53:260–6. doi: 10.1007/BF00776600
33. Potteiger JA, Weber SF. Rating of perceived exertion and heart rate as indicators of exercise intensity in different environmental temperatures. Med Sci Sports Exerc. (1994) 26(6):791–6. doi: 10.1249/00005768-199406000-00020
34. Grant S, McMillan K, Newell J, Wood L, Keatley S, Simpson D, et al. Reproducibility of the blood lactate threshold, 4 mmol·l–1 marker, heart rate and ratings of perceived exertion during incremental treadmill exercise in humans. Eur J Appl Physiol. (2002) 87(2):159–66. doi: 10.1007/s00421-002-0608-2
35. Prud’Homme D, Bouchard C, Leblance C, Landry F, Lortie G, Boulay MR. Reliability of assessments of ventilatory thresholds. J Sports Sci. (1984) 2(1):13–24. doi: 10.1080/02640418408729692
36. Montoye AHK, Vondrasek JD, Hancock JB II. Validity and reliability of the VO2 master pro for oxygen consumption and ventilation assessment. Int J Exerc Sci. (2020) 13(4):1382–401.33042375
37. Pallarés JG, Morán-Navarro R, Ortega JF, Fernández-Elías VE, Mora-Rodriguez R. Validity and reliability of ventilatory and blood lactate thresholds in well-trained cyclists. PLoS One. (2016) 11:e0163389. doi: 10.1371/journal.pone.0163389
38. Ettema M, Brurok B, Baumgart JK. Test-retest reliability of physiological variables during submaximal seated upper-body poling in able-bodied participants. Front Physiol. (2021) 12:749356. doi: 10.3389/fphys.2021.749356
39. Herman L, Foster C, Maher MA, Mikat RP, Porcari JP. Validity and reliability of the session RPE method for monitoring exercise training intensity. S Afr J Sport Med. (2006) 18(1):14–7. doi: 10.17159/2078-516X/2006/v18i1a247
40. Contreras-Briceño F, Espinosa-Ramirez M, Hevia G, Llambias D, Carrasco M, Cerda F, et al. Reliability of NIRS portable device for measuring intercostal muscles oxygenation during exercise. J Sports Sci. (2019) 37(23):2653–9. doi: 10.1080/02640414.2019.1653422
41. Kriel Y, Askew CD, Solomon C. The effect of running versus cycling high-intensity intermittent exercise on local tissue oxygenation and perceived enjoyment in 18–30-year-old sedentary men. PeerJ. (2018) 6:e5026. doi: 10.7717/peerj.5026
42. McKay AKA, Stellingwerff T, Smith ES, Martin DT, Mujika I, Goosey-Tolfrey VL, et al. Defining training and performance caliber: a participant classification framework. Int J Sports Physiol Perform. (2022) 17(2):317–31. doi: 10.1123/ijspp.2021-0451
43. Pauw KD, Roelands B, Cheung SS, de Geus B, Rietjens G, Meeusen R. Guidelines to classify subject groups in sport-science research. Int J Sports Physiol Perform. (2013) 8(2):111–22. doi: 10.1123/ijspp.8.2.111
44. Decroix L, De Pauw K, Foster C, Meeusen R. Guidelines to classify female subject groups in sport-science research. Int J Sports Physiol Perform. (2016) 11(2):204–13. doi: 10.1123/ijspp.2015-0153
45. Takaishi T, Ishida K, Katayama K, Yamazaki K, Yamamoto T, Moritani T. Effect of cycling experience and pedal cadence on the near-infrared spectroscopy parameters. Med Sci Sports Exerc. (2002) 34(12):2062–71. doi: 10.1097/00005768-200212000-00030
46. Borg GA. Psychophysical bases of perceived exertion. Med Sci Sports Exerc. (1982) 14(5):377–81. doi: 10.1249/00005768-198205000-00012
47. Barstow TJ. Understanding near infrared spectroscopy and its application to skeletal muscle research. J Appl Physiol. (2019) 126:1360–76. doi: 10.1152/japplphysiol.00166.2018
48. Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. (2016) 15(2):155–63. doi: 10.1016/j.jcm.2016.02.012
49. Hopkins WG. Measures of reliability in sports medicine and science. Sport Med. (2000) 30(5):375–81. doi: 10.2165/00007256-200030050-00006
50. Atkinson G, Nevill AM. Statistical methods for assessing measurement error (reliability) in variables relevant to sports medicine. Sports Med. (1998) 26(4):217–38. doi: 10.2165/00007256-199826040-00002
51. van Hooff M, Meijer EJ, Scheltinga MRM, Savelberg HHCM, Schep G. Test–retest reliability of skeletal muscle oxygenation measurement using near-infrared spectroscopy during exercise in patients with sport-related iliac artery flow limitation. Clin Physiol Funct Imaging. (2022) 42(2):114–26. doi: 10.1111/cpf.12738
52. Crouter SE, Antczak A, Hudak JR, DellaValle DM, Haas JD. Accuracy and reliability of the ParvoMedics TrueOne 2400 and MedGraphics VO2000 metabolic systems. Eur J Appl Physiol. (2006) 98(2):139–51. doi: 10.1007/s00421-006-0255-0
53. Halperin I, Emanuel A. Rating of perceived effort: methodological concerns and future directions. Sport Med. (2020) 50(4):679–87. doi: 10.1007/s40279-019-01229-z
54. Messere A, Roatta S. Influence of cutaneous and muscular circulation on spatially resolved versus standard beer-Lambert near-infrared spectroscopy. Physiol Rep. (2013) 1(7):e00179. doi: 10.1002/phy2.179
55. Koga S, Poole DC, Kondo N, Oue A, Ohmae E, Barstow TJ. Effects of increased skin blood flow on muscle oxygenation/deoxygenation: comparison of time-resolved and continuous-wave near-infrared spectroscopy signals. Eur J Appl Physiol. (2015) 115(2):335–43. doi: 10.1007/s00421-014-3019-2
56. Niemeijer VM, Spee RF, Jansen JP, Buskermolen ABC, van Dijk T, Wijn PFF, et al. Test–retest reliability of skeletal muscle oxygenation measurements during submaximal cycling exercise in patients with chronic heart failure. Clin Physiol Funct Imaging. (2017) 37(1):68–78. doi: 10.1111/cpf.12269
57. Zinner C, Sperlich B, Born D-P, Michels G. Effects of combined high intensity arm and leg training on performance and cardio-respiratory measures. J Sports Med Phys Fitness. (2017) 57(7–8):969–75. doi: 10.23736/S0022-4707.16.06539-7
58. Craig JC, Broxterman RM, Wilcox SL, Chen C, Barstow TJ. Effect of adipose tissue thickness, muscle site, and sex on near-infrared spectroscopy derived total-[hemoglobin-myoglobin]. J Appl Physiol. (2017) 123(6):1571–8. doi: 10.1152/japplphysiol.00207.2017
Keywords: muscle oxygen saturation, near-infrared spectroscopy, repeatability, metabolic analysis, incremental exercise test, wearable, exercise testing, training zones
Citation: Yogev A, Arnold J, Nelson H, Clarke DC, Guenette JA, Sporer BC and Koehle MS (2023) Comparing the reliability of muscle oxygen saturation with common performance and physiological markers across cycling exercise intensity. Front. Sports Act. Living 5:1143393. doi: 10.3389/fspor.2023.1143393
Received: 12 January 2023; Accepted: 11 July 2023;
Published: 4 August 2023.
Edited by:
Paul William Macdermid, Massey University, New ZealandReviewed by:
Stephane Perrey, Université de Montpellier, FranceDavid Varillas-Delgado, Universidad Francisco de Vitoria, Spain
Scott Nolan Drum, Northern Arizona University, United States
© 2023 Yogev, Arnold, Nelson, Clarke, Guenette, Sporer and Koehle. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Assaf Yogev eW9nZXZhc3NAc3R1ZGVudC51YmMuY2E=