
95% of researchers rate our articles as excellent or good
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.
Find out more
SYSTEMATIC REVIEW article
Front. Physiol. , 20 March 2025
Sec. Exercise Physiology
Volume 16 - 2025 | https://doi.org/10.3389/fphys.2025.1551645
Introduction: The “no pain, no gain” philosophy has long influenced athletic training approaches, particularly in high-intensity workouts like metabolic resistance training (MRT). However, the necessity of discomfort-inducing training for optimal athletic performance remains debatable. This systematic review and meta-analysis examined whether MRT provided comparable or better results than traditional training methods in trained athletes.
Methods: A systematic search of PubMed/MEDLINE, Web of Science, Scopus, and SPORTDiscus (January 2004 - December 2024) identified RCTs comparing MRT with traditional training in athletes. Two reviewers screened studies and assessed bias risk using Cochrane RoB 2. Random - effects meta - analyses were conducted for outcomes like VO2max, peak power, sprint performance, blood lactate, time to exhaustion, and jump height. GRADE was used to evaluate evidence certainty.
Results: Eleven studies (n = 276 participants) met inclusion criteria. MRT demonstrated a statistically significant improvement in sprint performance (SMD = 1.18, 95% CI: 0.00 to 2.36, p < 0.0001) and countermovement jump height (SMD = 0.80, 95% CI: −0.04 to 1.64, p = 0.0007), indicating notable gains in explosive power. VO2max improvements were observed (SMD = 0.30, 95% CI: −0.19 to 0.79, p = 0.10) but did not reach statistical significance. Peak power output showed a moderate but non-significant positive effect (SMD = 0.54, 95% CI: −2.05 to 3.13, p = 0.55), while blood lactate changes varied widely (SMD = −1.68, 95% CI: −8.58 to 5.22, p = 0.29), reflecting high heterogeneity across studies. Time to exhaustion presented a small positive effect (SMD = 0.23, 95% CI: 0.00 to 0.46, p = 0.18), but without statistical significance. Subgroup analyses revealed that younger adults (19–25 years) and experienced athletes benefited the most from MRT, with low-frequency training (≤2 sessions/week) yielding the most favorable adaptations. Moderator analysis confirmed that sprint performance had the strongest response to MRT, while aerobic measures exhibited more variability.
Conclusion: The evidence demonstrates the capacity of MRT to enhance athletic performance comparable to or exceeding traditional training methods while requiring reduced time commitment. These findings suggest that optimal performance adaptations can be achieved through well-designed MRT protocols without necessitating excessive training volumes.
Systematic Review Registration: https://inplasy.com/inplasy-2024-11-0024, identifier: 36 INPLASY2024110024.
The “no pain, no gain” philosophy has profoundly influenced athletic training approaches, particularly in high-intensity workouts like metabolic resistance training (MRT). This training philosophy, which gained prominence through 1970s American bodybuilding culture, reflects deeper assumptions about the relationship between physical discomfort and physiological adaptation in exercise (Kuriyama, 2019). Recent investigations have begun to examine how this philosophy intersects with the contemporary understanding of exercise physiology and performance outcomes (Lee et al., 2023; Lev, 2023; Moen et al., 2016).
MRT represents a targeted approach to high-intensity interval training (HIIT) that strategically combines resistance exercises with limited rest periods to optimize both metabolic and muscular adaptations (Adeel et al., 2021; Jett and Swank, 2013; Schoenfeld, 2021). MRT uniquely integrates multiple training stimuli through compound movements performed at moderate to high intensities, contrasting with traditional resistance training that focuses primarily on mechanical tension for strength development (Schoenfeld, 2021; Contreras, 2014; Woodard, 2014). The training modality typically employs multi-joint exercises with resistance at 60%–65% of one-repetition maximum for 15–20 repetitions per set, creating significant metabolic stress while maintaining mechanical tension. The physiological basis for MRT effectiveness stems from its impact on post-exercise metabolism and hormonal responses. When properly structured, MRT can elevate post-exercise oxygen consumption for up to 38 h after training, substantially increasing total energy expenditure (Adeel et al., 2021; Jones, 2024). This extended metabolic response, combined with acute hormonal changes triggered by compound movements, creates an environment conducive to both muscle growth and fat loss (Chang et al., 2022; Da Silva et al., 2020).
Recent studies examining high-intensity resistance training protocols have demonstrated significant improvements in strength, power, and metabolic conditioning compared to traditional methods (Marín-Pagán et al., 2020). The relationship between exercise intensity and athletic performance has shown complex patterns, particularly regarding delayed onset muscle soreness (DOMS). Lev (Lev, 2023) identified that DOMS often becomes an indispensable experience actively pursued by experienced practitioners, suggesting a psychological component to training adaptation. Research by (Moro et al., 2020; Steele et al., 2019) has challenged the notion that more intense, potentially painful training necessarily leads to better overall outcomes. Their findings suggest that effective adaptations may occur without excessive discomfort when proper programming principles are applied (Lee et al., 2023). further demonstrated that adequate exercise-induced hypoalgesia responses partially contribute to improvements in exercise outcomes, suggesting broader applications for MRT beyond performance enhancement. Several studies have investigated concurrent training combining resistance training (RT) with either HIIT or moderate-intensity continuous training (MICT) (Da Silva et al., 2020). Found that RT + HIIT improved fasting glucose, insulin sensitivity, and blood lipids more effectively than RT + MICT, though both improved strengths similarly. This suggests that high-intensity approaches might offer metabolic benefits beyond traditional methods.
While existing research within the biomedical field has undoubtedly illuminated physiological adaptations to MRT, there remains limited investigation into the lived experience of practitioners (Nunes et al., 2024). Current evidence indicates that optimal performance adaptations depend more on systematic program design elements like exercise selection, progressive overload, and recovery management than on pain tolerance (Moen et al., 2016; Adeel et al., 2021; Da Silva et al., 2020; Nunes et al., 2024; Aagaard et al., 2011). This systematic review with meta-analysis challenges conventional wisdom by examining whether HIIT proves necessary for achieving optimal athlete performance outcomes. The review provides evidence-based insights into whether effective training and long-term athletic success necessarily require MRT or whether similar results can be achieved with traditional cardio training and exercises. This approach could provide evidence that effective training and long-term athletic success do not necessarily require pushing the body to painful limits, thereby promoting more sustainable training methods.
The systematic review protocol was registered with the International Platform of Registered Systematic Review and Meta-Analysis Protocols (INPLASY) (registration number: INPLASY2024110024) on 6 November 2024. The protocol provides detailed methodology including a search strategy, eligibility criteria, outcome measures, and statistical analysis plan for comparing MRT versus traditional cardio approaches for athletic performance outcomes. The complete protocol is available at https://inplasy.com/inplasy-2024-11-0024/(Barker et al., 2021).
A systematic literature search was conducted across four major electronic databases: PubMed/MEDLINE, Web of Science, Scopus, and SPORTDiscus. The search strategy incorporated controlled vocabulary (MeSH terms) and free-text terms, organized around four key concept areas: population (athletes), intervention (metabolic resistance training), comparison (traditional cardio), and outcomes (performance measures). Population concept search terms included variations of “trained athlete,” “resistance trained,” and “athletic population”. The intervention concept encompassed terms such as “metabolic resistance training,” “MRT,” “high-intensity training,” and “resistance exercise”. The comparison concept included terms like “traditional cardio,” “aerobic exercise,” and “moderate-intensity continuous training”. Outcome measures were captured using terms such as “performance,” “strength,” “power output,” “VO2max,” and “body composition”. Each concept was expanded using appropriate synonyms, truncation, and Boolean operators (AND, OR and NOT) to ensure comprehensive coverage.
Prior to implementing the final search strategy (Table A1), a comprehensive initial search across all selected databases without language restrictions was performed to assess the potential impact of language limitation on study selection. This initial search revealed that over 99% of potentially eligible studies were published in English. PubMed/MEDLINE and Scopus searches yielded no non-English studies meeting the search criteria. Web of Science and SPORTDiscus each identified one Italian and Spanish language article, respectively, but upon detailed screening, these papers did not focus on MRT and failed to meet other inclusion criteria. Based on this empirical evidence that all relevant studies investigating MRT in trained athletes were published in English, the final search strategy proceeded with English-language restriction. This methodological decision ensured systematic consistency while maintaining the comprehensiveness of the review, as no eligible studies were excluded based on language criteria. The complete record of both final search strategies for each database is provided in Table A1, ensuring transparency and reproducibility of the search process.
Studies were included when they met the following criteria: 1.Study design: a) Randomized controlled trials (RCTs); b) Controlled clinical trials (CCTs); c) Cross-over trials.2.Language and publication period: a) Published in English; b) Published between 2004 and 2024. This timeframe was selected to capture contemporary training approaches while ensuring a comprehensive analysis of the evolution of MRT.3.Study population: a) Healthy, trained athletes and exercise practitioners (e.g., coaches, trainers) with at least 3 months of training experience; b) Aged above 16 years to ensure population homogeneity in training adaptations (Higgins et al., 2024; Papakostidis and Giannoudis, 2023).4.Intervention focus: a) Interventions specifically targeting MRT; b) Comparison with traditional cardio training approaches.5.Outcome measures: a) Studies reporting quantifiable athletic performance outcomes (Papakostidis and Giannoudis, 2023; Cho and Shin, 2021; Lee, 2019) including: i) Maximal strength; ii) Power output; iii) Cardiovascular fitness; iv) Body composition changes.6.Control group: a) Control groups receiving no intervention beyond regular training were considered valid comparators.
Studies were excluded when they met the following criteria: 1. Athlete condition: a) Athletes and exercise practitioners presenting with acute or chronic injuries to eliminate confounding factors in training adaptations. 2. Study design: a) Studies lacking control groups to maintain methodological rigor in intervention assessment; b) Studies combining multiple training protocols within intervention groups to preserve intervention specificity. 3. Population: a) Research focusing on recreational athletes rather than athletes and exercise practitioners to ensure target population homogeneity (Barker et al., 2021; Lensen, 2023). 4. Outcome measures: a) Studies lacking quantitative performance measures; b) Studies without complete statistical data for effect size calculation. 5. Methodological quality: a) Studies with insufficient methodological details; b) Studies with incomplete outcome reporting; c) Studies with unclear or inadequate statistical analyses to maintain analytical rigor (Papakostidis and Giannoudis, 2023; Cho and Shin, 2021; Lee, 2019). 6. Other exclusions: a) Non-human subject research as physiological responses in animals may not translate to humans (Papakostidis and Giannoudis, 2023; Cho and Shin, 2021; Lee, 2019); b) Non-English language publications following the initial search which revealed that 99% of eligible studies were published in English.
This systematic review followed PRISMA 2020 guidelines (Page et al., 2021), implementing a comprehensive selection strategy(Figure 1). The selection process utilized two specialized software tools: Zotero Reference Management Software (Version 6.0) for citation management and duplicate removal, and ASReview Lab (Version 1.0) automation tools for preliminary screening. The systematic search process incorporated multiple stages of evaluation. Following the initial database identification, all retrieved citations underwent deduplication using Zotero’s built-in detection algorithm, combining both automatic and manual verification processes. Duplicate identification relied on matching titles, authors, and publication years. ASReview Lab automation tools facilitated the preliminary screening phase, employing predefined algorithms to analyze titles and abstracts. Following the automated screening, two independent reviewers conducted manual screening of the remaining records against predetermined inclusion and exclusion criteria using standardized assessment forms. The assessment forms were designed to examine methodological quality, intervention protocols, and outcome measurements.
The full-text assessment followed a structured protocol, with reviewers documenting specific rationales for exclusion decisions. The process maintained methodological rigor through regular calibration meetings between reviewers to ensure consistency in decision-making and assessment standards. When disagreements arose between reviewers, resolution occurred through structured discussion using a predetermined resolution matrix based on the inclusion and exclusion criteria. A third reviewer was available for arbitration when necessary. This systematic approach ensured transparent documentation of the selection process while maintaining strict methodological standards. The selection process prioritized studies featuring MRT as the primary intervention focus, meeting predetermined quality criteria for inclusion in the final analysis.
The risk of bias was assessed using the Cochrane Risk of Bias 2.0 tool (RoB 2), which evaluates five domains: randomization process, deviations from intended interventions, missing outcome data, measurement of outcomes, and selection of reported results (Higgins et al., 2011; Sterne et al., 2019; Moseley et al., 2019). Each domain was rated as “low risk,” “some concerns,” or “high risk” of bias. Two independent reviewers completed the assessments, with a third reviewer consulted to resolve any discrepancies. For crossover trials, the additional domain of period and carryover effects was evaluated. The methodological quality of the included studies was systematically evaluated using three complementary assessment tools. The PEDro scale was applied to assess the methodological quality of each study across eleven distinct criteria, including random allocation, baseline comparability, blinding procedures, and adequate follow-up. Two independent reviewers scored each study, with disagreements resolved through consensus discussion (Moseley et al., 2019; Cashin and McAuley, 2020; de Morton, 2009). A study could achieve a maximum score of 10 points, as the first criterion (eligibility criteria) affects external validity and is not included in the total score. Publication bias was examined through funnel plot analysis using R Studio (version 4.2.1). The standard error of the intervention effect estimate was plotted against the effect estimate for each study, with asymmetry in the resulting plot potentially indicating publication bias (Harrer et al., 2021; Debray et al., 2018; Song et al., 2002). The analysis was conducted separately for each primary outcome measure where sufficient studies were available (minimum of 3 studies). Egger’s test was performed to provide a statistical assessment of funnel plot asymmetry, with a p-value < 0.05 considered indicative of significant publication bias (Harrer et al., 2021; Song et al., 2002; Armijo-Olivo et al., 2015).
The data extraction process involved systematic collection of bibliographic details and methodological characteristics including study design, sport type, and sample demographics. The extracted participant information encompassed sample sizes, age distributions, and training experience levels. Intervention details documented included training protocols, exercise intensity parameters (e.g., %VO2max, RPE ratings), session frequency, intervention duration, and specific training parameters such as work-to-rest ratios and exercise modalities. Performance outcomes underwent systematic extraction and standardization for comparison. Primary outcomes targeted for extraction included: VO2max (measured in mL/kg/min), Peak power output (standardized to W/kg), sprint performance (converted to m/s), blood lactate measurements (mmol/L), time to exhaustion (in seconds) and countermovement jump height (in cm). Secondary measures included program adherence rates and psychological responses where available. Quality indicators and methodological details underwent extraction to assess the risk of bias and study rigor. The extraction process emphasized standardization of outcome measures across studies through careful documentation of measurement protocols and units to enable valid cross-study comparisons. The extraction protocol excluded outcomes appearing in fewer than two studies or lacking standardized reporting methods for meaningful comparison. This methodological approach ensured focus on robust and comparable data points across studies, enhancing reliability. The standardization protocols enabled accurate cross-study comparisons while maintaining methodological rigor in the analysis of both aerobic and anaerobic adaptations associated with different training protocols.
Statistical analyses were conducted using two primary software platforms: Cochrane Review Manager (RevMan Web) for effect size calculations and certainty of evidence and RStudio for publication bias assessment and moderator analyses. A random-effects model using the Restricted Maximum-Likelihood (REML) method was employed for estimating between-study variance (τ2), as this approach provides more accurate estimates than traditional methods when dealing with high heterogeneity and smaller study numbers (Bakbergenuly et al., 2020; Seide et al., 2019). The Hartung-Knapp-Sidik-Jonkman (HKSJ) method was utilized for calculating confidence intervals, providing more conservative and reliable estimates by accounting for uncertainty in variance estimation (Seide et al., 2019; Röver et al., 2015; Weber et al., 2021). The random-effects model was selected due to the inherent biological and methodological variability across training studies and diverse athletic populations and training backgrounds represented in the included studies (Dettori et al., 2022; Zhai and Guyatt, 2024).
Standardized mean differences (SMD) with 95% confidence intervals were calculated for continuous outcomes including VO2max, peak power output, sprint performance, blood lactate measurements, time to exhaustion, and countermovement jump height. Effect sizes were interpreted following established guidelines: small (0.2–0.6), moderate (0.6–1.2), large (1.2–2.0), and very large (>2.0) (Hopkins et al., 2009; Swinton et al., 2023). Heterogeneity assessment involved multiple metrics: Tau2 (τ2) with 95% confidence intervals estimated between-study variance, while the Chi2 test assessed whether observed differences in results were compatible with chance alone. The I2 statistic quantified the proportion of observed variance reflecting real differences in effect size. Heterogeneity magnitude was classified as low (I2 ≤ 25%), moderate (25% > I2 < 75%), or high (I2 ≥ 75%) (Kang et al., 2025; von Hippel, 2015).
Subgroup analyses explored the influence of training experience, program duration, and exercise intensity levels, while meta-regression analyses investigated potential moderating effects. For subgroup analyses, the minimum thresholds of 6 studies per subgroup were established to ensure reliable comparisons. A minimum of 6 studies per subgroup are recommended due to statistical power considerations and recommendations from the Cochrane Handbook for Systematic Reviews (Higgins et al., 2024). This threshold balanced the need for meaningful comparisons with the practical limitations of available data.
Publication bias evaluation occurred through visual inspection of funnel plots examining the relationship between study precision and effect size. Asymmetry in funnel plots indicated potential publication bias or systematic heterogeneity (Harrer et al., 2021; Song et al., 2002; Armijo-Olivo et al., 2015). The Cochrane Collaboration’s Risk of Bias 2 (ROB 2) tool was employed to evaluate trial design, conduct, and reporting reliability across seven domains: random sequence generation, allocation concealment, blinding of participants and personnel, blinding of outcome assessment, incomplete outcome data, selective reporting, and other bias sources (Higgins et al., 2024; Higgins et al., 2011; Sterne et al., 2019).
The certainty of evidence for each outcome underwent assessment using the Grading of Recommendations Assessment, Development and Evaluation (GRADE) approach (Prasad, 2024; Neumann et al., 2013; Ansari et al., 2009). The analysis was performed using GRADEpro GDT software, with outcome data imported directly from RevMan Web through the Summary of Findings functionality (Cochrane Training, 2023). Five key domains for each outcome were evaluated, namely,: risk of bias, inconsistency, indirectness, imprecision, and publication bias (Prasad, 2024; Neumann et al., 2013; Ansari et al., 2009; Cochrane Training, 2023). The certainty of evidence was categorized as high, moderate, low, or very low, reflecting our confidence in the effect estimates (Prasad, 2024; Xun et al., 2023; Pandis et al., 2015). Two reviewers independently assessed the certainty of evidence for each outcome, with disagreements resolved through discussion.
The systematic database search initially identified 332 records across four major electronic databases: PubMed/MEDLINE (n = 55), Scopus (n = 69), Web of Science (n = 72), and SPORTDiscus (n = 136). Following the import of all citations into Zotero Reference Management Software (Version 6.0), duplicate detection and removal processes identified and eliminated 95 duplicate records through both automated and manual verification methods. The preliminary screening phase, utilizing ASReview Lab automation tools with predefined algorithmic parameters, excluded 196 records based on title and abstract analysis. This automated screening was followed by manual verification of the remaining 41 records by two independent reviewers. During this phase, eight studies were excluded due to non-athlete participants, leaving 33 articles eligible for full-text assessment. The full-text review phase resulted in the exclusion of 22 additional studies for the following reasons: 13 studies did not exclusively assess HIIT; 4 studies lacked appropriate control groups; 1 study included intervention groups with multiple protocols; 1 study involved non-trained athletes; 1 study included participants below the age threshold; while 2 studies failed to provide sufficient raw data for analysis. This rigorous selection process yielded 11 studies that met all eligibility criteria and were included in the final systematic review and meta-analysis. These studies demonstrated appropriate methodological quality and provided complete data for quantitative synthesis. The included studies represented research conducted between 2004 and 2024, encompassing various aspects of MRT in athletic populations. The complete study selection process is illustrated in the PRISMA flow diagram (Figure 1), which provides transparent documentation of the identification, screening, eligibility assessment, and final inclusion stages. This systematic approach ensures reproducibility and methodological rigor in the study selection process.
This systematic review and meta-analysis synthesized data from 11 studies (Androulakis-Korakakis et al., 2018; Devereux et al., 2022; Fiorenza et al., 2019; Gantois et al., 2019; Kelly et al., 2021; Kon et al., 2019; Liu et al., 2024; Mallol et al., 2019; Talsnes et al., 2022a; Talsnes et al., 2022b; Wen et al., 2024) (10 RCTs, 1 randomized crossover) examining MRT in trained athletes (Table 1). The studies encompassed diverse athletic populations including cyclists and triathletes (Fiorenza et al., 2019; Mallol et al., 2019), powerlifting (Androulakis-Korakakis et al., 2018), soccer players (Liu et al., 2024; Wen et al., 2024), basketball players (Gantois et al., 2019), cross-country skiers (Talsnes et al., 2022a; Talsnes et al., 2022b), and mixed-sport athletes (Devereux et al., 2022) while (Kelly et al., 2021) and (Kon et al., 2019) involved Gaelic football players and canoe athletes, respectively.
Participant characteristics demonstrated substantial homogeneity in training experience, with most studies requiring a minimum of 2–3 years of sport-specific training (Table 1). Some studies included elite athletes with extensive training backgrounds, such as national-level juniors training over 500 h annually (Talsnes et al., 2022a). Sample sizes ranged from 11 participants (Fiorenza et al., 2019) to 51 participants (Talsnes et al., 2022b), with a total of 276 participants across all studies. Mean participant ages spanned from 17.1 years (Wen et al., 2024) to 42.9 years (Mallol et al., 2019), with the majority of participants (78%) being male athletes.
Intervention protocols showed (Table 2) considerable variation in duration and frequency. Study lengths ranged from single sessions (Fiorenza et al., 2019) to 9 weeks (Talsnes et al., 2022a), with most studies implementing 3–8 weeks protocols. Training frequency typically involved 2–3 sessions per week, though some studies, particularly those with elite athletes, incorporated up to 8 sessions weekly (Kelly et al., 2021; Talsnes et al., 2022a). Exercise intensities were consistently prescribed at 85%–95% of maximum capacity, measured via heart rate (Devereux et al., 2022), VO2max (Mallol et al., 2019), or sport-specific metrics. Several studies employed “all-out” or maximal effort protocols (Fiorenza et al., 2019; Kon et al., 2019). Outcome measures were standardized across studies, with VO2max reported in mL/kg/min [8 studies - (Androulakis-Korakakis et al., 2018; Devereux et al., 2022; Gantois et al., 2019; Kelly et al., 2021; Kon et al., 2019; Mallol et al., 2019; Talsnes et al., 2022a; Talsnes et al., 2022b)], peak power output in W/kg [5 studies (Devereux et al., 2022; Fiorenza et al., 2019; Kelly et al., 2021; Kon et al., 2019; Mallol et al., 2019)], and sprint performance in m/s [3 studies: (Gantois et al., 2019; Liu et al., 2024; Wen et al., 2024)]. The research designs varied in their comparison groups, with some studies comparing different high-intensity protocols while others utilized traditional training control conditions.
The risk of bias assessment revealed key patterns across methodological quality domains. The randomization process demonstrated a moderate level of concern, with nearly two-thirds of studies showing some methodological limitations in their randomization procedures (Figure 2; Table 3). However, no studies exhibited high risk in this domain, suggesting that while randomization methods could be improved, they generally maintained basic methodological integrity. The strongest methodological quality emerged in period/carryover effects and missing outcome data domains, where over 90% of studies demonstrated a low risk of bias. This indicates robust handling of temporal effects in longitudinal measurements and strong participant retention and data collection practices across the included studies. Measurement of outcomes emerged as the most significant methodological challenge, with all studies showing either some concerns (81.8%) or high risk (18.2%) (Figure 2; Table 3). This systematic pattern suggests inherent challenges in maintaining measurement quality across MRT interventions, particularly in blinding outcome assessors and standardizing measurement procedures. Deviations from intended interventions and selection of reported results showed similar patterns, with approximately three-quarters of studies demonstrating low risk and the remainder showing some concerns or high risk. This indicates generally good adherence to planned protocols and pre-specified analyses, though with room for improvement in intervention fidelity and reporting practices (Higgins et al., 2024; Higgins et al., 2011; Sterne et al., 2019). The overall bias assessment revealed that while no studies achieved low-risk status across all domains, the majority (72.7%) maintained acceptable methodological quality with some concerns, while 27.3% showed a high risk of bias. Figure 2 presents the complete risk of bias assessment across all domains.
The PEDro scale (Table 4) assessment yielded a mean score of 7.36 (range: 7–9) across all studies. Most studies (72.7%, 8/11) achieved a score of 7, while two studies (18.2%) scored 9/10. Analysis of specific criteria showed high compliance (81.8%, 9/11 studies) with concealed allocation and adequate follow-up. The main methodological limitations were in the blinding of participants and therapists, with no studies achieving these criteria. Two studies (Liu et al., 2024; Wen et al., 2024) demonstrated the highest methodological quality with PEDro scores of 9/10. The coefficient of variation in quality scores was 11.4%, indicating consistent methodological standards across studies.
Publication bias assessment through funnel plot analysis and Egger’s test revealed distinct patterns across outcomes (Figure 3). The primary outcome, VO2max (n = 8 studies), demonstrated symmetrical distribution (Egger’s test p = 0.72) with standardized mean differences ranging from −0.5 to 1.5. Peak power output analysis (n = 6) showed asymmetrical distribution (Egger’s test p = 0.03) with effect size clustering between −0.8 and 0.2 SMD, while sprint performance and countermovement jump analyses (n = 3 each) exhibited positive effect distributions (p = 0.45 and p = 0.38 respectively). Blood lactate measurements (n = 3) displayed varied precision (SE range: 0.2–0.8) and effect sizes (−0.9 to 0.4 SMD), and time-to-exhaustion analysis (n = 5) demonstrated symmetric distribution around the central axis (p = 0.56). Contour-enhanced funnel plots indicated no evidence of small-study effects for the primary outcomes (Harrer et al., 2021; Song et al., 2002; Armijo-Olivo et al., 2015).
Figure 3. Funnel plots (A): symmetrical distribution around mean effect size (−0.5–1.5) suggests minimal publication bias. Even scatter across precision levels strengthens confidence in findings. (B): clustering on the negative effect side with one notable outlier (Fiorenza et al., 2019). Asymmetry may reflect methodological differences rather than bias. (C): consistent positive effects but limited precision spread. Pattern likely reflects standardized testing protocols rather than selective reporting. (D): wide variance in both precision and effect directions indicates protocol-dependent responses rather than publication bias. (E): relatively symmetric distribution with studies clustered at similar precision levels. Suggests balanced reporting of outcomes. (F): Consistent positive effects with similar precision levels across studies indicate reliable measurement protocols rather than selective reporting.
The meta-analysis examining MRT’s effect on VO2max (Figure 4) included eight studies with 203 participants (104 experimental, 99 control). The overall standardized mean difference showed a small positive effect (SMD = 0.30, 95% CI: −0.19–0.79), though not reaching statistical significance (Z = 1.65, p = 0.10). The analysis demonstrated moderate heterogeneity (I2 = 33%, τ2 = 0.08), indicating relatively consistent findings across studies. Individual effects ranged from SMD = −0.58 (Androulakis-Korakakis et al., 2018) to SMD = 1.58 (Kon et al., 2019), with larger studies (Talsnes et al., 2022a; Talsnes et al., 2022b) showing small positive effects with narrower confidence intervals. The 95% prediction interval (−0.54–1.14) reflects the range within which the true effects of future MRT interventions on VO2max are expected to fall, accounting for both within- and between-study variance.
The analysis of peak power (Figure 5) output encompassed five studies with 97 total participants (49 experimental, 48 control). The standardized mean difference revealed a moderate, non-significant positive effect (SMD = 0.54, 95% CI: −2.05 to 3.13, p = 0.55). The analysis exhibited high heterogeneity (I2 = 93%, τ2 = 3.77), with effect sizes ranging markedly across studies (Fiorenza et al., 2019). Demonstrated the largest positive effect (SMD = 4.30, 95% CI: 2.66–5.93), while (Kon et al., 2019) reported a significant negative effect (SMD = −1.35, 95% CI: −2.40 to −0.30). The wide prediction interval (−5.44–6.52) and substantial between-study variance (τ2 = 3.77, 95% CI: 1.08–37.55) indicate high variability in the observed effects across different training protocols and populations.
The sprint performance analysis (Figure 6) incorporated three studies with 79 participants(40 experimental, 39 control)demonstrating a significant large positive effect (SMD = 1.18, 95% CI: 0.00 to 2.36, p < 0.0001). All included studies (Gantois et al., 2019; Liu et al., 2024; Wen et al., 2024) showed positive standardized mean differences, with (Liu et al., 2024) reporting the largest effect (SMD = 1.65, 95% CI: 0.81–2.50). The analysis exhibited low heterogeneity (I2 = 27%, τ2 = 0.07), indicating consistency in the observed effects across studies. The between-study variance remained minimal (τ2 = 0.07, 95% CI: 0.00–7.63), with a 95% prediction interval of −0.46 to 2.82 for the distribution of true effects.
The analysis of countermovement jump (Figure 7) performance included three studies with 79 participants (40 experimental, 39 control), yielding a statistically significant moderate effect (SMD = 0.80, 95% CI: −0.04 to 1.64, p = 0.0007) (Liu et al., 2024). Demonstrated the largest effect (SMD = 1.18, 95% CI: 0.39–1.96), while (Wen et al., 2024) and (Gantois et al., 2019) showed comparable moderate effects. The analysis revealed zero heterogeneity (I2 = 0%, τ2 = 0.00), with the between-study variance estimated at τ2 = 0.00 (95% CI: 0.00–4.25). The prediction interval matched the confidence interval (−0.04–1.64), reflecting the high consistency of effects across the included studies.
The blood lactate analysis (Figure 8) encompassed three studies with 84 participants (44 experimental, 40 control), revealing a non-significant negative effect (SMD = −1.68, 95% CI: −8.58 to 5.22, p = 0.29). Individual study effects varied substantially, with (Fiorenza et al., 2019) showing a large negative effect (SMD = −5.01, 95% CI: −6.85 to −3.18) and (Talsnes et al., 2022b) demonstrating a small positive effect (SMD = 0.18, 95% CI: −0.41–0.78). The analysis exhibited high heterogeneity (I2 = 97%, τ2 = 7.10), with a wide prediction interval (−15.06 to 11.70) and substantial between-study variance (τ2 = 7.10, 95% CI: 1.54 to >100).
The time to exhaustion analysis (Figure 9) included four studies with 136 participants (70 experimental, 66 control), showing a small positive effect (SMD = 0.23, 95% CI: 0.00 to 0.46, p = 0.18) (Talsnes et al., 2022b). Reported the largest effect (SMD = 0.41, 95% CI: −0.19–1.01), while (Kelly et al., 2021) showed the smallest effect (SMD = 0.13, 95% CI: −0.66–0.91). The analysis demonstrated zero heterogeneity (I2 = 0%, τ2 = 0.00, 95% CI: 0.00–0.13). The prediction interval matched the confidence interval (0.00–0.46), indicating consistent effects across studies despite varying baseline performance levels.
A comprehensive subgroup analysis focused exclusively on VO2max measurements, as this outcome provided sufficient data points across studies for meaningful comparative analysis. The analysis examined six key categorical variables: training experience [novice/moderately trained (≤2 years), experienced (2–5 years), well-trained/elite (>5 years or national level)]; exercise intensity [submaximal (80%–90% maximum), near-maximal (90%–95%), maximal/all-out (≥95%)]; intervention duration [short-term (<4 weeks), medium-term (4–6 weeks), long-term (>6 weeks)]; training frequency [low (≤2 sessions/week], moderate (3–5), high (>5)]; age [youth/adolescent (<18 years), young adult (19–25), adult (>25)]; and gender composition (male-only, female-only, mixed-gender). For subgroup analyses, the minimum thresholds of 6 studies per subgroup were established to ensure reliable comparisons. A minimum of 6 studies per subgroup are recommended due to statistical power considerations and recommendations from the Cochrane Handbook for Systematic Reviews (Higgins et al., 2024). This threshold balanced the need for meaningful comparisons with the practical limitations of available data. The focus on VO2max for subgroup analysis reflected its position as the most consistently reported outcome measure across studies, providing sufficient statistical power for meaningful comparisons. Other outcome measures, while important, lacked adequate representation across subgroups to support reliable comparative analysis.
The subgroup analysis of VO2max adaptations across age groups reveals distinct patterns. Young adults (19–25 years) showed moderate heterogeneity with a non-significant trend toward improvement (SMD = 0.33, 95% CI: −0.74–1.40), characterized by substantial variability between individual studies. The youth/adolescent subgroup (under 18) demonstrated zero heterogeneity and minimal effect (SMD = −0.04, 95% CI: −1.80 to 1.72), suggesting limited VO2max adaptations. The adult subgroup was limited to a single study with a small negative effect. Statistical analysis indicated no significant moderation by age (Chi2 = 0.82, p = 0.66, I2 = 41%), highlighting the complexity of age-related responses to MRT (Table 5). The results emphasize the need for individualized training approaches that consider age-specific physiological characteristics.
The gender subgroup analysis of VO2max adaptations to MRT reveals significant limitations in the available research. Male-only studies demonstrated a moderate heterogeneity (I2 = 73%) with a standardized mean difference of 0.44 (95% CI: −1.08–1.97), indicating substantial variability in response and a lack of consistent effect. The mixed-gender studies showed remarkable consistency (I2 = 0%) with a minimal and statistically non-significant effect (SMD = −0.06, 95% CI: −0.29 to 0.17) (Table 5). Critically, the absence of female-only studies represents a substantial methodological gap in the current literature. This omission severely constrains a comprehensive understanding of gender-specific physiological adaptations to MRT. The statistical analysis further underscores the complexity of gender-based adaptations. The test for subgroup differences (Chi2 = 1.00, df = 1, p = 0.32) and zero heterogeneity in mixed-gender studies indicate that gender may not significantly moderate VO2max responses. However, the wide confidence intervals and limited study diversity necessitate extreme caution in drawing definitive conclusions.
The subgroup analysis of training experience reveals nuanced VO2max adaptations across different athlete proficiency levels. Well-trained/elite athletes exhibited high heterogeneity (I2 = 77%) with a minimal SMD of 0.18 (95% CI: −1.20–1.56), indicating inconsistent responses. Experienced athletes showed low variability (I2 = 26%) and a comparable effect size of 0.22 (95% CI: −1.19–1.62). The single novice/moderately trained study reported a small negative effect of −0.11 (95% CI: −1.09 to 0.87). Statistical analysis suggests no significant moderation by training experience (Chi2 = 0.40, p = 0.85), with substantial variability primarily observed in well-trained cohorts (Table 5). The divergent responses underscore the complexity of physiological adaptations across training backgrounds, challenging uniform training prescription assumptions.
The exercise intensity subgroup analysis reveals distinctive patterns of VO2max adaptations across different training protocols. Maximal/all-out effort protocols exhibited substantial heterogeneity (I2 = 72%) with a modest standardized mean difference of 0.29 (95% CI: −0.77–1.35), indicating significant variability in physiological responses. Near-maximal intensity interventions demonstrated complete consistency (I2 = 0%) with a marginal negative effect (−0.11), suggesting minimal physiological adaptation potential. The single submaximal high-intensity study presented a neutral effect (SMD = 0.11, 95% CI: −0.49–0.70), offering insufficient evidence for comprehensive conclusions. Statistical analysis suggests that exercise intensity does not significantly moderate (Table 5). VO2max adaptations, as indicated by the non-significant test for subgroup differences. These findings underscore the complexity of MRT’s impact on cardiorespiratory fitness, highlighting the need for more comprehensive research exploring intensity-specific physiological mechanisms.
The temporal dynamics of MRT present a critical investigative frontier in understanding physiological adaptations, with training duration emerging as a potential key modulator of VO2max responses. Medium-term interventions (4–6 weeks) demonstrated a slight positive SMD of 0.22 with low heterogeneity (I2 = 26%), suggesting potential consistency in physiological adaptations during this timeframe. The statistical parameters indicate modest variability in VO2max responses, potentially reflecting an optimal training adaptation window. Long-term interventions (>6 weeks) revealed a marginal negative standardized mean difference of −0.12 with zero heterogeneity, indicating uniform minimal decline across studies. This unexpected result challenges conventional assumptions about prolonged training adaptations, suggesting potential physiological accommodation or diminishing returns in extended MRT protocols.
Training frequency represents a fundamental variable in exercise prescription, with potentially significant implications for cardiorespiratory adaptation mechanisms and performance optimization. Low-frequency interventions (≤2 sessions/week) exhibited a standardized mean difference of 0.33 with moderate heterogeneity (I2 = 65%), suggesting variable but potentially positive VO2max adaptations. The statistical parameters indicate significant inter-study variability, highlighting the complexity of training frequency responses. High-frequency interventions (>5 sessions/week) showed a minimal standardized mean difference of −0.04 with zero heterogeneity, representing consistent near-neutral effects. This uniform response across studies suggests that increased training frequency may not necessarily translate to enhanced VO2max adaptations, challenging prevalent assumptions about training volume and physiological improvement.
The moderator analysis reveals significant differences in how MRT affects various performance outcomes (QM = 14.2749, p = 0.0267) (Table 6). Sprint performance showed the strongest positive effect (estimate = 1.21, p = 0.026), indicating it benefits most consistently from this training approach. CMJ and peak power demonstrated moderate positive effects (estimates of 0.90 and 0.87 respectively), though these fell just short of statistical significance (p = 0.096 and p = 0.051). Blood lactate showed a negative effect (estimate = −0.71, p = 0.221), suggesting possible improvements in metabolic efficiency, though this was not statistically significant. VO2max and time to exhaustion displayed the smallest effects (estimates of 0.33 and 0.23 respectively), with neither reaching statistical significance.
The significant residual heterogeneity (QE = 66.92, p < 0.0001) indicates that other factors beyond the measured outcomes influence training effectiveness. This suggests that individual responses to MRT likely depend on various unmeasured factors such as training experience, protocol design, or individual physiological characteristics. The analysis demonstrates that MRT’s effectiveness varies considerably across different performance parameters, with the strongest benefits observed in explosive performance measures like sprinting.
The GRADE assessment reveals distinct hierarchical patterns in evidence certainty across different performance outcomes. The high-certainty evidence for VO2max and countermovement jump performance demonstrates robust methodological quality through standardized protocols, precise estimates, and narrow confidence intervals (Table 7). These outcomes exhibited consistent effect directions and adequate sample sizes, establishing a reliable foundation for understanding training adaptations (Prasad, 2024; Neumann et al., 2013; Pandis et al., 2015). Sprint performance and time to exhaustion achieved moderate certainty ratings due to methodological limitations including heterogeneity (I2 = 40–75%) and wider confidence intervals. This moderate rating reflects balanced strengths and constraints in the evidence base, with consistent positive effects tempered by methodological variations across studies. Peak power output and blood lactate responses received very low certainty ratings due to substantial heterogeneity (I2 > 75%), imprecise estimates and confidence intervals spanning null effects. These limitations stem from systematic methodological variations rather than random error, indicating fundamental challenges in measurement standardization across studies (Thorlund et al., 2012; Zhao, 2013).
The assessment demonstrates clear temporal and methodological patterns, with longer-duration studies and larger sample sizes typically achieving higher certainty ratings, particularly for outcomes utilizing standardized measurement protocols. This relationship between study characteristics and evidence certainty provides essential context for interpreting effect sizes (Thorlund et al., 2012; Zhao, 2013). The systematic application of GRADE criteria exposes significant variations in evidence quality across different performance measures. Outcomes with standardized measurement protocols and clear reporting consistently achieved higher certainty ratings, independent of effect size magnitude (Al Duhailib et al., 2024; Brozek et al., 2021; Hultcrantz et al., 2020). This pattern establishes a clear hierarchy of evidence reliability while highlighting specific areas requiring methodological refinement. The varied certainty levels across outcomes create a nuanced framework for evaluating the effectiveness of MRT interventions. High-certainty evidence provides robust support for specific adaptations, while lower-certainty evidence identifies areas requiring additional methodological standardization and investigation.
The meta-analysis revealed substantial heterogeneity in blood lactate responses (I2 = 97%) and moderate heterogeneity in sprint performance (I2 = 63%). The significant variation in blood lactate measurements appears linked to protocol-specific factors, supported by the GRADE assessment which indicated very low certainty of evidence due to inconsistency and imprecision (Table 7). Studies varied significantly in their blood lactate sampling protocols, with (Fiorenza et al., 2019) showing a large negative effect (SMD = −5.01, 95% CI: −6.85 to −3.18) and (Talsnes et al., 2022b) demonstrating a small positive effect (SMD = 0.18, 95% CI: −0.41–0.78).
The heterogeneity in sprint performance, though moderate, demonstrated more consistent patterns. The subgroup analysis of training frequency revealed that low-frequency interventions (≤2 sessions/week) exhibited a standardized mean difference of 0.33 with moderate heterogeneity (I2 = 65%), while high-frequency training (>5 sessions/week) showed minimal effects with zero heterogeneity (Table 5). Training duration also impacted variability, with medium-term programs (4–6 weeks) showing moderate benefits (SMD = 0.22) and substantial heterogeneity (I2 = 52%), while long-term interventions demonstrated consistent but slightly negative effects (SMD = −0.12, I2 = 0%). Exercise intensity emerged as another significant moderator, with maximal/all-out protocols exhibiting substantial heterogeneity (I2 = 72%) compared to near-maximal intensity interventions showing complete consistency (I2 = 0%). Training experience further explained the variation pattern, with experienced athletes showing low variability (I2 = 26%) compared to well-trained/elite athletes (I2 = 77%). The GRADE assessment assigned moderate certainty to sprint performance outcomes, acknowledging this systematic variation while confirming the reliability of the overall effect.
This meta-analysis provides substantial evidence regarding the effectiveness of MRT across multiple performance domains in trained athletes. The analysis revealed distinct patterns of adaptation across different performance measures, with varying levels of evidence certainty. The most robust finding emerged in countermovement jump performance, where MRT demonstrated a significant positive effect (SMD = 0.80, 95% CI: −0.04 to 1.64, p = 0.0007) with zero heterogeneity (I2 = 0%). This consistency across studies suggests a reliable enhancement in explosive power capabilities through MRT protocols. Sprint performance similarly showed meaningful improvements (SMD = 1.18, 95% CI: 0.00 to 2.36, p < 0.0001) with low heterogeneity (I2 = 27%), indicating that MRT effectively enhances speed-related performance measures.
Regarding cardiorespiratory fitness, the analysis revealed a small positive effect on maximal oxygen consumption (SMD = 0.30, 95% CI: −0.19–0.79), though this did not reach statistical significance (p = 0.10). The moderate heterogeneity in these findings (I2 = 33%) suggests some variability in aerobic adaptations across different athletic populations. Time to exhaustion demonstrated a small positive effect (SMD = 0.23, 95% CI: 0.00 to 0.46, p = 0.18) with notably consistent responses across studies (I2 = 0%). Peak power output showed a moderate positive effect (SMD = 0.54, 95% CI: −2.05–3.13) but with substantial heterogeneity (I2 = 93%), indicating considerable variability in power adaptations across different protocols and populations. Blood lactate responses demonstrated the most variable outcomes (I2 = 97%) with a negative effect (SMD = −1.68, 95% CI: −8.58 to 5.22, p = 0.29), suggesting complex metabolic adaptations that may be highly protocol-dependent.
Subgroup analyses revealed optimal adaptations in younger adults (19–25 years) and experienced athletes, particularly with lower training frequencies (≤2 sessions/week). These findings challenge traditional assumptions about training volume requirements and suggest that well-designed MRT protocols can achieve significant performance improvements without excessive training frequencies. The GRADE assessment of evidence certainty provides crucial context for these findings. High-certainty evidence supports the improvements in countermovement jump performance, while sprint performance showed moderate-certainty evidence. Peak power output and blood lactate responses demonstrated very low certainty, indicating areas requiring additional methodological rigor in future research.
These results collectively indicate that MRT can effectively enhance multiple aspects of athletic performance while requiring relatively modest training volumes. The consistency of positive adaptations across several performance measures, particularly in explosive power and sprint capabilities, suggests that MRT offers a time-efficient approach to athletic development, though the magnitude and reliability of these effects vary across different performance domains.
The current meta-analysis findings demonstrate both convergence and divergence with previous research examining MRT and HIIT adaptations. The analysis of cardiorespiratory fitness (VO2max) reveals noteworthy comparisons with existing literature. While the present analysis found a small positive effect on VO2max (SMD = 0.30, 95% CI: −0.19–0.79), this aligns with findings from (Weston et al., 2014) who reported moderate improvements in VO2max following low-volume HIIT interventions in active non-athletic (SMD = 0.69) and sedentary (SMD = 0.94) populations. The findings also align with findings reported by Milanović et al. (2015), who identified modest improvements in VO2max through HIIT interventions. However, where (Milanović et al., 2015) reported primarily mean differences, the current analysis using standardized mean differences provides a more nuanced understanding of effect magnitude relative to outcome variability. This standardization reveals that while MRT produces positive adaptations in cardiorespiratory fitness, the effect size may be smaller than previously estimated.
Regarding anaerobic performance measures, the current findings of significant improvements in sprint performance (SMD = 1.18, 95% CI: 0.00–2.36) and countermovement jump height (SMD = 0.80, 95% CI: −0.04–1.64) extend beyond previous meta-analyses (Milanović et al., 2015). Reported smaller magnitude effects for similar explosive performance measures, suggesting that MRT protocols may offer enhanced neuromuscular adaptations compared to traditional HIIT approaches.
The observed improvements in time to exhaustion (SMD = 0.23, 95% CI: 0.00–0.46) align with findings from (Liang et al., 2024), who reported significant enhancements in endurance performance following HIIT interventions. However, the current analysis reveals more modest effects, potentially due to differences in training populations and protocol designs. The consistency of these findings across studies (I2 = 0%) contrasts with the higher heterogeneity reported in previous analyses, suggesting more uniform adaptations to MRT protocols.
Peak power output findings (SMD = 0.54, 95% CI: −2.05–3.13) demonstrate greater variability than previous research (Kunz et al., 2019). Reported more consistent power adaptations following low-volume HIIT, though methodological differences in power assessment make direct comparisons challenging. The substantial heterogeneity (I2 = 93%) in the current analysis suggests protocol-specific responses that warrant careful consideration in training design.
Blood lactate responses (SMD = −1.68, 95% CI: −8.58 to 5.22) present a complex picture of metabolic adaptations. These findings partially align with (Kunz et al., 2019), who reported varied metabolic responses to HIIT interventions. The high heterogeneity (I2 = 97%) in lactate responses suggests that metabolic adaptations may be highly individualized and protocol-dependent.
The subgroup analyses revealing optimal adaptations in younger adults (19–25 years) extend findings from previous research. These age-specific responses align with observations by Gantois et al. (2019) regarding enhanced trainability in collegiate-aged athletes. The effectiveness of lower training frequencies (≤2 sessions/week) supports growing evidence that well-designed high-intensity protocols can achieve significant adaptations with modest training volumes, as demonstrated by Liu et al. (2024) and Wen et al. (2024).
These comparative findings demonstrate that MRT protocols can effectively develop multiple fitness qualities simultaneously, potentially offering advantages over traditional training approaches. The evidence suggests particular efficacy in developing explosive power and sprint capabilities while maintaining or enhancing cardiorespiratory fitness. However, the variability in certain outcomes emphasizes the importance of careful protocol design and implementation.
This meta-analysis demonstrates several methodological strengths, including comprehensive risk of bias assessment, robust statistical approaches using standardized mean differences, and systematic evaluation of evidence certainty through GRADE methodology. The analysis of heterogeneity provides valuable insights into the variability of training responses across different outcomes. However, important limitations must be acknowledged. The substantial heterogeneity observed in peak power output (I2 = 93%) and blood lactate responses (I2 = 97%) suggests significant protocol-dependent variations that complicate interpretation. The modest number of studies for certain outcomes, particularly sprint performance and countermovement jump (n = 3 each), limits the generalizability of these findings. Additionally, the asymmetrical funnel plot distribution for peak power output (Egger’s test p = 0.03) indicates potential publication bias. The predominance of male participants in the included studies limits our understanding of gender-specific adaptations to MRT.
The findings provide evidence-based guidance for implementing MRT in athletic populations. The high-certainty evidence supporting improvements in countermovement jump performance (SMD = 0.80, 95% CI: −0.04–1.64) and moderate-certainty evidence for sprint performance enhancements (SMD = 1.18, 95% CI: 0.00–2.36) suggest that MRT effectively develops explosive power and speed capabilities. The consistency of these adaptations, particularly in countermovement jump performance (I2 = 0%), indicates reliable training responses across different populations.
Practitioners should consider that optimal adaptations were observed with lower training frequencies (≤2 sessions/week), particularly in younger adults (19–25 years). This suggests that well-designed MRT protocols can achieve significant performance improvements without requiring high training volumes. The moderate improvements in cardiorespiratory fitness (SMD = 0.30, 95% CI: −0.19–0.79) indicate that MRT can maintain or enhance aerobic capacity while developing other performance qualities. The high variability in certain outcomes, particularly blood lactate responses, emphasizes the importance of individualized monitoring and protocol adjustment. Practitioners should implement careful progression and monitoring strategies, especially when working with different athletic populations or experience levels.
Future research should address several key gaps identified in this analysis. Studies examining female athletes’ responses to MRT are particularly needed, given the current male-dominated evidence base. Investigation of long-term adaptations beyond the typical 4–12-week intervention period would provide valuable insights into the sustainability of performance improvements. Research should also explore the interaction between training frequency and adaptation magnitude, particularly given the effectiveness observed with lower training frequencies. Additionally, standardization of blood lactate measurement protocols and power output assessment methods would help reduce the high heterogeneity observed in these outcomes. Future studies should incorporate standardized reporting of training protocols and physiological responses to facilitate more precise comparisons across investigations.
This systematic review and meta-analysis demonstrate that MRT effectively enhances multiple performance parameters in trained athletes. High-certainty evidence supports significant improvements in countermovement jump performance and sprint capabilities while maintaining cardiorespiratory fitness. The analysis reveals that meaningful adaptations occur with relatively modest training frequencies (≤2 sessions/week), particularly in younger adults (19–25 years) and experienced athletes. The substantial variability observed in peak power output (I2 = 93%) and blood lactate responses (I2 = 97%) indicates that physiological adaptations may be highly protocol-dependent, emphasizing the importance of individualized program design. The GRADE assessment provides crucial context for interpreting these findings, with evidence certainty ranging from high for countermovement jump performance to very low for blood lactate responses. These findings demonstrate that well-designed MRT protocols can achieve significant performance improvements without requiring excessive training volumes. However, the limited number of studies examining female athletes and long-term adaptations highlights important areas for future research. Implementation should focus on systematic progression and comprehensive monitoring to optimize individual training responses while maintaining proper movement quality.
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author.
YT: Conceptualization, Data curation, Formal Analysis, Funding acquisition, Investigation, Methodology, Project administration, Resources, Software, Supervision, Validation, Visualization, Writing–original draft, Writing–review and editing. DC: Data curation, Methodology, Supervision, Writing–original draft, Writing–review and editing.
The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.
Thank you to Capital University of Physical Education and Sports for providing the platform, thank you to my supervisor CD for his careful guidance, and thank you to the coauthors for their strong support.
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declare that no Generative AI was used in the creation of this manuscript.
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Aagaard P., Andersen J. L., Bennekou M., Larsson B., Olesen J. L., Crameri R., et al. (2011). Effects of resistance training on endurance capacity and muscle fiber composition in young top-level cyclists. Scand. J. Med. and Sci. Sports 21 (6), e298–e307. doi:10.1111/j.1600-0838.2010.01283.x
Adeel M., Lai C.-H., Wu C.-W., Kang J.-H., Liou J.-C., Chen H.-C., et al. (2021). Modeling of metabolic equivalents (METs) during moderate resistance training exercises. Appl. Sci. 11 (18), 8773. doi:10.3390/app11188773
Al Duhailib Z., Granholm A., Alhazzani W., Oczkowski S., Belley-Cote E., Møller M. H. (2024). GRADE pearls and pitfalls—Part 1: systematic reviews and meta-analyses. Acta Anaesthesiol. Scand. 68 (5), 584–592. doi:10.1111/aas.14386
Androulakis-Korakakis P., Langdown L., Lewis A., Fisher J. P., Gentil P., Paoli A., et al. (2018). Effects of exercise modality during additional “high-intensity interval training” on aerobic fitness and strength in powerlifting and strongman athletes. J. Strength Cond. Res. 32 (2), 450–457. doi:10.1519/JSC.0000000000001809
Ansari M. T., Tsertsvadze A., Moher D. (2009). Grading quality of evidence and strength of recommendations: a perspective. PLOS Med. 6 (9), e1000151. doi:10.1371/journal.pmed.1000151
Armijo-Olivo S., da Costa B. R., Cummings G. G., Ha C., Fuentes J., Saltaji H., et al. (2015). PEDro or Cochrane to assess the quality of clinical trials? A meta-epidemiological study. PLOS One 10 (7), e0132634. doi:10.1371/journal.pone.0132634
Bakbergenuly I., Hoaglin D. C., Kulinskaya E. (2020). Methods for estimating between-study variance and overall effect in meta-analysis of odds ratios. Res. Synthesis Methods 11 (3), 426–442. doi:10.1002/jrsm.1404
Barker T. H., Migliavaca C. B., Stein C., Colpani V., Falavigna M., Aromataris E., et al. (2021). Conducting proportional meta-analysis in different types of systematic reviews: a guide for synthesisers of evidence. BMC Med. Res. Methodol. 21 (1), 189. doi:10.1186/s12874-021-01381-z
Brozek J. L., Canelo-Aybar C., Akl E. A., Bowen J. M., Bucher J., Chiu W. A., et al. (2021). GRADE guidelines 30: the GRADE approach to assessing the certainty of modeled evidence—an overview in the context of health decision-making. J. Clin. Epidemiol. 129, 138–150. doi:10.1016/j.jclinepi.2020.09.018
Cashin A. G., McAuley J. H. (2020). Clinimetrics: physiotherapy evidence database (PEDro) scale. J. Physiother. 66 (1), 59. doi:10.1016/j.jphys.2019.08.005
Chang Y.-H., Chou Y.-C., Chang Y.-C., Tan K.-H., Wu M.-H. (2022). The effects of high-intensity power training versus traditional resistance training on exercise performance. Int. J. Environ. Res. Public Health 19 (15), 9400. doi:10.3390/ijerph19159400
Cho S.-H., Shin I.-S. (2021). A reporting quality assessment of systematic reviews and meta-analyses in sports physical therapy: a review of reviews. Healthc. (Basel) 9 (10), 1368. doi:10.3390/healthcare9101368
Cochrane Training (2023). Using GRADEpro to perform a GRADE. YouTube. Available online at: https://www.youtube.com.
Contreras B. (2014). Bodyweight strength training anatomy. Champaign, IL: Human Kinetics. Available online at: http://ndl.ethernet.edu.et/bitstream/123456789/35699/1/Bret_Contreras.pdf.
Da Silva M. A. R., Baptista L. C., Neves R. S., De França E., Loureiro H., Lira F. S., et al. (2020). The effects of concurrent training combining both resistance exercise and high-intensity interval training or moderate-intensity continuous training on metabolic syndrome. Front. Physiology 11, 572. doi:10.3389/fphys.2020.00572
Debray T. P. A., Moons K. G. M., Riley R. D. (2018). Detecting small-study effects and funnel plot asymmetry in meta-analysis of survival data: a comparison of new and existing tests. Res. Synthesis Methods 9 (1), 41–50. doi:10.1002/jrsm.1266
de Morton N. A. (2009). The PEDro scale is a valid measure of the methodological quality of clinical trials: a demographic study. Aust. J. Physiother. 55 (2), 129–133. doi:10.1016/s0004-9514(09)70043-1
Dettori J. R., Norvell D. C., Chapman J. R. (2022). Fixed-effect vs random-effects models for meta-analysis: 3 points to consider. Glob. Spine J. 12 (7), 1624–1626. doi:10.1177/21925682221110527
Devereux G., Le Winton H. G., Black J., Beato M. (2022). Effect of a high-intensity short-duration cycling elevation training mask on VO2max and anaerobic power: a randomized controlled trial. Biol. Sport 39 (1), 181–187. doi:10.5114/biolsport.2021.102926
Fiorenza M., Hostrup M., Gunnarsson T. P., Shirai Y., Schena F., Iaia F. M., et al. (2019). Neuromuscular fatigue and metabolism during high-intensity intermittent exercise. Med. and Sci. Sports and Exerc. 51 (8), 1642–1652. doi:10.1249/MSS.0000000000001959
Gantois P., Batista G. R., Aidar F. J., Nakamura F. Y., De Lima-Júnior D., Cirilo-Sousa M. S., et al. (2019). Repeated sprint training improves both anaerobic and aerobic fitness in basketball players. Int. J. Environ. Sci. 27 (2), 97–105. doi:10.3233/IES-182212
Harrer M., Cuijpers P., Furukawa T. A., Ebert D. D. (2021). Doing meta-analysis in R: a hands-on guide. Boca Raton, FL and London: Chapman and Hall/CRC Press. Available online at: https://bookdown.org/MathiasHarrer/Doing_Meta_Analysis_in_R/pub-bias.html.
Higgins J., Thomas J., Chandler J., Cumpston M., Li T., Page M., et al. (2024). Cochrane Handbook for systematic reviews of interventions, 6.5. Cochrane Training Chapter 21. Available online at: https://training.cochrane.org/handbook/current.
Higgins J. P. T., Altman D. G., Gøtzsche P. C., Jüni P., Moher D., Oxman A. D., et al. (2011). The Cochrane collaboration’s tool for assessing risk of bias in randomised trials. BMJ 343, d5928. doi:10.1136/bmj.d5928
Hopkins W. G., Marshall S. W., Batterham A. M., Hanin J. (2009). Progressive statistics for studies in sports medicine and exercise science. Med. and Sci. Sports and Exerc. 41 (1), 3–13. doi:10.1249/MSS.0b013e31818cb278
Hultcrantz M., Mustafa R. A., Leeflang M. M. G., Lavergne V., Estrada-Orozco K., Ansari M. T., et al. (2020). Defining ranges for certainty ratings of diagnostic accuracy: a GRADE concept paper. J. Clin. Epidemiol. 117, 138–148. doi:10.1016/j.jclinepi.2019.05.002
Jett M., Swank A. M. (2013). Metabolic resistance training: a strategy to add “play” to our clinical programs. ACSM’s Health and Fit. J. 17 (2), 31–33. doi:10.1249/FIT.0b013e318282afe9
Jones A. M. (2024). The fourth dimension: physiological resilience as an independent determinant of endurance exercise performance. J. Physiology 602 (17), 4113–4128. doi:10.1113/JP284205
Kang X., Zhang Y., Sun C., Zhang J., Che Z., Zang J., et al. (2025). Effectiveness of virtual reality training in improving outcomes for dialysis patients: systematic review and meta-analysis. J. Med. Internet Res. 27, e58384. doi:10.2196/58384
Kelly D. T., Cregg C. J., O’Connor P. L., Cullen B. D., Moyna N. M. (2021). Physiological and performance responses of sprint interval training and endurance training in gaelic football players. Eur. J. Appl. Physiology 121 (8), 2265–2275. doi:10.1007/s00421-021-04699-0
Kon M., Nakagaki K., Ebi Y. (2019). Effects of all-out sprint interval training under hyperoxia on exercise performance. Physiol. Rep. 7 (14), 141944–e14210. doi:10.14814/phy2.14194
Kunz P., Engel F. A., Holmberg H.-C., Sperlich B. (2019). A meta-comparison of the effects of high-intensity interval training to those of small-sided games and other training protocols on parameters related to the physiology and performance of youth soccer players. Sports Med. - Open 5 (1), 7. doi:10.1186/s40798-019-0180-5
Kuriyama S. (2019). “No pain, No gain” and the history of presence. Representations 146 (1), 91–111. doi:10.1525/rep.2019.146.1.91
Lee S., Neogi T., Senderling B. M., Gheller M., Adamowicz L., Georgiev P., et al. (2023). No pain, No gain: association of pain increase during exercise with exercise-induced hypoalgesia in people with knee osteoarthritis. Osteoarthr. Cartil. 31, S378. doi:10.1016/j.joca.2023.01.428
Lee Y. H. (2019). Strengths and limitations of meta-analysis. Korean J. Med. 94 (5), 391–395. doi:10.3904/kjm.2019.94.5.391
Lensen S. (2023). When to pool data in a meta-analysis (and when not to)? Fertil. Steril. 119 (6), 902–903. doi:10.1016/j.fertnstert.2023.03.015
Lev A. (2023). “The glorious pain”: attaining pleasure and gratification in times of delayed onset muscle soreness (DOMS) among gym goers. J. Contemp. Ethnogr. 52 (3), 295–316. doi:10.1177/08912416221113369
Liang W., Wang X., Cheng S., Jiao J., Zhu X., Duan Y. (2024). Effects of high-intensity interval training on the parameters related to physical fitness and health of older adults: a systematic review and meta-analysis. Sports Med. - Open 10 (1), 98. doi:10.1186/s40798-024-00767-9
Liu G., Wang X., Xu Q. (2024). Supervised offseason training programs are able to mitigate the effects of detraining in youth men soccer players' physical fitness: a randomized parallel controlled study. J. Sports Sci. Med. 23, 219–227. doi:10.52082/jssm.2024.219
Mallol M., Bentley D. J., Norton L., Norton K., Mejuto G., Yanci J. (2019). Comparison of reduced-volume high-intensity interval training and high-volume training on endurance performance in triathletes. Int. J. Sports Physiology Perform. 14 (2), 239–245. doi:10.1123/ijspp.2018-0359
Marín-Pagán C., Blazevich A. J., Chung L. H., Romero-Arenas S., Freitas T. T., Alcaraz P. E. (2020). Acute physiological responses to high-intensity resistance circuit training vs. Traditional strength training in soccer players. Biology 9 (11), 383. doi:10.3390/biology9110383
Milanović Z., Sporiš G., Weston M. (2015). Effectiveness of high-intensity interval training (hit) and continuous endurance training for VO2max improvements: a systematic review and meta-analysis of controlled trials. Sports Med. 45 (10), 1469–1481. doi:10.1007/s40279-015-0365-0
Moen F., Firing K., Vittersø J. (2016). No pain No gain: feelings and performance in sport. J. Athl. Enhanc. 5 (3). doi:10.4172/2324-9080.1000229
Moro T., Marcolin G., Bianco A., Bolzetta F., Berton L., Sergi G., et al. (2020). Effects of 6 Weeks of traditional resistance training or high-intensity interval resistance training on body composition, aerobic power, and strength in Healthy young subjects: a randomized parallel trial. Int. J. Environ. Res. Public Health 17 (11), 4093. doi:10.3390/ijerph17114093
Moseley A. M., Rahman P., Wells G. A., Zadro J. R., Sherrington C., Toupin-April K., et al. (2019). Agreement between the Cochrane risk of bias tool and physiotherapy evidence database (PEDro) scale: a meta-epidemiological study of randomized controlled trials of physical therapy interventions. PLOS One 14 (9), e0222770. doi:10.1371/journal.pone.0222770
Neumann I., Brożek J., Guyatt G., Oxman A. (2013). GRADE Handbook (Version 1.0). The GRADE Working Group. Chapter 3. Available online at: https://book.gradepro.org.
Nunes P. R. P., Castro-e-Souza P., de Oliveira A. A., Camilo B. D. F., Cristina-Souza G., Vieira-Souza L. M., et al. (2024). Effect of resistance training volume on body adiposity, metabolic risk, and inflammation in postmenopausal and older females: systematic review and meta-analysis of randomized controlled trials. J. Sport Health Sci. 13 (2), 145–159. doi:10.1016/j.jshs.2023.09.012
Page M. J., McKenzie J. E., Bossuyt P. M., Boutron I., Hoffmann T. C., Mulrow C. D., et al. (2021). The PRISMA 2020 statement: an updated guideline for reporting systematic reviews. PLOS Med. 18 (3), e1003583. doi:10.1371/journal.pmed.1003583
Pandis N., Fleming P. S., Worthington H., Salanti G. (2015). The quality of the evidence according to GRADE is predominantly low or very low in oral health systematic reviews. PLOS ONE 10 (7), e0131644. doi:10.1371/journal.pone.0131644
Papakostidis C., Giannoudis P. V. (2023). Meta-analysis: what have we learned? Injury 54, S30–S34. doi:10.1016/j.injury.2022.06.012
Prasad M. (2024). Introduction to the GRADE tool for rating certainty in evidence and recommendations. Clin. Epidemiol. Glob. Health 25, 101484. doi:10.1016/j.cegh.2023.101484
Röver C., Knapp G., Friede T. (2015). Hartung-knapp-sidik-jonkman approach and its modification for random-effects meta-analysis with few studies. BMC Med. Res. Methodol. 15 (1), 99. doi:10.1186/s12874-015-0091-1
Schoenfeld B. (2021). Metabolic resistance training: build muscle and torch fat at once! Bodybuilding.com. Available online at: https://www.bodybuilding.com.
Seide S. E., Röver C., Friede T. (2019). Likelihood-based random-effects meta-analysis with few studies: empirical and simulation studies. BMC Med. Res. Methodol. 19 (1), 16. doi:10.1186/s12874-018-0618-3
Song F., Khan K. S., Dinnes J., Sutton A. J. (2002). Asymmetric funnel plots and publication bias in meta-analyses of diagnostic accuracy. Int. J. Epidemiol. 31 (1), 88–95. doi:10.1093/ije/31.1.88
Steele J., Androulakis-Korakakis P., Perrin C., Fisher J. P., Gentil P., Scott C., et al. (2019). Comparisons of resistance training and “cardio” exercise modalities as countermeasures to microgravity-induced physical deconditioning: new perspectives and lessons learned from terrestrial studies. Front. Physiology 10, 1150. doi:10.3389/fphys.2019.01150
Sterne J. A. C., Savović J., Page M. J., Elbers R. G., Blencowe N. S., Boutron I., et al. (2019). RoB 2: a revised tool for assessing risk of bias in randomised trials. BMJ 366, l4898. doi:10.1136/bmj.l4898
Swinton P. A., Shim J. S. C., Pavlova A. V., Moss R., Maclean C., Brandie D., et al. (2023). What are small, medium, and large effect sizes for exercise treatments of tendinopathy? A systematic review and meta-analysis. BMJ Open Sport and Exerc. Med. 9 (1), e001389. doi:10.1136/bmjsem-2022-001389
Talsnes R. K., Engdahl L. J., Sandbakk Ø. (2022a). How do the effects of an 8-week intervention influence subsequent performance development in cross-country skiers? Int. J. Sports Physiology Perform. 17 (4), 594–604. doi:10.1123/ijspp.2021-0356
Talsnes R. K., Van Den Tillaar R., Sandbakk Ø. (2022b). Effects of increased load of low- versus high-intensity endurance training on performance and physiological adaptations in endurance athletes. Int. J. Sports Physiology Perform. 17 (2), 216–225. doi:10.1123/ijspp.2021-0190
Thorlund K., Imberger G., Johnston B. C., Walsh M., Awad T., Thabane L., et al. (2012). Evolution of heterogeneity (I2) estimates and their 95% confidence intervals in large meta-analyses. PLOS One 7 (7), e39471. doi:10.1371/journal.pone.0039471
von Hippel P. T. (2015). The heterogeneity statistic I2 can Be biased in small meta-analyses. BMC Med. Res. Methodol. 15 (1), 35. doi:10.1186/s12874-015-0024-z
Weber F., Knapp G., Glass Ä., Kundt G., Ickstadt K. (2021). Interval estimation of the overall treatment effect in random-effects meta-analyses: recommendations from a simulation study comparing frequentist, bayesian, and bootstrap methods. Res. Synthesis Methods 12 (3), 291–315. doi:10.1002/jrsm.1471
Wen X., Song F., Yang L., Xu Q. (2024). Small-sided soccer games promote greater adaptations on vertical jump and change-of-direction deficit and similar adaptations in aerobic capacity than high-intensity interval training in females. J. Sports Sci. Med. 23, 445–454. doi:10.52082/jssm.2024.445
Weston M., Taylor K. L., Batterham A. M., Hopkins W. G. (2014). Effects of low-volume high-intensity interval training (hit) on fitness in adults: a meta-analysis of controlled and non-controlled trials. Sports Med. 44 (7), 1005–1017. doi:10.1007/s40279-014-0180-z
Woodard T. J. (2014). Comparison of Timed-Based Sets Metabolic Resistance Training vs. Repetition-Based Sets Metabolic Resistance Training on EPOC in Recreationally Active Young Women (Master’s thesis). Southern Illinois University Carbondale. Available online at: https://opensiuc.lib.siu.edu/cgi/viewcontent.cgi?referer=&httpsredir=1&article=1691&context=gs_rp.
Xun Y., Guo Q., Ren M., Liu Y., Sun Y., Wu S., et al. (2023). Characteristics of the sources, evaluation, and grading of the certainty of evidence in systematic reviews in public health: a methodological study. Front. Public Health 11, 998588. doi:10.3389/fpubh.2023.998588
Zhai C., Guyatt G. (2024). Fixed-effect and random-effects models in meta-analysis. Chin. Med. J. Engl. 137 (1), 1–4. doi:10.1097/CM9.0000000000002814
Zhao J.-G. (2013). Identifying and measuring heterogeneity across the studies in meta-analysis. J. Hand Surg. 38 (7), 1449–1450. doi:10.1016/j.jhsa.2013.05.020
We have included our location, terrain, and grain size estimates from field spectroscopy in Table A1.
Keywords: metabolic resistance training (MRT), high-intensity interval training (HIIT), athletic performance, athlete development, meta-analysis
Citation: Tongwu Y and Chuanwei D (2025) The effectiveness of metabolic resistance training versus traditional cardio on athletic performance: a systematic review and meta-analysis. Front. Physiol. 16:1551645. doi: 10.3389/fphys.2025.1551645
Received: 26 December 2024; Accepted: 25 February 2025;
Published: 20 March 2025.
Edited by:
Marianna Bellafiore, University of Palermo, ItalyReviewed by:
Clóvis Albuquerque Maurício, Federal University of Rio de Janeiro, BrazilCopyright © 2025 Tongwu and Chuanweil. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Chuanwei Ding, ODExMTcxNjQ5QHFxLmNvbQ==
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
Research integrity at Frontiers
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.