Identification of Main Influencers of Surgical Efficiency and Variability Using Task-Level Objective Metrics: A Five-Year Robotic Sleeve Gastrectomy Case Series

Tousignant, Mark R.; Liu, Xi; Ershad Langroodi, Marzieh; Jarc, Anthony M.

doi:10.3389/fsurg.2022.756522

ORIGINAL RESEARCH article

Front. Surg., 02 May 2022

Sec. Visceral Surgery

Volume 9 - 2022 | https://doi.org/10.3389/fsurg.2022.756522

This article is part of the Research TopicGastrointestinal Surgery: Emerging techniques, controversies and state of artView all 14 articles

Identification of Main Influencers of Surgical Efficiency and Variability Using Task-Level Objective Metrics: A Five-Year Robotic Sleeve Gastrectomy Case Series

Mark R. Tousignant¹

Xi Liu²^*

Marzieh Ershad Langroodi²

Anthony M. Jarc²

¹Medical Safety and Innovation, Intuitive Surgical Inc., Sunnyvale, CA, United States
²Applied Research, Intuitive Surgical Inc., Peachtree Corners, GA, United States

Objective: Surgical efficiency and variability are critical contributors to optimal outcomes, patient experience, care team experience, and total cost to treat per disease episode. Opportunities remain to develop scalable, objective methods to quantify surgical behaviors that maximize efficiency and reduce variability. Such objective measures can then be used to provide surgeons with timely and user-specific feedbacks to monitor performances and facilitate training and learning. In this study, we used objective task-level analysis to identify dominant contributors toward surgical efficiency and variability across the procedural steps of robotic-assisted sleeve gastrectomy (RSG) over a five-year period for a single surgeon. These results enable actionable insights that can both complement those from population level analyses and be tailored to an individual surgeon's practice and experience.

Methods: Intraoperative video recordings of 77 RSG procedures performed by a single surgeon from 2015 to 2019 were reviewed and segmented into surgical tasks. Surgeon-initiated events when controlling the robotic-assisted surgical system were used to compute objective metrics. A series of multi-staged regression analysis were used to determine: if any specific tasks or patient body mass index (BMI) statistically impacted procedure duration; which objective metrics impacted critical task efficiency; and which task(s) statistically contributed to procedure variability.

Results: Stomach dissection was found to be the most significant contributor to procedure duration (β = 0.344, p< 0.001; R = 0.81, p< 0.001) followed by surgical inactivity and stomach stapling. Patient BMI was not found to be statistically significantly correlated with procedure duration (R = −0.01, p = 0.90). Energy activation rate, a robotic system event-based metric, was identified as a dominant feature in predicting stomach dissection duration and differentiating earlier and later case groups. Reduction of procedure variability was observed between earlier (2015-2016) and later (2017-2019) groups (IQR = 14.20 min vs. 6.79 min). Stomach dissection was found to contribute most to procedure variability (β = 0.74, p < 0.001).

Conclusions: A surgical task-based objective analysis was used to identify major contributors to surgical efficiency and variability. We believe this data-driven method will enable clinical teams to quantify surgeon-specific performance and identify actionable opportunities focused on the dominant surgical tasks impacting overall procedure efficiency and consistency.

Introduction

Surgical efficiency and variability are critical contributors to optimal outcomes, patient and care team experience, and total cost to treat per disease episode (1–3). However, it is often unclear to clinical teams how to objectively quantify their own surgical efficiency and variability. Further, population-level analyses alone are not always able to deliver actionable insights to an individual surgeon due to unique aspects during practice. Therefore, objective methods to characterize surgical workflow and identify actionable areas for improvement with tailored feedback for each surgeon still need to be developed and made widely available.

Although multiple factors influence outcomes and efficiencies, many studies focus on how surgery is performed by describing subjectively initial case series or critical aspects within the procedure. Few studies use objective methods to identify which surgical activities and how surgeon performance affect overall procedure efficiency or surgical outcomes throughout a surgeon's learning curve (4–8). These studies are largely agnostic to the underlying surgical activities by using global subjective rating scales like Global Evaluative Assessment of Robotic Skills (GEARS) (9) or Objective Structured Assessment of Technical Skills (OSATS) (10). Further, although some studies describe the tasks within a surgery (11–13), there is room for improvement through the establishment of quantitative methods to provide actionable objective measures. Finally, task-based objective performance indicators (OPIs) other than total operative time are often neglected despite offering the potential for improved and focused feedback (14–17). There exists an opportunity to develop more objective methods that can scale for broad use (18–20) given a limited number of studies use subjective methods to estimate the impact of a surgeon's technical skills on patient outcomes (21–25). Additionally, these objective methods need to be able to be applied to an individual surgeon, within institutions, or across institutions.

The purpose of this study was to demonstrate a novel, data-driven method that retrospectively identifies dominant factors that influenced a surgeon's performance efficiency and variability over five years when performing robotic-assisted sleeve gastrectomy (RSG) procedures. Specifically, we (1) identified the dominant factors of surgical efficiency and variability within RSG by focusing on surgical tasks, (2) examined the influence of body mass index (BMI), an important patient factor within bariatric surgery (26), on efficiency, and (3) identified OPIs with greatest impact on efficiency of the identified critical step. The data-driven methods developed in this study might also be further generalized for clinical teams, residents during training and educators to quantify performance and identify actionable and scalable changes.

Materials and Methods

Study Design

Seventy-seven RSG procedures performed by a single surgeon from April 21st, 2015 to June 3rd, 2019 were retrospectively reviewed. All procedures were performed using the da Vinci Xi surgical system (Intuitive Surgical Inc., Sunnyvale, CA, USA). Nine surgical tasks were defined that constitute the major steps needed to complete a sleeve gastrectomy (Figure 1). The tasks included stomach dissection, hiatal hernia dissection (optional), lay stomach back, place bougie, stomach stapling, hiatal hernia repair (optional), oversew staple line, leak test and stomach extraction. Any additional surgical activities were defined as “other” and idle time between tasks were defined as “surgical inactivity.”

FIGURE 1

Figure 1. Procedure workflow changes over years. Segmented tasks from 9 example cases. Hiatal hernia dissection and repair were excluded from further analysis. Surgical inactivity time was denoted as the gaps between tasks.

Detailed criteria for task start and stop times was also defined to minimize annotation variability. For example, the start time of stomach dissection was defined as the time when a dissection tool engages with tissue to initiate dissection along the greater curve of the stomach. The start and stop times for each task in each video were then annotated by three professionally trained annotation technicians. An expert surgeon reviewed samples of these annotations to ensure quality. Note that hiatal hernia dissection and repair were optional tasks in RSG procedures and thus excluded from procedure time and subsequent analysis.

To observe and compare the changes in task completion time, we grouped the surgical videos into earlier and later case groups. Specifically, earlier cases included 39 videos from the years 2015 through 2016, and the later cases included 38 videos from the years 2017 through 2019. Note that the earlier cases were a subset from the first 50 cases of the surgeon and the later cases were a subset from the latest 80 cases of the same surgeon.

Procedure Efficiency and Variability

Overall procedure duration was considered as a measure of efficiency, and interquartile range (IQR) of consecutive case durations was used as a measure of variability. To further study the efficiency of the identified task(s), surgeon behavior was characterized by OPIs derived from three major surgical robotic system events: camera movement, energy activation and arm swap. The start and stop timestamps for each event were used to calculate event-based OPIs, including rates of occurrences, and median durations of all occurrences. Identifying the OPIs that contribute to overall improvement of the task assists with identifying the skills that need to be focused on during training to improve efficiency.

Statistical Analysis

We used a three-staged regression analysis to identify main contributor(s) to procedure efficiency: (1) Spearman rank-order correlation test between each independent variable and procedure duration; (2) multivariable regression analysis; (3) recursive feature elimination (RFE) (27). The variables considered in efficiency analysis were task durations and patient BMI. Specifically, the correlation matrix of all independent variables was first checked to ensure no multicollinearity in the data. Task duration, procedure duration, and BMI were then normalized by corresponding median values from the first 5 cases to capture a baseline of surgeon behavior and patient factors. Next, β coefficients from a multivariable linear regression analysis were compared to identify variable(s) with the highest impact on procedure efficiency. Finally, RFE was used to rank the independent variables. This analysis leads to identifying the critical task that can be focused on for further analysis. Confounding effect of BMI on the critical task in association with procedure efficiency was also examined.

To characterize the impact of surgeon behavior on efficiency, we computed event-based OPIs for the identified critical task and investigated the association between OPIs and task duration using the three-staged regression analysis. We also evaluated the ability of these OPIs in differentiating between earlier and later case groups using logistic regression. RFE and LASSO (28, 29) feature selection methods were again used to rank the OPIs.

Finally, we examined association between procedure and task duration variability across all procedures. IQRs of procedure and task durations were computed by applying a sliding window for every five consecutive procedures with a stride of one procedure in earlier and later groups, respectively. Task(s) that contributed most to overall procedure variability was then identified using the same three-staged analysis. Furthermore, logistic regression with RFE was used to identify tasks with most variability between earlier and later groups. p < 0.05 was considered statistically significant in all of our statistical analysis. Statistical analysis was performed using Python's statistical functions (Python 3.7.9; SciPy v1.5.2; scikit-learn 0.23.2).

Results

Procedure Characteristics

Surgical task annotation results of nine example cases ordered chronologically were shown in Figure 1. Each row corresponded to one case and each color bar corresponded to an annotated task in the case. Reductions in procedure duration and task duration and variability can also be observed as the surgeon progressed over years (Figures 1, 2).

FIGURE 2

Figure 2. Trend in task duration change of all 77 cases over five years.

Detailed characteristics of the case series, including the number of occurrences, median value and IQR of different case groups were provided in Table 1. Among the seven surgical tasks, five tasks were identified as frequent tasks across the case series: stomach dissection, place bougie, stomach stapling, oversew staple line and leak test, with occurrences of oversew staple line decreased [earlier 36 (92.3%) vs. later 8 (21.1%)]. When comparing earlier and later case groups at procedure level, median procedure duration and IQR decreased (earlier 41.89 min, IQR = 14.2 min vs. later 27.73 min, IQR = 6.79 min). Similarly, median duration of all frequent tasks decreased except for stomach stapling, and IQRs of all five frequent tasks decreased. The decreases in both median durations and IQRs indicates procedure efficiency improvement and variability reduction between the earlier and later groups. There is no obvious change in patient BMI characteristics (earlier 44.14, IQR = 8.85 vs. later 44.25, IQR = 9.62). Distribution of BMI and procedure time can be found in Figure 3.

TABLE 1

Table 1. Statistics of surgical tasks, procedure duration and BMI.

FIGURE 3

Figure 3. Correlation plot between BMI, task durations and procedure duration. Regression lines are included for each sub-comparison. The 95% confidence intervals were shown as the translucent bands around the regression line. Distributions of (A) BMI, (B) stomach dissection, (C) place bougie, (D) stomach stapling, (E) leak test, (F) surgical inactivity with regard to procedure durations are included for earlier and later case groups, respectively.

Efficiency Analysis

Critical Task Identification

In the first-stage analysis, none of the independent variables were found to be highly correlated with each other (Spearman rank-order correlation coefficients R ranging from −0.24 to 0.66) (detailed correlation matrix is visualized in Supplementary Figure 1). Correlation coefficients between each variable and procedure duration were summarized in Table 2 and visualized in Figure 3. Among all variables, stomach dissection was found to be most significantly correlated with procedure duration (R = 0.81, p < 0.001). BMI was not found to be statistically significantly correlated with procedure duration (R = −0.01, p = 0.90).

TABLE 2

Table 2. Regression models examining procedure duration change by surgical task duration and BMI change.

In the subsequent multivariable regression analysis, all independent variables were normalized by corresponding median durations of the first 5 cases from the surgeon (Table 1) to ensure fair comparison. The β coefficients of each variable were compared among earlier, later and all cases (Table 2). In the earlier group, a unit increase in stomach dissection duration relative to the median duration from the first 5 cases (i.e., increases by 21.18 min) was associated with a 34.3% increase (β = 0.343, 95% CI 0.324 to 0.362, p < 0.001) in baseline procedure duration (i.e., a 34.3% increase from 62.54 min). Compared with all other variables, stomach dissection was found to be associated with the largest β coefficient. Similarly, when considering later cases and all cases, stomach dissection was again associated with the largest β coefficients (Table 2). Surgical inactivity also contributed to procedure duration increase in both earlier and later groups (earlier β = 0.158, 95% CI 0.135 to 0.182, p < 0.001 vs. later β = 0.140, 95% CI 0.138 to 0.143, p < 0.001). In contrast, BMI was not found to be statistically significant in association with procedure duration in all cases (β = 0.001, 95% CI −0.012 to 0.014, p = 0.88; R = −0.01, p = 0.90) and neither in earlier or later groups.

Finally, RFE with linear regression was used to recursively eliminate and rank these eight features in predicting procedure duration change. Stomach dissection, stomach stapling and surgical inactivity were consistently ranked the top three most important features (Table 2). Patient BMI was consistently ranked the lowest across all groups.

Overall, stomach dissection was found to be the major critical task and main contributor to procedure efficiency considering all three stages of analysis. To further examine confounding effect of BMI on stomach dissection, β coefficient of dissection from a univariate linear regression (β = 0.703, 95% CI 0.595 to 0.810, p < 0.001) was compared to the coefficient from a multivariable regression model after adding BMI (β = 0.711, 95% CI 0.604 to 0.819, p < 0.001). The results indicate a 1.14% increase in the coefficient thus showing no confounding effect of BMI on stomach dissection task.

Event-Based Objective Performance Indicator

Five event-based OPIs were computed from surgical system events that occurred during stomach dissection. To investigate the association between OPIs and the critical task (i.e. stomach dissection) efficiency, the same three-staged regression analysis was performed. The absolute values of the correlation coefficients R between each pair of OPIs were in the range of (0.01, 0.44) ensuring no multicollinearity (Supplementary Figure 2). Energy activation rate, median duration of camera movement and camera movement rate were found to be statistically correlated with stomach dissection duration (Table 3). In the subsequent multivariable regression analysis, all variables were normalized to the median of the first 5 cases. Among all variables, energy activation rate was found to be statistically significantly associated with task duration (β =-2.40, 95% CI −3.90 to −0.91, p = 0.002). Finally, RFE was performed along with the linear regression to rank OPIs in association with stomach dissection duration. The rankings indicate that median duration of camera movement and energy activation rate were the two most influential OPIs on task efficiency. Overall, energy activation rate was found to be a consistent indicator of task efficiency considering all three-staged analyses.

TABLE 3

Table 3. Regression models examining stomach dissection duration change by event OPIs.

To further investigate surgeon's behavior change throughout the longitudinal dataset, we identified the OPIs that can best differentiate surgeon's performance in the critical task between earlier and later case groups. Two feature selection methods (LASSO and RFE) with logistic regression were used. Energy activation rate was again selected as the top feature by both methods. All features along with their ranks from RFE feature selection and coefficients from LASSO feature selection were summarized in Table 3. The comparisons of all OPIs for the earlier and later case groups were shown in Figure 4.

FIGURE 4

Figure 4. Stomach dissection OPIs between earlier and later case groups. Comparisons between earlier and later cases groups were provided for OPIs: (A) energy activation rate, (B) energy activation median duration, (C) arm swap rate, (D) camera movement rate, (E) camera movement median duration.

Variability Analysis

We observed decreases in IQRs of procedure and task durations between earlier and later groups (Table 1). To further investigate the association between task and procedure duration variability, IQRs of task and procedure durations were computed for every five consecutive cases in earlier and later groups. The IQRs were then combined to analyze variability among all cases. Five tasks were selected as independent variables to ensure equal occurrences between earlier and later groups. To compare different tasks, all IQRs were normalized by the values from the first five cases (Table 1). None of the independent variables were found to be highly correlated (coefficients ranging from −0.18 to 0.33) (see Supplementary Figure 3).

When considering all cases, stomach dissection variability was found to contribute most to procedure variability with high consistency according to our three-staged analysis (Table 4). Specifically, a unit increase in stomach dissection IQR from 5 consecutive cases compared to the baseline IQR (Table 1) was associated with a 74% increase (β = 0.74, 95% CI 0.31 to 1.17, p = 0.001) in procedure duration IQR (i.e. a 74% increase from the baseline IQR = 7.81 min). Finally, stomach dissection and surgical inactivity were among the top three features in predicting procedure variability ranked by RFE.

TABLE 4

Table 4. Regression models examining procedure variability by surgical task variability.

To identify tasks with most variability between earlier and later groups, we used RFE with logistic regression. These results showed that place bougie, leak test and surgical inactivity contributed most to variability differences between the two groups (Table 4).

Discussion

We believe this study outlines a novel method to identify the dominant influencers to overall procedure efficiency and variability within RSG through surgical task decomposition and task-based OPIs. The multi-staged regression analysis can help to identify dominant factors that influence surgery through quantitative measures, which is critically important to delivering actionable and focused surgeon-specific feedback but may also be generalized to enable objective and scalable insights across institutions. These objective and scalable feedbacks could also be especially helpful for surgeons during training.

In order to gain a deeper insight into RSG, the procedures were segmented into nine distinct surgical tasks based upon clinical relevance, consistency across the case series, and the ability to establish clear definition of start and stop times. Moreover, the nine surgical steps were defined in such a way to accommodate for minor technique changes over the case series (i.e., hiatal hernia dissection and repair and oversew the staple line were not present in all procedures). Stomach dissection and gastric sleeve stapling were two critical tasks within RSG. Additional surgical activities beyond the nine distinct tasks were classified as other or surgical inactivity. The surgical task segmentation is a foundational component that enables the ability to perform focused and granular analysis than conventional learning curve analysis (8, 30, 31) for this RSG case series.

The multi-staged regression analysis was first used to analyze the case series to determine the critical surgical task impacting overall efficiency and variability. As one might expect, overall variability decreased as overall efficiency increased. The critical task the correlates highest with the total procedure efficiency and variability for this single surgeon RSG case series was identified as stomach dissection (Tables 2, 4). Stomach dissection requires a combination of clinical judgment, such as identification of the gastromesenteric ligament, pylorus, and short gastric vessels, as well as technical skill, such as energy use, retraction, dissection, and camera control. Education around clinical knowledge and technique and associated technical skills for this step offer an opportunity for focused gains on efficiency.

Surgical inactivity was another important factor impacting overall efficiency. Efforts to reduce periods of inactivity can be pursued by both the surgeon and OR team by reducing interruptions and training around the equipment and technique required to complete the procedure. Development of repeatable techniques, surgical approach, proficiency, and coordination by both the operating surgeon and OR team are essential to ensure consistency and predictability.

Notably, patient BMI consistently ranked the least dominant feature to impact total operative time. One possible explanation may be due to the fact that these cases were performed robotically, which may eliminate the ergonomic challenges of operating on high-BMI patients seen in conventional laparoscopy, a compelling result within robotic-assisted bariatric surgery. This finding is consistent with those reported in other robotic-assisted bariatric procedures (26, 32, 33). In addition to which steps (or patient factors) influenced efficiency and variability, this study also identified objective metrics that quantify what surgeon behaviors within the most influential step—stomach dissection—differed most over the surgeon learning curve. Specifically, we used OPIs as objective measures, which were derived from three major surgical robotic system events: camera movement, energy activation, and arm swap. In addition to performing the multi-stage regression analysis across the entire case series, a second analysis was performed comparing earlier vs. later cases in the series to determine if there was any change over time. Counts of energy activation per minute was the top ranked OPI, which might be linked to dissection technique and surgeon technical skill using the energy pedals. By focusing training on related surgeon behaviors, one might allow for improved efficiency and reduced variability. Furthermore, the OPIs reported here removed the subjectivity inherent to rating scales (e.g., GEARS) and enabled scalability by eliminating the reliance on experts or crowds of lay people to complete the ratings.

This study has several limitations. First, this was a case series by a single surgeon across two institutions, and thus the identified dominant factors associated with efficiency and variability need to be reproduced by other surgeons and institutions to evaluate generalizability. Additionally, different surgical tasks and additional OPIs could be explored to see if they are more impactful to efficiency or variability. Community consensus across procedures will allow for more robust analysis and broad adoption (34). Finally, this work did not explore correlations between performance and additional, discrete outcomes, such as re-admission, re-operation, and blood transfusion. It will be important to focus future outcomes research in areas that could be significantly impacted by task-based surgeon performance vs. others that might be influenced by surgeon decisions (e.g., length of stay).

In future research, we plan to explore how these methods can be extended to account for variations in how surgery is delivered across institutions and geographies, and to examine other procedures and specialties and their main contributors to efficiency and variability. Additionally, we plan to incorporate more patient factors and outcomes to extend this work beyond efficiency. Related work has shown promising results that link OPIs from critical steps of robotic-assisted prostatectomy to outcomes (13, 14). Finally, we plan to develop machine learning techniques that overcome manual video annotation (11, 13, 35, 36).

Conclusions

This study demonstrated the feasibility of using objective task analysis to identify main factors around surgeon and OR team behavior that influence overall procedure efficiency and variability. In particular, stomach dissection was identified as the most critical step, and energy activation rate within stomach dissection was the most critical behavior. Importantly, BMI did not influence overall efficiency of the surgeon, suggesting robotic-assisted surgery might decouple patient BMI and surgical efficiency. This is particularly important to deliver minimally invasive surgery to bariatric patients. We believe this data-driven objective task analysis approach could be used to provide actionable, surgeon-specific feedback that may also be generalized to be used by clinical teams to quantify and influence best practices for those aspects of surgery contributing most to overall efficiency and consistency.

Data Availability Statement

The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.

Ethics Statement

Ethical review and approval was not required for the study on human participants in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements.

Author Contributions

MT performed the surgical procedures, collected data, contributed to study design, manuscript drafting, and revision. XL and ME performed statistical analysis, contributed to manuscript drafting, and revision. AJ contributed to study design, manuscript drafting, and revision. All authors read and approved the final manuscript.

Conflict of Interest

MT, XL, ME, and AJ were employees of Intuitive Surgical, Inc. However, MT was not affiliated with nor funded by Intuitive Surgical when the procedures were performed.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

The authors would like to thank Linlin Zhou for data processing, Yachna Sharma for task annotation processing, Rory Hand, Shane Braun, and Madeline Dean for annotation support, and Usha Kreaden for statistical consultation.

Supplementary Material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fsurg.2022.756522/full#supplementary-material

References

1. Bodenheimer T, Sinsky C. From triple to Quadruple aim: care of the patient requires care of the provider. Ann Fam Med. (2014) 12:573–6. doi: 10.1370/afm.1713

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Sheetz KH, Ibrahim AM, Nathan H, Dimick JB. Variation in surgical outcomes across networks of the highest-rated US hospitals. JAMA Surg. (2019) 154:510–5. doi: 10.1001/jamasurg.2019.0090

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Flum DR, Fisher N, Thompson J, Marcus-Smith M, Florence M, Pellegrini CA. Washington State's approach to variability in surgical processes/Outcomes: Surgical Clinical Outcomes Assessment Program (SCOAP). Surgery. (2005) 138:821–8. doi: 10.1016/j.surg.2005.07.026

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Wehrtmann FS., de la Garza JR, Kowalewski KF, Schmidt MW, Müller K, Tapking C, et al. Learning curves of laparoscopic Roux-en-Y gastric bypass and sleeve gastrectomy in bariatric surgery: a systematic review and introduction of a standardization. Obes Surg. (2020) 30:640–56. doi: 10.1007/s11695-019-04230-7

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Arnold BN, Thomas DC, Bhatnagar V, Blasberg JD, Wang Z, Boffa DJ, et al. Defining the learning curve in robot-assisted thoracoscopic lobectomy. Surgery (United States). (2019) 165:450–4. doi: 10.1016/j.surg.2018.06.011

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Boone BA, Zenati M, Hogg ME, Steve J, Moser AJ, Bartlett DL, et al. Assessment of quality outcomes for robotic pancreaticoduodenectomy: identification of the learning curve. JAMA Surg. (2015) 150:416–22. doi: 10.1001/jamasurg.2015.17

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Pernar LIM, Robertson FC, Tavakkoli A, Sheu EG, Brooks DC, Smink DS. An appraisal of the learning curve in robotic general surgery. Surg Endosc. (2017) 31:4583–96. doi: 10.1007/s00464-017-5520-2

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Beckmann JH, Bernsmeier A, Kersebaum JN, Mehdorn AS, von Schönfels W, Taivankhuu T, et al. The impact of robotics in learning Roux-en-Y gastric bypass: a retrospective analysis of 214 laparoscopic and robotic procedures: robotic vs. laparoscopic RYGB. Obes Surg. (2020) 30:2403–10. doi: 10.1007/s11695-020-04508-1

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Goh AC, Goldfarb DW, Sander JC, Miles BJ, Dunkin BJ. Global evaluative assessment of robotic skills: validation of a clinical assessment tool to measure robotic surgical skills. J Urol. (2012) 187:247–52. doi: 10.1016/j.juro.2011.09.032

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Martin JA, Regehr G, Reznick R, Macrae H, Murnaghan J, Hutchison C, et al. Objective structured assessment of technical skill (OSATS) for surgical residents. Br J Surg. (1997) 84:273–8. doi: 10.1046/j.1365-2168.1997.02502.x

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Hashimoto DA, Rosman G, Witkowski ER, Stafford C, Navarette-Welton AJ, Rattner DW, et al. Computer vision analysis of intraoperative video: automated recognition of operative steps in laparoscopic sleeve gastrectomy. Ann Surg. (2019) 270:414–21. doi: 10.1097/SLA.0000000000003460

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Addison P, Yoo A, Duarte-Ramos J, Addy J, Dechario S, Husk G, et al. Correlation between operative time and crowd-sourced skills assessment for robotic bariatric surgery. Surg Endosc. (2020) 35:5303–9 doi: 10.1007/s00464-020-08019-z

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Twinanda AP, Shehata S, Mutter D, Marescaux J, De Mathelin M, Padoy N. EndoNet: a deep architecture for recognition tasks on laparoscopic videos. IEEE Trans Med Imaging. (2017) 36:86–97. doi: 10.1109/TMI.2016.2593957

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Hung AJ, Chen J, Che Z, Nilanon T, Jarc A, Titus M, et al. Automated performance metrics and machine learning algorithms to measure surgeon performance and anticipate clinical outcomes in robotic surgery. JAMA Surg. (2018) 32:438–44. doi: 10.1001/jamasurg.2018.1512

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Chen J, Oh PJ, Cheng N, Shah A, Montez J, Jarc A, et al. Use of automated performance metrics to measure surgeon performance during robotic vesicourethral anastomosis and methodical development of a training tutorial. J Urol. (2018) 200:895–902. doi: 10.1016/j.juro.2018.05.080

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Hung AJ, Chen J, Ghodoussipour S, Oh PJ, Liu Z, Nguyen J, et al. A deep-learning model using automated performance metrics and clinical features to predict urinary continence recovery after robot-assisted radical prostatectomy. BJU Int. (2019) 124:487–95. doi: 10.1111/bju.14735

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Hung AJ, Chen J, Che Z, Nilanon T, Jarc A, Titus M, et al. Utilizing machine learning and automated performance metrics to evaluate robot-assisted radical prostatectomy performance and predict outcomes. J Endourol. (2018) 32:438–44. doi: 10.1089/end.2018.0035

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Jarc AM, Curet MJ. Viewpoint matters: objective performance metrics for surgeon endoscope control during robot-assisted surgery. Surg Endosc. (2017) 31:1192–202. doi: 10.1007/s00464-016-5090-8

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Lyman WB, Passeri MJ, Murphy K, Siddiqui IA, Khan AS, Iannitti DA, et al. An objective approach to evaluate novice robotic surgeons using a combination of kinematics and stepwise cumulative sum (CUSUM) analyses. Surg Endosc. (2020) 35:2765–72. doi: 10.1007/s00464-020-07708-z

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Brown KC, Bhattacharyya KD, Kulason S, Zia A, Jarc A. How to bring surgery to the next level: interpretable skills assessment in robotic-assisted surgery. Visc Med. (2020) 36:463–70. doi: 10.1159/000512437

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Birkmeyer JD, Finks JF, O'Reilly A, Oerline M, Carlin AM, Nunn AR, et al. Surgical skill and complication rates after bariatric surgery. N Engl J Med. (2013) 369:1434–42. doi: 10.1056/NEJMsa1300625

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Stulberg JJ, Huang R, Kreutzer L, Ban K, Champagne BJ, Steele SE, et al. Association between surgeon technical skills and patient outcomes. JAMA Surg. (2020) 155:960. doi: 10.1001/jamasurg.2020.3007

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Varban OA, Thumma JR, Finks JF, Carlin AM, Ghaferi AA, Dimick JB. Evaluating the effect of surgical skill on outcomes for laparoscopic sleeve gastrectomy: a video-based Study. Ann Surg. (2019) 273:766–71. doi: 10.1097/SLA.0000000000003385

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Varban OA, Thumma JR, Carlin AM, Finks JF, Ghaferi AA, Dimick JB. Peer assessment of operative videos with sleeve gastrectomy to determine optimal operative technique. J Am Coll Surg. (2020) 231:470–7. doi: 10.1016/j.jamcollsurg.2020.06.016

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Fecso AB, Kuzulugil SS, Babaoglu C, Bener AB, Grantcharov TP. Relationship between intraoperative non-technical performance and technical events in bariatric surgery. Br J Surg. (2018) 105:1044–50. doi: 10.1002/bjs.10811

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Sanchez BR, Mohr CJ, Morton JM, Safadi BY, Alami RS, Curet MJ. Comparison of totally robotic laparoscopic Roux-en-Y gastric bypass and traditional laparoscopic Roux-en-Y gastric bypass. Surg Obes Relat Dis. (2005) 1:549–54. doi: 10.1016/j.soard.2005.08.008

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Guyon I, Weston J, Barnhill S, Vapnik V. Gene selection for cancer classification using support vector machines. Mach Learn. (2002) 46:389–422. doi: 10.1023/A:1012487302797

CrossRef Full Text | Google Scholar

28. Tibshirani R. Regression shrinkage and selection via the Lasso. J R Stat Soc Ser B. (1996) 58:267–88. doi: 10.1111/j.2517-6161.1996.tb02080.x

CrossRef Full Text | Google Scholar

29. Santosa F, Symes WW. Linear inversion of band-limited reflection seismograms. SIAM J Sci Stat Comput. (1986) 7:1307–30. doi: 10.1137/0907087

CrossRef Full Text | Google Scholar

30. Lo HC, Wu SM. Reappraisal learning curve of laparoscopic Roux-en Y gastric bypass: retrospective results of one hundred and eight cases from a low-volume unit. BMC Surg. (2021) 21:1–8. doi: 10.1186/s12893-021-01058-w

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Dughayli M, Shimunov S, Johnson S, Baidoun F. Single-site robotic cholecystectomy: Comparison of clinical outcome and the learning curves in relation to surgeon experience in a community teaching hospital. BMC Surg. (2018) 18:1–7. doi: 10.1186/s12893-018-0373-8

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Jacobsen G, Berger R, Horgan S. The role of robotic surgery in morbid obesity. J Laparoendosc Adv Surg Tech A. (2003) 13:279–83. doi: 10.1089/109264203322333610

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Gray KD, Pomp A, Dakin G, Amanat S, Turnbull ZA, Samuels J, et al. Perioperative outcomes and anesthetic considerations of robotic bariatric surgery in a propensity-matched cohort of super obese and super-super obese patients. Surg Endosc. (2018) 32:4867–73. doi: 10.1007/s00464-018-6241-x

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Ritter EM, Gardner AK, Dunkin BJ, Schultz L, Pryor AD, Feldman L. Video-based assessment for laparoscopic fundoplication: initial development of a robust tool for operative performance assessment. Surg Endosc. (2020) 34:3176–83. doi: 10.1007/s00464-019-07089-y

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Zia A, Guo L, Zhou L, Essa I, Jarc A. Novel evaluation of surgical activity recognition models using task-based efficiency metrics. Int J Comput Assist Radiol Surg. (2019) 14:2155–63. doi: 10.1007/s11548-019-02025-w

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Kitaguchi D, Takeshita N, Matsuzaki H, Takano H, Owada Y, Enomoto T, et al. Real-time automatic surgical phase recognition in laparoscopic sigmoidectomy using the convolutional neural network-based deep learning approach. Surg Endosc. (2020) 34:4924–31. doi: 10.1007/s00464-019-07281-0

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: robotic-assisted surgery, sleeve gastrectomy, objective performance indicators, surgical task, workflow analysis, video analytics

Citation: Tousignant MR, Liu X, Ershad Langroodi M and Jarc AM (2022) Identification of Main Influencers of Surgical Efficiency and Variability Using Task-Level Objective Metrics: A Five-Year Robotic Sleeve Gastrectomy Case Series. Front. Surg. 9:756522. doi: 10.3389/fsurg.2022.756522

Received: 10 August 2021; Accepted: 07 March 2022;
Published: 02 May 2022.

Edited by:

Stefano Rausei, ASST Valle Olona, Italy

Reviewed by:

Alessandro Giardino, Casa di Cura Pederzoli, Italy
Lauren Kennedy-Metz, Harvard Medical School, United States

Copyright © 2022 Tousignant, Liu, Ershad Langroodi and Jarc. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Xi Liu, eGkubGl1QGludHVzdXJnLmNvbQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.