Quantifying the impact of surgical teams on each stage of the operating room process

Meyers, Adam; Daysalilar, Mertcan; Dagal, Arman; Wang, Michael; Kutlu, Onur; Akcin, Mehmet

doi:10.3389/fdgth.2024.1455477

ORIGINAL RESEARCH article

Front. Digit. Health, 03 October 2024

Sec. Health Informatics

Volume 6 - 2024 | https://doi.org/10.3389/fdgth.2024.1455477

This article is part of the Research TopicArtificial Intelligence for Smart Health: Learning, Simulation, and OptimizationView all 11 articles

Quantifying the impact of surgical teams on each stage of the operating room process

Adam Meyers^1*

Mertcan Daysalilar¹

Arman Dagal^2,3

Michael Wang³

Onur Kutlu⁴

Mehmet Akcin^1,4

¹Department of Industrial and Systems Engineering, University of Miami, Coral Gables, FL, United States
²Department of Anesthesiology, Perioperative Medicine, and Pain Management, Miller School of Medicine, University of Miami, Miami, FL, United States
³Department of Neurological Surgery, Miller School of Medicine, University of Miami, Miami, FL, United States
⁴DeWitt Daughtry Family Department of Surgery, Miller School of Medicine, University of Miami, Miami, FL, United States

Introduction: Operating room (OR) efficiency is a key factor in determining surgical healthcare costs. To enable targeted changes for improving OR efficiency, a comprehensive quantification of the underlying sources of variability contributing to OR efficiency is needed. Previous literature has focused on select stages of the OR process or on aggregate process times influencing efficiency. This study proposes to analyze the OR process in more fine-grained stages to better localize and quantify the impact of important factors.

Methods: Data spanning from 2019-2023 were obtained from a surgery center at a large academic hospital. Linear mixed models were developed to quantify the sources of variability in the OR process. The primary factors analyzed in this study included the primary surgeon, responsible anesthesia provider, primary circulating nurse, and procedure type. The OR process was segmented into eight stages that quantify eight process times, e.g., procedure duration and procedure start time delay. Model selection was performed to identify the key factors in each stage and to quantify variability.

Results: Procedure type accounted for the most variability in three process times and for 44.2% and 45.5% of variability, respectively, in procedure duration and OR time (defined as the total time the patient spent in the OR). Primary surgeon, however, accounted for the most variability in five of the eight process times and accounted for as much as 21.1% of variability. The primary circulating nurse was also found to be significant for all eight process times.

Discussion: The key findings of this study include the following. (1) It is crucial to segment the OR process into smaller, more homogeneous stages to more accurately assess the underlying sources of variability. (2) Variability in the aggregate quantity of OR time appears to mostly reflect the variability in procedure duration, which is a subinterval of OR time. (3) Primary surgeon has a larger effect on OR efficiency than previously reported in the literature and is an important factor throughout the entire OR process. (4) Primary circulating nurse is significant for all stages of the OR process, albeit their effect is small.

1 Introduction

Improving operating room (OR) efficiency is a key factor in controlling or reducing surgical healthcare costs (1), which are significant. Aggregate surgical healthcare expenditures comprised 29% of aggregate healthcare expenditures in the United States in 2005, as computed by Muñoz et al. (2). Moreover, aggregate surgical expenditures were forecasted to grow from 4.6% of US GDP in 2005 to 7.3% of US GDP in 2025 (2). In a more recent study by Childers and Maggard-Gibbons (3), the mean cost of ambulatory OR time across California hospitals in fiscal year 2014 was $36.14 per minute with a standard deviation of $19.53 per minute. Cerfolio et al. (4) report a significantly higher cost of $150 per minute of OR time in the main campus ORs at New York University Langone Health. Even with financial considerations aside, improving OR efficiency will likely improve patient safety, experience, and outcomes, decrease patient wait time, increase OR throughput, and improve surgical team and staff satisfaction (5, 6).

Improving OR efficiency is a multifaceted problem, and several metrics have been investigated by researchers.¹ A common approach to improving efficiency is to improve the utilization of the OR, that is, by minimizing both underutilization and overutilization (9, 10). Underutilization occurs when an OR lies unused due to cases being completed earlier than predicted, and overutilization occurs when an OR is used beyond its predicted or allotted time (5). Such inefficiencies are caused in large part by variability in OR time (11, 12), typically defined as the duration of time from when the patient is wheeled into the OR to the time the patient is wheeled out. Indeed, studies by Bokshan et al. (13) and Allen et al. (14) have shown OR time to be a significant driver of increased surgical costs. To reduce inefficiencies and associated costs, researchers have sought to identify the sources of variability in OR time. The primary conclusion in the literature is that procedure characteristics, namely, precise procedure type and type of anesthesia, are the main factors explaining the variation in OR time, followed by surgical team characteristics, primarily the surgeon (11, 12, 15, 16). Other factors such as patient characteristics (e.g., BMI) or other surgical team factors, such as the anesthesiologist, are generally found to be insignificant.

OR time, however, is an aggregate quantity that encompasses several stages of the OR process, and it does not span the entire OR process (Figure 1). As such, it has the following potential downsides. First, OR time does not include all stages of the OR process. In this study’s dataset, which consists of timestamps taken from a surgery center located in a large academic hospital, OR time does not include room setup duration or room cleanup duration, nor any delays in starting the next case or beginning anesthesia induction. In addition, the dataset shows that anesthesia induction begins, on average, approximately two minutes before the patient is wheeled into the OR (i.e., two minutes before OR time begins). Thus, analyzing OR time alone will not allow for ascertaining the sources of variability in all stages of the OR process, and it may also contain some inaccuracies due to the starting and ending points of OR time not lining up with the activities in the OR process. Second, OR time itself covers several different stages of the OR process, including anesthesia induction, procedure duration, and delays in the procedure start time and in the time the patient is wheeled out after the procedure is completed. It is reasonable to hypothesize that the above four stages do not have the same sources of variability, or that shared sources of variability do not account for the same proportion of variability across all stages of the OR process. Therefore, this study’s approach is to segment the OR process into more fine-grained, homogeneous stages and assess the sources of variability within each stage.

Figure 1

Figure 1. Visual depiction of the OR process, including timestamps and the span of time each OR process time covers. Formulas for each process time are given in Table 1.

Past studies have focused on other parts of the OR process besides OR time and the more fine-grained stages that comprise OR time, including surgical procedure duration, anesthesia-related times, start time delays, and turnover time (refer to Section 2.1). However, an effort to quantify the sources of variability across all fine-grained stages of the OR process is currently lacking. Based on timing markers obtained from a surgery center located in a large academic hospital, this paper quantifies the sources of variability in several OR process stages, including first case start time delay, setup duration, anesthesia induction time, procedure start time delay, procedure duration, wheels out delay, cleanup duration, and OR time (refer to Section 3.1). The focus of this paper is on quantifying the extent to which type of procedure and members of the surgical team - primary surgeon, responsible anesthesia provider, and primary circulating nurse - and their interactions explain the variation in the fine-grained stages of the OR process. By better understanding the influence of various important factors, stakeholders and researchers can better pinpoint where interventions to improve efficiency should be targeted.

The rest of this paper is organized as follows. Section 2 provides a literature review on previous approaches to assess or improve efficiency within different stages of the OR process. Section 3 describes the dataset, process times, statistical approach, and model selection. Section 4 describe the results of the statistical analysis, primarily providing a decomposition of variability for each process time. Section 5 discusses the primary findings of this study and comments on this study’s limitations and opportunities for future work. Section 6 provides concluding remarks. Additional tables and figures generated in this study are available in the Supplementary Material.

2 Research background

2.1 Related work in determining the factors driving the stages of the OR process

Numerous studies have investigated the various factors purported to cause or explain the variation in the OR process. Such work is motivated by the idea that, for OR efficiency to be improved, relevant stakeholders must first be informed about the primary factors driving OR inefficiencies. In addition, identifying the primary factors will allow for better predictive modeling, which in turn will allow for more accurate OR case scheduling to reduce OR underutilization and overutilization.

Variability in OR time (i.e., “wheels in” to “wheels out” time) is cited as a primary cause of inefficient OR utilization (11, 12). When a case lasts longer than planned, subsequent cases will either be delayed, potentially leading to OR overutilization, or cancelled, resulting in less OR revenue, patient dissatisfaction, and reduced quotas for surgical teams. When a case lasts shorter than expected, the OR will likely lie underutilized for some period of time, wasting resources. Exploring the factors that explain the variability in OR time, Dexter et al. (15) verified earlier findings, e.g., in Strum et al. (16), that reported the importance of three factors: precise procedure information, surgical team, and anesthetic type in predicting OR time. Eijkemans et al. (11) later identified additional factors, including the surgeon’s estimate of total surgical time, operation characteristics (e.g., number of separate procedures), and team characteristics (e.g., number of surgeons). van Eijk et al. (12) found that type of procedure is the overwhelming predictor of OR time variability, with surgeon having a small but significant effect and anesthesiologist having a negligible effect. Many studies show that patient characteristics (e.g., body mass index) have little effect (11, 12).

Some studies have investigated the sources of variability in other parts of the OR process and in more fine-grained stages. The most commonly examined stage is the (surgical) procedure duration, which is typically the longest stage that comprises OR time. For instance, Strum et al. (16) found the surgeon to be the most important source of variability in procedure duration, followed by anesthesia type. Patoir et al. (17) found surgeon characteristics, center location, and surgical procedure and patient characteristics accounted for much of the variation in procedure duration. Additional factors were explored in the literature, such as surgeon factors (e.g, team composition factors, such as the presence of residents) (18), factors that increase the expected duration (e.g., communication failures) by Gillespie et al. (19), and operational (e.g., OR assignment) and temporal (e.g., whether a case was started after 5:00PM) factors by Kayis et al. (9). However, many of the studies focusing on procedure duration, e.g., Strum et al. (16), Stepaniak et al. (18), and Kayis et al. (9), perform statistical analyses separately for each surgical speciality or coarse-grained category rather than considering holistically how the specific procedure type, as indicated by a fine-grained category such as the American Medical Association’s Current Procedure Terminology codes (refer to Section 3.2), accounts for the variation in procedure duration.

Other parts of the OR process explored in the literature are anesthesia-related times. For instance, Kougias et al. (20) found in their multivariate regression analysis that procedure type, anesthesia type, and BMI were statistically significant predictors of anesthesia induction time, while procedure type, anesthesia type, and operative case length were statistically significant predictors of anesthesia recovery time. van Veen-Berkx et al. (21) found that scheduling accuracy improved when looking at anesthesia-controlled time (ACT) as a proportion of total procedure time.² Few studies, however, have examined the impact of various human factors involved in the OR process on anesthesia-related times, including anesthesiologists.

Other fine-grained stages of the OR process that have been explored include start time delays, such as procedure start time delay, (any) case start time delay, and first case start time delay. Does et al. (23) employed Six Sigma techniques (24) to identify poor planning and scheduling as the primary factor causing delays in the start times of surgical procedures. The authors noted that surgical specialty and anesthesia technique also influence start time delays. A review by Halim et al. (25) identified several factors that can improve start time, including financial incentives for staff, education strategies, perioperative protocols and systems, surgical team communication, the “golden patient” initiative,³ and the “productive operating theatre” scheme⁴ A more specific approach is to look only at delays in the first case of the day, with the justification being to mitigate the cascading effect a delay in the first case has on subsequent cases in the OR. Cox Bauer et al. (27) analyzed data across three high-volume urban hospitals and found that, for cases with a documented reason for delay, the physician was the most reported reason for delay at 52%, followed in descending order by anesthesia, patient, staff, other sources, and facility. The authors did perform a regression analysis finding patient age, occurrence of late arrival, department, and facility to be significant predictors of delay. However, neither approach gives a quantification of the overall impact of a predictor on first case start time delay. Other similar work has looked at more specific events such as delays in the start of a subsequent case when the preceding case was performed by a different surgeon (28) and remaining time to exit the OR after surgical closure begins (29).

An additional stage of the OR process explored in the literature is turnover time, which is the duration of time from when a patient is wheeled out until the next patient is wheeled in. Thus, turnover time is all the remaining time in the OR process not covered by OR time (Figure 1). Bhatt et al. (30) took a systems-level approach to improve turnover time, which focused on developing a consistent “room ready” designation to reduce variability, implementing parallel processing to ensure room readiness and patient readiness occur simultaneously, and improving perioperative communication. Cerfolio et al. (4) piloted a Performance Improvement Team, called “PIT Crew,” that performed lean processing and value mapping to improve efficiency in the turnover time period. Goldhaber et al. (31) reduced turnover times significantly by collecting more granular data within the turnover time period and displaying these data to teams for regular review and accountability. The turnover time period was further divided into the followings segments: wheels out time $\to$ cleanup start time $\to$ cleanup complete time $\to$ setup start time $\to$ time room is ready for patient $\to$ wheels in time. Few studies, however, have taken the approach of quantifying the factors that explain variation in turnover time or the stages that comprise the turnover time period.

2.2 State-of-the-practice methodologies for determining important factors

There are several approaches in the related literature that seek to identify the important factors accounting for the variation in the OR process. Primary methods found in the literature include performing basic statistical analysis, fitting known probability distributions to OR process times, utilizing regression approaches for inference or prediction, utilizing systems-level approaches for improving process efficiency, and, more recently, training machine learning models for prediction.

Traditional statistical analysis, such as descriptive statistics and hypothesis testing methods, is a fundamental approach to gaining insights from gathered datasets. Such analysis dates back many decades but is still utilized today, particularly with healthcare data, as it provides insights and an overview of process efficiency. Dexter et al. (22) used two-group, one-sided $t$ -tests to determine if eliminating ACT would allow for additional cases to be completed during a typical 8-h workday. Martin and Langell (32) used Cuzick’s test for trend to evaluate whether pre-OR timeouts and performance pay improved on-time starts, OR utilization, and OR costs. Simmons et al. (33) was interested in determining if fine-grained CPT codes, compared to coarser-grained surgical specialties, would improve accuracy in surgical scheduling. They utilized the $I^{2}$ statistic and Levine’s test to assess heterogeneity in the means and variances, respectively, of ACTs and surgical-controlled times (SCTs).⁵ While traditional methods of statistical analysis can provide interpretable and meaningful summaries of data to answer questions of interest, such as determining whether differences in groups are significant following an intervention, further quantification capabilities are needed to assess the impact of factors on OR efficiency.

An early line of research involved finding distributions with a good fit to OR process time data. A main contributing paper in this approach is that of Strum et al. (34) in which the authors recommended using the lognormal distribution to model surgical procedure times. Stepaniak et al. (35) mostly corroborated the findings of Strum et al. (34), but Kayis et al. (9) found the lognormal distribution did not generally fit surgery duration well at the procedure level. Joustra et al. (36) more comprehensively fit a number of hazard models. However, as mentioned in Joustra et al. (36), such methods are less concerned with identifying the factors contributing to OR efficiency and more concerned with prediction.

Regression models, on the other hand, do allow for evaluating sources of variability in OR process times. Strum et al. (16) employed main-effects ANOVA modeling with the logarithm of surgical time and total procedure time as separate responses and found primary surgeon and type of anesthesia to be important predictors of variability. Does et al. (23) and Stepaniak et al. (18) also utilized ANOVA models to assess the importance of select factors on start time delays and surgical procedure times. Regression modeling is similarly used to identify factors that influence OR process times. Linear regression is especially utilized for this purpose, such as in Silber et al. (37), Ying Li and Huang (38), Gillespie et al. (19), and van Veen-Berkx et al. (21). Linear regression models also have added functionalities over ANOVA models, such as regularization techniques to avoid overfitting or to perform variable selection, e.g., LASSO used in Wang et al. (39), and incorporating nonlinear terms such as in Wang et al. (40).

The literature above utilizing linear regression methods tends to treat all factors as fixed effects. However, in a fixed effects setting, when certain units, e.g., surgeons, have few observations, parameter estimates may have high sample-to-sample variability. Thus, the parameter estimates may vary substantially from dataset to dataset, implying that the model built on a given dataset may not be reliable (41). In addition, fixed effects models require dummy variables to be created for each unit (e.g., each surgeon), and a coefficient must be estimated for each unit. If a factor contains many units (this study’s dataset contains over one hundred surgeons), then estimating a large number of coefficients reduces the model’s degrees of freedom, diminishes the model’s power, and increases the standard errors of the coefficient estimates (41). Furthermore, the present study is not concerned with estimating the effects of individual surgeons, anesthesiologists, etc., but rather the effect of these groups as a whole. For such reasons, previous papers in assessing the effects of different factors on OR process times have employed linear mixed model (LMM) approaches, which incorporate both fixed and random effects, with great success, e.g., Dexter and Ledolter (42), Eijkemans et al. (11), van Eijk et al. (12). This paper also takes an LMM approach for the above reasons.

More recently, machine learning (ML) has become a popular method for predicting quantities in the OR process. Master et al. (43) found that regression tree methods, such as gradient boosted regression trees, outperformed historical averaging, surgeon expert predictions, and other ML methods in the literature when predicting pediatric surgical durations. ML methods combined with surgeon predictions were also among the top-performing methods in Master et al. (43). Other research has used ML to improve predictions of OR process times (44–48). While ML methods may improve prediction, Wang and Dexter (49) notes that implementing ML software to increase prediction accuracy will not increase productivity unless accompanied by more allotted case time in a typical workday. More importantly, the objective of this paper is to quantify the impact that various human factors have on OR efficiency. LMMs allow for quantifying the proportion of variance explained in a response by each factor of interest. ML methods do have some options for determining similar values of impact, including variable importance in classification and regression trees (CART) (45) and Shapley additive explanations (SHAP) values (46). However, variable importance metrics may not correlate well with model variance explained by features (50), particularly when the model overfits the data on which it’s trained, which is a common issue with CART methods (43). The variable importance values are also typically reported in a relative fashion (to other variables) and thus do not provide an absolute assessment of the impact a factor has on a response. SHAP values may provide a better alternative in these regards, however they are not as well-established as linear regression-based metrics and may have issues as feature importance metrics (51).

3 Materials and methods

3.1 Dataset and subjects

Data spanning from January 2, 2019 to June 30, 2023 were obtained from a surgery center in the University of Miami Hospital. The surgery center incorporates six operating rooms and a dedicated preoperative area and postoperative recovery unit. The dataset originally contained 12,375 cases, before data cleaning was performed (detailed below). The dataset included the following timestamps: setup start time, anesthesia start time, wheels in time (i.e., when the patient enters the OR), anesthesia ready time, procedure start time, procedure complete time, wheels out time (i.e., when the patient exits the OR), and cleanup end time.⁶

This study examined various critical stages of the OR process rather than focusing solely on one stage or on aggregate process times encompassing several stages. The OR process times explored in this study included first case start time delay, setup duration, anesthesia induction time, procedure start time delay, procedure duration, wheels out delay, and cleanup duration.⁷ Each OR process time was defined as the elapsed time between two timestamps as described in Table 1. Figure 1 depicts the timestamps and process times.

Table 1

Table 1. Formulas for calculating each OR process time and number of cases after individual data cleaning for each process time.

There is a strong focus in previous literature on the aggregate quantity, OR time, defined as the elapsed time between when the patient is wheeled into the OR to when the patient is wheeled out. It is hypothesized that the factors driving an aggregate quantity such as OR time, which covers various stages of the OR process (Figure 1), would not necessarily be identical across all stages comprising OR time, nor that shared sources of variability in OR time would explain the same proportion of variation in each stage comprising OR time. To evaluate these hypotheses, OR time was also included as a process time for comparison to the other seven process times.

This study’s statistical analysis (refer to Section 3.2) involved building separate regression models using each OR process time as a univariate response for a total of eight models. The subset of cases containing errors corresponding to one process time were not necessarily the same as the subset of cases containing errors for a different process time. Then, because separate models were developed for each process time, the choice was made to clean the data separately for each process time, maximizing the amount of data available for each model. Data cleaning involved removing any cases with missing data, outliers, or errors. In addition, any process time labeled as a “delay” only included delay times that were positive. For instance, if the first case started on or before the day’s start time, e.g., 7:30 AM, then this case was removed as there was no “delay” in the commencement of the first case. After removal of such cases for first case start time delay, the number of cases available for fitting the statistical model was 3,543 cases (Table 1). If instead the choice was made to remove the same subset of cases for all process times, then while each of the eight models would have a common pool of data, the data size would be significantly reduced and the results would not be as robust. The number of cases available after data cleaning for each process time is provided in Table 1.

All the OR process times exhibited right skewness. For instance, Figures 2(a,b) shows the original distributions of first case start time delay and procedure duration, where the right skewness is evident.⁸ To address this, a common approach in the relevant literature is to use a logarithm transformation. Eijkemans et al. (11) and van Eijk et al. (12) used the log transformation on OR time, and Strum et al. (34) and Stepaniak et al. (35) showed that OR time and procedure duration follow lognormal distributions, implying that log-transforming these process times will approximately yield a normal distribution more appropriate for linear regression modeling methods. Does et al. (23) were concerned with reducing start time delays of procedures, and to address right skewness they opted for a more thorough Box-Cox transformation. However, the optimal choice for the parameter $λ$ in the Box-Cox tranformation was found to be zero in Does et al. (23), which is simply the log transformation. This study investigated several transformations, including log, square root, Box-Cox, and more. For many of the process times, the log transformation was not “optimal” in the sense of producing a distribution that most closely fits a normal distribution relative to all other transformations, however it was near-optimal for all process times. Moreover, given that the previous literature concluded the log transformation is appropriate for several OR process times and that the log transformation has better interpretability (in contrast to, e.g., the Box-Cox transformation), the logarithm was used to transform all process times in this study.

Figure 2

Figure 2. Histograms for (a) first cast start time delay and (b) procedure duration before transformation and (c) first case start time delay and (d) procedure duration after a log transformation.

3.2 Statistical analyses

The primary objective of this study was to quantify the extent to which the variability observed in each OR process time could be attributed to four key factors: type of procedure, primary surgeon, responsible anesthesia provider, and primary circulating nurse. These factors will henceforth be referred to as “procedure,” “surgeon,” “anesthesiologist,” and “circulator,” respectively. Such analyses can provide a more precise account and quantification of the impact each factor has on each fine-grained stage of the OR process.

To quantify sources of variability in the OR process times, a linear mixed model (LMM) approach was used. An LMM was built separately for each of the eight process times, so that the sources of variability for each stage of the OR process could be assessed and quantified. The primary factors of interest, i.e., procedure, surgeon, anesthesiologist, and circulator were treated as random effects. Table 2 shows the number of levels of each factor that occurs in each process time’s corresponding dataset (after data cleaning).

Table 2

Table 2. Number of levels of each random effect in the cleaned dataset for each OR process time.

The four primary factors were treated as random effects for multiple reasons. First, treating a factor as a random effect allows for estimating the factor’s variance and proportion of variance explained in the response (i.e., process time). Second, Table 2 shows that each of the four primary factors has many levels, and treating each as a fixed effect would require estimating tens to hundreds of coefficients, reducing the degrees of freedom in the model. This study is also not concerned with, e.g., a particular surgeon’s effect, but rather the impact of the group of surgeons as a whole. Third, the procedures, surgeons, anesthesiologists, and circulators included in the dataset do not necessarily encompass the entire populations of these factors. Thus, treating the factors as random effects allowed for accomplishing this study’s research objective and was an appropriate choice given the dataset. Note that only random intercepts were used in the LMMs.

Procedure was categorized based on the American Medical Association’s Current Procedure Terminology (CPT) codes (52). Several past studies have identified the importance of categorizing procedures with high granularity, e.g., with CPT codes, rather than with low granularity, e.g., with surgical specialities such as neurosurgery, gynecology, etc. (33, 34, 38). In particular, a recent study by Simmons et al. (33) examined over 30,000 surgical cases in an academic hospital and found that both the mean and variance of ACT and SCT varied significantly between CPT codes within specialities. Their results suggest that the use of more granular categories, specifically CPT codes, will enhance the accuracy of subsequent analysis and scheduling. Accordingly, this study used the primary CPT code for each case as the procedure type.

Other factors were available in the University of Miami Hospital’s database that could influence the process times. Domain expertise of this study’s authors was used to select the factors believed to impact OR process efficiency. Six factors were included; they are shown in Table 3. “Position,” for instance, was included as a proxy measure of the seniority and expertise of the primary surgeon. More experienced and senior surgeons were expected to be more efficient and consequently have a positive impact on OR efficiency. The six factors were treated as fixed effects for the following reasons. First, the factors were of less interest in this study and were expected to only marginally improve the model. The objective of this study was to quantify the sources of variability in the process times, focusing on procedure, surgeon, anesthesiologist, and circulator. Second, every factor had no more than five levels, with the exception of the number of procedures, which had thirty-two possible levels.⁹ Third, the levels of the factors were exhaustive of the population, whereas the levels of the random effects were only a subset of their respective populations.

Table 3

Table 3. Description of fixed effects used in the LMMs.

As stated previously, LMMs were separately built for each of the eight process times.¹⁰ Before model selection was performed, a univariate analysis of each random effect was conducted to quantify the improvement in each model by the addition of a single random effect. Two base models were used - one consisting of a fixed intercept and the other a fixed intercept plus all six fixed effects. To each base model, a single random effect was added and the adjusted intraclass correlation coefficient (ICC) was calculated for each random effect, given by

{ICC}_{( adj)} = \frac{σ_{α}^{2}}{σ_{α}^{2} + σ_{ϵ}^{2}}, (1)

where $σ_{α}^{2}$ refers to the variance of the random effect. ${ICC}_{( adj)}$ may be interpreted as the proportion of variance explained in the logarithm of the process time by the random effect, after controlling for the fixed effects.

After univariate analysis, multivariate analysis was performed to assess the impact of each random effect (in the presence of other significant random effects) on the process time and to control for fixed effects. Model selection proceeded as follows.¹¹

First, a base model was developed, given by

\begin{matrix} y_{i} = β_{0} + X_{i} β + α_{j [i]} + ϵ_{i}, \\ α_{j} \sim N (0, σ_{α}^{2}), \\ ϵ_{i} \sim N (0, σ_{ϵ}^{2}), \end{matrix} (2)

where $i = 1, \dots, n$ and $j = 1, \dots, J$ are the indices of the observations and procedure levels, respectively, $y_{i}$ represents the $i$ th observation of the logarithm of the respective process time, $β_{0}$ is the fixed intercept, $β$ is the vector of fixed slopes, $X_{i}$ is the vector of the $i$ th observations of all variables associated with the fixed effects (Table 3),¹² $α_{j [i]}$ is a random intercept for procedure, $j [i]$ denotes to which procedure the $i$ th observation belongs, $ϵ_{i}$ is the error term, and $σ_{α}^{2}$ and $σ_{ϵ}^{2}$ are the variances of the random effect and error, respectively.

Second, note that the base model in Equation 2 only includes a random intercept for procedure. Procedure was previously found in multiple studies to be the primary source of variability in various OR process times (11, 12, 16, 35). Thus, with procedure ostensibly explaining much of the variation in the process times, it was reasonable to begin the base model with only procedure as a random intercept. Each additional random effect was subsequently and cumulatively added to determine if the additional random effect should be retained in the final model. A chi-squared test was used to determine the significance¹³ of a model with one additional random effect compared to the (previous) model without the random effect. Akaike information criterion (AIC) was also reported as it penalizes the addition of more terms to the LMM. However, the chi-squared test was solely used for determining which random effects to keep, since AIC is more appropriate for prediction which is not the objective of this study.

Third, fixed effects were individually examined to determine whether each should be retained in the final model for each process time. For a given process time, a new base model was formed by adding all significant random effects found in step two above to Equation 2. Each of the six fixed effects were individually removed from the new base model, while retaining all other fixed effects, and chi-squared tests were performed and AIC values were computed. If the new base model (containing all fixed effects and significant random effects from step two above) was found significant over the new base model without the individual fixed effect (according to the chi-squared test), then the fixed effect was retained for the final model.

Fourth, the final model for a given process time was formed by adding all significant random effects from step two above and removing all fixed effects according to the procedure described in step three above. To assess the impact of each random effect retained in the final model, ${ICC}_{( adj)}$ in Equation 1 was calculated for each random effect. In addition, model ICC values were calculated to give the overall proportion of variance explained in the logarithm of the process time by all random effects. Both unadjusted and adjusted model ICC values were reported. The unadjusted model ICC, denoted ${ICC}_{LMM}$ , and adjusted model ICC, denoted ${ICC}_{LMM( adj)}$ , are given by

{ICC}_{LMM} = \frac{σ_{r}^{2}}{σ_{r}^{2} + σ_{\,f}^{2} + σ_{ϵ}^{2}} and {ICC}_{LMM( adj)} = \frac{σ_{r}^{2}}{σ_{r}^{2} + σ_{ϵ}^{2}}, (3)

where $σ_{r}^{2}$ and $σ_{\,f}^{2}$ are the variances explained by all random and fixed effects, respectively.

4 Results

Figure 3 shows a summary of the data, after cleaning, for each (untransformed) process time and random effect. To calculate the values of a given box plot, the times of the corresponding process time (e.g., procedure duration) were grouped according to the levels of the corresponding random effect (e.g., surgeon) and the median was taken for each level. For all process times, the random effect of procedure displays the highest dispersion through larger interquartile ranges and wider outliers (dots). As seen in Table 2, procedure also has significantly more levels than any other random effect. However, this alone does not explain the higher dispersion observed in procedure. Rather, it is likely that procedure is an important factor, which also agrees with much of the literature concluding that procedure is the primary source of variability in various OR process times (11, 12, 16, 35). Figure 3 also shows that all process times are right-skewed, as indicated by the median line being closer to the first quartile (bottom of box) and the upper tails and outliers extending far upwards, particularly in the box plots corresponding to the random effect of procedure. Some process times only show mild skewness, such as cleanup duration.¹⁴

Figure 3

Figure 3. Box plots by random effect for each (untransformed) process time: (A) first case start time delay, (B) setup duration, (C) anesthesia induction time, (D) procedure start time delay, (E) procedure duration, (F) wheels out delay, (G) cleanup duration, (H) OR time. The values used to generate each box plot were the median times for each level of a given random effect and for each process time.

Table 4 and Supplementary Tables S2–S8 provide an assessment of the individual impact of each random effect, with and without fixed effects. In Table 4 and Supplementary Tables S2–S8, ${ICC}_{(adj)}$ shows a small reduction when fixed effects are included; thus, the fixed effects included in this study do not explain much variation in the logarithms of the process times. Of all main random effects, procedure shows the largest ${ICC}_{(adj)}$ for most of the process times. This observation is supported by Figure 3 in that the box plots associated with procedure show the largest variation. Exceptions include first case start time delay (Supplementary Table S2) and cleanup duration (Supplementary Table S7) in which surgeon shows the largest ${ICC}_{(adj)}$ .¹⁵ In other cases, surgeon is not far behind procedure in terms of ${ICC}_{(adj)}$ , including setup duration (Supplementary Table S3), anesthesia induction time (Supplementary Table S4), procedure start time delay (Supplementary Table S5), and wheels out delay (Supplementary Table S6). Procedure duration (Table 4) and OR time (Supplementary Table S8) are the process times where procedure explains moderately more variation than surgeon.¹⁶ While the fact that procedure and surgeon accounting for the most variability could in part be due to both factors having many levels, circulator also shows approximately the same number of levels as surgeon (Table 2), yet it typically accounted for very little of the variability. One final observation from Table 4 and Supplementary Tables S2–S8 is that the interaction terms typically have higher ${ICC}_{(adj)}$ than their main effect counterparts, but the gain is marginal.

Table 4

Table 4. Univariate assessment of random effects when using procedure duration as the response.

Model selection was performed for each process time as described in Section 3.2. The model selection process is illustrated with procedure duration (Table 5).¹⁷ The $p$ -values were determined from performing chi-squared tests between each model and its previous (nested) model in Table 5, and they were used to determine whether to retain a particular random effect for the final LMM. In the case of procedure duration (and OR time; Supplementary Table S15), all main effects and interactions were determined to be significant and were retained for the final model. The gain in AIC exhibited by every random effect in addition to procedure indicates that including each term will likely improve prediction. The number of final models in which each random effect appeared is shown in Table 6. Procedure was by default included in every model, but surgeon, circulator, and the interaction of procedure and circulator also appeared in every model. Anesthesiologist appeared in all models except for that of first case start time delay.

Table 5

Table 5. Model selection for choosing random effects in the LMM where procedure duration is the response.

Table 6

Table 6. Number of final models in which each random and fixed effect appeared.

A new model for each process time was formed by augmenting the base model (Equation 2) with the significant random effects shown in Table 5 and Supplementary Tables S9–S15. Then the individual impact of each fixed effect on the performance of the augmented LMM was assessed (refer to Section 3.2). Table 7 shows the performance associated with the fixed effects for procedure duration.¹⁸ Based on the $p$ -values, the fixed effects retained for the final model for procedure duration were the number of procedures, procedure level, and patient class. Table 6 shows the number of final models for which each fixed effect was retained. Patient class was found significant for every process time and the number of procedures was found significant for all process times except first case start time delay (Supplementary Table S16) and cleanup duration (Supplementary Table S21).

Table 7

Table 7. Model selection for choosing fixed effects in the LMM where procedure duration is the response.

${ICC}_{( adj)}$ (Equation 1) was calculated for each random effect appearing in each final model, and model ICC values, ${ICC}_{LMM}$ and ${ICC}_{LMM( adj)}$ (Equation 3), were also calculated for each process time (Table 8). From Table 8, it is observed that surgeon is the random effect with the highest ${ICC}_{( adj)}$ value for five of the process times, including first case start time delay, setup duration, procedure start time delay, wheels out delay, and cleanup duration. However, the highest ${ICC}_{( adj)}$ value surgeon obtains is 21.1% for procedure start time delay. Procedure has the highest ${ICC}_{( adj)}$ value for all other process times, including anesthesia induction time, procedure duration, and OR time. Procedure accounted for 44.2% and 45.5% of variability in the logarithm of procedure duration and OR time, respectively. For all other process times, procedure accounted for approximately 11% of variation or less. While anesthesiologist was found significant for seven process times, it accounted for at most 1.1% of variation (wheels out delay). Interestingly, circulator was found significant for all models and accounted for as much as 3.4% of variation (wheels out delay). However, both anesthesiologist and circulator do not individually account for much variation.

Table 8

Table 8. ${ICC}_{( adj)}$ values for each random effect (Equation 1) appearing in each final model.

Table 8 also shows several significant interaction terms. In particular, the interaction of procedure and circulator was significant for all models. In many cases, this interaction term accounted for more variation than circulator individually. This suggests that the effect of the primary circulating nurse is significant but their effect can depend on the type of procedure. In addition, the interaction of procedure and surgeon was significant for five models and accounted for 2.2%–8.7% of variation. This also suggests the effect of the surgeon depends on the procedure. Lastly, the interaction of surgeon and circulator accounted for a modest amount of variance in the logarithms of setup duration (6.5%) and cleanup duration (8.6%), suggesting a synergistic effect of surgical teams in some stages of the OR process.

Overall, Table 8 shows that the primary factors examined in this study - procedure type, primary surgeon, responsible anesthesia provider, and primary circulating nurse - are most impactful on procedure duration and OR time, accounting for 67.5% and 69.7% of variation (in the logarithms), respectively, after fixed effects have been accounted for. The primary factors also explained a moderate amount of variation in the logarithm of procedure start time delay (43.5%), and were mildly impactful on setup duration (32.1%), anesthesia induction time (22.3%), wheels out delay (26.5%), and cleanup duration (28.7%). The primary factors accounted for very little of the variation in the logarithm of first case start time delay (11.6%). Finally, it is noted that there were little differences between ${ICC}_{LMM( adj)}$ and ${ICC}_{LMM}$ , further reinforcing that the fixed effects included in this study had little impact on the process times.

5 Discussion

5.1 Primary findings

The present study made several findings that both complement and add to the existing literature on OR efficiency. First, this study shows that, when investigating the impact of factors on the OR process, a fine-grained approach is necessary to pinpoint where in the process, and by how much, each factor makes an impact. In Section 1, it was hypothesized that the fine-grained stages of the OR process do not consist of the same sources of variability, nor that the common sources of variability account for the same proportion of variance in each stage. The results of this study support the above hypotheses (Table 8). Notably, OR time is an aggregate quantity consisting of the stages of the OR process in which the patient is present in the OR (i.e., “wheels in” to “wheels out”). However, the results of this study indicate that the quantification of variability in OR time mainly reflects the quantification of variability in procedure duration. Comparing the two process times in Table 8, their variabilities roughly decompose in the same way. For example, procedure accounted for 45.5% and 44.2% and surgeon for 13.3% and 10.8% of variability in the logarithms of OR time and procedure duration, respectively. Moreover, the random effects overall accounted for 69.7% and 67.5% of variability in the logarithms of OR time and procedure duration, respectively. In addition to procedure duration, OR time also comprises the time intervals associated with (a large proportion of) anesthesia induction time, procedure start time delay, and wheels out delay. However, the decompositions of variability for the latter three process times bear little resemblance to that of OR time. Thus, what happens in the OR during the procedure is mostly what is driving the aggregate quantity of OR time. As a result, interventions for improving efficiency in OR time should be focused on the procedure stage.

The second primary finding regards the impacts of each human factor. In particular, the primary surgeon had a larger impact in this study than what was previously reported in the literature. For instance, van Eijk et al. (12) found that the primary surgeon and second surgeon¹⁹ only accounted for a combined 4.8% of the variability in the logarithm of OR time. In the present study, however, primary surgeon alone accounted for 13.3% of variability in the logarithm of OR time (Table 8). Surgeon also accounted for a substantial 21.1% of variability in the logarithm of procedure start time delay and for at least 7% in the logarithms of all other process times (Table 8). The above results suggest that the primary surgeon (and other surgeons in the team) have moderate impacts not only on procedure duration, but also on many stages of the OR process. The importance of the surgeon was stressed in previous literature, e.g., Strum et al. (16), however a quantification of the variability due to surgeon was usually not provided. Moreover, the impact of the surgeon depends in part on the procedure, as seen by the significant interaction term of procedure and surgeon (Table 8). Indeed, Strum et al. (16) found that variability in surgical time increased as procedure time increased, indicating an interaction effect between type of procedure and surgeon.

In agreement with previous literature, responsible anesthesia provider was often a significant factor but not impactful on OR efficiency (12, 16). Surprisingly, responsible anesthesia provider had little impact on the anesthesia-controlled times, including anesthesia induction time and wheels out delay, the latter of which includes the patient’s emergence from anesthesia. Other factors not included in this study, such as patient and operation characteristics, may be important for accounting for variability in anesthesia-controlled times (54).

Lastly, this study found the primary circulating nurse, a less studied human factor in the literature regarding OR efficiency, to be a significant factor in all stages of the OR process. This is reasonable because the circulating nurse, sometimes called the “perioperative” nurse, is involved before the surgery (e.g., transporting the patient and preparing the patient for surgery), during the surgery (e.g., assisting with equipment), and after the surgery (e.g., monitoring the patient) (55). In this study, the circulating nurse had their largest effect on wheels out delay and setup duration, accounting for 3.4% and 2.3% of variability (Table 8), respectively. In addition, the interaction of procedure and circulator was also significant for every process time, and the interaction of surgeon and circulator was significant for four process times and reached an ${ICC}_{( adj)}$ value as high as 8.6% (cleanup duration; Table 8). Thus, the effect of the circulating nurse depends on the procedure type and, for some stages of the OR process, also on the particular attending surgeon, indicating some team synergistic effect on OR efficiency. Indeed, studies have found that nursing staff characteristics and team effects are important components of OR efficiency (56–58). More work is needed though to investigate the role of nursing staff on OR efficiency and to design interventions with nursing staff as a central component.

5.2 Clinical implications

The primary findings of this paper have the following clinical implications. First, OR process prediction models may be improved by incorporating significant factors and interactions found in this study. Improving prediction models will improve scheduling accuracy and increase OR utilization (i.e., decrease under- and over-utilization) which directly impacts OR efficiency.²⁰ This paper helps to fill a gap by quantifying the effect of key members of the surgical team and procedure type on various stages of the OR process. For instance, it may be beneficial for models predicting procedure duration to not only include the procedure type and primary surgeon, but also consider their interaction (Table 8). There is likely less variability in procedure duration among surgeons for simple procedures than for more complex procedures. Thus, prediction models should take into account that a surgeon’s variability itself will vary depending on the type of procedure performed. In addition, this study uniquely identifies the primary circulating nurse and various interaction terms as significant; therefore, researchers can more comprehensively consider the members of surgical teams and their synergistic effects when designing prediction models.

Second, case scheduling may also be improved by incorporating significant factors and interactions found in this study. The effect of a particular individual, e.g., the attending primary surgeon, can be considered when allocating portions of time to each stage for a given case. The particular individual can be used in a more advanced prediction model as mentioned above or, more simply, the individual’s historical data can be considered when allocating times. The same process can be done regarding surgical teams or combinations of surgical team members. Also, knowledge of particular surgical team members and teams themselves can inform strategies for case scheduling. For instance, if a particular surgeon or surgical team is known to have higher variability or expected completion times for a particular case, then such a case could be scheduled earlier or first in the day to allow for dynamic scheduling after the case’s completion, which could allow for the completion of more cases in a day (16).

Third, OR efficiency can be improved by minimizing the variability in the stages of the OR process attributable to members of the surgical team and combinations of team members. The present study highlights areas of higher variability for surgical team members. Efforts could be made to reduce variability by identifying inefficiencies in a surgical team’s or team member’s practice and providing relevant training. If the area of improvement is in teamwork, for instance, training could seek to promote effective, assertive, and closed-loop communication among surgical teams to help minimize team performance variability.²¹ Moreover, surgical teams can be further streamlined to match surgeons, anesthesiologists, and nurses who consistently work well together, which will in turn reduce performance variability.

5.3 Study limitations and future work

A limitation of this work was the use of a linear modeling approach and transformations to conform to model assumptions. Figure 4 shows model diagnostic plots for procedure duration.²² Figure 4(a) shows some departure from a normal distribution for the logarithm of procedure duration; the distribution shows heavier tails as indicated by the upper right and lower left portions of the curve “peeling away” from the red line. Heavier tails indicate the presence of outliers in both directions. In addition, Figure 4(b) suggests mild heteroscedasticity in the residuals as the variation appears to decrease as fitted values increase in absolute value. Finally, the distribution of procedure duration exhibited right skewness, which was corrected by a log transformation. The above three observations were true for many of the process times. While the results provided in this paper are still relatively robust due to the large sample size of the dataset, more accurate results could possibly be obtained through the use of robust regression methods suited to handle outliers and heteroscedasticity. Moreover, generalized linear mixed models could be explored to handle the non-normality of the process times (59).

Figure 4

Figure 4. Diagnostic plots for the final LMM where procedure duration is the response. (a) Normal probability plot of residuals; (b) residuals vs. fitted values; (c) histogram of residuals; (d) residuals vs. observation order.

Another limitation of this work was the lack of inclusion of many potentially important fixed-effect variables. Previous literature has explored a wide range of factors that may contribute to OR efficiency (refer to Section 2). There are likely important factors missing from this analysis as they were not available in the database at the University of Miami hospital. Future work could explore a more comprehensive list of factors to maximize the potential of data to reveal OR inefficiencies. Moreover, even with a more comprehensive and retrospective assessment of influential factors, more proactive measures are needed that implement realistic interventions, in collaboration with members of surgical teams, to bring greater efficiency to the OR suite.

6 Conclusions

The primary goal of this paper was to quantify the extent to which the procedure type and key members of the surgical team accounted for variation in the fine-grained stages of the OR process. Some of the stages of the OR process and more aggregate process times have been analyzed previously in the literature (refer to Section 2.1). However, a comprehensive analysis of the impact of the primary surgical team members on the many stages comprising the OR process is lacking. This study helps to fill this gap by developing eight different linear mixed models that quantify the variability of several OR process times with respect to procedure type, primary surgeon, responsible anesthesiology provider, and primary circulating nurse.

This study found that, to more accurately account for sources of variability in the OR process, it is necessary to break up the OR process into smaller, homogeneous stages. For instance, this study found that OR time, defined as the “wheels in” to “wheels out” time of a patient in the OR, largely reflects procedure duration and is therefore not homogeneous across its entire time span. In addition, this study found that surgeon has a larger impact than previously reported in the literature and that the circulating nurse accounted for a significant, albeit small, proportion of variability in all eight process times studied. This study can serve as a foundation for quantifying the impact of important members of the surgical team on various stages of the OR process and for more targeted interventions seeking to realize more efficient and cost-effective OR suites.

Data availability statement

The datasets presented in this article are not readily available because of concerns regarding data privacy. Requests to access the datasets should be directed to Adam Meyers,YXhtODMzNkBtaWFtaS5lZHU=.

Author contributions

AM: Conceptualization, Data curation, Formal Analysis, Investigation, Methodology, Resources, Software, Supervision, Visualization, Writing – original draft, Writing – review & editing. MD: Data curation, Formal Analysis, Investigation, Methodology, Software, Visualization, Writing – original draft, Writing – review & editing. AD: Conceptualization, Project administration, Resources, Supervision, Writing – review & editing. MW: Conceptualization, Resources, Supervision, Writing – review & editing. OK: Resources, Supervision, Writing – review & editing. MA: Conceptualization, Data curation, Project administration, Resources, Supervision, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Acknowledgments

The authors would like to thank the DeWitt Daughtry Family Department of Surgery and the University of Miami Hospital for generously providing the data used in this study.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fdgth.2024.1455477/full#supplementary-material

Footnotes

1. ^For comprehensive reviews, refer to Lee et al. (7) and Dexter and Epstein (8).

2. ^ACT is defined in Dexter et al. (22) as the sum of the time from when the patient enters the OR to when the positioning or skin preparation begins, plus the time from when the surgical dressing is completed to when the patient is wheeled out of the OR.

3. ^The “golden patient” initiative is a strategy where the first patient on the operating list is medically fit, thoroughly investigated, and has a clear surgical plan (25).

4. ^The “productive operating theatre” scheme is a three-step intervention to increase OR efficiency (25, 26).

5. ^Surgical-controlled time is defined as the duration of time from surgical incision to surgical closure.

6. ^Further description of the dataset in terms of procedural categories and the range of CPT codes used are provided in Supplementary Table S1.

7. ^Initially, the process times of anesthesia start time delay, computed as anesthesia start minus wheels in, and next case start time delay, computed as setup start (of next case) minus cleanup end (of previous case), were included in this study. However, after data cleaning, there were too little data to build LMMs with the desired factors; thus these process times were excluded from the analysis.

8. ^Supplementary Figures S1–S6 show the corresponding histograms for all other process times.

9. ^However, most cases involved only a few procedures.

10. ^All LMMs were fitted using the lmer function from the R package lme4 (53). A reference on this package is provided by Bates et al. (53).

11. ^A similar model selection procedure to that of van Eijk et al. (12) was used in this study.

12. ^Because all fixed effects were categorical with many levels, several dummy variables were created which would be included in the vector $X_{i}$ .

13. ^The standard 0.05 level of significance was used.

14. ^Skewness of the process times is further supported by the histograms displayed in Figure 2 and Supplementary Figures S1–S6.

15. ^For cleanup duration (Supplementary Table S7), surgeon is only higher than procedure in ${ICC}_{(adj)}$ when fixed effects are included.

16. ^As concluded in this paper, though, the variability of OR time largely reflects that of procedure duration; thus, procedure type has a significantly higher impact on procedure duration than the primary surgeon, but this is not the case for the other stages of the OR process as examined in this study.

17. ^The corresponding tables for all other process times are included in the Supplementary Tables S9–S15.

18. ^Supplementary Tables S16–S22 show the performance associated with the fixed effects for all other process times.

19. ^“Second surgeon” is defined in van Eijk et al. (12) as the first registered assistant surgeon during a procedure.

20. ^See Dexter and Epstein (8) where “OR efficiency” is defined as minimizing the “inefficiency of use of OR time,” the latter of which is calculated using costs and times associated with under- and over-utilization.

21. ^Granted, targeted research is needed to identify areas of inefficiency.

22. ^Diagnostic plots for all other process times are provided in Supplementary Figures S7–S13.

References

1. Scott EJ. Editorial commentary: improved operating room efficiency is the best way to control orthopaedic costs. Arthroscopy. (2024) 40:1527–8. doi: 10.1016/j.arthro.2024.01.005

PubMed Abstract | Crossref Full Text | Google Scholar

2. Muñoz E, Muñoz III W, Wise L. National and surgical health care expenditures, 2005–2025. Ann Surg. (2010) 251:195–200. doi: 10.1097/SLA.0b013e3181cbcc9a

Crossref Full Text | Google Scholar

3. Childers CP, Maggard-Gibbons M. Understanding costs of care in the operating room. JAMA Surg. (2018) 153:e176233. doi: 10.1001/jamasurg.2017.6233

PubMed Abstract | Crossref Full Text | Google Scholar

4. Cerfolio RJ, Ferrari-Light D, Ren-Fielding C, Fielding G, Perry N, Rabinovich A, et al. Improving operating room turnover time in a New York city academic hospital via lean. Ann Thorac Surg. (2019) 107:1011–6. doi: 10.1016/j.athoracsur.2018.11.071

PubMed Abstract | Crossref Full Text | Google Scholar

5. Dexter F, Traub RD, Qian F. Comparison of statistical methods to predict the time to complete a series of surgical cases. J Clin Monit Comput. (1999) 15:45–51. doi: 10.1023/A:1009999830753

PubMed Abstract | Crossref Full Text | Google Scholar

6. Rothstein DH, Raval MV. Operating room efficiency. Semin Pediatr Surg. (2018) 27:79–85. doi: 10.1053/j.sempedsurg.2018.02.004

PubMed Abstract | Crossref Full Text | Google Scholar

7. Lee DJ, Ding J, Guzzo TJ. Improving operating room efficiency. Curr Urol Rep. (2019) 20:1–8. doi: 10.1007/s11934-019-0895-3

PubMed Abstract | Crossref Full Text | Google Scholar

8. Dexter F, Epstein RH. Fundamentals of operating room allocation and case scheduling to minimize the inefficiency of use of the time. Perioper Care Oper Room Manag. (2024) 35:100379. doi: 10.1016/j.pcorm.2024.100379

Crossref Full Text | Google Scholar

9. Kayis E, Wang H, Patel M, Gonzalez T, Jain S, Ramamurthi R, et al. Improving prediction of surgery duration using operational and temporal factors. In: AMIA Annual Symposium Proceedings. Vol. 2012. Washington, DC: American Medical Informatics Association (2012). p. 456. doi: 10.1016/j.surg.2021.12.032

Crossref Full Text | Google Scholar

10. Lee S-H, Dai T, Phan PH, Moran N, Stonemetz J. The association between timing of elective surgery scheduling and operating theater utilization: a cross-sectional retrospective study. Anesth Analg. (2022) 134:455–62. doi: 10.1213/ANE.0000000000005871

PubMed Abstract | Crossref Full Text | Google Scholar

11. Eijkemans MJC, van Houdenhoven M, Nguyen T, Boersma E, Steyerberg EW, Kazemier G. Predicting the Unpredictable: A New Prediction Model for Operating Room Times Using Individual Characteristics and the Surgeon’s Estimate. Anesthesiology. (2010) 112:41–9. doi: 10.1097/ALN.0b013e3181c294c2

PubMed Abstract | Crossref Full Text | Google Scholar

12. van Eijk RP, van Veen-Berkx E, Kazemier G, Eijkemans MJ. Effect of individual surgeons and anesthesiologists on operating room time. Anesth Analg. (2016) 123:445–51. doi: 10.1213/ANE.0000000000001430

PubMed Abstract | Crossref Full Text | Google Scholar

13. Bokshan SL, Mehta S, DeFroda SF, Owens BD. What are the primary cost drivers of anterior cruciate ligament reconstruction in the united states? a cost-minimization analysis of 14,713 patients. Arthrosc J Arthrosc Relat Surg. (2019) 35:1576–81. doi: 10.1016/j.arthro.2018.12.013

PubMed Abstract | Crossref Full Text | Google Scholar

14. Allen AE, Sakheim ME, Mahendraraj KA, Nemec SM, Nho SJ, Mather III RC, et al. Time-driven activity-based costing analysis identifies use of consumables and operating room time as factors associated with increased cost of outpatient primary hip arthroscopic labral repair. Arthrosc J Arthrosc Relat Surg. (2023) 40:1517–26. doi: 10.1016/j.arthro.2023.10.050

PubMed Abstract | Crossref Full Text | Google Scholar

15. Dexter F, Dexter EU, Masursky D, Nussmeier NA. Systematic review of general thoracic surgery articles to identify predictors of operating room case durations. Anesth Analg. (2008) 106:1232–41. doi: 10.1213/ane.0b013e318164f0d5

PubMed Abstract | Crossref Full Text | Google Scholar

16. Strum DP, Sampson AR, May JH, Vargas LG. Surgeon and Type of Anesthesia Predict Variability in Surgical Procedure Times. Anesthesiology. (2000) 92:1454–66. doi: 10.1097/00000542-200005000-00036

PubMed Abstract | Crossref Full Text | Google Scholar

17. Patoir A, Payet C, Peix J-L, Colin C, Pascal L, Kraimps J-L, et al. Determinants of operative time in thyroid surgery: a prospective multicenter study of 3454 thyroidectomies. PLoS One. (2017) 12:e0181424. doi: 10.1371/journal.pone.0181424

PubMed Abstract | Crossref Full Text | Google Scholar

18. Stepaniak PS, Heij C, De Vries G. Modeling and prediction of surgical procedure times. Stat Neerl. (2010) 64:1–18. doi: 10.1111/j.1467-9574.2009.00440.x

Crossref Full Text | Google Scholar

19. Gillespie BM, Chaboyer W, Fairweather N. Factors that influence the expected length of operation: results of a prospective study. BMJ Qual Saf. (2012) 21:3–12. doi: 10.1136/bmjqs-2011-000169

PubMed Abstract | Crossref Full Text | Google Scholar

20. Kougias P, Tiwari V, Barshes NR, Bechara CF, Lowery B, Pisimisis G, et al. Modeling anesthetic times. predictors and implications for short-term outcomes. J Surg Res. (2013) 180:1–7. doi: 10.1016/j.jss.2012.10.007

PubMed Abstract | Crossref Full Text | Google Scholar

21. van Veen-Berkx E, Bitter J, Elkhuizen SG, Buhre WF, Kalkmadn CJ, Gooszen HG, et al. The influence of anesthesia-controlled time on operating room scheduling in Dutch university medical centres. Can J Anaesth. (2014) 61:524–32. doi: 10.1007/s12630-014-0134-9

PubMed Abstract | Crossref Full Text | Google Scholar

22. Dexter F, Coffin S, Tinker JH. Decreases in anesthesia-controlled time cannot permit one additional surgical operation to be reliably scheduled during the workday. Anesth Analg. (1995) 81:1263–8. doi: 10.1097/00000539-199512000-00024

PubMed Abstract | Crossref Full Text | Google Scholar

23. Does RJ, Vermaat TM, Verver JP, Bisgaard S, Van Den Heuvel J. Reducing start time delays in operating rooms. J Qual Technol. (2009) 41:95–109. doi: 10.1080/00224065.2009.11917763

Crossref Full Text | Google Scholar

24. Snee RD. Six–sigma: the evolution of 100 years of business improvement methodology. Int J Six Sigma Compet Adv. (2004) 1:4–20. doi: 10.1504/IJSSCA.2004.005274

Crossref Full Text | Google Scholar

25. Halim UA, Khan MA, Ali AM. Strategies to improve start time in the operating theatre: a systematic review. J Med Syst. (2018) 42:1–11. doi: 10.1007/s10916-018-1015-5

Crossref Full Text | Google Scholar

26. Ahmed K, Khan N, Anderson D, Watkiss J, Challacombe B, Khan MS, et al. Introducing the productive operating theatre programme in urology theatre suites. Urol Int. (2013) 90:417–21. doi: 10.1159/000345312

PubMed Abstract | Crossref Full Text | Google Scholar

27. Cox Bauer CM, Greer DM, Vander Wyst KB, Kamelle SA. First-case operating room delays: patterns across urban hospitals of a single health care system. J Patient Cent Res Rev. (2016) 3:125–35. doi: 10.17294/2330-0698.1265

Crossref Full Text | Google Scholar

28. Dexter F, Bayman EO, Pattillo JC, Schwenk ES, Epstein RH. Influence of parameter uncertainty on the tardiness of the start of a surgical case following a preceding surgical case performed by a different surgeon. Perioper Care Oper Room Manag. (2018) 13:12–7. doi: 10.1016/j.pcorm.2018.11.001

Crossref Full Text | Google Scholar

29. Epstein RH, Dexter F, Maga JM, Marian AA. Evaluation of the start of surgical closure as a milestone for forecasting the time remaining to exit the operating room: a retrospective, observational cohort study. Perioper Care Oper Room Manag. (2022) 29:100280. doi: 10.1016/j.pcorm.2022.100280

Crossref Full Text | Google Scholar

30. Bhatt AS, Carlson GW, Deckers PJ. Improving operating room turnover time: a systems based approach. J Med Syst. (2014) 38:1–8. doi: 10.1007/s10916-014-0148-4

PubMed Abstract | Crossref Full Text | Google Scholar

31. Goldhaber NH, Schaefer RL, Martinez R, Graham A, Malachowski E, Rhodes LP, et al. Surgical pit crew: initiative to optimise measurement and accountability for operating room turnover time. BMJ Health Care Inform. (2023) 30:e100741. doi: 10.1136/bmjhci-2023-100741

PubMed Abstract | Crossref Full Text | Google Scholar

32. Martin L, Langell J. Improving on-time surgical starts: the impact of implementing pre-or timeouts and performance pay. J Surg Res. (2017) 219:222–5. doi: 10.1016/j.jss.2017.05.092

PubMed Abstract | Crossref Full Text | Google Scholar

33. Simmons CG, Alvey NJ, Kaizer AM, Williamson K, Faruki AA, Kacmar RM, et al. Benchmarking of anesthesia and surgical control times by current procedural terminology (CPT®) codes. J Med Syst. (2022) 46:19. doi: 10.1007/s10916-022-01798-z

PubMed Abstract | Crossref Full Text | Google Scholar

34. Strum DP, May JH, Vargas LG. Modeling the Uncertainty of Surgical Procedure Times: Comparison of Log-normal and Normal Models. Anesthesiology. (2000) 92:1160–7. doi: 10.1097/00000542-200004000-00035

PubMed Abstract | Crossref Full Text | Google Scholar

35. Stepaniak PS, Heij C, Mannaerts GH, de Quelerij M, de Vries G. Modeling procedure and surgical times for current procedural terminology-anesthesia-surgeon combinations and evaluation in terms of case-duration prediction and operating room efficiency: a multicenter study. Anesth Analg. (2009) 109:1232–45. doi: 10.1213/ANE.0b013e3181b5de07

PubMed Abstract | Crossref Full Text | Google Scholar

36. Joustra P, Meester R, van Ophem H. Can statisticians beat surgeons at the planning of operations? Empir Econ. (2013) 44:1697–718. doi: 10.1007/s00181-012-0594-0

Crossref Full Text | Google Scholar

37. Silber JH, Rosenbaum PR, Zhang X, Even-Shoshan O. Influence of Patient and Hospital Characteristics on Anesthesia Time in Medicare Patients Undergoing General and Orthopedic Surgery. Anesthesiology. (2007) 106:356–64. doi: 10.1097/00000542-200702000-00025

PubMed Abstract | Crossref Full Text | Google Scholar

38. Li Y, Zhang S, Baugh RF, Huang JZ. Predicting surgical case durations using ill-conditioned CPT code matrix. IIE Trans. (2009) 42:121–35. doi: 10.1080/07408170903019168

Crossref Full Text | Google Scholar

39. Wang J, Cabrera J, Tsui K-L, Guo H, Bakker M, Kostis JB. Clinical and nonclinical effects on operative duration: evidence from a database on thoracic surgery. J Healthc Eng. (2020) 2020:3582796. doi: 10.1155/2020/3582796

PubMed Abstract | Crossref Full Text | Google Scholar

40. Wang J, Cabrera J, Tsui K-L, Guo H, Bakker M, Kostis JB. Clinical and non-clinical effects on surgery duration: statistical modeling and analysis. arXiv [Preprint] arXiv:1801.04110 (2018). Available online at: https://arxiv.org/abs/1801.04110. doi: 10.48550/arXiv.1801.04110

Crossref Full Text | Google Scholar

41. Clark TS, Linzer DA. Should I use fixed or random effects? Polit Sci Res Methods. (2015) 3:399–408. doi: 10.1017/psrm.2014.32

Crossref Full Text | Google Scholar

42. Dexter F, Ledolter J. Bayesian Prediction Bounds and Comparisons of Operating Room Times Even for Procedures with Few or No Historic Data. Anesthesiology. (2005) 103:1259–167. doi: 10.1097/00000542-200512000-00023

PubMed Abstract | Crossref Full Text | Google Scholar

43. Master N, Zhou Z, Miller D, Scheinker D, Bambos N, Glynn P. Improving predictions of pediatric surgical durations with supervised learning. Int J Data Sci Anal. (2017) 4:35–52. doi: 10.1007/s41060-017-0055-0

Crossref Full Text | Google Scholar

44. Bartek MA, Saxena RC, Solomon S, Fong CT, Behara LD, Venigandla R, et al. Improving operating room efficiency: machine learning approach to predict case-time duration. J Am Coll Surg. (2019) 229:346–354.e3. doi: 10.1016/j.jamcollsurg.2019.05.029

PubMed Abstract | Crossref Full Text | Google Scholar

45. Fairley M, Scheinker D, Brandeau ML. Improving the efficiency of the operating room environment with an optimization and machine learning model. Health Care Manag Sci. (2019) 22:756–67. doi: 10.1007/s10729-018-9457-3

PubMed Abstract | Crossref Full Text | Google Scholar

46. Kendale S, Bishara A, Burns M, Solomon S, Corriere M, Mathis M. Machine learning for the prediction of procedural case durations developed using a large multicenter database: algorithm development and validation study. JMIR AI. (2023) 2:e44909. doi: 10.2196/44909

PubMed Abstract | Crossref Full Text | Google Scholar

47. Martinez O, Martinez C, Parra CA, Rugeles S, Suarez DR. Machine learning for surgical time prediction. Comput Methods Programs Biomed. (2021) 208:106220. doi: 10.1016/j.cmpb.2021.106220

PubMed Abstract | Crossref Full Text | Google Scholar

48. Tuwatananurak JP, Zadeh S, Xu X, Vacanti JA, Fulton WR, Ehrenfeld JM, et al. Machine learning can improve estimation of surgical case duration: a pilot study. J Med Syst. (2019) 43:1–7. doi: 10.1007/s10916-019-1160-5

Crossref Full Text | Google Scholar

49. Wang Z, Dexter F. More accurate, unbiased predictions of operating room times increase labor productivity with the same staff scheduling provided allocated hours are increased. Perioper Care Oper Room Manag. (2022) 29:100286. doi: 10.1016/j.pcorm.2022.100286

Crossref Full Text | Google Scholar

50. Molnar C. Interpretable Machine Learning. Morrisville, NC: Lulu Press, Inc. (2020).

Google Scholar

51. Kumar IE, Venkatasubramanian S, Scheidegger C, Friedler S. Problems with Shapley-value-based explanations as feature importance measures. In: Proceedings of the 37th International Conference on Machine Learning, eds. H. D. III and A. Singh (PMLR), vol. 119 of Proceedings of Machine Learning Research. (2020). p. 5491–500.

Google Scholar

52. [Dataset] American Academy of Professional Coders (2023). CPT Code Lookup. Available online at: https://www.aapc.com/codes/cpt-codes-range (accessed December 19, 2023).

Google Scholar

53. Bates D, Mächler M, Bolker B, Walker S. Fitting linear mixed-effects models using lme4. J Stat Softw. (2015) 67:1–48. doi: 10.18637/jss.v067.i01

Crossref Full Text | Google Scholar

54. Brown ML, Staffa SJ, Quinonez LG, DiNardo JA, Nasr VG. Predictors of anesthesia ready time: analysis and benchmark data. JTCVS Open. (2023) 15:446–53. doi: 10.1016/j.xjon.2023.06.016

PubMed Abstract | Crossref Full Text | Google Scholar

55. Mathenge C. The importance of the perioperative nurse. Commun Eye Health. (2020) 33:44.

Google Scholar

56. Kayış E, Khaniyev TT, Suermondt J, Sylvester K. A robust estimation model for surgery durations with temporal, operational, and surgery team effects. Health Care Manag Sci. (2015) 18:222–33. doi: 10.1007/s10729-014-9309-8

PubMed Abstract | Crossref Full Text | Google Scholar

57. Parikh N, Gargollo P, Granberg C. Improving operating room efficiency using the six sigma methodology. Urology. (2021) 154:141–7. doi: 10.1016/j.urology.2021.02.049

PubMed Abstract | Crossref Full Text | Google Scholar

58. Xiao Y, Jones A, Zhang B, Bennett M, Mears SC, Mabrey JD, et al. Team consistency and occurrences of prolonged operative time, prolonged hospital stay, and hospital readmission: a retrospective analysis. World J Surg. (2015) 39:890–6. doi: 10.1007/s00268-014-2866-7

PubMed Abstract | Crossref Full Text | Google Scholar

59. Nakagawa S, Johnson PC, Schielzeth H. The coefficient of determination $R^{2}$ and intra-class correlation coefficient from generalized linear mixed-effects models revisited and expanded. J R Soc Interface. (2017) 14:20170213. doi: 10.1098/rsif.2017.0213

PubMed Abstract | Crossref Full Text | Google Scholar

Keywords: operating room, surgery, efficiency, case delay, duration, surgical team, human factors, linear mixed model

Citation: Meyers A, Daysalilar M, Dagal A, Wang M, Kutlu O and Akcin M (2024) Quantifying the impact of surgical teams on each stage of the operating room process. Front. Digit. Health 6:1455477. doi: 10.3389/fdgth.2024.1455477

Received: 27 June 2024; Accepted: 18 September 2024;
Published: 3 October 2024.

Edited by:

Nathan Gaw, Air Force Institute of Technology, United States

Reviewed by:

Dessislava Pachamanova, Babson College, United States
Alana Delaforce, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Australia

Copyright: © 2024 Meyers, Daysalilar, Dagal, Wang, Kutlu and Akcin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Adam Meyers, YXhtODMzNkBtaWFtaS5lZHU=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.