Skip to main content

ORIGINAL RESEARCH article

Front. Psychiatry, 10 March 2023
Sec. Addictive Disorders
This article is part of the Research Topic Benzodiazepine Addiction: From Lab to Street View all 16 articles

Predicting benzodiazepine prescriptions: A proof-of-concept machine learning approach

\r\nKerry L. Kinney,*Kerry L. Kinney1,2*Yufeng Zheng,Yufeng Zheng2,3Matthew C. Morris,Matthew C. Morris1,2Julie A. Schumacher,Julie A. Schumacher1,2Saurabh B. Bhardwaj,Saurabh B. Bhardwaj1,2James K. Rowlett,James K. Rowlett1,2
  • 1Department of Psychiatry and Human Behavior, University of Mississippi Medical Center, Jackson, MS, United States
  • 2Center for Innovation and Discovery in Addictions, University of Mississippi Medical Center, Jackson, MS, United States
  • 3Department of Data Science, University of Mississippi Medical Center, Jackson, MS, United States

Introduction: Benzodiazepines are the most commonly prescribed psychotropic medications, but they may place users at risk of serious adverse effects. Developing a method to predict benzodiazepine prescriptions could assist in prevention efforts.

Methods: The present study applies machine learning methods to de-identified electronic health record data, in order to develop algorithms for predicting benzodiazepine prescription receipt (yes/no) and number of benzodiazepine prescriptions (0, 1, 2+) at a given encounter. Support-vector machine (SVM) and random forest (RF) approaches were applied to outpatient psychiatry, family medicine, and geriatric medicine data from a large academic medical center. The training sample comprised encounters taking place between January 2020 and December 2021 (N = 204,723 encounters); the testing sample comprised data from encounters taking place between January and March 2022 (N = 28,631 encounters). The following empirically-supported features were evaluated: anxiety and sleep disorders (primary anxiety diagnosis, any anxiety diagnosis, primary sleep diagnosis, any sleep diagnosis), demographic characteristics (age, gender, race), medications (opioid prescription, number of opioid prescriptions, antidepressant prescription, antipsychotic prescription), other clinical variables (mood disorder, psychotic disorder, neurocognitive disorder, prescriber specialty), and insurance status (any insurance, type of insurance). We took a step-wise approach to developing a prediction model, wherein Model 1 included only anxiety and sleep diagnoses, and each subsequent model included an additional group of features.

Results: For predicting benzodiazepine prescription receipt (yes/no), all models showed good to excellent overall accuracy and area under the receiver operating characteristic curve (AUC) for both SVM (Accuracy = 0.868–0.883; AUC = 0.864–0.924) and RF (Accuracy = 0.860–0.887; AUC = 0.877–0.953). Overall accuracy was also high for predicting number of benzodiazepine prescriptions (0, 1, 2+) for both SVM (Accuracy = 0.861–0.877) and RF (Accuracy = 0.846–0.878).

Discussion: Results suggest SVM and RF algorithms can accurately classify individuals who receive a benzodiazepine prescription and can separate patients by the number of benzodiazepine prescriptions received at a given encounter. If replicated, these predictive models could inform system-level interventions to reduce the public health burden of benzodiazepines.

1. Introduction

Benzodiazepines are the most commonly prescribed psychotropic medications in the U.S. (1), with approximately 12.5% of U.S. adults reporting past-year benzodiazepine use (2). They are known for their anxiolytic, sedative, hypnotic, relaxant, and anticonvulsant effects, and they are primarily indicated for short-term use in anxiety and sleep disorders (3, 4). Benzodiazepine use is associated with risk of serious adverse effects, such as psychomotor impairment, cognitive decline, falls, accidents, opioid overdose, substance use disorders, and death (5, 6), suggesting benzodiazepines pose a significant public health burden. Moreover, simultaneous receipt of multiple benzodiazepine prescriptions is considered a suboptimal and potentially high-risk prescribing pattern, as it can lead to increased plasma concentrations and risk of toxicity (7, 8), but there is a paucity of research on the correlates of multiple benzodiazepine prescription receipt. The present study aims to develop an algorithm to predict whether a patient is likely to receive a benzodiazepine prescription and the number of benzodiazepine prescriptions they are likely to receive at a given medical encounter, which could reduce the public health burden of benzodiazepine use and misuse by connecting patients with evidence-based treatments for anxiety or sleep disorders before they receive a prescription.

Research suggests access to benzodiazepines differs by demographic factors such as race, sex, and age. Indeed, multiple studies have found that in the U.S., White individuals are more likely than other racial groups to receive a benzodiazepine prescription (9, 10). Differences in the need for anxiety or insomnia treatment is unlikely to explain the variation in benzodiazepine prescriptions by race (11). Although the discrepant nature of benzodiazepine prescription rates by race may safeguard individuals from minoritized backgrounds from the risks associated with benzodiazepine use, they are also indicative of underlying disparities in screening for and treating anxiety and insomnia. Benzodiazepine rates have also been shown to differ by insurance status, such that individuals who are insured are more likely to receive a benzodiazepine prescription compared to patients without insurance coverage (12, 13). Another long-standing finding with benzodiazepine use is that women are more likely to use benzodiazepines than men (14), and female gender is associated with higher mean cumulative dosage of benzodiazepines (15). Moreover, male prescribers are more likely to prescribe benzodiazepines to female compared to male patients (15), which could indicate physician bias (e.g., male physicians may view their female patients as more anxious and in greater need of medication to treat their distress). Age is also associated with the likelihood of receiving a benzodiazepine prescription, with older patients more commonly receiving a benzodiazepine prescription (12, 16). Thus, a machine learning approach to identifying who is likely to receive a benzodiazepine prescription could not only help hospital systems begin to develop strategies to reduce the public health burden of benzodiazepines, but also identify disparities in the identification and treatment of anxiety and sleep disturbance by raising awareness of non-clinical factors that play a role in prescription prediction.

Additional research suggests individuals who are at the greatest risk of adverse benzodiazepine-related outcomes have an increased likelihood of receiving a benzodiazepine prescription (12). For example, patients with depression, schizophrenia, or a substance use disorder are prescribed benzodiazepines at higher rates than those without these conditions (12, 1719). Similarly, individuals who are prescribed an antidepressant are more likely to be prescribed a benzodiazepine than those who do not use antidepressants (12). Individuals with a comorbid psychiatric or substance use disorder are at elevated risk of misusing benzodiazepines and of negative outcomes related to benzodiazepine use compared to the general population (17, 20, 21). Indeed, research suggests benzodiazepines are associated with new onset and worsening of depression symptoms (22) and that concurrent use of benzodiazepines with alcohol or opioids is associated with increased risk of emergency department visits, injury, overdose, and death (2328). Furthermore, one study found that individuals with more severe chronic obstructive pulmonary disease (COPD) were more likely to receive multiple benzodiazepine prescriptions compared to those with less severe COPD (8). This finding is especially concerning given benzodiazepines’ respiratory depressant effect (29). As far as we are aware, no research has examined other clinical predictors of receipt of multiple benzodiazepine prescriptions. Developing an algorithm to predict who is likely to receive a benzodiazepine prescription and to stratify patients by the number of benzodiazepine prescriptions they are likely to receive at a given encounter represents an important first step toward reducing benzodiazepine prescriptions in these vulnerable populations.

To our knowledge, there is no predictive algorithm that exists to classify patients by their likelihood of receiving a benzodiazepine prescription or to stratify patients by the number of benzodiazepine prescriptions they are likely to receive at a given encounter. Machine learning uses computational modeling to learn from existing data, thereby improving predictive performance (30). The emergence of electronic medical records has led to the creation of large, rich sources of data that are ripe for health-related analyses which use machine learning to answer clinical questions more efficiently than traditional approaches (31). Specifically, machine learning methods can efficiently handle large numbers of predictors; capture complex, multidirectional, and non-linear relationships between variables; and classify clinically important populations (32). Prior research suggests machine learning can be used to predict patients’ risk for a variety of negative health outcomes (30, 31, 33), including sustained opioid prescription (34) or opioid overdose (35). Importantly, such an approach may help hospital systems begin to address issues of disparities in access to treatment for anxiety and sleep disorders and the use of potentially inappropriate prescriptions by raising awareness of non-clinical factors that are related to prescribing. The current study aims to apply machine learning methods to develop algorithms for stratifying patients by the likelihood of receiving a benzodiazepine prescription and the number of benzodiazepine prescriptions they are likely to receive at a given encounter.

2. Materials and methods

2.1. Data source

Electronic health record data were obtained from a research data warehouse at an academic medical center (36). Data for this study were de-identified and date-shifted and thus did not include any protected health information. The data warehouse compiles data from the electronic records system, Epic, based on encounters at all of the institution’s hospitals and clinics. The present study included data from encounters in three specialties: family medicine, outpatient psychiatry, and geriatric medicine, as benzodiazepines are most commonly prescribed in these settings (14). All patients who identified their race as either Black/African-American or White/Caucasian were included; all other races were excluded, as only 4.2% of encounters in the training dataset and 3.6% of encounters in the test dataset were with patients who identified as another race (see Figure 1 for a diagram of included encounters). This proof-of-concept study builds on prior research by using machine learning to identify demographic and clinical factors to improve prediction of benzodiazepine use in patients seen at Mississippi’s only academic medical center.

FIGURE 1
www.frontiersin.org

Figure 1. Encounter flow diagram.

2.2. Features and outcomes

The following sets of features were selected: anxiety and sleep diagnoses (four features: primary anxiety disorder diagnosis, any anxiety disorder diagnosis, primary sleep disorder diagnosis, any sleep disorder diagnosis), demographic characteristics (three features: age, gender, race), medications (four features: opioid prescription at encounter, number of opioid prescriptions at encounter, antidepressant prescription at encounter, antipsychotic prescription at encounter), other clinical variables (four features: any mood disorder diagnosis, any psychotic disorder diagnosis, any neurocognitive disorder diagnosis, prescriber specialty), and insurance status (two features: any insurance, type of insurance). Breathing-related sleep disorder diagnoses were excluded because benzodiazepines are contraindicated for these disorders (6). The outcomes of interest were whether a benzodiazepine was prescribed at the encounter (yes/no) and the number of benzodiazepine prescriptions given to the patient at the encounter (0, 1, 2+). Benzodiazepine prescriptions included: alprazolam, clonazepam, diazepam, chlordiazepoxide, clorazepate, lorazepam, midazolam, and temazepam.

2.3. Analytic approach

We used the Statistical Package for Social Sciences (SPSS, version 28.0) for data management and MATLAB R2020b for analysis. We used multiple machine learning approaches to determine how to optimize prediction of benzodiazepine prescriptions. Specifically, we applied support vector machine (SVM) and random forest approaches to the de-identified electronic health record data for encounters in psychiatry, family medicine, and geriatrics at an academic medical center. To train and test the prediction models, two separate datasets were collected. The training dataset was collected between January 2020 and December 2021, while the test dataset was collected between January 2022 and March 2022. Overall patient-level sample characteristics are presented for the training and test datasets in Supplementary Table 1. No cross validations (e.g., k-fold) were applied to our analyses, as the test dataset was completely separated from the training dataset. All prediction results reported in the following tables and figures were derived from the test dataset. To compare the performance of different algorithms, the true positive rate (i.e., the ratio of values that are predicted to be positive and are actually positive to all positive values), the true negative rate (i.e., the ratio of values that are predicted to be negative and are actually negative to all negative values), and the overall accuracy were calculated. Some research suggests the area under the receiver operating characteristic curve (AUC) is a better measure for evaluating the predictive ability of machine learning algorithms compared to accuracy (37); therefore the AUC was also calculated for the models predicting benzodiazepine receipt. The AUC was not calculated for models predicting the number of prescriptions, as receiver operating characteristic curves are not suitable for multi-class classifications. The AUC is a function of both sensitivity and specificity and can be interpreted such that a value of 1.0 is a perfect test of classification, 0.90–0.99 is considered excellent, 0.80–0.89 is considered good, 0.70–0.79 is considered fair, 0.51–0.69 is a poor test, and a value of 0.5 corresponds with no improvement in prediction over chance (38, 39). A total of 17 features were selected a priori based on existing literature. Given the small number of features and the large sample size, we opted to manually combine different sets of features to test the classification accuracy, which can be more easily interpreted than using data-driven approaches to feature selection. A model building approach was used to determine which sets of features would maximize predictive accuracy. The models tested were as follows:

Model 1: Anxiety and Sleep Diagnoses Only.

Model 2: Anxiety and Sleep Diagnoses + Demographic Characteristics.

Model 3: Anxiety and Sleep Diagnoses + Demographic Characteristics + Co-Prescriptions.

Model 4: Anxiety and Sleep Diagnoses + Demographic Characteristics + Co-Prescriptions + Other Clinical Variables.

Model 5: Anxiety and Sleep Diagnoses + Demographic Characteristics + Co-Prescriptions + Other Clinical Variables + Insurance.

All training samples were used for training. Undersampling was employed on the training dataset to avoid bias given the unequal distribution of negative responses (i.e., did not receive a benzodiazepine prescription) compared to positive responses (i.e., received a benzodiazepine prescription). Suppose there are m samples of benzodiazepine prescription (yes), and n (typically n > m) samples of non-benzodiazepine prescription (no). We randomly selected n′ (= m) samples from n samples. Each model was repeated 30 times, yielding 30 different n′ samples. A multivariate analysis of variance (MANOVA) was performed for each model to compare whether mean performance differed between the random forest and SVM approaches. The SVM and random forest algorithms were implemented on the Matlab R2020b platform using default settings, except where noted otherwise.

2.3.1. Support vector machine

Support vector machine is a supervised learning model that analyzes data and performs non-linear classification (40). When provided a set of training data, in which each observation is coded as belonging to a group, an SVM training algorithm uses the data to build a model that can assign new data points to a specific category. An SVM creates a hyperplane (or set of hyperplanes), or a separating line between data belonging to different classes, for classification. It seeks to identify the optimal hyperplane by maximizing the distance between the hyperplane and the closest data points in each class. By maximizing the distance between the hyperplane and the nearest data points in each class, the SVM model minimizes the generalization error of the classifier (41).

For the present analyses, in the SVM method, we used a Gaussian kernel function and a one-versus-one coding design, which yields two (or three) binary learners and for two (or three) classes. To create a receiver operating characteristic curve, we transformed SVM classification scores to class posterior probabilities, which are obtained by predicting the maximum class posterior probability at each point in a grid.

2.3.2. Random forest

Random forest is another supervised learning model that can be used for classification. It uses ensemble learning, meaning it combines multiple models to solve complex problems, rather than using an individual model (32). The random forest algorithm relies on bagging or bootstrap aggregating to improve accuracy. It uses random subsets of a training dataset to generate individual decision trees for each subsample. Each decision tree will produce an output (i.e., a classification). The final output is chosen based on “majority voting;” in other words, the random forest output is the class that is chosen by the most trees. The random forest approach can reduce the effects of overfitting in individual decision trees (42).

For the present study, in the random forest model, we trained an ensemble of 100 classification trees using the entire training dataset. A random subset of predictors was used at each decision split. The selection of the split predictors aims to maximize the split-criterion gain over all possible splits of all predictors. The number of candidate predictors considered for each tree (i.e., mtry) differed for each model such that mtry=roundup(#Features). Random subsets of the training dataset were sampled with replacement. The final classifications are the combined results of all trees.

2.3.3. Feature selection

It should be noted that a model with few predictors is preferred, as it is less costly and time-consuming to use (43). To address this concern, many choose to employ data-driven feature selection approaches, e.g., (44) to remove non-informative features from models. Methods for data-driven feature selection include wrapper methods, which evaluate multiple models by adding and/or removing features to optimize model performance, and filter methods, which assess the relevance of features separately from the predictive models and only include predictors that meet specified criteria in the final model (43). However, both approaches have disadvantages. Wrapper methods involve the evaluation of many models, which significantly increases computation time, and it can increase the risk of over-fitting the model (43). In contrast, filter methods are more computationally efficient, but they involve using selection criteria that are not necessarily related to the optimization of the model. Moreover, because each feature is evaluated separately, it is possible that redundant features are selected for the final model, while interactions between features are not quantified during the feature selection process (43).

In addition, tree-based algorithms, such as random forest, conduct feature selection automatically. For instance, during the construction of a tree, if a feature is not employed in any split, the model is effectively independent of the feature (43). In fact, prior research suggests tuning random forest models can reduce the effect of non-informative features (45), precluding the need for feature selection in random forest approaches. Conversely, random forest is a powerful classifier because it can utilize weak features, which may be suppressed by methods such as principal component analysis, to boost the classification performance.

In the present study, we used relatively few features (i.e., 17 features), which were selected a priori based on existing literature. We have previously employed this approach (46), resulting in improved accuracy when compared to data-driven feature selection. Given the small number of features and the large sample size, we opted to manually combine different sets of features to test the classification accuracy, which can be more easily interpreted than using data-driven approaches to feature selection.

3. Results

In the training dataset, collected between January 2020 and December 2021, there were a total of 204,723 encounters taking place at outpatient psychiatry, family medicine, or geriatric medicine (involving 37,979 patients); there were 4,424 encounters at which a patient received at least one benzodiazepine prescription, while there were 200,299 encounters where a patient received no such prescription. Of these, there were 3,988 encounters where a patient received one benzodiazepine prescription and 436 encounters where a patient received two or more benzodiazepine prescriptions. Patient-level characteristics for the training dataset are presented by benzodiazepine prescription status in Supplementary Table 2.

In the test dataset, collected between January 2022 and March 2022, there were a total of 28,631 encounters (involving 14,404 patients); there were 842 encounters at which a patient received at least one benzodiazepine prescription and 27,789 where a patient received no such prescription. In the test data, there were 792 encounters where a patient received one benzodiazepine prescription and 50 encounters where a patient received two or more benzodiazepine prescriptions. Because the number of “positive” observations (i.e., received a benzodiazepine prescription) is significantly lower than the number of “negative” (i.e., did not receive a benzodiazepine prescription) observations, the number of positive and negative observations were balanced prior to model training in order to avoid bias. Patient-level characteristics for the test dataset are presented by benzodiazepine prescription status in Supplementary Table 3.

All prediction results (e.g., accuracy, AUC) reported in the following tables and figures were derived from the test dataset.

3.1. Prescription receipt

Table 1 and Figure 2 display the results for the models predicting whether a patient received a benzodiazepine prescription at a given encounter (yes/no). As depicted in Figure 2A, for the SVM approach, overall accuracy did not improve after including the first set of features (i.e., anxiety and sleep diagnoses). For the random forest approach, Model 2 maximized overall accuracy when predicting whether a patient received a benzodiazepine prescription at a given encounter (yes/no), and the random forest model slightly outperformed the SVM model (Random Forest benzodiazepine prescription receipt Model 2 accuracy = 0.887; SVM benzodiazepine prescription receipt Model 1 accuracy = 0.883, F(1, 58) = 52.892, p < 0.001). However, as shown in Figure 2B and Figure 3, when examining the AUC, Model 4 maximized the AUC for the SVM approach when predicting benzodiazepine prescription receipt at an encounter (SVM benzodiazepine prescription receipt Model 4 AUC = 0.924), while Model 5 maximized the AUC for the random forest approach when predicting whether a patient received a benzodiazepine prescription (Random Forest benzodiazepine prescription receipt Model 5 AUC = 0.953).

TABLE 1
www.frontiersin.org

Table 1. Benzodiazepine prescription receipt (yes/no) prediction results.

FIGURE 2
www.frontiersin.org

Figure 2. Accuracy (A) and area under the receiver operating characteristic curve (B) of each model in predicting benzodiazepine prescription receipt at a given encounter (yes/no). BZ, benzodiazepine; SVM, support vector machine; RF, random forest; AUC, area under the receiver operating characteristic curve.

FIGURE 3
www.frontiersin.org

Figure 3. Receiver operating characteristic curve for classification Model 4 (A) and for classification Model 5 (B). ROC, receiver operating characteristic curve; BZ, benzodiazepine; SVM, support vector machine; RF, random forest.

3.2. Number of prescriptions

Table 2 and Figure 4 display the results for the models predicting the number of benzodiazepine prescriptions received at a given encounter (0, 1, 2+). As demonstrated in Figure 4, Model 2 maximized overall accuracy when predicting how many benzodiazepine prescriptions a patient received at an encounter (0, 1, 2+), with the random forest model slightly outperforming the SVM model (Random Forest number of benzodiazepines Model 2 accuracy = 0.878; SVM number of benzodiazepines Model 2 accuracy = 0.877, F(1, 28) = 0.808, p = 0.372), though this difference was not statistically significant.

TABLE 2
www.frontiersin.org

Table 2. Number of benzodiazepine prescriptions (0, 1, 2+) prediction results.

FIGURE 4
www.frontiersin.org

Figure 4. Accuracy of each model in predicting the number of benzodiazepines a patient receives at a given encounter (0, 1, 2+). BZ, benzodiazepine; SVM, support vector machine; RF, random forest.

4. Discussion

Benzodiazepines, which are associated with risk of serious adverse effects (5, 6), represent a significant public health burden (2). Research suggests certain clinical (12, 1719) and demographic factors (9, 10, 12, 1416) are associated with benzodiazepine use. However, to our knowledge there is no predictive algorithm which exists that can classify patients by whether they are likely to receive a benzodiazepine prescription and the number of benzodiazepine prescriptions they are likely to receive at a given encounter. The present study used SVM and random forest approaches to develop an algorithm to predict whether a patient is likely to receive a benzodiazepine prescription at a given encounter and how many benzodiazepine prescriptions they are likely to receive at a given encounter, which could facilitate efforts to reduce the public health burden of benzodiazepine use and misuse. We took a step-wise approach to developing a prediction model in order to determine which categories of features are needed to predict benzodiazepine prescriptions accurately. Based on this analysis, both SVM and random forest algorithms may accurately classify individuals who receive a benzodiazepine prescription and can separate patients by the number of benzodiazepine prescriptions received, though there are some differences in performance between the approaches. This proof-of-concept study demonstrates the potential of machine learning approaches in identifying individuals to target for prevention efforts to reduce the burden of benzodiazepine use and inadequately treated anxiety and sleep disorders.

For the SVM approach, overall accuracy did not improve beyond Model 1 (i.e., anxiety and sleep disorder diagnoses), while for the random forest approach, Model 2 (i.e., anxiety and sleep disorder diagnoses and demographic characteristics) maximized overall accuracy when predicting whether a patient received a benzodiazepine prescription at a given encounter (yes/no). For both machine learning approaches, Model 2 maximized overall accuracy when predicting the number of benzodiazepine prescriptions received at an encounter (0, 1, 2+). The random forest model slightly outperformed the SVM model for both outcomes of interest. Of note, including additional groups of features beyond anxiety and sleep diagnoses and demographic characteristics did not improve overall accuracy and, in fact, decreased accuracy slightly. This runs counter to prior research suggesting co-prescriptions, comorbid conditions, and insurance status are important predictors of receiving a benzodiazepine prescription (12). It is possible that the predictive value of those factors is better accounted for by sleep and anxiety disorder diagnoses or patients’ demographic characteristics (i.e., race, age, or gender).

It should be noted that although overall accuracy did not improve when more categories of features were added, including co-prescribed medications in both the SVM and the random forest models improved the true positive rate for benzodiazepine prescription receipt, as well as the number of benzodiazepines prescribed, at a given encounter. Furthermore, including other clinical variables (i.e., any mood disorder diagnosis, any psychotic disorder diagnosis, any neurocognitive disorder diagnosis, prescriber specialty) and insurance status (i.e., whether the patient has insurance, type of insurance) improved the true positive rate for two or more benzodiazepine prescriptions. This suggests that decisions about which categories of features to include in a model may be driven by whether the system employing these machine learning methods is motivated primarily by maximizing sensitivity or specificity. For example, given that an intervention to reduce or prevent benzodiazepine prescribing represents a low risk to the patient, some hospital systems may prefer to use a prediction model that maximizes sensitivity, as false positives would not be a major concern. In contrast, if a hospital system is extremely resource-limited, they may prefer to maximize specificity.

In light of interpretation guidelines (38, 39), all of the SVM and random forest models predicting whether a patient received a benzodiazepine prescription at a given encounter (yes/no) tested in the present study demonstrate good to excellent predictive ability. Model 4 yielded the maximum AUC value for the SVM approach, suggesting that including the most relevant diagnoses, demographic characteristics, co-prescribed medications, and other clinical variables maximizes the predictive value for SVM. However, when using the random forest approach, Model 5 yielded the maximum AUC, suggesting insurance status offers additional predictive value. Both of these approaches yielded AUC values in the excellent range, with random forest slightly outperforming SVM. Thus, employing a random forest approach that utilizes all of the categories of features tested in the present study yields the maximum predictive value when evaluated via AUC.

One finding of note in the present study is that although overall accuracy is high for both the prediction of whether a patient will receive a benzodiazepine prescription at a given encounter and the number of benzodiazepines received, the true positive rate for identifying patients who received two or more benzodiazepine prescriptions at a given encounter was relatively low for both the SVM and random forest approach. This may be due to the relatively low base rate of patients receiving multiple benzodiazepine prescriptions at an encounter. To account for this obstacle, in the present study the number of positive and negative observations were balanced prior to model training in an attempt to avoid bias. However, despite low base rate questions being widely recognized as a concern in machine learning, the best method for accounting for this imbalance remains an open question (47). Further research is needed to determine how to best predict the likelihood of receiving two or more benzodiazepine prescriptions.

The present proof-of-concept study suggests that we can predict whether an individual is likely to receive a benzodiazepine prescription at a given encounter and how many benzodiazepine prescriptions they are likely to receive based on information from their electronic health record, with good to excellent predictive ability. Future research is needed to determine whether these predictive models could be useful in a clinical context by alerting providers to a patient’s classification and offering suggestions for how to proceed in light of the risks benzodiazepines can pose to patients’ health (5, 6). For example, if the predictive models used in the present study were employed by a hospital system, a message could be triggered by the algorithm in a patient’s chart that informs a provider of the patient’s risk, provides information on first-line treatments for anxiety and sleep conditions, and makes treatment recommendations. This may include suggesting that the provider refer the patient to cognitive behavioral therapy for anxiety or sleep disturbance (4851), attempt treatment with a selective-serotonin reuptake inhibitor for anxiety (52), and/or offer the patient educational materials on sleep hygiene and coping skills. Further research is needed to determine whether such an intervention reduces the public health burden of benzodiazepine use and inadequately treated anxiety and sleep disorders. Moreover, the same machine learning methods used in the present study could be applied to examine who is likely to convert to higher risk use (e.g., long-term or high-dose use) (5) if provided a benzodiazepine prescription. Similar methods have been successfully applied to the prediction of opioid use disorder onset (53), sustained opioid prescription (34), and opioid overdose (35). In addition, future research should investigate the utility of employing these machine learning models in longitudinal follow-up data to identify patients who, when prescribed a benzodiazepine, are at elevated risk of side effects or other complications. This would allow for prevention efforts to be targeted at patients who are at the greatest risk of suffering the negative consequences of benzodiazepine use.

The present findings should be interpreted in light of the study’s limitations. First, due to the approach used in the present study, we were unable to ascertain which specific features had the best predictive value. Additionally, the models used in the present study did not provide information on the direction of the relationship between features and the likelihood of receiving a benzodiazepine prescription, although the extant literature provides clues. Moreover, we did not control for benzodiazepine prescription history. Therefore, it is possible that a patient had already received a benzodiazepine prescription prior to the encounters examined in the current study or that patients received benzodiazepine prescriptions by other providers not captured in the current dataset. Finally, there may be additional features that were not included in our models but have value in predicting benzodiazepine prescriptions.

Taken together, the present study suggests SVM and random forest predictive models based on anxiety and sleep diagnoses and demographic characteristics can accurately classify individuals who receive a benzodiazepine prescription and can separate patients by the number of benzodiazepine prescriptions received, with random forest slightly outperforming SVM approaches. Moreover, including additional features can improve the AUC. If results are replicated, machine learning approaches may be useful in determining who to target for prevention efforts to reduce the public health burden of benzodiazepine use and misuse.

Data availability statement

The data analyzed in this study is subject to the following licenses/restrictions: The data warehouse from which the information for this study is derived has been de-identified and date-shifted so that it does not include any protected health information (PHI). Pursuant to 45 CFR 46, use of this database does not meet the definition of human subjects’ research and does not require IRB review. UMMC Faculty, Staffs, and Students can access this dataset. Requests to access these datasets should be directed to Center for Informatics and Analytics, Y2lhQHVtYy5lZHU=.

Author contributions

KK designed the study in consultation with all co-authors and drafted the manuscript. YZ conducted the machine learning analyses and provided feedback on the methods and results sections. MM, JS, and JR participated in the design of the study. SB provided consultation on clinicians’ benzodiazepine prescription decision-making processes. All authors provided feedback on the manuscript and approved the final manuscript.

Funding

This work was supported in part by grants from the National Institutes of Health (R01DA011792 and R01DA043204).

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Author disclaimer

The content is solely the responsibility of the authors and does not necessarily represent the official views of the NIH.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2023.1087879/full#supplementary-material

References

1. Greenblatt D, Harmatz J, Shader R. Psychotropic drug prescribing in the United States: extent, costs, and expenditures. Philadelphia: LWW (2011). p. 1–3. doi: 10.1097/JCP.0b013e318209cf05

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Maust D, Lin L, Blow F. Benzodiazepine use and misuse among adults in the United States. Psychiatr Serv. (2019) 70:97–106. doi: 10.1176/appi.ps.201800321

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Ashton H. Guidelines for the rational use of benzodiazepines. Drugs. (1994) 48:25–40. doi: 10.2165/00003495-199448010-00004

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Simon G, Ludman E. Outcome of new benzodiazepine prescriptions to older adults in primary care. Gen Hosp Psychiatry. (2006) 28:374–8. doi: 10.1016/j.genhosppsych.2006.05.008

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Lader M. Benzodiazepines revisited–will we ever learn? Addiction. (2011) 106:2086–109. doi: 10.1111/j.1360-0443.2011.03563.x

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Guina J, Merrill B. Benzodiazepines I: upping the care on downers: the evidence of risks, benefits and alternatives. J Clin Med. (2018) 7:17. doi: 10.3390/jcm7020017

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Tamblyn R, McLeod P, Abrahamowicz M, Monette J, Gayton D, Berkson L, et al. Questionable prescribing for elderly patients in Quebec. CMAJ. (1994) 150:1801.

Google Scholar

8. Vozoris N, Fischer H, Wang X, Anderson G, Bell C, Gershon A, et al. Benzodiazepine use among older adults with chronic obstructive pulmonary disease. Drugs Aging. (2013) 30:183–92. doi: 10.1007/s40266-013-0056-1

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Friedman J, Kim D, Schneberk T, Bourgois P, Shin M, Celious A, et al. Assessment of racial/ethnic and income disparities in the prescription of opioids and other controlled medications in California. JAMA Intern Med. (2019) 179:469–76. doi: 10.1001/jamainternmed.2018.6721

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Cook B, Creedon T, Wang Y, Lu C, Carson N, Jules P, et al. Examining racial/ethnic differences in patterns of benzodiazepine prescription and misuse. Drug Alcohol Depend. (2018) 187:29–34. doi: 10.1016/j.drugalcdep.2018.02.011

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Ribas Roca J, Everett T, Dongarwar D, Salihu HM. Racial-ethnic disparities in benzodiazepine prescriptions for anxiety in US emergency departments. J Racial Ethn Health Disparities. (2022) 10:334–42. doi: 10.1007/s40615-021-01224-z

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Kroll D, Nieva H, Barsky A, Linder J. Benzodiazepines are prescribed more frequently to patients already at risk for benzodiazepine-related adverse events in primary care. J Gen Intern Med. (2016) 31:1027–34. doi: 10.1007/s11606-016-3740-0

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Santo L, Rui P, Ashman J. Physician office visits at which benzodiazepines were prescribed: findings from 2014-2016 national ambulatory medical care survey. Natl Health Stat Rep. (2020) 137:1–16.

PubMed Abstract | Google Scholar

14. Agarwal S, Landon B. Patterns in outpatient benzodiazepine prescribing in the United States. JAMA Netw Open. (2019) 2:e187399. doi: 10.1001/jamanetworkopen.2018.7399

PubMed Abstract | CrossRef Full Text | Google Scholar

15. McIntyre R, Chen V, Lee Y, Lui L, Majeed A, Subramaniapillai M, et al. The influence of prescriber and patient gender on the prescription of benzodiazepines: evidence for stereotypes and biases? Soc Psychiatry Psychiatr Epidemiol. (2021) 56:1083–9. doi: 10.1007/s00127-020-01989-4

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Cunningham C, Hanley G, Morgan S. Patterns in the use of benzodiazepines in British Columbia: examining the impact of increasing research and guideline cautions against long-term use. Health Policy. (2010) 97:122–9. doi: 10.1016/j.healthpol.2010.03.008

PubMed Abstract | CrossRef Full Text | Google Scholar

17. McHugh R, Peckham A, Bjorgvinsson T, Korte F, Beard C. Benzodiazepine misuse among adults receiving psychiatric treatment. J Psychiatr Res. (2020) 128:33–7. doi: 10.1016/j.jpsychires.2020.05.020

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Wu C, Lin Y, Liu S. Benzodiazepine use among patients with schizophrenia in Taiwan: a nationwide population-based survey. Psychiatr Serv. (2011) 62:908–14. doi: 10.1176/ps.62.8.pss6208_0908

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Schuster J, Hoertel N, von Gunten A, Seigneurie A, Limosin F, Csa Study group. Benzodiazepine use among older adults with schizophrenia spectrum disorder: prevalence and associated factors in a multicenter study. Int Psychogeriatr. (2020) 32:441–51. doi: 10.1017/S1041610219000358

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Votaw V, Witkiewitz K, Valeri L, Bogunovic O, McHugh R. Nonmedical prescription sedative/tranquilizer use in alcohol and opioid use disorders. Addict Behav. (2019) 88:48–55. doi: 10.1016/j.addbeh.2018.08.010

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Votaw V, Geyer R, Rieselbach M, McHugh R. The epidemiology of benzodiazepine misuse: a systematic review. Drug Alcohol Depend. (2019) 200:95–114.

Google Scholar

22. van Vliet P, van der Mast R, van den Broek M, Westendorp R, de Craen A. Use of benzodiazepines, depressive symptoms and cognitive function in old age. Int J Geriatr Psychiatry. (2009) 24:500–8. doi: 10.1002/gps.2143

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Bannon M, Lapansie A, Jaster A, Saad M, Lenders J, Schmidt C. Opioid deaths involving concurrent benzodiazepine use: assessing risk factors through the analysis of prescription drug monitoring data and postmortem toxicology. Drug Alcohol Depend. (2021) 225:108854. doi: 10.1016/j.drugalcdep.2021.108854

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Jones C, McAninch J. Emergency department visits and overdose deaths from combined use of opioids and benzodiazepines. Am J Prev Med. (2015) 49:493–501. doi: 10.1016/j.amepre.2015.03.040

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Park T, Saitz R, Ganoczy D, Ilgen M, Bohnert A. Benzodiazepine prescribing patterns and deaths from drug overdose among US veterans receiving opioid analgesics: case-cohort study. BMJ. (2015) 350:h2698. doi: 10.1136/bmj.h2698

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Gomes T, Mamdani M, Dhalla I, Paterson J, Juurlink D. Opioid dose and drug-related mortality in patients with nonmalignant pain. Arch Intern Med. (2011) 171:686–91. doi: 10.1001/archinternmed.2011.117

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Jones C, Paulozzi L, Mack K. Alcohol involvement in opioid pain reliever and benzodiazepine drug abuse–related emergency department visits and drug-related deaths—United States, 2010. Morb Mortal Wkly Rep. (2014) 63:881.

PubMed Abstract | Google Scholar

28. Yarborough B, Stumbo S, Stoneburner A, Smith N, Dobscha S, Deyo R, et al. Correlates of benzodiazepine use and adverse outcomes among patients with chronic pain prescribed long-term opioid therapy. Pain Med. (2019) 20:1148–55.

Google Scholar

29. Gaudreault P, Guay J, Thivierge R, Verdy I. Benzodiazepine poisoning. Drug Saf. (1991) 6:247–65. doi: 10.2165/00002018-199106040-00003

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Shung D, Au B, Taylor R, Tay J, Laursen S, Stanley A, et al. Validation of a machine learning model that outperforms clinical risk scoring systems for upper gastrointestinal bleeding. Gastroenterology. (2020) 158:160–7. doi: 10.1053/j.gastro.2019.09.009

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Weiss J, Natarajan S, Peissig P, McCarty C, Page D. Machine learning for personalized medicine: predicting primary myocardial infarction from electronic health records. Ai Mag. (2012) 33:33–45. doi: 10.1609/aimag.v33i4.2438

CrossRef Full Text | Google Scholar

32. Galatzer-Levy I, Ruggles K, Chen Z. Data science in the research domain criteria era: relevance of machine learning to the study of stress pathology, recovery, and resilience. Chronic Stress. (2018) 2:2470547017747553. doi: 10.1177/2470547017747553

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Han J, Yee J, Cho S, Kim M, Moon J, Jung D, et al. A risk scoring system utilizing machine learning methods for hepatotoxicity prediction one year after the initiation of tyrosine kinase inhibitors. Front Oncol. (2022) 12:790343. doi: 10.3389/fonc.2022.790343

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Karhade A, Ogink P, Thio Q, Broekman M, Cha T, Hershman S, et al. Machine learning for prediction of sustained opioid prescription after anterior cervical discectomy and fusion. Spine J. (2019) 19:976–83. doi: 10.1016/j.spinee.2019.01.009

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Lo-Ciganic W, Huang J, Zhang H, Weiss J, Wu Y, Kwoh C, et al. Evaluation of machine-learning algorithms for predicting opioid overdose risk among medicare beneficiaries with opioid prescriptions. JAMA Netw Open. (2019) 2:e190968.

Google Scholar

36. University of Mississippi Medical Center. Center for Informatics and Analytics Patient Cohort Explorer (PCE) (Version 2) figshare Software. Jackson: University of Mississippi Medical Center (2020).

Google Scholar

37. Huang J, Ling C. Using AUC and accuracy in evaluating learning algorithms. IEEE Trans knowl Data Eng. (2005) 17:299–310. doi: 10.1109/TKDE.2005.50

CrossRef Full Text | Google Scholar

38. Carter J, Pan J, Rai S, Galandiuk S. ROC-ing along: evaluation and interpretation of receiver operating characteristic curves. Surgery. (2016) 159:1638–45. doi: 10.1016/j.surg.2015.12.029

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Hanley J, McNeil B. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology. (1982) 143:29–36. doi: 10.1148/radiology.143.1.7063747

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Burges CJ. A tutorial on support vector machines for pattern recognition. Data Min Knowl Discov. (1998) 2:121–67.

Google Scholar

41. Hastie T, Tibshirani R, Friedman J, Friedman J. The elements of statistical learning: data mining, inference, and prediction. Berlin: Springer (2009). doi: 10.1007/978-0-387-84858-7

CrossRef Full Text | Google Scholar

42. Breiman L. Random forests. Mach Learn. (2001) 45:5–32. doi: 10.1023/A:1010933404324

CrossRef Full Text | Google Scholar

43. Kuhn M, Johnson K. Applied predictive modeling. Berlin: Springer (2013). doi: 10.1007/978-1-4614-6849-3

CrossRef Full Text | Google Scholar

44. Drożdż K, Nabrdalik K, Kwiendacz H, Hendel M, Olejarz A, Tomasik A, et al. Risk factors for cardiovascular disease in patients with metabolic-associated fatty liver disease: a machine learning approach. Cardiovasc Diabetol. (2022) 21:1–12. doi: 10.1186/s12933-022-01672-9

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Kuhn M, Johnson K. Recursive feature elimination| feature engineering and selection: a practical approach for predictive models. Abingdon: Taylor & Francis Group (2019). doi: 10.1201/9781315108230

CrossRef Full Text | Google Scholar

46. Zheng Y, Christman B, Morris M, Hillegass W, Zhang Y, Douglas K, et al. editors. Adolescent behavioral risk analysis and prediction using machine learning: a foundation for precision suicide prevention. Multimodal image exploitation and learning 2022. Bellingham: SPIE (2022). doi: 10.1117/12.2620105

CrossRef Full Text | Google Scholar

47. Krawczyk B. Learning from imbalanced data: open challenges and future directions. Prog Artif Intell. (2016) 5:221–32. doi: 10.1007/s13748-016-0094-0

CrossRef Full Text | Google Scholar

48. Hofmann S, Smits J. Cognitive-behavioral therapy for adult anxiety disorders: a meta-analysis of randomized placebo-controlled trials. J Clin Psychiatry. (2008) 69:621–32. doi: 10.4088/JCP.v69n0415

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Olatunji B, Cisler J, Deacon B. Efficacy of cognitive behavioral therapy for anxiety disorders: a review of meta-analytic findings. Psychiatr Clin North Am. (2010) 33:557–77.

Google Scholar

50. van der Zweerde T, Bisdounis L, Kyle S, Lancee J, van Straten A. Cognitive behavioral therapy for insomnia: a meta-analysis of long-term effects in controlled studies. Sleep Med Rev. (2019) 48:101208. doi: 10.1016/j.smrv.2019.08.002

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Mitchell M, Gehrman P, Perlis M, Umscheid C. Comparative effectiveness of cognitive behavioral therapy for insomnia: a systematic review. BMC Fam Pract. (2012) 13:40. doi: 10.1186/1471-2296-13-40

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Vaswani M, Linda F, Ramesh S. Role of selective serotonin reuptake inhibitors in psychiatric disorders: a comprehensive review. Prog Neuropsychopharmacol Biol Psychiatry. (2003) 27:85–102. doi: 10.1016/S0278-5846(02)00338-X

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Wadekar A editor. Predicting opioid use disorder (OUD) using a random forest. Proceedings of the 2019 IEEE 43rd annual computer software and applications conference (COMPSAC). Piscataway, NJ: IEEE (2019). doi: 10.1109/COMPSAC.2019.00161

CrossRef Full Text | Google Scholar

Keywords: benzodiazepine, prescriptions, machine learning, support vector machine, random forest

Citation: Kinney KL, Zheng Y, Morris MC, Schumacher JA, Bhardwaj SB and Rowlett JK (2023) Predicting benzodiazepine prescriptions: A proof-of-concept machine learning approach. Front. Psychiatry 14:1087879. doi: 10.3389/fpsyt.2023.1087879

Received: 02 November 2022; Accepted: 20 February 2023;
Published: 10 March 2023.

Edited by:

Benjamin Rolland, Université Claude Bernard Lyon 1, France

Reviewed by:

Jakub Nalepa, Silesian University of Technology, Poland
Juliana Gomes, Federal University of Pernambuco, Brazil

Copyright © 2023 Kinney, Zheng, Morris, Schumacher, Bhardwaj and Rowlett. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Kerry L. Kinney, a2tpbm5leUB1bWMuZWR1

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.