- 1Maternal, Fetal & Neonatal Research Center, Family Health Research Institute, Tehran University of Medical Sciences, Tehran, Iran
- 2Center for Orthopedic Trans-disciplinary Applied Research (COTAR), Tehran University of Medical Sciences, Tehran, Iran
- 3Sports Medicine Research Center, Neuroscience Institute, Tehran University of Medical Sciences, Tehran, Iran
- 4Department of Neuroscience (DNS), Padua Neuroscience Center, University of Padova, Padua, Italy
- 5Padua Neuroscience Center, University of Padova, Padua, Italy
- 6Department of Psychological Medicine, Institute of Psychiatry, Psychology and Neuroscience, Kings College London, London, United Kingdom
- 7Social and Affective Neuroscience Group, MoMiLab, Institutions, Markets, Technologies (IMT) School for Advanced Studies Lucca, Lucca, Italy
- 8Department of Pathophysiology and Transplantation, University of Milan, Milan, Italy
- 9Department of Neurosciences and Mental Health, Fondazione Istituto di Ricovero e Cura a Carattere Scientifico (IRCCS) Ca’ Granda Ospedale Maggiore Policlinico, Milan, Italy
Background: Schizophrenia spectrum disorders (SSD) can be associated with an increased risk of violent behavior (VB), which can harm patients, others, and properties. Prediction of VB could help reduce the SSD burden on patients and healthcare systems. Some recent studies have used machine learning (ML) algorithms to identify SSD patients at risk of VB. In this article, we aimed to review studies that used ML to predict VB in SSD patients and discuss the most successful ML methods and predictors of VB.
Methods: We performed a systematic search in PubMed, Web of Sciences, Embase, and PsycINFO on September 30, 2023, to identify studies on the application of ML in predicting VB in SSD patients.
Results: We included 18 studies with data from 11,733 patients diagnosed with SSD. Different ML models demonstrated mixed performance with an area under the receiver operating characteristic curve of 0.56-0.95 and an accuracy of 50.27-90.67% in predicting violence among SSD patients. Our comparative analysis demonstrated a superior performance for the gradient boosting model, compared to other ML models in predicting VB among SSD patients. Various sociodemographic, clinical, metabolic, and neuroimaging features were associated with VB, with age and olanzapine equivalent dose at the time of discharge being the most frequently identified factors.
Conclusion: ML models demonstrated varied VB prediction performance in SSD patients, with gradient boosting outperforming. Further research is warranted for clinical applications of ML methods in this field.
1 Introduction
Schizophrenia disorders are characterized by delusions, hallucinations, disordered thinking, disorganized behavior, and blunted or inappropriate affects (1, 2). The disorders profoundly impact an individual’s quality of life and can also pose a risk to others, especially when they lead to violent behaviors (VB) (3). People with schizophrenia are frequently stigmatized as having a higher potential for violence, resulting in discrimination (4). Moreover, recent research has shown that schizophrenia spectrum disorders (SSD) – including schizophrenia, schizoaffective disorder, and other delusional disorders – have been linked with an increased risk of VB in various studies conducted worldwide (5–8).
The definition of VB is diverse, but it generally encompasses any manifestation of verbal or physical aggression directed at objects, others, or oneself (9, 10). The impact of VB is widespread, affecting not only the patients themselves, who may lose property, relationships, and well-being, but also their caregivers, such as family, friends, or healthcare workers, who can be traumatized by the experience (11, 12). Additionally, VB can increase the burden on the healthcare system for patients with SSD (13). A recent systematic review and meta-analysis reported a prevalence of 17.19 - 23.83% for different types of VB other than homicide among SSD patients (5). Another systematic review and meta-analysis, which pooled data from 15 countries, reported an odds ratio of 4.5 for interpersonal VB among SSD individuals compared to a general population group without these disorders (7).
Given the significant impact that VB can have on patients and those in their environment, it is critical to accurately predict the risk of VB to help prevent these behaviors. To date, many studies have investigated the risk factors for VB in SSD patients, including sociodemographic factors, disease characteristics, and previous patients’ medical history (14–16). However, most of these studies could not predict the risk of VB accurately, due to the complex and multifactorial nature of violence occurrence (17).
Machine learning (ML) is a subset of artificial intelligence that uses algorithms to learn from data, identify patterns, and make predictions (18, 19). By analyzing large amounts of data, ML algorithms can identify complex relationships and hidden links behind phenomena that are not obvious to human observers (20). The key aspect of ML is its capability to build predictive models, demonstrated by its ability to anticipate clinical outcomes such as suicidal ideation, impulsivity, and VB (19, 21, 22). This attribute renders ML a promising instrument for unraveling the intricate interplay between schizophrenia and VB, thereby aiding healthcare providers in the early identification of individuals susceptible to VB (23, 24). This, in turn, holds the potential to optimize resource allocation, diminish lay times, and fortify the safety of both staff and patients (25). Ultimately, the trajectory of ML in healthcare portends the evolution of medical prediction tools, envisaging their integration into routine clinical practice to proactively avert instances of VB and alleviate the burden of schizophrenia within this context (26).
This systematic review aims to investigate the potential of ML in predicting VB in patients with SSD, which we believe will offer a better understanding of the potential of ML in this clinical context and will be of interest to researchers and healthcare providers seeking to use ML to identify patients at risk of VB. Our main objectives are: 1) to discuss the most robust algorithms used for the prediction of VB; 2) to assess the general accuracy that has been achieved in predicting VB using ML; and 3) to review the effective factors that have enhanced ML’s ability to predict VB.
2 Materials and methods
2.1 Search strategy
We performed a systematic search in PubMed, Web of Sciences, Embase, and PsycINFO for relevant studies published before September 30, 2023. The search keywords consisted of three groups of keywords related to (a) ML, (b) SSD, and (c) VB. In this systematic review, the PICO (Population, Intervention, Comparison, Outcome) framework was employed with the following criteria:
Population: Schizophrenia spectrum disorder (SSD), including schizophrenia, schizoaffective disorder, and other delusional disorders (27);
Intervention: Machine learning models (ML);
Comparison: Medical records of patients or clinical violence risk assessment scales;
Outcome: Violent behavior (VB), defining as an attempt or action to harm a target, assault, robbery, aggression toward property, actions resulting in physical injury, child abuse, sexual abuse, threatening or causing injury with a weapon, verbal aggression or threatening, and violent crimes, e.g., attempted or completed homicide (28, 29).
This study was conducted in concordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) (30) and Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies (CHARMS) guidelines (31).
2.2 Inclusion and exclusion criteria
All studies developing ML models for predicting VB in SSD patients were included. The development of a ML model in medicine includes the following stages: data acquisition, data preparation, ML model development, model evaluation, hyperparameter tuning, and model validation (32). We aimed to review the articles that developed and evaluated ML models for the prediction of VB in SSD patients. Hence, studies that only employed statistical models by using an ML subset (e.g., logistic regression) and did not either evaluate or validate the performance of their generated model were not included in this review. The exclusion criteria consisted of 1) Records that did not study patients with an SSD diagnosis, 2) records that did not predict VB, 3) records that did not employ an ML method, 4) records that were not available in the English language full-text, 5) editorials, commentaries, letters, conference abstracts, books, and review articles, and 6) animal studies.
2.3 Study selection
The selection process began with removing the duplicated records. Then, two authors (MP and AA) independently reviewed the article titles and abstracts and selected the relevant papers for the full-text screening process. The same authors (MP and AA) independently conducted the full-text screening of the selected records for eligibility. Any discrepancies were settled by discussion and, if necessary, referred to a third author (GC).
2.4 Data extraction
Two authors (AA and MT) conducted the data extraction. We collected data about the authors, year of publication, sample size, characteristics of the patients, ML model and validation techniques, input variables (i.e., demographics), output variables (VB), additional assessments, and key findings from every included record. Also, reported measures of the area under the receiver operating characteristic curves (AUROC), balanced accuracy, predictive power, P-value, sensitivity, specificity, positive predictive values (PPV), and negative predictive values (NPV) were collected. Cross-validation performance was defined as a training dataset because it involved data “seen” by the machine, whereas “unseen” data from a held-out test set or external cohort was treated as validation.
2.5 Data synthesis
To bypass the limitations of meta-analyzing heterogeneous datasets, one author (MT) implemented a novel comparative approach, ranking each ML model’s performance within individual studies and then averaging ranks across studies to identify the best overall performing ML model.
2.6 Risk of bias assessment
To assess the risk of bias (ROB), we employed the Prediction Model Risk of Bias Assessment Tool (PROBAST) (33). It is a tool for assessing ROB and the applicability of diagnostic and prognostic prediction model studies. PROBAST evaluates 4 domains of participants, predictors, outcome, and analysis in the study by 20 signaling questions. signaling questions of the PROBAST checklist and its guidance notes for rating ROB and applicability are fully provided in PROBAST checklist section of the Supplementary Material. These questions facilitate structured judgment of ROB in the studies of predictive models. We used the explanation and elaboration document that describes the rationale for including each domain and signaling question and guides researchers to use them to assess the ROB and applicability concerns. Also, to assess the ROB in the studies that employed more than one ML model, we selected the ML model with the best performance (best AUROC or accuracy).
3 Results
3.1 Study selection
The search strategy employed in this systematic review yielded 3941 articles. Following the removal of duplicates, 2142 articles remained for further assessment. After assessing the abstracts, 250 articles were deemed suitable for full-text screening. A total of 18 articles satisfied the eligibility criteria and were included in the final analysis (Figure 1). Table 1 shows the characteristics and extracted data of the included articles.
3.2 Study characteristics
3.2.1 General features
The 18 included studies were conducted in Switzerland (n=8), China (n=8), and Canada (n=2). A total of 11,733 patients diagnosed with SSD were systematically reviewed in the present study, with diagnostic criteria including Diagnostic and Statistical Manual of Mental Disorders (DSM)-III, IV, and V, International Classification of Diseases (ICD)-9 and 10. Of the patients, 7,330 (62.47%) were male, and 4,403 (37.53%) were female. Three studies included exclusively male participants (38, 43, 44). Except for one study that recruited outpatients (34), all other studies recruited participants from inpatient settings. Among these studies, four employed ML models to predict VB during the current admission (35, 41, 47, 51). Additionally, nine studies categorized patients based on the occurrence of VB prior to their current admission (38–40, 43, 44, 46, 48–50), while another four classified patients into violent and non-violent groups by retrospectively reviewing their medical records since their disease onset (36, 37, 42, 45). Moreover, eight studies were part of a larger project investigating the relationship between SSD and offending and used the same dataset of offender patients as their sample population (39, 41, 42, 45, 46, 48–50).
3.2.2 Input measures
Most of the included studies utilized only sociodemographic and clinical features of patients to predict VB. Of these studies, five evaluated a large number of features (over 100 features) as predictors (39, 41, 45, 49, 50). Tzeng et al. (2004) explored the role of schizophrenia patients’ insight about their disease as a variable in addition to the sociodemographic features to predict the occurrence of VB (34). Additionally, Sun et al. (2021) explored the correlation between different psychotic symptoms and violence among schizophrenia patients (40). Likewise, Kirchebner et al. (2022) analyzed the role of accumulation and types of stressors in the patient’s history in increasing the severity of an offense (42). Furthermore, Machetanz et al. (2022, 2023) in two separate studies evaluated the differences between offender and non-offender SSD patients regarding psychiatric prescription patterns and illness-related factors (46, 49). Also, ten studies analyzed the relationship between different rating tools scores and VB in patients with SSD (36, 38, 39, 41, 43, 45, 46, 48–50), including the Brief Psychiatric Rating Scale (BPRS) (38, 43, 52), the Psychopathy Checklist: Screening Version (PCL-SV), the Historical, Clinical and Risk management (HCR-20) scale (38, 53), The Barratt Impulsiveness Scale version 11 (BIS-11) (38, 54), the Positive And Negative Symptom Scale (PANSS) (36, 39, 41, 43, 45, 46, 48–50, 55), the Social Disability Screening Schedule (SDSS) (43), Insight and Treatment Attitude Questionnaire (ITAQ) (47, 56), Family Adaptation, Partnership, Growth, Affection and Resolve (APGAR) (47, 57), Social Support Rating Scale (SSRS) (47, 58), and Family Burden Scale of Disease (FBS) (47, 59). Furthermore, two studies evaluated neuroimaging data of patients as VB predictors, along with sociodemographic features. Specifically, Gou et al. (2021) attempted to combine three modalities of neuroimaging data – T1-weighted magnetic resonance imaging (MRI), functional magnetic resonance imaging (fMRI), and diffusion tensor imaging (DTI) – with patients’ clinical features to improve the prediction power of the ML model (38). Similarly, Yu et al. (2022) assessed the effects of structural MRI (sMRI) features such as gray matter volume (GMV), cortical surface area, and cortical thickness in differentiating between violent and non-violent schizophrenia patients (44).
Moreover, two other studies examined the role of biochemical markers in indicating VB. Chen et al. (2015) examined the relationship between the violence trajectories, baseline clinical features, and lipid levels to develop a model to predict more violent trajectories (35), while Chen et al. (2020) tried to identify the metabolic characteristic of violent schizophrenia patients, including amino acids, lipids, and carbohydrates metabolism, by performing untargeted metabolomics and analyzing their plasma metabolites (36).
3.2.3 Output measures
The definition of VB varied significantly across studies due to the use of different criteria, scales, or aims. While some studies defined verbal aggression as VB, others only included physical aggression, and some differentiated offenses based on their severity. Four studies utilized the Modified Overt Aggression Scale (MOAS) (60) criteria, but with different thresholds (37, 38, 44, 47): Wang et al. (2020) considered the outcome as physical aggression, irrespective of the aim or the outcome of VB (37), Gou et al. (2021) considered it as physical aggression aimed at others and leading to injury (38), and finally Yu et al. (2022) and Cheng et al. (2023) defined VB as a minimum MOAS score of 5 or 4 respectively, which could be achieved by various VBs without restricting the type or the target of it (44, 47). Additionally, four studies employed different scales for the VB definition: Tzeng et al. (34) used the Violence and Suicide Assessment (VAS-A) (61), Chen et al. (35) utilized the Violence Scale (28), Chen et al. (36) employed the MacArthur Violence Risk Assessment Study (MVRAS) (62), and Watts et al. (51) used the Aggressive Incidents Scale (AIS) (63). Meanwhile, three other studies simply defined VB without the use of any scale: Sun et al. (2021) and You et al. (2022) focused on physical VB aimed at others (40, 43), while Hoffman et al. (2022) included physical VB regardless of the aim (41). On the other hand, six studies used a shared database to distinguish between violent and non-violent offenses (39, 42, 46, 48–50). In a seventh study, they attempted to predict the risk of homicide among other offenses (45).
3.3 Machine learning
3.3.1 Overview of algorithms
None of the 18 studies utilized unsupervised learning (clustering), which is consistent with the nature of the subject – since the classes and the target of classification is given (64). Instead, all of them used supervised learning (classification or regression), with three studies (43, 44, 47) incorporating deep learning through the neural network (NNET) or multi-layer perceptron (MLP) model. Among the top classification methods of supervised learning, support vector machine (SVM) was utilized in fifteen studies, decision trees (including random forests (RF) in fifteen, and k-nearest neighbor (KNN) in eleven. For the top regression methods of supervised learning, logistic regression (LR) (including stepwise LR) was utilized by twelve studies, while least absolute shrinkage and selection operator (LASSO) was used by five. While thirteen studies compared different ML models’ functions in violence prediction, others focused on developing a single prediction model (34–36, 38, 40). See Supplementary Material for detailed information regarding the model development and validation across the reviewed studies.
3.3.2 Model development
In most of the studies, some details were unclear about model development, with few providing information about hyperparameter tuning, an essential part of model development. Hyperparameters are parameters set before the training process begins and affect how the model learns from and generalizes the data (65). Tuning hyperparameters can significantly impact model performance and determine the complexity/flexibility of the model (65). Among the eighteen studies, four provided some explanation about the hyperparameter tuning (34, 35, 38, 47), two used default settings without optimization (41, 45), and the other twelve studies did not mention anything about hyperparameter optimization.
One study did not develop a prediction model but sought to find the best predictors of violence in SSD by using SVM and LR separately (36). Then they identified overlapping best predictors among metabolic biomarkers. By using two different models separately, they aimed to minimize overfitting – a common bias where models fit too closely to the training data, producing good predictions for data points in the training set but do not generalize well to new data, performing poorly on new samples (65) – as it is unlikely for two different algorithms to overfit the same way.
The remaining studies developed and assessed models for violence prediction in SSD. They employed feature selection or cross-validation to overcome overfitting bias and achieve more accurate model development. Seven studies employed data-driven feature selection by ML before model training to control overfitting: one utilized LASSO (38), three used RF (39, 41, 45), one applied boosted tree (42), one utilized both LASSO and LR (43), and one selected features after calculation of variable importance for each employed model separately (51). Sixteen studies used cross-validation, with two using 10-fold cross-validation (43, 44), one using 7-fold (36), nine using 5-fold (37, 39, 41, 42, 45, 46, 48–50), one using 4-fold (47), and one using 3-fold (34). Two studies did not use cross-validation (35, 40).
Furthermore, only sixteen studies acknowledged the implementation of imputation methods on their respective training set data (39, 41, 45, 46, 49–51). Imputation methods refers to techniques for estimating or imputing missing values within datasets to enhance overall completeness and analytical suitability (66). Notably, 5 studies opted for a common practice wherein missing continuous values were imputed with the mean observed values pertaining to the respective variable, while categorical variables underwent replacement with the mode of observed values (39, 41, 45, 46, 49, 50). However, one study imputed missing continuous variables with either the observed mean or median values, concurrently addressing missing categorical variables based on the mode of observed values (51).
The choice of ML models is often influenced by the type of data being used. According to a survey (67), deep learning models, such as NNET and MLP, are commonly employed for interpreting imagery data. Among the studies we reviewed, two specifically utilized brain imaging data to train ML models: In one study, LASSO was employed for image interpretation, while SVM was used for integrating image and clinical data and making final predictions (38). In the other study, seven models, including NNET, were compared to assess their performance (44).
3.3.3 Model validation
Regarding model validation and generalization assessment, six studies reported results on the training set (34–38, 42), while the rest of the studies performed internal validation by evaluating unseen portions of their training set. However, none of the studies conducted external validation using an independent and unseen set of data. This further implies that the prediction accuracy reported in these studies was based on a retrospective estimate rather than a prospective prediction and none of the studies tested their algorithms’ accuracy on future cases.
3.3.4 Models results
Primary outcome measures for evaluating model performance included area under the receiver operating characteristic curve (AUROC), accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV), with AUROC and accuracy being the most frequently used performance metrics. Regarding each metric, the ranges, the proportion of studies reaching values ≥75%, and the best-performing study were as the following: AUROC (0.56 – 0.95; 15/17 studies reached ≥75% (38), reached the best value), sensitivity (8.33 – 95.23%, 11/14 studies reached ≥75% (38, 51), reached the best value), specificity (24.38 – 98.39%, 12/14 studies reached ≥75% (43), reached the best value), accuracy (50.27 – 90.67%, 12/15 studies reached ≥75% (38), reached the best value), PPV (14.58 – 94.45%, 6/10 studies reached ≥75% (45), reached the best value), NPV (20.48 – 99.34%, 8/10 studies reached ≥75% (39, 51), reached the best value). Additionally, twelve studies achieved values above 75% for both AUROC and accuracy.
3.3.5 Models comparison
Running a meta-analysis on diverse studies with varying datasets, features, and variable distributions was impossible; Therefore, we adopted a particular approach to overcome the challenge of integrating and comparing the results of these studies. We specifically targeted studies that were designed to compare different models, as they offered valuable insights for our analysis. By extracting the rankings of different models, we could assess their relative performance, independent of the specific magnitude of each function indicator. This allowed us to overcome the limitations associated with diverse study designs and datasets, enabling a more meaningful comparison (Table 2).
As mentioned earlier, thirteen studies were designed to compare different models (37, 39, 41–51). However, two of these studies utilized imaging data (38, 44), which differed from the data used in the other studies. Since each ML model typically performs well with specific types of data (65), combining the results of these two studies with the others was not appropriate. Therefore, we excluded these studies from the analysis to maintain standardization across the dataset, which left us with eleven studies.
The performance rank of each model across the different studies was aggregated to generate a final rank. This approach allowed us to understand the average success rate of each model. To enhance the interpretability of the results, we took two steps. Firstly, we excluded models used in less than half of the eleven studies. Secondly, we standardized the ranks so that they fell within a range of 0 to 7. (For the studies that compared N models, all rankings were multiplied by 7/N.) By doing so, we ensured that the final ranks accurately reflected the relative performance of each model. A lower final rank indicated a better average performance across the studies.
Finally, in terms of both accuracy and AUROC, the gradient boosting (GB) model consistently achieved the highest performance rank among the six models with a substantial margin compared to the next highest-ranked model. However, given that meta-analysis was not possible, it is not feasible to assess whether this margin was significant or not. This suggests that the GB model shows promising performance in predicting violence among SSD patients using clinical data.
3.4 Discriminative features
Various features were identified in the included studies as the predictor variables of VB in SSD patients. We can classify most of them into sociodemographic, clinical, metabolic, and neuroimaging groups. Most of the features were consistent in multiple studies, except for some discrepancies, which will be elaborated upon.
3.4.1 Sociodemographic features
Some studies identified age (34, 37, 43, 45, 47), gender (34), and educational level (38, 43) as factors that contribute to the prediction of VB. However, other studies reported that these factors do not have a significant relationship with the occurrence of such behavior (35, 37, 44).
3.4.2 Clinical features
Psychotic symptoms are associated with VB in SSD patients. Different studies consistently demonstrate that negative symptoms, such as flat affect and poverty of thought, decrease the risk of VB (35, 40). However, there is inconsistency in the results concerning the impact of positive symptoms on the occurrence of VB. Some studies suggest an increased risk of VB associated with positive symptoms (35, 43), while others propose a diminishing impact of specific positive symptoms, including delusion of persecution and auditory hallucination, on VB occurrence (40). Furthermore, various studies reported that daily dosage of prescribed olanzapine-equivalent at the time of discharge from previous psychiatric hospitalization of SSD patients can predict the occurrence of VB among them (39, 45, 46, 48, 49). However, their results were divergent, with four studies demonstrating a positive association between the olanzapine-equivalent dosage and risk of VB (39, 46, 48, 49), and one study reporting a negative association (45).
Patients’ past stresses also can contribute to VB. Patients who have experienced a higher number of past stressors had an increased risk of engaging in VB (42, 51). Consistently, history of previous outpatient psychiatric treatment was found to be associated with an increased risk of VB in patients (46, 48–50). In addition, specific stressors, including a history of coercive psychiatric treatment and separation from main caregivers in childhood or adolescence, have also been found to be related to VB (42). There is a lack of consensus on the relationship between patients’ employment status and VB. While Kirchebner et al. (2022) found a significant correlation between unemployment and VB (42), Chen et al. (2015) and Wang et al. (2020) reported no statistical relevance between a patient’s employment status and the likelihood of VB (35, 37).
Additionally, scores of several rating tools are significantly associated with VB. The BPRS total score, BPRS hostility score, BPRS withdrawal factors score (38), ITAQ score, family APGAR score, SSRS score, and FBS score (47) were all found to correlate with the risk of VB. Moreover, the PANSS total score at admission and discharge (39, 45), and PANSS anxiety and lack of spontaneity scores (50) are significantly related to VB. Other statistically relevant clinical features are presented in Table 1.
3.4.3 Neuroimaging features
Two studies explored potential neuroimaging features for predicting VB. Gou et al. (2021) identified brain features associated with regional homogeneity (ReHo), gray matter volume (GMV), and fractional anisotropy (FA) as effective predictors of VB in schizophrenia patients (38). Significant GMV alterations were observed in the striatum system (including the putamen and pallidum), median cingulate, and paracingulate gyri, as well as temporal, occipital, and anterior parts of the parietal lobe. In addition, ReHo was most predictive in the anterior cingulate, dorsolateral part of the superior frontal gyrus, temporal pole, parietal lobe, and subcortical areas of the striatum, such as the caudate and pallidum. Also, the left superior longitudinal fasciculus was found to play a crucial role in FA predictions. Overall, the study identified the cingulate gyrus, dorsolateral part of superior frontal gyrus, temporal lobe (inferior temporal gyrus and temporal pole), supplementary motor area, and pallidum as the key regions for predicting VB in schizophrenia patients using sMRI and fMRI (38). On the other hand, Yu et al. (2022) found that the measurement of whole-brain GMV, right areas of superior temporal sulcus cortical thickness, right inferior parietal cortical thickness, and left frontal pole GMV correlated to the likelihood of violent tendencies (44).
3.4.4 Metabolic features
Three plasma metabolites were recognized as potentially effective biomarkers for predicting VB. In the study by Chen et al. (2022) the ratio of L-asparagine to L-aspartic acid, vanillylmandelic acid, and glutaric acid was found to be associated with an increased likelihood of VB (36). Specifically, a decrease in the ratio of L-asparagine to L-aspartic acid and glutaric acid level and an increase in the vanillylmandelic acid level appear to be correlated with violent tendencies. Furthermore, altered specific metabolic pathways seemed to predispose individuals toward violence. Specifically, the glycerolipid metabolism pathway, characterized by an up-regulation of glycerol and a down-regulation of glycerol-3-phosphate, and the phenylalanine, tyrosine, and tryptophan biosynthesis pathway, marked by a down-regulation of 4-hydroxyphenylpyruvic, have been associated with violent tendencies (36). Moreover, it has been demonstrated that raised triglyceride levels were associated with a reduced likelihood of engaging in VB (35).
3.5 Risk of bias assessment
Based on the results of our ROB assessment using the PROBAST guidelines, all studies except for two (43, 47), had some bias due to a small sample size, different violence definitions, and the inability to satisfy the study’s purpose. Although most of the studies had high ROB, the most important limitation arises from their limited sample sizes. According to the PROBAST guidelines, to achieve a low ROB in the analysis domain, the number of participants with the outcome relative to the number of the input variables should be equal to or higher than 20 (33). Only four reviewed studies had low ROB in the analysis domain (41, 42, 44, 47). Another reason for high ROB arises from the divergent definitions of violence and the use of different scales across the studies. We defined VB as an attempt or action to harm a target, assault, child or sexual abuse, and violent crimes. Whereas, Hofmann et al. (2022) included verbal aggression in the definition of VB (41) and four studies evaluated the ability of ML models to classify patients with previous criminal offenses (including VB) from non-offenders (46, 48–50). Also, some studies have evaluated the power of ML models in predicting VB (e.g., homicide) among offenders with SSD disorders (39, 42, 45). Although many studies represented high ROB in at least one field according to the PROBAST guideline (33), most of them (11/18) showed low concerns regarding applicability in the field of violence prediction in patients with SSD. Table 3 and Figure 2 illustrate the results of the quality assessment process.
4 Discussion
4.1 Key findings
Previous research has shown an acceptable power for ML models in predicting VB in populations broader than SSD patients (68, 69). In this article, we reviewed the role of ML in predicting VB in patients with SSD. According to our findings, the predictive performances of the ML models varied across the reviewed papers. However, ML models performed better in studies that employed more intricate methodologies for model development and evaluation. These findings suggest that a well-designed ML model could be a potential tool for VB prediction in SSD patients, and could be beneficial in warning the caregivers to seek prevention techniques and stop them from further harmful acts in clinical and forensic settings. Among the reviewed ML models, GB showed the best performance in VB detection. Also, we reviewed the most discriminating features in violence prediction of SSD patients. Age (34, 37, 43, 45, 47) and olanzapine equivalent dose at the time of discharge (39, 45, 46, 48, 49) were the most repetitive variables found to be associated with violence across the studies.
4.2 Machine learning models
While direct comparison of results among studies was challenging due to the differences in sample characteristics, some insights were obtained. First, about two third of the studies (11/18) could reach values above 75% for both AUROC and accuracy, indicating that ML can be a promising tool for the accurate prediction of VB among SSD patients. Second, the studies demonstrated diverse performance in predicting VB among SSD patients, with an AUROC ranging from 0.56 to 0.95 and an accuracy range of 50.27% to 90.67%. However, the performance ranges within each study were narrower when comparing different ML models. Considering that many studies employed similar ML models and input variables, the observed diversity in performance appears to be partly influenced by the variations in study designs. This suggests that future similar studies could enhance their results not only by focusing on ML model selection or input variable choices but also by paying attention to the details of model development to mitigate biases.
In addition, there exists considerable divergence among the reviewed studies with regard to the methodologies employed for both feature selection and cross-validation. These two components play pivotal roles in the trajectory of ML model development, serving to mitigate overfitting and augment overall model performance (70). Within the included studies, a mere seven undertook data-driven feature selection utilizing ML techniques prior to model training as a preemptive measure against overfitting (38, 39, 41–43, 45). Notably, one study adopted a post hoc approach, selecting features subsequent to the computation of variable importance for each employed model independently (51). Additionally, sixteen studies embraced diverse methods for cross-validation, while two studies opted to forgo its application (35, 40). This heterogeneity in model development practices across the reviewed studies poses a significant obstacle to synthesizing their respective findings.
Therefore, our significant challenge was comparing ML models by integrating the results of different studies due to variations in sample characteristics, including differences in input and output variables distribution. To address this challenge, we devised a ranking method that enabled us to assess the overall success rate of commonly used methods. Based on our findings, the GB model exhibited notably superior average performance. However, it is essential to note that this does not necessarily imply inherent weakness in the other models. Instead, it highlights the favorable results achieved by the GB model in the specific context of the studied field.
GB is a subset of ensemble machine learning models, which also includes common models like classification trees and RF (32). This approach enables the effective handling of big data and also the handling of missing values in the predictors (32). While common ensemble techniques like RF rely on straightforward averaging of models within the ensemble, GB stands out for its step-by-step, sequential strategy for selecting the best predictor (71). This notable flexibility empowers GB to be highly adaptable to specific data-driven tasks (71). Due to its unique characteristics, GB outperformed other ML models in predicting VB in SSD patients, particularly due to its effective handling of a large number of predictors. Nevertheless, it is noteworthy that several other ML models, including SVM, LASSO, NNET, RF, decision trees, PDA, MLP, elastic net, and LR, in the studies reviewed, also achieved AUROC values exceeding 0.9 (41, 44, 47, 51). This highlights the substantial predictive potential of these alternative ML models in addition to GB.
4.3 Discriminative features
Notably, various studies have explored the influence of age on VB risk, yielding diverse findings. For instance, Tzeng et al. (2004) associated younger age with a higher risk of VB (34). In contrast, four other studies observed that older age correlated with an increased tendency for VB in SSD patients (37, 44, 45, 47). Yet, Chen et al. (2015) found no significant correlation between patients’ age and VB risk. While the majority of previous research aligns with Chen et al. (2015) and negates the association between age and VB risk (35, 72), there are outliers such as Soyka et al. (2007) who identified older ages as linked to a higher VB risk in SSD patients (73). This variability underscores the need for further research to ascertain the precise impact of age on VB occurrence in SSD patients.
Contrary to age, the reported gender effect on VB occurrence risk was quite consistent among studies, which showed that male sex was associated with a higher risk of VB (34). These findings confirmed most of the previous studies that reported a higher prevalence of VB among male SSD patients (73, 74), as the general population (17). Furthermore, Gou et al. (2021) and Yu et al. (2022), but not Chen et al. (2015) and Yu et al. (2022a), found that lower educational levels could predict VB occurrence (38, 43). This was in line with previous research that found lower educational levels to be significant predictors of VB among SSD patients (75, 76) and the general population (17). These disparities suggest that further studies on larger populations are required to determine the exact effect of the educational level of SSD patients on their VB tendency.
Moreover, in terms of the effect of occupational status on VB tendency, Kirchebner et al. (2022) reported a significant relationship between unemployment and VB in SSD patients (42), which confirmed the findings by Karabekiroğlu et al. (2016). Conversely, Chen et al. (2015) and Wang et al. (2020) found no correlation between employment status and VB (35, 37). This divergence could be a result of different definitions of violence in these studies; indeed, Kirchebner et al. (2022) and Karabekiroğlu et al. (2016) studies, unemployment was able to differentiate SSD patients with serious VB (e.g., homicide) from patients with minor VB (e.g., property damage). On the other hand, two other studies trained ML models to differentiate SSD patients with any kind of VB (serious or minor) from patients without VB (35, 37). This suggests that unemployment does not seem to be associated with the overall risk of VB among SSD patients, but it increases the risk of serious VB among offenders.
Regarding the clinical features, two studies reported that the presence of positive symptoms (35, 43) was correlated with an increased risk of VB, which was consistent with previous research (72, 74, 77). However, another study suggested that the presence of specific positive symptoms, including delusion of persecution and auditory hallucination, decreases the risk of VB (40). This controversy indicates that different types of delusion may have varying effects on the occurrence of VB (40). Also, Sonnweber et al. (2021, 2022) found a favorable predictive power for the PANSS total score of the patients in two different studies (39, 45). This is in line with previous studies that demonstrated higher PANSS total scores in violent patients, compared to non-violent patients (78, 79). Moreover, consistent with previous research (80), Gou et al. (2021) found that the risk of VB occurrence is higher among SSD patients with higher scores in the BPRS hostility subscale. Furthermore, higher scores in BPRS total score, BPRS withdrawal factors, PCL-SV, HCR-20 (38), and SDSS (43) successfully predicted VB in SSD patients across the reviewed studies. While the BPRS and PANSS scales assess various domains of SSD, including positive and negative symptoms (55, 81), the PCL-SV scale is specifically designed to evaluate psychopathic traits in patients, which is not directly associated with SSD (82). This indicates that aside from psychotic symptoms, additional symptoms like patients’ personality profiles, including psychopathy and impulsivity, may have relevance in predicting VB among individuals with SSD. Altogether, these suggest that by training ML models with certified psychiatric rating tools, we can significantly improve the accuracy of predicting VB in SSD patients, which can be highly beneficial in clinical applications.
Chen et al. (2015) found negative symptoms to be correlated with a decreased risk of VB (35), which is in line with previous studies that found depressive and other negative symptoms to be associated with a lower occurrence of VB in SSD patients (73, 77). Furthermore, the effect of the age of disease onset was controversial across the reviewed studies. While Sonnweber et al. (2021, 2022) reported that younger age of disease onset correlated with the probability of VB, Chen et al. (2015) and Wang et al. (2020) did not find a significant relationship between the age of disease onset and VB occurrence. The findings of previous research in this field are also divergent. Indeed, while Caqueo-Urízar et al. (2016) found VB to be more prevalent among patients with younger age of illness onset, Nolan et al. (1999) did not find any significant differences between the age of onset of violent and non-violent patients. Therefore, further research is warranted to determine the effects of disease onset on the VB occurrence in SSD patients, as it can help the early detection and treatment of patients at higher risk of VB.
While most studies evaluating the prescribed daily olanzapine-equivalent dose at the time of discharge from previous hospitalizations have reported a positive association with the risk of VB (39, 46, 48, 49), there is an exception in one study that reported the opposite (45). The divergence in findings can be attributed to the different focus of the Sonnweber (2022) study, which specifically differentiated between homicide committers and patients who committed other types of VB (45). It is logical to assume that higher doses of antipsychotics are prescribed to patients with more enduring symptoms, as they are reported to be more prone to engaging in VB in some studies (83). However, some previous studies found no significant association between the disease severity or prescribed dosage of antipsychotics and the risk of VB in patients with SSD (84, 85). This highlights the need for further research to better comprehend the relationship between disease severity and prescribed antipsychotic dosages in the occurrence of VB among SSD patients.
Previous research has shown that SSD patients’ previous history of violence is significantly correlated with increased risks of VB, such as recent violence episodes (86), history of a recent assault (87), previous history of aggression (74), and a previous violent conviction (87). Consistently, Sonnweber et al. (2021) reported previous conviction history as a significant predictor of VB in SSD patients (39). Moreover, Wang et al. (2020) found that a history of more than five times of hospitalization increased the likelihood of VB tendency in patients. However, Tzeng et al. (2004) reported that the lifetime number of hospitalizations was not correlated with an increased risk of VB occurrence in SSD patients. This disparity could be due to the differences in the psychiatric history assessment across the studies, as Tzeng et al. (2004) evaluated a broader variable (lifetime hospitalization), while other studies assessed the recent hospitalization history (88, 89) or a more distinguishing variable (≥ 5-lifetime hospital admissions) (37).
Finally, two studies have observed that neuroimaging variables were robust predictors of VB in SSD patients. Yu et al. (2022) found decreased whole-brain gray matter volume, right inferior parietal thickness, and left frontal pole volume to be predictors of VB. Consistently, Gou et al. (2021) reported disruption in the structural and functional MRI of the temporal, frontal lobes, cingulate gyrus, and striatum can predict VB in SSD patients. Also, a systematic review of 21 studies, revealed that reduced volumes of the frontal lobe in patients with schizophrenia are associated with a higher rate of VB occurrence (90). This is not surprising, as previous research mentioned a prominent role for frontal and temporal lobes and cingulate gyrus disruptions in developing VB (91). Considering the role of the frontal cortex in controlling disinhibited behaviors (e.g., impulsiveness, aggressiveness, and violence), patients with disrupted frontal cortex are more likely to present VB (91, 92). Although previous research established the involvement of the hippocampus and amygdala in emotional processing and in the development of VB (91), the predictive value of these regions was not assessed across the reviewed studies. In conclusion, our knowledge in the field of ML-based prediction of VB in SSD patients by training MRI data is still limited, and future research is required to clarify its potential.
4.4 Limitations and further directions
This study has some limitations. First, the sample sizes of most studies were small, considering the number of input variables, which can influence their analysis results. Second, the study samples across the reviewed articles were heterogeneous, as most of them studied clinical inpatients, while some studied forensic inpatients, and one included only outpatients’ data. Also, some studies only included male patients. Third, the outcome definitions differed within studies. For example, while most of the studies classified SSD patients into violent and non-violent, some others distinguished patients with serious types of VB (e.g., homicide) from other types of VB. Fourth, the reviewed studies were conducted in countries with different healthcare systems, which could have a significant impact on violence among SSD patients. Fifth, most of the studies did not select time-dependent features for VB prediction, which substantially lowers the ML model performance. Finally, none of the reviewed articles performed external validation, which can significantly diminish the generalizability of their findings. Therefore, future research with more homogenous methodologies and both internal and external validations seems to be necessary.
4.5 Conclusions
The outcomes of the ML models employed by the reviewed studies have yielded compelling findings, highlighting the significance of continuing along this research trajectory for further exploration and advancement. More in detail, while the ML models’ performance in VB prediction among SSD patients was divergent, yet promising, our comparative analysis demonstrated that GB outperformed other ML models. Considering the heterogeneity of ML model applications and study populations across the reviewed articles, there is substantial potential for further research in this field. Furthermore, the absence of external validation in the majority of the included articles reduces the generalizability of their findings. Indeed, subsequent research endeavors, employing comparable models, outcomes, and predictors, in extensive clinical samples, are imperative to substantiate the certainty of the current findings and ascertain the applicability of the developed ML algorithms.
Moreover, given the rapidly growing trend in the application of various artificial intelligence tools in medical contexts, it appears likely that in the next years ML models can be also utilized for VB prediction in SSD patients. Indeed, while the performance of ML models varied across the reviewed studies; several models demonstrated excellent predictive abilities with an AUROC exceeding 0.9. This highlights the potential for developing reliable ML models through further well-designed studies. Upon validation through external assessments, these models could effectively predict VB in real-world clinical settings. Consequently, the development of clinical assessment tools integrating patient data could facilitate the early identification of individuals highly susceptible to VB, whether in outpatient or inpatient settings. The utilization of such tools enables timely preventive interventions, such as providing social support and rehabilitation, adjusting medications, and considering more intensive therapeutic approaches, like electroconvulsive therapy. Implementing these measures could significantly alleviate the burden of VB on patients, healthcare systems, and society at large.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.
Author contributions
MP: Conceptualization, Investigation, Project administration, Resources, Visualization, Writing – original draft. AA: Data curation, Investigation, Visualization, Writing – original draft. MT: Data curation, Formal Analysis, Writing – original draft. HS: Writing – original draft. GC: Funding acquisition, Supervision, Writing – review & editing. FS: Supervision, Writing – review & editing. AP: Data curation, Writing – review & editing. PB: Project administration, Supervision, Writing – review & editing. GD: Funding acquisition, Methodology, Supervision, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. GC was supported by a grant from Cassa di Risparmio di Padova e Rovigo (CARIPARO). The study was partially supported by the Italian Ministry of Health (ricerca corrente 2023).
Conflict of interest
The authors declare the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2024.1384828/full#supplementary-material
Glossary
References
1. Mancuso SG, Morgan VA, Mitchell PB, Berk M, Young A, Castle DJ. A comparison of schizophrenia, schizoaffective disorder, and bipolar disorder: Results from the Second Australian national psychosis survey. J Affect Disord. (2015) 172:30–7. doi: 10.1016/j.jad.2014.09.035
2. Karbalaee M, Jameie M, Amanollahi M, TaghaviZanjani F, Parsaei M, Basti FA, et al. Efficacy and safety of adjunctive therapy with fingolimod in patients with schizophrenia: A randomized, double-blind, placebo-controlled clinical trial. Schizophr Res. (2023) 254:92–8. doi: 10.1016/j.schres.2023.02.020
3. Millier A, Schmidt U, Angermeyer MC, Chauhan D, Murthy V, Toumi M, et al. Humanistic burden in schizophrenia: a literature review. J Psychiatr Res. (2014) 54:85–93. doi: 10.1016/j.jpsychires.2014.03.021
4. James A. Stigma of mental illness. Foreword. Lancet. (1998) 352:1048. doi: 10.1016/S0140-6736(98)00019-1
5. Guo Y, Yang X, Wang D, Fan R, Liang Y, Wang R, et al. Prevalence of violence to others among individuals with schizophrenia in China: A systematic review and meta-analysis. Front Psychiatry. (2022) 13:939329. doi: 10.3389/fpsyt.2022.939329
6. Simpson AI, Penney SR, Jones RM. Homicide associated with psychotic illness: What global temporal trends tell us about the association between mental illness and violence. Aust N Z J Psychiatry. (2022) 56:1384–8. doi: 10.1177/00048674211067164
7. Whiting D, Gulati G, Geddes JR, Fazel S. Association of schizophrenia spectrum disorders and violence perpetration in adults and adolescents from 15 countries: A systematic review and meta-analysis. JAMA Psychiatry. (2022) 79:120–32. doi: 10.1001/jamapsychiatry.2021.3721
8. Moghaddam HS, Parsaei M, Taghavizanjani F, Cattarinussi G, Aarabi MH, Sambataro F. White matter alterations in affective and non-affective early psychosis: A diffusion MRI study. J Affect Disord. (2024) 351:615–23. doi: 10.1016/j.jad.2024.01.238
9. Monahan J, Vesselinov R, Robbins PC, Appelbaum PS. Violence to others, violent self-victimization, and violent victimization by others among persons with a mental illness. Psychiatr Serv. (2017) 68:516–9. doi: 10.1176/appi.ps.201600135
10. Whiting D, Lichtenstein P, Fazel S. Violence and mental disorders: a structured review of associations by individual diagnoses, risk factors, and risk assessment. Lancet Psychiatry. (2021) 8:150–61. doi: 10.1016/S2215-0366(20)30262-5
11. Kageyama M, Solomon P. Post-traumatic stress disorder in parents of patients with schizophrenia following familial violence. PloS One. (2018) 13:e0198164. doi: 10.1371/journal.pone.0198164
12. Tasa-Vinyals E, Álvarez MJ, Puigoriol-Juvanteny E, Roura-Poch P, García-Eslava JS, Escoté-Llobet S. Intimate partner violence among patients diagnosed with severe mental disorder. J Nerv Ment Dis. (2020) 208:749–54. doi: 10.1097/NMD.0000000000001207
13. Cloutier M, Aigbogun MS, Guerin A, Nitulescu R, Ramanakumar AV, Kamat SA, et al. The economic burden of schizophrenia in the United States in 2013. J Clin Psychiatry. (2016) 77:764–71. doi: 10.4088/JCP.15m10278
14. Halmai T, Tényi T, Gonda X. Symptom profiles and parental bonding in homicidal versus non-violent male schizophrenia patients. Ideggyogy Sz. (2017) 70:43–52. doi: 10.18071/isz.70.0043
15. Oakley C, Harris S, Fahy T, Murphy D, Picchioni M. Childhood adversity and conduct disorder: A developmental pathway to violence in schizophrenia. Schizophr Res. (2016) 172:54–9. doi: 10.1016/j.schres.2016.01.047
16. Witt K, van Dorn R, Fazel S. Risk factors for violence in psychosis: systematic review and meta-regression analysis of 110 studies. PloS One. (2013) 8:e55942. doi: 10.1371/journal.pone.0055942
17. Coid JW, Ullrich S, Kallis C, Freestone M, Gonzalez R, Bui L, et al. Improving risk management for violence in mental health services: a multimethods approach. Programme Grants Appl Res. 2016 4(16). doi: 10.3310/pgfar04160
18. Iniesta R, Stahl D, McGuffin P. Machine learning, statistical learning and the future of biological research in psychiatry. Psychol Med. (2016) 46:2455–65. doi: 10.1017/S0033291716001367
19. Parsaei M, Taghavizanjani F, Cattarinussi G, Moghaddam HS, Di Camillo F, Akhondzadeh S, et al. Classification of suicidality by training supervised machine learning models with brain MRI findings: A systematic review. J Affect Disord. (2023) 340:766–91. doi: 10.1016/j.jad.2023.08.034
20. Günther MP, Kirchebner J, Lau S. Identifying direct coercion in a high risk subgroup of offender patients with schizophrenia via machine learning algorithms. Front Psychiatry. (2020) 11:415. doi: 10.3389/fpsyt.2020.00415
21. Jiménez S, Angeles-Valdez D, Rodríguez-Delgado A, Fresán A, Miranda E, Alcalá-Lozano R, et al. Machine learning detects predictors of symptom severity and impulsivity after dialectical behavior therapy skills training group in borderline personality disorder. J Psychiatr Res. (2022) 151:42–9. doi: 10.1016/j.jpsychires.2022.03.063
22. Verrey J, Ariel B, Harinam V, Dillon L. Using machine learning to forecast domestic homicide via police data and super learning. Sci Rep. (2023) 13:22932. doi: 10.1038/s41598-023-50274-2
23. Vijeikis R, Raudonis V, Dervinis G. Efficient violence detection in surveillance. Sensors. (2022) 22:2216. doi: 10.3390/s22062216
24. Bakhshi A, García-Gómez J, Gil-Pita R, Chalup S. Violence detection in real-life audio signals using lightweight deep neural networks. Proc Comput Science. (2023) 222:244–51. doi: 10.1016/j.procs.2023.08.162
25. Gould C, Mufamadi D. Costs and benefits of preventing violence. ISS South Afr Rep. (2021) 2021:1–14.
26. Verma AA, Murray J, Greiner R, Cohen JP, Shojania KG, Ghassemi M, et al. Implementing machine learning in medicine. Cmaj. (2021) 193:E1351–e7. doi: 10.1503/cmaj.202434
27. Wright M. Schizophrenia and schizophrenia spectrum disorders. JAAPA. (2020) 33:46–7. doi: 10.1097/01.JAA.0000662412.51169.bf
28. Morrison EF. The measurement of aggression and violence in hospitalized psychiatric patients. Int J Nurs Stud. (1993) 30:51–64. doi: 10.1016/0020-7489(93)90092-9
29. Monahan J, Steadman H, Silver E, Appelbaum P, Robbins P, Mulvey E, et al. Rethinking risk assessment: the macArthur study of mental disorder and violence. Thomas Grisso. (2002) 1:147–52. doi: 10.1093/oso/9780195138825.001.0001
30. Stewart LA, Clarke M, Rovers M, Riley RD, Simmonds M, Stewart G, et al. Preferred Reporting Items for Systematic Review and Meta-Analyses of individual participant data: the PRISMA-IPD Statement. Jama. (2015) 313:1657–65. doi: 10.1001/jama.2015.3656
31. Moons KG, de Groot JA, Bouwmeester W, Vergouwe Y, Mallett S, Altman DG, et al. Critical appraisal and data extraction for systematic reviews of prediction modelling studies: the CHARMS checklist. PloS Med. (2014) 11:e1001744. doi: 10.1371/journal.pmed.1001744
32. Arbet J, Brokamp C, Meinzen-Derr J, Trinkley KE, Spratt HM. Lessons and tips for designing a machine learning study using EHR data. J Clin Transl Sci. (2020) 5:e21. doi: 10.1017/cts.2020.513
33. Wolff RF, Moons KGM, Riley RD, Whiting PF, Westwood M, Collins GS, et al. PROBAST: A tool to assess the risk of bias and applicability of prediction model studies. Ann Intern Med. (2019) 170:51–8. doi: 10.7326/M18-1376
34. Tzeng H-M, Lin Y-L, Hsieh J-G. Forecasting violent behaviors for schizophrenic outpatients using their disease insights: development of a binary logistic regression model and a support vector model. Int J Ment Health. (2004) 33:17–31. doi: 10.1080/00207411.2004.11043366
35. Chen SC, Chu NH, Hwu HG, Chen WJ. Trajectory classes of violent behavior and their relationship to lipid levels in schizophrenia inpatients. J Psychiatr Res. (2015) 66-67:105–11. doi: 10.1016/j.jpsychires.2015.04.022
36. Chen X, Xu J, Tang J, Dai X, Huang H, Cao R, et al. Dysregulation of amino acids and lipids metabolism in schizophrenia with violence. BMC Psychiatry. (2020) 20:97. doi: 10.1186/s12888-020-02499-y
37. Wang KZ, Bani-Fatemi A, Adanty C, Harripaul R, Griffiths J, Kolla N, et al. Prediction of physical violence in schizophrenia with machine learning algorithms. Psychiatry Res. (2020) 289:112960. doi: 10.1016/j.psychres.2020.112960
38. Gou N, Xiang Y, Zhou J, Zhang S, Zhong S, Lu J, et al. Identification of violent patients with schizophrenia using a hybrid machine learning approach at the individual level. Psychiatry Res. (2021) 306:114294. doi: 10.1016/j.psychres.2021.114294
39. Sonnweber M, Lau S, Kirchebner J. Violent and non-violent offending in patients with schizophrenia: Exploring influences and differences via machine learning. Compr Psychiatry. (2021) 107:152238. doi: 10.1016/j.comppsych.2021.152238
40. Sun L, Han X, Wang K, Xu C, Song Z, Zhang J, et al. Candidate symptomatic markers for predicting violence in schizophrenia: A cross-sectional study of 7711 patients in a Chinese population. Asian J Psychiatr. (2021) 59:102645. doi: 10.1016/j.ajp.2021.102645
41. Hofmann LA, Lau S, Kirchebner J. Advantages of machine learning in forensic psychiatric research-uncovering the complexities of aggressive behavior in schizophrenia. Appl Sciences-Basel. (2022) 12:819. doi: 10.3390/app12020819
42. Kirchebner J, Sonnweber M, Nater UM, Günther M, Lau S. Stress, schizophrenia, and violence: A machine learning approach. J Interpers Violence. (2022) 37:602–22. doi: 10.1177/0886260520913641
43. Yu T, Zhang X, Liu X, Xu C, Deng C. The prediction and influential factors of violence in male schizophrenia patients with machine learning algorithms. Front Psychiatry. (2022) 13:799899. doi: 10.3389/fpsyt.2022.799899
44. Yu T, Pei W, Xu C, Zhang X, Deng C. Prediction of violence in male schizophrenia using sMRI, based on machine learning algorithms. BMC Psychiatry. (2022) 22:676. doi: 10.1186/s12888-022-04331-1
45. Sonnweber M, Lau S, Kirchebner J. Exploring characteristics of homicide offenders with schizophrenia spectrum disorders via machine learning. Int J Offender Ther Comp Criminol. (2022), 306624x221102799. doi: 10.21428/cb6ab371
46. Machetanz L, Günther MP, Lau S, Kirchebner J. High risk, high dose?-pharmacotherapeutic prescription patterns of offender and non-offender patients with schizophrenia spectrum disorder. Biomedicines. (2022) 10:3243. doi: 10.3390/biomedicines10123243
47. Cheng N, Guo M, Yan F, Guo Z, Meng J, Ning K, et al. Application of machine learning in predicting aggressive behaviors from hospitalized patients with schizophrenia. Front Psychiatry. (2023) 14:1016586. doi: 10.3389/fpsyt.2023.1016586
48. Kirchebner J, Lau S, Machetanz L. Offenders and non-offenders with schizophrenia spectrum disorders: Do they really differ in known risk factors for aggression? Front Psychiatry. (2023) 14:1145644. doi: 10.3389/fpsyt.2023.1145644
49. Machetanz L, Hofmann AB, Möhrke J, Kirchebner J. Offenders and non-offenders with schizophrenia spectrum disorders: the crime-preventive potential of sufficient embedment in the mental healthcare and support system. Front Psychiatry. (2023) 14:1231851. doi: 10.3389/fpsyt.2023.1231851
50. Machetanz L, Lau S, Habermeyer E, Kirchebner J. Suicidal offenders and non-offenders with schizophrenia spectrum disorders: A retrospective evaluation of distinguishing factors using machine learning. Brain Sci. (2023) 13:97. doi: 10.3390/brainsci13010097
51. Watts D, Mamak M, Moulden H, Upfold C, de Azevedo Cardoso T, Kapczinski F, et al. The HARM models: Predicting longitudinal physical aggression in patients with schizophrenia at an individual level. J Psychiatr Res. (2023) 161:91–8. doi: 10.1016/j.jpsychires.2023.02.030
52. Woerner MG, Mannuzza S, Kane JM. Anchoring the BPRS: an aid to improved reliability. Psychopharmacol Bull. (1988) 24:112–7.
53. Telles LE, Day VP, Folino JO, Taborda JG. Reliability of the Brazilian version of HCR-20 assessing risk for violence. Braz J Psychiatry. (2009) 31:253–6. doi: 10.1590/S1516-44462009005000001
54. Stanford MS, Mathias CW, Dougherty DM, Lake SL, Anderson NE, Patton JH. Fifty years of the Barratt Impulsiveness Scale: An update and review. Pers Individ Differences. (2009) 47:385–95. doi: 10.1016/j.paid.2009.04.008
55. Kay SR, Fiszbein A, Opler LA. The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophr Bull. (1987) 13:261–76. doi: 10.1093/schbul/13.2.261
56. McEvoy JP, Apperson LJ, Appelbaum PS, Ortlip P, Brecosky J, Hammill K, et al. Insight in schizophrenia. Its relationship to acute psychopathology. J Nerv Ment Dis. (1989) 177:43–7. doi: 10.1097/00005053-198901000-00007
57. Smilkstein G, Ashworth C, Montano D. Validity and reliability of the family APGAR as a test of family function. J Fam Pract. (1982) 15:303–11.
58. Xiao S-Y. The theoretical basis and research application of social support rating scale. J Clin Psychiatry. (1994) 4:98–100.
59. Pai S, Kapur RL. Impact of treatment intervention on the relationship between dimensions of clinical psychopathology, social dysfunction and burden on the family of psychiatric patients. Psychol Med. (1982) 12:651–8. doi: 10.1017/S0033291700055756
60. Kay SR, Wolkenfeld F, Murrill LM. Profiles of aggression among psychiatric patients. I. Nature and prevalence. J Nerv Ment Dis. (1988) 176:539–46. doi: 10.1097/00005053-198809000-00007
61. Feinstein R, Plutchik R. Violence and suicide risk assessment in the psychiatric emergency room. Compr Psychiatry. (1990) 31:337–43. doi: 10.1016/0010-440X(90)90040-Y
62. Monahan J, Appelbaum PS, Mulvey EP, Robbins PC, Lidz CW. Ethical and legal duties in conducting research on violence: lessons from the MacArthur Risk Assessment Study. Violence Vict. (1993) 8:387–96. doi: 10.1891/0886-6708.8.4.387
63. Cook AN, Moulden HM, Mamak M, Lalani S, Messina K, Chaimowitz G. Validating the hamilton anatomy of risk management–forensic version and the aggressive incidents scale. Assessment. (2016) 25:432–45. doi: 10.1177/1073191116653828
64. Hamsagayathri P, Vigneshwaran S. (2021). Symptoms based disease prediction using machine learning techniques, in: 2021 Third international conference on intelligent communication technologies and virtual mobile networks (ICICV) Tirunelveli, Indiadoi: IEEE. doi: 10.1109/ICICV50876.2021.9388603
66. Emmanuel T, Maupong T, Mpoeleng D, Semong T, Mphago B, Tabona O. A survey on missing data in machine learning. J Big Data. (2021) 8:140. doi: 10.1186/s40537-021-00516-9
67. Garg A, Mago V. Role of machine learning in medical research: A survey. Comput Sci review. (2021) 40:100370. doi: 10.1016/j.cosrev.2021.100370
68. Parmigiani G, Barchielli B, Casale S, Mancini T, Ferracuti S. The impact of machine learning in predicting risk of violence: A systematic review. Front Psychiatry. (2022) 13:1015914. doi: 10.3389/fpsyt.2022.1015914
69. Tay JL, Li Z, Sim K. Effectiveness of artificial intelligence methods in personalized aggression risk prediction within inpatient psychiatric treatment settings-A systematic review. J Pers Med. (2022) 12:1470. doi: 10.3390/jpm12091470
70. Meyer H, Reudenbach C, Hengl T, Katurji M, Nauss T. Improving performance of spatio-temporal machine learning models using forward feature selection and target-oriented validation. Environ Model Software. (2018) 101:1–9. doi: 10.1016/j.envsoft.2017.12.001
71. Natekin A, Knoll A. Gradient boosting machines, a tutorial. Front Neurorobot. (2013) 7:21. doi: 10.3389/fnbot.2013.00021
72. Martínez-Martín N, Fraguas D, García-Portilla MP, Sáiz PA, Bascarán MT, Arango C, et al. Self-perceived needs are related to violent behavior among schizophrenia outpatients. J Nerv Ment Dis. (2011) 199:666–71. doi: 10.1097/NMD.0b013e318229d0d5
73. Soyka M, Graz C, Bottlender R, Dirschedl P, Schoech H. Clinical correlates of later violence and criminal offences in schizophrenia. Schizophr Res. (2007) 94:89–98. doi: 10.1016/j.schres.2007.03.027
74. Araya T, Ebnemelek E, Getachew R. Prevalence and Associated Factors of Aggressive Behavior among Patients with Schizophrenia at Ayder Comprehensive Specialized Hospital, Ethiopia. BioMed Res Int. (2020) 2020:7571939. doi: 10.1155/2020/7571939
75. Karabekiroğlu A, Pazvantoğlu O, Karabekiroğlu K, Böke Ö, Korkmaz IZ. Associations with violent and homicidal behaviour among men with schizophrenia. Nord J Psychiatry. (2016) 70:303–8. doi: 10.3109/08039488.2015.1109139
76. Wang J, Li C, Zhu XM, Zhang SM, Zhou JS, Li QG, et al. Association between schizophrenia and violence among Chinese female offenders. Sci Rep. (2017) 7:818. doi: 10.1038/s41598-017-00975-2
77. Swanson JW, Swartz MS, Van Dorn RA, Elbogen EB, Wagner HR, Rosenheck RA, et al. A national study of violent behavior in persons with schizophrenia. Arch Gen Psychiatry. (2006) 63:490–9. doi: 10.1001/archpsyc.63.5.490
78. Arango C, Calcedo Barba A, González S, Calcedo Ordóñez A. Violence in inpatients with schizophrenia: a prospective study. Schizophr Bull. (1999) 25:493–503. doi: 10.1093/oxfordjournals.schbul.a033396
79. Yi Y, Huang Y, Chen Q, Yang H, Li H, Feng Y, et al. Violence, neurocognitive function and clinical correlates in patients with schizophrenia. Front Psychiatry. (2022) 13:1087372. doi: 10.3389/fpsyt.2022.1087372
80. Abushua'leh K, Abu-Akel A. Association of psychopathic traits and symptomatology with violence in patients with schizophrenia. Psychiatry Res. (2006) 143:205–11. doi: 10.1016/j.psychres.2005.05.017
81. Shafer A. Meta-analysis of the brief psychiatric rating scale factor structure. Psychol Assess. (2005) 17:324–35. doi: 10.1037/1040-3590.17.3.324
82. Hare R, Hart S, Cox D. The hare psychopathy checklist: screening version (PCL-SV). Toronto: Multi-Health Systems Inc (1995).
83. Cho W, Shin WS, An I, Bang M, Cho DY, Lee SH. Biological aspects of aggression and violence in schizophrenia. Clin Psychopharmacol Neurosci. (2019) 17:475–86. doi: 10.9758/cpn.2019.17.4.475
84. Ullrich S, Keers R, Coid JW. Delusions, anger, and serious violence: new findings from the MacArthur Violence Risk Assessment Study. Schizophr Bull. (2014) 40:1174–81. doi: 10.1093/schbul/sbt126
85. Tasmim S, Kolla NJ, Dada O, Bani-Fatemi A, De Luca V. Correlation between violence and antipsychotic dosage in schizophrenia: A secondary analysis of the clinical antipsychotic trials for intervention effectiveness (CATIE) dataset. Pharmacopsychiatry. (2019) 52:217–21. doi: 10.1055/a-0826-4935
86. Bobes J, Fillat O, Arango C. Violence among schizophrenia out-patients compliant with medication: prevalence and associated factors. Acta Psychiatr Scand. (2009) 119:218–25. doi: 10.1111/j.1600-0447.2008.01302.x
87. Walsh E, Gilvarry C, Samele C, Harvey K, Manley C, Tattan T, et al. Predicting violence in schizophrenia: a prospective study. Schizophr Res. (2004) 67:247–52. doi: 10.1016/S0920-9964(03)00091-4
88. Kageyama M, Solomon P, Kita S, Nagata S, Yokoyama K, Nakamura Y, et al. Factors related to physical violence experienced by parents of persons with schizophrenia in Japan. Psychiatry Res. (2016) 243:439–45. doi: 10.1016/j.psychres.2016.06.036
89. Caqueo-Urízar A, Fond G, Urzúa A, Boyer L, Williams DR. Violent behavior and aggression in schizophrenia: Prevalence and risk factors. A multicentric study from three Latin-America countries. Schizophr Res. (2016) 178:23–8. doi: 10.1016/j.schres.2016.09.005
90. Fjellvang M, Grøning L, Haukvik UK. Imaging violence in schizophrenia: A systematic review and critical discussion of the MRI literature. Front Psychiatry. (2018) 9:333. doi: 10.3389/fpsyt.2018.00333
91. Siever LJ. Neurobiology of aggression and violence. Am J Psychiatry. (2008) 165:429–42. doi: 10.1176/appi.ajp.2008.07111774
Keywords: artificial intelligence, machine learning, schizophrenia, schizophrenia spectrum disorder, violent behavior
Citation: Parsaei M, Arvin A, Taebi M, Seyedmirzaei H, Cattarinussi G, Sambataro F, Pigoni A, Brambilla P and Delvecchio G (2024) Machine Learning for prediction of violent behaviors in schizophrenia spectrum disorders: a systematic review. Front. Psychiatry 15:1384828. doi: 10.3389/fpsyt.2024.1384828
Received: 10 February 2024; Accepted: 08 March 2024;
Published: 21 March 2024.
Edited by:
Massimo Tusconi, University of Cagliari, ItalyReviewed by:
Stefano Barlati, University of Brescia, ItalyArtemis Igoumenou, University College London, United Kingdom
Serdar M. Dursun, University of Alberta, Canada
Johannes Kirchebner, University of Zurich, Switzerland
Copyright © 2024 Parsaei, Arvin, Taebi, Seyedmirzaei, Cattarinussi, Sambataro, Pigoni, Brambilla and Delvecchio. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Giuseppe Delvecchio, Z2l1c2VwcGUuZGVsdmVjY2hpb0Bwb2xpY2xpbmljby5taS5pdA==
†These authors have contributed equally to this work and share first authorship