Machine Learning for prediction of violent behaviors in schizophrenia spectrum disorders: a systematic review

Parsaei, Mohammadamin; Arvin, Alireza; Taebi, Morvarid; Seyedmirzaei, Homa; Cattarinussi, Giulia; Sambataro, Fabio; Pigoni, Alessandro; Brambilla, Paolo; Delvecchio, Giuseppe

doi:10.3389/fpsyt.2024.1384828

SYSTEMATIC REVIEW article

Front. Psychiatry , 21 March 2024

Sec. Schizophrenia

Volume 15 - 2024 | https://doi.org/10.3389/fpsyt.2024.1384828

This article is part of the Research Topic Violence and Mental Health. Focus on Schizophrenia Spectrum and Psychotic Disorders View all 6 articles

Machine Learning for prediction of violent behaviors in schizophrenia spectrum disorders: a systematic review

Mohammadamin Parsaei^1†

Alireza Arvin^2†

Morvarid Taebi²

Homa Seyedmirzaei³

Giulia Cattarinussi^4,5,6

Fabio Sambataro^4,5

Alessandro Pigoni^7,8

Paolo Brambilla^7,8,9

Giuseppe Delvecchio^9*

¹Maternal, Fetal & Neonatal Research Center, Family Health Research Institute, Tehran University of Medical Sciences, Tehran, Iran
²Center for Orthopedic Trans-disciplinary Applied Research (COTAR), Tehran University of Medical Sciences, Tehran, Iran
³Sports Medicine Research Center, Neuroscience Institute, Tehran University of Medical Sciences, Tehran, Iran
⁴Department of Neuroscience (DNS), Padua Neuroscience Center, University of Padova, Padua, Italy
⁵Padua Neuroscience Center, University of Padova, Padua, Italy
⁶Department of Psychological Medicine, Institute of Psychiatry, Psychology and Neuroscience, Kings College London, London, United Kingdom
⁷Social and Affective Neuroscience Group, MoMiLab, Institutions, Markets, Technologies (IMT) School for Advanced Studies Lucca, Lucca, Italy
⁸Department of Pathophysiology and Transplantation, University of Milan, Milan, Italy
⁹Department of Neurosciences and Mental Health, Fondazione Istituto di Ricovero e Cura a Carattere Scientifico (IRCCS) Ca’ Granda Ospedale Maggiore Policlinico, Milan, Italy

Background: Schizophrenia spectrum disorders (SSD) can be associated with an increased risk of violent behavior (VB), which can harm patients, others, and properties. Prediction of VB could help reduce the SSD burden on patients and healthcare systems. Some recent studies have used machine learning (ML) algorithms to identify SSD patients at risk of VB. In this article, we aimed to review studies that used ML to predict VB in SSD patients and discuss the most successful ML methods and predictors of VB.

Methods: We performed a systematic search in PubMed, Web of Sciences, Embase, and PsycINFO on September 30, 2023, to identify studies on the application of ML in predicting VB in SSD patients.

Results: We included 18 studies with data from 11,733 patients diagnosed with SSD. Different ML models demonstrated mixed performance with an area under the receiver operating characteristic curve of 0.56-0.95 and an accuracy of 50.27-90.67% in predicting violence among SSD patients. Our comparative analysis demonstrated a superior performance for the gradient boosting model, compared to other ML models in predicting VB among SSD patients. Various sociodemographic, clinical, metabolic, and neuroimaging features were associated with VB, with age and olanzapine equivalent dose at the time of discharge being the most frequently identified factors.

Conclusion: ML models demonstrated varied VB prediction performance in SSD patients, with gradient boosting outperforming. Further research is warranted for clinical applications of ML methods in this field.

1 Introduction

Schizophrenia disorders are characterized by delusions, hallucinations, disordered thinking, disorganized behavior, and blunted or inappropriate affects (1, 2). The disorders profoundly impact an individual’s quality of life and can also pose a risk to others, especially when they lead to violent behaviors (VB) (3). People with schizophrenia are frequently stigmatized as having a higher potential for violence, resulting in discrimination (4). Moreover, recent research has shown that schizophrenia spectrum disorders (SSD) – including schizophrenia, schizoaffective disorder, and other delusional disorders – have been linked with an increased risk of VB in various studies conducted worldwide (5–8).

The definition of VB is diverse, but it generally encompasses any manifestation of verbal or physical aggression directed at objects, others, or oneself (9, 10). The impact of VB is widespread, affecting not only the patients themselves, who may lose property, relationships, and well-being, but also their caregivers, such as family, friends, or healthcare workers, who can be traumatized by the experience (11, 12). Additionally, VB can increase the burden on the healthcare system for patients with SSD (13). A recent systematic review and meta-analysis reported a prevalence of 17.19 - 23.83% for different types of VB other than homicide among SSD patients (5). Another systematic review and meta-analysis, which pooled data from 15 countries, reported an odds ratio of 4.5 for interpersonal VB among SSD individuals compared to a general population group without these disorders (7).

Given the significant impact that VB can have on patients and those in their environment, it is critical to accurately predict the risk of VB to help prevent these behaviors. To date, many studies have investigated the risk factors for VB in SSD patients, including sociodemographic factors, disease characteristics, and previous patients’ medical history (14–16). However, most of these studies could not predict the risk of VB accurately, due to the complex and multifactorial nature of violence occurrence (17).

Machine learning (ML) is a subset of artificial intelligence that uses algorithms to learn from data, identify patterns, and make predictions (18, 19). By analyzing large amounts of data, ML algorithms can identify complex relationships and hidden links behind phenomena that are not obvious to human observers (20). The key aspect of ML is its capability to build predictive models, demonstrated by its ability to anticipate clinical outcomes such as suicidal ideation, impulsivity, and VB (19, 21, 22). This attribute renders ML a promising instrument for unraveling the intricate interplay between schizophrenia and VB, thereby aiding healthcare providers in the early identification of individuals susceptible to VB (23, 24). This, in turn, holds the potential to optimize resource allocation, diminish lay times, and fortify the safety of both staff and patients (25). Ultimately, the trajectory of ML in healthcare portends the evolution of medical prediction tools, envisaging their integration into routine clinical practice to proactively avert instances of VB and alleviate the burden of schizophrenia within this context (26).

This systematic review aims to investigate the potential of ML in predicting VB in patients with SSD, which we believe will offer a better understanding of the potential of ML in this clinical context and will be of interest to researchers and healthcare providers seeking to use ML to identify patients at risk of VB. Our main objectives are: 1) to discuss the most robust algorithms used for the prediction of VB; 2) to assess the general accuracy that has been achieved in predicting VB using ML; and 3) to review the effective factors that have enhanced ML’s ability to predict VB.

2 Materials and methods

2.1 Search strategy

We performed a systematic search in PubMed, Web of Sciences, Embase, and PsycINFO for relevant studies published before September 30, 2023. The search keywords consisted of three groups of keywords related to (a) ML, (b) SSD, and (c) VB. In this systematic review, the PICO (Population, Intervention, Comparison, Outcome) framework was employed with the following criteria:

Population: Schizophrenia spectrum disorder (SSD), including schizophrenia, schizoaffective disorder, and other delusional disorders (27);

Intervention: Machine learning models (ML);

Comparison: Medical records of patients or clinical violence risk assessment scales;

Outcome: Violent behavior (VB), defining as an attempt or action to harm a target, assault, robbery, aggression toward property, actions resulting in physical injury, child abuse, sexual abuse, threatening or causing injury with a weapon, verbal aggression or threatening, and violent crimes, e.g., attempted or completed homicide (28, 29).

This study was conducted in concordance with the Preferred Reporting Items for Systematic Reviews and Meta-Analysis (PRISMA) (30) and Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies (CHARMS) guidelines (31).

2.2 Inclusion and exclusion criteria

All studies developing ML models for predicting VB in SSD patients were included. The development of a ML model in medicine includes the following stages: data acquisition, data preparation, ML model development, model evaluation, hyperparameter tuning, and model validation (32). We aimed to review the articles that developed and evaluated ML models for the prediction of VB in SSD patients. Hence, studies that only employed statistical models by using an ML subset (e.g., logistic regression) and did not either evaluate or validate the performance of their generated model were not included in this review. The exclusion criteria consisted of 1) Records that did not study patients with an SSD diagnosis, 2) records that did not predict VB, 3) records that did not employ an ML method, 4) records that were not available in the English language full-text, 5) editorials, commentaries, letters, conference abstracts, books, and review articles, and 6) animal studies.

2.3 Study selection

The selection process began with removing the duplicated records. Then, two authors (MP and AA) independently reviewed the article titles and abstracts and selected the relevant papers for the full-text screening process. The same authors (MP and AA) independently conducted the full-text screening of the selected records for eligibility. Any discrepancies were settled by discussion and, if necessary, referred to a third author (GC).

2.4 Data extraction

Two authors (AA and MT) conducted the data extraction. We collected data about the authors, year of publication, sample size, characteristics of the patients, ML model and validation techniques, input variables (i.e., demographics), output variables (VB), additional assessments, and key findings from every included record. Also, reported measures of the area under the receiver operating characteristic curves (AUROC), balanced accuracy, predictive power, P-value, sensitivity, specificity, positive predictive values (PPV), and negative predictive values (NPV) were collected. Cross-validation performance was defined as a training dataset because it involved data “seen” by the machine, whereas “unseen” data from a held-out test set or external cohort was treated as validation.

2.5 Data synthesis

To bypass the limitations of meta-analyzing heterogeneous datasets, one author (MT) implemented a novel comparative approach, ranking each ML model’s performance within individual studies and then averaging ranks across studies to identify the best overall performing ML model.

2.6 Risk of bias assessment

To assess the risk of bias (ROB), we employed the Prediction Model Risk of Bias Assessment Tool (PROBAST) (33). It is a tool for assessing ROB and the applicability of diagnostic and prognostic prediction model studies. PROBAST evaluates 4 domains of participants, predictors, outcome, and analysis in the study by 20 signaling questions. signaling questions of the PROBAST checklist and its guidance notes for rating ROB and applicability are fully provided in PROBAST checklist section of the Supplementary Material. These questions facilitate structured judgment of ROB in the studies of predictive models. We used the explanation and elaboration document that describes the rationale for including each domain and signaling question and guides researchers to use them to assess the ROB and applicability concerns. Also, to assess the ROB in the studies that employed more than one ML model, we selected the ML model with the best performance (best AUROC or accuracy).

3 Results

3.1 Study selection

The search strategy employed in this systematic review yielded 3941 articles. Following the removal of duplicates, 2142 articles remained for further assessment. After assessing the abstracts, 250 articles were deemed suitable for full-text screening. A total of 18 articles satisfied the eligibility criteria and were included in the final analysis (Figure 1). Table 1 shows the characteristics and extracted data of the included articles.

Figure 1

Figure 1 Study selection process flow diagram.

Table 1

Table 1 Characteristics of the included studies.

3.2 Study characteristics

3.2.1 General features

The 18 included studies were conducted in Switzerland (n=8), China (n=8), and Canada (n=2). A total of 11,733 patients diagnosed with SSD were systematically reviewed in the present study, with diagnostic criteria including Diagnostic and Statistical Manual of Mental Disorders (DSM)-III, IV, and V, International Classification of Diseases (ICD)-9 and 10. Of the patients, 7,330 (62.47%) were male, and 4,403 (37.53%) were female. Three studies included exclusively male participants (38, 43, 44). Except for one study that recruited outpatients (34), all other studies recruited participants from inpatient settings. Among these studies, four employed ML models to predict VB during the current admission (35, 41, 47, 51). Additionally, nine studies categorized patients based on the occurrence of VB prior to their current admission (38–40, 43, 44, 46, 48–50), while another four classified patients into violent and non-violent groups by retrospectively reviewing their medical records since their disease onset (36, 37, 42, 45). Moreover, eight studies were part of a larger project investigating the relationship between SSD and offending and used the same dataset of offender patients as their sample population (39, 41, 42, 45, 46, 48–50).

3.2.2 Input measures

Most of the included studies utilized only sociodemographic and clinical features of patients to predict VB. Of these studies, five evaluated a large number of features (over 100 features) as predictors (39, 41, 45, 49, 50). Tzeng et al. (2004) explored the role of schizophrenia patients’ insight about their disease as a variable in addition to the sociodemographic features to predict the occurrence of VB (34). Additionally, Sun et al. (2021) explored the correlation between different psychotic symptoms and violence among schizophrenia patients (40). Likewise, Kirchebner et al. (2022) analyzed the role of accumulation and types of stressors in the patient’s history in increasing the severity of an offense (42). Furthermore, Machetanz et al. (2022, 2023) in two separate studies evaluated the differences between offender and non-offender SSD patients regarding psychiatric prescription patterns and illness-related factors (46, 49). Also, ten studies analyzed the relationship between different rating tools scores and VB in patients with SSD (36, 38, 39, 41, 43, 45, 46, 48–50), including the Brief Psychiatric Rating Scale (BPRS) (38, 43, 52), the Psychopathy Checklist: Screening Version (PCL-SV), the Historical, Clinical and Risk management (HCR-20) scale (38, 53), The Barratt Impulsiveness Scale version 11 (BIS-11) (38, 54), the Positive And Negative Symptom Scale (PANSS) (36, 39, 41, 43, 45, 46, 48–50, 55), the Social Disability Screening Schedule (SDSS) (43), Insight and Treatment Attitude Questionnaire (ITAQ) (47, 56), Family Adaptation, Partnership, Growth, Affection and Resolve (APGAR) (47, 57), Social Support Rating Scale (SSRS) (47, 58), and Family Burden Scale of Disease (FBS) (47, 59). Furthermore, two studies evaluated neuroimaging data of patients as VB predictors, along with sociodemographic features. Specifically, Gou et al. (2021) attempted to combine three modalities of neuroimaging data – T1-weighted magnetic resonance imaging (MRI), functional magnetic resonance imaging (fMRI), and diffusion tensor imaging (DTI) – with patients’ clinical features to improve the prediction power of the ML model (38). Similarly, Yu et al. (2022) assessed the effects of structural MRI (sMRI) features such as gray matter volume (GMV), cortical surface area, and cortical thickness in differentiating between violent and non-violent schizophrenia patients (44).

Moreover, two other studies examined the role of biochemical markers in indicating VB. Chen et al. (2015) examined the relationship between the violence trajectories, baseline clinical features, and lipid levels to develop a model to predict more violent trajectories (35), while Chen et al. (2020) tried to identify the metabolic characteristic of violent schizophrenia patients, including amino acids, lipids, and carbohydrates metabolism, by performing untargeted metabolomics and analyzing their plasma metabolites (36).

3.2.3 Output measures

The definition of VB varied significantly across studies due to the use of different criteria, scales, or aims. While some studies defined verbal aggression as VB, others only included physical aggression, and some differentiated offenses based on their severity. Four studies utilized the Modified Overt Aggression Scale (MOAS) (60) criteria, but with different thresholds (37, 38, 44, 47): Wang et al. (2020) considered the outcome as physical aggression, irrespective of the aim or the outcome of VB (37), Gou et al. (2021) considered it as physical aggression aimed at others and leading to injury (38), and finally Yu et al. (2022) and Cheng et al. (2023) defined VB as a minimum MOAS score of 5 or 4 respectively, which could be achieved by various VBs without restricting the type or the target of it (44, 47). Additionally, four studies employed different scales for the VB definition: Tzeng et al. (34) used the Violence and Suicide Assessment (VAS-A) (61), Chen et al. (35) utilized the Violence Scale (28), Chen et al. (36) employed the MacArthur Violence Risk Assessment Study (MVRAS) (62), and Watts et al. (51) used the Aggressive Incidents Scale (AIS) (63). Meanwhile, three other studies simply defined VB without the use of any scale: Sun et al. (2021) and You et al. (2022) focused on physical VB aimed at others (40, 43), while Hoffman et al. (2022) included physical VB regardless of the aim (41). On the other hand, six studies used a shared database to distinguish between violent and non-violent offenses (39, 42, 46, 48–50). In a seventh study, they attempted to predict the risk of homicide among other offenses (45).

3.3 Machine learning

3.3.1 Overview of algorithms

None of the 18 studies utilized unsupervised learning (clustering), which is consistent with the nature of the subject – since the classes and the target of classification is given (64). Instead, all of them used supervised learning (classification or regression), with three studies (43, 44, 47) incorporating deep learning through the neural network (NNET) or multi-layer perceptron (MLP) model. Among the top classification methods of supervised learning, support vector machine (SVM) was utilized in fifteen studies, decision trees (including random forests (RF) in fifteen, and k-nearest neighbor (KNN) in eleven. For the top regression methods of supervised learning, logistic regression (LR) (including stepwise LR) was utilized by twelve studies, while least absolute shrinkage and selection operator (LASSO) was used by five. While thirteen studies compared different ML models’ functions in violence prediction, others focused on developing a single prediction model (34–36, 38, 40). See Supplementary Material for detailed information regarding the model development and validation across the reviewed studies.

3.3.2 Model development

In most of the studies, some details were unclear about model development, with few providing information about hyperparameter tuning, an essential part of model development. Hyperparameters are parameters set before the training process begins and affect how the model learns from and generalizes the data (65). Tuning hyperparameters can significantly impact model performance and determine the complexity/flexibility of the model (65). Among the eighteen studies, four provided some explanation about the hyperparameter tuning (34, 35, 38, 47), two used default settings without optimization (41, 45), and the other twelve studies did not mention anything about hyperparameter optimization.

One study did not develop a prediction model but sought to find the best predictors of violence in SSD by using SVM and LR separately (36). Then they identified overlapping best predictors among metabolic biomarkers. By using two different models separately, they aimed to minimize overfitting – a common bias where models fit too closely to the training data, producing good predictions for data points in the training set but do not generalize well to new data, performing poorly on new samples (65) – as it is unlikely for two different algorithms to overfit the same way.

The remaining studies developed and assessed models for violence prediction in SSD. They employed feature selection or cross-validation to overcome overfitting bias and achieve more accurate model development. Seven studies employed data-driven feature selection by ML before model training to control overfitting: one utilized LASSO (38), three used RF (39, 41, 45), one applied boosted tree (42), one utilized both LASSO and LR (43), and one selected features after calculation of variable importance for each employed model separately (51). Sixteen studies used cross-validation, with two using 10-fold cross-validation (43, 44), one using 7-fold (36), nine using 5-fold (37, 39, 41, 42, 45, 46, 48–50), one using 4-fold (47), and one using 3-fold (34). Two studies did not use cross-validation (35, 40).

Furthermore, only sixteen studies acknowledged the implementation of imputation methods on their respective training set data (39, 41, 45, 46, 49–51). Imputation methods refers to techniques for estimating or imputing missing values within datasets to enhance overall completeness and analytical suitability (66). Notably, 5 studies opted for a common practice wherein missing continuous values were imputed with the mean observed values pertaining to the respective variable, while categorical variables underwent replacement with the mode of observed values (39, 41, 45, 46, 49, 50). However, one study imputed missing continuous variables with either the observed mean or median values, concurrently addressing missing categorical variables based on the mode of observed values (51).

The choice of ML models is often influenced by the type of data being used. According to a survey (67), deep learning models, such as NNET and MLP, are commonly employed for interpreting imagery data. Among the studies we reviewed, two specifically utilized brain imaging data to train ML models: In one study, LASSO was employed for image interpretation, while SVM was used for integrating image and clinical data and making final predictions (38). In the other study, seven models, including NNET, were compared to assess their performance (44).

3.3.3 Model validation

Regarding model validation and generalization assessment, six studies reported results on the training set (34–38, 42), while the rest of the studies performed internal validation by evaluating unseen portions of their training set. However, none of the studies conducted external validation using an independent and unseen set of data. This further implies that the prediction accuracy reported in these studies was based on a retrospective estimate rather than a prospective prediction and none of the studies tested their algorithms’ accuracy on future cases.

3.3.4 Models results

Primary outcome measures for evaluating model performance included area under the receiver operating characteristic curve (AUROC), accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV), with AUROC and accuracy being the most frequently used performance metrics. Regarding each metric, the ranges, the proportion of studies reaching values ≥75%, and the best-performing study were as the following: AUROC (0.56 – 0.95; 15/17 studies reached ≥75% (38), reached the best value), sensitivity (8.33 – 95.23%, 11/14 studies reached ≥75% (38, 51), reached the best value), specificity (24.38 – 98.39%, 12/14 studies reached ≥75% (43), reached the best value), accuracy (50.27 – 90.67%, 12/15 studies reached ≥75% (38), reached the best value), PPV (14.58 – 94.45%, 6/10 studies reached ≥75% (45), reached the best value), NPV (20.48 – 99.34%, 8/10 studies reached ≥75% (39, 51), reached the best value). Additionally, twelve studies achieved values above 75% for both AUROC and accuracy.

3.3.5 Models comparison

Running a meta-analysis on diverse studies with varying datasets, features, and variable distributions was impossible; Therefore, we adopted a particular approach to overcome the challenge of integrating and comparing the results of these studies. We specifically targeted studies that were designed to compare different models, as they offered valuable insights for our analysis. By extracting the rankings of different models, we could assess their relative performance, independent of the specific magnitude of each function indicator. This allowed us to overcome the limitations associated with diverse study designs and datasets, enabling a more meaningful comparison (Table 2).

Table 2

Table 2 Performance ranks of each machine learning model across the different studies.

As mentioned earlier, thirteen studies were designed to compare different models (37, 39, 41–51). However, two of these studies utilized imaging data (38, 44), which differed from the data used in the other studies. Since each ML model typically performs well with specific types of data (65), combining the results of these two studies with the others was not appropriate. Therefore, we excluded these studies from the analysis to maintain standardization across the dataset, which left us with eleven studies.

The performance rank of each model across the different studies was aggregated to generate a final rank. This approach allowed us to understand the average success rate of each model. To enhance the interpretability of the results, we took two steps. Firstly, we excluded models used in less than half of the eleven studies. Secondly, we standardized the ranks so that they fell within a range of 0 to 7. (For the studies that compared N models, all rankings were multiplied by 7/N.) By doing so, we ensured that the final ranks accurately reflected the relative performance of each model. A lower final rank indicated a better average performance across the studies.

Finally, in terms of both accuracy and AUROC, the gradient boosting (GB) model consistently achieved the highest performance rank among the six models with a substantial margin compared to the next highest-ranked model. However, given that meta-analysis was not possible, it is not feasible to assess whether this margin was significant or not. This suggests that the GB model shows promising performance in predicting violence among SSD patients using clinical data.

3.4 Discriminative features

Various features were identified in the included studies as the predictor variables of VB in SSD patients. We can classify most of them into sociodemographic, clinical, metabolic, and neuroimaging groups. Most of the features were consistent in multiple studies, except for some discrepancies, which will be elaborated upon.

3.4.1 Sociodemographic features

Some studies identified age (34, 37, 43, 45, 47), gender (34), and educational level (38, 43) as factors that contribute to the prediction of VB. However, other studies reported that these factors do not have a significant relationship with the occurrence of such behavior (35, 37, 44).

3.4.2 Clinical features

Psychotic symptoms are associated with VB in SSD patients. Different studies consistently demonstrate that negative symptoms, such as flat affect and poverty of thought, decrease the risk of VB (35, 40). However, there is inconsistency in the results concerning the impact of positive symptoms on the occurrence of VB. Some studies suggest an increased risk of VB associated with positive symptoms (35, 43), while others propose a diminishing impact of specific positive symptoms, including delusion of persecution and auditory hallucination, on VB occurrence (40). Furthermore, various studies reported that daily dosage of prescribed olanzapine-equivalent at the time of discharge from previous psychiatric hospitalization of SSD patients can predict the occurrence of VB among them (39, 45, 46, 48, 49). However, their results were divergent, with four studies demonstrating a positive association between the olanzapine-equivalent dosage and risk of VB (39, 46, 48, 49), and one study reporting a negative association (45).

Patients’ past stresses also can contribute to VB. Patients who have experienced a higher number of past stressors had an increased risk of engaging in VB (42, 51). Consistently, history of previous outpatient psychiatric treatment was found to be associated with an increased risk of VB in patients (46, 48–50). In addition, specific stressors, including a history of coercive psychiatric treatment and separation from main caregivers in childhood or adolescence, have also been found to be related to VB (42). There is a lack of consensus on the relationship between patients’ employment status and VB. While Kirchebner et al. (2022) found a significant correlation between unemployment and VB (42), Chen et al. (2015) and Wang et al. (2020) reported no statistical relevance between a patient’s employment status and the likelihood of VB (35, 37).

Additionally, scores of several rating tools are significantly associated with VB. The BPRS total score, BPRS hostility score, BPRS withdrawal factors score (38), ITAQ score, family APGAR score, SSRS score, and FBS score (47) were all found to correlate with the risk of VB. Moreover, the PANSS total score at admission and discharge (39, 45), and PANSS anxiety and lack of spontaneity scores (50) are significantly related to VB. Other statistically relevant clinical features are presented in Table 1.

3.4.3 Neuroimaging features

Two studies explored potential neuroimaging features for predicting VB. Gou et al. (2021) identified brain features associated with regional homogeneity (ReHo), gray matter volume (GMV), and fractional anisotropy (FA) as effective predictors of VB in schizophrenia patients (38). Significant GMV alterations were observed in the striatum system (including the putamen and pallidum), median cingulate, and paracingulate gyri, as well as temporal, occipital, and anterior parts of the parietal lobe. In addition, ReHo was most predictive in the anterior cingulate, dorsolateral part of the superior frontal gyrus, temporal pole, parietal lobe, and subcortical areas of the striatum, such as the caudate and pallidum. Also, the left superior longitudinal fasciculus was found to play a crucial role in FA predictions. Overall, the study identified the cingulate gyrus, dorsolateral part of superior frontal gyrus, temporal lobe (inferior temporal gyrus and temporal pole), supplementary motor area, and pallidum as the key regions for predicting VB in schizophrenia patients using sMRI and fMRI (38). On the other hand, Yu et al. (2022) found that the measurement of whole-brain GMV, right areas of superior temporal sulcus cortical thickness, right inferior parietal cortical thickness, and left frontal pole GMV correlated to the likelihood of violent tendencies (44).

3.4.4 Metabolic features

Three plasma metabolites were recognized as potentially effective biomarkers for predicting VB. In the study by Chen et al. (2022) the ratio of L-asparagine to L-aspartic acid, vanillylmandelic acid, and glutaric acid was found to be associated with an increased likelihood of VB (36). Specifically, a decrease in the ratio of L-asparagine to L-aspartic acid and glutaric acid level and an increase in the vanillylmandelic acid level appear to be correlated with violent tendencies. Furthermore, altered specific metabolic pathways seemed to predispose individuals toward violence. Specifically, the glycerolipid metabolism pathway, characterized by an up-regulation of glycerol and a down-regulation of glycerol-3-phosphate, and the phenylalanine, tyrosine, and tryptophan biosynthesis pathway, marked by a down-regulation of 4-hydroxyphenylpyruvic, have been associated with violent tendencies (36). Moreover, it has been demonstrated that raised triglyceride levels were associated with a reduced likelihood of engaging in VB (35).

3.5 Risk of bias assessment

Based on the results of our ROB assessment using the PROBAST guidelines, all studies except for two (43, 47), had some bias due to a small sample size, different violence definitions, and the inability to satisfy the study’s purpose. Although most of the studies had high ROB, the most important limitation arises from their limited sample sizes. According to the PROBAST guidelines, to achieve a low ROB in the analysis domain, the number of participants with the outcome relative to the number of the input variables should be equal to or higher than 20 (33). Only four reviewed studies had low ROB in the analysis domain (41, 42, 44, 47). Another reason for high ROB arises from the divergent definitions of violence and the use of different scales across the studies. We defined VB as an attempt or action to harm a target, assault, child or sexual abuse, and violent crimes. Whereas, Hofmann et al. (2022) included verbal aggression in the definition of VB (41) and four studies evaluated the ability of ML models to classify patients with previous criminal offenses (including VB) from non-offenders (46, 48–50). Also, some studies have evaluated the power of ML models in predicting VB (e.g., homicide) among offenders with SSD disorders (39, 42, 45). Although many studies represented high ROB in at least one field according to the PROBAST guideline (33), most of them (11/18) showed low concerns regarding applicability in the field of violence prediction in patients with SSD. Table 3 and Figure 2 illustrate the results of the quality assessment process.

Table 3

Table 3 Findings of the ROB assessment based on the PROBAST statement.

Figure 2

Figure 2 Assessment of Risk of Bias based on PROBAST.

4 Discussion

4.1 Key findings

Previous research has shown an acceptable power for ML models in predicting VB in populations broader than SSD patients (68, 69). In this article, we reviewed the role of ML in predicting VB in patients with SSD. According to our findings, the predictive performances of the ML models varied across the reviewed papers. However, ML models performed better in studies that employed more intricate methodologies for model development and evaluation. These findings suggest that a well-designed ML model could be a potential tool for VB prediction in SSD patients, and could be beneficial in warning the caregivers to seek prevention techniques and stop them from further harmful acts in clinical and forensic settings. Among the reviewed ML models, GB showed the best performance in VB detection. Also, we reviewed the most discriminating features in violence prediction of SSD patients. Age (34, 37, 43, 45, 47) and olanzapine equivalent dose at the time of discharge (39, 45, 46, 48, 49) were the most repetitive variables found to be associated with violence across the studies.

4.2 Machine learning models

While direct comparison of results among studies was challenging due to the differences in sample characteristics, some insights were obtained. First, about two third of the studies (11/18) could reach values above 75% for both AUROC and accuracy, indicating that ML can be a promising tool for the accurate prediction of VB among SSD patients. Second, the studies demonstrated diverse performance in predicting VB among SSD patients, with an AUROC ranging from 0.56 to 0.95 and an accuracy range of 50.27% to 90.67%. However, the performance ranges within each study were narrower when comparing different ML models. Considering that many studies employed similar ML models and input variables, the observed diversity in performance appears to be partly influenced by the variations in study designs. This suggests that future similar studies could enhance their results not only by focusing on ML model selection or input variable choices but also by paying attention to the details of model development to mitigate biases.

In addition, there exists considerable divergence among the reviewed studies with regard to the methodologies employed for both feature selection and cross-validation. These two components play pivotal roles in the trajectory of ML model development, serving to mitigate overfitting and augment overall model performance (70). Within the included studies, a mere seven undertook data-driven feature selection utilizing ML techniques prior to model training as a preemptive measure against overfitting (38, 39, 41–43, 45). Notably, one study adopted a post hoc approach, selecting features subsequent to the computation of variable importance for each employed model independently (51). Additionally, sixteen studies embraced diverse methods for cross-validation, while two studies opted to forgo its application (35, 40). This heterogeneity in model development practices across the reviewed studies poses a significant obstacle to synthesizing their respective findings.

Therefore, our significant challenge was comparing ML models by integrating the results of different studies due to variations in sample characteristics, including differences in input and output variables distribution. To address this challenge, we devised a ranking method that enabled us to assess the overall success rate of commonly used methods. Based on our findings, the GB model exhibited notably superior average performance. However, it is essential to note that this does not necessarily imply inherent weakness in the other models. Instead, it highlights the favorable results achieved by the GB model in the specific context of the studied field.

GB is a subset of ensemble machine learning models, which also includes common models like classification trees and RF (32). This approach enables the effective handling of big data and also the handling of missing values in the predictors (32). While common ensemble techniques like RF rely on straightforward averaging of models within the ensemble, GB stands out for its step-by-step, sequential strategy for selecting the best predictor (71). This notable flexibility empowers GB to be highly adaptable to specific data-driven tasks (71). Due to its unique characteristics, GB outperformed other ML models in predicting VB in SSD patients, particularly due to its effective handling of a large number of predictors. Nevertheless, it is noteworthy that several other ML models, including SVM, LASSO, NNET, RF, decision trees, PDA, MLP, elastic net, and LR, in the studies reviewed, also achieved AUROC values exceeding 0.9 (41, 44, 47, 51). This highlights the substantial predictive potential of these alternative ML models in addition to GB.

4.3 Discriminative features

Notably, various studies have explored the influence of age on VB risk, yielding diverse findings. For instance, Tzeng et al. (2004) associated younger age with a higher risk of VB (34). In contrast, four other studies observed that older age correlated with an increased tendency for VB in SSD patients (37, 44, 45, 47). Yet, Chen et al. (2015) found no significant correlation between patients’ age and VB risk. While the majority of previous research aligns with Chen et al. (2015) and negates the association between age and VB risk (35, 72), there are outliers such as Soyka et al. (2007) who identified older ages as linked to a higher VB risk in SSD patients (73). This variability underscores the need for further research to ascertain the precise impact of age on VB occurrence in SSD patients.

Contrary to age, the reported gender effect on VB occurrence risk was quite consistent among studies, which showed that male sex was associated with a higher risk of VB (34). These findings confirmed most of the previous studies that reported a higher prevalence of VB among male SSD patients (73, 74), as the general population (17). Furthermore, Gou et al. (2021) and Yu et al. (2022), but not Chen et al. (2015) and Yu et al. (2022a), found that lower educational levels could predict VB occurrence (38, 43). This was in line with previous research that found lower educational levels to be significant predictors of VB among SSD patients (75, 76) and the general population (17). These disparities suggest that further studies on larger populations are required to determine the exact effect of the educational level of SSD patients on their VB tendency.

Moreover, in terms of the effect of occupational status on VB tendency, Kirchebner et al. (2022) reported a significant relationship between unemployment and VB in SSD patients (42), which confirmed the findings by Karabekiroğlu et al. (2016). Conversely, Chen et al. (2015) and Wang et al. (2020) found no correlation between employment status and VB (35, 37). This divergence could be a result of different definitions of violence in these studies; indeed, Kirchebner et al. (2022) and Karabekiroğlu et al. (2016) studies, unemployment was able to differentiate SSD patients with serious VB (e.g., homicide) from patients with minor VB (e.g., property damage). On the other hand, two other studies trained ML models to differentiate SSD patients with any kind of VB (serious or minor) from patients without VB (35, 37). This suggests that unemployment does not seem to be associated with the overall risk of VB among SSD patients, but it increases the risk of serious VB among offenders.

Regarding the clinical features, two studies reported that the presence of positive symptoms (35, 43) was correlated with an increased risk of VB, which was consistent with previous research (72, 74, 77). However, another study suggested that the presence of specific positive symptoms, including delusion of persecution and auditory hallucination, decreases the risk of VB (40). This controversy indicates that different types of delusion may have varying effects on the occurrence of VB (40). Also, Sonnweber et al. (2021, 2022) found a favorable predictive power for the PANSS total score of the patients in two different studies (39, 45). This is in line with previous studies that demonstrated higher PANSS total scores in violent patients, compared to non-violent patients (78, 79). Moreover, consistent with previous research (80), Gou et al. (2021) found that the risk of VB occurrence is higher among SSD patients with higher scores in the BPRS hostility subscale. Furthermore, higher scores in BPRS total score, BPRS withdrawal factors, PCL-SV, HCR-20 (38), and SDSS (43) successfully predicted VB in SSD patients across the reviewed studies. While the BPRS and PANSS scales assess various domains of SSD, including positive and negative symptoms (55, 81), the PCL-SV scale is specifically designed to evaluate psychopathic traits in patients, which is not directly associated with SSD (82). This indicates that aside from psychotic symptoms, additional symptoms like patients’ personality profiles, including psychopathy and impulsivity, may have relevance in predicting VB among individuals with SSD. Altogether, these suggest that by training ML models with certified psychiatric rating tools, we can significantly improve the accuracy of predicting VB in SSD patients, which can be highly beneficial in clinical applications.

Chen et al. (2015) found negative symptoms to be correlated with a decreased risk of VB (35), which is in line with previous studies that found depressive and other negative symptoms to be associated with a lower occurrence of VB in SSD patients (73, 77). Furthermore, the effect of the age of disease onset was controversial across the reviewed studies. While Sonnweber et al. (2021, 2022) reported that younger age of disease onset correlated with the probability of VB, Chen et al. (2015) and Wang et al. (2020) did not find a significant relationship between the age of disease onset and VB occurrence. The findings of previous research in this field are also divergent. Indeed, while Caqueo-Urízar et al. (2016) found VB to be more prevalent among patients with younger age of illness onset, Nolan et al. (1999) did not find any significant differences between the age of onset of violent and non-violent patients. Therefore, further research is warranted to determine the effects of disease onset on the VB occurrence in SSD patients, as it can help the early detection and treatment of patients at higher risk of VB.

While most studies evaluating the prescribed daily olanzapine-equivalent dose at the time of discharge from previous hospitalizations have reported a positive association with the risk of VB (39, 46, 48, 49), there is an exception in one study that reported the opposite (45). The divergence in findings can be attributed to the different focus of the Sonnweber (2022) study, which specifically differentiated between homicide committers and patients who committed other types of VB (45). It is logical to assume that higher doses of antipsychotics are prescribed to patients with more enduring symptoms, as they are reported to be more prone to engaging in VB in some studies (83). However, some previous studies found no significant association between the disease severity or prescribed dosage of antipsychotics and the risk of VB in patients with SSD (84, 85). This highlights the need for further research to better comprehend the relationship between disease severity and prescribed antipsychotic dosages in the occurrence of VB among SSD patients.

Previous research has shown that SSD patients’ previous history of violence is significantly correlated with increased risks of VB, such as recent violence episodes (86), history of a recent assault (87), previous history of aggression (74), and a previous violent conviction (87). Consistently, Sonnweber et al. (2021) reported previous conviction history as a significant predictor of VB in SSD patients (39). Moreover, Wang et al. (2020) found that a history of more than five times of hospitalization increased the likelihood of VB tendency in patients. However, Tzeng et al. (2004) reported that the lifetime number of hospitalizations was not correlated with an increased risk of VB occurrence in SSD patients. This disparity could be due to the differences in the psychiatric history assessment across the studies, as Tzeng et al. (2004) evaluated a broader variable (lifetime hospitalization), while other studies assessed the recent hospitalization history (88, 89) or a more distinguishing variable (≥ 5-lifetime hospital admissions) (37).

Finally, two studies have observed that neuroimaging variables were robust predictors of VB in SSD patients. Yu et al. (2022) found decreased whole-brain gray matter volume, right inferior parietal thickness, and left frontal pole volume to be predictors of VB. Consistently, Gou et al. (2021) reported disruption in the structural and functional MRI of the temporal, frontal lobes, cingulate gyrus, and striatum can predict VB in SSD patients. Also, a systematic review of 21 studies, revealed that reduced volumes of the frontal lobe in patients with schizophrenia are associated with a higher rate of VB occurrence (90). This is not surprising, as previous research mentioned a prominent role for frontal and temporal lobes and cingulate gyrus disruptions in developing VB (91). Considering the role of the frontal cortex in controlling disinhibited behaviors (e.g., impulsiveness, aggressiveness, and violence), patients with disrupted frontal cortex are more likely to present VB (91, 92). Although previous research established the involvement of the hippocampus and amygdala in emotional processing and in the development of VB (91), the predictive value of these regions was not assessed across the reviewed studies. In conclusion, our knowledge in the field of ML-based prediction of VB in SSD patients by training MRI data is still limited, and future research is required to clarify its potential.

4.4 Limitations and further directions

This study has some limitations. First, the sample sizes of most studies were small, considering the number of input variables, which can influence their analysis results. Second, the study samples across the reviewed articles were heterogeneous, as most of them studied clinical inpatients, while some studied forensic inpatients, and one included only outpatients’ data. Also, some studies only included male patients. Third, the outcome definitions differed within studies. For example, while most of the studies classified SSD patients into violent and non-violent, some others distinguished patients with serious types of VB (e.g., homicide) from other types of VB. Fourth, the reviewed studies were conducted in countries with different healthcare systems, which could have a significant impact on violence among SSD patients. Fifth, most of the studies did not select time-dependent features for VB prediction, which substantially lowers the ML model performance. Finally, none of the reviewed articles performed external validation, which can significantly diminish the generalizability of their findings. Therefore, future research with more homogenous methodologies and both internal and external validations seems to be necessary.

4.5 Conclusions

The outcomes of the ML models employed by the reviewed studies have yielded compelling findings, highlighting the significance of continuing along this research trajectory for further exploration and advancement. More in detail, while the ML models’ performance in VB prediction among SSD patients was divergent, yet promising, our comparative analysis demonstrated that GB outperformed other ML models. Considering the heterogeneity of ML model applications and study populations across the reviewed articles, there is substantial potential for further research in this field. Furthermore, the absence of external validation in the majority of the included articles reduces the generalizability of their findings. Indeed, subsequent research endeavors, employing comparable models, outcomes, and predictors, in extensive clinical samples, are imperative to substantiate the certainty of the current findings and ascertain the applicability of the developed ML algorithms.

Moreover, given the rapidly growing trend in the application of various artificial intelligence tools in medical contexts, it appears likely that in the next years ML models can be also utilized for VB prediction in SSD patients. Indeed, while the performance of ML models varied across the reviewed studies; several models demonstrated excellent predictive abilities with an AUROC exceeding 0.9. This highlights the potential for developing reliable ML models through further well-designed studies. Upon validation through external assessments, these models could effectively predict VB in real-world clinical settings. Consequently, the development of clinical assessment tools integrating patient data could facilitate the early identification of individuals highly susceptible to VB, whether in outpatient or inpatient settings. The utilization of such tools enables timely preventive interventions, such as providing social support and rehabilitation, adjusting medications, and considering more intensive therapeutic approaches, like electroconvulsive therapy. Implementing these measures could significantly alleviate the burden of VB on patients, healthcare systems, and society at large.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Author contributions

MP: Conceptualization, Investigation, Project administration, Resources, Visualization, Writing – original draft. AA: Data curation, Investigation, Visualization, Writing – original draft. MT: Data curation, Formal Analysis, Writing – original draft. HS: Writing – original draft. GC: Funding acquisition, Supervision, Writing – review & editing. FS: Supervision, Writing – review & editing. AP: Data curation, Writing – review & editing. PB: Project administration, Supervision, Writing – review & editing. GD: Funding acquisition, Methodology, Supervision, Writing – review & editing.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. GC was supported by a grant from Cassa di Risparmio di Padova e Rovigo (CARIPARO). The study was partially supported by the Italian Ministry of Health (ricerca corrente 2023).

Conflict of interest

The authors declare the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyt.2024.1384828/full#supplementary-material

Glossary

www.frontiersin.org

References

1. Mancuso SG, Morgan VA, Mitchell PB, Berk M, Young A, Castle DJ. A comparison of schizophrenia, schizoaffective disorder, and bipolar disorder: Results from the Second Australian national psychosis survey. J Affect Disord. (2015) 172:30–7. doi: 10.1016/j.jad.2014.09.035

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Karbalaee M, Jameie M, Amanollahi M, TaghaviZanjani F, Parsaei M, Basti FA, et al. Efficacy and safety of adjunctive therapy with fingolimod in patients with schizophrenia: A randomized, double-blind, placebo-controlled clinical trial. Schizophr Res. (2023) 254:92–8. doi: 10.1016/j.schres.2023.02.020

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Millier A, Schmidt U, Angermeyer MC, Chauhan D, Murthy V, Toumi M, et al. Humanistic burden in schizophrenia: a literature review. J Psychiatr Res. (2014) 54:85–93. doi: 10.1016/j.jpsychires.2014.03.021

PubMed Abstract | CrossRef Full Text | Google Scholar

4. James A. Stigma of mental illness. Foreword. Lancet. (1998) 352:1048. doi: 10.1016/S0140-6736(98)00019-1

CrossRef Full Text | Google Scholar

5. Guo Y, Yang X, Wang D, Fan R, Liang Y, Wang R, et al. Prevalence of violence to others among individuals with schizophrenia in China: A systematic review and meta-analysis. Front Psychiatry. (2022) 13:939329. doi: 10.3389/fpsyt.2022.939329

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Simpson AI, Penney SR, Jones RM. Homicide associated with psychotic illness: What global temporal trends tell us about the association between mental illness and violence. Aust N Z J Psychiatry. (2022) 56:1384–8. doi: 10.1177/00048674211067164

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Whiting D, Gulati G, Geddes JR, Fazel S. Association of schizophrenia spectrum disorders and violence perpetration in adults and adolescents from 15 countries: A systematic review and meta-analysis. JAMA Psychiatry. (2022) 79:120–32. doi: 10.1001/jamapsychiatry.2021.3721

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Moghaddam HS, Parsaei M, Taghavizanjani F, Cattarinussi G, Aarabi MH, Sambataro F. White matter alterations in affective and non-affective early psychosis: A diffusion MRI study. J Affect Disord. (2024) 351:615–23. doi: 10.1016/j.jad.2024.01.238

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Monahan J, Vesselinov R, Robbins PC, Appelbaum PS. Violence to others, violent self-victimization, and violent victimization by others among persons with a mental illness. Psychiatr Serv. (2017) 68:516–9. doi: 10.1176/appi.ps.201600135

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Whiting D, Lichtenstein P, Fazel S. Violence and mental disorders: a structured review of associations by individual diagnoses, risk factors, and risk assessment. Lancet Psychiatry. (2021) 8:150–61. doi: 10.1016/S2215-0366(20)30262-5

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Kageyama M, Solomon P. Post-traumatic stress disorder in parents of patients with schizophrenia following familial violence. PloS One. (2018) 13:e0198164. doi: 10.1371/journal.pone.0198164

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Tasa-Vinyals E, Álvarez MJ, Puigoriol-Juvanteny E, Roura-Poch P, García-Eslava JS, Escoté-Llobet S. Intimate partner violence among patients diagnosed with severe mental disorder. J Nerv Ment Dis. (2020) 208:749–54. doi: 10.1097/NMD.0000000000001207

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Cloutier M, Aigbogun MS, Guerin A, Nitulescu R, Ramanakumar AV, Kamat SA, et al. The economic burden of schizophrenia in the United States in 2013. J Clin Psychiatry. (2016) 77:764–71. doi: 10.4088/JCP.15m10278

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Halmai T, Tényi T, Gonda X. Symptom profiles and parental bonding in homicidal versus non-violent male schizophrenia patients. Ideggyogy Sz. (2017) 70:43–52. doi: 10.18071/isz.70.0043

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Oakley C, Harris S, Fahy T, Murphy D, Picchioni M. Childhood adversity and conduct disorder: A developmental pathway to violence in schizophrenia. Schizophr Res. (2016) 172:54–9. doi: 10.1016/j.schres.2016.01.047

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Witt K, van Dorn R, Fazel S. Risk factors for violence in psychosis: systematic review and meta-regression analysis of 110 studies. PloS One. (2013) 8:e55942. doi: 10.1371/journal.pone.0055942

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Coid JW, Ullrich S, Kallis C, Freestone M, Gonzalez R, Bui L, et al. Improving risk management for violence in mental health services: a multimethods approach. Programme Grants Appl Res. 2016 4(16). doi: 10.3310/pgfar04160

CrossRef Full Text | Google Scholar

18. Iniesta R, Stahl D, McGuffin P. Machine learning, statistical learning and the future of biological research in psychiatry. Psychol Med. (2016) 46:2455–65. doi: 10.1017/S0033291716001367

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Parsaei M, Taghavizanjani F, Cattarinussi G, Moghaddam HS, Di Camillo F, Akhondzadeh S, et al. Classification of suicidality by training supervised machine learning models with brain MRI findings: A systematic review. J Affect Disord. (2023) 340:766–91. doi: 10.1016/j.jad.2023.08.034

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Günther MP, Kirchebner J, Lau S. Identifying direct coercion in a high risk subgroup of offender patients with schizophrenia via machine learning algorithms. Front Psychiatry. (2020) 11:415. doi: 10.3389/fpsyt.2020.00415

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Jiménez S, Angeles-Valdez D, Rodríguez-Delgado A, Fresán A, Miranda E, Alcalá-Lozano R, et al. Machine learning detects predictors of symptom severity and impulsivity after dialectical behavior therapy skills training group in borderline personality disorder. J Psychiatr Res. (2022) 151:42–9. doi: 10.1016/j.jpsychires.2022.03.063

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Verrey J, Ariel B, Harinam V, Dillon L. Using machine learning to forecast domestic homicide via police data and super learning. Sci Rep. (2023) 13:22932. doi: 10.1038/s41598-023-50274-2

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Vijeikis R, Raudonis V, Dervinis G. Efficient violence detection in surveillance. Sensors. (2022) 22:2216. doi: 10.3390/s22062216

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Bakhshi A, García-Gómez J, Gil-Pita R, Chalup S. Violence detection in real-life audio signals using lightweight deep neural networks. Proc Comput Science. (2023) 222:244–51. doi: 10.1016/j.procs.2023.08.162

CrossRef Full Text | Google Scholar

25. Gould C, Mufamadi D. Costs and benefits of preventing violence. ISS South Afr Rep. (2021) 2021:1–14.

Google Scholar

26. Verma AA, Murray J, Greiner R, Cohen JP, Shojania KG, Ghassemi M, et al. Implementing machine learning in medicine. Cmaj. (2021) 193:E1351–e7. doi: 10.1503/cmaj.202434

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Wright M. Schizophrenia and schizophrenia spectrum disorders. JAAPA. (2020) 33:46–7. doi: 10.1097/01.JAA.0000662412.51169.bf

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Morrison EF. The measurement of aggression and violence in hospitalized psychiatric patients. Int J Nurs Stud. (1993) 30:51–64. doi: 10.1016/0020-7489(93)90092-9

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Monahan J, Steadman H, Silver E, Appelbaum P, Robbins P, Mulvey E, et al. Rethinking risk assessment: the macArthur study of mental disorder and violence. Thomas Grisso. (2002) 1:147–52. doi: 10.1093/oso/9780195138825.001.0001

CrossRef Full Text | Google Scholar

30. Stewart LA, Clarke M, Rovers M, Riley RD, Simmonds M, Stewart G, et al. Preferred Reporting Items for Systematic Review and Meta-Analyses of individual participant data: the PRISMA-IPD Statement. Jama. (2015) 313:1657–65. doi: 10.1001/jama.2015.3656

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Moons KG, de Groot JA, Bouwmeester W, Vergouwe Y, Mallett S, Altman DG, et al. Critical appraisal and data extraction for systematic reviews of prediction modelling studies: the CHARMS checklist. PloS Med. (2014) 11:e1001744. doi: 10.1371/journal.pmed.1001744

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Arbet J, Brokamp C, Meinzen-Derr J, Trinkley KE, Spratt HM. Lessons and tips for designing a machine learning study using EHR data. J Clin Transl Sci. (2020) 5:e21. doi: 10.1017/cts.2020.513

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Wolff RF, Moons KGM, Riley RD, Whiting PF, Westwood M, Collins GS, et al. PROBAST: A tool to assess the risk of bias and applicability of prediction model studies. Ann Intern Med. (2019) 170:51–8. doi: 10.7326/M18-1376

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Tzeng H-M, Lin Y-L, Hsieh J-G. Forecasting violent behaviors for schizophrenic outpatients using their disease insights: development of a binary logistic regression model and a support vector model. Int J Ment Health. (2004) 33:17–31. doi: 10.1080/00207411.2004.11043366

CrossRef Full Text | Google Scholar

35. Chen SC, Chu NH, Hwu HG, Chen WJ. Trajectory classes of violent behavior and their relationship to lipid levels in schizophrenia inpatients. J Psychiatr Res. (2015) 66-67:105–11. doi: 10.1016/j.jpsychires.2015.04.022

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Chen X, Xu J, Tang J, Dai X, Huang H, Cao R, et al. Dysregulation of amino acids and lipids metabolism in schizophrenia with violence. BMC Psychiatry. (2020) 20:97. doi: 10.1186/s12888-020-02499-y

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Wang KZ, Bani-Fatemi A, Adanty C, Harripaul R, Griffiths J, Kolla N, et al. Prediction of physical violence in schizophrenia with machine learning algorithms. Psychiatry Res. (2020) 289:112960. doi: 10.1016/j.psychres.2020.112960

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Gou N, Xiang Y, Zhou J, Zhang S, Zhong S, Lu J, et al. Identification of violent patients with schizophrenia using a hybrid machine learning approach at the individual level. Psychiatry Res. (2021) 306:114294. doi: 10.1016/j.psychres.2021.114294

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Sonnweber M, Lau S, Kirchebner J. Violent and non-violent offending in patients with schizophrenia: Exploring influences and differences via machine learning. Compr Psychiatry. (2021) 107:152238. doi: 10.1016/j.comppsych.2021.152238

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Sun L, Han X, Wang K, Xu C, Song Z, Zhang J, et al. Candidate symptomatic markers for predicting violence in schizophrenia: A cross-sectional study of 7711 patients in a Chinese population. Asian J Psychiatr. (2021) 59:102645. doi: 10.1016/j.ajp.2021.102645

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Hofmann LA, Lau S, Kirchebner J. Advantages of machine learning in forensic psychiatric research-uncovering the complexities of aggressive behavior in schizophrenia. Appl Sciences-Basel. (2022) 12:819. doi: 10.3390/app12020819

CrossRef Full Text | Google Scholar

42. Kirchebner J, Sonnweber M, Nater UM, Günther M, Lau S. Stress, schizophrenia, and violence: A machine learning approach. J Interpers Violence. (2022) 37:602–22. doi: 10.1177/0886260520913641

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Yu T, Zhang X, Liu X, Xu C, Deng C. The prediction and influential factors of violence in male schizophrenia patients with machine learning algorithms. Front Psychiatry. (2022) 13:799899. doi: 10.3389/fpsyt.2022.799899

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Yu T, Pei W, Xu C, Zhang X, Deng C. Prediction of violence in male schizophrenia using sMRI, based on machine learning algorithms. BMC Psychiatry. (2022) 22:676. doi: 10.1186/s12888-022-04331-1

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Sonnweber M, Lau S, Kirchebner J. Exploring characteristics of homicide offenders with schizophrenia spectrum disorders via machine learning. Int J Offender Ther Comp Criminol. (2022), 306624x221102799. doi: 10.21428/cb6ab371

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Machetanz L, Günther MP, Lau S, Kirchebner J. High risk, high dose?-pharmacotherapeutic prescription patterns of offender and non-offender patients with schizophrenia spectrum disorder. Biomedicines. (2022) 10:3243. doi: 10.3390/biomedicines10123243

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Cheng N, Guo M, Yan F, Guo Z, Meng J, Ning K, et al. Application of machine learning in predicting aggressive behaviors from hospitalized patients with schizophrenia. Front Psychiatry. (2023) 14:1016586. doi: 10.3389/fpsyt.2023.1016586

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Kirchebner J, Lau S, Machetanz L. Offenders and non-offenders with schizophrenia spectrum disorders: Do they really differ in known risk factors for aggression? Front Psychiatry. (2023) 14:1145644. doi: 10.3389/fpsyt.2023.1145644

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Machetanz L, Hofmann AB, Möhrke J, Kirchebner J. Offenders and non-offenders with schizophrenia spectrum disorders: the crime-preventive potential of sufficient embedment in the mental healthcare and support system. Front Psychiatry. (2023) 14:1231851. doi: 10.3389/fpsyt.2023.1231851

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Machetanz L, Lau S, Habermeyer E, Kirchebner J. Suicidal offenders and non-offenders with schizophrenia spectrum disorders: A retrospective evaluation of distinguishing factors using machine learning. Brain Sci. (2023) 13:97. doi: 10.3390/brainsci13010097

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Watts D, Mamak M, Moulden H, Upfold C, de Azevedo Cardoso T, Kapczinski F, et al. The HARM models: Predicting longitudinal physical aggression in patients with schizophrenia at an individual level. J Psychiatr Res. (2023) 161:91–8. doi: 10.1016/j.jpsychires.2023.02.030

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Woerner MG, Mannuzza S, Kane JM. Anchoring the BPRS: an aid to improved reliability. Psychopharmacol Bull. (1988) 24:112–7.

PubMed Abstract | Google Scholar

53. Telles LE, Day VP, Folino JO, Taborda JG. Reliability of the Brazilian version of HCR-20 assessing risk for violence. Braz J Psychiatry. (2009) 31:253–6. doi: 10.1590/S1516-44462009005000001

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Stanford MS, Mathias CW, Dougherty DM, Lake SL, Anderson NE, Patton JH. Fifty years of the Barratt Impulsiveness Scale: An update and review. Pers Individ Differences. (2009) 47:385–95. doi: 10.1016/j.paid.2009.04.008

CrossRef Full Text | Google Scholar

55. Kay SR, Fiszbein A, Opler LA. The positive and negative syndrome scale (PANSS) for schizophrenia. Schizophr Bull. (1987) 13:261–76. doi: 10.1093/schbul/13.2.261

PubMed Abstract | CrossRef Full Text | Google Scholar

56. McEvoy JP, Apperson LJ, Appelbaum PS, Ortlip P, Brecosky J, Hammill K, et al. Insight in schizophrenia. Its relationship to acute psychopathology. J Nerv Ment Dis. (1989) 177:43–7. doi: 10.1097/00005053-198901000-00007

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Smilkstein G, Ashworth C, Montano D. Validity and reliability of the family APGAR as a test of family function. J Fam Pract. (1982) 15:303–11.

PubMed Abstract | Google Scholar

58. Xiao S-Y. The theoretical basis and research application of social support rating scale. J Clin Psychiatry. (1994) 4:98–100.

Google Scholar

59. Pai S, Kapur RL. Impact of treatment intervention on the relationship between dimensions of clinical psychopathology, social dysfunction and burden on the family of psychiatric patients. Psychol Med. (1982) 12:651–8. doi: 10.1017/S0033291700055756

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Kay SR, Wolkenfeld F, Murrill LM. Profiles of aggression among psychiatric patients. I. Nature and prevalence. J Nerv Ment Dis. (1988) 176:539–46. doi: 10.1097/00005053-198809000-00007

PubMed Abstract | CrossRef Full Text | Google Scholar

61. Feinstein R, Plutchik R. Violence and suicide risk assessment in the psychiatric emergency room. Compr Psychiatry. (1990) 31:337–43. doi: 10.1016/0010-440X(90)90040-Y

PubMed Abstract | CrossRef Full Text | Google Scholar

62. Monahan J, Appelbaum PS, Mulvey EP, Robbins PC, Lidz CW. Ethical and legal duties in conducting research on violence: lessons from the MacArthur Risk Assessment Study. Violence Vict. (1993) 8:387–96. doi: 10.1891/0886-6708.8.4.387

PubMed Abstract | CrossRef Full Text | Google Scholar

63. Cook AN, Moulden HM, Mamak M, Lalani S, Messina K, Chaimowitz G. Validating the hamilton anatomy of risk management–forensic version and the aggressive incidents scale. Assessment. (2016) 25:432–45. doi: 10.1177/1073191116653828

PubMed Abstract | CrossRef Full Text | Google Scholar

64. Hamsagayathri P, Vigneshwaran S. (2021). Symptoms based disease prediction using machine learning techniques, in: 2021 Third international conference on intelligent communication technologies and virtual mobile networks (ICICV) Tirunelveli, Indiadoi: IEEE. doi: 10.1109/ICICV50876.2021.9388603

CrossRef Full Text | Google Scholar

65. Alpaydin E. Introduction to machine learning. Cambridge, Massachusetts: The MIT Press. (2020).

Google Scholar

66. Emmanuel T, Maupong T, Mpoeleng D, Semong T, Mphago B, Tabona O. A survey on missing data in machine learning. J Big Data. (2021) 8:140. doi: 10.1186/s40537-021-00516-9

PubMed Abstract | CrossRef Full Text | Google Scholar

67. Garg A, Mago V. Role of machine learning in medical research: A survey. Comput Sci review. (2021) 40:100370. doi: 10.1016/j.cosrev.2021.100370

CrossRef Full Text | Google Scholar

68. Parmigiani G, Barchielli B, Casale S, Mancini T, Ferracuti S. The impact of machine learning in predicting risk of violence: A systematic review. Front Psychiatry. (2022) 13:1015914. doi: 10.3389/fpsyt.2022.1015914

PubMed Abstract | CrossRef Full Text | Google Scholar

69. Tay JL, Li Z, Sim K. Effectiveness of artificial intelligence methods in personalized aggression risk prediction within inpatient psychiatric treatment settings-A systematic review. J Pers Med. (2022) 12:1470. doi: 10.3390/jpm12091470

PubMed Abstract | CrossRef Full Text | Google Scholar

70. Meyer H, Reudenbach C, Hengl T, Katurji M, Nauss T. Improving performance of spatio-temporal machine learning models using forward feature selection and target-oriented validation. Environ Model Software. (2018) 101:1–9. doi: 10.1016/j.envsoft.2017.12.001

CrossRef Full Text | Google Scholar

71. Natekin A, Knoll A. Gradient boosting machines, a tutorial. Front Neurorobot. (2013) 7:21. doi: 10.3389/fnbot.2013.00021

PubMed Abstract | CrossRef Full Text | Google Scholar

72. Martínez-Martín N, Fraguas D, García-Portilla MP, Sáiz PA, Bascarán MT, Arango C, et al. Self-perceived needs are related to violent behavior among schizophrenia outpatients. J Nerv Ment Dis. (2011) 199:666–71. doi: 10.1097/NMD.0b013e318229d0d5

PubMed Abstract | CrossRef Full Text | Google Scholar

73. Soyka M, Graz C, Bottlender R, Dirschedl P, Schoech H. Clinical correlates of later violence and criminal offences in schizophrenia. Schizophr Res. (2007) 94:89–98. doi: 10.1016/j.schres.2007.03.027

PubMed Abstract | CrossRef Full Text | Google Scholar

74. Araya T, Ebnemelek E, Getachew R. Prevalence and Associated Factors of Aggressive Behavior among Patients with Schizophrenia at Ayder Comprehensive Specialized Hospital, Ethiopia. BioMed Res Int. (2020) 2020:7571939. doi: 10.1155/2020/7571939

PubMed Abstract | CrossRef Full Text | Google Scholar

75. Karabekiroğlu A, Pazvantoğlu O, Karabekiroğlu K, Böke Ö, Korkmaz IZ. Associations with violent and homicidal behaviour among men with schizophrenia. Nord J Psychiatry. (2016) 70:303–8. doi: 10.3109/08039488.2015.1109139

PubMed Abstract | CrossRef Full Text | Google Scholar

76. Wang J, Li C, Zhu XM, Zhang SM, Zhou JS, Li QG, et al. Association between schizophrenia and violence among Chinese female offenders. Sci Rep. (2017) 7:818. doi: 10.1038/s41598-017-00975-2

PubMed Abstract | CrossRef Full Text | Google Scholar

77. Swanson JW, Swartz MS, Van Dorn RA, Elbogen EB, Wagner HR, Rosenheck RA, et al. A national study of violent behavior in persons with schizophrenia. Arch Gen Psychiatry. (2006) 63:490–9. doi: 10.1001/archpsyc.63.5.490

PubMed Abstract | CrossRef Full Text | Google Scholar

78. Arango C, Calcedo Barba A, González S, Calcedo Ordóñez A. Violence in inpatients with schizophrenia: a prospective study. Schizophr Bull. (1999) 25:493–503. doi: 10.1093/oxfordjournals.schbul.a033396

PubMed Abstract | CrossRef Full Text | Google Scholar

79. Yi Y, Huang Y, Chen Q, Yang H, Li H, Feng Y, et al. Violence, neurocognitive function and clinical correlates in patients with schizophrenia. Front Psychiatry. (2022) 13:1087372. doi: 10.3389/fpsyt.2022.1087372

PubMed Abstract | CrossRef Full Text | Google Scholar

80. Abushua'leh K, Abu-Akel A. Association of psychopathic traits and symptomatology with violence in patients with schizophrenia. Psychiatry Res. (2006) 143:205–11. doi: 10.1016/j.psychres.2005.05.017

PubMed Abstract | CrossRef Full Text | Google Scholar

81. Shafer A. Meta-analysis of the brief psychiatric rating scale factor structure. Psychol Assess. (2005) 17:324–35. doi: 10.1037/1040-3590.17.3.324

PubMed Abstract | CrossRef Full Text | Google Scholar

82. Hare R, Hart S, Cox D. The hare psychopathy checklist: screening version (PCL-SV). Toronto: Multi-Health Systems Inc (1995).

Google Scholar

83. Cho W, Shin WS, An I, Bang M, Cho DY, Lee SH. Biological aspects of aggression and violence in schizophrenia. Clin Psychopharmacol Neurosci. (2019) 17:475–86. doi: 10.9758/cpn.2019.17.4.475

PubMed Abstract | CrossRef Full Text | Google Scholar

84. Ullrich S, Keers R, Coid JW. Delusions, anger, and serious violence: new findings from the MacArthur Violence Risk Assessment Study. Schizophr Bull. (2014) 40:1174–81. doi: 10.1093/schbul/sbt126

PubMed Abstract | CrossRef Full Text | Google Scholar

85. Tasmim S, Kolla NJ, Dada O, Bani-Fatemi A, De Luca V. Correlation between violence and antipsychotic dosage in schizophrenia: A secondary analysis of the clinical antipsychotic trials for intervention effectiveness (CATIE) dataset. Pharmacopsychiatry. (2019) 52:217–21. doi: 10.1055/a-0826-4935

PubMed Abstract | CrossRef Full Text | Google Scholar

86. Bobes J, Fillat O, Arango C. Violence among schizophrenia out-patients compliant with medication: prevalence and associated factors. Acta Psychiatr Scand. (2009) 119:218–25. doi: 10.1111/j.1600-0447.2008.01302.x

PubMed Abstract | CrossRef Full Text | Google Scholar

87. Walsh E, Gilvarry C, Samele C, Harvey K, Manley C, Tattan T, et al. Predicting violence in schizophrenia: a prospective study. Schizophr Res. (2004) 67:247–52. doi: 10.1016/S0920-9964(03)00091-4

PubMed Abstract | CrossRef Full Text | Google Scholar

88. Kageyama M, Solomon P, Kita S, Nagata S, Yokoyama K, Nakamura Y, et al. Factors related to physical violence experienced by parents of persons with schizophrenia in Japan. Psychiatry Res. (2016) 243:439–45. doi: 10.1016/j.psychres.2016.06.036

PubMed Abstract | CrossRef Full Text | Google Scholar

89. Caqueo-Urízar A, Fond G, Urzúa A, Boyer L, Williams DR. Violent behavior and aggression in schizophrenia: Prevalence and risk factors. A multicentric study from three Latin-America countries. Schizophr Res. (2016) 178:23–8. doi: 10.1016/j.schres.2016.09.005

PubMed Abstract | CrossRef Full Text | Google Scholar

90. Fjellvang M, Grøning L, Haukvik UK. Imaging violence in schizophrenia: A systematic review and critical discussion of the MRI literature. Front Psychiatry. (2018) 9:333. doi: 10.3389/fpsyt.2018.00333

PubMed Abstract | CrossRef Full Text | Google Scholar

91. Siever LJ. Neurobiology of aggression and violence. Am J Psychiatry. (2008) 165:429–42. doi: 10.1176/appi.ajp.2008.07111774

PubMed Abstract | CrossRef Full Text | Google Scholar

92. Bonelli RM, Cummings JL. Frontal-subcortical circuitry and behavior. Dialogues Clin Neurosci. (2007) 9:141–51. doi: 10.31887/DCNS.2007.9.2/rbonelli

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: artificial intelligence, machine learning, schizophrenia, schizophrenia spectrum disorder, violent behavior

Citation: Parsaei M, Arvin A, Taebi M, Seyedmirzaei H, Cattarinussi G, Sambataro F, Pigoni A, Brambilla P and Delvecchio G (2024) Machine Learning for prediction of violent behaviors in schizophrenia spectrum disorders: a systematic review. Front. Psychiatry 15:1384828. doi: 10.3389/fpsyt.2024.1384828

Received: 10 February 2024; Accepted: 08 March 2024;
Published: 21 March 2024.

Edited by:

Massimo Tusconi, University of Cagliari, Italy

Reviewed by:

Stefano Barlati, University of Brescia, Italy
Artemis Igoumenou, University College London, United Kingdom
Serdar M. Dursun, University of Alberta, Canada
Johannes Kirchebner, University of Zurich, Switzerland

Copyright © 2024 Parsaei, Arvin, Taebi, Seyedmirzaei, Cattarinussi, Sambataro, Pigoni, Brambilla and Delvecchio. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Giuseppe Delvecchio, Z2l1c2VwcGUuZGVsdmVjY2hpb0Bwb2xpY2xpbmljby5taS5pdA==

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Machine Learning for prediction of violent behaviors in schizophrenia spectrum disorders: a systematic review

1 Introduction

2 Materials and methods

2.1 Search strategy

2.2 Inclusion and exclusion criteria

2.3 Study selection

2.4 Data extraction

2.5 Data synthesis

2.6 Risk of bias assessment

3 Results

3.1 Study selection

3.2 Study characteristics

3.2.1 General features

3.2.2 Input measures

3.2.3 Output measures

3.3 Machine learning

3.3.1 Overview of algorithms

3.3.2 Model development

3.3.3 Model validation

3.3.4 Models results

3.3.5 Models comparison

3.4 Discriminative features

3.4.1 Sociodemographic features

3.4.2 Clinical features

3.4.3 Neuroimaging features

3.4.4 Metabolic features

3.5 Risk of bias assessment

4 Discussion

4.1 Key findings

4.2 Machine learning models

4.3 Discriminative features

4.4 Limitations and further directions

4.5 Conclusions

Data availability statement

Author contributions

Funding

Conflict of interest

Publisher’s note

Supplementary material

Glossary

References

95% of researchers rate our articles as excellent or good