
95% of researchers rate our articles as excellent or good
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.
Find out more
ORIGINAL RESEARCH article
Front. Psychol. , 28 February 2025
Sec. Psychopathology
Volume 16 - 2025 | https://doi.org/10.3389/fpsyg.2025.1500200
Background: In a recent study of Norwegian children (N = 874), Halse et al. used random-intercept cross-lagged panel models (RI-CLPM) and concluded that their findings supported the assumption that deficiencies in executive functions and psychopathology are both a cause and a consequence of the other. However, it is known that RI-CLPM can give biased results.
Methods: We reanalyzed data simulated to resemble the data used by Halse et al. with several complementary models, e.g., latent change score models (LCSM).
Results: The analyzed models indicated contradictory simultaneous increasing and decreasing effects between executive deficits and symptoms of psychopathology.
Conclusion: The present contradictory findings suggested that prospective effects between executive deficits and symptoms of psychopathology may have been spurious rather than truly increasing. Consequently, conclusions by Halse et al. appear to have been premature. It is important for researchers to bear in mind that correlations, including cross-lagged effects in RI-CLPM, do not prove causality. Careful interpretation of RI-CLPM results is of utmost importance in, for example, research in clinical and developmental psychology. We recommend researchers to use, as we did here, triangulation to scrutinize findings from analyses of observational data.
In random-intercept cross-lagged panel models (RI-CLPM), longitudinally measured scores on two (or more) constructs are regressed on stable, trait-like, latent variables (the random intercepts). Then, auto-regressive and cross-lagged effects are estimated between residuals of the measured scores not accounted for by individuals’ stable levels. These residuals could, for example, be occasion-specific deviations from individuals’ stable levels of anxiety and depression and the cross-lagged effects would estimate how deviation in anxiety at time T predicts deviation in depression at time T + 1 when adjusting for deviation in depression at time T and vice versa. In this way, effects are presumably estimated within individuals (Hamaker et al., 2015; Mulder and Hamaker, 2021). Within-individual effects should give better estimations of true increasing or decreasing effects compared with effects in traditional cross-lagged panel models (CLPM), where effects are conflated by between-individual differences (Usami et al., 2019).
Halse et al. (2022), used RI-CLPM and concluded that their findings supported the assumption that deficiencies in executive functions and psychopathology are both a cause and a consequence of the other. However, there are indications that RI-CLPM, similarly as CLPM, may be susceptible to spurious findings (Lüdtke and Robitzsch, 2021; Lucas, 2023; Sorjonen et al., 2023; Murayama and Gfrörer, 2024). For example, if longitudinal scores on two constructs X and Y are affected by common auto-correlated state factors, RI-CLPM will tend to indicate statistically significant, but spurious, cross-lagged effects (Sorjonen et al., 2024b). For example, imagine some individuals with the same within-individual score on anxiety at time T (i.e., they deviate equally much from their stable levels of anxiety) but who differ in their within-individual score on depression at time T. In this situation, we should expect that those with a higher score on within-individual depression have received a lower score on within-individual anxiety than they should have (i.e., a more negative residual) while those with a lower within-individual score on depression have received a higher score than they should have (i.e., a more positive residual). However, residuals tend to regress toward a mean value of zero between measurements. Consequently, we should expect a more positive, but spurious, change in within-individual anxiety between time T and time T + 1 for those with a higher within-individual score on depression at time T compared with those with the same within-individual score on anxiety but a lower within-individual score on depression at time T. Consequently, positive cross-lagged effects between within-individual depression and within-individual anxiety in an analysis with RI-CLPM could be due to correlations with residuals and regression to the mean rather than genuine (i.e., non-spurious) increasing effects.
Given indications that results from RI-CLPM may be spurious, the objective of the present study was to reanalyze data simulated to resemble the data used by Halse et al. (2022) and to evaluate their conclusion of causal effects between deficiencies in executive functions and psychopathology. We used simulated data as the empirical data used by Halse et al. were not available to us. Our reanalyses followed recommendations to triangulate findings from analyses of correlational (i.e., non-experimental) data (Munafò and Davey Smith, 2018; Hammerton and Munafò, 2021; Sorjonen and Melin, 2024). If results from different models converge, conclusions of causality are corroborated. If, on the other hand, results from different models diverge, conclusions of causality would appear premature.
We refer to Halse et al. (2022) for more comprehensive information on the study sample, used instruments, research procedure, etc. In short, data on deficits in executive functions and symptoms of depression, oppositional defiant disorder and conduct disorder (ODD/CD), attention-deficit hyperactivity disorder (ADHD), and anxiety were collected for children (N = 874) born in Norway between 2003 and 2004. Data were collected on five occasions (every second year) when the children were between 6 and 14 years old.
We used simulated data as the empirical data used by Halse et al. were not available to us. It is important to bear in mind that RI-CLPM, as well as other types of structural equation models (SEM), extract covariances or correlations (for standardized parameters) from raw data and use these as input for the analyses. Consequently, although slight deviations can be expected due to missing values and non-normality in empirical data, analyses of simulated data should yield very similar results as analyses of empirical data. For example, a cross-lagged effect of X1 on Y2 when adjusting for Y1 is given by correlations between the variables (Equation 1, Cohen et al., 2003). This means that the cross-lagged effect would be the same in simulated data with the same correlations between variables as in empirical data.
We fitted five different models (Figure 1) on the simulated data, separately for each of the four measures of psychopathology: (A) Original RI-CLPM, where within-individual residuals of executive deficits at time T predicted within-individual residuals of symptoms of psychopathology at time T + 1 when adjusting for within-individual residuals of symptoms of psychopathology at time T, and vice versa; (B) Reversed RI-CLPM, where within-individual residuals of executive deficits at time T predicted within-individual residuals of symptoms of psychopathology at time T when adjusting for within-individual residuals of symptoms of psychopathology at time T + 1; (C) A second reversed RI-CLPM, where within-individual residuals of symptoms of psychopathology at time T predicted within-individual residuals of executive deficits at time T when adjusting for within-individual residuals of executive deficits at time T + 1; (D) Latent change score model (LCSM) (McArdle, 2009; Ghisletta and McArdle, 2012; Kievit et al., 2018), where executive deficits at time T predicted latent change in symptoms of psychopathology between times T and T + 1 and vice versa; (E) Model of spurious longitudinal associations (MoSLA) (Sorjonen and Melin, 2024), where the five measurements of executive deficits and symptoms of psychopathology were regressed on stable trait-like levels of the constructs, i.e., random intercepts, as well as on auto-correlated state factors affecting executive deficits and psychopathology measured at the same occasion. The trait-like levels of executive deficits and psychopathology were affected, i.e., confounded, by a common trait-factor. Analyses were conducted with R 4.3.1 statistical software (R Core Team, 2024) employing the lavaan (Rosseel, 2012) and MASS (Venables and Ripley, 2002) packages. The analytic script, which also generates the simulated data, is available at the Open Science Framework at https://osf.io/fs8c7/.
Figure 1. The five analyzed models. (A) Original RI-CLPM, where initial within-individual levels of executive deficits predicted subsequent within-individual levels of psychopathology when adjusting for initial within-individual levels of psychopathology and vice versa; (B) Reversed RI-CLPM, where initial within-individual levels of executive deficits predicted initial within-individual levels of psychopathology when adjusting for subsequent within-individual levels of psychopathology. Parameters between random intercepts and the measured scores not shown; (C) A second reversed RI-CLPM, where initial within-individual levels of psychopathology predicted initial within-individual levels of executive deficits when adjusting for subsequent within-individual levels of executive deficits. Parameters between random intercepts and the measured scores not shown; (D) Latent change score model (LCSM), where initial levels of executive deficits predicted subsequent latent change in levels of psychopathology and vice versa; (E) Model of spurious longitudinal associations (MoSLA), where scores on executive deficits and psychopathology were regressed on general, trait-like, random intercepts as well as on auto-correlated state factors. ED, executive deficits; PP, psychopathology; CT, common trait-factor; 1, 2, 3 = measurements at waves 1, 2, and 3, respectively; gED/gPP, general, trait-like, levels (i.e., random intercepts) of executive deficits and psychopathology, respectively; rED/rPP, within-individual residuals of executive deficits and psychopathology, respectively; dED/dPP, latent change in executive deficits and psychopathology, respectively; St, state factor; be/bp, effects of main interest; analyzed data contained five waves of measurement rather than three as illustrated here.
If executive deficits and psychopathology had, as concluded by Halse et al. (2022), causal increasing effects on each other, we should expect focal effects (labeled be and bp in Figure 1) to be positive in models A and D but negative in models B and C. Although negative effects may appear counterintuitive, they would indicate that a low initial within-individual value on executive deficits had compensated for a high initial within-individual value on psychopathology and allowed individuals to reach the same subsequent within-individual level of psychopathology as individuals with a lower initial within-individual value on psychopathology but a higher initial within-individual value on executive deficits, and vice versa. For example, an effect of −0.5 of initial within-individual executive deficits on initial within-individual psychopathology when adjusting for subsequent within-individual psychopathology would mean that among individuals with the same subsequent within-individual level of psychopathology (e.g., zero), those with lower initial within-individual executive deficits (e.g., −1) had had a higher initial within-individual level of psychopathology (−1 × −0.5 = 0.5) compared with those with higher initial within-individual executive deficits (e.g., 1 and 1 × −0.5 = −0.5). This would, in turn, mean that those with lower initial within-individual executive deficits had experienced a more negative change in within-individual psychopathology between the measurements (0–0.5 = −0.5) compared with those with the same subsequent within-individual level of psychopathology but with a higher initial within-individual level of executive deficits (0 – (−0.5) = 0.5). As an analogy, given that pouring water into a glass has an increasing effect on its total weight (i.e., including its content), if we pour different amounts of water into glasses that then have the same subsequent weight, those glasses that received least water should have had the highest initial weight. This corresponds to a negative effect of the amount of poured water on initial weight when adjusting for subsequent weight and would indicate that a high initial weight had compensated for receiving little water and allowed the glass to have the same subsequent weight as glasses that received more water. Moreover, an adequate fit of model E would indicate that data may have been generated without any genuine effects between executive deficits and psychopathology, i.e., the apparent increasing effects in the RI-CLPM may have been spurious.
The size of focal effects (labeled be and bp in Figure 1) as well as model fits are presented in Table 1. With the possible exception of LCSM (model D), the models exhibited adequate fit for all four measures of symptoms of psychopathology. In the original RI-CLPM (model A), the positive focal effects indicated that when adjusting for initial within-individual levels of psychopathology, initial within-individual levels of executive deficits had an increasing effect on subsequent change in within-individual levels of psychopathology and vice versa. However, the positive focal effects in the two reversed RI-CLPMs (models B and C) indicated contradictory decreasing effects between within-individual levels of executive deficits and psychopathology. For example, among individuals with the same subsequent within-individual level of depression (e.g., zero), those with a low initial within-individual level of executive deficits (e.g., −1) were predicted to have had a lower initial within-individual level of depression (−1 × 0.13 = −0.13) compared with those with a high initial within-individual level of executive deficits (e.g., 1 and 1 × 0.13 = 0.13). Consequently, those with a low initial within-individual level of executive deficits were predicted to have increased in within-individual level of depression between measurements (0 – (−0.13) = 0.13) while those with a high initial within-individual level of executive deficits were predicted to have decreased in within-individual level of depression between measurements (0–0.13 = −0.13). The positive focal effects in models B and C suggest that high, not low, initial within-individual levels of executive deficits had compensated for high initial within-individual levels of psychopathology and allowed individuals to reach the same subsequent within-individual levels of psychopathology as individuals with lower initial within-individual levels of executive deficits and psychopathology and vice versa. Moreover, the LCSMs (model D) did not indicate increasing effects of initial executive deficits on subsequent change in symptoms of psychopathology and vice versa. Rather, these effects tended to be negative. Furthermore, the adequate fit of the MoSLA (model E) suggested that data may have been generated without any genuine effects between executive deficits and symptoms of psychopathology, i.e., the apparent increasing effects in the RI-CLPM may have been spurious. In summary, conclusions of reciprocal prospective increasing effects between executive deficits and symptoms of psychopathology were supported by the original RI-CLPM (Model A) but contested by the other models (Models B–E).
Table 1. Effects between deficits in executive functions and symptoms of psychiatric disorders, separately for four different disorders and five different models.
The present study set out to conduct reanalyses and to evaluate the conclusion by Halse et al. (2022) of causal effects between deficiencies in executive functions and psychopathology. Random-intercept cross-lagged panel models (RI-CLPM) indicated, in agreement with Halse et al., increasing effects between within-individual levels of executive deficits and symptoms of psychopathology. However, reversed RI-CLPMs and latent change score models (LCSM) indicated contradictory decreasing effects between executive deficits and symptoms of psychopathology. Moreover, models of spurious longitudinal associations (MoSLAs) indicated that data may have been generated without any genuine effects between executive deficits and symptoms of psychopathology, i.e., the apparent increasing effects in the RI-CLPM may have been spurious. Taken together, results from the present reanalyses and triangulation suggested that data analyzed by us and by Halse et al. do not support conclusions of causal effects between executive deficits and symptoms of psychopathology and, consequently, that the conclusions by Halse et al. in this regard were premature. In extension, the present findings also suggest that trying to alleviate symptoms of psychopathology through measures targeting executive deficits, and vice versa, may not be the best use of limited clinical resources.
The present reanalyses followed recommendations to triangulate findings from analyses of observational data (Munafò and Davey Smith, 2018; Hammerton and Munafò, 2021; Sorjonen and Melin, 2024). We agree with this recommendation, because correlations do not prove causality. This is true both of zero-order correlations and more advanced correlations like adjusted cross-lagged effects in traditional CLPM and in RI-CLPM. With longitudinal data, triangulations may utilize the fact that if a variable X (e.g., amount of water poured into a glass) has an increasing effect on a variable Y (e.g., the total weight of the glass), X should also compensate for a low initial value on Y. This means that with a genuine increasing effect of X on Y, X should have a positive effect on subsequent Y when adjusting for initial Y, a positive effect on the subsequent Y - initial Y difference, as well as a negative effect on initial Y when adjusting for subsequent Y. Without this combination of expected effects, observed effects would appear to be spurious and conclusions of causality premature. We (Sorjonen et al., 2023, 2024a, 2024b; Sorjonen and Melin, 2024) have previously used this type of triangulation and challenged conclusions in studies using RI-CLPM and claiming effects of need of cognition on anxiety and depression symptoms (Zainal and Newman, 2022), of income on self-esteem (Bleidorn et al., 2023), of curiosity on creativity and vice versa (Ma and Wei, 2023), and of self-esteem on eating pathology and vice versa (Beckers et al., 2023). We recommend researchers to use similar triangulations to scrutinize findings from analyses of observational data.
As we did not have access to the empirical data used by Halse et al. (2022), we reanalyzed data simulated to have the same sample size and correlations between variables as in their empirical dataset. Some may view this as a major limitation of the present study. However, it is important to bear in mind that RI-CLPM, as well as other types of structural equation models (SEM), extract covariances or correlations (for standardized parameters) from raw data and use these as input for the analyses. Consequently, although slight deviations can be expected due to missing values and non-normality in empirical data, analyses of simulated data should yield very similar results as analyses of empirical data. In the present case, this conclusion was corroborated by the fact that original RI-CLPMs indicated, in agreement with Halse et al., increasing effects between executive deficits and symptoms of psychopathology. The crucial difference between our analyses and analyses by Halse et al. was not that we used simulated data, but that we, differently from them, used triangulation and fitted several complementary models to the same data, which allowed us to discriminate between genuine and spurious increasing effects. Moreover, reliability is a function of correlations. Consequently, as the correlations in our simulated dataset were the same as in the original empirical dataset used by Halse et al., there were no differences in measurement reliability between the two datasets.
The data collection may not have utilized optimal instruments, study procedures, etc. However, it is, again, important to bear in mind that such factors were constant across the analyzed models, meaning that they cannot explain why the models indicated simultaneous and contradictory increasing and decreasing effects between executive deficits and symptoms of psychopathology. Consequently, such possible deficiencies do not pose a threat against our conclusion that the data analyzed by Halse et al., and reanalyzed by us, do not allow conclusions of causal increasing effects between executive deficits and symptoms of psychopathology.
The present study was based on data simulating data collected in a, presumably quite homogenous, sample of Norwegian children. It is unclear to what degree the present main finding, that prospective effects between executive deficits and symptoms of psychopathology appear to be spurious rather than genuinely increasing, generalizes to other populations. It should be noted that a potential limitation in generalizability is due to where and how the original data were collected and not a graver threat against our study employing simulated data than against the original study by Halse et al.
Halse et al. (2022) conducted analyses with random-intercept cross-lagged panel models (RI-CLPM) and concluded that their findings supported the assumption that deficiencies in executive functions and psychopathology are both a cause and a consequence of the other. Here, we reanalyzed data simulated to resemble the data used by Halse et al. Our findings suggested that prospective effects between executive deficits and symptoms of psychopathology may have been spurious rather than truly increasing. Consequently, conclusions by Halse et al. appear to have been premature. It is important for researchers to bear in mind that correlations, including cross-lagged effects in RI-CLPM, do not prove causality. We recommend researchers to use, as we did here, triangulation to scrutinize findings from analyses of observational data.
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found below: the analytic script, which also generates the simulated data, is available at the Open Science Framework at https://osf.io/fs8c7/.
KS: Conceptualization, Data curation, Formal analysis, Investigation, Methodology, Project administration, Visualization, Writing – original draft, Writing – review & editing. BM: Conceptualization, Investigation, Supervision, Validation, Writing – review & editing.
The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Beckers, D., Burk, W. J., Larsen, J. K., and Cillessen, A. H. N. (2023). The bidirectional associations between self-esteem and problematic eating behaviors in adolescents. Int. J. Eat. Disord. 57, 104–115. doi: 10.1002/eat.24083
Bleidorn, W., Kretzschmar, A., Rauthmann, J. F., Orth, U., Denissen, J. J. A., and Hopwood, C. J. (2023). Self-esteem and income over time. Psychol. Sci. 34, 1163–1172. doi: 10.1177/09567976231185129
Cohen, J., Cohen, P., West, S. G., and Aiken, L. S. (2003). Applied multiple regression/correlation analysis for the behavioral sciences. Third Edn. Mahwah, NJ: Lawrence Erlbaum Associates.
Ghisletta, P., and McArdle, J. J. (2012). Latent curve models and latent change score models estimated in R. Struct. Equ. Model. Multidiscip. J. 19, 651–682. doi: 10.1080/10705511.2012.713275
Halse, M., Steinsbekk, S., Hammar, Å., and Wichstrøm, L. (2022). Longitudinal relations between impaired executive function and symptoms of psychiatric disorders in childhood. J. Child Psychol. Psychiatry 63, 1574–1582. doi: 10.1111/jcpp.13622
Hamaker, E. L., Kuiper, R. M., and Grasman, R. P. (2015). A critique of the cross-lagged panel model. Psychol. Methods 20, 102–116. doi: 10.1037/a0038889
Hammerton, G., and Munafò, M. R. (2021). Causal inference with observational data: the need for triangulation of evidence. Psychol. Med. 51, 563–578. doi: 10.1017/S0033291720005127
Kievit, R. A., Brandmaier, A. M., Ziegler, G., van Harmelen, A.-L., de Mooij, S. M. M., Moutoussis, M., et al. (2018). Developmental cognitive neuroscience using latent change score models: a tutorial and applications. Dev. Cogn. Neurosci. 33, 99–117. doi: 10.1016/j.dcn.2017.11.007
Lucas, R. E. (2023). Why the cross-lagged panel model is almost never the right choice. Adv. Methods Pract. Psychol. Sci. 6:25152459231158378. doi: 10.1177/25152459231158378
Lüdtke, O., and Robitzsch, A. (2021). A critique of the random intercept cross-lagged panel model. Available online at: https://osf.io/preprints/psyarxiv/6f85c_v1 (Accessed February 22, 2022).
Ma, J., and Wei, W. (2023). Curiosity causes creativity? Revealing the reinforcement circle between state curiosity and creativity. J. Creat. Behav. 57, 488–494. doi: 10.1002/jocb.606
McArdle, J. J. (2009). Latent variable modeling of differences and changes with longitudinal data. Annu. Rev. Psychol. 60, 577–605. doi: 10.1146/annurev.psych.60.110707.163612
Mulder, J. D., and Hamaker, E. L. (2021). Three extensions of the random intercept cross-lagged panel model. Struct. Equ. Model. Multidiscip. J. 28, 638–648. doi: 10.1080/10705511.2020.1784738
Munafò, M. R., and Davey Smith, G. (2018). Robust research needs many lines of evidence. Nature 553, 399–401. doi: 10.1038/d41586-018-01023-3
Murayama, K., and Gfrörer, T. (2024). Thinking clearly about time-invariant confounders in cross-lagged panel models: a guide for choosing a statistical model from a causal inference perspective. Psychol. Methods. doi: 10.1037/met0000647
R Core Team (2024). R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing.
Rosseel, Y. (2012). Lavaan: an R package for structural equation modeling. J. Stat. Softw. 48, 1–36. doi: 10.18637/jss.v048.i02
Sorjonen, K., and Melin, B. (2024). Using triangulation to evaluate findings from random-intercept cross-lagged panel models: an application with data on curiosity and creativity. Available online at: https://osf.io/preprints/psyarxiv/z83y5_v1 (Accessed March 21, 2024).
Sorjonen, K., Melin, M., and Melin, B. (2024a). Inconclusive evidence for within-individual increasing effects of income on self-esteem: A reanalysis and comment on Bleidorn et al. (2023). Available online at: https://osf.io/preprints/psyarxiv/q9uyr_v1 (Accessed April 22, 2024).
Sorjonen, K., Nilsonne, G., Ghaderi, A., and Melin, B. (2024b). Spurious effects in random-intercept cross-lagged panel models: results from simulations and reanalyses of data on self-esteem and problematic eating behaviors used by Beckers et al. (2023). Available online at: https://osf.io/preprints/psyarxiv/hferw_v1 (Accessed February 4, 2024).
Sorjonen, K., Nilsonne, G., Melin, B., and Ingre, M. (2023). Uncertain inference in random intercept cross-lagged panel models: an example involving need for cognition and anxiety and depression symptoms. Personal. Individ. Differ. 201:111925. doi: 10.1016/j.paid.2022.111925
Usami, S., Murayama, K., and Hamaker, E. L. (2019). A unified framework of longitudinal models to examine reciprocal relations. Psychol. Methods 24, 637–657. doi: 10.1037/met0000210
Venables, W. N., and Ripley, B. D. (2002). Modern applied statistics with S. Fourth Edn. New York: Springer.
Keywords: executive deficits, random-intercept cross-lagged panel models, reanalyses, spurious effects, symptoms of psychopathology, triangulation
Citation: Sorjonen K and Melin B (2025) Inconclusive effects between executive functions and symptoms of psychiatric disorders in random-intercept cross-lagged panel models: a simulated reanalysis and comment on Halse et al. (2022). Front. Psychol. 16:1500200. doi: 10.3389/fpsyg.2025.1500200
Received: 22 September 2024; Accepted: 19 February 2025;
Published: 28 February 2025.
Edited by:
Preethi Premkumar, London South Bank University, United KingdomReviewed by:
Rodrigo Erick Escartín-Pérez, National Autonomous University of Mexico, MexicoCopyright © 2025 Sorjonen and Melin. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Kimmo Sorjonen, a2ltbW8uc29yam9uZW5Aa2kuc2U=
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
Research integrity at Frontiers
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.