- 1Institute of Psychology, University of Kiel, Kiel, Germany
- 2Department of Psychology, University of Bonn, Bonn, Germany
- 3Institute of Psychology, University of London, London, United Kingdom
This study highlights psychometric properties and evidence of construct validity on parcel-level for questionnaires on the original and revised reinforcement sensitivity theory. Our data (N = 1,076) suggest good to very good psychometric properties and moderate to excellent internal consistencies. Confirmatory factor analysis (CFA) models suggest a very good model fit for the first-order, four factor models of the Carver-White BIS/BAS scales, Reinforcement Sensitivity Theory – Personality Questionnaire (RST-PQ), the two-factor model of revised Reinforcement Sensitivity Theory-Questionnaire (rRST-Q) and for the bifactor model of the Conflict Monitoring Questionnaire (CMQ-44). The CMQ-44 extends the psychometric measurement of previous trait-(r)BIS and trait-BAS scales. Factor scores of CMQ-44 cognitive demand correlate positively with factor scores of Carver-White BIS and all Carver-White BAS subfactors except RST-PQ-Impulsivity suggesting that CMQ-44 cognitive demand addresses Carver-White trait-BIS specifically and more generally the trait-BAS core. CMQ-44 anticipation of negative consequences and response adaptation correlate negatively with trait-BAS, whereas the second-order factor performance monitoring extends the rRST trait-space and correlates positively with trait-BAS.
Introduction
The reinforcement sensitivity theory (RST, Gray, 1987) and its latest revision in 2000 (rRST, Gray and McNaughton, 2000) have motivated a number of questionnaire developments in English starting more than 30 years ago (Wilson et al., 1989, 1990) but also in recent years (Corr, 2008; Corr and Cooper, 2016). One of the most important implications of rRST compared to former RST versions is its differentiation of the functioning of the behavioral inhibition system (BIS), the behavioral approach system (BAS), and the Fight-Flight-Freeze system (FFFS). In rRST, the BIS is presumed to detect and solve conflicts that have implications for behavioral adaptations like BAS-related and/or FFFS-related approach behavior (e.g., Fight) and FFFS-related withdrawal behavior such as Flight or Freeze (Corr, 2008). That is, in case of conflicting information the BIS changes from checking (e.g., observing, comparing mode) to control mode by innervating behavioral changes of the BAS and the FFFS (Gray and McNaughton, 2000). The abbreviation “rRST” is used to indicate that scales of the “newer” rRST (Gray and McNaughton, 2000; Corr, 2008) are discussed or compared to scales of the “older” RST (Gray, 1987).
The present study incorporates a construct validation of rRST questionnaires in German language that have been published between 2015 and 2021 (Reuter et al., 2015; Pugnaghi et al., 2018; Leue and Beauducel, 2021). For the purpose of comparison, the construct validation of the more recent rRST questionnaires in this study also includes the German version of the Carver and White (1994) BIS/BAS scales developed based on the “older” RST (Strobel et al., 2001). The number of RST-related personality questionnaires developed in different languages between the 1980s and 2021 is larger than presented here (see Corr, 2008; Krupić et al., 2016; Walker and Jackson, 2016; Leue and Beauducel, 2021, for further questionnaires). Among psychometric studies, different models for disentangling rRST scales have been tested using confirmatory factor analysis (CFA). Therefore, the aim of the present study is four-fold: (1) We highlight the psychometric properties (i.e., item means, part-whole corrected item-total correlations, different types of reliabilities) and descriptive statistics. (2) We present evidence of factorial validity for different trait-models of the (r)RST questionnaires. (3) We describe whether (r)RST latent factors are measurement-equivalent across gender. (4) We aim at presenting a priori predicted convergent and discriminant construct validity (Campbell and Fiske, 1959) based on factor scores.
Psychometric properties and confirmatory factor analysis of (r)RST trait-scales
For all (r)RST-related questionnaires examined in this study, psychometric properties have been reported as a part of the construct validation (Strobel et al., 2001; Reuter et al., 2015; Pugnaghi et al., 2018; Leue and Beauducel, 2021). In terms of published criteria, all (r)RST questionnaires in the present study reveal good to excellent psychometric properties (i.e., positive and ≥0.10 part-whole corrected item total correlations) and good to excellent internal consistencies (Nunnally and Bernstein, 1994). Therefore, the present study seeks to investigate psychometric properties of the (r)RST questionnaires in relation to conceptual replication issues (research question 1).
The Carver and White BIS/BAS scales (Carver and White, 1994) with their German translation (Strobel et al., 2001) measure trait-BIS as a sensitivity to aversive reinforcement (Gray, 1987). Moreover, the Carver and White BIS/BAS scales assess trait-BAS as a total scale. The total trait-BAS scale incorporates three BAS subscales entitled as BAS-Drive, BAS-Reward Responsiveness, and BAS-Fun Seeking. FFFS-related behavior has not been psychometrically assessed in the Carver and White BIS/BAS scales. The Reuter-Montag Reinforcement Sensitivity Questionnaire (Reuter and Montag’s rRST-Q, Reuter et al., 2015) includes trait-BAS, trait-BIS and trait-FFFS in accordance with rRST. Similarly, the Reinforcement Sensitivity Theory – Personality Questionnaire (RST-PQ, Corr and Cooper, 2016) and its German translation (Pugnaghi et al., 2018) measures trait-BIS, trait-FFFS, and trait-BAS in terms of rRST. Trait-BIS and trait-FFFS are conceived as “unitary defensive factors” (Pugnaghi et al., 2018, p. 2) including four and three subscales, respectively. Trait-BAS incorporates four subscales. Table 1 summarizes short definitions of the trait-BIS, trait-BAS and trait-FFFS scales.
The factorial validity of the German Carver-White BIS/BAS scales, the RST-PQ, and the rRST-Q has been confirmed by means of CFAs (Strobel et al., 2001; Reuter et al., 2015; Pugnaghi et al., 2018). Further studies have examined alternative models of the Carver-White BIS/BAS scales—-especially a two-factor model of the trait-BIS scale in addition to the three trait-BAS subscales (Johnson et al., 2003; Heym et al., 2008; Levinson et al., 2011; Müller et al., 2013; Maack and Ebesutani, 2018). In addition to CFA studies for the Carver-White BIS/BAS scales, further CFA models were tested for the RST-PQ (Krupić et al., 2016; Wytykowska et al., 2017; Eriksson et al., 2019). For a short RST-PQ scale see Veccione and Corr (2020).
As a new development, the conflict monitoring questionnaire (CMQ, Leue and Beauducel, 2021) investigates determinants of revised trait-BIS-related conflict monitoring by means of anticipation of negative consequences (Gray and McNaughton, 2000; Corr, 2008) and cognitive demand (Botvinick, 2007; Leue et al., 2012, 2014). Response adaptation and uncertainty of reinforcement are assessed as behavioral consequences following stimulus-related conflict monitoring (named as “response patterns following conflict monitoring” in Leue and Beauducel, 2021, Table 1). The CMQ-44 (i.e., “44” deriving from the fact that 44 of 60 items with the best psychometric properties are analyzed, see “Materials and Methods” section) has been developed based on facet theory (Shye et al., 1994; Guttman and Greenbaum, 1998; Hackett, 2014) meaning that each item contains a determinant and a consequence of conflict monitoring (Leue and Beauducel, 2021). In sum, the present study aims at investigating construct validity (Cronbach and Meehl, 1955; Campbell and Fiske, 1959) of the (r)RST-related questionnaires within one sample allowing also to compare pre-processing issues of the items (see section Parceling issues; research question 2).
Measurement equivalence
Effects of measurement equivalence have been rarely addressed in previous CFA studies on (r)RST questionnaires. Beyond CFA modeling, effects of gender and/or age groups have been reported on the scale-level for the German version of the Carver-White BIS/BAS scales (Strobel et al., 2001) and the rRST-Q of Reuter et al. (2015). Correspondingly, trait-BIS and trait-FFFS (Flight and Freeze) mean values were slightly higher in female compared to male participants, but not for trait-BAS (Reuter et al., 2015, their Table 3). Trait-BIS and trait-BAS were significantly higher for females than males (Strobel et al., 2001). Age group effects were not significant (Strobel et al., 2001). Leue and Beauducel (2021) also reported gender effects meaning that women reported higher CMQ-44 cognitive demand and CMQ-44 performance monitoring than men. Therefore, this study investigates a Multiple-Indicator-Multiple-Cause (MIMIC) model for the Carver-White BIS/BAS scales, the RST-PQ, the rRST-Q, and for the CMQ-44. A MIMIC model has been preferred over other statistical models (e.g., multiple group confirmatory factor analysis) in purpose of comparison with the original factorial validation of the CMQ-44 published in Leue and Beauducel (2021). We address gender as a MIMIC factor in all CFA models (research question 3).
Additionally, none of the (r)RST CFA studies investigated a hierarchical structure of the BIS/BAS sub-scales. As Leue and Beauducel (2021) indicated, the CMQ-44 allows for a hierarchical factor model including performance monitoring (G) as a second-order factor and four first-order factors (cognitive demand, anticipation of negative consequences, response adaptation, uncertainty of reinforcement). Performance monitoring has been correlated with the other (r)RST first-order factor scores (Results section) to evaluate the generality of the first-order factors in non-hierarchical (r)RST models.
Previous results on convergent and discriminant validity among (r)RST trait-scales
Previous studies revealed positive inter-correlations between trait-BIS scales of the Carver-White BIS/BAS scales (Strobel et al., 2001), the RST-PQ (Pugnaghi et al., 2018), and the rRST-Q (Reuter et al., 2015). Similarly, positive inter-correlations have been reported between trait-BAS scales of the Carver-White BIS/BAS scales (Strobel et al., 2001), the RST-PQ (Pugnaghi et al., 2018), and the rRST-Q (Reuter et al., 2015). Inter-correlations between trait-BIS and trait-BAS scales were often significantly negative or non-significant. Therefore, we presume evidence of convergent validity among trait-BIS and among trait-BAS scales, respectively. In contrast, we presume evidence of discriminant validity between trait-BIS and trait-BAS scales.
Cognitive demand has been discussed in the context of the conflict-monitoring-theory (Botvinick, 2007) as a determinant that enhances conflict monitoring. Cognitive demand of the CMQ-44 measures the tendency to respond to situational or experimental requirements of higher cognitive demand by an intensification of conflict monitoring, performance monitoring and subsequently cognitive control (Leue et al., 2012, 2014). In this line, we predict that cognitive demand of the CMQ-44 correlates positively with other personality scales of (r)RST questionnaires that aim to measure trait-BIS (i.e., the tendency to detect and control for conflict information). As self-reports of higher cognitive demand should be related to cautious behavior, we predict negative correlations with behavioral approach tendencies measured with the trait-BAS scales. According to rRST (Gray and McNaughton, 2000; Corr, 2008), individuals with higher trait-BIS scores anticipate negative consequences of errors and, therefore, invest in a more intense stimulus monitoring compared to individuals with lower trait-BIS scores and individuals with higher compared to lower reasoning ability (Leue et al., 2014). Therefore, we presume that the CMQ-44 anticipation of negative consequences is positively correlated with (r)RST trait-BIS scales. In contrast, we predict no substantial or negative correlations with (r)RST trait-BAS scales because we expect the anticipation of negative consequences scale to be rather related to stimulus monitoring and evaluation than to reward-related approach behavior as measured by means of trait-BAS scales.
The CMQ-44 response adaptation scale is thought to be positively related to previous (r)RST-related trait-BIS scales because CMQ-44 response adaptation can serve to inhibit behavior especially when response adaptation occurs in a less adaptive manner. When CMQ-44 response adaptation is performed in a reactive, more flexible manner, higher scores of the CMQ-44 response adaptation scale could also correlate positively with trait-BAS scales and negatively with previous trait-(r)BIS scales. If the CMQ-44 response adaptation correlates negatively with previous trait-(r)BIS scales this would indicate that response adaptation extends previous trait-BIS scales by measuring a more flexible, reactive adaptation of responses instead of a fixed, proactive adaptation of responses (Braver, 2012).
CMQ-44 uncertainty of reinforcement measures the tendency to be sensitive for situations that are ambiguously reinforcing because situations are rewarding as well as punishing or they are so complex for the decision process that the reinforcing value cannot be defined. CMQ-44 uncertainty of reinforcement should enhance behavioral inhibition tendencies and, therefore, correlates positively with (r)RST-related trait-BIS scales. Moreover, higher scores of CMQ-44 uncertainty of reinforcement should reduce behavioral approach tendencies and, therefore, -if at all- is negatively related to (r)RST trait-BAS scales. Presuming that people take the risk of errors into account, higher scores of CMQ-44 uncertainty of reinforcement could be positively related to approach behavior as measured with trait-BAS scales. The predictions of the CMQ-44 subscales with previous (r)RST-related trait-BIS and trait-BAS scales are summarized in Table 2 and address research question 4.
Parceling issues in (r)RST questionnaires
Factorial validity of the (r)RST questionnaires – except the CMQ-44 – have been mainly investigated on the item-level. Parceling items (Humphreys, 1962) has been mainly applied for items constructed based on facet theory (Liepmann et al., 2007; Beauducel and Kersting, 2010; Beauducel et al., 2010; Leue and Beauducel, 2021). Parceling items allows us to capture systematic item content prior to the investigation of factorial validity by means of CFA models and prior to the calculation of unit-weighted sum scales or factor scores. Due to a rational, theory-driven item construction, researchers define a priori which items are thought to measure the latent constructs or at least parts of those constructs (Süß and Beauducel, 2005). In this respect, parceling items has the advantage that items can be systematized based on their a priori defined item content and tested for their model fit with the conceptually intended scales. Beyond these advantages, item parceling has been an issue of psychometric critics (Marsh et al., 2013). Marsh et al. (2013) argue that parceling items is “never appropriate a priori” (p. 258) because item misspecifications are ignored otherwise. The argument probably neglects the fact that items for the measurement of personality traits and intelligence have been sometimes constructed in an inductive, empirical manner rather than in a theory-driven approach as recommended by means of facet theory (Guttman and Greenbaum, 1998). Thus, item parceling for items constructed based on theoretical predictions as (r)RST and facet theory particularly summarizes those items that have been conceptualized a priori to belong to a certain construct or a sub-facet of a construct. In this respect, item parceling prior to the investigation of the model fit in CFA models can be conceived as a necessary pre-processing step – not as a prevention of item misspecifications. Moreover, results of CFA models indicate that –despite item parceling– not all theory-driven CFA models show a sufficient or very good model fit (see Results section). To investigate construct validity in a test-fair manner for all (r)RST questionnaires in this study, items of all (r)RST questionnaires have been parceled with regard to their a priori defined construct content (Sterba and Rights, 2017). This procedure ensures that theory-driven item development and item development based on facet theory can be tested for comparable and test-fair item units. If item parceling would not be applied in a comparable way to all (r)RST questionnaires, construct validity of the (r)RST items would have been compared on different construct levels. Performing the CFA models for all (r)RST questionnaires allows us to save factor scores of all latent factors. The factor scores for the Carver-White BIS/BAS scales, the RST-PQ, and the rRST-Q were applied to investigate the inter-correlations with the factor scores of the CMQ-44 subscales. Otherwise, inter-correlations that would have been calculated for unit-weighted sum scales in some (r)RST questionnaire and factor scores in the CMQ-44 might underestimate or overestimate inter-correlations because of scaling issues.
Aims and research questions
Based on prior findings we investigate the following research questions. (1) Are psychometric properties for the (r)RST-related questionnaires comparable to prior findings? (2) Can factorial validity of the four German (r)RST questionnaires (i.e., Carver-White BIS/BAS scales, RST-PQ, RST-Q, and CMQ-44) be confirmed based on the parcel level? (3) Are (r)RST latent factors equivalent across gender? (4) Do the factor scores for the best fitting CFA models provide evidence of a priori predicted convergent and discriminant validity (Table 2)?
Materials and methods
Sample
A total of N = 1,127 participants took part in a Unipark survey that was performed in collaboration with Respondi AG1 between November 2020 and January 2021 (n = 88 were assessed via Unipark by a research assistant in the team of the first author). The psychometric survey along with the research questions was approved in September 2020 by the Ethics committee of the Medical Faculty at the University of Kiel, Germany. Hypotheses in Table 2 were not pre-registered but formulated a priori (i.e., prior to data collection). We recruited participants in three examination intervals via Respondi AG. The first examination was performed between 24-November-2020 and 13-December-2020 with n = 44 participants in a pre-test and n = 471 participants in the main test. The second examination interval included n = 2 participants in the pre-test and n = 523 in the main test lasting from 13-January-2021 until 25-January-2021. The third Unipark assessment started on 17-December-2020 and ended on 20-February-2021 with n = 5 participants in a pre-test and n = 83 participants who took part in the main test (n = 88 see above). Overall, of the N = 1,128 participants we excluded n = 51 pre-test participants because pre-tests included slight changes in the Unipark programming. One participant younger than 18 years was excluded.
The final sample comprised N = 1,076 participants aged between 18 and 66 years (M = 38.38 years, SD = 12.93) for statistical analysis. We planned a widely equal recruitment of four age groups between 18 and 28 years (n = 317), 29 and 39 years (n = 250), 40 and 50 years (n = 286), and 51 and 66 years (n = 223). A total of n = 514 female and n = 559 male participants took part in this study (for gender proportions in Germany2). Three participants classified their gender as diverse. Participants received a reimbursement credit via Respondi AG redirects of about 5 €. The study plan was pre-registered in Hogrefe Verlag, Germany (proposal sent in April 2020 and discussed with a member of the Hogrefe Verlag in the beginning of May 2020). Data acquisition was funded by the University of Kiel, Germany.
Inventories
Participants were asked to answer demographic variables (e.g., federal state, age, gender, school grade, profession, income per month). Afterward participants answered the items of four questionnaires in German language in the following sequence: (1) Conflict monitoring questionnaire CMQ-44 (Leue and Beauducel, 2021), (2) Reinforcement Sensitivity Theory – Personality Questionnaire (RST-PQ, Pugnaghi et al., 2018), (3) BIS/BAS scales (Strobel et al., 2001), and (4) Reuter-Montag’s rRST questionnaire (rRST-Q, Reuter et al., 2015). For item examples the publications cited in (1) to (4) should be consulted.
The CMQ originally includes 60 items with a 6-point Likert response scale: 1 = trifft überhaupt nicht zu (does not correspond at all), 2 = trifft überwiegend nicht zu (does mainly not correspond), 3 = trifft eher nicht zu (does rather not correspond), 4 = trifft eher zu (does rather correspond), 5 = trifft überwiegend zu (does mainly correspond), 6 = trifft vollständig zu (does completely correspond). The CMQ incorporates four latent factors named as structs in terms of facet theory (Table 1). Two latent factors describe determinants of conflict monitoring: cognitive demand and anticipation of negative consequences. Two further latent factors differentiate behavioral consequences of conflict monitoring and are entitled as response adaptation and uncertainty of reinforcement. Higher self-reported cognitive demand and anticipation of negative consequences are thought to be related to more intense conflict monitoring. Higher self-reported response adaptation and experience of uncertainty of reinforcement are thought to result from more intense conflict monitoring (for factor meanings see Leue and Beauducel, 2021, section 3.3 “Quality assessment”). Of the 60 items, the shorter versions CMQ-44 and CMQ-28 comprise Cronbach’s Alpha coefficients between 0.72 and 0.89 and revealed sufficient to very good psychometric properties (Leue and Beauducel, 2021, their Table 6).
The German version of the RST-PQ (Pugnaghi et al., 2018) includes 65 items with four response categories of a 4-point Likert type with 1 = überhaupt nicht (not at all), 2 = etwas (slightly), 3 = mäßig (moderately), 4 = sehr (highly) but varying verbal coding compared to Strobel et al. (2001) and Reuter et al. (2015). The BAS scale differentiates four subscales entitled BAS – reward interest, BAS – goal drive persistence, BAS – reward reactivity, BAS – impulsivity. The BIS scale incorporates four subscales named as BIS – cautious risk assessment, BIS – motor planning interruption, BIS – behavioral disengagement, and BIS – obsessive thoughts. The FFFS scale includes Flight, Active Avoidance, and Freezing. All personality scales and sub-scales revealed a Cronbach’s Alpha between 0.67 and 0.91 (Pugnaghi et al., 2018, their Table 1). The BAS subscales in Strobel et al. (2001) are entitled reward responsiveness, fun seeking and drive. Thus, whereas Reuter et al. (2015) and Pugnaghi et al. (2018) disentangled the FFFS subscales, Strobel et al. (2001) and Pugnaghi et al. (2018) differentiated the BAS subscales. The German BIS/BAS scales as a translation of the English BIS/BAS scales (Carver and White, 1994) consist of 24 items (four dummy items are not included into statistical analysis). Cronbach’s Alpha has been reported for the Carver-White BIS/BAS scales with 0.67–0.81 (Strobel et al., 2001, Table 3). The rRST-Q incorporates 31 items with a Cronbach’s Alpha reliability ranging between 0.75 and 0.78 (Reuter et al., 2015, Table 4). The rRST-Q measures trait-BIS, trait-BAS and trait-FFFS (including Fight, Flight and Freezing behavior). Both the BIS/BAS scales and the rRST-Q apply a 4-point Likert-type response format with 1 = trifft für mich gar nicht zu (I strongly disagree), 2 = trifft für mich eher nicht zu (I disagree), 3 = trifft für mich eher zu (I rather agree), and 4 = trifft für mich genau zu (I strongly agree).
Procedure
At the start of the Unipark link, participants were informed about the study, duration per examination (about 30–40 min), and contact persons who were prepared to answer questions on the study. Participants were instructed to answer the items in a well-lit, quiet room with no disturbance during item answering and no participation of others. When participants gave written informed consent, they obtained demographic and questionnaire items for answering. Respondi AG handled the recruitment and reimbursement (mingle points which could be converted in the Respondi AG portal) of most participants (except n = 88 who were recruited and reimbursed at the University of Kiel, Germany, in the team of the first author).
Statistical analysis
Statistical analysis was performed using IBM SPSS statistics version 26 and Mplus version 8.3 (Muthén and Muthén, (1998–2017)). Preprocessing of data included the investigation of missing values and normal distribution. There were no missing values because participants answered all items. By using SPSS 26, we performed Mardia’s test of multivariate kurtosis to test for multivariate normal distribution (DeCarlo, 1997). The Mardia’s test was performed on parcel level. Parcels were performed based on an a priori questionnaire construction, i.e., items belonging to the same item content were grouped into parcels (Little et al., 2002). That means by reading the published items parceling was performed. We performed sum scores to establish the item parcels. Supplementary Table 1 provides an overview of the items per parcel. Each item was applied once for computing a parcel to hold the criterion of a theory-related item-to-parcel allocation (Sterba and Rights, 2017). Mardia’s test was significant for all (r)RST questionnaires included in this study (Supplementary Table 1) suggesting that the multivariate normal distribution was not given. Therefore, we applied a maximum likelihood estimator with robust standard errors entitled as MLR in our CFA models (Luo, 2018). Mardia’s test was preferred over Q–Q plots because Mardia’s test allows for a statistical instead of a graphical evaluation of multivariate normal distribution.
We report model fit (Hu and Bentler, 1999; Beauducel and Wittmann, 2005) for the following indices: Comparative Fit Index (CFI), Root Mean Square Error of Approximation (RMSEA), Standardized Root Mean Square Residual (SRMR). Results showing a CFI of better than 0.90 (Beauducel and Wittmann, 2005) and 0.95 (Hu and Bentler, 1999) were evaluated as very good. We decided to evaluate a range for the CFI because the thresholds for the description of the CFI differ with regard to software, tested factor loading thresholds and number of factors in a CFA model (Hu and Bentler, 1999; Beauducel and Wittmann, 2005). A RMSEA of ≤0.06 and a SRMR of ≤0.04 (Beauducel and Wittmann, 2005, their Table 3) was evaluated as a very good model fit. Construct validation in this study incorporates the investigation of the factorial validity of the four questionnaires on parcel-level by means of CFA and inter-correlations of factor scores. We report factor loadings and MIMIC findings for gender of the STDYX matrix. Spearman Rank correlations were calculated to report findings on convergent and discriminant validity for the factor scores. Partial correlations are reported to indicate effects of gender on the convergent and discriminant results. As all inventories are part of an on-going construct validation process original data, code books, or program code will be made available in PsyArxiv upon request to the first author and depending on further validation studies: https://osf.io/9vu8e/?view_only=9655b511443c4c5e95f9393fcb15622c.
Results
Psychometric and descriptive data (research question 1)
In a sample of N = 1,076 participants we observed the following psychometric properties for trait-BIS and trait-BAS scales, and FFFS scales (Table 3). Excellent reliabilities (≥0.90, given in bold in Table 3) were rare for the (r)RST questionnaires (Table 3 and Supplementary Table 2). Most reliabilities were moderate (0.80–0.90) and are given in italics in accordance with George and Mallery (2020). Reliabilities were comparable or even higher in the present study (Table 3 and Supplementary Table 2) compared to previous studies (Strobel et al., 2001; Reuter et al., 2015; Pugnaghi et al., 2018; Leue and Beauducel, 2021). Thus, reliabilities suggest moderate to high data quality for the present online data. Supplementary Table 2 summarizes descriptive statistics of all (r)RST questionnaires.
Factorial validity of (r)RST trait-BIS-related, trait-FFFS-related and trait-BAS-related subscales (research question 2)
Table 4 summarizes all performed CFA MIMIC models. Item parcels were performed based on the ascending item number for each latent, theory-driven content factor (i.e., for BIS and BAS items, see Supplementary Table 2) of the RST-PQ, the rRST-Q, and the Carver-White BIS/BAS scales. Briefly, our results show that a very good model fit occurred for the four-factors Carver-White-BIS/BAS model and for the 4-factors RST-PQ model in terms of CFI and RMSEA (Table 4, model fit indices marked in bold). For the two-factors rRST-Q, and for the CMQ-44 or 28 bifactor models the model fit was very good in terms of CFI, RMSEA and SRMR (Table 4, model fit indices marked in bold).
All three two-factor models of the Carver-White BIS/BAS scales, the RST-PQ, and of the rRST-Q are primary-order factor models (Table 4). The two-factors Carver-White trait-BIS/trait-BAS model including two trait-BIS parcels with three to four items per parcel and three trait-BAS parcels with four to five items per parcel (Supplementary Table 1) showed a very good model fit for the CFI and a moderate model fit for the RMSEA and SRMR. In accordance with Strobel et al. (2001), the inter-correlation between trait-BIS and trait-BAS was set to 0.17 to perform the MIMIC models (Table 4).
The two-factor RST-PQ model for trait-BIS and trait-BAS including parcels of three to four items (Supplementary Table 1) suggest again a very good model fit for the CFI and a moderate model fit for the RMSEA and the SRMR (Table 4). In accordance with Pugnaghi et al. (2018, their Table 1), prior to the calculation of the MIMIC model we set the inter-correlation between trait-BIS and trait-BAS to –0.02 which corresponds to the mean inter-correlations between trait-BIS and the trait-BAS subscales. For the three-factor model of the RST-PQ the model fit was poor. The mean inter-correlation between trait-BAS and trait-FFFS was 0.08. Trait-BIS and trait-FFFS were set to an inter-correlation of 0.46 (see Pugnaghi et al., 2018, their Table 1).
The two-factor rRST-Q trait-BIS and trait-BAS model showed a very good model fit in terms of CFI, RMSEA, and SRMR based on parcel-level (Table 4) with three to four items per parcel (Supplementary Table 1). The model fit of the three-factor model including trait-BIS, trait-BAS, and trait-FFFS was poor (Table 4). In accordance with Reuter et al. (2015, their Table 6) we set the inter-correlation between trait-BIS and trait-BAS to –0.29, the inter-correlations between trait-BIS and trait-FFFS to 0.45 and to –0.41 for trait-BAS and trait-FFFS prior to the calculation of the MIMIC model. Figure 1 summarizes the standardized factor loadings (STDYX) on parcel level for the two-factor models of the Carver-White BIS/BAS questionnaire, the RST-PQ and the rRST-Q.
Figure 1. Standardized factor loadings (STDYX) of two-factor models for the Carver-White (CW) BIS/BAS questionnaire (A), RST-PQ (B), and Reuter-Montag (RM) rRST-Q (C). p1–p8, parcel 1–parcel 8. All p-values p < 0.01, two-tailed.
As trait-BIS and trait-BAS subfactors have been reported for the Carver-White BIS/BAS questionnaire and for the RST-PQ, we additionally performed primary factor MIMIC CFAs for more than two latent factors (Table 4 and Figure 2A). For the Carver-White BIS/BAS questionnaire, we performed a model including trait-BIS and three latent factors entitled trait-BAS – reward responsiveness, trait-BAS – fun seeking, and trait-BAS – drive. Item parcels were performed based on item content (i.e., items that were thought to belong to the respective sub-scale). Item parcels for the trait-BAS subscales incorporate two to three items of a conceptually comparable content per parcel. Each BAS subfactor comprised two parcels. The four-factor model with trait-BIS and three trait-BAS subscales of the Carver-White BIS/BAS scales fitted the data well in terms of CFI and RMSEA (Table 4).
Figure 2. Standardized factor loadings (STDYX) of four-factor models for the Carver-White BIS/BAS questionnaire including one BIS and three BAS subfactors (A) and for the RST_PQ including four BIS and four BAS subfactors (B). BD, behavioral disengagement; CRA, cautious risk assessment; D, drive; GDP, goal-drive persistence; FS, fun seeking; IMP, impulsivity; MPI, motor planning interruption; OT, obsessive thoughts; RI, reward interest; RR, reward reactivity; CW-trait-BAS-RR, reward responsiveness. All p-values p < 0.01, two-tailed.
For the RST-PQ we performed a 4-factor model including four trait-BIS subfactors and four trait-BAS subfactors (Table 4 and Figure 2B). The trait-BIS subscales are entitled as Motor planning interruption, Cautious risk assessment, Obsessive thoughts, and Behavioral disengagement. Item parcels of the RST-PQ trait-BIS subfactors incorporate two to four items per parcel. The trait-BAS subfactors for the RST-PQ are named as Reward interest, Goal-drive persistence, Reward reactivity, and Impulsivity. Again, item parcels of the RST-PQ trait-BAS subfactors comprise three to four items per parcel. The four-subfactors model for trait-BIS and the four-subfactors model for trait-BAS fitted the data very well in terms of CFI and RMSEA (Table 4). The inter-correlations between the four trait-BAS factors were chosen as reported in Pugnaghi et al. (2018). No inter-correlations were reported for the four trait-BAS factors in Pugnaghi et al. (2018).
Regarding the model fit of the CMQ-44, we performed a four factors primary-order MIMIC model including cognitive demand, anticipation of negative consequences, response adaptation, and uncertainty of reinforcement. The model did not fit the data well (Table 4). In contrast, a bifactor MIMIC factor model of the CMQ-44 showed a very good model fit (Table 4) as recently reported in Leue and Beauducel (2021). The standardized factor loadings for the bifactor MIMIC factor model were – except a few loadings – significant (Table 5 and Supplementary Figure 1). For CMQ-28 (Supplementary Figure 2 and Supplementary Table 4), the model fit results were pretty comparable to the CMQ-44 for the CFI, RMSEA, and SRMR. For the CMQ-44 and for the CMQ-28 the latent factors were presumed to be orthogonal (i.e., no factor inter-correlations were specified for the MIMIC models).
Table 5. Standardized factor loadings (STDYX) of the Bifactor MIMIC model of the CMQ-44 (N = 1,076).
Measurement equivalence across gender (research question 3)
Effects of measurement equivalence across gender have not been predicted a priori as directed hypotheses in this study. This is due to the fact that gender effects in Strobel et al. (2001), Reuter et al. (2015), and Leue and Beauducel (2021) (section “Characteristics of included studies”) were calculated based on quite different statistical methods (ANOVA, MIMIC models). Significant gender differences were observed for the trait-BIS factor (β = 0.36, p < 0.01) and for trait-BAS in the Carver-White BIS/BAS two factor model (β = 0.18, p < 0.01) with women (n = 514) and individuals who classified themselves as diverse (n = 3) showing higher trait-BIS and higher trait-BAS values than men (n = 559). In the Carver-White BIS/BAS four factor model, female and diverse individuals showed higher trait-BIS, trait-BAS-reward reactivity and trait-BAS-drive values than men (for all three latent factors: β = 0.36, 0.13, 0.07, ps < 0.05).
For the RST-PQ two-factor model, women revealed higher trait-BIS values (β = 0.20, p < 0.01) and higher trait-BAS values (β = 0.07, p < 0.05) than men. Regarding the RST-PQ four-factor models for trait-BIS and trait-BAS, respectively, women and individuals who classified themselves as diverse scored higher than men for all four trait-BIS subscales (β = 0.17 to 0.25, ps < 0.001). For the trait-BAS subfactors exclusively Goal-drive persistence and Reward reactivity were higher for female and diverse participants compared to men (both latent factors: β = 0.10, ps < 0.01).
For the rRST-Q, women and individuals who classified themselves as diverse indicated higher trait-BIS values than men (β = 0.18, p < 0.01), whereas no gender differences were observed for trait-BAS (β = 0.01, p = 0.76). Finally, for the CMQ-44, women and participants who classified themselves as diverse reported higher Cognitive demand values than men (β = 0.14, p < 0.01). Men showed higher CMQ-44 anticipation of negative consequences values than women and individuals who classified themselves as diverse (β = –0.13, p < 0.05). Response adaptation was higher in female and diverse participants compared to male participants (β = 0.08, p < 0.05). Moreover, Performance monitoring (G) was higher in male individuals compared to female and diverse participants (β = –0.24, p < 0.01). No significant gender differences were observed for CMQ-44 uncertainty of reinforcement (β = 0.04, p = 0.27). For the CMQ-28, gender differences were not robust compared with CMQ-44 (Cognitive demand: β = –0.02, p = 0.78; Anticipation of negative consequences: β = 0.07, p = 0.13; Response adaptation: β = 0.04, p = 0.37). In contrast to the CMQ-44, CMQ-28 uncertainty of reinforcement indicated higher factor scores for female and diverse participants compared to male participants (β = 0.12, p < 0.05). Comparable to the CMQ-44, CMQ-28 performance monitoring revealed higher path coefficient for men compared to women and diverse individuals (β = –0.23, p < 0.01).
Convergent and discriminant validity along with evidence of robustness (research question 4)
We investigated evidence of convergent and discriminant validity of the bifactor CMQ-44 model based on factor scores for those CFA models of the (r)RST questionnaires that indicated the best model fit in terms of two or even three model fit indices (see Table 4). We indicate correlational results that correspond with our hypotheses (Tables 6, 7) in bold.
Cognitive demand correlated positively exclusively with the Carver-White BIS factor score (Table 6). Contrary to prediction (Table 2), CMQ-44 anticipation of negative consequences, CMQ-44 response adaptation, CMQ-44 uncertainty of reinforcement, and CMQ-44 performance monitoring correlate negatively with the factor scores of the other (r)RST questionnaires indicating that these CMQ-44 factors measure different contents of trait-BIS that are not represented in the other (r)RST trait-BIS factors (Table 6). The negative inter-correlations of CMQ-44-anticipation of negative consequences, CMQ-44 response adaptation, CMQ-44 uncertainty of reinforcement, CMQ-44 performance monitoring indicate that higher CMQ-44 factor scores go along with lower (r)RST factor scores of the Carver-White BIS, the RST-PQ BIS and the rRST-Q BIS factor scores. That is, the CMQ-44 factors (except cognitive demand) do not just measure preparations of behavioral inhibition. CMQ-44 anticipation of negative consequences (ANC), response adaptation (RA), uncertainty of reinforcement (UR), and performance monitoring (G) rather measure cognitive-motivational weigh-offs prior to behavioral withdrawal. Thus, the three primary-order factors (ANC, RA, UR) and G of the CMQ-44 provide psychometric measures that are promising for investigating information processing steps before checking mode switches to control mode of the BIS and behavioral withdrawal related to Flight or Freezing (Corr, 2008). The Spearman rank correlations were performed to account for non-normality of the data. Spearman Rank correlations (Table 6) hold even when we performed partial correlations controlling for gender.
We observed positive and mainly significant inter-correlations of CMQ-44 cognitive demand with the factor scores of the other trait-BAS factors revealing that CMQ-44 cognitive demand facilitates reward-related behavior (Kool et al., 2011). Negative and mainly significant Spearman rank correlations occurred for CMQ-44 anticipation of negative consequences and CMQ-44 response adaptation indicating evidence of discriminant validity. These correlations indicate that CMQ-44 ANC and RA are not identical to BAS-related behavioral approach. CMQ-44 uncertainty of reinforcement and CMQ-44 performance monitoring appeared to correlate positively and significantly with the factor scores of the other trait-BAS factors. In this respect, it is noteworthy that CMQ-44 UR is contrary to impulsive BAS-related behavior but evokes approach behavior as does CMQ-44 performance monitoring (G). It can be supposed that BAS-oriented approach tendencies of CMQ-44 UR and G might be due to preparations from checking to control model of the BIS. The Spearman Rank correlations hold when controlled for gender in partial correlations.
Discussion
The present study investigated psychometric properties (research question 1), evidence of factorial validity of (r)RST questionnaires (research question 2), effects of measurement equivalence for the latent (r)RST factors (research question 3), evidence of convergent and discriminant validity (research question 4). Our data reveal comparable and convincing evidence for the psychometric properties of the (r)RST questionnaires. Factorial validity has been confirmed for all (r)RST questionnaires. Best model fits have been observed for the four factor models of the Carver-White BIS/BAS scales (i.e., trait-BIS and three trait-BAS subscales), the four trait-BIS and four trait-BAS factors of the RST-PQ, the two-factor model of the RST-Q and for the CMQ-44 as well as CMQ-28 bifactor models. Gender effects were found for all inserted (r)RST questionnaires limiting measurement equivalence of the latent factors. Convergent validity for CMQ-44 cognitive demand has been exclusively observed with the Carver-White trait-BIS scales. Overall, the other CMQ-44 factors (anticipation of negative consequences, response adaptation, uncertainty of negative reinforcement, performance monitoring) rather extend the previous trait-BIS and trait-BAS space.
Correlating positively with most of the previous trait-BAS factors, the CMQ-44 cognitive demand factor appeared to be a BAS-facilitating factor (Kool et al., 2011). A similar effect occurred for RST-PQ trait-BIS subscales with RST-PQ trait-BAS Impulsivity and Carver-White BAS-Reward Responsiveness (Supplementary Table 7). While anticipation of negative consequences and response adaptation revealed evidence of discriminant validity with previous trait-BAS factors, uncertainty of negative consequences and performance monitoring extend the trait-BAS space by means of mainly significant and positive inter-correlations with previous trait-BAS factors. Higher factor scores of response adaptation of the CMQ-44 can be rather interpreted as a reactive, more flexible manner to adapt behavior (Braver, 2012; Botvinick and Braver, 2015). The small and mainly negative inter-correlations of response adaptation with the previous trait-BAS factors reveal that response adaptation is not an impulsive, spontaneous behavioral tendency. It is worth noting that individuals with more intense performance monitoring show more BAS-related behavior. Moreover, participants of the present study reported higher BAS-related approach tendencies in the previous (r)RST questionnaires even when they reported about situations with more uncertainty of reinforcement. Our data suggest evidence of convergent and discriminant validity although all included questionnaires belong to the same personality theory. The present study illustrates that personality scales in the context of (r)RST establish a nomological network. Among this nomological network the (r)RST-related personality scales operationalize different more or less overlapping parts of the trait-BIS, trait-BAS and trait-FFFS continuum. These conceptual similarities and dissimilarities between (r)RST personality scales can be documented in terms of correlations (see Tables 6, 7) and might be extended by second-order factor analyses. That is, it is a strength of the present study to include those personality scales that establish the psychometric framework of more than 20 years of psychometric (r)RST research starting with the German version of the BIS/BAS scales in 2001 (Strobel et al., 2001) and continuing to the CMQ-44 published in 2021 (Leue and Beauducel, 2021). The present study illustrates for the first time a nomological network of (r)RST questionnaires which extends the quite rare examples of nomological nets given except for the Five-factor model in the field of personality research (Ziegler et al., 2013). The fact that different researchers (Carver and White, 1994; Strobel et al., 2001; Reuter et al., 2015; Corr and Cooper, 2016; Pugnaghi et al., 2018; Leue and Beauducel, 2021) could develop independently different personality questionnaires that are suitable to comprise predictions on trait-BIS, trait-BAS, and trait-FFFS indicates in an impressive manner that (r)RST has been developed to a substantiative personality theory with extensive psychometric and neuroscientific perspectives (Gray, 1970, 1987; Gray and McNaughton, 2000).
Experimental studies investigating neural activations (e.g., frontal stimulus-locked N2 component and response-locked error-related negativity component, ERN/Ne) will elucidate the contextual foundations and individual differences of the CMQ-44 factors to further our understanding on changes between checking and control mode of the BIS (Corr, 2008). For the CMQ-44, especially anticipation of negative consequences and performance monitoring were higher in men than women. As in Leue and Beauducel (2021), CMQ-44 self-reported cognitive demand was higher in female than male participants. Overall, our psychometric data suggest that gender effects at least partly modulate individual differences of BIS/BAS scores.
Limitations and future directions
The present data motivate further research on emic and etic issues of (r)RST questionnaires in English-speaking samples. As (r)RST questionnaires have been applied in clinical samples (Farrell and Walker, 2019), forensic samples (Leue et al., 2008; Donahu and Caraballo, 2015), and in work settings (Corr et al., 2017), it would be of interest to investigate predictions of the newly validated CMQ-44 and previous (r)RST questionnaires in forensic and clinical settings. In terms of test fairness, future research might address further evidence of measurement equivalence (e.g., for age groups). To further elucidate the nomological network we should investigate the CMQ-44 factors in relation to the five-factor model, perfectionism and with regard to intelligence (Borkenau and Ostendorf, 2008; Beauducel et al., 2010; Stoeber and Corr, 2015). To elucidate the neuroscientific basis of reward-facilitating investment of cognitive demand, individual differences of CMQ-44 cognitive demand and performance monitoring should be experimentally assessed in a study measuring event-related potentials like N2, error-related negativity (ERN/Ne), and feedback negativity (FN). The items of the included questionnaires were not developed based on psychopharmacological predictors. Future research might investigate which of those items are sufficient for correlations with psychopharmacological predictors. For examples of item developments based on psychopharmacological predictions see West and Ussher (2010). Overall, based on the scale definitions presented in Table 1 and CFA evidence we argue in favor of holding all (r)RST questionnaires for future research. When researchers wish to investigate FFFS-related trait-variations RST-PQ and RST-Q (Reuter et al., 2015; Corr, 2016; Pugnaghi et al., 2018) are recommended. When determinants and behavioral consequences of conflict monitoring are the research focus CMQ-44 and CMQ-28 are promising (Leue and Beauducel, 2021). To disentangle trait-BIS or trait-BAS responses the BIS/BAS scales, the RST-PQ and the CMQ-cognitive demand scale are sufficient psychometric candidates. The Carver-White BIS/BAS scales are required to psychometrically compare results on BAS subscales (Reuter et al., 2015) and individual differences on the N2 (Leue et al., 2012, 2014, 2020). Future research on (r)RST questionnaires might also investigate other statistical models like multiple group CFAs to compare the (r)RST personality scales for configural, metric and scalar invariances and group factors like gender (Seib-Pfeifer et al., 2017; Hein et al., 2021). In the present study, we used a construct-related parceling algorithm because all included questionnaires comprise theoretically well-defined latent constructs (Sterba and Rights, 2017). “Alternative parcel allocations” might be performed such as random item-to-parcel allocations (Sterba and Rights, 2017). Those alternative allocations have not been tested in this study. A more detailed argumentation on pros and cons of item-to-parcel allocations is given in the cited studies (Matsunaga, 2008; Little et al., 2013; Marsh et al., 2013). Further research is warranted to elucidate the trait-neurotransmitter relationship especially for the newly published (r)RST questionnaires (for an example see Reuter et al., 2006). In this respect alternative item-to-parcel allocations could be tested in order to disentangle whether a theory-related item-to-parcel allocation results in more precise trait-neurotransmitter relations than more random item-to-parcel allocations which might increase parcel-allocation variability (Sterba and Rights, 2017). Moreover, studies on the trait-neurotransmitter relation would further our knowledge on the multi-trait-multi-method matrix of (r)RST-related personality questionnaires as would be other studies combining psychometric data and data from the field of personality neuroscience (DeYoung, 2010; Leue, 2015; Asendorpf et al., 2016).
Conclusion
The present data suggest that the Carver-White BIS/BAS scales, the RST-PQ, the rRST-Q and the CMQ-44/28 are promising personality questionnaires for the (r)RST trait space with sufficient psychometric properties. Confirmatory factor models have been mainly confirmed for trait-BIS and trait-BAS. Gender effects matter for the assessment of trait-BIS and trait-BAS. We provide evidence that factor scores are a promising tool compared to unit-weighted sum scales to elucidate convergent and discriminant validity of the trait-BIS and trait-BAS factors. These data are promising to investigate changes of the BIS from checking to control (Corr, 2008), to predict individual differences of reactive and proactive cognitive control (Braver, 2012; Botvinick and Braver, 2015), and to investigate conflict monitoring and the affect signaling hypothesis (Dignath et al., 2020).
Data availability statement
As all inventories are part of an on-going construct validation process original data, code books, or program code will be made available in PsyArxiv upon request to the first author and depending on further validation studies: https://osf.io/9vu8e/?view_only=9655b511443c4c5e95f9393fcb15622c.
Ethics statement
The studies involving human participants were reviewed and approved by Ethics Committee of the Medical Faculty, University of Kiel, Germany. The participants provided their written informed consent to participate in this study.
Author contributions
AL: conceptualization, data curation and collection with team support and data processing, analysis, and wrote the manuscript. UE, MR, and PC: co-authors of the previous questionnaires for measuring the personality traits of (revised) reinforcement sensitivity theory, read, and commented on the manuscript prior to submission.
Acknowledgments
We are grateful to M.Sc.-Psych. Clara Haufschild and Dipl.-Psych. Selin Geyik for their support in preparing the Unipark survey. We are grateful to Prof. Dr. André Beauducel and his co-authors Dr. Norbert Hilger and Dr. Christopher Harms for guiding us through their syntax when we calculated Hancock’s H (Beauducel et al., 2016). Moreover, we very much appreciate Prof. Dr. Alexander Strobel’s conceptual feedback to the manuscript.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fpsyg.2022.1026894/full#supplementary-material
Footnotes
References
Asendorpf, J. B., Beauducel, A., and Leue, A. (2016). “Persönlichkeit, neurowissenschaftliche ansätze [Personality, neuroscientific approaches],” in Dorsch–psychologisches wörterbuch, ed. M. A. Wirtz (Bern: Huber).
Beauducel, A., Harms, C., and Hilger, N. (2016). Reliability estimates for three factor score estimators. Int. J. Stat. Probab. 5, 1–14. doi: 10.5539/ijsp.v5n6p94
Beauducel, A., and Kersting, M. (2010). Start-P. Persönlichkeitstest mit berufsbezug für Jugendliche und junge erwachsene. Göttingen: Hogrefe.
Beauducel, A., Liepmann, D., Horn, S., and Brocke, B. (2010). Intelligence-structure-test English version of the intelligenz-struktur-test 2000 R (I-S-T 2000 R). Göttingen: Hogrefe.
Beauducel, A., and Wittmann, W. W. (2005). Simulation study on fit indexes in CFA based on data with slightly distorted simple structure. Struct. Equ. Model. 12, 41–75. doi: 10.1207/s15328007sem1201_3
Borkenau, P., and Ostendorf, F. (2008). Neo-fünf-faktoren-inventar nach costa und mccrae [Neo-five-factor-inventory]. Göttingen: Hogrefe.
Botvinick, M. M. (2007). Conflict monitoring and decision making: Reconciling two perspectives on anterior cingulate function. Cogn. Affect. Behav. Neurosci. 7, 356–366. doi: 10.3758/cabn.7.4.356
Botvinick, M. M., and Braver, T. (2015). Motivation and cognitive control: From behavior to neural mechanism. Annu. Rev. Psychol. 66, 83–113. doi: 10.1146/annurev-psych-010814-015044
Botvinick, M. M., and Rosen, Z. B. (2009). Anticipation of cognitive demand during decision-making. Psychol. Res. 73, 835–842. doi: 10.1007/s00426-008-0197-8
Braver, T. (2012). The variable nature of cognitive control: A dual-mechanisms framework. Trends Cogn. Sci. 16, 106–113. doi: 10.1016/j.tics.2011.12.010
Campbell, D. T., and Fiske, D. W. (1959). Convergent and discriminant validation by the multitrait-multimethod matrix. Psychol. Bull. 56, 81–105. doi: 10.1037/h0046016
Carver, C. S., and White, T. L. (1994). Behavioral inhibition, behavioral activation, and affective responses to impending reward and punishment: The BIS/BAS scales. J. Per. Soc. Psychol. 67, 319–333. doi: 10.1037/0022-3514.67.2.319
Corr, P. J. (2008). The reinforcement sensitivity theory. Cambridge: Cambridge University Press. doi: 10.1017/CBO9780511819384
Corr, P. J. (2016). Reinforcement sensitivity theory of personality questionnaires: Structural survey with recommendations. Pers. Individ. Differ. 89, 60–64. doi: 10.1016/j.paid.2015.09.045
Corr, P. J., and Cooper, A. J. (2016). The reinforcement sensitivity theory of personality questionnaire (RST-PQ): Development and validation. Psychol. Assess. 28, 1427–1440. doi: 10.1037/pas0000273
Corr, P. J., McNaughton, N., Wilson, M. R., Hutchison, A., Burch, G., and Poropat, A. (2017). “Neuroscience of motivation and organizational behavior: Putting the reinforcement sensitivity theory (RST) to work,” in Recent developments in neuroscience research on human motivation advances in motivation and achievement, Vol. 39, eds S. Kim, J. Reeve, and M. Bong (Bingley: Emerald Group Publishing Limited), 65–92. doi: 10.1108/S0749-742320160000019010
Cronbach, L. J., and Meehl, P. E. (1955). Construct validity in psychological tests. Psychol. Bull. 52, 281–302. doi: 10.1037/h0040957
DeCarlo, L. T. (1997). On the meaning and use of kurtosis. Psychol. Methods 2, 292–307. doi: 10.1037/1082-989X.2.3.292
DeYoung, C. G. (2010). Personality neuroscience and the biology of traits. Soc. Pers. Psychol. Compass 4, 1165–1180. doi: 10.1111/j.1751-9004.2010.00327.x
Dignath, D., Eder, A. B., Steinhauser, M., and Kiesel, A. (2020). Conflict monitoring and the affective-signaling hypothesis–an integrative review. Psychon. Bull. Rev. 27, 193–216. doi: 10.3758/s13423-019-01668-9
Donahu, J. J., and Caraballo, L. J. (2015). Examining the triarchic model of psychopathy using revised reinforcement sensitivity theory. Pers. Individ. Differ. 80, 125–130. doi: 10.1016/j.paid.2015.02.031
Eriksson, L. J. K., Jansson, B., and Sundin, Ö (2019). Psychometric properties of a Swedish version of the reinforcement sensitivity theory of personality questionnaire. Nord. Psychol. 7, 134–145. doi: 10.1080/19012276.2018.1516563
Farrell, N., and Walker, B. R. (2019). Reinforcement sensitivity theory and problem gambling in a general population sample. J. Gambl. Stud. 35, 1163–1175. doi: 10.1007/s10899-019-09850-3
George, D., and Mallery, P. (2020). IBM SPSS statistics 26 step by step: A simple guide and reference, 16th Edn. New York, NY: Routledge. doi: 10.4324/9780429056765
Gray, J. A. (1970). The psychophysiological basis of introversion-extraversion. Behav. Res. Ther. B 8, 249–266. doi: 10.1016/0005-7967(70)90069-0
Gray, J. A. (1987). “The neuropsychology of emotion and personality,” in Cognitive neurochemistry, eds S. M. Stahl, S. D. Iversen, and E. C. Godman (Oxford: Oxford University press).
Gray, J. A., and McNaughton, N. (2000). The neuropsychology of anxiety. Oxford: Oxford University Press.
Guttman, R., and Greenbaum, C. W. (1998). Facet theory: Its development and current status. Eur. Psychol. 3, 13–34. doi: 10.1027//1016-9040.3.1.13
Hackett, P. M. W. (2014). Facet theory and the mapping sentence: Evolving philosophy, use and application. Hampshire: Palgrave Macmillan. doi: 10.1057/9781137345929
Hein, F.-E., Scheuble, V., Beauducel, A., and Leue, A. (2021). Psychometric properties of a German online version of the Gudjonsson suggestibility scale 1. Front. Psychol. 12:718805. doi: 10.3389/fpsyg.2021.718805
Heym, N., Ferguson, E., and Lawrence, C. (2008). An evaluation of the relationship between Gray’s revised RST and Eysenck’s PEN: Distinguishing BIS and FFFS in Carver and White’s BIS/BAS scales. Pers. Individ. Differ. 45, 709–715. doi: 10.1016/j.paid.2008.07.013
Hu, L.-T., and Bentler, P. M. (1999). Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria vs. new alternatives. Struct. Equ. Model. 6, 1–55. doi: 10.1080/10705519909540118
Humphreys, L. (1962). The origanization of human abilities. Am. Psychol. 17, 475–483. doi: 10.1037/h0041550
Johnson, S. L., Turner, R. J., and Iwata, N. (2003). BIS/BAS levels and psychiatric disorder: An epidemiological study. J. Psychopathol. Behav. Assess. 25, 25–36. doi: 10.1023/A:1022247919288
Kool, W., Mcguire, J. T., Rosen, Z. B., and Botvinick, M. M. (2011). Decision making and the avoidance of cognitive demand. J. Exp. Psychol. Gen. 139, 665–682. doi: 10.1037/a0020198
Krupić, D., Corr, P. J., Ručević, S., Križanić, V., and Gračanin, A. (2016). Five reinforcement sensitivity theory (RST) of personality questionnaires: Comparison, validity and generalization. Pers. Individ. Differ. 97, 19–24. doi: 10.1016/j.paid.2016.03.012
Leue, A. (2015). Psychophysiologische konfliktkonzepte [Psychophysiological conflict concepts]. Aachen: Shaker.
Leue, A., and Beauducel, A. (2021). A facet theory approach for the psychometric measurement of conflict monitoring. Pers. Individ. Differ. 171:110479. doi: 10.1016/j.paid.2020.110479
Leue, A., Brocke, B., and Hoyer, J. (2008). Reinforcement sensitivity of sex offenders and non-offenders: An experimental and psychometric study of reinforcement sensitivity theory. Br. J. Psychol. 99, 361–378. doi: 10.1348/000712607X228519
Leue, A., Lange, S., and Beauducel, A. (2012). Modulation of the conflict monitoring intensity: The role of aversive reinforcement, cognitive demand, and trait-BIS. Cogn. Affect. Behav. Neurosci. 12, 287–307. doi: 10.3758/s13415-012-0086-x
Leue, A., Nieden, K., Scheuble, V., and Beauducel, A. (2020). Individual differences of conflict monitoring and feedback processing during reinforcement learning in a mock forensic context. Cogn. Affect. Behav. Neurosci. 20, 408–426. doi: 10.3758/s13415-020-00776-7
Leue, A., Weber, B., and Beauducel, A. (2014). How do working-memory-related demand, reasoning ability and aversive reinforcement modulate conflict monitoring? Front. Hum. Neurosci. 8:210. doi: 10.3389/fnhum.2014.00210
Levinson, C. A., Rodebaugh, T. L., and Frye, T. (2011). An examination of the factor, convergent, and discriminant validity of the behavioral inhibition system and behavioral activation system scales. J. Psychopathol. Behav. Assess. 33, 87–100. doi: 10.1007/s10862-010-9202-9
Liepmann, D., Beauducel, A., Brocke, B., and Amthauer, R. (2007). Intelligenz-struktur-test 2000 R Göttingen. Göttingen: Hogrefe.
Little, T. D., Cunningham, W. A., Shahar, G., and Widaman, K. F. (2002). To parcel or not to parcel: Exploring the question, weighing the merits. Struct. Equ. Model. 9, 151–173. doi: 10.1207/S15328007SEM0902_1
Little, T. D., Rhemtulla, M., Gibson, K., and Schoemann, A. M. (2013). Why the items versus parcels controversy needn’t be one. Psychol. Methods 18, 285–300. doi: 10.1037/a0033266
Luo, Y. (2018). A short note on estimating the testlet model with different estimators in Mplus. Educ. Psychol. Meas. 78, 517–529. doi: 10.1177/0013164417717314
Maack, D. J., and Ebesutani, C. (2018). A re-examination of the BIS/BAS scales: Evidence for BIS and BAS as unidimensional scales. Int. J. Methods Psychiatry Res. 27:e1612. doi: 10.1002/mpr.1612
Marsh, H. W., Lüdtke, O., Nagengast, B., and Morin, A. J. S. (2013). Why item parcels are (almost) never appropriate: Two wrongs do not make a right–camouflaging misspecification with item parcels in CFA models. Psychol. Methods 18, 257–284. doi: 10.1037/a0032773
Matsunaga, M. (2008). Item parceling in structural equation modeling: A primer. Commun. Methods Meas. 2, 260–293. doi: 10.1080/19312450802458935
Müller, A., Smits, D., Claes, C., and De Zwaan, N. (2013). Faktorenstruktur der deutschsprachigen version der BIS/BAS-Skalen in einer Bevölkerungsstichprobe [Factor stuctuer of the German version of the BIS/BAS scales in a population sample]. Fortschr. Neurol. Psychiatrie 81, 75–80. doi: 10.1055/s-0032-1330482
Pugnaghi, G., Cooper, A., Ettinger, U., and Corr, P. J. (2018). The psychometric properties of the German language reinforcement sensitivity theory-personality questionnaire (RST-PQ). J. Individ. Differ. 5, 182–190. doi: 10.1027/1614-0001/a000262
Reuter, M., Cooper, A. J., Smillie, L. D., Markett, S., and Montag, C. (2015). A new measure for the revised reinforcement sensitivity theory: Psychometric criteria and genetic validation. Front. Syst. Neurosci. 9:38. doi: 10.3389/fnsys.2015.00038
Reuter, M., Schmitz, A., Corr, P. J., and Hennig, J. (2006). Molecular genetics support Gray’s personality theory: The interaction of COMT and Drd2 polymorphisms predicts the behavioural approach system. Int. J. Neuropsychopharmacol. 9, 155–166. doi: 10.1017/S1461145705005419
Seib-Pfeifer, L. E., Pugnaghi, G., Beauducel, A., and Leue, A. (2017). On the replication of factor structures of the positive and negative affect schedule (Panas). Pers. Individ. Differ. 107, 201–207. doi: 10.1016/j.paid.2016.11.053
Shye, S., Elizur, D., and Hoffman, M. (1994). Introduction to facet theory: Content design and intrinsic data analysis in behavioral research. Thousand Oaks, CA: Sage. doi: 10.4135/9781412984645
Smith, E. H., Horga, G., Yates, M. J., Mikell, C. B., Banks, G. P., Pathak, Y., et al. (2019). Widespread temporal coding of cognitive control in the human prefrontal cortex. Nat. Neurosci. 22, 1883–1891. doi: 10.1038/s41593-019-0494-0
Sterba, S. K., and Rights, J. D. (2017). Effects of parceling on model selection: Parcel-allocation variability in model ranking. Psychol. Methods 22, 47–68. doi: 10.1037/met0000067
Stoeber, J., and Corr, P. J. (2015). Perfectionism, personality, and affective experiences: New insights from revised reinforcement sensitivity theory. Pers. Individ. Differ. 86, 354–359. doi: 10.1016/j.paid.2015.06.045
Strobel, A., Beauducel, A., Debener, S., and Brocke, B. (2001). Psychometrische und strukturelle Merkmale einer deutschsprachigen version des BIS/BAS Fragebogens von Carver und White. Z. Differ. Diagnost. Psychol. 22, 216–227. doi: 10.1024//0170-1789.22.3.216
Süß, H.-M., and Beauducel, A. (2005). “Faceted models of intelligence,” in Understanding and measuring intelligence, eds O. Wilhelm and R. Engle (Thousand Oaks, CA: Sage). doi: 10.4135/9781452233529.n18
Veccione, M., and Corr, P. J. (2020). Development and validation of a short version of the reinforcement sensitivity theory of personality questionnaire (RST-PQ-S). J. Pers. Assess. 103, 535–546. doi: 10.1080/00223891.2020.1801702
Walker, R. J., and Jackson, C. J. (2016). Examining the validity of the revised reinforcement sensitivity theory scales. Pers. Individ. Differ. 106, 90–94. doi: 10.1080/00223980.2017.1419158
West, R., and Ussher, M. (2010). Is the ten-item questionnaire of smoking urges (QSU-brief) more sensitive to abstinence than shorter craving measures? Psychopharmacology 208, 427–432. doi: 10.1007/s00213-009-1742-x
Wilson, G. D., Barrett, P. T., and Gray, J. A. (1989). Human reactions to reward and punishment: A questionnaire examination of Gray’s personality theory. Br. J. Psychol. 80, 509–515. doi: 10.1111/j.2044-8295.1989.tb02339.x
Wilson, G. D., Gray, J. A., and Barrett, P. T. (1990). A factor analysis of the Gray-Wilson personality questionnaire. Pers. Individ. Differ. 11, 1037–1045. doi: 10.1016/0191-8869(90)90131-A
Wytykowska, A., Fajkowska, M., Domaradzka, E., and Jankowski, K. S. (2017). Construct validity of the polish version of the reinforcement sensitivity theory-personality questionnaire. Pers. Individ. Differ. 109, 172–180. doi: 10.1016/j.paid.2016.12.054
Keywords: conflict monitoring, trait-BIS/BAS, CFA, item parceling, construct validity
Citation: Leue A, Reuter M, Corr PJ and Ettinger U (2022) Construct validity of questionnaires for the original and revised reinforcement sensitivity theory. Front. Psychol. 13:1026894. doi: 10.3389/fpsyg.2022.1026894
Received: 24 August 2022; Accepted: 20 October 2022;
Published: 21 November 2022.
Edited by:
Monika Fleischhauer, MSB Medical School Berlin, GermanyReviewed by:
Richard Anthony Inman, Lusíada University of Porto, PortugalPetar Čolović, University of Novi Sad, Serbia
Copyright © 2022 Leue, Reuter, Corr and Ettinger. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Anja Leue, bGV1ZUBwc3ljaG9sb2dpZS51bmkta2llbC5kZQ==
†ORCID: Anja Leue, orcid.org/0000-0002-2588-5226; Martin Reuter, orcid.org/0000-0003-1050-9655; Philip J. Corr, orcid.org/0000-0002-7618-0058; Ulrich Ettinger, orcid.org/0000-0002-0160-0281