- 1Department Tranzo, Tilburg School of Social and Behavioral Sciences, Tilburg University, Tilburg, Netherlands
- 2Clinical Centre of Excellence for Body, Mind, and Health, GGz Breburg, Tilburg, Netherlands
- 3Department of Methodology and Statistics, Tilburg School of Social and Behavioral Sciences, Tilburg University, Tilburg, Netherlands
Introduction: The Bermond–Vorst Alexithymia Questionnaire (BVAQ) has been validated in student samples and small clinical samples, but not in the general population; thus, representative general-population norms are lacking.
Aim: We examined the factor structure of the BVAQ in Longitudinal Internet Studies for the Social Sciences panel data from the Dutch general population (N = 974).
Results: Factor analyses revealed a first-order five-factor model and a second-order two-factor model. However, in the second-order model, the factor interpreted as analyzing ability loaded on both the affective factor and the cognitive factor. Further analyses showed that the first-order test scores are more reliable than the second-order test scores. External and construct validity were addressed by comparing BVAQ scores with a clinical sample of patients suffering from somatic symptom and related disorder (SSRD) (N = 235). BVAQ scores differed significantly between the general population and patients suffering from SSRD, suggesting acceptable construct validity. Age was positively associated with alexithymia. Males showed higher levels of alexithymia.
Discussion: The BVAQ is a reliable alternative measure for measuring alexithymia.
Introduction
Sifneos (1) introduced the terminology of alexithymia to describe emotional deficiencies in patients suffering from classic psychosomatic disorders and epilepsy (2, 3). These patients were unaware of their feelings and their unawareness was accompanied by an inability to fantasize about their inner thoughts, feelings, and attitudes. Several studies report that alexithymia is a relatively stable trait rather than a state-dependent phenomenon (4–6). Alexithymia has been linked to neurobiological and neuropsychological characteristics such as the functioning of the “visceral” or “limbic” brain [e.g., MacLean (3)]. Furthermore, alexithymia has been associated with somatization (7, 8), somatoform disorder (9), and psychosomatic symptoms (10), and is considered a risk factor for the development of major depression (11), schizophrenia (12), psychosis (13), and eating disorders (14). Moreover, emotional deficiencies were found to have a negative impact on one’s health and were a potential obstacle for successful psychological treatment (15). Recently, De Berardis et al. (16) evaluated the relationship between alexithymia and suicide risk and recommended the assessment of alexithymia in clinical practice. This renders alexithymia important in research on understanding the onset and progress of medically unexplained symptoms and to further improve the effectiveness of psychotherapeutic interventions.
The conceptualization of alexithymia is ongoing, and several questionnaires have been developed to assess alexithymia: the two most frequently used questionnaires are the Bermond–Vorst Alexithymia Questionnaire (BVAQ) (17) and the twenty-item Toronto Alexithymia Scale (TAS-20) (18). Both questionnaires are self-report measures and both have good reliability (19). The TAS-20 operationalizes alexithymia as a constellation of three cognitive factors: difficulty identifying feelings, difficulty describing feelings, and external-oriented thinking (18). However, the TAS does not cover fantasizing, which Bagby et al. (20) and Bermond et al. (19) conceived as another essential feature of alexithymia. The absence of fantasizing motivated Bagby and colleagues to develop the Toronto Structured Interview for Alexithymia (TSIA) (21), which also measures fantasizing. The third factor of the TAS-20 and the TSIA (the externally oriented thinking factor) actually reflects how people have cognitions about their feelings to guide their behaviors, and so, it describes a possible connection between cognition and emotions, as is reflected in confirmatory factor analyses of the original TSIA (21). Another study of the TSIA (22) showed a four-factor model with difficulty identifying feelings and difficulty describing feelings item-facet sets nested under one higher-order factor labeled affect awareness, and the externally oriented thinking and the imaginal processes item-facet sets nested under a second higher-order factor labeled operatory thinking, consistent with the conceptualization of alexithymia as involving different domains in emotional processing and emotional experience.
The BVAQ takes the emotional aspects of alexithymia into account in a more explicit way. It uses a more comprehensive definition of alexithymia by operationalizing alexithymia as a constellation of five basic factors: ability to fantasize and fantasize about virtual matters (fantasizing), ability to identify emotions (identifying), looking for an explanation of emotional reactions (analyzing), ability to describe and/or communicate about emotional reactions (verbalizing), and ability to be emotionally aroused (emotionalizing). The inclusion of emotionalizing as a distinctive factor is a difference between the BVAQ and the TSIA and the TAS-20.
According to Vorst and Bermond (17), emotionalizing refers to the degree of emotional arousal by emotion-inducing events. However, considering emotionalizing as an aspect of alexithymia is subject to debate (23), because emotionalizing might not describe differences in awareness of feelings but rather differences in physiological arousal (20). The BVAQ enables the clinician to assess both cognitive and affective aspects of alexithymia in a more explicit way. Hence, the BVAQ provides clinicians with clinically relevant information.
Internal Validity of the BVAQ
For justifiable use of the BVAQ, both in research and clinical settings, it is important that its psychometric properties are well understood. Although the factorial structure and the psychometric properties of the BVAQ have been the subject of several studies [e.g., Bagby et al. (20); Bekker et al. (24); Bermond et al. (25)], six potential issues necessitate further research: (a) indeterminacy of the BVAQ’s factor structure, (b) use of inadequate groups such as student samples, (c) use of small sample sizes, (d) invalid respondent answers due to lack of motivation to fill out the BVAQ, (e) lack of comparison of the BVAQ between groups with expected different alexithymia levels, and (f) factor structures for indicative and counter-indicative items.
Several studies replicated the first-order five-factor structure of the BVAQ, including the factors identifying, verbalizing, analyzing, fantasizing, and emotionalizing [e.g., Bagby et al. (20); Bekker et al. (24); Bermond et al. (25); Deborde et al. (26); Vorst and Bermond (17)], but Hornsveld and Kraaimaat (27) found poor fit. Bermond et al. (25) reduced the five factors to two second-order factors, representing a cognitive dimension and an affective dimension. These two second-order factors were obtained using principal component analysis (PCA) followed by both orthogonal (varimax) and oblique (oblimin) rotation and were corroborated by findings in neuropsychological research (28). Other studies were unable to replicate the second-order factors (20) or the affective dimension (24). Hence, our study aims at addressing internal validity by exploring the first-order and second-order factor structure of the BVAQ using exploratory factor analyses (EFA).
External Validity
Different explanations may be given for the ambiguity in the first-order and the second-order factor structures, some of which pertain to the external validity of the BVAQ. Most studies used small clinical samples or student samples, usually psychology students, and may have played a role in ambiguous findings regarding factor structure so far. Student samples cannot be considered to adequately represent the populations of interest, such as the general population or clinical populations. Another problem is that PCA or EFA using small samples may be overly sensitive to sampling fluctuation (29), limiting the generalizability of the sample results to the population. Sample size limitations were rarely recognized in the literature. Hence, in this study, we used a large sample and we explored the external validity of the BVAQ in several ways.
Ecological Validity
In this study, we used panel data from a large sample from the general population. A disadvantage of panel data is that respondents complete the questionnaire under artificial conditions because the outcomes of the BVAQ are not the respondent’s interest. As a result, respondents may not be motivated to complete the selected questionnaires (thus inducing selection bias), complete the questionnaire randomly, or tend to give only extreme responses (i.e., either 1 or 5 scores). This might result in data having questionable validity that provides a biased picture of the questionnaire’s ecological validity. Invalid data may also explain ambiguous factor-analysis results. Person-fit analysis (30) may signal traitedness for a limited number of respondents, thus casting doubt on the validity of their data (31).
Construct Validity, Differences Between Populations
The BVAQ renders assessing differences between alexithymia scores obtained from different populations possible. Differences are likely to be found between the general population and patients suffering from somatic symptom and related disorder (SSRD) (32), which replaced the somatoform disorders (33). Somatoform disorders were related with alexithymia, and we expect that the same relationship exists for patients suffering from SSRD. Therefore, for investigating construct validity, medical patients suffering from a high expected likelihood to suffer from alexithymia were included in the study. We anticipated that these patients scored higher on alexithymia than non-patients. Previous studies suggest that alexithymia mediated effectiveness of psychotherapy (34). Patients were recruited from a specialty mental health outpatient clinic for patients suffering from SSRD. The data were collected during intake for treatment, hence patients might be more honest with respect to their possible alexithymia symptoms than people from the general population who were investigated without personal treatment objective. Observed mean differences in BVAQ scores between the general population and SSRD patients provide further evidence of the questionnaire’s construct validity.
Construct Validity, Indicative and Counter-Indicative Items
Another validity issue with the BVAQ is the use of indicative and counter-indicative items. Questionnaires containing indicative and counter-indicative items, in the literature often referred to as balanced scales [e.g., Vigneau and Cormier (35)], may reveal additional factors related to response styles, or additional factors may arise because positively and negatively worded items might tap slightly different attributes, thus limiting construct validity. Subtle differences between subpopulations with respect to the interpretation of indicative and counter-indicative items might also explain differences between the factorial structures found in different BVAQ studies. Interpretation differences have received little attention so far. To further understand the possible wording effects and possible implications for using the BVAQ in clinical practice, we performed two EFAs, one for the indicative items (i.e., I find it difficult to express my feelings verbally) and one for the counter-indicative items (i.e., I often use my imagination).
Scoring
Bermond–Vorst Alexithymia Questionnaire item scores may be added to obtain test scores for items loading on the first-order factors, the second-order factors, and for all the items in the questionnaire. In general, sum scores are more reliable when the number of items grows larger, but when additional items tap different traits, the conceptual interpretation of the scores may be less clear. For example, total BVAQ scores are most reliable, but equal scores might reflect different alexithymia profiles, thus hampering the clinical interpretation of total scores. Therefore, sum scores have to be based on subsets of items allowing a clear interpretation. Vorst and Bermond (17) advocated the use of second-order BVAQ scores because these scores preserve about 70% of the variance of the first-order scores and maintain a clear meaning. Researchers and clinicians may want to use first-order scores to investigate how different alexithymia aspects correlate with other variables, but then, the question arises whether first-order scores have additional value compared to second-order scores. Reise et al. (36) showed that, under certain conditions, total scores on a multi-factor questionnaire may provide more reliable information about specific trait aspects than scores based on single factors. We compared the psychometric properties of sum scores based on first-order factors and second-order factors, including sum-score reliability, and explored whether or not first-order test scores were more reliable than the second-order test scores.
Finally, we provided norms based on normative data from the general population to enhance the interpretation of individual BVAQ sum scores. Because, in former studies, results regarding gender and age differences were ambiguous (17, 37), we explored gender and age differences with respect to the BVAQ.
Materials and Methods
Participants
General Population Sample
Data were used from the Longitudinal Internet Studies for the Social Sciences (LISS) panel (www.lissdata.nl) collected by CentERdata (Tilburg University, The Netherlands). The LISS panel constitutes a representative panel that consists of 4,500 households, comprising 7,000 Dutch-speaking adults from the general population, permanently residing in the Netherlands, who participate in monthly internet surveys. The panel was drawn from the population register by Statistics Netherlands. Households without access to the internet were provided with a computer and an internet connection. Panel members complete online questionnaires for about 15–30 min on a monthly basis. Relevant ethical safeguards were met with respect to the participant’s confidentiality and consent. More detailed information about the LISS panel is provided in Scherpenzeel et al. (38).
For this study, a random sample of 1,434 panel members from the LISS panel were invited by email to complete an online questionnaire that included the BVAQ, but 335 respondents (23.4%) did not respond. Thirteen participants (1.2%) started filling out the BVAQ, but did not complete the survey and were considered as non-responders. Hence 1,086 (98.8%) participants completed the questionnaire. Table 1 shows the sample characteristics of both responders (47% males and 53% females) and non-responders (44.8% males and 55.2% females). In the analysis sample, men were on average older than women [t(972) = − 2.95, p = 0.003, d = 0.19]. Responders were significantly older [mean difference = 12.3, t(1,432) = 12.72, p < 0.001, d = 0.78], better educated (p = 0.03, Cramer’s V = 0.10), and more often engaged in a relationship (p < 0.001, Cramer’s V = 0.20) than non-responders. Figure 1 shows how the final sample was obtained. The Dutch version of the BVAQ was digitalized and propounded to the LISS panel. After data collection, the raw data were transformed following the scoring syntax suggested by Vorst and Bermond (17).
Outpatient Clinic Sample
A sample of patients suffering from SSRD (N = 235) was used for external validation. All consecutive patients referred to the Clinical Centre of Excellence for Body, Mind, and Health (CLGG) situated in Tilburg, The Netherlands between August 2013 and April 2016 were included. The BVAQ was self-administered during the standard intake procedure. The Commission of Scientific Research of GGz Breburg approved to conduct this study (file number: CWO 2014-09). Patients gave consent to make use of their intake data for scientific research purposes. The inclusion criterion was age at least 18 years. Exclusion criteria were: patients were engaged in profession injury or personal procedures, were unable to come to CLGG, the primary focus was not physically related, a psychosis was present, ran active suicide risk (threatening), and were substance dependent.
Instrument
Bermond–Vorst Alexithymia Questionnaire
Alexithymia was measured by means of the Dutch BVAQ. The BVAQ comprises 40 items; half of the items is alexithymia indicative and the other half is counter-indicative. Respondents rated their answer on a 5-point Likert scale ranging from “this definitely applies” to “this in no way applies.” All items were scored 1 through 5 such that higher scores reflect higher levels of alexithymia (17). The questionnaire comprises five subscales, which are identifying, verbalizing, analyzing, fantasizing, and emotionalizing, each in accordance with the five-factor model of alexithymia (17). Given item scores ranging from 1 to 5, the first-order test scores range from 8 to 40. Test scores on the cognitive factor were obtained by adding the total scores on the subscales identifying, analyzing, and verbalizing, meaning that test scores can range from 24 through 120. Test scores of the affective factor were obtained by adding the total scores on the subscales emotionalizing and fantasizing, thus producing test scores ranging from 16 through 80. Hence, high cognitive test scores represent problems with respect to the conscious experience of arousal accompanying emotions and high affective test scores reflect difficulties with respect to emotionalizing and fantasizing.
Data Analysis
Internal Validity
Validity was investigated in a series of analyses. Because data were collected in low-stakes conditions, some respondents may not have been motivated to complete questionnaires seriously. Others may have used idiosyncratic response styles. Resulting aberrant item-response patterns were identified using person-fit analysis (30). Aberrant patterns were removed from the sample prior to EFA to obtain a sample without invalid item-response patterns. For person-fit analysis, we used the average normed number of Guttman errors (denoted GN) (40) across the subscales. Statistic GN can assume values between 0 (perfect fit) and 1 (extreme misfit). Following Emons et al. (41), we removed the highest 10% of the cases, which amounts to removing cases whose GN value was above 0.326. This cutoff is consistent with cutoffs suggested by Emons (40), based on simulations. This resulted in two (overlapping) samples, the complete sample and the analysis sample.
Exploratory factor analyses was done as follows. First, we used parallel analysis (42) in combination with minimum rank factor analysis (MRFA) [Ten Berge and Kiers (43), Timmerman and Lorenzo-Seva (44)] to determine the number of common factors. Like any factor-analysis approach, MRFA maximizes the item communalities given the number of factors (43), but MRFA does this such that the reduced correlation matrix is statistically correct. Therefore, MRFA allows valid estimates of the explained common variance (ECV) (45), which expresses the proportion of common variance explained by the hypothesized factors. Parallel analysis compares the percentage of variance explained by the factors with the percentage of variance explained by the same number of factors resulting from randomly generated data. In total, 500 random correlation matrices were generated by means of permutation of the raw data and subsequently analyzed by means of MRFA. Factors were considered meaningful if the percentage of variance these factors explained exceeded the percentage of variance the random-data factors explained. Because the BVAQ comprises ordinal items, showing both positive and negative skewness, some also showing excessive kurtosis, factor analysis of the polychoric correlation matrix was preferred (46). Parallel analysis was conducted by means of the free software program FACTOR version 10.3.01 (47).
Once the number of factors was determined, we investigated the factor structure using the configuration of the factor loadings. Promax rotation (48) was used to obtain the final rotated factor loadings. The presence of second-order factors was investigated by factor-analyzing the correlations between the first factors obtained using the first-order factor model. The final factor solution was again obtained using promax rotation. The final structures were inspected for adherence to a simple structure (48) and compared with the factorial structure Vorst and Bermond (17) found. EFAs were run in MPLUS7.1, using weighted least squares means and variance adjusted estimation (49) and R-package Psych (50).
Total-score reliability is commonly examined using coefficient alpha (51). The accompanying 95% confidence intervals (CIs) of coefficient alpha were obtained using the method of Feldt et al. (52) as implemented in the package cocron (53), which is also avalaible in R (54).
Construct and External Validity
To examine construct validity, we ran EFAs separately for the indicative and for the counter-indicative items. EFAs were run in MPLUS7.1 using weighted least squares means and variance adjusted estimation (49) and R-package Psych (50).
To examine external validity, we compared the BVAQ scores of the general population with the scores of SSRD patients to explore the degree to which the BVAQ discriminates between groups. Independent sample t-tests were done to compare the first-order and second-order BVAQ scores and Cohen’s d estimated effect size.
Scoring
To examine whether first-order test scores provided additional diagnostic information about the first-order factors that are more reliable than the information provided by the aggregated total scores, we used Haberman’s procedure (55). This procedure uses the proportional reduction in mean squared errors (PRMSE). The PRSME is conceptually similar to the reliability, and for first-order test scores, the PRMSE is equivalent to coefficient alpha. Large PRMSEs are desirable. PRMSEs were obtained using the R-package sirt (56).
Because of the expected differences between gender groups and age groups with respect to alexithymia, it might be useful to have separate norms for males and females and for different age groups. We first examined the relationship of gender and age with alexithymia to decide if separate norms for men and women and different age groups were needed. In case gender or age was associated with alexithymia, we used regression analysis to derive normative data [e.g., Oosterhuis et al. (57)]. This was done as follows. First, we regressed BVAQ scores on gender and age using a linear model with main effects only. The regression model provides estimates of mean BVAQ score as a function of gender and age. Second, for each respondent i, we computed a standardized residual; that is, ei = observed test score − expected test score, based on the estimated regression model. The distribution of the residuals served as normative reference distribution. The residuals were standardized using in which is the SD of the residuals. The standardized residual indicates the relative position of the individual’s score with respect to the mean in the population of persons having the same gender and the same age. To facilitate the interpretation of the standardized residuals, we converted standardized residuals to percentile values by means of the standard normal cumulative distribution. Model assumptions were tested by means of graphical inspection of the residuals. Analyses were done in Statistical Package for the Social Sciences for Windows version 22.0 (58).
Results
Comparison of the background characteristics in the original sample and the analysis sample did not show any differences. Inspection of the misfitting cases showed unsystematic patterns. Six respondents scored “3” on all items, suggesting they did not seriously fill out the BVAQ. Consequently, they were considered as cases showing extreme response styles. The corresponding data records were removed from the sample, thus producing an analysis sample of 974 participants to be used for EFA.
Factor Structure
Parallel analysis suggested five common factors. Model fit of the first-order five-factor model was acceptable (Comparative Fit Index = 0.94; Root Mean Square Error of Approximation = 0.046; Root Mean Square of the Residuals = 0.032). The first-order five-factor model explained 45.7% of the total variance and 68.3% of the common variance. Extracting a sixth factor only marginally improved the ECV to 71.7%, thus accounting for only a small proportion of common variance between the items. Therefore, we retained five first-order factors for further analysis.
Table 2 (columns 2–12) shows the standardized factor loadings for the first-order five-factor model and promax rotated factors, for the full sample and the analysis sample (only loadings above 0.3 are reported). In both samples, the loadings approximated a simple structure [e.g., Gorsuch (48)]; that is, for each factor, at least a few items only loaded predominantly on that specific factor. However, the pattern of loadings differed from the postulated five-factor structure (17), and results differed between the complete sample and the analysis sample. Based on the literature (17, 25), we initially labeled the factors as follows: verbalizing (F1), fantasizing (F2), identifying (F3), emotionalizing (F4), and analyzing (F5).
Table 2. Standardized factor loadings of the five-factor model for complete and analysis sample (i.e., without aberrant response patterns). Items are listed in clusters according to the subscales as suggested by Vorst and Bermond’s subscales. Only loadings of 0.3 or higher are reported.
Comparison of the factor loadings between the full sample and the analysis sample showed few notable differences. Deletion of the aberrant item-score patterns removed the cross loadings for items in the subscales verbalizing and fantasizing. In the complete sample, the identifying items 8, 18, 23, and 33 loaded on analyzing instead of identifying, but in the analysis sample, all items loaded on the postulated factors, with low cross loadings for items 13 and 28 on analyzing. Interestingly, these items are the counter-indicative items, and the results suggest that these items are indicators of analyzing rather than identifying. In the complete sample, the factor loadings showed an unsystematic pattern. In the analysis sample, the indicative items (10, 20, 30, and 40) loaded on the postulated factor but only item 40 had a substantial loading (>0.60), two items (25 and 35) had weak cross loadings on other factors, and the other items (5 and 15) loaded on none of the factors. Hence, the subscale analyzing could not be replicated in the complete and the analysis samples.
Table 3 shows the estimated factor correlations and the second-order factor structure based on the estimated factor correlations, in both the complete and the analysis sample. Results for verbalizing, fantasizing, identifying, and emotionalizing were consistent across different EFAs, corroborating the presence of a cognitive and an affective domain within the BVAQ. The affective dimension emotionalizing showed a substantive cross loading with the cognitive dimension in the complete sample but not in the analysis sample. Results for analyzing were ambiguous.
Table 3. Correlations between the first-order factors of the five-factor model and standardized second-order factor loadings in the analysis sample.
Reliability
Table 4 (columns 3) shows coefficient alpha and corresponding 95% CIs for the first-order test scores, and the second-order test scores. Coefficient alpha ranged from 0.75 to 0.89. PRMSEs for the first-order test scores (column 4) were higher than the PRMSE for the second-order test scores or total scores (column 5). Table 4 (column 2) also shows the range of item-rest correlations of the items constituting the first-order test scores and the second-order test scores in the general population. Item-rest correlations suggested adequate assignment of the individual items to the subscales. These results also showed that some items are weak indicators of the general attribute of alexithymia. In particular, item 2 (Before I fall asleep, I imagine all kinds of events, encounters and conversations), item 5 (I hardly ever consider my feelings) and 15 (When I feel uncomfortable, I will not trouble myself even more by asking myself why) are weak indicators.
Table 4. Reliability and additional values (PRMSEs) of the first-order and second-order scores of the BVAQ (results obtained in the total sample).
External Validity
Table 5 shows results for EFAs for the indicative and the counter-indicative items. For both sets of items, the five-factor model fitted the data well and all items loaded on the corresponding factor. Cross loadings were absent. These results suggest that the items can be clustered into subscales as intended, but the counter-indicative items of analyzing may represent a slightly different conceptualization than the indicative items. Figure 2 shows a visualization of the factor structure for the indicative and the counter-indicative items.
Table 5. Standardized factor loadings of the five-factor model in the LISS panel data of the analysis sample, for the indicative items and counter-indicative items.
Figure 2. Second-order factor structure of the Bermond–Vorst Alexithymia Questionnaire based on exploratory factor analyses on the indicative items and on the counter-indicative items (analysis sample).
Table 3 also shows estimated factor correlations and the second-order factor structure based on the estimated factor correlations for indicative and counter-indicative items. Correlations of analyzing with the other factors were lower when analyzing all 40 items together than for indicative and counter-indicative items separately. The different factor correlations for indicative and counter-indicative items might also explain the differences between the second-order factor structures. Results suggest that indicative and counter-indicative analyzing items refer to slightly different attributes, which is obscured when analyzing all items together.
Scoring
Table 6 (columns 2–5) shows the means and SDs of the first-order test scores and the second-order test scores for the SSRD sample and for the general-population sample. Table 6 (columns 6–7) also shows the p-values and Cohen’s d for the comparison between the SSRD sample and the general-population sample. The mean scores of emotionalizing (p < 0.001, d = 0.57) and the affective dimension (p = 0.003, d = 0.22) were significantly higher in the general-population sample. The mean scores on identifying (p < 0.001, d = −0.57), verbalizing (p < 0.001, d = −0.35), and the cognitive dimension (p < 0.001, d = −0.33) were significantly lower in the general-population sample than the SSRD sample.
Table 6. Descriptive statistics, of the first-order and second-order scores of the BVAQ (results obtained in the total sample), descriptive statistics for the SSRD sample (N = 234) and statistical comparison between SSRD sample and general population of BVAQ scores.
Inspection of the residuals suggested that BVAQ total scores were linearly related to age and that heteroscedasticity was absent. Table 7 (columns 2–4) shows the estimated unstandardized regression coefficients for predicting first-order test scores and second-order test scores by age and gender. Age and gender explained 2% (identifying) to 15% (emotionalizing) of the variance of the first-order test scores (Table 7, column 5), which amounts to small to medium effects according to Cohen’s (59) rules of thumb. Except for fantasizing, a significant effect of gender was found for the other subscales. Significant effects of age were found for the subscales fantasizing, analyzing, and the affective factor. To gauge the practical importance of age given the estimated regression model, we looked at differences between predicted scores for the youngest and the oldest respondents. The predicted score of 18-year-old males equaled 46.9, whereas the predicted score for an 89-year-old male equaled 53.3, which represents a score difference of 6.4 units. Based on the distribution of the residuals (i.e., SD = 8.84; see Table 7), a score difference of 6.4 units amounts to Cohen’s d of 0.73 (6.4/8.84), meaning a large effect size. Therefore, it is important to control for age.
Table 7. Multiple regression analysis predicting first-order scores or second-order scores from age and gender, and distribution of the residuals.
Table 7 (columns 6–8) also describes the distribution of the residuals (i.e., SD, skewness, and kurtosis). In all models, residuals were obtained for the model including both age and gender as predictors. The residuals were normally distributed. The coefficients in Table 7 can be used to norm scores that take age and gender differences into account. An Excel template for this purpose is available upon request from the corresponding author as well as norm tables for each age and gender group.
Discussion
This study was the first to validate the BVAQ for the general population. Aberrant item responses due to extreme responders were removed prior to the EFA in an effort to better validate the BVAQ factor structure. Removal of aberrant item-response patterns produced a factor structure that was consistent with the conceptualization of alexithymia. This study showed that person-fit analysis may contribute to a better understanding of the factor structure.
The results suggest that items indicative of analyzing represent a conceptually different attribute than counter-indicative items. A competing explanation for different results might be the wording of the items. For example, indicative items are phrased in terms of “unclear” whereas counter-indicative items are phrased in terms of “understand.” Such small differences may invoke different cognitive processes, producing responses that represent different attributes. Because this was the first study in the general population, it is unclear whether such wording effects are typical of the general population or whether these results also generalize to other populations. This is a topic for future research. Because the results showed a clear difference with respect to the second-order factor structure for the indicative and the counter-indicative items, and because analyzing ability also loaded on the affective factor instead of only on the cognitive factor, our analysis of indicative and counter-indicative items may explain why construct validity of the BVAQ was found suboptimal in earlier studies.
We found that the BVAQ is a reliable instrument. Additional analyses showed that when scores are aggregated to second-order test scores, reliable information about the constituent components is lost. Consequently, this study provided support for the use of first-order test scores to provide diagnostic information for understanding alexithymia at a more detailed level. Because first-order test scores have additional value with respect to second-order test scores, clinicians and researchers should better rely on the first-order test scores for a clinical judgment.
This was also the first study that compared alexithymia scores in the general population and in a patient population suffering from SSRD, that, we hypothesized, would have more difficulty expressing their feelings and thoughts about their symptoms. Consequently, they were expected to score higher on an alexithymia scale than the general population. Another possibility was that patients gave a more involved opinion about their symptoms because the data were collected in connection with their intake for treatment. We checked the likelihood of these alternative explanations. Because higher scores of alexithymia were found in the SSRD group, support for construct validity was found.
Regression analyses of alexithymia on age and gender corroborated the trends found in other studies. Males had higher mean alexithymia scores than women and a positive effect of age was found, similar to findings in studies using clinical populations [e.g., Salminen et al. (37), Franz et al. (60), Joukamaa et al. (61), Mattila et al. (62), Pasini et al. (63)]. Caution should be exercised drawing conclusions about within-person change in alexithymia over time based on cross-sectional data. Individuals in varying cohorts may grow up in different social contexts, which may produce between-person variation in mean alexithymia across age groups, while alexithymia remains stable within persons. Longitudinal data are needed to study within- and between-person differences in alexithymia over time while controlling for physical conditions. This is a topic for future research.
Normative data were reported, both unconditional and conditional on age and gender. Both types of norms have practical value, but should be used carefully. When using age and gender-specific norms, one implicitly assumes that gender and age differences in alexithymia are related to contextual factors and not the construct itself. Contextual factors may include social environment and time-specific social norms. For example, two persons with the same BVAQ scores but of different age may not be conceived as equally alexithymic because the older person grew up in times where it was socially not that well accepted to talk about emotions while the younger person is more used to it. Likewise, a male and female having the same BVAQ scores may not be equally alexithymic because the female may have learned to express her emotions when she was young while the male did not. Hence, gender differences result from social norms and not the trait itself and this effect should be partialed out when comparing BVAQ scores between males and females. However, in the clinical practice, where the BVAQ is used for screening and treatment decisions, one may not want to treat males and females with the same BVAQ total scores differently. In such cases, clinicians can use the unconditional scores. We may notice that screening using unconditional norms may result in different prevalence rates for males and females or across age cohorts, while prevalence rates will be the same when using conditional norms.
Previous studies showed a relationship between alexithymia and distress (64–69). Distress can be an outcome or a determinant of alexithymia (70), but this topic did not receive much attention yet. Tominaga et al. (71) suggested that alexithymia hampers the successful regulation of negative affect and leads to increased distress. Distress also has been shown to coincide with alexithymia as a state-dependent phenomenon (72, 73). Because the role of distress for alexithymia is unclear, future studies may address this topic.
Significant differences were found between responders and non-responders with respect to age, educational level, and marital status. Because age is associated with alexithymia, caution should be exercised when generalizing results to the general population. Another limitation involves the use of panel data. However, we corrected for extreme responders to mitigate this limitation. The BVAQ is more reliable than the TAS-20, which is possibly due to the former questionnaire’s greater number of items.
The development of tools to assess alexithymia is continuing. The TSIA (21) enables the measurement of fantasizing that was lacking in the TAS-20 and is considered important (19, 20), Also, a recent study reported that the subscale external oriented thinking of the TAS-20 has weak psychometric properties in the group of younger adolescents (74). Another study corroborated this finding (75) and concluded that the psychometric properties of the external oriented thinking subscale are poorer than those of identifying and describing feelings. Based on the findings of this study, the BVAQ could be a reliable alternative to the TAS-20 that is based on a different operationalization of the alexithymia construct.
As far as we know, this was the first study exploring the BVAQ factor structure in the general population, taking external validity into account, and comparing the general population with a patient population expected to score higher on alexithymia. The currently existing treatment options for alexithymia are not effective and the development of evidence-based treatments is necessary (76). The psychotherapeutic process relies primarily on the ability of the patient to access their emotions. Patients who are unable to do so are difficult to treat by the therapist. In order to develop evidence-based treatments, a proper assessment of alexithymia is pivotal. Hence, the results of our study provide clinicians with a valuable tool for assessing alexithymia in the clinic by using the norm scores and offer clinicians and scientists a starting point for the development of evidence-based treatment options. This study thus provided insight in the reliability of the BVAQ and provided norm scores for use in clinical practice in the Netherlands.
Ethics Statement
The Commission of Scientific Research of GGz Breburg approved to conduct this study (file number: CWO 2014-09). Patients gave consent to make use of their intake data for scientific research purposes.
Author Contributions
LV drafted the manuscript. LV and WE were responsible for design and analysis of the data. WE, KS, and CvdF-C revised the draft. All authors approved of the final manuscript.
Conflict of Interest Statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as potential conflict of interest.
Funding
This study was financially supported by GGz Breburg, Tilburg, the Netherlands.
References
1. Sifneos PE. The prevalence of ‘alexithymic’ characteristics in psychosomatic patients. Psychother Psychosom (1973) 22(2–6):255–62. doi:10.1159/000286529
2. Nemiah JC, Sifneos PE. Psychosomatic illness: a problem in communication. Psychother Psychosom (1970) 18(1–6):154–60. doi:10.1159/000286074
3. MacLean PD. Psychosomatic disease and the “Visceral Brain”: recent developments bearing on the Papez Theory of Emotion. Psychosom Med (1949) 11(6):338–53. doi:10.1097/00006842-194911000-00003
4. De Timary P, Luts A, Hers D, Luminet O. Absolute and relative stability of alexithymia in alcoholic inpatients undergoing alcohol withdrawal: relationship to depression and anxiety. Psychiatry Res (2008) 157(1):105–13. doi:10.1016/j.psychres.2006.12.008
5. Luminet O, Rokbani L, Ogez D, Jadoulle V. An evaluation of the absolute and relative stability of alexithymia in women with breast cancer. J Psychosom Res (2007) 62(6):641–8. doi:10.1016/j.jpsychores.2007.01.003
6. Luminet O, Bagby RM, Taylor GJ. An evaluation of the absolute and relative stability of alexithymia in patients with major depression. Psychother Psychosom (2001) 70(5):254–60. doi:10.1159/000056263
7. Kellner R. Somatization: theories and research. J Nerv Ment Dis (1990) 178(3):150–60. doi:10.1097/00005053-199003000-00002
8. Rief W, Broadbent E. Explaining medically unexplained symptoms-models and mechanisms. Clin Psychol Rev (2007) 27(7):821–41. doi:10.1016/j.cpr.2007.07.005
9. Waller E, Scheidt CE. Somatoform disorders as disorders of affect regulation: a development perspective. Int Rev Psychiatry (2006) 18(1):13–24. doi:10.1080/09540260500466774
10. Taylor GJ, Bagby RM, Parker JD. Disorders of Affective Regulation: Alexithymia in Medical and Psychiatric Illness. Cambridge, UK: Cambridge University Press (1999).
11. Honkalampi K, Koivumaa-Honkanen H, Lehto SM, Hintikka J, Haatainen K, Rissanen T, et al. Is alexithymia a risk factor for major depression, personality disorder, or alcohol use disorders? A prospective population-based study. J Psychosom Res (2010) 68(3):269–73. doi:10.1016/j.jpsychores.2009.05.010
12. Van’t Wout M, Aleman A, Bermond B, Kahn RS. No words for feelings: alexithymia in schizophrenia patients and first-degree relatives. Compr Psychiatry (2007) 48(1):27–33. doi:10.1016/j.comppsych.2006.07.003
13. Van der Velde J, Swart M, van Rijn S, van der Meer L, Wunderink L, Wiersma D, et al. Cognitive alexithymia is associated with the degree of risk for psychosis. PLoS One (2015) 10(6):e0124803. doi:10.1371/journal.pone.0124803
14. Westwood H, Kerr-Gaffney J, Stahl D, Tchanturia K. Alexithymia in eating disorders: systematic review and meta-analyses of studies using the Toronto Alexithymia Scale. J Psychosom Res (2017) 99:66–81. doi:10.1016/j.jpsychores.2017.06.007
15. Lumley MA, Neely LC, Burger AJ. The assessment of alexithymia in medical settings: implications for understanding and treating health problems. J Pers Assess (2007) 89(3):230–46. doi:10.1080/00223890701629698
16. De Berardis D, Fornaro M, Orsolini L, Valchera A, Carano A, Vellante F, et al. Alexithymia and suicide risk in psychiatric disorders: a mini-review. Front Psychiatry (2017) 8:148. doi:10.3389/fpsyt.2017.00148
17. Vorst HC, Bermond B. Validity and reliability of the Bermond-Vorst Alexithymia Questionnaire. Pers Individ Dif (2001) 30:413–34. doi:10.1016/S0191-8869(00)00033-7
18. Bagby RM, Parker JD, Taylor GJ. The twenty-item Toronto Alexithymia Scale – I. Item selection and cross-validation of the factor structure. J Psychosom Res (1994) 38(1):23–32. doi:10.1016/0022-3999(94)90005-1
19. Bermond B, Oosterveld P, Vorst HCM. Measures of alexithymia. In: Boyle GJ, Saklofske DH, Matthews G, editors. Measures of Personality and Social Psychological Construct. London, UK: Academic Press (2015). p. 227–56.
20. Bagby RM, Quilty LC, Taylor GJ, Grabe HJ, Luminet O, Verissimo R, et al. Are there subtypes of alexithymia? Pers Individ Dif (2009) 47(5):413–8. doi:10.1016/j.paid.2009.04.012
21. Bagby RM, Taylor GJ, Parker JD, Dickens SE. The development of the Toronto Structured Interview for Alexithymia: item selection, factor structure, reliability and concurrent validity. Psychother Psychosom (2006) 75(1):25–39. doi:10.1159/000089224
22. Caretti V, Porcelli P, Solano L, Schimmenti A, Bagby RM, Taylor GJ. Reliability and validity of the Toronto Structured Interview for Alexithymia in a mixed clinical and nonclinical sample from Italy. Psychiatry Res (2011) 187(3):432–6. doi:10.1016/j.psychres.2011.02.015
23. Bagby RM, Taylor GJ, Quilty LC, Parker JD. Reexamining the factor structure of the 20-item Toronto Alexithymia Scale: commentary on Gignac, Palmer, and Stough. J Pers Assess (2007) 89(3):258–64. doi:10.1080/00223890701629771
24. Bekker MH, Bachrach N, Croon MA. The relationships of antisocial behavior with attachment styles, autonomy-connectedness, and alexithymia. J Clin Psychol (2007) 63(6):507–27. doi:10.1002/jclp.20363
25. Bermond B, Clayton K, Liberova A, Luminet O, Maruszewski T, Ricci Bitti PE, et al. A cognitive and an affective dimension of alexithymia in six languages and seven populations. Cogn Emot (2007) 21(5):1125–36. doi:10.1080/02699930601056989
26. Deborde A-S, Berthoz S, Wallier J, Fermanian J, Falissard B, Jeammet P, et al. The Bermond-Vorst Alexithymia Questionnaire cutoff scores: a study in eating-disordered and control subjects. Psychopathology (2007) 41(1):43–9. doi:10.1159/000109955
27. Hornsveld R, Kraaimaat F. Alexithymia in Dutch violent forensic psychiatric outpatients. Psychol Crime Law (2012) 18(9):833–46. doi:10.1080/1068316X.2011.568416
28. Bermond B, Vorst HC, Moormann PP. Cognitive neuropsychology of alexithymia: implications for personality typology. Cogn Neuropsychiatry (2006) 11(3):332–60. doi:10.1080/13546800500368607
29. Pett MA, Lackey NR, Sullivan JJ. Making Sense of Factor Analysis: the USD of Factor Analysis for Instrument Development in Health Care Research. Thousand Oaks, CA: SAGE (2003).
30. Meijer RR, Sijtsma K. Methodology review: evaluating person fit. Appl Psychol Meas (2001) 25(2):107–35. doi:10.1177/01466210122031957
31. Reise SP, Waller NG. Traitedness and the assessment of response pattern scalability. J Pers Soc Psychol (1993) 65(1):143. doi:10.1037/0022-3514.65.1.143
32. American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders (DSM-5). Washington, DC: American Psychiatric Association (2013).
33. American Psychiatric Association. DSM-IV-TR: Diagnostic and Statistical Manual of Mental Disorders, Text Revision. Washington, DC: American Psychiatric Association (2000).
34. Ogrodniczuk JS, Piper WE, Joyce AS. Effect of alexithymia on the process and outcome of psychotherapy: a programmatic review. Psychiatry Res (2011) 190(1):43–8. doi:10.1016/j.psychres.2010.04.026
35. Vigneau F, Cormier S. The factor structure of the State-Trait Anxiety Inventory: an alternative view. J Pers Assess (2008) 90(3):280–5. doi:10.1080/00223890701885027
36. Reise SP, Bonifay WE, Haviland MG. Scoring and modeling psychological measures in the presence of multidimensionality. J Pers Assess (2013) 95(2):129–40. doi:10.1080/00223891.2012.725437
37. Salminen JK, Saarijärvi S, Äärelä E, Toikka T, Kauhanen J. Prevalence of alexithymia and its association with sociodemographic variables in the general population of Finland. J Psychosom Res (1999) 46(1):75–82. doi:10.1016/S0022-3999(98)00053-1
38. Scherpenzeel A, Das M. True longitudinal and probability-based internet panels: evidence from the Netherlands. In: Das M, Ester P, Kaczmirek L, editors. Social and Behavorial Research and the Internet: Advances in Applied Methods and New Research Strategies. Boca Raton, FL: Taylor & Franics (2011) p. 77–103.
40. Emons WH. Nonparametric person-fit analysis of polytomous item scores. Appl Psychol Meas (2008) 32(3):224–47. doi:10.1177/0146621607302479
41. Emons WH, Sijtsma K, Meijer RR. Global, local, and graphical person-fit analysis using person-response functions. Psychol Methods (2005) 10(1):101–19. doi:10.1037/1082-989X.10.1.101
42. Horn JL. A rationale and test for the number of factors in factor analysis. Psychometrika (1965) 30(2):179–85. doi:10.1007/BF02289447
43. Ten Berge JM, Kiers HA. A numerical approach to the approximate and the exact minimum rank of a covariance matrix. Psychometrika (1991) 56(2):309–15. doi:10.1007/BF02294464
44. Timmerman ME, Lorenzo-Seva U. Dimensionality assessment of ordered polytomous items with parallel analysis. Psychol Methods (2011) 16(2):209. doi:10.1037/a0023353
45. Ferrando PJ, Lorenzo-Seva U. Unrestricted Item Factor Analysis and Some Relations with Item Response Theory. Tarragona: Department of Psychology, Universitat Rovira i Virgili (2013). Technical Report. Available from: http://psico.fcep.urv.cat/utilitats/factor (Accessed: August 25, 2015).
46. Muthen B, Kaplan D. A comparison of some methodologies for the factor analysis of non-normal Likert variables: a note on the size of the model. Br J Math Stat Psychol (1992) 45(1):19–30. doi:10.1111/j.2044-8317.1992.tb00975.x
47. Lorenzo-Seva U, Ferrando PJ. FACTOR: a computer program to fit the exploratory factor analysis model. Behav Res Methods (2006) 38(1):88–91. doi:10.3758/BF03192753
50. Revelle W. psych: Procedures for Personality and Psychological Research (R-Package, Version 1.7.5) [Computer Program]. (2015). Available from: https://cran.r-project.org/src/contrib/Archive/psych/ (2015) (Accessed: August 25, 2015).
51. Cronbach LJ. Coefficient alpha and the internal structure of tests. Psychometrika (1951) 16(3):297–334. doi:10.1007/BF02310555
52. Feldt LS, Woodruff DJ, Salih FA. Statistical inference for coefficient alpha. Appl Psychol Meas (1987) 11(1):93–103. doi:10.1177/014662168701100107
53. Diedenhofen B. cocron: Statistical Comparisons of Two or More Alpha Coefficients (R-package, Version 1.0-1) [Computer Program]. (2016). Available from: http://comparingcronbachalphas.org/ (Accessed: November 18, 2016).
54. Team R. R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing (2014).
55. Haberman SJ. When can subscores have value? J Educ Behav Stat (2008) 33:204–29. doi:10.3102/1076998607302636
56. Robitzsch A. sirt: Supplemantory Item Response Theory Models (R-package, Version 1.10-0) [Computer Program]. (2016). Available from: https://cran.r-project.org/src/contrib/Archive/sirt/ (Accessed: December 5, 2016).
57. Oosterhuis HE, van der Ark LA, Sijtsma K. Sample size requirements for traditional and regression-based norms. Assessment (2016) 23(2):191–202. doi:10.1177/1073191115580638
59. Cohen J. Statistical Power Analysis for the Behavioral Sciences. Hillsdale, NJ: Lawrence Earlbaum Associates (1988). p. 20–6.
60. Franz M, Popp K, Schaefer R, Sitte W, Schneider C, Hardt J, et al. Alexithymia in the German general population. Soc Psychiatry Psychiatr Epidemiol (2008) 43(1):54–62. doi:10.1007/s00127-007-0265-1
61. Joukamaa M, Saarijärvi S, Muuriaisniemi M-L, Salokangas RK. Alexithymia in a normal elderly population. Compr Psychiatry (1996) 37(2):144–7. doi:10.1016/S0010-440X(96)90576-3
62. Mattila AK, Kronholm E, Jula A, Salminen JK, Koivisto AM, Mielonen RL, et al. Alexithymia and somatization in general population. Psychosom Med (2008) 70(6):716–22. doi:10.1097/PSY.0b013e31816ffc39
63. Pasini A, Delle Chiaie R, Seripa S, Ciani N. Alexithymia as related to sex, age, and educational level: results of the Toronto Alexithymia Scale in 417 normal subjects. Compr Psychiatry (1992) 33(1):42–6. doi:10.1016/0010-440X(92)90078-5
64. Barbosa F, Mota C, Patrício P, Alcântara C, Ferreira C, Barbosa A. The relationship between 754 alexithymia and psychological factors in systemic lupus erythematosus. Compr Psychiatry (2011) 52(6):754–62. doi:10.1016/j.comppsych.2010.11.004
65. Castelli L, Tesio V, Colonna F, Molinaro S, Leombruni P, Bruzzone M, et al. Alexithymia and psychological distress in fibromyalgia: prevalence and relation with quality of life. Clin Exp Rheumatol (2012) 30(6 Suppl 74):70–7.
66. Evren B, Evren C, Guler MH. Clinical correlates of alexithymia in patients with fibromyalgia. Pain Clin (2006) 18(1):1–9. doi:10.1163/156856906775249857
67. Lumley MA, Cohen JL, Borszcz GS, Cano A, Radcliffe AM, Porter LS, et al. Pain and emotion: a biopsychosocial review of recent research. J Clin Psychol (2011) 67(9):942–68. doi:10.1002/jclp.20816
68. Malt EA, Olafsson S, Lund A, Ursin H. Factors explaining variance in perceived pain in women with fibromyalgia. BMC Musculoskelet Disord (2002) 3(1):12. doi:10.1186/1471-2474-3-12
69. Van Middendorp H, Lumley MA, Jacobs JW, van Doornen LJ, Bijlsma JW, Geenen R. Emotions and emotional approach and avoidance strategies in fibromyalgia. J Psychosom Res (2008) 64(2):159–67. doi:10.1016/j.jpsychores.2007.08.009
70. Margalit D, Har LB, Brill S, Vatine J-J. Complex regional pain syndrome, alexithymia, and psychological distress. J Psychosom Res (2014) 77(4):273–7. doi:10.1016/j.jpsychores.2014.07.005
71. Tominaga T, Choi H, Nagoshi Y, Wada Y, Fukui K. Relationship between alexithymia and coping strategies in patients with somatoform disorder. Neuropsychiatr Dis Treat (2014) 10:55–62. doi:10.2147/ndt.s55956
72. Haviland MG, Shaw DG, Cummings MA, MacMurray JP. Alexithymia: subscales and relationship to depression. Psychother Psychosom (1988) 50(3):164–70. doi:10.1159/000288115
73. Honkalampi K, Hintikka J, Saarinen P, Lehtonen J, Viinamäki H. Is alexithymia a permanent feature in depressed patients? Psychother Psychosom (2000) 69(6):303–8. doi:10.1159/000012412
74. Craparo G, Faraci P, Gori A. Psychometric properties of the 20-item Toronto Alexithymia Scale in a group of Italian younger adolescents. Psychiatry Investig (2015) 12(4):500–7. doi:10.4306/pi.2015.12.4.500
75. Bagby RM, Ayearst LE, Morariu RA, Watters C, Taylor GJ. The internet administration version of the 20-item Toronto Alexithymia Scale. Psychol Assess (2014) 26(1):16. doi:10.1037/a0034316
Keywords: psychometric properties, alexithymia, Bermond–Vorst Alexithymia Questionnaire, validation study, somatic symptom and related disorders
Citation: de Vroege L, Emons WHM, Sijtsma K and van der Feltz-Cornelis CM (2018) Psychometric Properties of the Bermond–Vorst Alexithymia Questionnaire (BVAQ) in the General Population and a Clinical Population. Front. Psychiatry 9:111. doi: 10.3389/fpsyt.2018.00111
Received: 31 December 2017; Accepted: 19 March 2018;
Published: 23 April 2018
Edited by:
Michael Noll-Hussong, Universitätsklinikum des Saarlandes, GermanyReviewed by:
Adriano Schimmenti, Kore University of Enna, ItalyDomenico De Berardis, Azienda Usl Teramo, Italy
Petr Bob, Charles University, Czechia
Copyright: © 2018 de Vroege, Emons, Sijtsma and van der Feltz-Cornelis. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Lars de Vroege, l.devroege@ggzbreburg.nl