- 1Center for Research in Neuropsychology and Cognitive and Behavioral Intervention (CINEICC), University of Coimbra, Coimbra, Portugal
- 2Psychological Assessment and Psychometrics Laboratory (PsyAssessmentLab), University of Coimbra, Coimbra, Portugal
- 3Universidade Europeia, Faculty of Health Sciences, Lisbon, Portugal
- 4Institute of Clinical and Forensic Psychology (MIND), Lisbon, Portugal
- 5Faculty of Psychology and Educational Sciences, University of Coimbra, Coimbra, Portugal
- 6College of Arts & Sciences, Texas A&M University Central Texas, Killeen, TX, United States
Introduction: Originally published in the United States of America in 1991, the Personality Assessment Inventory (PAI) has been translated and adapted to a growing number of countries, but Portugal had yet to study its adequacy to the Portuguese population.
Methods: The current study aimed to investigate the Portuguese normative data, the predictive effect of sociodemographic variables on the PAI scores, and the reliability of the Portuguese version of the PAI. Additionally, results were compared with other international versions of the PAI. The sample was comprised of 900 participants (age: M = 43.13, SD = 14.28, range = 18–75), recruited from various regions of Portugal.
Results: Findings showed that the Portuguese sample scored higher than the U.S. and other international versions of the PAI in most scales. Sociodemographic variables (e.g., gender, age, and educational level) were significant predictors on PAI scores. The internal consistency of the Portuguese sample revealed lower values on the validity scales, but adequate on the clinical, treatment, and interpersonal scales. Overall, the Portuguese PAI revealed adequate psychometric properties, with normative results often superior to other international versions of the inventory.
Discussion: It is a crucial step into the Portuguese adaptation and validation of this instrument, a measure with considerable potential in clinical, forensic, and research contexts. This adaptation may lead to the growth and development of the psychological assessment field in Portugal, and the opportunity to develop future cross-cultural studies with other international versions of the PAI.
Introduction
The Personality Assessment Inventory (PAI) is a multiscale self-report measure of psychopathology and personality developed in 1991 by Leslie Morey. This inventory aims to provide information on critical variables for diagnostic and clinical decision-making purposes. The PAI includes 344 items, organized into 22 non-overlapping scales, resulting from a careful theoretical and empiric review, and items within each scale were selected based on their substantial associations with relevant constructs (e.g., response validity, clinical syndromes, interpersonal style, treatment complications, and environment characteristics) while considering their minimal associations with items of different constructs. Thus, in contrast to other self-report personality inventories (e.g., the Minnesota Multiphasic Personality Inventories or the Millon Clinical Multiaxial Inventories), none of the PAI scales overlapped with the remaining ones (Morey and McCredie, 2019). The PAI includes four validity scales, 11 clinical scales, five treatment consideration scales, and two interpersonal scales (Morey, 1991) (see Table 1).
The four validity scales – Inconsistency (ICN), Infrequency (INF), Negative Impression (NIM), and Positive Impression (PIM) – were developed to measure systematic or random profile distortions (Morey and McCredie, 2020). The clinical scales measure the most relevant clinical domains in the field of mental illness, which gathered the most focus in contemporary diagnostic practices (Morey, 1991). These include the Somatic Complaints (SOM), Anxiety (ANX), Anxiety-Related Disorders (ARD), Depression (DEP), Mania (MAN), Paranoia (PAR), Schizophrenia (SCZ), Borderline Features (BOR), Antisocial Features (ANT), Alcohol Problems (ALC), and Drug Problems (DRG) scales (Morey, 1991). These scales (excepting the ALC and DRG scales) are organized into several subscales, each of them related to a particular clinical syndrome. The treatment consideration scales aim to assist professionals in decision-making process by providing information regarding the risk the examinees present, regarding themselves and others. These scales include the Aggression (AGG), Suicidal Ideation (SUI), Stress (STR), Nonsupport (NON), and Treatment Rejection (RXR) scales. Additionally, the AGG scale incorporates some subscales, covering different components of aggression (Morey, 1991). Lastly, the interpersonal scales – Dominance (DOM) and Warmth (WRM) – assess interpersonal behaviors, making them an important tool in the therapeutic process (Kiesler, 1996). The interpersonal scales also provide information concerning variations in normal personality and are sensitive to the presence of possible mental pathologies, following the belief that the most important expressions of personality occur in events that involve more than one person (Pincus, 2005).
In addition to the scales, several supplemental indexes are presented in the professional manual, namely the Malingering Index (MAL), the Rogers Discriminant Function (RDF), the Defensiveness Index (DEF), the Cashel Discriminant Function (CDF), the Suicide Potential Index (SPI), the Violence Potential Index, and the Treatment Process Index (TPI) (Morey, 1991). Even though the RDF and CDF have lost support in past decades as reliable indicators of response style (e.g., Hawes and Boccaccini, 2009), other indicators, such as the Multi-Feign Index (MFI; Gaines et al., 2013) and the Cognitive Bias Scale (CBS; Gaasedelen et al., 2019), have steadily been gaining support as relevant supplemental indexes (e.g., Ingram et al., 2024).
The PAI can be completed individually or in group resorting to an Answer Sheet or an online administration and scoring via PARiConnect. Minimal differences have been found between these administration methods, so results should be considered interchangeable if the appropriate guidelines are followed (Finger and Ones, 1999; Morey, 2007; American Psychological Association, 2020). For example, a recent study compared Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF) scores of police candidates who completed the test in an in-person condition with the scores of candidates who completed it remotely. The results revealed that the normative data and psychometric properties were equivalent regardless of whether the test was administered in-person or remotely, with group differences no greater than two T-score points per scale (Menton et al., 2022). Overall, a growing body of research has supported the use of telehealth alternatives to in-person assessment procedures, with minimal to no loss of scale effectiveness (e.g., Corey and Ben-Porath, 2020; Agarwal et al., 2023).
The PAI has shown great utility in different settings, such as research and training (e.g., Schroeder et al., 2016; Stedman et al., 2017), and in forensic (e.g., Ackerman et al., 2021) and neuropsychology contexts (e.g., Martin et al., 2015). For instance, in personality assessment training and practice settings, it is one of the most used instruments, providing clinical practice and higher self-report competency to trainees (e.g., Mihura et al., 2017; Ingram et al., 2022), and, in forensic cases, it appears to be a promising measure of malingering and a good predictor of inmate violence, misconduct, and recidivism (e.g., Reidy et al., 2016).
PAI reliability has been heavily studied and the professional manual (Morey, 1991) presented adequate internal consistency (full scale Cronbach alpha mean of 0.81 in a census sample, 0.82 in a college sample, and 0.86 in a clinical sample) and test–retest reliability (average r = 0.82 across the 22 scales). The lower Cronbach alphas belonged to two validity scales, particularly the Inconsistency (e.g., 0.23 in the clinical sample) and Infrequency (e.g., 0.40 in the clinical sample). Boyle and Lennon (1994) also found adequate alpha coefficients (median = 0.83) in a sample of 211 subjects. Similar findings were also reported by other international adaptations of the inventory. For example, Burneo-Garcés et al. (2020) found alpha coefficients ranging from 0.49 (PIM) to 0.89 (DRG) for the Spanish adaptation of the PAI, Groves and Engel (2007) reported alphas ranging from 0.26 (INF) to 0.91 (SOM) for the German version of the inventory, and Bach-Nguyen and Morey (2018) found alphas ranging from 0.43 (ICN) to 0.94 (SUI) for the Vietnamese version. In general, the lowest alphas belonged to the validity scales, particularly ICN and INF which were designed as measures of random error. Morey (1991) argued that because these two validity scales do not measure substantive theoretical constructs, but instead are comprised of indicators of random error variance, it would be expected that the items could be uncorrelated.
The influence of sociodemographic variables on the PAI scales (e.g., age, gender, and education) were observed in the original study of Morey (1991). The author found that age exhibited some influence on PAI scores, where younger adults scored significantly higher in PAR, BOR, and ANT. Interestingly, SOM scores appeared to be stable across age ranges, suggesting that this scale does not reflect the types of physical disorders experienced by elderly people. Differences in gender were small, except on the ANT and ALC scales, where men scored significantly more than women. Finally, education appeared to present consistent effect across scales; therefore, people with lower years of education scored, generally, above the mean. Despite these findings, no other studies that focused on the impact of sociodemographic variables on PAI scores in a normative sample were found.
The PAI has been translated and adapted to a growing number of countries, including China, Germany, Spain, Greece, Italy, Mexico, Argentina, Vietnam, Iran, South-Korea, and Canada. Of these international versions of the PAI, South-Korea presented the highest standardization sample (n = 2,212), while Iran resorted to the smallest one (n = 462).
Overall, the translation and adaptation of psychological tests allows the evaluation of people from different countries by well-established measures in their own language and culture. Furthermore, test translations and adaptations, if done according to recognized guidelines and procedures (e.g., International Test Commission (ITC), 2017), facilitate the development of cross-cultural studies, reducing costs and saving time in developing new tests, while providing increasingly robust data and support to the validity and reliability of the adapted measures (Iliescu, 2017; Hernández et al., 2020). To ensure an adequate test adaptation and validation, researchers should provide evidence supporting the norms, reliability, and validity of the adapted version, thereby guaranteeing that the obtained norms are adequate for interpreting test scores (Hernández et al., 2020). The establishment of norms is of particular importance, given that they are statistical data that summarize the distribution of test scores for one or more groups, with the aim of representing a given reference to the population (American Educational Research Association et al., 2014).
The current study aims to investigate the normative data, the predictive effect of sociodemographic variables on the scales scores, and the reliability of the Portuguese version of the PAI. We also compared the Portuguese version with other international versions.
Method
Participants
Data collection was carried out from April 2021 to March 2022. The initial sample was comprised of 2,362 participants, recruited online from various regions of Portugal (i.e., North, Center, Lisbon, Alentejo, Algarve, Madeira, and Azores). In accordance with the PAI Manual (Morey, 1991), participants were dropped out if the scores of the validity scales were above the cut-off scores. Sociodemographic variables were collected, and participants were selected based on specific stratified criteria (i.e., gender, age, education, and region) close to the Portuguese census (PORDATA, 2021).1 Participants were also excluded if they answered Yes to either of these two questions: “In the last year, did you receive Neurological or Psychiatric treatment?” or “In the last year, did you have any problems with the law?”
From the original 2,362 participants, 1,297 were eliminated from the final sample due to stratification criteria, and 165 were eliminated due to invalid profiles. Consequently, a total of 900 individuals were included in the final sample (Portuguese standardization sample of the PAI), 381 (42.30%) of which were male and 519 (57.70%) were female. Descriptive statistics for the sociodemographic characteristics of the final sample are reported in Table 2 and cumulative elevation frequencies related to various T score cutoffs across gender are presented in Table 3.
Table 3. PAI normative data for the Portuguese sample by gender (raw scores) and frequency of scale elevations (T-scores).
Participation was voluntary and all participants read and signed the informed consent form prior to taking part in the study. Participants did not receive compensation for their involvement in the study.
Measures
Personality Assessment Inventory (PAI; Morey, 1991). The PAI is a self-report personality inventory comprised of 344 items, organized into 22 scales (i.e., 4 validity scales, 11 clinical scales, 5 treatment consideration scales, and 2 interpersonal scales), and 10 subscales (Morey, 1991). The Portuguese version of the PAI was developed and made available by Hogrefe Publishing, which approved the current adaptation and validation study (e.g., translation, back translation, preliminary version, and normative data).
Statistical analyses
The statistical analyses were performed using the IBM Statistical Package for the Social Sciences Statistics (SPSS) version 26. Group differences were analyzed using Student’s t test and analysis of variance (ANOVA, Tukey post hoc). Cohen’s d and partial η2 were calculated to determine the effect size of the difference between groups.
To determine the influence of gender, age, and education level on PAI scores, linear regression analyses were also conducted. The total variance (R2) of the regression model and the standardized (β) regression coefficients for each independent variable were calculated.
Results
Portuguese normative data and comparison with other international versions
The M and SD of the PAI raw scores for the Portuguese standardization sample are presented in Table 4. As expected, the raw scores of the clinical scales were low when compared to the maximum scores. As Morey (1991) suggested, these scales measure pathological content, which is expected to be limited in a community-based sample (Morey, 1991).
Table 4. Descriptive statistics for the Portuguese standardization sample and for international samples raw scores
A one-sample t-test was conducted to compare the raw scores of all the 22 PAI scales for the Portuguese version to other international versions. The raw scores of the Portuguese sample were statistically higher than the U.S. raw scores, except for the MAN, ANT, ALC, DRG, AGG, DOM, and WRM scales (U.S. > Portuguese, p < 0.05) (see Table 4). Non-significant differences were only found in three scales: ICN, ANT, and SUI. The magnitude of the differences ranged from very small (e.g., d = 0.00 in the ICN scale) to medium (e.g., d = 0.62 in the PAR scale).
Table 4 also presents comparisons between raw scores of the Portuguese version and other two European versions (Greek and German). Compared to the Portuguese sample, the Greek normative sample scored lower in two validity scales (NIM and PIM). In the clinical scales the Portuguese version revealed statistically higher scores in all scales, except for MAN, ANT, ALC, and DRG (Greek > Portuguese, p < 0.001). The same can be found for the treatment scales, where the Portuguese sample scored higher, except for in AGG. Finally, with regards to the interpersonal scales, the Greek sample scored higher in WRM. Effect sizes ranged from d = 0.03 to 0.65. In the comparisons between the Portuguese and German versions, we found that the Portuguese sample scored statistically higher in all validity and clinical scales except for PIM, MAN, DRG, and ALC (German > Portuguese, p < 0.001). Similarly, the Portuguese sample also scored higher in all treatment and interpersonal scales, except for RXR and WRM. Effect sizes ranged from d = 0.02 to 0.81.
Differences and predictive effect of sociodemographic variables over the PAI scales
The analysis of the relationships between the PAI scales and sociodemographic variables showed that gender, age, and educational level had significant effects in some PAI scales.
As shown in Table 3, gender appears to exhibit some degree of influence on PAI scores. Men scored significantly higher in ICN, INF, MAN, PAR, ANT, ALC, DRG, AGG, and NON, while women scored higher in SOM, ANX, ARD, and WRM. No significant differences were found for the remaining scales. Following methodology employed by Ingram et al. (2020) in their study with the MMPI-2-RF, cumulative elevation frequencies associated with various T score cutoffs, across gender, are presented in Table 3.
In relation to age, the youngest age group (i.e., between 18 and 24 years) scored higher on the ICN, ANX, ARD, DEP, MAN, PAR, SCZ, BOR, ANT, DRG, SUI, STR, and NON scales when compared to older age groups In contrast, older age groups (e.g., between 60 and 74 years) scored significantly higher in PIM, AGG, RXR, DOM, and WRM. In the NIM scale mixed findings were found with participants aged 18 to 24 scoring higher when compared to people older than 40 years, but individuals aged 60 to 74 scored higher when compared to people aged 25 to 39 (see Table 5). Effect sizes ranged from small (partial η2 = 0.002 for SOM) to medium to (partial η2 = 0.090 for ANX) (Cohen, 1988; Richardson, 2011).
Education level also appears to have an effect across scales, but mostly for participants who attended 10th to 12th grades. As seen in Table 6, this group scored significantly higher in ICN, NIM, SOM, ANX, ARD, DEP, PAR, SCZ, BOR, ANT, AGG, SUI, STR and NON, when compared to participants who attended University. On the other hand, individuals who attended 7th to 9th grade scored higher than people who attended University in INF, ARD, PAR, and SCZ. Participants who attended University only registered higher scores in RXR and DOM when compared with participants that attended 10th to 12th grades. Effect sizes ranged from small (partial η2 = 0.001 for MAN) to medium (partial η2 = 0.057 for PAR) (Cohen, 1988; Richardson, 2011).
Linear regression analyses were also conducted to analyze the predictive effect of gender, age, and education level on PAI scores. The beta weights, presented in Table 7, suggested that gender contributes most to predicting scores in INF, MAN, ANT, ALC, DRG, AGG, and NON. Age was the most relevant predictor in ICN, NIM, PIM, ANX, ARD, SCZ, BOR, SUI, RXR, DOM, and WRM. On the other hand, education level contributes most in predicting scores in SOM, DEP, PAR, and STR. The highest adjusted mean R2 value is 0.13 for ALC, which indicates that 13% of the variance on the ALC scores was explained by these three predictors.
Reliability
Table 8 shows the internal consistency (Cronbach alpha) for the PAI scales of the Portuguese standardization sample and other international versions. Overall, the internal consistency of the Portuguese standardization sample revealed lower Cronbach alphas on the validity scales (0.08–0.68), but adequate on the clinical (0.73–0.92, except for DRG), treatment (0.75–0.91, except for STR), and interpersonal (0.78–0.82) scales. We found lower alphas on the DRG and STR scales (0.58 and 0.63, respectively), and in the four validity scales.
The Portuguese PAI’s internal consistency was also compared to other international versions, namely the U.S., German, Spanish, South-Korean, Chilean, Greek, and Vietnamese versions. Overall, the results showed that the Portuguese version of the PAI showed higher Cronbach alphas than the other international versions in most clinical scales (i.e., ANX, ARD, DEP, MAN, PAR, SCZ, BOR, ANT, ALC, and DRG), except when compared to the Vietnamese version, which revealed similar alphas for ANX, SCZ, and BOR. Internal consistency in the validity scales was somewhat lower than in other countries, particularly when compared to the Greek sample. Regarding the interpersonal scales, the Portuguese version had the second highest Cronbach alpha in WRM, and the third highest in DOM.
Discussion and conclusions
Psychological test translation and adaptation is a growing phenomenon in the last couple of decades, and relates with several areas of study, including the study of personality. The MMPI (also adapted for the Portuguese population) is the most used measure worldwide (Friedman et al., 2014). Nevertheless, it is worth recognizing that the PAI has been gaining popularity since its birth as a useful, valid, reliable, and robust measure of personality, with multiple benefits and with great potential in several contexts (e.g., clinical, forensic). The PAI has been increasingly referenced in empirical studies centered around the use of personality measures as the second most used inventory in forensic settings (e.g., Ackerman et al., 2021), in the neuropsychology context (e.g., Martin et al., 2015), in the study of malingering (e.g., Schroeder et al., 2016), and in the training of recently graduated psychologists (e.g., Stedman et al., 2017). In the clinical context, it is also widely recognized as the third most used measure of personality (after MMPI-2/MMPI-2-RF and MCMI-III) (e.g., Wright et al., 2017).
PAI’s adaptation and validation to the Portuguese population is of great interest, not only because it would allow further research focused on the PAI, supporting its growth and dissemination, but also because it would allow Portuguese psychologists to use a valid and robust alternative to the MMPI-2-RF, that benefits from adequate reliability (i.e., internal consistency and retest values) and validity (i.e., discriminant and convergent). Compared to measures such as the MMPI-2 and MCMI-III, the PAI use of non-overlapped scales provides distinct advantages, particularly with respect of discriminant validity (Wise et al., 2010). Additionally, in forensic settings its use is particularly valuable due to its ability to correctly predict inmate violence and institutional misconduct (e.g., Reidy et al., 2016).
Comparing the raw scores of the Portuguese and U.S. versions of the PAI, we verified that, in general, the Portuguese sample manifested significantly higher scores in all scales, except in the MAN, ALC, DRG, AGG, DOM, and WRM scales. Furthermore, the largest effect sizes were found in the PAR, ANX, and DEP scales. The Portuguese sample also had higher scores in all clinical samples than the German (Groves and Engel, 2007) and Greek (Lyrakos, 2011) standardization samples, except for SOM, ALC, MAN, ANT, and DRG. These higher scores in the clinical scales may be due to nuances of translation, cultural differences related to the concepts measured by the scales, sampling issues related to the use of an online sample for normative data collection, and/or due to the higher prevalence of mental health disorders found in Portugal. Indeed, Portugal is the second European country with the highest psychiatric disorder prevalence, and anxiety disorders are particularly common (Pinto-Meza et al., 2012; Antunes et al., 2018).
Regarding to gender, women appeared to score significantly higher than men in SOM, ANX, ARD, and WRM, and men scored higher in ICN, INF, MAN, PAR, ANT, ALC, DRG, AGG, and NON. These results are in accordance with literature that associates a higher prevalence of somatization problems (e.g., Wool and Barsky, 1994) and anxiety (e.g., McLean et al., 2011) in women; but manic episodes (e.g., Azorin et al., 2013), antisociality (e.g., Glenn et al., 2013), alcoholism (e.g., Ceylan-Isik et al., 2010), and drug abuse (e.g., Cotto et al., 2010) in men. Furthermore, our results are in line with the original manual of the PAI, where Morey (1991) also found higher scores in the SOM, ANX, ARD, and WRM scales in women, but lower in ICN, INF, MAN, PAR, ANT, ALC, DRG, AGG, and NON.
Differences in age showed that younger participants appeared to score higher in scales such as ICN, NIM, ANX, ARD, DEP, MAN, PAR, SCZ, BOR, ANT, DRG, SUI, STR, and NON, and that older participants tended to score higher in PIM, AGG, RXR, DOM, and WRM. These data were consistent with Morey’s (2007) findings that younger adults tend to score higher in PAR, BOR and ANT, and also with other studies that attributes less personality stability in younger ages and that recognizes a greater prevalence of personality disorders in these age groups (e.g., Johnson et al., 2000).
The effect of education level on PAI scores was unexpected. Contrary to what is reported in the original manual, in our study, the groups with lower years of education (i.e., 1st to 6th grade) did not score higher than the groups with higher levels of education (Morey, 1991). Instead, our results revealed significant differences between respondents who completed 7th to 9th grades and individuals who attended University (7th to 9th grades scored higher in INF, ANX, PAR, and SCZ), and between participants who completed 10th to 12th and those who attended University (10th to 12th grades scored higher in ICN, NIM, SOM, ANX, ARD, DEP, PAR, SCZ, BOR, ANT, AGG, SUI, STR, and NON; but participants who attended University scored higher in RXR and WRM). Therefore, although these results generally reproduce the expected effect (i.e., lower education level leading to higher scores), they do not reproduce it in the way it was expected (i.e., participants who completed 1st to 6th grade did not score significantly higher than people who attended University), which could reflect differences in expectations around education in the two cultures.
The present study also aimed to investigate the psychometric properties of the PAI to the Portuguese population by analyzing its reliability. Furthermore, similarly to what other studies with international versions of the PAI did (e.g., Jeffay et al., 2021, for the French-Canadian version), we compared the newly developed Portuguese version with the original U.S. sample, as well as with other international adaptations of the inventory. Our sample (n = 900), albeit smaller than some international samples (e.g., Chinese, n = 1884; Italian, n = 1,548; French-Canadian, n = 1,120), is similar to the Argentinian (n = 998) and Mexican (n = 961) samples, and higher than other international samples (e.g., German, n = 749; Chilean, n = 569; Iranian, n = 462).
Regarding to the Portuguese PAI’s reliability, the results indicated overall adequate internal consistency. The Portuguese version revealed higher Cronbach alphas than the U.S. version in 9 out of 22 scales of the PAI, but lower values in the remaining 13 scales (Cronbach alphas <0.70 were identified in 7 scales: ICN, INF, NIM, PIM, DRG, and STR). Some Cronbach alphas were very low, particularly ICN, INF, and DRG; however, most of these scales are validity scales, which is consistent with the assertion that the ICN and INF scales possibly measuring variance error indicators and, therefore, leading to low internal consistency values (Morey, 1991). The lower Cronbach alphas observed in the NIM, PIM, and DRG scales were similar, albeit smaller, than the values found in the German version (Groves and Engel, 2007), and the DRG alpha was similar to the South-Korean (Lee et al., 2020) and higher than the French-Canadian alphas (Jeffay et al., 2021). The smaller STR values were also similar to the alphas reported in the Chilean version (Ortiz-Tallo et al., 2011).
The reliability data also suggested that the Portuguese version showed higher alphas in most clinical scales (except for SOM, ARD, MAN, ANT, ALC, and DRG), when compared to the other versions. The Vietnamese version, however, presented the same values in ANX, SCZ, and BOR. The highest internal consistency values in the Portuguese version of the PAI (i.e., > 0.85) belong to the SOM, ANX, DEP, PAR, BOR, and SUI scales, and these alphas were overall higher than in other international versions. In general, the scales of the Portuguese demonstrated internal consistency of the same or greater magnitude than the other international versions of the PAI.
Despite the strengths of the present study, it is not without some limitations. The sample was collected entirely online, which may be problematic. Indeed, online data collection has been a growing phenomenon in the last decade, because of an increase in demands for higher statistical power associated with larger sample sizes (Sassenberg and Ditrich, 2019). Online experiments have several advantages by reducing costs, providing procedure uniformity, and allowing wider accessibility, while also maintaining ethical standards (Dandurand et al., 2008). So, although it may bring some disadvantages (e.g., increased environmental variability during testing, possible multiple submissions, and bigger drop-out rates), many of these can be easily avoidable (e.g., by asking participants to participate in the study in a particular environment, by asking for personal data, and by using IP address verification). In addition, some studies have shown that self-selected web samples do not differ systematically from samples obtained from other ways (e.g., in-person), even for cognitive and perceptual demanding tasks (Dandurand et al., 2008; Germine et al., 2012; Hilbig, 2016; Corey and Ben-Porath, 2020; Agarwal et al., 2023), although other studies using the PAI suggest that online participants may be somewhat more socially withdrawn than those participating in in-person studies (McCredie and Morey, 2019). Future studies should compare results between the data collected online and in-person, to check for possible differences in the normative data and psychometric properties. The analysis of normative data on supplemental indexes (e.g., MFI, CBS) would also contribute to validation efforts of these response style measures and reinforce their utility in many settings. An analysis of creating norms based on sociodemographic variables such as gender, age, and educational level may be equally relevant. Differences in scores among minority groups should be further examined in future studies, with regards to clinical and validity scales. Furthermore, it would be also interesting to include forensic samples (e.g., victims, offenders, forensic professionals) and clinical samples, considering that they may influence the psychometric properties and the normative data of the PAI.
In conclusion, this study provides evidence regarding the psychometric properties of the Portuguese version of the PAI. It is a crucial step into the Portuguese adaptation and validation of this instrument, a measure with considerable potential in clinical, forensic, and research contexts. This adaptation may lead to the growth and development of the psychological assessment field in Portugal, and the opportunity to develop future cross-cultural studies with other international versions of the PAI.
Data availability statement
The supporting data presented in this article is not readily available because of ethical and commercial rights (Hogrefe Publisher, Portugal). Requests to access the datasets should be directed to Hogrefe Publisher (Portugal).
Ethics statement
The studies involving humans were approved by Faculty of Psychology and Education Sciences of the University of Coimbra Ethics Board (CEDI/FPCEUC:78/R_5). The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
MP: Writing – original draft, Writing – review & editing. MM: Writing – original draft, Writing – review & editing. OM: Investigation, Methodology, Supervision, Writing – review & editing. DR: Conceptualization, Writing – review & editing. LM: Conceptualization, Formal analysis, Supervision, Validation, Writing – review & editing. MS: Conceptualization, Formal analysis, Investigation, Supervision, Validation, Writing – review & editing, Writing – original draft.
Funding
The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.
Conflict of interest
In accordance with our ethical obligation as researchers, we are reporting that the author LM is the author of the Personality Assessment Inventory and receives royalties from its sale. MP, MM, OM, DR, and MS are part of the team responsible for adapting the Personality Assessment Inventory, in Portugal. The authors received no financial support for the research, authorship, and/or publication of this article.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Footnotes
1. ^For example, according to the Portuguese Census, in 2021, 47.6% of the Portuguese population was male, and 52.4% was female.
References
Ackerman, M. J., Bow, J. N., and Mathy, N. (2021). Child custody evaluation practices: where we were, where we are, and where we are going. Prof. Psychol. Res. Pract. 52, 406–417. doi: 10.1037/pro0000380
Agarwal, L. P., Keen, M. A., Morris, C. S., and Ingram, P. B. (2023). Contrasting MMPI-3 validity scale effectiveness differences across in-person and telehealth administration procedures. Psychol. Assess. 35, 925–937. doi: 10.1037/pas0001258
American Educational Research Association, American Psychological Association, & National Council on Measurement in Education (2014). Standards for educational and psychological testing. Washington, DC: American Educational Research Association.
American Psychological Association . (2020). Guidance on psychological tele-assessment during the COVID-19 crisis. Available at: https://www.apaservices.org/practice/reimbursement/health-codes/testing/tele-assessment-covid-19
Antunes, A., Frasquilho, D., Azeredo-Lopes, S., Neto, D., Silva, M., Cardoso, G., et al. (2018). Disability and common mental disorders: results from the world mental health survey initiative Portugal. Eur. Psychiatry 49, 56–61. doi: 10.1016/j.eurpsy.2017.12.004
Azorin, J. M., Belzeaux, R., Kaladjian, A., Adida, M., Hantouche, E., Lancrenon, S., et al. (2013). Risks associated with gender differences in bipolar I disorder. J. Affect. Disord. 151, 1033–1040. doi: 10.1016/j.jad.2013.08.031
Bach-Nguyen, M., and Morey, Leslie C. (2018). Assessing mental health in Vietnam with the Personality Assessment Inventory: Cross-cultural comparability, International. Int. J. Cult. Ment. Health. doi: 10.1080/17542863.2018.1428641
Boyle, G. J., and Lennon, T. J. (1994). Examination of the reliability and validity of the personality assessment inventory. J. Psychopathol. Behav. Assess. 16, 173–187. doi: 10.1007/BF02229206
Burneo-Garcés, C., Fernández-Alcántara, M., and Aguayo-Estremera, R. and Pérez-García, M. (2020). Psychometric properties of the spanish adaptation of the personality assessment inventory in orrectional settings: an esem study. J Pers Assess. 102, 75–87. doi: 10.1080/00223891.2018.1481858
Ceylan-Isik, A. F., McBride, S. M., and Ren, J. (2010). Sex difference in alcoholism: who is at a greater risk for development of alcoholic complication? Life Sci. 87, 133–138. doi: 10.1016/j.lfs.2010.06.002
Cohen, J. W. (1988). Statistical power analysis for the behavioral sciences (2nd edition). Hillsdale, NJ: Lawrence Erlbaum Associates.
Corey, D. M., and Ben-Porath, Y. S. (2020). Practical guidance on the use of the MMPI instruments in remote psychological testing. Prof. Psychol. Res. Pract. 51, 199–204. doi: 10.1037/pro0000329
Cotto, J. H., Davis, E., Dowling, G. J., Elcano, J. C., Staton, A. B., and Weiss, S. R. (2010). Gender effects on drug use, abuse, and dependence: a special analysis of results from the National Survey on drug use and health. Gend. Med. 7, 402–413. doi: 10.1016/j.genm.2010.09.004
Dandurand, F., Shultz, T. R., and Onishi, K. H. (2008). Comparing online and lab methods in a problem-solving experiment. Behav Res Methods 40, 428–434. doi: 10.3758/BRM.40.2.428
Finger, M. S., and Ones, D. S. (1999). Psychometric equivalence of the computer and booklet forms of the MMPI: a meta-analysis. Psychol. Assess. 11, 58–66. doi: 10.1037/1040-3590.11.1.58
Friedman, A. F., Bolinskey, P. K., Levak, R. W., and Nichols, D. S. (2014). Psychological assessment with the MMPI-2/MMPI-2-RF. New York, NY: Routledge.
Gaasedelen, O. J., Whiteside, D. M., Altmaier, E., Welch, C., and Basso, M. R. (2019). The construction and the initial validation of the cognitive Bias scale for the personality assessment inventory. Clin. Neuropsychol. 33, 1467–1484. doi: 10.1080/13854046.2019.1612947
Gaines, M. V., Giles, C. L., and Morgan, R. D. (2013). The detection of feigning using multiple PAI scale elevations: a new index. Assessment 20, 437–447. doi: 10.1177/1073191112458146
Germine, L., Nakayama, K., Duchaine, B. C., Chabris, C. F., Chatterjee, G., and Wilmer, J. B. (2012). Is the web as good as the lab? Comparable performance from web and lab in cognitive/perceptual experiments. Psychon. Bull. Rev. 19, 847–857. doi: 10.3758/s13423-012-0296-9
Glenn, A. L., Johnson, A. K., and Raine, A. (2013). Antisocial personality disorder: a current review. Curr. Psychiatry Rep. 15, 1–8. doi: 10.1007/s11920-013-0427-7
Groves, J. A., and Engel, R. R. (2007). The German adaptation and standardization of the personality assessment inventory (PAI). J. Pers. Assess. 88, 49–56. doi: 10.1080/00223890709336834
Hawes, S. W., and Boccaccini, M. T. (2009). Detection of overreporting of psychopathology on the personality assessment inventory: a meta-analytic review. Psychol. Assess. 21, 112–124. doi: 10.1037/a0015036
Hernández, A., Hidalgo, M. D., Hambleton, R. K., and Gómez-Benito, J. (2020). International test commission guidelines for test adaptation: a criterion checklist. Psicothema 32, 390–398. doi: 10.7334/psicothema2019.306
Hilbig, B. E. (2016). Reaction time effects in lab-versus web-based research: experimental evidence. Behav Res Methods 48, 1718–1724. doi: 10.3758/s13428-015-0678-9
Iliescu, D. (2017). Adapting tests in linguistic and cultural situations. New York: Cambridge University Press.
Ingram, P. B., Armistead-Jehle, P., Herring, T. T., and Morris, C. S. (2024). Cross validation of the personality assessment inventory (PAI) cognitive Bias scale of scales (CB-SOS) over-reporting indicators in a military sample. Mil. Psychol. 36, 192–202. doi: 10.1080/08995605.2022.2160151
Ingram, P. B., Schmidt, A. T., Bergquist, B. K., and Currin, J. M. (2022). Coursework, instrument exposure, and perceived competence in psychological assessment: a national survey of practices and beliefs of health service psychology trainees. Train. Edu.Prof. Psychol 16, 10–19. doi: 10.1037/tep0000348
Ingram, P. B., Tarescavage, A. M., Ben-Porath, Y. S., and Oehlert, M. E. (2020). Patterns of MMPI-2-restructured form (MMPI-2-RF) validity scale scores observed across veteran affairs settings. Psychol. Serv. 17, 355–362. doi: 10.1037/ser0000339
International Test Commission (ITC) (2017). ITC guidelines for translating and adapting tests. Int. J. Test. 18, 101–134. doi: 10.1080/15305058.2017.1398166
Jeffay, E., Sekely, A., Lacerte, M., and Zakzanis, K. K. (2021). Reliability of the French-Canadian adaptation of the personality assessment inventory: medical–legal implications. Psychiatry Psychol. Law 28, 135–148. doi: 10.1080/13218719.2020.1767716
Johnson, J. G., Cohen, P., Kasen, S., Skodol, A. E., Hamagami, F., and Brook, J. S. (2000). Age-related change in personality disorder trait levels between early adolescence and adulthood: a community-based longitudinal investigation. Acta Psychiatr. Scand. 102, 265–275. doi: 10.1034/j.1600-0447.2000.102004265.x
Kiesler, D. (1996). Contemporary interpersonal theory and research: Personality, psychopathology, and psychotherapy. New York: Wiley.
Lee, S., Lee, E., Kim, J., Hong, S., Hwang, S., Kweon, H., et al. (2020). Restandardization of the Korean personality assessment inventory: comparisons with the original Korean version. J. Korean Neuropsychiatr. Assoc. 59:142. doi: 10.4306/jknpa.2020.59.2.142
Lyrakos, D. G. (2011). The development of the Greek personality assessment inventory. Psychology 02, 797–803. doi: 10.4236/psych.2011.28122
Martin, P. K., Schroeder, R. W., and Odland, A. P. (2015). Neuropsychologists’ validity testing beliefs and practices: a survey of north American professionals. Clin. Neuropsychol. 29, 741–776. doi: 10.1080/13854046.2015.1087597
McCredie, M. N., and Morey, L. C. (2019). Who are the Turkers? A characterization of MTurk workers using the personality assessment inventory. Assessment 26, 759–766. doi: 10.1177/1073191118760709
McLean, C. P., Asnaani, A., Litz, B. T., and Hofmann, S. G. (2011). Gender differences in anxiety disorders: prevalence, course of illness, comorbidity and burden of illness. J. Psychiatr. Res. 45, 1027–1035. doi: 10.1016/j.jpsychires.2011.03.006
Menton, W. H., Corey, D. M., and Ben-Porath, Y. S. (2022). Evidence for the comparability of local and remote administrations of the MMPI-2-RF in police candidate evaluations. Psychol. Assess. 34, 98–104. doi: 10.1037/pas0001088
Mihura, J. L., Roy, M., and Graceffo, R. A. (2017). Psychological assessment training in clinical psychology doctoral programs. J. Pers. Assess. 99, 153–164. doi: 10.1080/00223891.2016.1201978
Morey, L. C. (1991). The personality assessment inventory: Professional manual. Odessa, FL: Psychological Assessment Resources.
Morey, L. C. (2007). The personality assessment inventory: Professional manual. 2nd Edn: Odessa, FL: Psychological Assessment Resources.
Morey, L. C., and McCredie, M. N. (2019). “Personality assessment inventory” in The Cambridge handbook of clinical assessment diagnosis. eds. M. Sellbom and J. A. Suhr (Cambridge: Cambridge University Press), 231–248.
Morey, L. C., and McCredie, M. N. (2020). “Personality assessment inventory” in Encyclopedia of personality and individual differences. eds. V. Zeigler-Hill and T. K. Shackelford (Cham: Springer International Publishing), 3739–3743.
Ortiz-Tallo, M., Santamaría, P., Cardenal, V., and Sánchez, M. P. (2011). Adaptación española del Inventario de Evaluación de la Personalidad (PAI). Madrid: TEA Ediciones.
Pincus, A. L. (2005). “A contemporary integrative interpersonal theory of personality disorders” in Major theories of personality disorder. eds. M. F. Lenzenweger and J. F. Clarkin (New York: Guilford Press), 282–331.
Pinto-Meza, A., Moneta, M. V., Alonso, J., Angermeyer, M. C., Bruffaerts, R., Almeida, J. M. C., et al. (2012). Social inequalities in mental health: results from the EU contribution to the world mental health surveys initiative. Soc. Psychiatry Psychiatr. Epidemiol. 48, 173–181. doi: 10.1007/s00127-012-0536-3
PORDATA (2021). População residente: Censos. [Resident population: Census] Available at: https://www.pordata.pt/subtema/portugal/populacao+residente-558
Reidy, T. J., Sorensen, J. R., and Davidson, M. (2016). Testing the predictive validity of the personality assessment inventory (PAI) in relation to inmate misconduct and violence. Psychol. Assess. 28, 871–884. doi: 10.1037/pas0000224
Richardson, J. T. (2011). Eta squared and partial eta squared as measures of effect size in educational research. Educ. Res. Rev. 6, 135–147. doi: 10.1016/j.edurev.2010.12.001
Sassenberg, K., and Ditrich, L. (2019). Research in social psychology changed between 2011 and 2016: larger sample sizes, more self-report measures, and more online studies. Adv. Methods Pract. Psychol. Sci. 2, 107–114. doi: 10.1177/2515245919838781
Schroeder, R. W., Martin, P. K., and Odland, A. P. (2016). Expert beliefs and practices regarding neuropsychological validity testing. Clin. Neuropsychol. 30, 515–535. doi: 10.1080/13854046.2016.1177118
Stedman, J. M., McGreary, C. A., and Essery, J. (2017). Current patterns of training in personality assessment during internship. J. Clin. Psychol. 74, 398–406. doi: 10.1002/jclp.22496
Wise, E. A., Streiner, D. L., and Walfish, S. (2010). A review and comparison of the reliabilities of the MMPI-2, MCMI-III, and PAI presented in their respective test manuals. Meas. Eval. Couns. Dev. 42, 246–254. doi: 10.1177/0748175609354594
Wool, C. A., and Barsky, A. J. (1994). Do women somatize more than men?: gender differences in somatization. Psychosomatics 35, 445–452. doi: 10.1016/S0033-3182(94)71738-2
Keywords: Personality Assessment Inventory (PAI), Portuguese validation, psychometric properties, normative data, reliability
Citation: Paulino M, Moniz M, Moura O, Rijo D, Morey L and Simões MR (2024) Psychometric properties of the Portuguese version of the Personality Assessment Inventory: normative data and reliability. Front. Psychol. 15:1359793. doi: 10.3389/fpsyg.2024.1359793
Edited by:
Sao Luis Castro, University of Porto, PortugalReviewed by:
Paul Ingram, Texas Tech University, United StatesFranca Crippa, University of Milano-Bicocca, Italy
Copyright © 2024 Paulino, Moniz, Moura, Rijo, Morey and Simões. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Mauro Paulino, bXBhdWxpbm9fcHNpY0B5YWhvby5jb20=