Skip to main content

ORIGINAL RESEARCH article

Front. Immunol., 08 January 2024
Sec. Autoimmune and Autoinflammatory Disorders : Autoimmune Disorders
This article is part of the Research Topic Community Series in Towards Precision Medicine for Immune-Mediated Disorders: Advances in Using Big Data and Artificial Intelligence to Understand Heterogeneity in Inflammatory Responses, Volume II View all 13 articles

Large-scale epidemiological analysis of common skin diseases to identify shared and unique comorbidities and demographic factors

  • 1Department of Biostatistics, University of Michigan, Ann Arbor, MI, United States
  • 2Department of Dermatology, University of Michigan, Ann Arbor, MI, United States
  • 3The Center for Autoimmune Genomics and Etiology, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, United States
  • 4Rheumatology, Internal Medicine, University of Michigan, Ann Arbor, MI, United States
  • 5Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, United States

Introduction: The utilization of large-scale claims databases has greatly improved the management, accessibility, and integration of extensive medical data. However, its potential for systematically identifying comorbidities in the context of skin diseases remains unexplored.

Methods: This study aims to assess the capability of a comprehensive claims database in identifying comorbidities linked to 14 specific skin and skin-related conditions and examining temporal changes in their association patterns. This study employed a retrospective case-control cohort design utilizing 13 million skin/skin-related patients and 2 million randomly sampled controls from Optum’s de-identified Clinformatics® Data Mart Database spanning the period from 2001 to 2018. A broad spectrum of comorbidities encompassing cancer, diabetes, respiratory, mental, immunity, gastrointestinal, and cardiovascular conditions were examined for each of the 14 skin and skin-related disorders in the study.

Results: Using the established type-2 diabetes (T2D) and psoriasis comorbidity as example, we demonstrated the association is significant (P-values<1x10-15) and stable across years (OR=1.15-1.31). Analysis of the 2014-2018 data reveals that celiac disease, Crohn’s disease, and ulcerative colitis exhibit the strongest associations with the 14 skin/skin-related conditions. Systemic lupus erythematosus (SLE), leprosy, and hidradenitis suppurativa show the strongest associations with 30 different comorbidities. Particularly notable associations include Crohn’s disease with leprosy (odds ratio [OR]=6.60, 95% confidence interval [CI]: 3.09-14.08), primary biliary cirrhosis with SLE (OR=6.07, 95% CI: 4.93-7.46), and celiac disease with SLE (OR=6.06, 95% CI: 5.49-6.69). In addition, changes in associations were observed over time. For instance, the association between atopic dermatitis and lung cancer demonstrates a marked decrease over the past decade, with the odds ratio decreasing from 1.75 (95% CI: 1.47-2.07) to 1.02 (95% CI: 0.97-1.07). The identification of skin-associated comorbidities contributes to individualized healthcare and improved clinical management, while also enhancing our understanding of shared pathophysiology. Moreover, tracking these associations over time aids in evaluating the progression of clinical diagnosis and treatment.

Discussion: The findings highlight the potential of utilizing comprehensive claims databases in advancing research and improving patient care in dermatology.

1 Introduction

Dermatological disorders are among the most common human diseases: more than a third of the global population suffers from some form of skin condition (15). While most skin disorders are not fatal, the burden on patients and society is severe; in fact, skin disorders are ranked the fourth leading cause of nonfatal disease burden globally (1). For instance, in a previous study, 60% of working patients noted significant work time lost, and 40% of non-working patients attributed their lack of work to psoriasis (6). In 1984, it was estimated that the cost for 2.3 million psoriasis outpatients in the US reached $1.5 billion per year (7), and a recent study reviewing the yearly cost for psoriasis nationwide increased the estimate to a range between $51.7 and $63.2 billion (8). Atopic dermatitis (AD) is another common skin condition that affects over 30 million patients in the US with a total annual cost of $4.2 billion in 2004 and $5.4 billion in 2016 (9). Although systemic lupus erythematosus (SLE), in which up to 70% patients exhibit skin manifestations, is relatively less common with a prevalence rate of around 10 per 10,000 in the US (10), the economic burden is significant, with a total annual cost estimated to be $13,735-$20,926 per patient (11). With these significant medical burden for the wide spectrum of dermatological disorders (12), the prevention and treatment of these conditions are critical issues for public health.

The associated comorbidities (i.e. co-occurrence of two different diseases (13)) for skin conditions contribute significantly to health and social burden. Numerous studies have found that skin disorders can be early manifestations of systemic diseases (13). Thus, it is important to assess patients’ risk for having other conditions in addition to their primary skin disorder; furthermore, understanding skin-associated comorbidities can further the development of better healthcare management (14) by facilitating early diagnosis of associated systemic conditions (13). Comorbidity information can also advance the identification of shared pathophysiology and risk factors, which play an important role in preventive medicine. For instance, cardiovascular disease has been found to have a significant association with psoriasis and contributes largely to the 5-year shorter life expectancy of psoriatic patients (15). Although this connection has been well publicized, a survey conducted between 2009 to 2012 showed that many physicians were unaware of this association potentially increasing the risk of delayed diagnosis and inadequate treatment of the associated cardiovascular comorbidity (16, 17).

While small cohort studies have been conducted to identify associated demographic variables or co-occurring conditions for specific skin-diseases (4, 18, 19) and the availability of large-scale claims databases has advanced precision medicine and comorbidity identification (20), limited research has investigated the potential of using these resources to identify, in a systematic fashion, associated skin conditions and comorbidities. A prominent claims data system is Optum’s de-identified Clinformatics® Data Mart Database (CDM) (21, 22), an organized medical claims database that supports large-scale retrospective cohort studies. By utilizing medical records dating from 2001 to 2018, we revealed specific/shared comorbidities for 14 different skin diseases. With the 18-year time span, the trajectory of disease-comorbidity associations was also studied (23).

Our work highlights that most of the potential skin/skin-related condition-comorbidity pairs are positively associated. We calculated the trend of the skin-comorbidity associations over time and illustrated that the association between type-2 diabetes (T2D) and psoriasis over time is significant, stable, and consistent with previously published studies, confirming the validity of using CDM data in the identification of skin/skin-related disease comorbidities. However, analysis of some disease conditions can be biased, for instance, the association between psoriatic arthritis (PsA) and rheumatoid arthritis (RA) can be inflated when using unrestricted CDM data. This observation manifests potential misdiagnosis for some disease pairs in claims data. The CDM data processing and analyses in skin disease comorbidity identification can help inform the potentials and challenges in using large-scale claims data to study comorbidities and facilitate the development of individualized health care and optimization of clinical management.

2 Materials and methods

2.1 Data preparation

The data used in this study comes from CDM (21), a de-identified patient-level database provided by Optum, a national healthcare management company. The CDM database includes medical claims from various sources, including commercially insured patients, administrative services only patients, legacy medicare choice patients prior to 2006, and medicare advantage patients after 2006. It covers a span of 18 years, from 2001 to 2018, and includes over 63 million patients from all 50 U.S. states. However, the CDM cohort does not include patients insured by Medicaid, so the socioeconomic spectrum of the entire U.S. population is not fully represented in this dataset (22).

Our analysis focused on identifying comorbidities related to skin diseases. We began by selecting a total of 13,934,335 patients with at least one of the 14 skin conditions. These conditions were categorized into three groups: immune-mediated skin diseases (acne, rosacea, alopecia areata, vitiligo, psoriasis, atopic dermatitis, hidradenitis suppurativa, prurigo nodularis), non-immune-mediated skin diseases (aging, leprosy, pigmentation, melanoma), and skin-related disorders (systemic lupus erythematosus, psoriatic arthritis). For the control group, we randomly sampled 2 million unique patients from the entire CDM database, excluding those with any of the aforementioned 14 skin/skin-related diseases. We extracted and adjusted several demographic and socioeconomic variables for analysis, including age, sex, race, education level, income level, home ownership, and the number of adults and children in the household, to account for the higher socioeconomic sampling bias. To account for non-recorded comorbidities resulting from patients leaving the healthcare system, we also included the length of time patients stayed in the system as a covariate. In the subsequent analysis, we only included individuals with complete demographic and socioeconomic information leaving 7,553,273 patients and 726,230 controls. If a patient was diagnosed with two diseases within a 5-year time span, we considered those conditions to be co-occurring. This time range is based on empirical observations of the duration patients stay in the CDM system. We divided the full dataset into consecutive 5-year subsets (e.g., 2001-2005, 2002-2006,…, 2014-2018) and conducted separate analyses for each time interval. Figure 1 provides an overview of our study.

Figure 1
www.frontiersin.org

Figure 1 Data preprocessing and model fitting workflow. The flowchart illustrates the selection process for patients with skin-related conditions and the control group. Patients with the 14 skin-related conditions are initially extracted, and a separate control group of 2,000,000 patients is randomly sampled from the remaining cohort. Quality control steps are applied to remove patients with incomplete records. The subsequent statistical analyses involve comparing the extracted skin condition patients with the randomly sampled non-skin condition patients as the control group (psoriasis is used as an example in the above pipeline).

2.2 Statistical analysis

A descriptive analysis was performed to provide an overview of the dataset and the distribution of all covariates. Categorical variables such as sex, race, education level, home ownership, and income level were summarized as percentages for each category. Continuous/Integer variables such as age, the number of children, and the number of adults in the household were summarized as mean values with their corresponding standard deviations.

Logistic regression was employed to model the association between each skin disease and comorbidity pair while accounting for potential confounding covariates. Treating either skin/skin-related disorders or comorbidities as outcome variable can achieve this goal. Since the other aim of this work is to model the risk of skin/skin-related disorders, therefore, in the following analysis we treat skin/skin-related disorders as outcome variable and comorbidities and other demographics as predictors. Age was categorized into specific ranges (e.g., <10, 10-20, 20-30,…, 70-80, >80), allowing for non-linear patterns, with the reference category being age<10. As weight and height information was unavailable, the obesity diagnosis code was used as a surrogate to control for the impact of low or high BMI on disease associations. Male and European ancestry were chosen as the reference categories for sex and race, respectively. Education level was categorized as “below high school,” “high school,” “bachelor,” and “above bachelor,” with “below high school” as the reference category. Annual household income was categorized as “<$40k,” “$40k-$49k,” “$50k-$59k,” “$60k-$74k,” “$75k-$99k,” and “$100k>,” with “<$40k” as the reference category. The time lengths for each patient in the system were calculated as the number of years between the first and last recorded diagnosis. For patient i, the logistic regression model for the following comorbidity analysis is thus:

logit{Pr(Skini|Xi)}  =β0+βcomorbidity×Xicomorbidity+βobesity×Xiobesity+βage   ×Xiage+βsex×Xisex+βrace×Xirace+βeducation×Xieducation   +βincome×Xiincome+βchild×Xichild+βadult×Xiadult+βtime×Xitime,

where βcomorbidityis the parameter of interest indicating the association levels for a pair of skin/skin-related condition and comorbidity, which can be interpreted as the log odds ratio of developing the skin/skin-related disease between patients with or without the comorbidity.

3 Results

3.1 Summary statistics

The summary information for the cases and controls during the period of 2014-2018 is presented in Table 1 in addition to the US general population characteristics. When comparing the randomly controlled samples with the US general population, the CDM data represents older, higher income and education US population with less ethnic minorities. This further justifies controlling the socioeconomic factors in the logistic regression model for subsequent analysis. Consistent with previous studies (2428), certain skin or skin-related disorders show a higher prevalence among women. For example, rosacea, alopecia areata, SLE, acne, and hidradenitis suppurativa (HS) have 67.6%, 73.7%, 86.3%, 67.6%, and 72.5% female patients, respectively, compared to 50.7% in the control group. We also found a higher proportion of European ancestry associated with the diagnosis of rosacea, aging (chronic exposure to sun or non-ionizing radiation), melanoma, and pigmentation (e.g. hyperpigmentation and freckles; detailed definition can be found in Supplementary Table 1), with percentages of 82.6%, 87.6%, 88.7%, and 81.8%, respectively, compared to the baseline composition of 72.2% Europeans in the control population. Conversely, the Hispanic and African American populations have lower proportions in most skin diseases compared to the control group, except for vitiligo (16.5%) and leprosy (14.5%) among Hispanics (control: 12.5%), and SLE (15.5%) and HS (18.6%) among African Americans (control: 10.5%). Patients of Asian heritage have a lower proportion of melanoma (0.9%) but a higher proportion of vitiligo (7.4%) and leprosy (8.9%) compared to the control group (4.8%). Furthermore, we observed that a higher education level is associated with a larger number of medical claims for skin disorders. Rosacea (30.5% above college), acne (35.4% above college), and pigmentation (30.1% above college) have the most significant elevation compared to the control group (18.8% above college). Similarly, a higher income level is linked to a stronger association with medical claims for skin conditions, with rosacea (53.4% income >$100k), acne (61.0% >$100k), and pigmentation (53.1% >$100k) showing the largest contrast compared to the control population (39.2% >$100k).

Table 1
www.frontiersin.org

Table 1 Descriptive analysis for CDM data.

Figure 2 provides an overview of the demographic variables in our study. Figure 2A displays the prevalence of each skin disease and control categorized by gender. AD, pigmentation, and acne are the most prevalent skin conditions in the CDM data, and their prevalence remains consistent when comparing 2014-2018 records to those from 2001-2005 (Supplementary Figure 1A). The gender distributions for different skin conditions also remain consistent. Figure 2B presents the density of the time (in years) that patients stay in the CDM system, showing that approximately 60% of the patients stay within a 5-year time span. Figure 2C displays the age distribution of the control group and each skin disease group for the period between 2014-2018. This represents the ages of patients with skin-related disorders diagnosis in the system, and not necessarily represent the disease age of onset. Each disease exhibits a unique age distribution compared to the control group. For example, acne patients tend to be younger (29), while AD shows a bimodal pattern in age distribution, which is consistent with previous studies (30). We also observed that the median age for all skin conditions, except for acne, tends to be earlier in the 2001-2005 cohort (Supplementary Figure 1B) compared to the 2014-2018 cohort, whereas the age distribution for acne remains consistent over time.

Figure 2
www.frontiersin.org

Figure 2 Data summary. (A) Gender-specific prevalence of each skin disease/control between 2014-2018. Females generally exhibit a higher prevalence than males in developing immune-mediated skin diseases. (B) Distribution of patients’ time in the system, spanning from 2001 to 2018. Most patients stayed in the system for less than 5 years. (C) Age distribution of different skin/skin-related diseases/control between 2014-2018. Most skin diseases show a similar age distribution compared to the control group, while acne, AD, and HS tend to have a higher proportion of younger patients.

3.2 Skin-comorbidity association trends across time

We first investigated the trend of associations between psoriasis and T2D (18, 31), a comorbidity pair that has been extensively studied before. Figure 3A provides a summary of adjusted Odds Ratios (ORs) with 95% confidence intervals (CIs) from the logistic regression model. We observed consistent and stable estimated ORs across different time periods, ranging between 1.15 and 1.31. To compare our findings with previous studies (18, 31) on the association between psoriasis and T2D, we included their OR estimates and corresponding 95% CIs. Due to smaller sample sizes, the 95% CIs of these earlier studies are wider compared to our analysis. Although their estimates show some variability, their point estimates for OR align closely with ours, and their 95% CIs encompass most of our estimates.

Figure 3
www.frontiersin.org

Figure 3 Forest plots of association across a year-to-year period. (A) Forest plot illustrating the odds ratio (OR) with confidence intervals (CIs) for the association between psoriasis and type 2 diabetes (T2D) in comparison to non-T2D patients. The OR and CI from this study are shown, along with the corresponding OR and CI from two previous studies for comparison. The findings indicate that the OR estimate from this study aligns with previous results, but featuring more precise CIs. (B) Forest plot showcasing the parameter estimate for the OR with CIs of developing atopic dermatitis (AD) in lung cancer patients compared to lung cancer-free patients. The results exhibit a declining trend in the association, which ultimately dissipates. (C) Forest plot displaying the parameter estimate for the OR with CIs between psoriatic arthritis (PsA) and rheumatoid arthritis (RA) based on all clinics and providers (black) and solely rheumatology clinics (red). The estimated associations derived from rheumatology clinics is weaker than that from all clinics, with both estimates showing a steady downward trend. This suggests the potential for more precise diagnoses in rheumatology clinics, as well as improved diagnosis accuracy over time in general. (D) Regression analysis of PsA vs RA odds ratios based on all clinics and rheumatology clinics, incorporating first-order and second-order time covariates. Estimates and P-values of the second-order time coefficients are shown in the legend. The significant second-order time coefficient from the rheumatology clinic estimate suggests a significant deceleration in the rate of change for ORs, while the rate of change for ORs from all clinics demonstrates a steady decline. For all figures, the control group consists of randomly sampled patients from the general CDM population.

Furthermore, we explored the association trends of other disease pairs and highlighted notable findings in Figure 3. For instance, the association between AD and lung cancer (Figure 3B) has transitioned from a significant positive association in the period 2001-2005 (OR: 1.62, 95% CI [1.34-1.97]) to a non-significant association in 2014-2018 (OR: 1.02, 95% CI [0.97-1.07]). While earlier studies from 2005 and 2012 reported positive associations between AD and lung cancer (32, 33), a more recent study in 2020 found that after adjusting for potential mediators such as smoking or smoking-related diseases, this association disappears (34). These findings suggest that improved treatment for AD in recent years or changes in modifying behaviors (such as smoking) may have played a role in reducing the risk of cancer for AD patients. In Figure 3C, we observed strong associations between PsA and RA across different years. Since many clinical measures of PsA are adopted from RA (35) and the specific diagnosis of RA and PsA require knowledge from rheumatologists (36), the strong associations may be attributed to miscoding. To explore this further, we conducted separate analyses for patients diagnosed exclusively in rheumatology clinics (red lines in Figure 3C), in addition to the analysis based on all clinics or providers (black lines in Figure 3C). The associations between PsA and RA from rheumatology clinics consistently exhibit weaker associations compared to the findings from the unrestricted data, while both analyses demonstrate a decreasing trend over time. Although this finding could indicate improving diagnosis accuracy for both rheumatology clinics and other clinics over time, special care is still needed when using medical claims to study disease comorbidities. Additionally, we also observed diminishing differences between the ORs estimated from rheumatology clinics and all clinics (i.e. unrestricted data). We regressed these ORs on both the first-order and second-order time covariates (Figure 3D), and found that the second-order term in the regression for all clinics is not significant (p = 0.452), indicating that the rate of ORs changing across years remains relatively constant. In contrast, the second-order term in the regression for rheumatology clinics is significant (p < 1×10-7), suggesting that the changing rate of ORs decreases across years.

3.3 Large-scale comorbidity identification

We conducted a large-scale association study to identify the comorbidities for the 14 skin/skin-related conditions using data from the period 2014-2018. We evaluated a total of 420 skin disease-comorbidity pairs by associating the concurrence of these conditions with 30 common human disorders, including respiratory, cancer, mental, immunological, gastrointestinal, cardiovascular, and diabetes conditions (Figure 4 with detailed association estimates, sample sizes and P-values in Supplementary Table 2). For the large-scale comorbidity analysis, we found that most of the skin/skin-related condition-comorbidity associations are significant and positive, with the most prominent associated pairs being Crohn’s disease and leprosy (OR=6.60, 95% CI: 3.09-14.08); primary biliary cirrhosis (PBC) and SLE (OR=6.07, 95% CI: 4.93-7.46); as well as celiac disease (CD) and SLE (OR=6.06, 95% CI: 5.49-6.69). These associations are consistent with previous literature: for instance, different studies have reported overlapping genetic signals between Crohn’s disease and leprosy (3739). For PBC and SLE, researchers have found the odds of developing PBC is 2.23 (CI: 1.26-3.96) times higher if patients have a family history of SLE (40). A 2016 study estimated the CD and SLE association to be 3.92 in OR (CI 2.55-6.03) (41). Our findings also reveal that patients diagnosed with melanoma have higher rates of being diagnosed with multiple cancers, including ovarian, lung, and prostate cancers. Additionally, we observed that diabetes has either no association or significant negative associations with acne, rosacea, aging, pigmentation, and melanoma. However, among all the skin conditions studied, leprosy patients exhibit the highest odds of co-diagnosis with type I diabetes (OR: 2.71, CI: 1.53-4.80). Our findings align with previous research demonstrating that the incidence of diabetes among leprosy patients is over seven times higher compared to control groups (14.2% vs. 2%) (42). Notably, when compared to the 2001-2005 cohort, the most notable associations remain consistent (Supplementary Figure 2), while less associations are observed for multiple cancers.

Figure 4
www.frontiersin.org

Figure 4 Heatmap of large-scale association results between 2014-2018. Heatmap representation of the associations between overall skin/skin-related conditions and potential comorbidities during the period of 2014-2018. The color intensity reflects the level of odds ratio (OR) association, while asterisks indicate the significance levels (***: P<10-3; **: 10-3≤P<10-2; *: 10-2≤P<0.05; ·: 0.05≤P<0.01). The findings suggest that the majority of associations between skin and skin-related conditions and comorbidities are both significant and positive. Particularly notable pairings include Crohn’s disease with leprosy, primary biliary cirrhosis with systemic lupus erythematosus (SLE), and celiac disease with SLE. # The comorbidity analysis does not include rheumatological conditions due to the ambiguity of the phenotyping when using ICD codes and misdiagnosis.

We presented the effect sizes (in log OR) of all comorbidities for each skin/skin-related condition in the 2014-2018 cohort in Supplementary Figure 3A. This highlights that patients with SLE, leprosy, and HS are more susceptible to other comorbid diagnoses. In Supplementary Figure 3B, we showed the effect sizes of skin/skin-related conditions within each comorbidity, revealing that celiac disease, Crohn’s disease, and ulcerative colitis have the strongest average associations with the multiple different skin conditions studied in our analysis. We also provided the results for the 2001-2005 cohort in Supplementary Figure 4, which generally align with the findings from the 2014-2018 cohort. Additionally, we summarized the 2014-2018 prevalence of the most prevalent comorbidities within controls and patients with skin/skin-related diseases in Supplementary Table 3. These results further support that celiac disease is one of the most common comorbidities for patients suffering from skin/skin-related conditions.

4 Discussion

Identifying potential comorbidities, particularly those with modest associations, often requires a large sample size for adequate statistical power. Skin conditions, despite being prevalent, are known to have a high percentage of patients who do not seek medical advice, estimated at 73% (43). Consequently, studies in this domain may suffer from limited sample sizes and reduced power to detect weak associations (18, 31). However, leveraging the extensive sample size provided by the claims-based CDM database, we were able to uncover comorbidities even with mild associations. It is worth noting, however, that the CDM database does not include patients insured by Medicaid, which may impact the generalizability of the findings. To validate the CDM dataset, we evaluated the population summary statistics and confirmed their consistency with previous findings regarding overall prevalence, as well as age, ethnicity, and gender distributions. Additionally, we have showcased the well-established link between psoriasis and T2D as a proof-of-concept to further substantiate the validity of the CDM data. We also investigated other skin/skin-related diseases and comorbidities to determine association trends over time. We found that the PsA and RA association decreased dramatically across years. For a long time, PsA was considered to be a variant of RA (44, 45) due to limited knowledge and lack of more specific biomarkers (46). Since the proposition and clinical application of dactylitis as a hallmark and distinct feature of PsA, compared to RA in 1996 (47), and the CASPAR criteria for PsA diagnosis in 2006 (48), our analysis suggests that potential mis-diagnosis is decreasing over time.

We also adopted a different approach to examine the comorbidity: for a particular skin condition (e.g. psoriasis) we randomly selected control patients from the remaining 13 cohorts consisting of patients with different skin conditions. The pipeline and results of this alternative analysis, depicted in Supplementary Figures 5 and 6, indicate a generally lower association between psoriasis and T2D compared to the original analysis. This suggests the existence of associations between T2D and other skin conditions within the dataset.

The comorbidity of skin diseases can arise from various mechanisms, and understanding these mechanisms can contribute to a deeper comprehension of disease pathogenesis and enhance diagnostic accuracy. The information on disease co-occurrence would enable researchers to explore shared pathogenesis between these related conditions, thereby advancing the understanding of both conditions. Additionally, comorbidities play a crucial role in dermatological diagnoses, aiding dermatologists in distinguishing different diseases more accurately. The presence of comorbidities can be influenced by treatments administered to patients. In other words, different therapeutic interventions, such as medications, surgeries, or other medical procedures, can have an impact on the occurrence or development of concurrent diseases in individuals with skin conditions. For instance, certain medications used to treat one condition may influence the immune system or physiological processes that could potentially lead to the onset or exacerbation of other diseases. Additionally, the side effects or interactions of medications can also contribute to the development of comorbidities. Moreover, confounding factors such as patients’ lifestyle, quality of life, and living environment can also lead to disease co-occurrence (49). In this analysis, we accounted for potential confounders by adjusting for demographic and socioeconomic variables in the model. Lastly, misdiagnosis can contribute to the observed co-occurrence of two diseases. For example, PsA and RA are susceptible to misdiagnosis, as reported in previous studies (50). In our analysis, we observed a high association between these conditions; however, we also noticed a consistent temporal decrease in this association. This may be attributed to improved diagnostic criteria and a better understanding of disease mechanisms. Nevertheless, it is important to note that our association analysis does not completely eliminate the potential of misdiagnosis. We recommend that future systematic studies consider employing machine learning methods to correct phenotyping and address misdiagnosis as a preliminary step (51, 52).

Data availability statement

The data analyzed in this study is subject to the following licenses/restrictions: The data analyzed in this study was obtained from Optum’s de-identified Clinformatics® Data Mart Database, and cannot be directly shared to the public. Requests to access these datasets should be directed to Optum, connected@optum.com.

Author contributions

QL: Data curation, Formal Analysis, Investigation, Methodology, Software, Writing – original draft. MP: Data curation, Methodology, Supervision, Writing – review & editing. SS: Data curation, Writing – review & editing. JK: Supervision, Writing – review & editing. JMK: Writing – review & editing. JG: Writing – review & editing. KH: Methodology, Supervision, Writing – review & editing. LT: Conceptualization, Funding acquisition, Methodology, Project administration, Supervision, Writing – review & editing.

Funding

The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was supported by the Michigan Mcubed 3.0 Classic Cube fund (JiK, KH, LT), National Psoriasis Foundation (LT, MP, and JG), and awards from the National Institutes of Health (K01AR072129 and R01AR080662 to LT; 1P30AR075043 to LT, MP, and JG; UC2 AR081033 to LT and JG). MP was also supported by the Dermatology Foundation.

Conflict of interest

JG has served as a consultant to AbbVie, Eli Lilly, Almirall, Celgene, BMS, Janssen, Prometheus, TimberPharma, Galderma, Novatis, MiRagen, AnaptysBio and has received research support from AbbVie, SunPharma, Eli Lilly, Kyowa Kirin, Almirall, Celgene, BMS, Janssen, Prometheus, and TimberPharma. LT has received research support from Janssen, Novartis, and Galderma.

The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fimmu.2023.1309549/full#supplementary-material

References

1. Hay RJ, Johns NE, Williams HC, Bolliger IW, Dellavalle RP, Margolis DJ, et al. The global burden of skin disease in 2010: an analysis of the prevalence and impact of skin conditions. J Invest Dermatol (2014) 134(6):1527–34. doi: 10.1038/jid.2013.446

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Johnson M-LT, Roberts J. Skin conditions and related need for medical care among persons 1-74 years, United States, 1971-1974. Department Health Education Welfare Public Health Service Office (1978).

Google Scholar

3. Bickers DR, Lim HW, Margolis D, Weinstock MA, Goodman C, Faulkner E, et al. The burden of skin diseases: 2004: A joint project of the American Academy of Dermatology Association and the Society for Investigative Dermatology. J Am Acad Dermatol (2006) 55(3):490–500. doi: 10.1016/j.jaad.2006.05.048

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Schofield J, Grindlay D, William H. Skin conditions in the UK: a health needs assessment (University of Nottingham: Centre of Evidence Based Dermatology UK) (2009) 2013.

Google Scholar

5. Hay RJ, Fuller LC. The assessment of dermatological needs in resource-poor regions. Int J Dermatol (2011) 50(5):552–7. doi: 10.1111/j.1365-4632.2011.04953.x

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Finlay AY, Coles E. The effect of severe psoriasis on the quality of life of 369 patients. Br J Dermatol (1995) 132(2):236–44. doi: 10.1111/j.1365-2133.1995.tb05019.x

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Krueger G, Koo J, Lebwohl M, Menter A, Stern RS, Rolstad T. The impact of psoriasis on quality of life: results of a 1998 National Psoriasis Foundation patient-membership survey. Arch Dermatol (2001) 137(3):280–4.

PubMed Abstract | Google Scholar

8. Brezinski EA, Dhillon JS, Armstrong AW. Economic burden of psoriasis in the United States: a systematic review. JAMA Dermatol (2015) 151(6):651–8. doi: 10.1001/jamadermatol.2014.3593

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Adamson AS. The economics burden of atopic dermatitis. Manage Atopic Dermatitis (2017) 1027:79–92. doi: 10.1007/978-3-319-64804-0_8

CrossRef Full Text | Google Scholar

10. Khamashta M, Bruce I, Gordon C, Isenberg D, Ateka-Barrutia O, Gayed M, et al. The cost of care of systemic lupus erythematosus (SLE) in the UK: annual direct costs for adult SLE patients with active autoantibody-positive disease. Lupus (2014) 23(3):273–83. doi: 10.1177/0961203313517407

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Slawsky KA, Fernandes AW, Fusfeld L, Manzi S, Goss TF. A structured literature review of the direct costs of adult systemic lupus erythematosus in the US. Arthritis Care Res (2011) 63(9):1224–32. doi: 10.1002/acr.20502

CrossRef Full Text | Google Scholar

12. Laughter MR, Maymone MB, Karimkhani C, Rundle C, Hu S, Wolfe S, et al. The burden of skin and subcutaneous diseases in the United States from 1990 to 2017. JAMA Dermatol (2020) 156(8):874–81. doi: 10.1001/jamadermatol.2020.1573

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Wakkee M, Nijsten T. Comorbidities in dermatology. Dermatologic clinics. (2009) 27(2):137–47. doi: 10.1016/j.det.2008.11.013

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Brunner PM, Silverberg JI, Guttman-Yassky E, Paller AS, Kabashima K, Amagai M, et al. Increasing comorbidities suggest that atopic dermatitis is a systemic disorder. J Invest Dermatol (2017) 137(1):18–25. doi: 10.1016/j.jid.2016.08.022

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Balta I, Balta S, Demirkol S, Celik T, Ekiz O, Cakar M, et al. Aortic arterial stiffness is a moderate predictor of cardiovascular disease in patients with psoriasis vulgaris. Angiology (2014) 65(1):74–8. doi: 10.1177/0003319713485805

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Parsi KK, Brezinski EA, Lin T-C, Li C-S, Armstrong AW. Are patients with psoriasis being screened for cardiovascular risk factors? A study of screening practices and awareness among primary care physicians and cardiologists. J Am Acad Dermatol (2012) 67(3):357–62. doi: 10.1016/j.jaad.2011.09.006

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Kimball AB, Wu Y. Cardiovascular disease and classic cardiovascular risk factors in patients with psoriasis. Int J Dermatol (2009) 48(11):1147–56. doi: 10.1111/j.1365-4632.2009.04075.x

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Wan MT, Shin DB, Hubbard RA, Noe MH, Mehta NN, Gelfand JM. Psoriasis and the risk of diabetes: a prospective population-based cohort study. J Am Acad Dermatol (2018) 78(2):315–22. doi: 10.1016/j.jaad.2017.10.050

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Hemminki K, Liu X, Försti A, Sundquist J, Sundquist K, Ji J. Subsequent type 2 diabetes in patients with autoimmune disease. Sci Rep (2015) 5(1):1–8. doi: 10.1038/srep13871

CrossRef Full Text | Google Scholar

20. Frey LJ, Bernstam EV, Denny JC. Precision medicine informatics. J Am Med Inf Assoc (2016) 23(4):668–70. doi: 10.1093/jamia/ocw053

CrossRef Full Text | Google Scholar

21. Gunaseelan V, Kenney B, Lee JS-J, Hu HM. Databases for surgical health services research: Clinformatics Data Mart. Surgery (2019) 165(4):669–71. doi: 10.1016/j.surg.2018.02.002

PubMed Abstract | CrossRef Full Text | Google Scholar

22. O'Byrne ML, DeCost G, Katcoff H, Savla JJ, Chang J, Goldmuntz E, et al. Resource utilization in the first 2 years following operative correction for tetralogy of fallot: study using data from the optum's de-identified clinformatics data mart insurance claims database. J Am Heart Assoc (2020) 9(15):e016581. doi: 10.1161/JAHA.120.016581

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Desai RJ, Solomon DH, Jin Y, Liu J, Kim SC. Temporal trends in use of biologic DMARDs for rheumatoid arthritis in the United States: a cohort study of publicly and privately insured patients. J managed Care specialty pharmacy. (2017) 23(8):809–14. doi: 10.18553/jmcp.2017.23.8.809

CrossRef Full Text | Google Scholar

24. Kyriakis KP, Palamaras I, Terzoudi S, Emmanuelides S, Michailides C, Pagana G. Epidemiologic aspects of rosacea. J Am Acad Dermatol (2005) 53(5):918–9. doi: 10.1016/j.jaad.2005.05.018

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Lundin M, Chawa S, Sachdev A, Bhanusali D, Seiffert-Sinha K, Sinha AA. Gender differences in alopecia areata. J Drugs dermatology: JDD. (2014) 13(4):409–13.

Google Scholar

26. Wasef SZY. Gender differences in systemic lupus erythematosus. Gender Med (2004) 1(1):12–7. doi: 10.1016/S1550-8579(04)80006-8

CrossRef Full Text | Google Scholar

27. Skroza N, Tolino E, Proietti I, Bernardini N, La Viola G, Nicolucci F, et al. Women and acne: any difference from males? A review of the literature. Giornale italiano di dermatologia e venereologia: organo ufficiale Societa italiana di dermatologia e sifilografia. (2016) 151(1):87–92.

PubMed Abstract | Google Scholar

28. Yee D, Collier EK, Atluri S, Jaros J, Shi VY, Hsiao JL. Gender differences in sexual health impairment in hidradenitis suppurativa: A systematic review. Int J Women's Dermatol (2021) 7(3):259–64. doi: 10.1016/j.ijwd.2020.10.010

CrossRef Full Text | Google Scholar

29. Cunliffe W, Gould D. Prevalence of facial acne vulgaris in late adolescence and in adults. Br Med J (1979) 1(6171):1109–10. doi: 10.1136/bmj.1.6171.1109

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Saeki H, Furue M, Furukawa F, Hide M, Ohtsuki M, Katayama I, et al. Guidelines for management of atopic dermatitis. J Dermatol (2009) 36(10):563–77. doi: 10.1111/j.1346-8138.2009.00706.x

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Dubreuil M, Rho YH, Man A, Zhu Y, Zhang Y, Love TJ, et al. Diabetes incidence in psoriatic arthritis, psoriasis and rheumatoid arthritis: a UK population-based cohort study. Rheumatology (2014) 53(2):346–52. doi: 10.1093/rheumatology/ket343

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Hagströmer L, Ye W, Nyrén O, Emtestam L. Incidence of cancer among patients with atopic dermatitis. Arch Dermatol (2005) 141(9):1123–7. doi: 10.1001/archderm.141.9.1123

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Hwang CY, Chen YJ, Lin MW, Chen TJ, Chu SY, Chen CC, et al. Cancer risk in patients with allergic rhinitis, asthma and atopic dermatitis: a nationwide cohort study in Taiwan. Int J cancer. (2012) 130(5):1160–7. doi: 10.1002/ijc.26105

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Mansfield KE, Schmidt SA, Darvalics B, Mulick A, Abuabara K, Wong AY, et al. Association between atopic eczema and cancer in England and Denmark. JAMA Dermatol (2020) 156(10):1086–97. doi: 10.1001/jamadermatol.2020.1948

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Brent LH. Inflammatory arthritis: an overview for primary care physicians. Postgraduate Med (2009) 121(2):148–62. doi: 10.3810/pgm.2009.03.1987

CrossRef Full Text | Google Scholar

36. Giacomelli R, Gorla R, Trotta F, Tirri R, Grassi W, Bazzichi L, et al. Quality of life and unmet needs in patients with inflammatory arthropathies: results from the multicentre, observational RAPSODIA study. Rheumatology (2015) 54(5):792–7. doi: 10.1093/rheumatology/keu398

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Grant AV, Alter A, Huong NT, Orlova M, Van Thuc N, Ba NN, et al. Crohn's disease susceptibility genes are associated with leprosy in the Vietnamese population. J Infect diseases. (2012) 206(11):1763–7. doi: 10.1093/infdis/jis588

CrossRef Full Text | Google Scholar

38. Schurr E, Gros P. A common genetic fingerprint in leprosy and Crohn's disease? Mass Med Soc; (2009) p:2666–8. doi: 10.1056/NEJMe0910690

CrossRef Full Text | Google Scholar

39. Jung S, Park D, Lee H-S, Kim Y, Baek J, Hwang SW, et al. Identification of shared loci associated with both Crohn’s disease and leprosy in East Asians. Hum Mol Genet (2022) 31(22):3934–44. doi: 10.1093/hmg/ddac101

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Gershwin ME, Selmi C, Worman HJ, Gold EB, Watnik M, Utts J, et al. Risk factors and comorbidities in primary biliary cirrhosis: a controlled interview-based study of 1032 patients. Hepatology (2005) 42(5):1194–202. doi: 10.1002/hep.20907

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Dahan S, Shor DB-A, Comaneshter D, Tekes-Manova D, Shovman O, Amital H, et al. All disease begins in the gut: celiac disease co-existence with SLE. Autoimmun Rev (2016) 15(8):848–53. doi: 10.1016/j.autrev.2016.06.003

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Nigam P, Dayal S, Srivastava P, Joshi L, Goyal B, Gupta M. Diabetic status in leprosy. Hansenologia Internationalis: hanseníase e outras doenças infecciosas. (1979) 4(1):7–14. doi: 10.47878/hi.1979.v4.35626

CrossRef Full Text | Google Scholar

43. Basra MK, Shahrukh M. Burden of skin diseases. Expert Rev Pharmacoeconomics Outcomes Res (2009) 9(3):271–83. doi: 10.1586/erp.09.23

CrossRef Full Text | Google Scholar

44. Wright V. Psoriatic arthritis. Bull Rheum Dis (1971) 21:627–32. doi: 10.1016/0049-0172(73)90035-8

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Gladman DD, Ritchlin C. Clinical manifestations and diagnosis of psoriatic arthritis. UpToDate (2020). v18.

Google Scholar

46. Avila R, Pugh DG, Slocumb CH, Winkelmann R. Psoriatic arthritis: a roentgenologic study. Radiology (1960) 75(5):691–702. doi: 10.1148/75.5.691

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Roberton D, Cabral D, Malleson P, Petty R. Juvenile psoriatic arthritis: followup and evaluation of diagnostic criteria. J Rheumatol (1996) 23(1):166–70.

PubMed Abstract | Google Scholar

48. Taylor W, Gladman D, Helliwell P, Marchesoni A, Mease P, Mielants H. Classification criteria for psoriatic arthritis: development of new criteria from a large international study. Arthritis Rheumatism: Off J Am Coll Rheumatol (2006) 54(8):2665–73. doi: 10.1002/art.21972

CrossRef Full Text | Google Scholar

49. Liu J-T, Yeh H-M, Liu S-Y, Chen K-T. Psoriatic arthritis: epidemiology, diagnosis, and treatment. World J orthopedics. (2014) 5(4):537. doi: 10.5312/wjo.v5.i4.537

CrossRef Full Text | Google Scholar

50. Merola JF, Espinoza LR, Fleischmann R. Distinguishing rheumatoid arthritis from psoriatic arthritis. RMD Open (2018) 4(2):e000656. doi: 10.1136/rmdopen-2018-000656

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Tang B, Wu Y, Jiang M, Denny JC, Xu H. Recognizing and encoding discorder concepts in clinical text using machine learning and vector space model. CLEF (Working Notes). (2013) 665.

Google Scholar

52. Chen ML. Machine Learning for the Classification of Dementia in the Presence of Mis-labelled Data. (Imperial College London) (2013).

Google Scholar

Keywords: epidemiology, claims, skin disease, comorbidity, Optum

Citation: Li Q, Patrick MT, Sreeskandarajan S, Kang J, Kahlenberg JM, Gudjonsson JE, He Z and Tsoi LC (2024) Large-scale epidemiological analysis of common skin diseases to identify shared and unique comorbidities and demographic factors. Front. Immunol. 14:1309549. doi: 10.3389/fimmu.2023.1309549

Received: 08 October 2023; Accepted: 12 December 2023;
Published: 08 January 2024.

Edited by:

Paola Savoia, Università degli Studi del Piemonte Orientale, Italy

Reviewed by:

Yan-na Wang, Guangdong Provincial People’s Hospital, China
Robert Swerlick, Emory University, United States

Copyright © 2024 Li, Patrick, Sreeskandarajan, Kang, Kahlenberg, Gudjonsson, He and Tsoi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Lam C. Tsoi, YWxleHRzb2lAbWVkLnVtaWNoLmVkdQ==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.