AUTHOR=Li Wuquan , Guan Xiaoran , Wang Yong , Lv Yan , Wu Yuyong , Yu Min , Sun Yeying TITLE=Cuproptosis-related gene identification and immune infiltration analysis in systemic lupus erythematosus JOURNAL=Frontiers in Immunology VOLUME=14 YEAR=2023 URL=https://www.frontiersin.org/journals/immunology/articles/10.3389/fimmu.2023.1157196 DOI=10.3389/fimmu.2023.1157196 ISSN=1664-3224 ABSTRACT=Background

Systemic lupus erythematosus (SLE) is an autoimmune disease characterized by loss of tolerance to self-antigen, autoantibody production, and abnormal immune response. Cuproptosis is a recently reported cell death form correlated with the initiation and development of multiple diseases. This study intended to probe cuproptosis-related molecular clusters in SLE and constructed a predictive model.

Methods

We analyzed the expression profile and immune features of cuproptosis-related genes (CRGs) in SLE based on GSE61635 and GSE50772 datasets and identified core module genes associated with SLE occurrence using the weighted correlation network analysis (WGCNA). We selected the optimal machine-learning model by comparing the random forest (RF) model, support vector machine (SVM) model, generalized linear model (GLM), and the extreme gradient boosting (XGB) model. The predictive performance of the model was validated by nomogram, calibration curve, decision curve analysis (DCA), and external dataset GSE72326. Subsequently, a CeRNA network based on 5 core diagnostic markers was established. Drugs targeting core diagnostic markers were acquired using the CTD database, and Autodock vina software was employed to perform molecular docking.

Results

Blue module genes identified using WGCNA were highly related to SLE initiation. Among the four machine-learning models, the SVM model presented the best discriminative performance with relatively low residual and root-mean-square error (RMSE) and high area under the curve (AUC = 0.998). An SVM model was constructed based on 5 genes and performed favorably in the GSE72326 dataset for validation (AUC = 0.943). The nomogram, calibration curve, and DCA validated the predictive accuracy of the model for SLE as well. The CeRNA regulatory network includes 166 nodes (5 core diagnostic markers, 61 miRNAs, and 100 lncRNAs) and 175 lines. Drug detection showed that D00156 (Benzo (a) pyrene), D016604 (Aflatoxin B1), D014212 (Tretinoin), and D009532 (Nickel) could simultaneously act on the 5 core diagnostic markers.

Conclusion

We revealed the correlation between CRGs and immune cell infiltration in SLE patients. The SVM model using 5 genes was selected as the optimal machine learning model to accurately evaluate SLE patients. A CeRNA network based on 5 core diagnostic markers was constructed. Drugs targeting core diagnostic markers were retrieved with molecular docking performed.