
94% of researchers rate our articles as excellent or good
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.
Find out more
ORIGINAL RESEARCH article
Front. Oncol. , 19 February 2025
Sec. Gastrointestinal Cancers: Hepato Pancreatic Biliary Cancers
Volume 15 - 2025 | https://doi.org/10.3389/fonc.2025.1505376
This article is part of the Research Topic Advancing Cancer Imaging Technologies: Bridging the Gap from Research to Clinical Practice View all 13 articles
Purpose: This study aimed to construct a differential diagnostic model to distinguish autoimmune pancreatitis (AIP) from pancreatic ductal adenocarcinoma (PDCA) using ultrasound clinical features and machine learning algorithms.
Methods: Retrospective ultrasound clinical data of patients with AIP and PDCA from three different centers were used as the training cohort, external validation cohort 1, and external validation cohort 2. Feature selection was conducted via variance filtering and LASSO regression, followed by the construction of a random forest (RF) model. The hyperparameters were optimized in the training cohort, and the final model was evaluated in the external validation cohorts. The model’s performance was assessed using sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), the F1 score, accuracy, and area under curve(AUC). The clinical application value of the model was clarified through a comparison between humans and machines.
Results: An RF model was constructed using six features: Ca 19-9 level, abdominal pain, jaundice, focal/diffuse-type AIP, blood flow signals, and morphology. In external validation cohort 1, the model’s sensitivity, specificity, PPV, NPV, F1 score, accuracy, and AUC were 86.0%, 80.0%, 81.0%, 86.0%, 84.0%, 83.0%, and 89.0%, respectively; in external validation cohort 2, these values were 72.0%, 94.0%, 93.0%, 77.0%, 81.0%, 83.0%, and 91.0%, respectively. The predictive performance of experienced radiologists using clinical information and ultrasound images demonstrated a sensitivity of 81%, specificity of 79%, PPV of 78%, NPV of 76%, F1 score of 80%, and accuracy of 80%. For radiologists with intermediate experience, the sensitivity was 75%, specificity was 74%, PPV was 73%, NPV was 76%, F1 score was 74%, and accuracy was 75%. less experienced radiologist had a sensitivity of 55%, specificity of 56%, PPV of 62%, NPV of 49%, F1 score of 58%, and accuracy of 55%.
Conclusion: The RF model constructed using clinical ultrasound features achieved diagnostic levels comparable to those of experienced radiologists and can assist in differentiating AIP from PDCA, potentially guiding clinical practice.
Pancreatic ductal adenocarcinoma (PDCA) is the most common pathological type of pancreatic cancer, accounting for 80–90% of all pancreatic cancers. PDCA has become the third-leading cause of cancer-related deaths in the United States, surpassing breast cancer (1). Chemotherapy and surgery are the main treatment options for PDCA. AIP is a rare type of pancreatitis characterized by distinctive histological features and a positive response to steroids. In 1995, Yoshida and colleagues first described autoimmune pancreatitis (AIP) (2). Due to the elevated IgG4 levels and associated systemic diseases often observed in patients with AIP, AIP is classified as an IgG4-related disease (IgG4-RD) (3). In 2011, the International Association of Pancreatology released the International Consensus Diagnostic Criteria (ICDC) for AIP. After existing standards were reviewed, criteria based on imaging of the pancreatic parenchyma and ducts, serology, involvement of other organs, pancreatic histopathology, and response to corticosteroid treatment were established (4). AIP is a type of autoimmune-mediated pancreatitis characterized by pancreatic enlargement, obstructive jaundice, lymphoplasmacytic infiltration combined with fibrosis, and a significant response to steroid treatment. The main symptoms of AIP are obstructive jaundice and abdominal discomfort. Most clinical symptoms and imaging findings lack distinctive specificity, especially in mass-forming AIP, which is often misdiagnosed as pancreatic cancer. This misdiagnosis leads to the unnecessary use of medical resources and surgical procedures.
Ultrasound examination, as a convenient, real-time, and cost-effective diagnostic tool, holds significant value for the diagnosis of AIP. The typical ultrasound manifestations of AIP include diffuse, focal, and multifocal types. The characteristic appearance of diffuse AIP on two-dimensional ultrasound is sausage-like pancreatic enlargement with reduced echogenicity, with or without main pancreatic duct dilation. However, the two-dimensional ultrasound appearance of mass-forming AIP often overlaps with that of pancreatic cancer, presenting as single or multiple hypoechoic masses, which frequently leads to misdiagnosis. Experienced ultrasound radiologists integrate patient history and clinical manifestations, carefully observe sonographic features, and consider indirect signs such as the degree of bile duct and main pancreatic duct dilation, involvement of surrounding organs, and the presence of retroperitoneal lymph node enlargement, to comprehensively differentiate these two diseases. An ultrasound diagnosis is dependent on the operator’s experience. Due to the rarity of AIP, primary ultrasound radiologists may have limited awareness and diagnostic experience regarding this disease. The serum IgG4 levels should be measured in patients with suspected AIP. Serum IgG4 elevation is the most specific serological indicator for the diagnosis of IgG4-AIP. The clinical incidence of PDCA is much higher than that of AIP; therefore, routine serum IgG4 testing for all patients is economically unfeasible. Additionally, while serum IgG4 levels are a specific indicator for diagnosing AIP, they are not elevated in all patients with AIP, and elevated levels may be observed in patients with other conditions. Therefore, IgG4 levels alone cannot provide a definitive diagnosis (5).
Machine learning has been widely applied across various fields, including computer science, statistics, and information theory. It uses algorithms to analyze data, explore hidden patterns, and predict new information with high accuracy, speed, and scalability. These attributes have shown significant potential in medical diagnosis, natural language processing, and image recognition. Machine learning technology reduces the dependence on operators, standardizes image interpretation, provides stable results, enables quick decision-making, and alleviates the heavy workload of radiologists. Interest in applying machine learning technology to ultrasound has grown exponentially in recent years. This study aims to (1) explore the diagnostic value of clinical ultrasound features for the differentiation of PDCA and AIP, (2) develop a quantitative machine learning model using clinical ultrasound features to differentiate PDCA and AIP, and (3) clarify the clinical auxiliary diagnostic value via a comparison between humans and machines. The workflow is shown in(Figure 1).
This multicenter retrospective study was approved by the ethics committees of Fuzhou University Affiliated Provincial Hospital, the First Affiliated Hospital of Fujian Medical University, and the Union Hospital of Fujian Medical University, and the requirement of informed consent was waived. We systematically reviewed the computer databases regarding AIP at the three hospitals from July 2014 to September 2024, identifying 90, 70, and 85 patients with AIP, respectively. We also reviewed the computer databases for patients with PDCA confirmed via surgery or biopsy from March 2022 to June 2024 at these hospitals; these patients were used as controls. We identified 100, 105, and 125 patients with PDCA from the three hospitals. The data were screened according to the inclusion and exclusion criteria. Inclusion criteria: (1) PDCA confirmed via surgical pathology or biopsy, or AIP diagnosed according to the 2011 ICDC; (2) ultrasound examination prior to biopsy or surgery; and (3) complete clinical pathology data available. Exclusion criteria: (1) no ultrasound examination prior to biopsy or surgery; (2) ultrasound images not suitable for imaging analysis; (3) incomplete clinical pathology data; or (4) PDCA with distant metastasis. Finally, 20, 17, and 18 patients with AIP and 74, 80, and 105 patients with PDCA were included from the hospitals, respectively. The clinical records, ultrasound images, and pathological data were recorded. AIP has been reported to occur mostly in middle-aged and elderly men (6). Propensity Score Matching(PSM) was used to match the patients with AIP and PDCA at a ratio of 1:1 according to sex and age, resulting in 40, 34, and 36 matched pairs from the three hospitals. The cohort from Fuzhou University Affiliated Provincial Hospital was used as the training cohort, the cohort from the First Affiliated Hospital of Fujian Medical University was used as external validation cohort 1, and the cohort from the Union Hospital of Fujian Medical University was used as external validation cohort 2. A flowchart of the patient selection process is provided (Figure 2).
Ultrasound examinations were performed using Philips IU22, Philips EPIQ5, SIEMENS S3000, and Mindray DC-8 color Doppler ultrasound diagnostic instruments and an abdominal convex array probe with a frequency of 1.0–6.0 MHz. Patients fasted for at least eight hours prior to the examination. During the examination, patients were placed in a supine position, and the convex array probe was applied to the upper abdomen to perform two-dimensional and color Doppler ultrasound scans of the pancreas. The extent of lesion involvement, location, boundary, echo, blood flow signal grading (0, I, II, or III), and associated findings such as calcification, main pancreatic duct dilation, and intrahepatic and extrahepatic bile duct dilation were observed. Blood flow was graded according to the Alder blood flow classification (7): grade 0: no blood flow within the mass; grade I: 1–2 punctate or short rod-like blood flows within the mass; grade II: 3–4 punctate blood flows or 1 long strip-like blood flow within the mass; and grade III: 5 or more punctate blood flows or 2 or more long strip-like blood flows within the mass. Ultrasound examinations and image analyses were completed by three experienced physicians from the three centers, and the results were recorded. Representative ultrasound images are shown in (Supplementary Material). All ultrasound images were outputted through the hospital’s picture archiving and communication system (PACS) in DICOM format. The three ultrasound physicians re-evaluated the images and recorded the ultrasound features and clinical characteristics of each patient with AIP or PDCA using the electronic medical record platform.
In this study, the CA 19-9 level is the only continuous ultrasound feature, the remaining features are categorical features. The categorical variables included lesion involvement (focal AIP or diffuse AIP), location (head, neck, body, or tail of the pancreas), number, boundary (clear or unclear), echo (hypoechoic, isoechoic, or hyperechoic), presence of calcification, main pancreatic duct dilation, intrahepatic bile duct dilation, abdominal pain, jaundice, fever, and blood flow signal grading (0, I, II, or III). Most patients had a blood flow grade of 0 or I, with very few patients having grade II blood flow, and no patients having grade III blood flow. Therefore, grade 0 was categorized separately from grade I and grade II blood flow. CA 19-9 level reported by Chang et al. (2014) (8), who suggested that a CA 19-9 value > 85 U/ml had the best accuracy (85.6%) in differentiating pancreatic cancer from AIP, were used in this study. We used 85 U/ml as the cut-off value to divide the patients into two groups. During the screening process, the variance filtering method was first used to exclude variables with a variance threshold of 0. LASSO regression feature screening was then used to eliminate collinear variables.
During the model training phase, 10-fold cross-validation and grid search were used to optimize the hyperparameters in the training set from Fuzhou University Affiliated Provincial Hospital. The highest-scoring hyperparameters were selected as the model parameters. The constructed model was independently evaluated using datasets from the First Affiliated Hospital of Fujian Medical University and Fujian Union Hospital. The model’s performance was evaluated by calculating the sensitivity, specificity, positive predictive value, negative predictive value, F1 score, accuracy, and AUC.
Three ultrasound radiologists with 10, 5, and 2 years of experience differentiated between AIP and PDCA using the ultrasound images and clinical information. The ultrasound radiologists were blinded to the pathological results. The sensitivity, specificity, positive predictive value, negative predictive value, F1 score, and accuracy of the ultrasound radiologists’ diagnoses were determined. The performance of the machine learning model was compared with the predictions of the experienced radiologist (10 years), the intermediately experienced radiologist (5 years), and the less experienced radiologist (2 years).
Model construction was performed using Python software (version 3.9.13), and all statistical analyses were conducted using R software (version 4.3.2). Normally distributed continuous data are expressed as the mean ± standard deviation and were compared between the two groups using the Student’s t-test. Non-normally distributed continuous data are expressed as the median [upper quartile, lower quartile] and were compared between the two groups using the Wilcoxon test. Categorical data are expressed as frequency or percentage and were compared using Fisher’s exact test. ROC curves were used to compare the models, and the DeLong test was used to compare the ROC curves. A p- value <0.05 was used to indicate statistical significance.
A total of 110 patients were included in this study, with 55 patients with AIP and 55 patients having PDCA (101 males and 9 females; age range: 36–87 years). The CA 19-9 level, presence of abdominal pain, jaundice, focal/diffuse-type AIP, and morphology were significantly different between the AIP and PDCA groups (Table 1). There were no significant differences in sex or age between the AIP and PDCA groups.
This study included 15 ultrasound clinical features: sex, age, abdominal pain, fever, jaundice, focal/diffuse type, location of the lesion, boundary, morphology, echo of the lesion, pancreatic duct dilation, calcification, intrahepatic bile duct dilation, blood flow signals, and Ca 19-9 level. PSM was used to match patients with AIP and patients with PDCA at a ratio of 1:1 based on sex and age. Almost all patients with AIP or PDCA did not have fever, and the masses were hypoechoic. Therefore, sex, age, fever, and echo were initially excluded. To understand the independent predictive performance of the ultrasound clinical features, logistic regression models were constructed separately for the remaining 11 features, and their predictive performances were evaluated using the external validation cohorts. The AUCs and accuracies of the models constructed with individual features were not high, and there was severe overfitting (Figure 3). The specific performance indicators of each feature are shown (Table 2). These findings indicate that individual ultrasound clinical features cannot be used to accurately differentiate AIP and PDCA.
Figure 3. Predictive performance of individual clinical ultrasound features in external validation cohorts.
Variance filtering was used to exclude ultrasound features with a variance threshold of 0, eliminating two features: fever and echo. LASSO regression with 10-fold cross-validation was used to exclude features with a coefficient of 0, including boundary, pancreatic duct dilation, intrahepatic bile duct, location, and calcification. The remaining six features with nonzero coefficients included abdominal pain, jaundice, focal/diffuse AIP, blood flow signals, CA 19-9 level, and morphology (Figures 4, 5). The six features were ranked by importance, with focal/diffuse-type AIP being the most important, followed by jaundice, CA 19-9 level, abdominal pain, morphology, and blood flow signals (Figure 6). (Supplementary Material) shows the distributions of these six features across the three datasets. Abdominal pain was more common in patients with PDCA than in those with AIP, and patients with PDCA had significantly higher CA 19-9 level than those with AIP. Diffuse lesions and jaundice were more common in the AIP group than in the PDCA group. PDCA lesions were more irregular in shape than AIP lesions. Both the PDCA and AIP groups exhibited poor blood supply, with most patients having grade 0-I blood flow signals. In the training cohort and external validation cohort 1, most patients in the AIP group had grade I blood flow signals, whereas most patients in the PDCA group had grade 0 blood flow signals. In external validation cohort 2, most patients in the PDCA group had grade I blood flow signals, whereas most patients in the AIP group had grade 0 blood flow signals.
Five models were used, including the support vector machine, random forest (RF), logistic regression, decision tree, and k-nearest neighbors. Using the grid search method and 10-fold cross-validation in the sklearn library, the hyperparameters of the models were optimized in the training cohort. Among the models, RF performed the best and was selected as the final model. The parameters of the RF model were set to criterion=‘gini’, max_depth=4, min_samples_leaf=4, and n_estimators=10. The model’s sensitivity, specificity, positive predictive value, negative predictive value, F1 score, accuracy, and AUC were 86.0%, 86.0%, 86.0%, 86.0%, 70.0%, 86.0%, and 95.0%, respectively, in the training cohort. In external validation cohort 1, the sensitivity, specificity, positive predictive value, negative predictive value, F1 score, accuracy, and AUC were 86.0%, 80.0%, 81.0%, 86.0%, 84.0%, 83.0%, and 89.0%, respectively. In external validation cohort 2, the sensitivity, specificity, positive predictive value, negative predictive value, F1 score, accuracy, and AUC were 72.0%, 94.0%, 93.0%, 77.0%, 81.0%, 83.0%, and 91.0%, respectively. The DeLong test revealed no significant differences between the ROC curves of the training cohort, external validation cohort 1, and external validation cohort 2 (P>0.05) (Table 3, Figure 7). The calibration bias curves of the training cohort and external validation cohorts closely overlapped with the ideal line (Figure 8).
We compared the performance differences of the model when the CA 19-9 level was included as a continuous variable or a categorical variable. The model performed better when the CA 19-9 level was included as a categorical variable with a threshold of 85 U/ml (Table 4). Considering the inconsistency in the distribution of blood flow signals in external validation cohort 2 compared to that of the training cohort and external validation cohort 1 and the potential influence of different equipment and operator adjustments, we attempted to exclude this feature. The model’s performance decreased when the blood flow signal feature was removed (Table 5).
Table 4. The model's performance when the CA 19-9 level was included as a continuous variable or a categorical variable with a threshold of 85 U/ml.
The predictive performance of experienced radiologists based on clinical information and ultrasound images showed a sensitivity of 81%, specificity of 79%, positive predictive value of 78%, negative predictive value of 76%, F1 score of 80%, and accuracy of 80%. The predictive performance of the radiologist with intermediate experience showed a sensitivity of 75%, specificity of 74%, positive predictive value of 73%, negative predictive value of 76%, F1 score of 74%, and accuracy of 75%. The predictive performance of the radiologist with two years of experience showed a sensitivity of 55%, specificity of 56%, positive predictive value of 62%, negative predictive value of 49%, F1 score of 58%, and accuracy of 55%. The classification results were visualized using a confusion matrix between the predicted and ground-truth labels (Supplementary Material). The experienced ultrasound radiologist’s evaluation indicators were superior to those of the radiologist with intermediate experience, and those of the ultrasound radiologist with intermediate experience were superior to those of the less experienced ultrasound radiologist (Figure 9).
This study included data from three medical centers and found that clinical ultrasound features have certain value in differentiating AIP and PDCA, though individual features exhibit poor performance and overfitting issues. Through variable selection, six ultrasound features were identified: focal/diffuse-type AIP, jaundice, CA 19-9 level, abdominal pain, morphology, and blood flow signals. The RF model constructed with these features can be used for the differential diagnosis of AIP and PDCA, as AUC values of 89% and 91% were obtained in the external validation cohorts 1 and 2, respectively. The model reached the classification level of experienced ultrasound radiologists, outperforming those with intermediate and low experience. Therefore, the model can be used in primary medical institutions to assist primary care doctors in differentiating AIP and PDCA and may help avoid unnecessary biopsies and surgeries. To our knowledge, this work describes the first machine learning model based on clinical ultrasound features to differentiate AIP and PDCA, and it achieves good performance in multicenter cohorts.
Standard imaging techniques for diagnosing AIP include CT, MRI, and EUS. The typical CT manifestation of AIP includes clear, low-density, cystic margins around the pancreas, and enhanced CT typically reveals diffuse pancreatic enlargement. The typical enhanced CT manifestation of pancreatic cancer is a hypovascular focal mass. However, simple pancreatic enlargement cannot accurately differentiate AIP from PDCA and other malignancies (9). MRI is necessary to visualize the pancreatic duct and its secondary branches, with typical MRCP manifestations include multiple segmental strictures of the pancreatic duct often related to pancreatic lesions. However, MRI examination is not specific for diffuse, focal, or segmental enlargement and is more costly and technically demanding compared to CT and EUS (10). EUS combines the advantages of electronic endoscopy and ultrasound, allowing for the visualization of structures not observed via endoscopy, such as hemodynamic changes, the depth of lesion tissue infiltration, and extra-luminal organs. EUS is often used to assess the staging of gastrointestinal tumors (11). Moreover, EUS-guided fine-needle aspiration (EUS-FNA) can be used to reveal pancreatic lesion pathology and has significant value in differentiating pancreatic cancer from AIP (12). However, EUS-FNA may yield insufficient tissue samples due to the small sample size, leading to false-negative results. In addition, EUS-FNA has limitations, such as the inability of some patients to tolerate endoscopy, which prevents lesion tissue acquisition or causes secondary injuries (13–15).
In addition, serological tests play an important role in distinguishing AIP and PDCA. Hamano et al. (16). reported that a serum IgG4 concentration threshold of 1.35 g/L distinguishes AIP from pancreatic cancer, with diagnostic accuracy, sensitivity, and specificity all exceeding 95%; however, elevated serum IgG4 levels are also observed in other diseases, such as tumors and infections (17). Serum CA 19-9 level increase differently inin the presence of malignant tumors, benign tumors and inflammatory responses of the digestive system, especially in patients with pancreatic cancer, making it the most commonly used biomarker for pancreatic cancer screening. Serum CA 19-9 is expressed at low levels in normal pancreatic tissue, however, its expression gradually increases when pancreatic duct epithelial cells undergo malignant transformation, and the diagnostic sensitivity for pancreatic cancer is significantly greater than that of other tumor markers. In this study, patients with PDCA had significantly higher CA 19-9 level than patients with AIP, which is consistent with the findings of Chang et al. in 2014 (8).
Abdominal ultrasound, which is convenient, non-invasive, cost-effective, and repeatable, is a routine screening method for pancreatic lesions and provides a convenient and effective method for posttreatment follow-up of AIP. Almost all patients with AIP or PDCA undergo ultrasound examination prior to other tests, and the data provided by ultrasound doctors plays an important role in subsequent clinical decision-making. It is difficult to differentiate between AIP and PDCA using routine abdominal ultrasound alone. This study revealed that while individual clinical ultrasound features are significantly different between the two groups, models constructed with single features exhibited low AUCs and accuracies as well as significant overfitting issues. Therefore, the use of individual clinical ultrasound features to differentiate AIP and PDCA is insufficient. Ultrasound radiologists typically do not use individual ultrasound features for differential diagnosis. Therefore, clinical, serological, and ultrasound features were used to construct the model, and six features were identified via feature selection. These features were (ranked by importance): focal/diffuse-type AIP, jaundice, CA 19-9 level, abdominal pain, morphology, and blood flow signals.
Patients with PDCA had focal lesions, high CA19-9 levels, abdominal pain, irregular shapes, and grade 0 blood flow more often than those with AIP and had significant jaundice less frequently than patients with AIP. AIP often presents with painless obstructive jaundice, which may be due to external compression of the pancreatic head by inflammatory processes, leading to stenosis of the bile duct in the pancreatic region or IgG4-RD with concurrent sclerosing cholangitis (18). An international multicenter survey (19) on AIP found that the incidence of obstructive jaundice in patients with AIP is 75%. The first symptom of PDCA is often abdominal pain, and with improved awareness and instrument resolution, tumors are often detected when they are smaller, making obstructive jaundice less common in patients with PDCA than in those with AIP. According to the extent of involvement, AIP can be classified into diffuse, focal, and segmental types on ultrasound. Diffuse- type AIP is the most common characterized by significant lymphoplasmacytic infiltration and fibrosis in the pancreatic parenchyma, which eliminates the normal feathery structure and causes sausage-like swelling with fat gap loss (20). PDCA is characterized by infiltrative growth, presenting as an irregularly shaped mass on imaging. The blood flow signals were often grade 0 in patients with PDCA in this study, which may be due to tumor cell infiltration and destruction of internal blood vessels, thrombosis, and arteriovenous shunting. The AIP group had relatively more patients with grade I or II blood flow signals than the PDCA group, which may be due to the pathological changes of AIP, involving inflammatory lymphocyte infiltration, interstitial fibrosis, and fibrous tissue hyperplasia without significant destruction and proliferation of microvessels within the lesion. Therefore, the blood flow signals of the lesions are not significantly different from those of the surrounding normal pancreatic tissue.
We found that experienced ultrasound radiologists have strong differentiation abilities, with an accuracy of 0.80, which is superior to that of ultrasound radiologists with intermediate and low experience. Experienced ultrasound radiologists typically rely on years of experience to identify subtle differences in sonographic images and data regarding clinical manifestations and laboratory tests to make comprehensive diagnoses, achieving high classification accuracy. However, the diagnostic performance of a less experienced ultrasound radiologist (accuracy of 55%) was equivalent to that of random guessing as AIP is rare in clinical practice, and less experienced ultrasound radiologist lack awareness and experience. The ultrasound radiologist do not sufficiently combine laboratory indicators to recognize diseases, especially AIP. Less experienced ultrasound radiologist also have less mature machine adjustment capabilities, leading to deviations in the display of some ultrasound sonographic features, such as lesion blood flow signals and morphology. Therefore, differentiating between AIP and PDCA relies heavily on the ultrasound radiologist’s experience, requiring significant time and money for training. The model constructed in this study is expected to reduce the impact of subjective judgments when using ultrasound images for diagnoses, and it can be promoted in primary medical institutions to better differentiate AIP and PDCA, reducing unnecessary biopsies and surgeries.
Limitations of our study:
1. Despite collecting data from three centers over a ten-year follow-up period, the sample size remains limited, which increases the risk of overfitting in the machine learning models. Future studies should aim to include larger sample sizes and conduct multicenter validation to enhance the model’s generalizability.
2. The ultrasound features used in the model—such as focal/diffuse-type AIP, jaundice, CA 19-9 level, abdominal pain, morphology, and blood flow signals—are generally accessible in clinical settings. However, blood flow signals may vary due to differences in equipment or operator technique, potentially introducing bias into the model.
3. The model requires the interpretation of ultrasound features by a clinician, which necessitates professional expertise and may limit its widespread adoption, particularly in settings with less specialized staff. As data volume increases, AI could potentially assist in further refining the differentiation between AIP and PDCA, reducing the reliance on specialized knowledge.
4. Combining the predictive outputs of the machine learning algorithm with serological tests, such as IgG4 detection, could enhance the model’s diagnostic accuracy in distinguishing AIP from PDCA. Future prospective studies should validate this integrated approach.
The Random Forest (RF) model developed using clinical ultrasound features demonstrates significant clinical value in differentiating AIP from PDCA. Further validation and refinement of this model with larger and more diverse datasets will be crucial for its broader clinical application.
The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.
The studies involving humans were approved by Ethics Committee of Fujian Provincial Hospital Branch for Medical Research and Clinical Technology Application Ethics Committee of First Affiliated Hospital of Fujian Medical University. The studies were conducted in accordance with the local legislation and institutional requirements. The human samples used in this study were acquired from primarily isolated as part of your previous study for which ethical approval was obtained. Written informed consent for participation was not required from the participants or the participants’ legal guardians/next of kin in accordance with the national legislation and institutional requirements.
LZ: Writing – original draft, Writing – review & editing, Data curation, Formal analysis. XaC: Data curation, Writing – original draft. ZC: Validation, Writing – review & editing. WC: Data curation, Writing – original draft. JZ: Writing – original draft, Data curation. MZ: Data curation, Writing – original draft. XiC: Supervision, Validation, Writing – review & editing.
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was supported by Fujian Provincial Natural Science Foundation of China (2024J08249).
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
The author(s) declare that no Generative AI was used in the creation of this manuscript.
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2025.1505376/full#supplementary-material
1. Rahib L, Smith BD, Aizenberg R, Rosenzweig AB, Fleshman JM, Matrisian LM, et al. Projecting g cancer incidence and deaths to 2030:the unexpected burden of thyroid, liver, and pancreas cancers in the United States. Cancer Res. (2014) 74:2913–21. doi: 10.1158/0008-5472.CAN-14-0155
2. Yoshida K, Toki F, Takeuchi T, Watanabe S, Shiratori K, Hayashi N, et al. Chronic pancreatitis caused by an autoimmune abnormality. Proposal of the concept of autoimmune pancreatitis. Dig Dis Sci. (1995) 40:1561–8. doi: 10.1007/BF02285209
3. Deng H, Zhao S, Yue Y, Shan L, Liu L, Zhang Y, et al. IgG4-related disease of pul-monary artery causing pulmonary hypertension. MEDI- CINE. (2018) 97:e10698. doi: 10.1097/MD.0000000000010698
4. Shimosegawa T, Chari ST, Frulloni L, Kamisawa T, Kawa S, Mino-Kenudson M, et al. International consensus diagnostic criteria for autoimmune pancreatitis: Guidelines of the International Association of Pancreatology. Pancreas. (2011) 40:352–8. doi: 10.1097/MPA.0b013e3182142fd2
5. Deheragoda MG, Church NI, Rodriguez-Justo M, Munson P, Sandanayake N, Seward EW, et al. The use of immunoglobulin g4 immunostaining in diagnosing pancreatic and extrapancreatic involvement in autoimmune pancreatitis. Clin Gastroenterol Hepatol. (2007) 5:1229–34. doi: 10.1016/j.cgh.2007.04.023
6. Hart PA, Kamisawa T, Brugge WR, William R, et al. Long-term outcomes of autoimmune pancreatitis: a multicentre, international analysis. Gut. (2013) 62:1771–6. doi: 10.1136/gutjnl-2012-303617
7. Alder DD, Carson PL, Rubin JM, Quinn SF, Haller JO, Diaconis JN, et al. Doppler ultra- sound color flow imaging in the study of breast cancer:Prelimi- nary findings. Ultrasound Med Biol. (1990) 16:553–9.
8. Chang M C, Liang PC, Jan S, Yang CY, Kuo SH, Wang HP, et al. Increase diagnostic accuracy in differentiating focal type autoimmune pancreatitis from pancreatic cancer with combined serum IgG4 and CA19-9 levels. Pancreatology. (2014) 14:366–72. doi: 10.1016/j.pan.2014.07.010
9. Vasaitis L. IgG4-related disease: A relatively new concept for clinicians. Eur J Intern Med. (2016) 27:1–9. doi: 10.1016/j.ejim.2015.09.022
10. Iglesias-García J, Lindkvist Björn, Lariño-Noia José, Abdulkader-Nallib I, Domínguez-Muñoz JE, Caamaño-Freire M, et al. The role of EUS in relation to other imaging modalities in the differential diagnosis between mass forming chronic pancreatitis,autoimmune pancreatitis and ductal pancreatic adenocarcinoma. Rev Esp Enferm Dig. (2012) 104:315–21.
11. Farrell JJ, Garber J, Sahani D, Brugge WR, Deshpande V, Mino-Kenudson M, et al. EUS findings in patients with autoimmune pancreatitis. Gastrointest Endosc. (2004) 60:927–36. doi: 10.1016/S0016-5107(04)02230-8
12. Lee S, Kim JH, Kim SY, Byun JH, Kim HJ, Lee MG, et al. Comparison of diagnostic performance between CT and MRI in differentiating non-diffuse-type autoimmune pancreatitis from pancreatic ductal adenocarcinoma. Eur Radiol. (2018) 28:5267–74. doi: 10.1007/s00330-018-5565-1
13. Dite P, Novotny I, Dvorackova J, Kala Z, Vanasek T, Zadorova Z, et al. Pancreatic solid focal lesions: differential diagnosis between autoimmune pancreatitis and pancreatic cancer. Dig Dis. (2019) 37:416–21. doi: 10.1159/000499762
14. Ishikawa T, Kawashima H, Ohno E, Ishida K, Iida T, Suzuki H, et al. Usefulness of endoscopic ultrasound-guided fine-needle biopsy for the diagnosis of autoimmune pancreatitis using a 22-gauge Franseen needle:a prospective multicenter study. Endoscopy. (2020) 52:978–85. doi: 10.1055/a-1155-9270
15. Krishna NB, LaBundy JL, Saripalli S, Safdar R, Agarwal B, Arnoletti JP, et al. Diagnostic value of EUS- FNA in patients suspected of having pancreatic cancer with a focal lesion on CT scan/MRI but without obstructive jaundice. Pancreas. (2009) 38:625–30. doi: 10.1097/MPA.0b013e3181ac35d2
16. Hamano H, Kawa S, Horiuchi A, Unno H, Furuya N, Akamatsu T, et al. High srum IgG4 concentrations in patients with sclerosing pancreatitis. N Engl J Med. (2001) 344:732–8. doi: 10.1056/NEJM200103083441005
17. Carruthers MN, Khosroshahi A, Augustin T, Deshpande V, Stone JH, Perugino CA, et al. The diagnostic utility of serum IgG4 concentrations in IgG4-related disease. Ann Rheumatic Dis. (2015) 74:14–8. doi: 10.1136/annrheumdis-2013-204907
18. Okazaki K, Uchida K. Current perspectives on autoimmune pancreatitis and IgG4 - related disease. Proc Jpn Acad Ser B Phys Biol Sci. (2018) 94:412–27. doi: 10.2183/pjab.94.027
19. Kamisawa T, Chari ST, Giday SA, Kim MH, Chung JB, Okazaki K, et al. Clinical profile of autoimmune pancreatitis its histological subtypes: an inierational multicenter survey. Pancreas. (2011) 40:809–14. doi: 10.1097/MPA.0b013e3182258a15
Keywords: pancreatic ductal adenocarcinoma, autoimmune pancreatitis, ultrasound clinical features, reader study, random forest
Citation: Zhang L, Chen X, Chen Z, Chen W, Zheng J, Zhuo M and Chen X (2025) Differentiating pancreatic ductal adenocarcinoma and autoimmune pancreatitis using a machine learning model based on ultrasound clinical features. Front. Oncol. 15:1505376. doi: 10.3389/fonc.2025.1505376
Received: 02 October 2024; Accepted: 31 January 2025;
Published: 19 February 2025.
Edited by:
Abhishek Mahajan, The Clatterbridge Cancer Centre, United KingdomReviewed by:
Wenming Cao, City University of Hong Kong, Hong Kong SAR, ChinaCopyright © 2025 Zhang, Chen, Chen, Chen, Zheng, Zhuo and Chen. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Xing Chen, amlhbnl1aGFvaGFpMjAyNEBmam11LmVkdS5jbg==
†These authors have contributed equally to this work and share first authorship
Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.
Research integrity at Frontiers
Learn more about the work of our research integrity team to safeguard the quality of each article we publish.