Skip to main content

SYSTEMATIC REVIEW article

Front. Oncol., 18 June 2024
Sec. Head and Neck Cancer

Application of CT and MRI images based on artificial intelligence to predict lymph node metastases in patients with oral squamous cell carcinoma: a subgroup meta-analysis

Cheng DengCheng DengJun HuJun HuPing TangPing TangTao XuTao XuLing HeLing HeZesheng ZengZesheng ZengJianfeng Sheng*Jianfeng Sheng*
  • Department of Thyroid, Head, Neck and Maxillofacial Surgery, the Third Hospital of Mianyang & Sichuan Mental Health Center, Mianyang, Sichuan, China

Background: The performance of artificial intelligence (AI) in the prediction of lymph node (LN) metastasis in patients with oral squamous cell carcinoma (OSCC) has not been quantitatively evaluated. The purpose of this study was to conduct a systematic review and meta-analysis of published data on the diagnostic performance of CT and MRI based on AI algorithms for predicting LN metastases in patients with OSCC.

Methods: We searched the Embase, PubMed (Medline), Web of Science, and Cochrane databases for studies on the use of AI in predicting LN metastasis in OSCC. Binary diagnostic accuracy data were extracted to obtain the outcomes of interest, namely, the area under the curve (AUC), sensitivity, and specificity, and compared the diagnostic performance of AI with that of radiologists. Subgroup analyses were performed with regard to different types of AI algorithms and imaging modalities.

Results: Fourteen eligible studies were included in the meta-analysis. The AUC, sensitivity, and specificity of the AI models for the diagnosis of LN metastases were 0.92 (95% CI 0.89–0.94), 0.79 (95% CI 0.72–0.85), and 0.90 (95% CI 0.86–0.93), respectively. Promising diagnostic performance was observed in the subgroup analyses based on algorithm types [machine learning (ML) or deep learning (DL)] and imaging modalities (CT vs. MRI). The pooled diagnostic performance of AI was significantly better than that of experienced radiologists.

Discussion: In conclusion, AI based on CT and MRI imaging has good diagnostic accuracy in predicting LN metastasis in patients with OSCC and thus has the potential for clinical application.

Systematic Review Registration: https://www.crd.york.ac.uk/PROSPERO/#recordDetails, PROSPERO (No. CRD42024506159).

1 Introduction

Oral squamous cell carcinoma (OSCC) is the most common type of oral cancer, accounting for approximately 90% of oral cancer cases and with a poor prognosis (1, 2). Assessing the lymph node (LN) status determines the staging, management, and ultimately the survival outcomes of patients with OSCC (35). Cervical LN metastasis is associated with poor prognosis and is one of the most important independent prognostic factors in OSCC (6, 7). The dissection of metastatic LNs at the time of resection of the primary tumor can significantly reduce the rate of regional recurrence and enhance the survival of patients with OSCC (8, 9). Thus, the accurate determination of the clinical LN status is critical for the treatment and prognosis of these patients. Computed tomography (CT) and magnetic resonance imaging (MRI) are widely used to evaluate the status of cervical LNs in patients with OSCC (10, 11). However, the diagnostic accuracy of these methods is affected by multiple factors and remains inadequate for assessing the LN status in these patients (12, 13).

Artificial intelligence (AI) has recently been applied to the evaluation of radiology images, as it excels at automatically recognizing complex patterns in imaging data and providing quantitative, rather than qualitative, assessments of radiographic characteristics (14). Currently, radiological feature-based AI plays an important role in tumor diagnosis and staging, as well as in predicting the treatment response and prognosis, demonstrating its potential as a non-invasive auxiliary tool for personalized medicine (15, 16). In recent years, several studies have reported the application of AI algorithms [machine learning (ML) or deep learning (DL)] in predicting LN metastasis in OSCC patients, and they exhibited promising performance. However, available information on the use of AI-based methods for the prediction of OSCC LN status is scattered, and thus, a systematic review to evaluate and summarize the prediction performance of AI-based methods using CT and MRI is needed.

Thus, the purpose of this study was to conduct a systematic review and meta-analysis of published data on the performance of CT and MRI based on AI algorithms for predicting LN metastases in patients with OSCC. In addition, to elaborate on the predictive performance of AI and increase the reliability of the evidence, a subgroup meta-analysis was performed by grouping studies according to the type of AI algorithm and image modality.

2 Methods

A systematic search of the Cochrane Library, PubMed (Medline), Embase, and Web of Science was performed using Preferred Reporting Items for Systematic Reviews and Meta-analysis (PRISMA) guidelines (17). This study was prospectively registered in PROSPERO (No. CRD42024506159).

2.1 Search strategy

All potentially relevant studies from the beginning up to February 2024 were identified. The following MeSH terms were used: “oral squamous cell carcinoma,” “oral tongue squamous cell carcinoma,” “head and neck squamous cell carcinoma,” “artificial intelligence,” “deep learning,” “convolutional neural network,” “machine learning,” “automatic detection,” “radiomics,” “radiomic,” “CT”, “MRI,” “lymph node,” and “lymph node metastasis.” The details of the search formula are shown in Supplementary Table 1: “Search strategy.” Additional studies were identified by hand-searching reference lists of all relevant articles. Any disagreements in the results of the search process between two authors were resolved by a discussion or consultation with a third author (the corresponding author).

2.2 Selection criteria

Articles were included if they met the following criteria: 1) included patients with histopathological diagnosis of OSCC; 2) developed or used ML or DL to assess CT and MRI preoperative lymph node metastasis prediction; 3) can estimate the values of true-positive (TP), false-positive (FP), false-negative (FN), and true-negative (TN); and 4) published in the English language. The exclusion criteria were as follows: 1) studies with valid outcomes data that could not be extracted; 2) reviews, guidelines, and meta-analyses; 3) animal experiments, case reports, abstracts, conference proceedings, or expert opinions; and 4) duplicate publications.

2.3 Data extraction

Two authors independently extracted the data from each included study and cross-checked the extracted data. If the data were unclear, the corresponding author of the study was contacted by email to obtain insight into the original data set. Any disagreements in the results of the search process between the two authors were resolved by a discussion or consultation with a third author (the corresponding author).

The following data were then extracted from each study: study type, area of the study, total patient number, sample size for diagnostic accuracy, targeted area, image modality, AI algorithm, and reference gold standard. To obtain diagnostic accuracy data, we extracted TP, FP, TN, FN, and area under the receiver operating curve (AUC) along with other parameters of the AI models. If the included studies presented comparative diagnostic performance of AI models versus radiologists, the TP, FP, FN, and TN values of the radiologists in each study were also extracted.

For subgroup analysis, the authors extracted the following variables from each included study: type of AI algorithms (ML or DL) and type of image modality (CT or MRI).

2.4 Quality assessment

Two authors assessed the methodological quality of the final included articles using the second version of the Quality Assessment of Diagnostic Accuracy Studies (QUADAS-2). Two authors assessed each part as having a high, low, or unclear risk of bias.

2.5 Publication bias

Publication bias was evaluated using the Deek funnel plot asymmetry test.

2.6 Statistical analysis

For the quantitative meta-analysis, TP, FP, TN, and FN extracted from the test set were used. If results were not reported in an independent test set, cross-validation results are reported. When different AI models were tested within the same paper, the proposed model in the paper with the highest diagnostic performance was used for pooled analysis and the needed model was used for subgroup analysis, respectively. Sensitivity analyses were carried out by sequentially removing individual studies to evaluate the robustness of the pooled results. Additionally, a pooled analysis was performed to estimate the accuracy of the radiologist’s assessment derived from studies that reported this. The corresponding AUC, sensitivities, and specificities of the radiologists were extracted in the same way as described above.

Statistical analyses were performed with STATA (version 15.1; Stata Corporation, College Station, TX, USA) software, including the packages metandi and midas. The bivariate method and the hierarchical summary receiver operating characteristic (HSROC) method were applied for meta-analysis (18, 19). A forest plot of the sensitivity and specificity and a summary receiver-operating characteristic (SROC) curve were generated using the bivariate method and HSROC method, respectively. To assess heterogeneity between studies, the inconsistency index (I2) was used. I2 values below 50% indicated low heterogeneity, while values above 50% indicated substantial heterogeneity (20). All tests were two-tailed tests and a difference of P <0.05 was considered statistically significant. Differences between subgroups were assessed by inspection of the subgroups’ confidence intervals. Non-overlapping confidence intervals for any two subgroups indicated a statistically significant difference between the subgroups (21, 22).

3 Results

3.1 Study selection

A total of 219 studies were identified and 138 remained after removing duplicates. A review of the titles and abstracts left 28 studies for full-text review. Finally, 14 (1932) articles were included in the systematic review and meta-analysis. The flowchart of the search and screening results for the relevant studies is shown in Figure 1.

Figure 1
www.frontiersin.org

Figure 1 Steps of the process of screening documents.

3.2 Study characteristics

The characteristics of the included studies are summarized in Table 1. The included studies were published between 2019 and 2023 and all of them were retrospective. Seven studies were performed in China and the rest were performed in Japan (five studies) and Italy (two studies). In terms of AI algorithms, five studies used the DL algorithm and 10 studies used the ML algorithm (one of them used both AI algorithms and separately evaluated the performance of the fusion model). For radiomic features, LNs or tumor features in MRI were extracted by five studies and features in CT were extracted by nine studies. Regarding the extracting method, only two studies automatically extracted the feature, while others extracted the feature manually by experienced radiologists. Seven studies presented the comparative diagnostic performance of AI models versus experienced radiologists.

Table 1
www.frontiersin.org

Table 1 Characteristics of the included studies.

3.3 Quality assessment and publication bias

According to the QUADAS-2 tool, the overall risk of bias in the selection of patients was high in one (7%) study and low in 13 (93%) studies. Flow and timing was assessed in only five (36%) studies with a low risk of bias. Overall applicability concerns were low except for one (7%) study with a high risk of bias in the patient selection. Individual evaluation of the risk of bias and its applicability is shown in Figure 2. The Deek funnel plot exhibited a symmetrical shape with respect to the regression line (Figure 3), and the asymmetry test showed no evidence of publication bias (P = 0.97). The detailed results of the quality assessment for each study are shown in Supplementary Table 2: “Quality assessment.”

Figure 2
www.frontiersin.org

Figure 2 Quality assessment of the 14 included studies by the QUADAS-2 tool. “+” denotes a low risk of bias, “?” denotes an unclear risk of bias, and “−” denotes a high risk of bias.

Figure 3
www.frontiersin.org

Figure 3 Deek funnel plot of artificial intelligence (AI) models for the prediction of lymph node (LN) metastasis.

3.4 Meta-analysis of diagnostic accuracy

Among the 14 studies for the meta-analysis, the pooled AUC, sensitivity, specificity, positive likelihood ratio, negative likelihood ratio, and DOR of AI models for the diagnosis of LN metastases were 0.92 (95% CI 0.89–0.94), 0.79 (95% CI 0.72–0.85), 0.90 (95% CI 0.86–0.93), 7.9 (95% CI 5.5–11.4), 0.23 (95% CI 0.16–0.32), and 35 (95% CI 19–62), respectively (Table 2). The SROC curve with a 95% confidence region and prediction region is illustrated in Figure 4. The forest plots of sensitivity and specificity are illustrated in Figure 5. To investigate the clinical utility of the AI models, a Fagan nomogram was generated. Assuming a 50% prevalence of LN metastasis, the Fagan nomogram shows that the posterior probability of LN metastasis was 89% if the test was positive, and the posterior probability of LN metastasis was 19% if the test was negative (Figure 6). For sensitivity analyses, after each exclusion of a single study, there was no large variation in the results, suggesting the stability of the findings; however, high heterogeneity among the studies remains.

Table 2
www.frontiersin.org

Table 2 Summary of results.

Figure 4
www.frontiersin.org

Figure 4 The summary receiver-operating characteristic (SROC) curves of AI models for the prediction of LN metastasis.

Figure 5
www.frontiersin.org

Figure 5 Coupled forest plots of sensitivity and specificity in AI models for the prediction of LN metastasis.

Figure 6
www.frontiersin.org

Figure 6 The Fagan nomogram for the prediction of LN metastasis. LR, likelihood ratio; Post_Prob_Pos, the posterior probability of LN metastasis if the finding of the model was positive; Post_Prob_Neg, the posterior probability of LN metastasis if the finding of the model was negative.

3.5 Subgroup analysis of different types of AI algorithms and image modalities

For the subgroup analysis of different types of AI algorithms, 10 studies used the ML algorithm and five studies used the DL algorithm. The pooled AUC, sensitivity, and specificity of the ML model for predicting LN metastases were 0.91 (95% CI 0.88–0.93), 0.84 (95% CI 0.74–0.90), and 0.87 (95% CI 0.82–0.90), respectively. The pooled AUC, sensitivity, and specificity of the DL model for predicting LN metastases were 0.92 (95% CI 0.89–0.94), 0.74 (95% CI 0.44–0.91), and 0.90 (95% CI 0.81–0.95), respectively. Our results showed that both ML and DL algorithms have the potential to predict the LN metastasis and no significant difference was found between the ML model and the DL model (Supplementary Figures 1–4).

For the subgroup analysis of different types of image modality, nine studies used CT features to train AI models, and the pooled AUC, sensitivity, and specificity for predicting LN metastases were 0.94 (95% CI 0.92–0.96), 0.81 (95% CI 0.62–0.92), and 0.92 (95% CI 0.87–0.95), respectively. Five studies used MRI features to train AI models, and the pooled AUC, sensitivity, and specificity for predicting LN metastases were 0.89 (95% CI 0.86–0.91), 0.84 (95% CI 0.69–0.93), and 0.84 (95% CI 0.77–0.90), respectively. The pooled results showed that the AI model based on CT has the potential to predict LN metastasis, and the pooled AUC of the AI model based on MRI for predicting LN metastases in OSCC was significantly lower than that of the model based on CT (Supplementary Figures 5–8).

The outcomes of the subgroup analysis are shown in Table 2; Supplementary Figures.

3.6 Comparison of diagnostic performances with experienced radiologists

Six studies with sufficient data compared the diagnostic performance of AI models and experienced radiologists. The pooled AUC, sensitivity, and specificity of the AI model for predicting LN metastases were 0.93 (95% CI 0.90–0.95), 0.83 (95% CI 0.73–0.90), and 0.90 (95% CI 0.82–0.95), respectively. For the experienced radiologists, the pooled AUC, sensitivity, and specificity for predicting LN metastases were 0.81 (95% CI 0.78–0.85), 0.73 (95% CI 0.65–0.79), and 0.90 (95% CI 0.83–0.95), respectively. The pooled results showed that experienced radiologists have a good ability to predict LN metastasis, but the pooled AUC for predicting LN metastases was significantly lower than that of the AI model (Supplementary Figures 9–12).

The comparison outcomes are shown in Table 2; Supplementary Figures.

4 Discussion

To the best of our knowledge, this is the first meta-analysis to quantitatively assess the diagnostic value of AI in the prediction of LN metastasis in patients with OSCC. The findings showed that AI based on CT and MRI images performed extremely well in predicting LN metastasis in these patients. The respective pooled sensitivity and specificity values of 0.79 and 0.90, the AUC value of 0.92, and the practical values in the Fagan nomogram indicated the potential for using AI models in clinical practice.

At present, the surgical strategy for OSCC with LN metastasis (LNM) includes both radical resection of the primary tumor and dissection of the neck LNs to varying degrees (37). Unnecessary surgical dissection of non-metastatic LNs can lead to postoperative complications, while delayed dissection of metastatic LNs can result in tumor progression. Therefore, the accurate evaluation of LN status is closely associated with the prediction of prognosis and the choice of surgical strategy. Although ultrasound-guided fine-needle aspiration is now performed for determining LN status with high specificity, the method is invasive and has major drawbacks such as reliance on the skill and experience of the surgeon, sampling error, and post-biopsy complications (3840). CT and MRI are widely used non-invasive tools for diagnosing LN metastasis in patients with OSCC. However, even when interpreted by experienced radiologists, both imaging modalities can show poor efficacy in the diagnosis of LNM, with sensitivity and specificity values of CT between 67%–77% and 68%–72%, respectively, while the sensitivity and specificity values of MRI were 0.66% and 0.68%, respectively (10, 41).

In recent years, medical imaging-based AI techniques have been developed and deployed in clinics. AI technology is based on large-scale data training and deep learning algorithms, which can extract features from medical images and perform accurate analysis and judgment (42). Radiomics is one of the hand-crafted feature-based models that allow high-throughput mining of quantitative image features in medical imaging, which are then used as inputs for ML models that are trained to classify patients in ways that can support clinical decision-making (14). DL represents a fundamentally different paradigm to ML and can automatically extract the higher-level features from medical images without human intervention, thus precisely preserving both the objectivity and nature of the data, and has achieved an outstanding performance in various medical tasks (43, 44). Compared to conventional diagnostic methods, diagnostic models using AI algorithms have the advantages of reproducibility, objectivity, and immediacy. Currently, ML and DL models based on medical imaging are being actively evaluated for the determination of LN status and are showing great potential (4547).

Our findings revealed that AI algorithms exhibited commendable performance with an AUC of 0.92, significantly surpassing the performance of established conventional diagnostic imaging methods. This is better than the performance of experienced radiologists reported in six of the included studies (AUC of 0.81). This may be associated with the faster image-processing rates, the ability to work continuously, and the recognition of certain imaging features that could not be detected by radiologists. Considering the real-life clinical adaptation of using both radiologists and AI models, an AI–radiologist combination would drive developments in the AI field and reduce the burden on the healthcare system. For example, AI may be considered a virtual assistant that can assist the radiologist in increasing both productivity and diagnostic accuracy, which would be particularly useful for less-experienced radiologists.

ML is a branch of AI and DL is a subset of ML. In contrast to ML, DL learns directly by navigating the data space without the need for explicit feature predefinition or selection (14). Given the growing number of applications of AI in medical imaging, several studies have compared DL algorithms with those of ML and have reported substantial improvements in the performance with DL (48, 49). In this meta-analysis, most of the included studies employed ML (n = 10), with only five studies using DL (one study used both AI algorithms). The pooled results showed that both ML and DL algorithms have a good ability to predict LN metastasis, and no significant difference was found between the two algorithms. It is undeniable that DL algorithms based on automatic segmentation can significantly reduce the time spent on manual labeling. However, the requirement for more data is a limiting factor due to its being more prone to overfitting. Chen et al. (33) reported that a DL–ML fusion model improved the preoperative identification of LN metastasis in OSCC and outperformed other single-algorithm models. Thus, fusion modeling may also be a trend for AI applications, allowing AI to meet the stringent requirements for clinical utility in the future.

CT and MRI are effective tools for the diagnosis of LN metastasis in patients with OSCC. In the context of AI, it is not clear whether there is a difference between CT and MRI in diagnosing LN metastasis. In our subgroup analysis, we found that CT-based AI models had a higher pooled AUC than the MRI-based models [0.94 (95% CI 0.92–0.96) vs. 0.89 (95% CI 0.86–0.91)], and there was no significant difference in terms of sensitivity and specificity, possibly because AI can extract more imaging features about LN metastasis from CT. It is also possible that AI models with better predictive performance may be developed in the future based on CT. Nevertheless, MRI is better able to accurately evaluate submucosal diffusion and invasion of adjacent structures while defining the LN status due to its high resolution of soft tissue (50, 51). Therefore, the selection of optimal imaging tools requires a comprehensive consideration of patient characteristics, cost-effectiveness, and clinical needs.

There were several limitations in this meta-analysis. First, because all included studies were retrospective in nature, potential bias, such as case selection, could not be fully eliminated from the analysis. Second, varying degrees of heterogeneity were present in both the pooled analysis and subgroup analysis. It is assumed that the high heterogeneity can be attributed to variations in the different patient populations, scanner technology, sample sizes, and targeted areas. Thus, the summary estimate values must be interpreted with caution. Third, most of the included studies were performed in Asia, and there is thus a potential for geographical bias. Last, few studies have conducted external tests to verify the performance of the AI models. Due to the unique characteristics of patients in different institutions, AI models developed using data from a single institution are usually limited in their broader implementation. Hence, high-quality multicenter prospective studies are needed to overcome biases in AI implementation and verify the prediction performance of AI models in the future.

In summary, the results of the analysis suggested that the use of AI based on CT and MRI for predicting LN metastasis in patients with OSCC has significant potential. More high-quality AI research under more stringent benchmarks is needed for clinical practice in the future.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Author contributions

CD: Data curation, Formal analysis, Visualization, Writing – original draft. JH: Data curation, Formal analysis, Writing – original draft. PT: Visualization, Writing – review & editing. TX: Data curation, Writing – review & editing. LH: Visualization, Writing – review & editing. ZZ: Writing – review & editing. JS: Conceptualization, Methodology, Writing – review & editing.

Funding

The author(s) declare that no financial support was received for the research, authorship, and/or publication of this article.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2024.1395159/full#supplementary-material

References

1. Pulte D, Brenner H. Changes in survival in head and neck cancers in the late 20th and early 21st century: a period analysis. Oncologist. (2010) 15:994–1001. doi: 10.1634/theoncologist.2009-0289

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Lingen MW, Kalmar JR, Karrison T, Speight PM. Critical evaluation of diagnostic aids for the detection of oral cancer. Oral Oncol. (2008) 44:10–22. doi: 10.1016/j.oraloncology.2007.06.011

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Zhao TC, Liang SY, Ju WT, Fu Y, Zhou ZH, Wang LZ, et al. High-risk lymph node ratio predicts worse prognosis in patients with locally advanced oral cancer. J Oral Pathol Med. (2020) 49:787–95. doi: 10.1111/jop.13043

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Worthington HV, Bulsara VM, Glenny AM, Clarkson JE, Conway DI, Macluskey M. Interventions for the treatment of oral cavity and oropharyngeal cancers: surgical treatment. Cochrane Database Syst Rev. (2023) 8:Cd006205. doi: 10.1002/14651858

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Matos LL, Guimarães YLM, Leite AK, Cernea CR. Management of stage III oral cavity squamous cell carcinoma in light of the new staging system: a critical review. Curr Oncol Rep. (2023) 25:107–13. doi: 10.1007/s11912-022-01353-6

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Voss JO, Freund L, Neumann F, Mrosk F, Rubarth K, Kreutzer K, et al. Prognostic value of lymph node involvement in oral squamous cell carcinoma. Clin Oral Investig. (2022) 26:6711–20. doi: 10.1007/s00784-022-04630-7

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Arduino PG, Carrozzo M, Chiecchio A, Broccoletti R, Tirone F, Borra E, et al. Clinical and histopathologic independent prognostic factors in oral squamous cell carcinoma: a retrospective study of 334 cases. J Oral Maxillofac Surg. (2008) 66:1570–9. doi: 10.1016/j.joms.2007.12.024

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Abu-Ghanem S, Yehuda M, Carmel NN, Leshno M, Abergel A, Gutfeld O, et al. Elective neck dissection vs observation in early-stage squamous cell carcinoma of the oral tongue with no clinically apparent lymph node metastasis in the neck: A systematic review and meta-analysis. JAMA Otolaryngol Head Neck Surg. (2016) 142:857–65. doi: 10.1001/jamaoto.2016.1281

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Ren ZH, Xu JL, Li B, Fan TF, Ji T, Zhang CP, et al. Elective versus therapeutic neck dissection in node-negative oral cancer: Evidence from five randomized controlled trials. Oral Oncol. (2015) 51:976–81. doi: 10.1016/j.oraloncology.2015.08.009

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Thoenissen P, Heselich A, Burck I, Sader R, Vogl T, Ghanaati S. The role of magnetic resonance imaging and computed tomography in oral squamous cell carcinoma patients' preoperative staging. Front Oncol. (2023) 13:972042. doi: 10.3389/fonc.2023.972042

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Lwin CT, Hanlon R, Lowe D, Brown JS, Woolgar JA, Triantafyllou A, et al. Accuracy of MRI in prediction of tumour thickness and nodal stage in oral squamous cell carcinoma. Oral Oncol. (2012) 48:149–54. doi: 10.1016/j.oraloncology.2011.11.002

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Madsen CB, Rohde M, Gerke O, Godballe C, Sørensen JA. Diagnostic accuracy of up-front PET/CT and MRI for detecting cervical lymph node metastases in T1-T2 oral cavity cancer-A prospective cohort study. Diagnostics (Basel). (2023) 13:3414. doi: 10.3390/diagnostics13223414

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Bae MR, Roh JL, Kim JS, Lee JH, Cho KJ, Choi SH, et al. (18)F-FDG PET/CT versus CT/MR imaging for detection of neck lymph node metastasis in palpably node-negative oral cavity cancer. J Cancer Res Clin Oncol. (2020) 146:237–44. doi: 10.1007/s00432-019-03054-3

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Hosny A, Parmar C, Quackenbush J, Schwartz LH, Aerts HJWL. Artificial intelligence in radiology. Nat Rev Cancer. (2018) 18:500–10. doi: 10.1038/s41568-018-0016-5

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Bera K, Braman N, Gupta A, Velcheti V, Madabhushi A. Predicting cancer outcomes with radiomics and artificial intelligence in radiology. Nat Rev Clin Oncol. (2022) 19:132–46. doi: 10.1038/s41571-021-00560-7

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Zhang YP, Zhang XY, Cheng YT, Li B, Teng XZ, Zhang J, et al. Artificial intelligence-driven radiomics study in cancer: the role of feature engineering and modeling. Mil Med Res. (2023) 10:22. doi: 10.1186/s40779-023-00458-8

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Liberati A, Altman DG, Tetzlaff J, Mulrow C, Gøtzsche PC, Ioannidis JP, et al. The PRISMA statement for reporting systematic reviews and meta-analyses of studies that evaluate health care interventions: explanation and elaboration. J Clin Epidemiol. (2009) 62:e1–34. doi: 10.1016/j.jclinepi.2009.06.006

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Reitsma JB, Glas AS, Rutjes AW, Scholten RJ, Bossuyt PM, Zwinderman AH. Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. J Clin Epidemiol. (2005) 58:982–90. doi: 10.1016/j.jclinepi.2005.02.022

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Rutter CM, Gatsonis CA. A hierarchical regression approach to meta-analysis of diagnostic test accuracy evaluations. Stat Med. (2001) 20:2865–84. doi: 10.1002/sim.942

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Higgins JP, Thompson SG. Quantifying heterogeneity in a meta-analysis. Stat Med. (2002) 21:1539–58. doi: 10.1002/sim.1186

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Wei S, Wo BL, Qi HP, Xu H, Luo ZC, Roy C, et al. Early amniotomy and early oxytocin for prevention of, or therapy for, delay in first stage spontaneous labour compared with routine care. Cochrane Database Syst Rev. (2013) 8):Cd006794. doi: 10.1002/14651858

CrossRef Full Text | Google Scholar

22. Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch VA eds. Cochrane Handbook for Systematic Reviews of Interventions. 2nd Edition. Chichester (UK): John Wiley & Sons (2019).

Google Scholar

23. Ariji Y, Fukuda M, Kise Y, Nozawa M, Yanashita Y, Fujita H, et al. Contrast-enhanced computed tomography image assessment of cervical lymph node metastasis in patients with oral cancer by using a deep learning system of artificial intelligence. Oral Surg Oral Med Oral Pathol Oral Radiol. (2019) 127:458–63. doi: 10.1016/j.oooo.2018.10.002

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Tomita H, Yamashiro T, Heianna J, Nakasone T, Kimura Y, Mimura H, et al. Nodal-based radiomics analysis for identifying cervical lymph node metastasis at levels I and II in patients with oral squamous cell carcinoma using contrast-enhanced computed tomography. Eur Radiol. (2021) 31:7440–9. doi: 10.1007/s00330-021-07758-4

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Tomita H, Yamashiro T, Heianna J, Nakasone T, Kobayashi T, Mishiro S, et al. Deep learning for the preoperative diagnosis of metastatic cervical lymph nodes on contrast-enhanced computed toMography in patients with oral squamous cell carcinoma. Cancers (Basel). (2021) 13:600. doi: 10.3390/cancers13040600

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Yuan Y, Ren J, Tao X. Machine learning-based MRI texture analysis to predict occult lymph node metastasis in early-stage oral tongue squamous cell carcinoma. Eur Radiol. (2021) 31:6429–37. doi: 10.1007/s00330-021-07731-1

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Ariji Y, Kise Y, Fukuda M, Kuwada C, Ariji E. Segmentation of metastatic cervical lymph nodes from CT images of oral cancers using deep-learning technology. Dentomaxillofac Radiol. (2022) 51:20210515. doi: 10.1259/dmfr.20210515

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Committeri U, Fusco R, Di Bernardo E, Abbate V, Salzano G, Maglitto F, et al. Radiomics metrics combined with clinical data in the surgical management of early-stage (cT1-T2 N0) tongue squamous cell carcinomas: A preliminary study. Biol (Basel). (2022) 11:468. doi: 10.3390/biology11030468

CrossRef Full Text | Google Scholar

29. Kubo K, Kawahara D, Murakami Y, Takeuchi Y, Katsuta T, Imano N, et al. Development of a radiomics and machine learning model for predicting occult cervical lymph node metastasis in patients with tongue cancer. Oral Surg Oral Med Oral Pathol Oral Radiol. (2022) 134:93–101. doi: 10.1016/j.oooo.2021.12.122

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Ren J, Yuan Y, Tao X. Histogram analysis of diffusion-weighted imaging and dynamic contrast-enhanced MRI for predicting occult lymph node metastasis in early-stage oral tongue squamous cell carcinoma. Eur Radiol. (2022) 32:2739–47. doi: 10.1007/s00330-021-08310-0

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Wang F, Tan R, Feng K, Hu J, Zhuang Z, Wang C, et al. Magnetic resonance imaging-based radiomics features associated with depth of invasion predicted lymph node metastasis and prognosis in tongue cancer. J Magn Reson Imaging. (2022) 56:196–209. doi: 10.1002/jmri.28019

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Zhong YW, Jiang Y, Dong S, Wu WJ, Wang LX, Zhang J, et al. Tumor radiomics signature for artificial neural network-assisted detection of neck metastasis in patient with tongue cancer. J Neuroradiol. (2022) 49:213–8. doi: 10.1016/j.neurad.2021.07.006

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Chen Z, Yu Y, Liu S, Du W, Hu L, Wang C, et al. A deep learning and radiomics fusion model based on contrast-enhanced computer tomography improves preoperative identification of cervical lymph node metastasis of oral squamous cell carcinoma. Clin Oral Investig. (2023) 28:39. doi: 10.1007/s00784-023-05423-2

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Liu S, Zhang A, Xiong J, Su X, Zhou Y, Li Y, et al. The application of radiomics machine learning models based on multimodal MRI with different sequence combinations in predicting cervical lymph node metastasis in oral tongue squamous cell carcinoma patients. Head Neck. (2023) 46:513–27. doi: 10.1002/hed.27605

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Xu X, Xi L, Wei L, Wu L, Xu Y, Liu B, et al. Deep learning assisted contrast-enhanced CT-based diagnosis of cervical lymph node metastasis of oral cancer: a retrospective study of 1466 cases. Eur Radiol. (2023) 33:4303–12. doi: 10.1007/s00330-022-09355-5

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Vidiri A, Marzi S, Piludu F, Lucchese S, Dolcetti V, Polito E, et al. Magnetic resonance imaging-based prediction models for tumor stage and cervical lymph node metastasis of tongue squamous cell carcinoma. Comput Struct Biotechnol J. (2023) 21:4277–87. doi: 10.1016/j.csbj.2023.08.020

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Sproll KC, Leydag S, Holtmann H, Schorn LK, Aissa J, Kröpil P, et al. Is the prediction of one or two ipsilateral positive lymph nodes by computerized tomography and ultrasound reliable enough to restrict therapeutic neck dissection in oral squamous cell carcinoma (OSCC) patients? J Cancer Res Clin Oncol. (2021) 147:2421–33. doi: 10.1007/s00432-021-03523-8

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Horvath L, Kraft M. Evaluation of ultrasound and fine-needle aspiration in the assessment of head and neck lesions. Eur Arch Otorhinolaryngol. (2019) 276:2903–11. doi: 10.1007/s00405-019-05552-z

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Ren Y, Shi Y, Zhou Z, Yang D, Xie J, Zhang Z, et al. Ultrasonographic and cytological characterization of ultrasound-guided fine-needle aspiration cytology of cervical lymph nodes for false-negative and false-positive diagnosis. Eur Arch Otorhinolaryngol. (2023) 280:2463–70. doi: 10.1007/s00405-022-07802-z

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Tandon S, Shahab R, Benton JI, Ghosh SK, Sheard J, Jones TM. Fine-needle aspiration cytology in a regional head and neck cancer center: comparison with a systematic review and meta-analysis. Head Neck. (2008) 30:1246–52. doi: 10.1002/hed.20849

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Sun J, Li B, Li CJ, Li Y, Su F, Gao QH, et al. Computed tomography versus magnetic resonance imaging for diagnosing cervical lymph node metastasis of head and neck cancer: a systematic review and meta-analysis. Onco Targets Ther. (2015) 8:1291–313. doi: 10.2147/OTT.S73924

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Lipkova J, Chen RJ, Chen B, Lu MY, Barbieri M, Shao D, et al. Artificial intelligence for multimodal data integration in oncology. Cancer Cell. (2022) 40:1095–110. doi: 10.1016/j.ccell.2022.09.012

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Deepak S, Ameer PM. Brain tumor classification using deep CNN features via transfer learning. Comput Biol Med. (2019) 111:103345. doi: 10.1016/j.compbiomed.2019.103345

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Tang Y, Zheng Y, Chen X, Wang W, Guo Q, Shu J, et al. Identifying periampullary regions in MRI images using deep learning. Front Oncol. (2021) 11:674579. doi: 10.3389/fonc.2021.674579

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Bian Y, Zheng Z, Fang X, Jiang H, Zhu M, Yu J, et al. Artificial intelligence to predict lymph node metastasis at CT in pancreatic ductal adenocarcinoma. Radiology. (2023) 306:160–9. doi: 10.1148/radiol.220329

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Chen M, Kong C, Lin G, Chen W, Guo X, Chen Y, et al. Development and validation of convolutional neural network-based model to predict the risk of sentinel or non-sentinel lymph node metastasis in patients with breast cancer: a machine learning study. EClinicalMedicine. (2023) 63:102176. doi: 10.1016/j.eclinm.2023.102176

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Bedrikovetski S, Dudi-Venkata NN, Kroon HM, Seow W, Vather R, Carneiro G, et al. Artificial intelligence for pre-operative lymph node staging in colorectal cancer: a systematic review and meta-analysis. BMC Cancer. (2021) 21:1058. doi: 10.1186/s12885-021-08773-w

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Paul R, Hawkins SH, Balagurunathan Y, Schabath MB, Gillies RJ, Hall LO, et al. Deep feature transfer learning in combination with traditional features predicts survival among patients with lung adenocarcinoma. Tomography. (2016) 2:388–95. doi: 10.18383/j.tom.2016.00211

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Cheng JZ, Ni D, Chou YH, Qin J, Tiu CM, Chang YC, et al. Computer-aided diagnosis with deep learning architecture: applications to breast lesions in US images and pulmonary nodules in CT scans. Sci Rep. (2016) 6:24454. doi: 10.1038/srep24454

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Mao MH, Wang S, Feng ZE, Li JZ, Li H, Qin LZ, et al. Accuracy of magnetic resonance imaging in evaluating the depth of invasion of tongue cancer. A prospective cohort study Oral Oncol. (2019) 91:79–84. doi: 10.1016/j.oraloncology.2019.01.021

CrossRef Full Text | Google Scholar

51. Park JO, Jung SL, Joo YH, Jung CK, Cho KJ, Kim MS. Diagnostic accuracy of magnetic resonance imaging (MRI) in the assessment of tumor invasion depth in oral/oropharyngeal cancer. Oral Oncol. (2011) 47:381–6. doi: 10.1016/j.oraloncology.2011.03.012

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: oral squamous cell carcinoma, artificial intelligence, radiomics, deep learning, lymph node metastasis, meta-analysis

Citation: Deng C, Hu J, Tang P, Xu T, He L, Zeng Z and Sheng J (2024) Application of CT and MRI images based on artificial intelligence to predict lymph node metastases in patients with oral squamous cell carcinoma: a subgroup meta-analysis. Front. Oncol. 14:1395159. doi: 10.3389/fonc.2024.1395159

Received: 03 March 2024; Accepted: 30 May 2024;
Published: 18 June 2024.

Edited by:

Sharon R. Pine, University of Colorado Anschutz Medical Campus, United States

Reviewed by:

Filippo Marchi, San Martino Hospital (IRCCS), Italy
Xue-Feng Leng, University of Electronic Science and Technology of China, China

Copyright © 2024 Deng, Hu, Tang, Xu, He, Zeng and Sheng. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Jianfeng Sheng, 1078388932@qq.com

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.