Rapid multi-task diagnosis of oral cancer leveraging fiber-optic Raman spectroscopy and deep learning algorithms

Li, Xing; Li, Lianyu; Sun, Qing; Chen, Bo; Zhao, Chenjie; Dong, Yuting; Zhu, Zhihui; Zhao, Ruiqi; Ma, Xinsong; Yu, Mingxin; Zhang, Tao

doi:10.3389/fonc.2023.1272305

ORIGINAL RESEARCH article

Front. Oncol. , 10 October 2023

Sec. Head and Neck Cancer

Volume 13 - 2023 | https://doi.org/10.3389/fonc.2023.1272305

Rapid multi-task diagnosis of oral cancer leveraging fiber-optic Raman spectroscopy and deep learning algorithms

Xing Li¹

Lianyu Li²

Qing Sun³

Bo Chen⁴

Chenjie Zhao⁵

Yuting Dong⁵

Zhihui Zhu¹

Ruiqi Zhao¹

Xinsong Ma²

Mingxin Yu^2*

Tao Zhang^1*

¹Department of Stomatology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
²Key Laboratory of the Ministry of Education for Optoelectronic Measurement Technology and Instrument, Beijing Information Science and Technology University, Beijing, China
³Department of Plastic Surgery, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
⁴Department of Pathology, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
⁵Plastic Surgery Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China

Introduction: Oral cancer, a predominant malignancy in developing nations, represents a global health challenge with a five-year survival rate below 50%. Nonetheless, substantial reductions in both its incidence and mortality rates can be achieved through early detection and appropriate treatment. Crucial to these treatment plans and prognosis predictions is the identification of the pathological type of oral cancer.

Methods: Toward this end, fiber-optic Raman spectroscopy emerges as an effective tool. This study combines Raman spectroscopy technology with deep learning algorithms to develop a portable intelligent prototype for oral case analysis. We propose, for the first time, a multi-task network (MTN) Raman spectroscopy classification model that utilizes a shared backbone network to simultaneously achieve different clinical staging and histological grading diagnoses.

Results: The developed model demonstrated accuracy rates of 94.88%, 94.57%, and 94.34% for tumor staging, lymph node staging, and histological grading, respectively. Its sensitivity, specificity, and accuracy compare closely with the gold standard: routine histopathological examination.

Discussion: Thus, this prototype proposed in this study has great potential for rapid, non-invasive, and label-free pathological diagnosis of oral cancer.

Introduction

According to the Global Cancer Statistics 2020 (GLOBOCAN 2020), both the incidence and mortality of cancer worldwide has been steadily increasing. This increase is indicated by the 377,713 new cases and 177,757 deaths attributed to oral and lip tumors in 2020 (1). The global incidence of oropharyngeal cancer is on the rise (2), and oral tumor prevalence is remarkable escalating in certain developing nations (3). Notably, oral squamous cell carcinoma (OSCC) constitutes over 90% of all oral tumor cases (4). The primary risk factors for oral cancer include smoking, alcohol consumption, betel nut consumption, sun exposure, and HPV infection (5–7). Despite advancements in treatment methods for oral tumors—ranging from surgery and radiation to chemotherapy, immunotherapy, and targeted therapy (8)—the five-year survival rate for oral squamous cell carcinoma has remained below 50% for the past three decades (9). However, early-stage tumor patients experience a significant improvement in the five-year survival rate post-effective treatment (10). Thus, early diagnosis and treatment are pivotal in enhancing the survival rate of oral tumor patients and minimizing mortality (11). Regrettably, most early-stage oral tumor patients often delay treatment due to misdiagnoses, often mistaken for oral ulcers or chronic inflammatory changes, as early oral tumors closely resemble benign lesions (12, 13). Consequently, physicians struggle to accurately distinguish them via visual inspection and palpation (14). Biopsy, though considered the diagnostic gold standard for oral tumors, presents challenges—it is time-consuming, invasive, costly, and demands significant diagnostic skill from pathologists (15). This results in delayed diagnosis and referrals, leading to treatment postponement and reducing survival time for many oral tumor patients (16). Additionally, patients with differing levels of oral cancer pathological differentiation display marked variance in prognosis (17). Research reveals that the five-year survival rate is 89% for patients with well-differentiated oral cancer, compared to 68% and 45% for those with moderate-differentiation or poor-differentiation respectively (18). Hence, the development of a new technology for quick, real-time, portable, and non-invasive diagnosis of oral tumors—capable of providing personalized optimal treatment plans and prognostic information—would enhance diagnostic efficiency and increase patient survival rates.

As illustrated in Table 1, Raman spectroscopy technology has become a prevalent tool in the pathological diagnosis research of oral tumors. Micro-Raman spectroscopy, a prominent technique within the realm of Raman spectroscopy, has proven instrumental in distinguishing between benign and malignant formations in oral tumor tissue analyses. This is predominantly achieved through the analysis of hematoxylin and eosin (H&E) tissue sections (19), frozen sections (20), and ex-vivo tissues (21), among other samples. Furthermore, surface-enhanced Raman spectroscopy (SERS) facilitates the diagnosis of oral tumors by analyzing biological specimens such as saliva (22) and serum (23, 24) from patients afflicted with the condition. Up until now, there has been a limited number of studies utilizing Raman spectroscopy for the precise diagnosis of tumor-node-metastasis (TNM) staging and identifying pathological grades such as well differentiated (Grade I), moderately differentiated (Grade II), and poorly differentiated (Grade III). Numerous research groups have successfully employed Raman spectroscopy in analyzing tissue samples or cell lines from patients suffering from diseases like breast cancer (29), brain cancer (30), esophageal squamous cell carcinoma (31), and bladder cancer (32), yielding significant findings. These groups have devised various models leveraging specific algorithms to accurately discern tumor stages or pathological classifications. Sharma et al. (21) have analyzed the intrinsic molecular changes in tissues at different T-stages of oral cancer patients using microscopic Raman spectroscopy technology and have established a diagnostic model for healthy tissues and malignant tumor tissues. However, this study has not yet developed a multi-task diagnostic model that can simultaneously predict the T-stage, N-stage, and pathological grades of oral tumor patients. Xue et al. (25) utilized Surface Enhanced Raman Spectroscopy (SERS) to analyze the serum samples of patients with oral squamous cell carcinoma, thereby predicting their tumor stage and pathological status, with an accuracy of only 85%. However, this method also faces many challenges, such as invasiveness, dependence on external reagents, time-consuming, and complex procedures. Fiber-optic Raman spectroscopy technology circumvents existing hurdles, promising a notable enhancement in diagnostic precision. Singh et al. (26) previously applied this technique to in-vivo assessments of individuals with oral tumors, successfully facilitating the pathological grading of normal, precancerous, and tumor tissues. Concurrently, Aaboubout et al. (27) analyzed freshly removed tissues, distinguishing between benign and malignant samples with a sensitivity of 85% and a 92% accuracy rate. Our prior research underscored the potential of portable fiber-optic Raman spectroscopy (PFORS), in conjunction with various machine learning algorithms, to differentiate between cancerous and adjacent tissues in patients afflicted with gum and cheek cancers (28). Nevertheless, the integration of this research into clinical applications has been hampered primarily due to the limited scale of existing research datasets, the challenging nature of ensuring model generalizability, and the protracted duration required for data compilation. Paramountly, before being sanctioned for clinical utilization, these devices necessitate meticulous clinical trials and adherence to medical device safety standards.

TABLE 1

Table 1 Raman Spectroscopy studies on oral cancers.

This research utilizes a portable fiber-optic Raman spectrometer to investigate ex vivo tissues from oral cancer patients, aiming to determine TNM staging and assess the histologic status. Given the high variability in tumor type, location, and histological grading among oral cancer patients, the tumor specimens exhibit marked heterogeneity, leading to relatively unstable Raman spectroscopic data (33, 34). Consequently, our objective is to provide a comprehensive representation of the spectral characteristics of oral tumors. We accomplish this by expanding the patient sample size, procuring spectral data from diverse anatomical sites, and gathering a substantial dataset of Raman spectroscopic data. In parallel, to unveil hidden features of the Raman spectra and subsequently ascertain the TNM staging and pathologic grading of patients, we have engineered a multi-output deep learning model for spectral data analysis and multi-task network (MTN) classification. Thus, this paper’s primary contributions are outlined below (1): the development of a portable prototype for Raman spectroscopy (2); the creation of a MTN Raman spectroscopy classification model, capable of concurrently diagnosing tumor stages and pathologic grades upon the extraction of shared backbone network features (3); the application of the model to perform a comprehensive visualization analysis and molecular feature interpretation of Raman spectroscopy data.

Materials and methods

The portable fiber optic Raman spectrometer prototype

The research employs a portable fiber-optic Raman spectrometer (PFORS) prototype developed by our team for the collection of spectral data from oral cancer tissues, as illustrated in Figure 1. A diode laser, employing a fiber coupling of 785 nm, serves as the excitation source. This is introduced through the handheld fiber Raman probe (HT-PROB-MULTI-785, Emvision, LLC), which is linked to fiber-optic cables to facilitate laser excitation, and standard fiber for signal acquisition (NA = 0.22), providing a resolution of 6 cm⁻¹. The Raman signal is gathered using a fiber spectrometer (QE65 Pro, Ocean Optics, USA), relying on a charge-coupled device (CCD). Throughout the detection process, the CCD operates at a temperature of -20°C. The spectrometer communicates with the PC through the OceanView interface.

FIGURE 1

Figure 1 Portable Fiber-optic Raman Spectrometer (PFORS) Prototype. This system includes: ① display; ② manual displacement platform; ③ Edge computing device; ④ diode laser; ⑤ spectrometer; ⑥ fiber-optic Raman spectroscopy probe.

Patient enrollment and sample preparation

A total of 36 patients with oral cancer who met the inclusion criteria received surgical treatment at Peking Union Medical College Hospital between 2022 and 2023. Comprehensive data including age, gender, and diagnosis were meticulously documented prior to surgery. We acquired the test samples from 48 surgically excised tissues of the participating patients. To prevent contamination that could disrupt spectral signals, any blood stains on the surface of the test samples were meticulously rinsed with running water before conducting fiber Raman spectroscopy. Fresh samples were examined through fiber Raman spectroscopy within 30 minutes post-surgery. After completing the detection process, the localized diseased tissues, and adjacent healthy tissues, for which spectra had been detected, were collected. These samples were then fixed in formalin, embedded in paraffin, and stained with hematoxylin and eosin. An experienced clinical pathologist then performed diagnoses on the pathological sections, assessing disease type, stage, and degree of differentiation. In this study, those examining the Raman spectra of the test tissues, as well as the pathologists conducting pathological testing of the tissues, were blinded to the Raman spectra.

Data acquisition

In Raman spectroscopy detection, the fiber-optic Raman spectroscopy probe maintains a distance of 0.5 mm from the surface of the tissue under test to ensure stable spectral signals. The measurement of distance is facilitated through visual estimation aided by a flat ruler with a thickness of 0.5mm produced via 3D printing. Prior to every measurement, the background spectrum undergoes an automatic subtraction with a one-second integration time while the laser is deactivated. Subsequently, with the laser reactivated, 90 measurements are conducted on both the tumor and normal tissue surfaces, each with a one-second integration time. The laser power at the probe tip is meticulously calibrated to 100 mW cm⁻². The entire detection procedure is assuredly concluded within 30 minutes post tissue excision.

Multi-task network model

Data pre-processing

Initially, we categorized all the Raman spectroscopy data by type and selected the spectroscopy data within the range of 400 - 1400 cm^-1 for further analysis. At the same time, in order to solve the problem of unbalanced sample classes, we average the spectra of samples with a relatively large number of categories. Specifically, for healthy spectra, we average five spectra into one spectrum; For high-T1-N0 tissue spectra, we averaged the three spectra into one spectrum; For the tissue spectra of high-T2-N0 and high-T2-N1, we averaged the five spectra into one spectrum. For the tissue spectra of high-T4-N0 and high-T4-N1, we averaged the six spectra into one spectrum. At the same time, we also removed part of the Raman spectral data that were obviously abnormal. To mitigate the impact of robust fluorescence signals and noise originating from disparate background sources, it was essential to preprocess the spectroscopy data prior to its analysis. This preprocessing phase comprised of three steps:

(i) The Savitzky-Golay filter is utilized during signal denoising to smoothen the spectral data, hence reducing the effect of noise on the spectrum.

(ii) Following signal denoising, the least squares method is employed for baseline correction to fit the polynomial baseline and eliminate the fluorescent background from the initial spectrum.

(iii) Lastly, data normalization is executed using minimum-maximum intensity normalization to standardize the intensity of all spectra within the [0,1] range, facilitating comparison among various samples.

Model architecture

This paper presents the MTN-ResNet50 model, designed to concurrently perform tumor staging, lymph node staging, and histological grading of oral cancer tissues. The architecture of this network, as illustrated in Figure 2, consists of three distinct components: the Backbone, the Neck, and the Head. The Backbone component employs the ResNet50 network structure (35) to extract the feature information from the Raman spectroscopy of oral cancer tissues. This backbone comprises 5 convolutional modules, 12 identity blocks, and 4 pooling layers, collectively forming a five-layer network structure. In particular, the initial layer of the network employs 64 convolutional kernels with dimensions of 7×1 and incorporates a 3×1 Max pooling operation. This operation results in an output image size of 519 x 64. Subsequently, the second through fifth layers are composed of residual modules, each generating feature maps with dimensions of 519×256, 260×512, 130×1024, and 65×2048, respectively. The Neck component is responsible for processing the extracted spectroscopic data and employs adaptive global average pooling to pool the input data, transforming it into a suitable format for subsequent processing. Lastly, the Head component handles the classification tasks for tumor staging, lymph node staging, and histological grading. It utilizes three classification head, contains a total of six full connection layer to generate classification results for the three tasks at the same time. Specifically, we have designed three separate classifiers within the head component to cater to different tasks, including tumor staging, lymph node staging, and histological grading classification. For the tumor staging task, we used three fully connected layers for building a seven-class model. Similarly, a five-class model was constructed using two fully connected layers for lymph node staging, and a single fully connected layer was utilized for histological grading, leading to a five-class model.

FIGURE 2

Figure 2 Architecture of MTN-ResNet50 model. The model employs Raman data, comprised of 3534 individual spectra, which are separately introduced to the Backbone, Neck, and Head modules of the system. The Backbone module is primarily responsible for feature extraction, employing 5 distinct convolution layer modules and 12 Identity Blocks. The Neck module undertakes a global average pooling process, whereas the Head module handles the classification task, with the inclusion of 6 fully connected layers.

Model training

To achieve the final classification, we employed the Softmax for calculating the probability of the outputs. The learning rate is set to 0.0001, a batch size is set to 256, and training epochs are set to 1000. The stochastic gradient descent algorithm was used for optimization, with a momentum of 0.9 and a weight decay of 0.00002.

To test the model’s generalizability, we randomly partitioned the dataset into ten subsets. During each training session, seven subsets were used for training, two for validation, and the remaining one was reserved for testing. This process was repeated ten times, each time using a different subset for testing. Finally, the mean of the ten models’ evaluation results served as the model’s performance metric.

To provide an easily interpretable explanation of the MTN-ResNet50 model’s performance in Raman spectroscopy data analysis, we used the Gradient-weighted Class Activation Mapping (Grad-CAM) approach (36). The CAM analysis enabled us to visualize the Raman spectral regions that the MTN-ResNet50 model was focusing on, thereby facilitating a better understanding of the classification process. Initially, the spectral data was fed into the MTN-ResNet50 model, and the Grad-CAM method was used to calculate and plot the gradients of the last convolutional layer’s feature map. Subsequently, these gradients were weighted and summed with the feature map from the last convolutional layer, followed by global average pooling to obtain a heatmap corresponding to the target class.

Lastly, the heatmap and the original spectral image were superimposed on a single graph. This approach provided an intuitive visualization of spectral differences across bands and aided in our comprehension of spectral data characteristics and variations.

Result

Flowchart

Figure 3 provides a schematic diagram depicting the workflow for tumor staging and histological grading of oral cancer patients, utilizing Raman spectroscopy techniques and deep learning algorithms. This figure was generated using BioRender.com. Initially, oral tumor patients are recruited, and the relevant tissues for examination are harvested during surgery. We then acquire Raman spectroscopy data from both the oral tumor and the tissue adjacent to it. Concurrently, we collect essential demographic information about the patients enrolled and the pathological diagnosis corresponding to the tissues under examination. Subsequently, this spectral input is integrated into a MTN model designed for diagnosing oral cancer pathologically, thereby facilitating real-time diagnostic assessment of the cancer’s stage and its pathological progression using this device. Ultimately, the potential application of this portable fiber-optic Raman spectrometer lies in its ability to identify intraoperative tumor boundaries, thereby providing surgical guidance.

FIGURE 3

Figure 3 The flowchart depicts the pathological staging and histological grading diagnosis process for oral cancer tissue, leveraging fiber-optic Raman spectroscopy technology and deep learning algorithms.

Patient information

Table 2 outlines the patient characteristics for this study. Patients with oral cancer are categorized into five groups (Tis, T1, T2, T3, T4) based on tumor size and extent of tumor involvement. The study also groups patients into N0, N1, and N2 based on lymph node metastasis and its features. Patients who fall under N3 classification or exhibit distant metastasis, for whom further surgical intervention is not recommended, are excluded from this study. Patients’ histological diagnosis is categorized as health, benign tumor or dysplasia (BOD), well differentiated (WD), moderately differentiated (MD) and poorly differentiated (PD) tissue, according to the World Health Organization’s histological classification (37).

TABLE 2

Table 2 Summaries of the fundamental information for the enrolled oral tumor patients and the corresponding number of collected Raman spectra.

Raman spectroscopy analysis

Results of Raman spectroscopy analysis across various T-staging

The study classifies the 2127 Raman spectroscopy data points from all oral cancer patients into five groups: Tis, T1, T2, T3, and T4, comprising 270 Tis, 300 T1, 432 T2, 180 T3, and 945 T4, respectively. Given the close correlation between T1 and T2, and T3 and T4 in clinical practice and disease management, these categories will be merged for analysis, resulting in two groups: TI and TII. The comparison of Raman spectra between TI and Tis, as well as between TII and TI, reveals distinct differences, detailed in Table 3. In Figure 4A, an integrated analysis of these differential Raman spectra indicates increased peak values at 484 (Glycogen), 525 (proteins), 1220 (Amide III) cm^-1, and decreased spectral peaks at 585 (OH out of plane bending), 858 – 863 (Tyrosine, collagen type I) cm^-1. These peak shifts primarily involve components such as sugars, Amide III, and collagen type I (38, 39).

TABLE 3

Table 3 Analysis of peak positions and assignment in raman spectra during TI-Nis and TII-TI progressions.

FIGURE 4

Figure 4 (A) the Raman Spectra of Oral Cancer Tissues Across Varying T-Stages; (B) the Raman Spectra of Oral Cancer Tissues Across Varying N-Stages; (C) the Raman Spectra of Oral Cancer Tissues Across Varying Histological Grades.

Results of Raman spectroscopy analysis across various N-staging

This study organizes the 2127 Raman spectroscopy data points from all oral cancer patients into three groups: N0, N1, and N2, including 1305 N0, 282 N1, and 540 N2 data points, respectively. By contrasting these groups, we can delineate the differential Raman spectra between N1 and N0, and between N2 and N1 in Table 4. As shown in Figure 4B, a comprehensive analysis of these differential Raman spectra reveals an increase in peak values at 1174 (phenylalanine), 1195 (Nucleic acids), 1198 (tryptophan) cm^-1, and a decrease at 728 (collagen), 717-719 (lipids), 719 (phospholipids) cm^-1. These variations in peak Raman shifts primarily correspond to lipids, tryptophan, phenylalanine, and collagen (38, 39).

TABLE 4

Table 4 Analysis of peak positions and assignment in raman spectra during N1-N0 and N2-N1 progressions.

Results of Raman spectroscopy analysis across various histological grades

In accordance with pathological classifications, this study divided the 3534 spectral datasets obtained from all the tested tissues into five categories: healthy tissue, benign tumor or dysplasia (BOD), well differentiated (WD), moderately differentiated (MD) and poorly differentiated (PD) tissue, comprising 703, 704, 987, 540, and 600 samples respectively. The pathological diagrams and average Raman spectral datasets for these categories are depicted in Figures 5A–M. Spectral charts from patients with the same disease stage and pathologic grade, albeit from different oral cancer patients, showed remarkable similarity, implying significant homogeneity within identical test tissue types. In contrast, spectral data from patients with varying disease stages and pathological grades displayed notable differences. A comparison of Raman spectral data across different pathologic grades revealed significant variation in Raman peaks within specific areas in Table 5: during the transition from WD to MD, peak intensities at 820, 889, 998, 1034 cm^-1 experienced an increase, while those at 501, 1299, 1332 cm^-1 showed a decrease. Similarly, during the transition from MD to PD, peaks at 613, 1090, 1356 cm^-1 increased, whereas those at 534, 1034, 1146, 1255 cm^-1 decreased. Upon conducting a comprehensive analysis of these divergent Raman spectra in Figure 4C, both sets showed an increase in Raman spectra at peaks of 815 (nucleic acid), 820 (structural protein modes of tumors), 970 (proteins and nucleic acids), and 1370 cm^-1 (the most pronounced saccharide band); conversely, peaks at 516 (phosphatidylinositol), 1146 (carbohydrates), 1223 (collagen I), and 1318 cm^-1 (protein and Amide III) decreased. As indicated by previous studies, these alterations in peak values of the Raman shifts primarily correspond to nucleic acids, structural proteins, and collagen I within tumor cells (38, 39).

FIGURE 5

Figure 5 Identification of different histopathologic grades of oral tissues, including Health, BOD, WD, MD, and PD tissues. (A–E) Histopathological images; (F–M) Raman spectra of different histopathologic grades of oral tissues.

TABLE 5

Table 5 Analysis of peak positions and assignment in raman spectra during MD - WD and PD - MD progressions.

Result of multi-task network model

In assessing the performance and reliability of CNN models, accuracy and cross-entropy loss are commonly used metrics. As the learning iterations progress, the accuracy and cross-entropy loss curves of the validation set gradually converge, indicating that the model does not suffer from overfitting. Figures 6A, B present the accuracy curves and cross-entropy loss curves for the three subtasks depicted in this study.

FIGURE 6

Figure 6 (A, B) Validation Accuracy and Cross-Entropy Loss Curves in Iterative Training of Convolutional Neural Networks for T-staging, N-staging, and Histological Grading; (C) ROC Curve and Corresponding AUC Values of the Test Set; (D) Cumulative Confusion Matrix from Ten-Fold Cross-Validation of the T-Staging Classification Task; (E) Cumulative Confusion Matrix from Ten-Fold Cross-Validation of the N-Staging Classification Task; (F) Cumulative Confusion Matrix from Ten-Fold Cross-Validation for Histological Grading.

For comparison purposes, we opted for VGG16 and Support Vector Machines (SVM) as benchmark models. VGG16 (40), a well-known Convolutional Neural Network (CNN), is recognized for its remarkable feature extraction capability. In order to achieve multi-task learning, we made modifications to the VGG16 network model and created another multi-task network model called MTN-VGG16, aimed at adapting to multiple classification tasks such as tumor staging, lymph node staging, and histological grading. On the other hand, SVM, a conventional machine learning algorithm, is extensively applied in classification tasks. We used a one-versus-all strategy to realize MTN learning with SVM. Specifically, we treated the three classification tasks as independent and processed them by sequentially training and testing three SVM classifiers.

Table 6 illustrates the performance measures, including accuracy, specificity, and sensitivity, of our MTN-ResNet50 model, the MTN-VGG16 model, and the SVM algorithm on the three classification tasks. A comparative analysis reveals that our MTN-CNN model exhibits superior performance across all tasks. Particularly, for the T-stage classification task, our model yielded an accuracy, specificity, and sensitivity of 94.49%, 99.06%, and 94.83% respectively. For the N-stage classification task, these measures were 94.15%, 98.41%, and 94.31% respectively. For the pathologic grading classification task, the measures were 94.30%, 98.48%, and 95.25% respectively.

TABLE 6

Table 6 Performance of MTN-ResNet50, MTN-VGG16, and SVM algorithms in T-staging, N-staging, and histological grading identification.

To quantitatively assess the performance of the MTN-ResNet50 model, we generated the Receiver Operating Characteristic (ROC) curves for the tumor staging, lymph node staging, and pathologic grading classification tasks, and calculated the Area Under the Curve (AUC). The corresponding AUC values were 0.9971, 0.9931, and 0.9969 respectively as shown in Figure 6C. These results corroborate the superior performance of the MTN-ResNet50 model for these tasks. To visually represent the prediction accuracy and error rates of our classification model across different categories, we present the confusion matrices corresponding to the results of the ten-fold cross-validation for the three classification tasks in Figures 6D–F. The visual representation of confusion matrices aids in understanding the prediction behavior of the classification model in each category and provides a clear understanding of the model’s classification performance.

Visual analytical approach to Grad-CAM

Utilizing the Grad-CAM tool, we generated visualizations of Raman spectra for three categories: healthy tissue, BOD, and malignant tumor tissue using our MTN-ResNet50 model. To facilitate an intuitive comparison between these categories, Figure 7 presents the average Raman spectra and Grad-CAM neural network heatmap. The color intensity in these visualizations is indicative of the influence that specific area has on the model’s target category prediction, with redder areas demonstrating higher influence and lighter areas suggesting lower influence. By visualizing these Raman spectra, it becomes apparent that the MTN-ResNet50 model places emphasis on different Raman shift areas for different datasets.

FIGURE 7

Figure 7 Comparative Raman Spectroscopy Profiles of Healthy, BOD, and Malignant Tissues, Coupled with the Heatmap Visualization of a Grad-CAM Neural Network.

More specifically, the model focuses on the spectral range of 542 cm^-1 to 880 cm^-1 for healthy tissue, 695 cm^-1 to 1020 cm^-1 for BOD, and 400 cm^-1 to 625 cm^-1 and 1170 cm^-1 to 1270 cm^-1 for malignant tumor tissue. Through these visualizations, we can delineate the variations between different tissue types, thereby enhancing our understanding of Raman spectra classifications. As shown in Table 7, several biomolecules have been reported to correlate strongly with our research, and they reside within the Raman shift regions highlighted by Grad-CAM. These biomolecules, which hold the potential to characterize the biochemical features of various biological tissues, enhance the interpretability of our classifications.

TABLE 7

Table 7 Analysis of peak positions and assignment in raman spectra of healthy, BOD, and malignant tissues .

Discussion

Raman spectroscopy is utilized for the detection and analysis of biochemical components in biological tissues, with primary constituents including proteins, lipids, and nucleic acids (41). Through the measurement of molecular vibrational modes, fiber-optic Raman spectroscopy provides intricate details about the composition and concentrations of these biochemical components (42). The spectral characteristics are principally determined by the biochemical components and histological characteristics of the tested samples (32). This study is the first to extract the biochemical characteristics of oral lesions at different pathological stages and histological grades through fiber-optic Raman spectroscopy combined with deep learning algorithms. Then, a “Spectroscopy-TNM staging-histological grading” model was established using a MTN learning algorithm to predict the pathological diagnosis of oral tumor patients. Previous research teams have distinguished between benign and malignant human tissues by analyzing Raman spectroscopy data and extracting essential spectral characteristics via various machine learning algorithms (43, 44). The multi-output model utilized in this study can concurrently execute multiple tasks, including TNM staging and histological grading. By extracting shared features, the model can glean associative information amongst distinct tasks, thereby enhancing the model’s generalizability.

Raman spectroscopy is instrumental in TNM staging of OSCC patients, as it correlates with the types and concentrations of biochemical constituents within the tissues (45). Studies reveal that shifts in the composition and structure of these biochemical constituents will lead to changes in the signal intensity at different Raman shifts (46). An analysis of Raman spectra from patients at various T-stages, integrated with key areas indicated by Grad-CAM analysis, demonstrates an increase in glycogens and Amide III, as well as a decrease in Tyrosine and Collagen type I between Tis and TI, and between TII and TI. These findings align with prior research asserting that cancer cells require an augmented glucose supply for rapid growth and division (47). Moreover, cancer cells can exhibit alterations in the extracellular glycan structure, utilizing these glycans for immune evasion (48). An observed increase in Amide III may be attributable to the cancer cells’ stress response to inadequate nutrition and oxygen, as exemplified by the elevated expression of heat shock proteins (49). Employing the ResNet50 algorithm, the overall accuracy, specificity, and sensitivity for different T-stages are reported to be 94.88 ± 1.38%, 99.12 ± 0.24%, and 95.23 ± 1.46%, respectively.

In addition, while N-staging greatly impacts the choice of therapeutic approach and patient prognosis, its accurate determination currently necessitates pathological analysis of surgical specimens (50). Prior studies, utilizing Raman spectroscopy, have achieved 100% accuracy in differentiating lymph nodes containing metastatic tumors in breast cancer patients (51, 52). However, no study thus far has predicted lymphatic metastasis via direct tumor examination. Previous in vitro experiments indicated Raman spectroscopy’s capability in distinguishing mouse cancer cell lines with varying metastatic potentials and invasiveness (53). In this investigation, Raman spectroscopy revealed a decrease in lipid or fatty acid and phospholipid accumulation, collagen, alongside an increase in Tryptophan and Phenylalanine between stages N1 and N0, as well as between N2 and N1. Literature corroborates that a “low lipid” phenotype in tumor tissues is indicative of enhanced cellular migration in vitro and increased metastatic ability in vivo (54). Research has established that MMP-2 and MMP-9 foster tumor invasion and metastasis through collagen degradation, leading to extracellular matrix disruption and consequent cellular dysfunction (54–56). An increase in Tryptophan in cancer tissues, catalyzed by indoleamine 2,3-dioxygenase (IDO) into immunosuppressive guanosine, facilitates immune evasion (57). Nonetheless, a unique observation of sugar reduction at 490 cm^-1 when transitioning from N2 to N0 remains unexplained within the biological context. However, the correlation between changes in Raman spectroscopy and tumor staging is not constant, primarily due to the fact that tumor staging doesn’t encapsulate the trend of alterations in all biomolecules. Algorithmic analysis has shown that the overall accuracy, specificity, and sensitivity across different N-stages are 94.57 ± 1.32%, 98.54 ± 0.35%, and 94.47 ± 1.47% respectively.

Histopathological identification of OSCC relies on aspects such as cellular morphology and tissue architecture (58). The pathological type of the tumor plays a crucial role in guiding the choice of treatment regimens and in prognostic assessment (59). Our analysis of Raman spectral variations in well, moderately, and poorly differentiated types, augmented with Grad-CAM analysis, revealed a trend consistent with changes in pathological state from WD to PD. Specifically, we observed increased structural protein modes, decreased collagen I, and heightened nucleic acids at spectral positions of 820, 1223, and 815 cm^-1, respectively. As the malignancy of the tumor escalates, there is an increase in collagen degradation within the tumor, a process known to stimulate angiogenesis, as corroborated by several studies (60). Additionally, as the tumor progresses, tumor-associated fibroblasts primarily responsible for collagen I production undergo phenotypic changes, leading to a decrease in collagen I levels. On another note, to support rapid cellular proliferation and division, cancer cells display amplified nucleic acid metabolism (61). In our study, Raman spectral data analyzed with the ResNet50 algorithm yielded an overall diagnostic accuracy, specificity, and sensitivity for different pathologic grades of 94.34 ± 1.55%, 98.54 ± 0.41%, and 94.96 ± 1.28%, respectively.

This research presents several areas of limitation. Primarily, an imbalance in the sample sizes poses a concern; specifically, some pathological categories lack sufficient sample numbers. Nonetheless, comparable studies have demonstrated effective results through algorithmic processing (62). A secondary limitation lies in the non-application of the model machine in vivo due to potential challenges the probe might introduce to surgical sterility requirements (63). Furthermore, the portable fiber-optic Raman spectrometer has yet to be fully integrated, thus impeding true portability. As such, future research by our team will concentrate on component integration to enhance device portability and mobility. Finally, the research is constrained by the recent treatment history of the patients involved, disallowing the collection of prognostic data. As a result, predicting patient prognosis through Raman spectroscopy remains impossible. Nonetheless, it is crucial to acknowledge the potential of this technology in aiding pathologists in faster and more accurate determination of tumor staging and histological grading, thereby reducing diagnostic variability. Once this system accomplishes rapid, label-free, non-invasive, and highly accurate pathological diagnosis, it could facilitate intraoperative tumor boundary diagnosis, and potentially provide significant guidance for preoperative treatment planning and patient prognosis analysis.

Conclusion

This study demonstrates that fiber-optic Raman spectroscopy can elucidate subtle, real-time changes in the biochemical composition of oral lesion tissues, offering an advantage over traditional histopathological diagnosis. Leveraging this technique in conjunction with machine learning algorithms, we constructed a single pathological diagnosis model that simultaneously achieves MTN diagnosis of oral cancer pathologic staging and histological grading. This is accomplished by extracting shared features across sub-tasks and assimilating related information. Our findings reveal that Raman spectra vary significantly across different pathological stages, reflecting notable changes in the content of glycans, lipids, nucleic acids, and collagen proteins. Raman spectroscopy, as shown in this study, can provide insights into the mechanistic evolution of pathologic grade changes from a biochemical standpoint. Consequently, this technology aids in developing innovative, rapid, non-invasive, and label-free tools for both preoperative and intraoperative pathological diagnosis of oral cancer, which can be applied in outpatient clinics and operating rooms.

Data availability statement

The original contributions presented in the study are included in the article/supplementary material. Further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving humans were approved by Medical Ethics Committee of Peking Union Medical College Hospital. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.

Author contributions

XL: Data curation, Investigation, Methodology, Project administration, Visualization, Writing – original draft. LL: Data curation, Investigation, Methodology, Visualization, Writing – original draft. QS: Investigation, Methodology, Writing – review & editing. BC: Investigation, Methodology, Writing – review & editing. CZ: Investigation, Methodology, Writing – review & editing. YD: Investigation, Methodology, Writing – review & editing. ZZ: Investigation, Methodology, Writing – review & editing. RZ: Investigation, Methodology, Writing – review & editing. XM: Investigation, Methodology, Writing – review & editing. MY: Conceptualization, Funding acquisition, Resources, Software, Supervision, Validation, Writing – review & editing. TZ: Conceptualization, Funding acquisition, Resources, Supervision, Validation, Writing – review & editing.

Funding

The authors declare financial support was received for the research, authorship, and/or publication of this article. This research was supported by the Non-profit Central Research Institute Fund of Chinese Academy of Medical Sciences (Grant No.2022-JKCS-17), the National High Level Hospital Clinical Research Funding (Grant No.2022-PUMCH-B-036) and the Beijing Natural Science Foundation (Grant No. 4222040).

Acknowledgments

The authors express gratitude to all the patients who provided human specimens for this research.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

PFORS, Portable Fiber-optic Raman Spectrometer; OSCC, Oral Squamous Cell Carcinoma; MTN, Multi-Task Network; TNM, Tumor-Node-Metastasis; CCD, Charge-Coupled Device; BOD, Benign Tumor or Dysplasia; WD, Well Differentiated; MD, Moderately Differentiated; PD, Poorly Differentiated; SVM, Support Vector Machines; CNN, Convolutional Neural; Network Grad-CAM, Gradient-weighted Class Activation Mapping.

References

1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al. Global cancer statistics 2020: globocan estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J Clin (2021) 71(3):209–49. doi: 10.3322/caac.21660

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Zumsteg ZS, Luu M, Rosenberg PS, Elrod JK, Bray F, Vaccarella S, et al. Global epidemiologic patterns of oropharyngeal cancer incidence trends. J Natl Cancer Inst (2023) djad169. doi: 10.1093/jnci/djad169

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Joshi P, Dutta S, Chaturvedi P, Nair S. Head and neck cancers in developing countries. Rambam Maimonides Med J (2014) 5(2):e0009. doi: 10.5041/RMMJ.10143

PubMed Abstract | CrossRef Full Text | Google Scholar

4. D'Souza S, Addepalli V. Preventive measures in oral cancer: an overview. BioMed Pharmacother (2018) 107:72–80. doi: 10.1016/j.biopha.2018.07.114

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Bugshan A, Farooq I. Oral squamous cell carcinoma: metastasis, potentially associated Malignant disorders, etiology and recent advancements in diagnosis. F1000Res (2020) 9:229. doi: 10.12688/f1000research.22941.1

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Muttagi SS, Chaturvedi P, Gaikwad R, Singh B, Pawar P. Head and neck squamous cell carcinoma in chronic areca nut chewing Indian women: case series and review of literature. Indian J Med Paediatr Oncol (2012) 33(1):32–5. doi: 10.4103/0971-5851.96966

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Nagpal JK, Patnaik S, Das BR. Prevalence of high-risk human papilloma virus types and its association with P53 codon 72 polymorphism in tobacco addicted oral squamous cell carcinoma (Oscc) patients of eastern India. Int J Cancer (2002) 97(5):649–53. doi: 10.1002/ijc.10112

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Chai AWY, Lim KP, Cheong SC. Translational genomics and recent advances in oral squamous cell carcinoma. Semin Cancer Biol (2020) 61:71–83. doi: 10.1016/j.semcancer.2019.09.011

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Sasahira T, Kirita T. Hallmarks of cancer-related newly prognostic factors of oral squamous cell carcinoma. Int J Mol Sci (2018) 19(8):2413. doi: 10.3390/ijms19082413

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Sciubba JJ. Oral cancer. The importance of early diagnosis and treatment. Am J Clin Dermatol (2001) 2(4):239–51. doi: 10.2165/00128071-200102040-00005

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Abati S, Bramati C, Bondi S, Lissoni A, Trimarchi M. Oral cancer and precancer: A narrative review on the relevance of early diagnosis. Int J Environ Res Public Health (2020) 17(24):9160. doi: 10.3390/ijerph17249160

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Gonzalez-Ruiz I, Ramos-Garcia P, Ruiz-Avila I, Gonzalez-Moles MA. Early diagnosis of oral cancer: A complex polyhedral problem with a difficult solution. Cancers (Basel) (2023) 15(13):3270. doi: 10.3390/cancers15133270

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Valente VB, Takamiya AS, Ferreira LL, Felipini RC, Biasoli ER, Miyahara GI, et al. Oral squamous cell carcinoma misdiagnosed as a denture-related traumatic ulcer: A clinical report. J Prosthet Dent (2016) 115(3):259–62. doi: 10.1016/j.prosdent.2015.08.024

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Bagan J, Sarrion G, Jimenez Y. Oral cancer: clinical features. Oral Oncol (2010) 46(6):414–7. doi: 10.1016/j.oraloncology.2010.03.009

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Baby NT, Abdullah A, Kannan S. The scope of liquid biopsy in the clinical management of oral cancer. Int J Oral Maxillofac Surg (2022) 51(5):591–601. doi: 10.1016/j.ijom.2021.08.017

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Stefanuto P, Doucet JC, Robertson C. Delays in treatment of oral cancer: A review of the current literature. Oral Surg Oral Med Oral Pathol Oral Radiol (2014) 117(4):424–9. doi: 10.1016/j.oooo.2013.12.407

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Woolgar JA, Triantafyllou A. Pitfalls and procedures in the histopathological diagnosis of oral and oropharyngeal squamous cell carcinoma and a review of the role of pathology in prognosis. Oral Oncol (2009) 45(4-5):361–85. doi: 10.1016/j.oraloncology.2008.07.016

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Rogers SN, Brown JS, Woolgar JA, Lowe D, Magennis P, Shaw RJ, et al. Survival following primary surgery for oral cancer. Oral Oncol (2009) 45(3):201–11. doi: 10.1016/j.oraloncology.2008.05.008

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Meksiarun P, Ishigaki M, Huck-Pezzei VA, Huck CW, Wongravee K, Sato H, et al. Comparison of multivariate analysis methods for extracting the paraffin component from the paraffin-embedded cancer tissue spectra for raman imaging. Sci Rep (2017) 7:44890. doi: 10.1038/srep44890

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Cals FL, Bakker Schut TC, Hardillo JA, Baatenburg de Jong RJ, Koljenovic S, Puppels GJ. Investigation of the potential of raman spectroscopy for oral cancer detection in surgical margins. Lab Invest (2015) 95(10):1186–96. doi: 10.1038/labinvest.2015.85

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Sharma M, Li YC, Manjunatha SN, Tsai CL, Lin RM, Huang SF, et al. Identification of healthy tissue from Malignant tissue in surgical margin using raman spectroscopy in oral cancer surgeries. Biomedicines (2023) 11(7):1984. doi: 10.3390/biomedicines11071984

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Borsa RM, Toma V, Onaciu A, Moldovan CS, Marginean R, Cenariu D, et al. Developing new diagnostic tools based on sers analysis of filtered salivary samples for oral cancer detection. Int J Mol Sci (2023) 24(15):12125. doi: 10.3390/ijms241512125

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Amber A, Nawaz H, Bhatti HN, Mushtaq Z. Surface-enhanced raman spectroscopy for the characterization of different anatomical subtypes of oral cavity cancer. Photodiagnosis Photodyn Ther (2023) 42:103607. doi: 10.1016/j.pdpdt.2023.103607

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Moisoiu V, Stefancu A, Gulei D, Boitor R, Magdo L, Raduly L, et al. Sers-based differential diagnosis between multiple solid Malignancies: breast, colorectal, lung, ovarian and oral cancer. Int J Nanomedicine (2019) 14:6165–78. doi: 10.2147/IJN.S198684

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Xue L, Yan B, Li Y, Tan Y, Luo X, Wang M. Surface-enhanced raman spectroscopy of blood serum based on gold nanoparticles for tumor stages detection and histologic grades classification of oral squamous cell carcinoma. Int J Nanomedicine (2018) 13:4977–86. doi: 10.2147/IJN.S167996

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Singh SP, Deshmukh A, Chaturvedi P, Murali Krishna C. In vivo raman spectroscopic identification of premalignant lesions in oral buccal mucosa. J BioMed Opt (2012) 17(10):105002. doi: 10.1117/1.JBO.17.10.105002

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Aaboubout Y, Nunes Soares MR, Bakker Schut TC, Barroso EM, van der Wolf M, Sokolova E, et al. Intraoperative assessment of resection margins by raman spectroscopy to guide oral cancer surgery. Analyst (2023) 148(17):4116–26. doi: 10.1039/d3an00650f

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Chang XH, Yu MX, Liu RY, Jing RX, Ding JY, Xia JB, et al. Deep learning methods for oral cancer detection using raman spectroscopy. Vibrational Spectrosc (2023) 126:103522. doi: 10.1016/j.vibspec.2023.103522

CrossRef Full Text | Google Scholar

29. Zhang B, Zhang Z, Gao B, Zhang F, Tian L, Zeng H, et al. Raman microspectroscopy based tnm staging and grading of breast cancer. Spectrochim Acta A Mol Biomol Spectrosc (2023) 285:121937. doi: 10.1016/j.saa.2022.121937

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Aguiar RP, Falcao ET, Pasqualucci CA, Silveira L Jr. Use of raman spectroscopy to evaluate the biochemical composition of normal and tumoral human brain tissues for diagnosis. Lasers Med Sci (2022) 37(1):121–33doi: 10.1007/s10103-020-03173-1

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Huang W, Shang Q, Xiao X, Zhang H, Gu Y, Yang L, et al. Raman spectroscopy and machine learning for the classification of esophageal squamous carcinoma. Spectrochim Acta A Mol Biomol Spectrosc (2022) 281:121654. doi: 10.1016/j.saa.2022.121654

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Morselli S, Baria E, Cicchi R, Liaci A, Sebastianelli A, Nesi G, et al. The feasibility of multimodal fiber optic spectroscopy analysis in bladder cancer detection, grading, and staging. Urologia (2021) 88(4):306–14. doi: 10.1177/03915603211007018

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Guze K, Pawluk HC, Short M, Zeng H, Lorch J, Norris C, et al. Pilot study: raman spectroscopy in differentiating premalignant and Malignant oral lesions from normal mucosa and benign lesions in humans. Head Neck (2015) 37(4):511–7. doi: 10.1002/hed.23629

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Jeng MJ, Sharma M, Sharma L, Huang SF, Chang LB, Wu SL, et al. Novel quantitative analysis using optical imaging (Velscope) and spectroscopy (Raman) techniques for oral cancer detection. Cancers (Basel) (2020) 12(11):3364. doi: 10.3390/cancers12113364

PubMed Abstract | CrossRef Full Text | Google Scholar

35. He K, Zhang X, Ren S, Sun J. Deep residual learning for image recognition. 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). (2016) 770–8. doi: 10.1109/CVPR.2016.90

CrossRef Full Text | Google Scholar

36. Selvaraju RR, Cogswell M, Das A, Vedantam R, Parikh D, Batra D. Grad-cam: visual explanations from deep networks via gradient-based localization. 2017 IEEE International Conference on Computer Vision (ICCV). (2017) 618–26. doi: 10.1109/ICCV.2017.74

CrossRef Full Text | Google Scholar

37. Ei-Naggar AK, Chan J, Grandis J, Takata T, Slootweg P. Who classification of head and neck tumours: international agency. 4th Ed. (2017) 105–11.

Google Scholar

38. Movasaghi Z, Rehman S, Rehman I. Raman spectroscopy of biological tissues. Appl Spectrosc Rev. (2007) 42(5). doi: 10.1080/05704920701551530

CrossRef Full Text | Google Scholar

39. De Gelder J, De Gussem K, Vandenabeele P, Moens L. Reference database of raman spectra of biological molecules. J Raman Spectroscopy: Int J Original Work all Aspects Raman Spectroscopy Including Higher Order Processes also Brillouin Rayleigh Scattering (2007) 38(9):1133–47. doi: 10.1002/jrs.1734

CrossRef Full Text | Google Scholar

40. Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition. Computational and Biological Learning Society (2015) 1–14.

Google Scholar

41. Qiu S, Xu Y, Huang L, Zheng W, Huang C, Huang S, et al. Non-invasive detection of nasopharyngeal carcinoma using saliva surface-enhanced raman spectroscopy. Oncol Lett (2016) 11(1):884–90. doi: 10.3892/ol.2015.3969

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Sharma A, Sharma S, Zarrow A, Schwartz RA, Lambert WC. Raman spectroscopy: incorporating the chemical dimension into dermatological diagnosis. Indian J Dermatol (2016) 61(1):1–8. doi: 10.4103/0019-5154.173978

PubMed Abstract | CrossRef Full Text | Google Scholar

43. He C, Zhu S, Wu X, Zhou J, Chen Y, Qian X, et al. Accurate tumor subtype detection with raman spectroscopy via variational autoencoder and machine learning. ACS Omega (2022) 7(12):10458–68. doi: 10.1021/acsomega.1c07263

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Blake N, Gaifulina R, Griffin LD, Bell IM, Thomas GMH. Machine learning of raman spectroscopy data for classifying cancers: A review of the recent literature. Diagnostics (Basel) (2022) 12(6):1491. doi: 10.3390/diagnostics12061491

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Parveen S, Taneja N, Bathi RJ, Deka AC. Evaluation of circulating immune complexes and serum immunoglobulins in oral cancer patients–a follow up study. Indian J Dent Res (2010) 21(1):10–5. doi: 10.4103/0970-9290.62800

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Hanna K, Krzoska E, Shaaban AM, Muirhead D, Abu-Eid R, Speirs V. Raman spectroscopy: current applications in breast cancer diagnosis, challenges and future prospects. Br J Cancer (2022) 126(8):1125–39. doi: 10.1038/s41416-021-01659-5

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Liu Q, Li J, Zhang W, Xiao C, Zhang S, Nian C, et al. Glycogen accumulation and phase separation drives liver tumor initiation. Cell (2021) 184(22):5559–76 e19. doi: 10.1016/j.cell.2021.10.001

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Rabinovich GA, van Kooyk Y, Cobb BA. Glycobiology of immune responses. Ann N Y Acad Sci (2012) 1253:1–15. doi: 10.1111/j.1749-6632.2012.06492.x

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Ciocca DR, Calderwood SK. Heat shock proteins in cancer: diagnostic, prognostic, predictive, and treatment implications. Cell Stress Chaperones (2005) 10(2):86–103. doi: 10.1379/csc-99r.1

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Stenner M, Molls C, Luers JC, Beutner D, Klussmann JP, Huettenbrink KB. Occurrence of lymph node metastasis in early-stage parotid gland cancer. Eur Arch Otorhinolaryngol (2012) 269(2):643–8. doi: 10.1007/s00405-011-1663-2

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Sattlecker M, Bessant C, Smith J, Stone N. Investigation of support vector machines and raman spectroscopy for lymph node diagnostics. Analyst (2010) 135(5):895–901. doi: 10.1039/b920229c

PubMed Abstract | CrossRef Full Text | Google Scholar

52. Horsnell J, Stonelake P, Christie-Brown J, Shetty G, Hutchings J, Kendall C, et al. Raman spectroscopy–a new method for the intra-operative assessment of axillary lymph nodes. Analyst (2010) 135(12):3042–7. doi: 10.1039/c0an00527d

PubMed Abstract | CrossRef Full Text | Google Scholar

53. Hedegaard M, Krafft C, Ditzel HJ, Johansen LE, Hassing S, Popp J. Discriminating isogenic cancer cells and identifying altered unsaturated fatty acid content as associated with metastasis status, using K-means clustering and partial least squares-discriminant analysis of raman maps. Anal Chem (2010) 82(7):2797–802. doi: 10.1021/ac902717d

PubMed Abstract | CrossRef Full Text | Google Scholar

54. Wright HJ, Hou J, Xu B, Cortez M, Potma EO, Tromberg BJ, et al. Cdcp1 drives triple-negative breast cancer metastasis through reduction of lipid-droplet abundance and stimulation of fatty acid oxidation. Proc Natl Acad Sci U.S.A. (2017) 114(32):E6556–E65. doi: 10.1073/pnas.1703791114

PubMed Abstract | CrossRef Full Text | Google Scholar

55. Yoon SO, Park SJ, Yun CH, Chung AS. Roles of matrix metalloproteinases in tumor metastasis and angiogenesis. J Biochem Mol Biol (2003) 36(1):128–37. doi: 10.5483/bmbrep.2003.36.1.128

PubMed Abstract | CrossRef Full Text | Google Scholar

56. Westermarck J, Kähäri VM. Regulation of matrix metalloproteinase expression in tumor invasion. FASEB J (1999) 13(8):781–92. doi: 10.1096/fasebj.13.8.781

PubMed Abstract | CrossRef Full Text | Google Scholar

57. Platten M, Wick W, Van den Eynde BJ. Tryptophan catabolism in cancer: beyond ido and tryptophan depletion. Cancer Res (2012) 72(21):5435–40. doi: 10.1158/0008-5472.CAN-12-0569

PubMed Abstract | CrossRef Full Text | Google Scholar

58. Bavle RM, Venugopal R, Konda P, Muniswamappa S, Makarla S. Molecular classification of oral squamous cell carcinoma. J Clin Diagn Res (2016) 10(9):ZE18–21. doi: 10.7860/JCDR/2016/19967.8565

PubMed Abstract | CrossRef Full Text | Google Scholar

59. Boeve K, Melchers LJ, Schuuring E, Roodenburg JL, Halmos GB, van Dijk BA, et al. Addition of tumour infiltration depth and extranodal extension improves the prognostic value of the pathological tnm classification for early-stage oral squamous cell carcinoma. Histopathology (2019) 75(3):329–37. doi: 10.1111/his.13886

PubMed Abstract | CrossRef Full Text | Google Scholar

60. Tang M, Dai W, Wu H, Xu X, Jiang B, Wei Y, et al. Transcriptome analysis of tongue cancer based on high−Throughput sequencing. Oncol Rep (2020) 43(6):2004–16. doi: 10.3892/or.2020.7560

PubMed Abstract | CrossRef Full Text | Google Scholar

61. Sun L, Suo C, Li ST, Zhang H, Gao P. Metabolic reprogramming for cancer cells and their microenvironment: beyond the warburg effect. Biochim Biophys Acta Rev Cancer (2018) 1870(1):51–66. doi: 10.1016/j.bbcan.2018.06.005

PubMed Abstract | CrossRef Full Text | Google Scholar

62. Zhang R, Xie H, Cai S, Hu Y, Gk L, Hong W, et al. Transfer-learning-based raman spectra identification. J Raman Spectrosc (2020) 51(1):176–86. doi: 10.1002/jrs.5750

CrossRef Full Text | Google Scholar

63. Daoust F, Tavera H, Dallaire F, Orsini P, Savard K, Bismuth J, et al. A clinical raman spectroscopy imaging system and safety requirements for in situ intraoperative tissue characterization. Analyst (2023) 148(9):1991–2001. doi: 10.1039/d2an01946a

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: Raman spectroscopy, oral cancer, TNM classification, histological diagnosis, machine learning algorithm

Citation: Li X, Li L, Sun Q, Chen B, Zhao C, Dong Y, Zhu Z, Zhao R, Ma X, Yu M and Zhang T (2023) Rapid multi-task diagnosis of oral cancer leveraging fiber-optic Raman spectroscopy and deep learning algorithms. Front. Oncol. 13:1272305. doi: 10.3389/fonc.2023.1272305

Received: 03 August 2023; Accepted: 18 September 2023;
Published: 10 October 2023.

Edited by:

Toshinori Iwai, Yokohama City University Hospital, Japan

Reviewed by:

Aditi Sahu, Memorial Sloan Kettering Cancer Center, United States
Wei Han, Nanjing University, China

Copyright © 2023 Li, Li, Sun, Chen, Zhao, Dong, Zhu, Zhao, Ma, Yu and Zhang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Tao Zhang, ZHJ0emhhbmdAMTYzLmNvbQ==; Mingxin Yu, eXVtaW5neGluQGJpc3R1LmVkdS5jbg==

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Rapid multi-task diagnosis of oral cancer leveraging fiber-optic Raman spectroscopy and deep learning algorithms

Introduction

Materials and methods

The portable fiber optic Raman spectrometer prototype

Patient enrollment and sample preparation

Data acquisition

Multi-task network model

Data pre-processing

Model architecture

Model training

Result

Flowchart

Patient information

Raman spectroscopy analysis

Results of Raman spectroscopy analysis across various T-staging

Results of Raman spectroscopy analysis across various N-staging

Results of Raman spectroscopy analysis across various histological grades

Result of multi-task network model

Visual analytical approach to Grad-CAM

Discussion

Conclusion

Data availability statement

Ethics statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher’s note

Abbreviations

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good