- 1Department of College of Traditional Chinese Medicine, Shanghai University of Traditional Chinese Medicine, Shanghai, China
- 2School of Computer Science, Fudan University, Shanghai, China
Background: This study aimed to characterize the oral and gut microbiota in prediabetes mellitus (Pre-DM) and type 2 diabetes mellitus (T2DM) patients while exploring the association between tongue manifestations and the oral-gut microbiota axis in diabetes progression.
Methods: Participants included 30 Pre-DM patients, 37 individuals with T2DM, and 28 healthy controls. Tongue images and oral/fecal samples were analyzed using image processing and 16S rRNA sequencing. Machine learning techniques, including support vector machine (SVM), random forest, gradient boosting, adaptive boosting, and K-nearest neighbors, were applied to integrate tongue image data with microbiota profiles to construct predictive models for Pre-DM and T2DM classification.
Results: Significant shifts in tongue characteristics were identified during the progression from Pre-DM to T2DM. Elevated Firmicutes levels along the oral-gut axis were associated with white greasy fur, indicative of underlying metabolic changes. An SVM-based predictive model demonstrated an accuracy of 78.9%, with an AUC of 86.9%. Notably, tongue image parameters (TB-a, perALL) and specific microbiota (Escherichia, Porphyromonas-A) emerged as prominent diagnostic markers for Pre-DM and T2DM.
Conclusion: The integration of tongue diagnosis with microbiome analysis reveals distinct tongue features and microbial markers. This approach significantly improves the diagnostic capability for Pre-DM and T2DM.
1 Introduction
Diabetes mellitus (DM) is a multifactorial endocrine and metabolic disorder triggered by a combination of genetic predisposition and environmental influences, which disrupt insulin secretion and impair insulin sensitivity, ultimately resulting in multi-organ dysfunction and potential organ failure (Standl et al., 2019). In China, the prevalence of type 2 diabetes (T2DM) among adults is approximately 10.9%, with pre-diabetes (Pre-DM) affecting around 35.7% of the population (Wang et al., 2017). T2DM, a polygenic condition, arises from a combination of hereditary and environmental factors, with insulin resistance (IR) and beta-cell dysfunction (reduced insulin production) as its hallmark features (Paquette et al., 2023). Pre-DM, defined by impaired fasting glucose (FBG) and/or impaired glucose tolerance, presents a significant risk, as up to 21% of individuals progress to T2DM within three years (Eades et al., 2014). Thus, early intervention during the pre-diabetic phase is one of the most effective strategies for mitigating the onset of T2DM.However, the indicators and underlying mechanisms predisposing to the conversion of Pre-DM individuals to T2DM remain unclear (Takeuchi et al., 2023).
Tongue diagnosis plays a fundamental role in traditional Chinese medicine (TCM) diagnosis, characterized by features such as tongue shape, color, texture, back, coating color, and thickness. TCM relies on these visual characteristics to infer disease progression and type. However, traditional visual observation in TCM lacks the ability to objectively quantify these features (Thirunavukkarasu et al., 2024). Utilizing computer image processing enables the segmentation of various tongue regions, the automatic extraction of spectral parameters, and the identification of distinct features, including region division, color, texture, and shape. These metrics offer a more precise description of the tongue’s properties. In addition, deep learning approaches provide a more comprehensive analysis of tongue images, allowing for an enhanced understanding of the underlying pathology. In previous research, we developed a robust classification system specifically for diabetic tongues (Li et al., 2022), leveraging a deep learning model to evaluate a substantial number of tongue images and establish optical characteristics associated with diabetes (Jiang et al., 2021). Building on these results, a method was devised to correlate tongue image data with diabetes, supporting the potential of tongue features as early biomarkers for diagnosing prediabetes and diabetes (Li et al., 2021a). The tongue, being the initial segment of the digestive tract, is also intimately linked to the oral microbiome, which plays a crucial role in maintaining oral ecological balance and is associated with the onset and progression of systemic diseases (Gao et al., 2018). Recent studies suggest a significant interplay between oral and intestinal microbiota, with disruptions in intestinal microbiota implicated as a risk factor for chronic conditions such as T2DM, gastrointestinal cancers, and neurological disorders (Tuganbaev et al., 2022; Pan et al., 2024).
In light of the significant involvement of tongue-coated microbiota (oral microbiota) in TCM tongue diagnosis, this study aimed to explore the relationship between alterations in TCM tongue patterns and the oral-gut microbiota axis during diabetes progression. The objective was to offer early risk indicators for T2DM, enabling timely interventions, while simultaneously advancing the scientific comprehension of tongue diagnosis within the framework of TCM.
2 Methods
2.1 Study subject recruitment
The study encompassed individuals undergoing physical examinations at the Shanghai Gaohang Community Service Center between 2022 and 2023. Following the application of inclusion and exclusion criteria, 28 healthy controls, 30 individuals with Pre-DM, and 37 T2DM patients were selected. Informed consent, approved by the ethics committee of Shuguang Hospital, affiliated with Shanghai University of TCM, was obtained from all participants.
Participants were considered eligible for the Pre-DM group based on one or more of the following criteria: (1) FBG ranging from 5.6 to 6.9 mmol/L; (2) 2-hour postprandial blood glucose (2hPG) between 7.8 and 11.0 mmol/L; (3) Glycosylated hemoglobin (HbA1C) levels between 5.7% and 6.4%. For T2DM, inclusion required meeting one or more of the following: (1) random blood glucose ≥11.1 mmol/L; (2) FBG ≥7.0 mmol/L; (3) 2hPG ≥11.1 mmol/L; (4) HbA1C ≥6.5%.
The exclusion criteria included: (1) patients diagnosed with type 1 or specific types of DM, or those experiencing acute complications such as ketoacidosis; (2) recent use of antibiotics, probiotics, traditional Chinese medicines, or immunosuppressive drugs within the past two weeks; (3) coexisting oral conditions or complications, including periodontitis, pulpitis, or oral cancer; (4) individuals with severe systemic diseases, such as malignant tumors, immune disorders, or hematological diseases; (5) pregnant or breastfeeding women; (6) individuals unable to provide fully informed consent due to mental health symptoms, behavioral disorders, or cognitive impairments; (7) participants with distinct dietary habits; (8) participants with irregular bowel movements; (9) smokers.
2.2 Data and sample collection
The clinical characteristics of the enrolled patients were examined in this cross-sectional study, with data collected on demographic factors such as age, gender, body mass index (BMI), and waist-to-hip ratio (WHR). Personal lifestyle factors included smoking and drinking history, along with dietary preferences categorized as spicy, sweet, or no specific preference. The laboratory-developed TCM Clinical Diagnosis Record Form documented key symptoms, including dry mouth, bitter taste, constipation, and diarrhea. Blood pressure was assessed according to World Health Organization (WHO) criteria, classifying hypertension into grade 1 (140–159 mmHg systolic/90–99 mmHg diastolic), grade 2 (160–179 mmHg systolic/100–109 mmHg diastolic), and grade 3 (systolic ≥ 180 mmHg/diastolic ≥ 110 mmHg).
The tongue image acquisition process utilized the Tongue Diagnostic Instrument (TFDA-1), a device engineered by the Intelligent Diagnostic Laboratory at Shanghai University of Traditional Chinese Medicine, as depicted in Figure 1. Detailed data collection procedures can be found in our earlier publication (Soper, 2021). Key technical specifications of the device included a manual operating mode, 1/125 shutter speed, F6.3 aperture setting, ISO sensitivity of 200, correlated color temperature ranging from 4500K to 7000K, and illumination at 4800 ± 10% (unit: lx).
Figure 1. Display of TFDA-1 Tongue Diagnostic Instrument. (A) was the front picture, (B) was the side picture, and (C) was the shooting interface picture.
Oral microbiota samples were obtained from the central region of the tongue dorsum using aseptic pharyngeal swabs, with at least ten rotational movements. Stool samples were self-collected by participants in the morning using a sterile fecal sampler, targeting the central portion of the stool. Following cryopreservation, the samples were immediately dispatched to researchers on the same day. Each specimen was sealed in sterile, enzyme-free Eppendorf tubes, kept on ice, and transferred to a −80°C freezer within 30 minutes for preservation prior to sequencing. Participants were instructed to fast before sample collection.
Tongue image features were extracted using the established methods previously developed by our research group. An intelligent quality assessment model was employed to screen all collected tongue images, ensuring they met the required quality standards (Jiang et al., 2021). Feature extraction was performed using the adversarial generation network, Tongue-GAN, as described in our earlier publications (Li et al., 2021b; Jiang et al., 2022).
2.3 DNA extraction and 16S full-length library construction
Bacterial DNA from the tongue dorsum and fecal samples was extracted using a swab genomic DNA extraction kit (CW2654, CwBiotech, Beijing, China) and an intestinal DNA extraction kit (TIANamp Stool DNA Kit, DP328, Tiangen Biotech, Beijing, China), respectively. The 16S rDNA full-length assembly sequencing technology (16S-FAST) enabled species-level classification through analysis of bacterial ribosomal 16S RNA sequences, encompassing nine variable and ten conserved regions (Karst et al., 2018). For both qualitative and quantitative assessment, as well as quality control, 10 ng of DNA was utilized. Splice and link libraries were constructed, followed by data assembly from electrophoresis and Qubit concentration measurements to ensure quality before proceeding with sequencing on the Illumina NovaSeq 6000 platform (Illumina, USA). The methodology has been thoroughly detailed in a previous study by members of the research team (Guo et al., 2023).
2.4 Bioinformatic analysis
A cloud platform (https://www.genescloud.cn/home) was employed for sequence analyses, utilizing QIIME2 (2019.4), R language (v3.2.0), the ggplot2 package, and Python. ASV-level alpha diversity, including Shannon diversity indices, was calculated via the ASV table in QIIME2 and visualized through box plots. To assess the significance of differences, the Kruskal-Wallis rank sum test followed by Dunn’s post-hoc test was applied. Ranked abundance curves at the ASV level were generated to evaluate richness and evenness across samples. Beta diversity analysis, leveraging UniFrac distance metrics, explored microbial community structural variation, visualized through principal coordinate analysis (PCoA) and hierarchical clustering. Microbial structure differentiation among groups was quantified using permutational multivariate analysis of variance (PERMANOVA) in QIIME2. Linear discriminant analysis effect size (LEfSe) analysis identified differentially abundant taxa among groups under default settings. Random forest analysis in QIIME2, with default parameters, was employed to classify samples from distinct groups, utilizing nested stratified k-fold cross-validation for automated hyperparameter optimization and sample prediction. Co-occurrence network analysis was conducted via SparCC, with a pseudo-count set to 106. Correlation coefficient cutoffs were established at 0.70 using random matrix theory-based methods in the R package RMThreshold, with Cytoscape (v3.9.0) constructing the network visualization. The R language facilitated analysis of the network’s topological structure, with key species identified through topological indices and visualized using the ZiPi plot. Phylogenetic Investigation of Communities by Reconstruction of Unobserved States (PICRUSt2) predicted microbial functions based on MetaCyc (https://metacyc.org/).
2.5 Machine learning methods
Logistic regression with backward selection was applied, incorporating L2 regularization, a tolerance of 1e-4, an inverse regularization strength (C) of 1.0, and the lbfgs solver, with a maximum of 100 iterations. Tongue image features, clinical indicators, and microbial data were screened for Pre-DM and T2DM classification, allowing the removal of insignificant variables while addressing multicollinearity (Chicco et al., 2021). Model fit was evaluated via maximum likelihood and the Hosmer-Lemeshow test. A combined tongue-microbiota classification model for Pre-DM and T2DM was subsequently developed and validated. To ensure robustness, 5-fold cross-validation was employed, partitioning data into five subsets and averaging performance metrics such as ROC AUC, accuracy, sensitivity, and specificity (Lin et al., 2020; Soper, 2021). Python 3.10.9 facilitated machine learning techniques aimed at capturing non-linear relationships, utilizing models like support vector machines (SVM), random forests (RF), gradient boosting, adaptive boosting (AdaBoost), and K-nearest neighbors (KNN). Classification results were calculated using the sklearn library (Version 1.3.1).
2.6 Statistical analysis
Data analysis utilized SPSS v. 25.0 (IBM Corp., Armonk, NY, USA). Variable distribution normality and variance homogeneity were assessed using the Shapiro–Wilk and Levene tests, respectively. For data meeting normal distribution and variance homogeneity criteria, a t-test was applied; otherwise, non-parametric methods were employed. Categorical variables were analyzed using Fisher’s exact test, while continuous variables were evaluated via the Wilcoxon rank-sum test. Relationships among independent variables were examined through Spearman’s rank correlation, with p-values adjusted for multiple comparisons using the Bonferroni correction.
3 Results
3.1 Baseline clinical characteristics of the study cohort
Following the screening process, 28 healthy controls, 30 Pre-DM patients, and 37 T2DM patients were included in the study (Figure 2), with baseline clinical characteristics detailed in Table 1. The Pre-DM and T2DM groups exhibited older average ages compared to the control group, although no significant age difference was observed between the Pre-DM and T2DM groups. Hypertension was identified as a risk factor for diabetes, with a markedly higher prevalence of hypertension among T2DM patients. BMI and WHR measurements indicated greater obesity in both the Pre-DM and T2DM groups relative to the control group.
Figure 2. Flow Chart of the Clinical Cohort. A total of 28 Control patients, 30 Pre-DM patients, and 37 T2DM patients were selected from the screened population. Tongue images were captured via TFDA-1, followed by analysis of tongue image characteristics. Additionally, oral and fecal samples were obtained for bioinformatic profiling, utilizing 16S-FAST for subsequent analysis.
3.2 The change of tongue
Following the extraction of tongue image features, three sets of crowd computer tongue image parameters were identified. Notably, a significant increase in preALL was observed in both the Pre-DM and T2DM groups, indicating a marked thickening and greasiness of the tongue coating. The Con* and MEAN* values for both the tongue body and coating in the T2DM group were substantially higher compared to the other groups. Furthermore, the ASM* and ENT* values for the tongue coating displayed significant variation when compared to the Control and Pre-DM groups. These results suggest a progressive transition in the tongue texture from smooth to rough, indicative of aging as diabetes progresses. Analyzing the color parameters of the tongue and coating, a gradual shift towards paler and whiter shades was noted, as shown in Table 2 and Figure 3.
Figure 3. Tongue image features of participants. (A) Three groups of tongue images;(B) tongue coating thickness index; (C) tongue texture; (D) tongue body color; (E) tongue coating color; *Compared with Control group, P < 0.05; ** Compared with Control group, P < 0.01; #Compared with Pre-DM group, P < 0.05; ##Compared with Pre-DM group, P < 0.01.
3.3 Alerted diversity of the oral and gut microbiota of participants
α-diversity analysis revealed a marked reduction in the richness and evenness of gut microbiota as diabetes progressed, contrasted by a significant rise in oral microbiota species abundance in both the pre-DM and T2DM groups (Figures 4A, B). This discrepancy may be attributed to the thickened tongue coating observed in these groups. PCoA analysis further demonstrated partial overlap between the pre-DM and control groups, while the T2DM group exhibited distinct shifts in community composition, highlighting notable alterations in microbial structure (Figures 4C, D).
Figure 4. Alterations in the oral and gut microbiota diversity in Pre-DM and T2DM patients were presented as follows: (A) Oral microbiota α-diversity, assessed via Observed species count and Shannon index; (B) Gut microbiota α-diversity, similarly evaluated with Observed species and Shannon index; (C) PCoA analysis of the oral microbiota; (D) PCoA analysis of gut microbiota across participants.
3.4 Changes in the composition of the oral and gut microbiota in patients with Pre-DM and T2DM
Significant variations were observed in the oral and gut microbiota at both the phylum and genus levels (Figure 5). In the oral microbiota, diabetes progression was associated with a marked increase in Firmicutes-C and a decrease in Fusobacteriota. Notably, the Pre-DM group exhibited a rise in Bacteroidota and a reduction in Actinobacteriota compared to the control and T2DM groups. In the gut microbiota, Bacteroidota levels were significantly elevated, while Firmicutes-A and Actinobacteriota showed substantial reductions in the T2DM group relative to other groups. The microbiota shifts in Pre-DM largely mirrored those in T2DM, though the increase in Firmicutes-C was most prominent in the Pre-DM group. A significant rise in Proteobacteria was observed exclusively in the T2DM group.
Figure 5. Compositional alterations of oral and gut microbiota in participants. (A) Stacked bar plots presenting the relative abundance of oral microbiota at the phylum level among participants; (B) Stacked bar plots illustrating the relative abundance of gut microbiota at the phylum level among participants; (C) Stacked bar plots depicting the relative abundance of oral microbiota at the genus level among participants; (D) Stacked bar plots demonstrating the relative abundance of gut microbiota at the genus level among participants.
At the genus level, Pauljensenia and Veillonella-A exhibited significant enrichment, while Neisseria showed a slight reduction in the T2DM group, aligning with the trends observed in Pre-DM. Additionally, Haemophilus-D and Porphyromonas-A were specifically reduced in the T2DM group. In the gut microbiota, Bacteroides levels increased, whereas Faecalibacterium and Bifidobacterium declined as the disease progressed. Notably, Phocaeicola exhibited the most significant increase in the Pre-DM group. Escherichia was markedly enriched in the T2DM group, where it accounted for the highest relative abundance.
3.5 Oral and gut signature microbiota in Pre-DM and T2DM
LEfSe analysis revealed significant distinctions in tongue coating and fecal microbiota between the Pre-DM and T2DM groups (Figure 6). At the genus level, Porphyromonas, AlloPrevotella, and Staphylococcus within the oral microbiota, alongside Blautia, Lactiplantibacillus, and Romboutsia-B in the gut microbiota, emerged as potential microbial markers for Pre-DM. In contrast, Escherichia, Klebsiella, and AlloPrevotella in the gut microbiota were identified as potential indicators of T2DM.
Figure 6. LEfSe analysis of oral and gut signature microbiota in Pre-DM and T2DM (the LDA threshold was 3). (A) Taxonomic branching map of oral microbiota in Pre-DM group and T2DM group; (B) Bar chart of oral microbiota in Pre-DM group and T2DM group; (C) Taxonomic branching map of gut microbiota in Pre-DM group and T2DM group; (D) Bar chart of gut microbiota in Pre-DM group and T2DM group.
3.6 Association analysis between tongue features and microbiota of the oral-gut axis
Abnormal FBG patients exhibiting thin white fur (TW-fur) and white greasy fur (WG-fur) were selected to investigate the relationship between tongue characteristics and the oral-gut axis microbiota. At the genus level, WG-fur was associated with elevated levels of Veillonella-A and Streptococcus in the oral cavity, while increased Blautia and Prevotella were observed in the gut. The majority of the altered microbiota belonged to the phylum Firmicutes. Pearson correlation analysis revealed significant associations between oral-gut microbiota and tongue parameters. Specifically, perALL showed a positive correlation with Phocaeicola-A and Veillonella-A, while TB-a exhibited the strongest correlation with Blautia-A (correlation coefficient 0.28, P<0.01). Additionally, Escherichia was positively correlated with multiple tongue image parameters, including TB-L, TC-L, and TC-b (Figure 7).
Figure 7. Association analysis between the microbiota of the oral-gut axis in TW-fur and WG-fur. (A) Two distinct groups of tongue images; (B) Stacked bar plots illustrating statistically significant differences in flora at the genus level; (C) Heat maps depicting species composition at the genus level for the two sets, with UPGMA clustering based on Pearson correlation coefficient matrix and ranked by clustering results; (D) Correlation heat map showing the relationship between tongue parameters and oral-gut axis microbiota (correlation coefficients as values); (E) Predicted metabolic pathways of tongue coating microbiota in WG-fur patients (The x-axis represents the average relative abundance of functional pathways, the y-axis lists MetaCyc functional pathways at the second classification level, and the right margin indicates the first-level pathway classification).
To investigate the mechanism underlying greasy fur formation, an abundance analysis of KEGG functional pathways was performed. The findings indicate that the differential alterations in tongue fur were predominantly associated with pathways related to Metabolism, with the highest abundance observed in metabolic cofactors and vitamins (Figure 7). According to TCM, tongue coating results from “disharmony of the viscera and fumigation of the spleen and stomach.” Increased metabolic activity elevates blood flow to the tongue, leading to a thickened coating. The metabolism of cofactors, vitamins, and other nutrients indirectly influences the tongue coating by impacting nutritional status, metabolic function, and visceral health.
3.7 Pre-DM and T2DM diagnostic models based on tongue image and oral and gut flora biomarker fusion
A diagnostic model was developed to assess the predictive value of tongue images and microbiota in identifying Pre-DM and T2DM. Variables included gender, age, BMI, WHR, tongue image parameters (perALL, TB-Con, TC-Con, TB-ASM, TB-ENT, TB-MEAN, TC-ASM, TC-ENT, TC-MEAN, TB-L, TB-a, TB-b, TC-L, TC-a, and TC-b), and key microorganisms from the oral and gut microbiome (Bacteroides-H, Phocaeicola-A, Escherichia, and Porphyromonas-A). Six modelling techniques were applied—Logistic Regression, SVM, Random Forest, Gradient Boosting, AdaBoost, and KNN—to differentiate Pre-DM, T2DM, and healthy controls. Among the three classification models, SVM achieved the highest accuracy, followed by KNN, while Gradient Boosting demonstrated the poorest performance for classifying Pre-DM and T2DM (Figure 8, Table 3). Feature Importance analysis from the Random Forest model highlighted TB-a and perALL from the tongue image parameters, along with Escherichia and Porphyromonas-A, as key contributors to the model’s predictive performance, serving as primary classification indicators for Pre-DM and T2DM diagnosis (Figure 9).
Figure 8. Different modeling method was adopted to establish the Pre-DM and T2DM diagnosis model of Receiver Operating Characteristic (ROC) curve.
4 Discussion
Tongue image diagnosis plays a central role in TCM diagnosis, with recent research indicating a strong correlation between changes in tongue appearance, coating, and the oral microbiome (Han et al., 2016; Wilbert et al., 2020; Lu et al., 2022). As the oral cavity serves as the entry point to the digestive tract, the oral and gastrointestinal microbiota—the two largest microbiomes in the human body—are intricately connected. While gut microbiota has been extensively studied, particularly its role and metabolites in the development and progression of T2DM (Li et al., 2020; Longo et al., 2023), less attention has been given to the impact of oral microorganisms on T2DM. The oral-gut microbiota axis has been identified as a key mechanism through which oral microbiota influence host diseases (Li et al., 2024b; Zhang et al., 2024b). Previous studies from our group have identified shared commensal bacteria between the tongue coating and the intestines, particularly Prevotella (Guo et al., 2022). However, few studies have specifically addressed the role of oral microorganisms in T2DM. This study investigates the tongue image characteristics of Pre-DM and T2DM patients, alongside the dynamic changes in the oral-gut microbiota axis, through a clinical cohort study, emphasizing the relationship between oral microbiota and tongue image characteristics at different stages of diabetes progression. We can identify diabetes-related metabolites and study the link between specific flora and systemic inflammation by analyzing the oral-gut microbiota axis.
Tongue diagnosis, rooted in the TCM theory of the visceral picture, posits that internal organs are connected to the tongue via meridians, either directly or indirectly (Zhao et al., 2013). Consequently, abnormalities in tongue coating signify systemic imbalances, including alterations in the tongue-coating microbiome (Jiang et al., 2012). The physiological state and pathological conditions of internal organs manifest through changes in the tongue. In TCM, the formation of tongue coating is attributed to the disharmony of Zang-Fu organs, with “fumigation of the spleen and stomach” leading to increased metabolism, heightened tongue blood flow, and thickening of the coating. Additionally, nutrient metabolism, particularly involving cofactors and vitamins, influences the nutritional status, metabolic activity, and Zang-Fu functions, indirectly affecting the tongue coating. In TCM, diabetes is classified under “collateral disease,” characterized by symptoms such as excessive thirst, frequent urination, and weight loss, which corresponds to the term “thirst-quenching” (Liu et al., 2024). Diabetes is primarily considered a result of “humidity” and “heat” imbalances in the body (Li et al., 2024a). Tongue coating in Pre-DM and T2DM patients often appears thick and greasy, with a rough texture and aged appearance. Color parameters indicate that both tongue and moss colors progressively become pale and white.
Research has increasingly established a clear association between gut microbiota dysregulation and the onset of T2DM (Sharma and Tripathi, 2019). Diabetic patients exhibit significant alterations in gut microbiota composition compared to healthy individuals, with microbial biodiversity in Pre-DM and T2DM progressively declining (Yang et al., 2021). This observation aligns with the current study’s findings, where both the Observed Species and Shannon index of intestinal flora in Pre-DM and T2DM patients were markedly lower than in healthy controls, indicating a reduction in microbial richness associated with diabetes. In metabolic disorders, the gut microbiome is frequently characterized by dysbiosis, typically manifesting as a reduction in commensal bacteria alongside an increase in pathogenic species. This shift results in an overrepresentation of normally minor bacterial populations, particularly opportunistic pathogens, and a corresponding decline in overall diversity (Scheithauer et al., 2020). Throughout the progression from pre-glucose intolerance to T2DM, bacterial changes often follow a synchronous pattern (Allin et al., 2018), a trend that this study also confirmed. Specifically, gut microbiota composition evolved with diabetes progression, evidenced by an increase in Bacteroides-H and a decrease in Faecalibacterium and Bifidobacterium. The most pronounced increase in Phocaeicola was observed in the Pre-DM group, while Escherichia was significantly enriched in T2DM, holding the highest relative abundance. These trends are consistent with the findings of Li Wang et al. (Wang et al., 2021) and Christian Diener et al (Diener et al., 2021). Additionally, Xiuying Zhang et al. (Zhang et al., 2013) demonstrated that the relative abundance of Bacteroides fluctuates significantly in response to worsening glucose intolerance.
In this study, the analysis integrated oral flora alongside intestinal flora, revealing a notable contrast between the two. The oral microbiota showed a marked increase in species abundance in both Pre-DM and T2DM groups compared to the gut microbiota. This discrepancy may be attributed to the thickened tongue coating observed in patients, in contrast to the thin coating in healthy individuals. As the disease progresses, the accumulation of phlegm and dampness leads to a thicker, greasier tongue coating, corresponding with an increase in oral microbial diversity. Further species composition analysis identified significant enrichment of Pauljensenia and Veillonella-A in the T2DM group, while Neisseria exhibited a slight decrease, consistent with the trends observed in Pre-DM. Additionally, Haemophilus-D and Porphyromonas-A were reduced exclusively in the T2DM group. LEfSe analysis indicated that Porphyromonas may serve as a key marker distinguishing Pre-DM from T2DM, given its association with chronic inflammatory diseases. Insulin resistance, thus, seems secondary to the inflammatory process, where both innate and adaptive immune responses are potentially driven by microbiota-induced inflammation (Cai et al., 2011; Agrawal and Kant, 2014). These results highlight the pivotal role of inflammation in the development and progression of diabetes.
To investigate the association between TCM tongue diagnosis and microbiota, patients with thin white fur and white greasy fur were selected to examine the correlation between tongue characteristics and the oral-gut microbiota axis. Analysis of species composition revealed that the predominant flora in both the oral cavity and intestines of patients with white greasy coating belonged to Firmicutes, a key producer of butyric acid among short-chain fatty acids (SCFAs). SCFAs play a role in fatty acid oxidation, glucose metabolism, and inflammation (Rivera-Chávez et al., 2016), with butyric acid promoting cytokine production in anti-inflammatory regulatory T cells and enhancing lipolysis (Atarashi et al., 2011, 2013; Rumberger et al., 2014). To further understand the mechanism behind greasy fur formation, an abundance analysis of KEGG functional pathways was performed. Results indicated that the differential changes in tongue coating were primarily linked to pathways involved in Metabolism, with the highest abundance observed in metabolic cofactors and vitamins.
Advancements in science and technology have accelerated the application of deep learning models in disease diagnosis and classification (Zhang et al., 2024a). RF analysis and ANN models have been employed to identify key signature genes and develop diagnostic frameworks (Yao et al., 2024). Xiaozhou Lu et al. (Lu et al., 2024) demonstrated that deep learning-based tongue image analysis serves as an effective screening tool for liver fibrosis. In previous research, tongue image analysis was integrated with microbiome technology to develop an early screening model for MAFLD with enhanced accuracy (Dai et al., 2024). This study marks the first instance of combining tongue analysis and microbiome data for the prediction and diagnosis of Pre-DM and T2DM, achieving high precision. Among the three diagnostic models, SVM showed the highest accuracy, reaching 78.9%. Contribution analysis identified TB-a and perALL in tongue image parameters, along with Escherichia and Porphyromonas-A in microbiota, as primary classification indicators for Pre-DM and T2DM diagnosis. Non-laboratory-based risk models offer the potential to identify and prioritize individuals at higher risk, guiding targeted diagnostic testing and preventive measures. Constructing a diagnostic model confirms the importance of tongue image and oral flora in diagnosing diabetes, providing a basis for further exploration of the mechanism.
This experiment led to several key conclusions. First, distinct characteristic markers are present at various stages of diabetes progression. The increased abundance of Porphyromonas in the oral microbiota and Blautia in the gut microbiota not only serve as microbiological indicators for Pre-DM patients but also function as risk predictors for T2DM. Escherichia is significantly elevated in T2DM and represents a potential microbial marker for the condition. Additionally, the study identified alterations in tongue imagery and the oral-gut axis across different stages of diabetes, emphasizing the central role of elevated Firmicutes in the oral-gut axis in the development of a white, greasy coating, closely linked to metabolic processes. A diagnostic and predictive model was also developed, achieving high accuracy through the SVM model. Parameters such as TB-a and perALL in tongue imagery, along with Escherichia and Porphyromonas-A in microbiota, emerged as primary classifiers for Pre-DM and T2DM diagnosis. The findings further underscore the relevance of the oral microbiota present on tongue coatings, supporting the scientific foundation of TCM tongue diagnosis in disease management.
This exploratory research combined tongue diagnosis from TCM with analysis of the oral-gut microbiome axis. However, limitations due to the small sample size led to overfitting in the machine learning model, and validation of the corresponding diagnostic model remains insufficient. Future studies will aim to address these issues by increasing the sample size, controlling for confounding variables, and incorporating additional bioinformatics data to further identify diagnostic markers for Pre-DM and T2DM.
Data availability statement
The datasets presented in this study can be found in online repositories. The names of the repository/repositories and accession number(s) can be found in the article/supplementary material.
Ethics statement
The studies involving humans were approved by Shuguang Hospital, Affiliated with Shanghai University of TCM. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
JD: Writing – original draft, Writing – review & editing. SD: Writing – original draft. SL: Writing – original draft. LT: Methodology, Supervision, Writing – review & editing. JC: Conceptualization, Formal Analysis, Writing – review & editing. XH: Project administration, Supervision, Writing – review & editing. XQ: Methodology, Software, Writing – review & editing. TJ: Conceptualization, Funding acquisition, Methodology, Resources, Supervision, Validation, Writing – review & editing. JX: Funding acquisition, Methodology, Resources, Supervision, Validation, Writing – review & editing.
Funding
The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. The research, authorship, and publication of this article were financially supported by the National Natural Science Foundation of China (reference number: 82104738), the General Project of China Postdoctoral Science Foundation (reference number: 2023M732337), the High-level Key Discipline Construction Project of Traditional Chinese Medicine by the National Administration of Traditional Chinese Medicine (reference number: ZYYZDXK-2023069), and the Shanghai Super Postdoctoral Incentive Program (reference number: 2022509). The funding bodies had no involvement in the study’s design, execution, data collection, management, analysis, interpretation, or manuscript writing.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
Agrawal, N. K., Kant, S. (2014). Targeting inflammation in diabetes: Newer therapeutic options. World J. Diabetes 5, 697–710. doi: 10.4239/wjd.v5.i5.697
Allin, K. H., Tremaroli, V., Caesar, R., Jensen, B. A. H., Damgaard, M. T. F., Bahl, M. I., et al. (2018). Aberrant intestinal microbiota in individuals with prediabetes. Diabetologia 61, 810–820. doi: 10.1007/s00125-018-4550-1
Atarashi, K., Tanoue, T., Oshima, K., Suda, W., Nagano, Y., Nishikawa, H., et al. (2013). Treg induction by a rationally selected mixture of Clostridia strains from the human microbiota. Nature 500, 232–236. doi: 10.1038/nature12331
Atarashi, K., Tanoue, T., Shima, T., Imaoka, A., Kuwahara, T., Momose, Y., et al. (2011). Induction of colonic regulatory T cells by indigenous Clostridium species. Science 331, 337–341. doi: 10.1126/science.1198469
Cai, K., Qi, D., Wang, O., Chen, J., Liu, X., Deng, B., et al. (2011). TNF-α acutely upregulates amylin expression in murine pancreatic beta cells. Diabetologia 54, 617–626. doi: 10.1007/s00125-010-1972-9
Chicco, D., Warrens, M. J., Jurman, G. (2021). The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation. PeerJ Comput. Sci. 7, e623. doi: 10.7717/peerj-cs.623
Dai, S., Guo, X., Liu, S., Tu, L., Hu, X., Cui, J., et al. (2024). Application of intelligent tongue image analysis in Conjunction with microbiomes in the diagnosis of MAFLD. Heliyon 10, e29269. doi: 10.1016/j.heliyon.2024.e29269
Diener, C., Reyes-Escogido, M., de, L., Jimenez-Ceja, L. M., Matus, M., Gomez-Navarro, C. M., et al. (2021). Progressive shifts in the gut microbiome reflect prediabetes and diabetes development in a treatment-naive Mexican cohort. Front. Endocrinol. 11. doi: 10.3389/fendo.2020.602326
Eades, C. E., Leese, G. P., Evans, J. M. M. (2014). Incidence of impaired glucose regulation and progression to type 2 diabetes mellitus in the Tayside region of Scotland. Diabetes Res. Clin. Pract. 104, e16–e19. doi: 10.1016/j.diabres.2014.01.012
Gao, L., Xu, T., Huang, G., Jiang, S., Gu, Y., Chen, F. (2018). Oral microbiomes: more and more importance in oral cavity and whole body. Protein Cell 9, 488–500. doi: 10.1007/s13238-018-0548-1
Guo, X.-J., Dai, S.-X., Lou, J., Ma, X.-X., Hu, X.-J., Tu, L.-P., et al. (2023). Distribution characteristics of oral microbiota and its relationship with intestinal microbiota in patients with type 2 diabetes mellitus. Front. Endocrinol. 14. doi: 10.3389/fendo.2023.1119201
Guo, X.-J., Jiang, T., Ma, X.-X., Hu, X.-J., Huang, J.-B., Cui, L.-T., et al. (2022). Relationships between diurnal changes of tongue coating microbiota and intestinal microbiota. Front. Cell. Infect. Microbiol. 12. doi: 10.3389/fcimb.2022.813790
Han, S., Yang, X., Qi, Q., Pan, Y., Chen, Y., Shen, J., et al. (2016). Potential screening and early diagnosis method for cancer: Tongue diagnosis. Int. J. Oncol. 48, 2257–2264. doi: 10.3892/ijo.2016.3466
Jiang, B., Liang, X., Chen, Y., Ma, T., Liu, L., Li, J., et al. (2012). Integrating next-generation sequencing and traditional tongue diagnosis to determine tongue coating microbiome. Sci. Rep. 2, 936. doi: 10.1038/srep00936
Jiang, T., Hu, X.-J., Yao, X.-H., Tu, L.-P., Huang, J.-B., Ma, X.-X., et al. (2021). Tongue image quality assessment based on a deep convolutional neural network. BMC Med. Inform. Decis. Mak. 21, 147. doi: 10.1186/s12911-021-01508-8
Jiang, T., Lu, Z., Hu, X., Zeng, L., Ma, X., Huang, J., et al. (2022). Deep learning multi-label tongue image analysis and its application in a population undergoing routine medical checkup. Evid.-Based Complement. Altern. Med. ECAM 2022, 3384209. doi: 10.1155/2022/3384209
Karst, S. M., Dueholm, M. S., Mcllroy, S. J., Kirkegaard, R. H., Nielsen, P. H., Albertsen, M., et al. Retrieval of a million high-quality, full-length microbial 16S and 18S rRNA gene sequences without primer bias. Nat Biotechnol. (2018) 36 (2), 190–195. doi: 10.1038/ nbt.4045
Li, J., Chen, Q., Hu, X., Yuan, P., Cui, L., Tu, L., et al. (2021a). Establishment of noninvasive diabetes risk prediction model based on tongue features and machine learning techniques. Int. J. Med. Inf. 149, 104429. doi: 10.1016/j.ijmedinf.2021.104429
Li, J., Guo, X., Huang, Y., Ren, Y., Liang, H., Xie, X., et al. (2024a). Exploring the mechanism of Bushen Huoxue prescription in the treatment of early diabetic retinal edema from the perspective of inner blood-retinal barrier injury. J. Ethnopharmacol. 323, 117658. doi: 10.1016/j.jep.2023.117658
Li, J., Huang, J., Jiang, T., Tu, L., Cui, L., Cui, J., et al. (2022). A multi-step approach for tongue image classification in patients with diabetes. Comput. Biol. Med. 149, 105935. doi: 10.1016/j.compbiomed.2022.105935
Li, L., Li, C., Lv, M., Hu, Q., Guo, L., Xiong, D. (2020). Correlation between alterations of gut microbiota and miR-122-5p expression in patients with type 2 diabetes mellitus. Ann. Transl. Med. 8, 1481. doi: 10.21037/atm-20-6717
Li, Y., Liu, Y., Cui, J., Zhu, M., Wang, W., Chen, K., et al. (2024b). Oral-gut microbial transmission promotes diabetic coronary heart disease. Cardiovasc. Diabetol. 23, 123. doi: 10.1186/s12933-024-02217-y
Li, J., Yuan, P., Hu, X., Huang, J., Cui, L., Cui, J., et al. (2021b). A tongue features fusion approach to predicting prediabetes and diabetes with machine learning. J. Biomed. Inform. 115, 103693. doi: 10.1016/j.jbi.2021.103693
Lin, S., Li, Z., Fu, B., Chen, S., Li, X., Wang, Y., et al. (2020). Feasibility of using deep learning to detect coronary artery disease based on facial photo. Eur. Heart J. 41, 4400–4411. doi: 10.1093/eurheartj/ehaa640
Liu, X.-Y., Zheng, H.-W., Wang, F.-Z., Atia, T.-W., Fan, B., Wang, Q. (2024). Developments in the study of Chinese herbal medicine’s assessment index and action mechanism for diabetes mellitus. Anim. Models Exp. Med. 7 (4), 433–443. doi: 10.1002/ame2.12455
Longo, S., Rizza, S., Federici, M. (2023). Microbiota-gut-brain axis: relationships among the vagus nerve, gut microbiota, obesity, and diabetes. Acta Diabetol. 60, 1007–1017. doi: 10.1007/s00592-023-02088-x
Lu, X.-Z., Hu, H.-T., Li, W., Deng, J.-F., Chen, L., Cheng, M.-Q., et al. (2024). Exploring hepatic fibrosis screening via deep learning analysis of tongue images. J. Tradit. Complement. Med. 14, 544–549. doi: 10.1016/j.jtcme.2024.03.010
Lu, C., Zhu, H., Zhao, D., Zhang, J., Yang, K., Lv, Y., et al. (2022). Oral-gut microbiome analysis in patients with metabolic-associated fatty liver disease having different tongue image feature. Front. Cell. Infect. Microbiol. 12. doi: 10.3389/fcimb.2022.787143
Pan, Y., Bu, T., Deng, X., Jia, J., Yuan, G. (2024). Gut microbiota and type 2 diabetes mellitus: a focus on the gut-brain axis. Endocrine 84, 1–15. doi: 10.1007/s12020-023-03640-z
Paquette, S., Thomas, S. C., Venkataraman, K., Appanna, V. D., Tharmalingam, S. (2023). The effects of oral probiotics on type 2 diabetes mellitus (T2DM): A clinical trial systematic literature review. Nutrients 15, 4690. doi: 10.3390/nu15214690
Rivera-Chávez, F., Zhang, L. F., Faber, F., Lopez, C. A., Byndloss, M. X., Olsan, E. E., et al. (2016). Depletion of butyrate-producing clostridia from the gut microbiota drives an aerobic luminal expansion of salmonella. Cell Host Microbe 19, 443–454. doi: 10.1016/j.chom.2016.03.004
Rumberger, J. M., Arch, J. R. S., Green, A. (2014). Butyrate and other short-chain fatty acids increase the rate of lipolysis in 3T3-L1 adipocytes. PeerJ 2, e611. doi: 10.7717/peerj.611
Scheithauer, T. P. M., Rampanelli, E., Nieuwdorp, M., Vallance, B. A., Verchere, C. B., van Raalte, D. H., et al. (2020). Gut microbiota as a trigger for metabolic inflammation in obesity and type 2 diabetes. Front. Immunol. 11. doi: 10.3389/fimmu.2020.571731
Sharma, S., Tripathi, P. (2019). Gut microbiome and type 2 diabetes: where we are and where to go? J. Nutr. Biochem. 63, 101–108. doi: 10.1016/j.jnutbio.2018.10.003
Soper, D. S. (2021). Greed is good: rapid hyperparameter optimization and model selection using greedy k-fold cross validation. Electronics 10, 1973. doi: 10.3390/electronics10161973
Standl, E., Khunti, K., Hansen, T. B., Schnell, O. (2019). The global epidemics of diabetes in the 21st century: Current situation and perspectives. Eur. J. Prev. Cardiol. 26, 7–14. doi: 10.1177/2047487319881021
Takeuchi, T., Kubota, T., Nakanishi, Y., Tsugawa, H., Suda, W., Kwon, A. T.-J., et al. (2023). Gut microbial carbohydrate metabolism contributes to insulin resistance. Nature 621, 389–395. doi: 10.1038/s41586-023-06466-x
Thirunavukkarasu, U., Umapathy, S., Ravi, V., Alahmadi, T. J. (2024). Tongue image fusion and analysis of thermal and visible images in diabetes mellitus using machine learning techniques. Sci. Rep. 14, 14571. doi: 10.1038/s41598-024-64150-0
Tuganbaev, T., Yoshida, K., Honda, K. (2022). The effects of oral microbiota on health. Science 376, 934–936. doi: 10.1126/science.abn1890
Wang, L., Gao, P., Zhang, M., Huang, Z., Zhang, D., Deng, Q., et al. (2017). Prevalence and ethnic pattern of diabetes and prediabetes in China in 2013. JAMA 317, 2515–2523. doi: 10.1001/jama.2017.7596
Wang, L., Yu, X., Xu, X., Ming, J., Wang, Z., Gao, B., et al. (2021). The fecal microbiota is already altered in normoglycemic individuals who go on to have type 2 diabetes. Front. Cell. Infect. Microbiol. 11. doi: 10.3389/fcimb.2021.598672
Wilbert, S. A., Mark Welch, J. L., Borisy, G. G. (2020). Spatial ecology of the human tongue dorsum microbiome. Cell Rep. 30, 4003–4015.e3. doi: 10.1016/j.celrep.2020.02.097
Yang, G., Wei, J., Liu, P., Zhang, Q., Tian, Y., Hou, G., et al. (2021). Role of the gut microbiota in type 2 diabetes and related diseases. Metabolism. 117, 154712. doi: 10.1016/j.metabol.2021.154712
Yao, N., Hu, H., Chen, K., Huang, H., Zhao, C., Guo, Y., et al. (2024). A robust deep learning method with uncertainty estimation for the pathological classification of renal cell carcinoma based on CT images. J. Imaging Inform. Med. doi: 10.1007/s10278-024-01276-7
Zhang, H., Hua, H., Wang, C., Zhu, C., Xia, Q., Jiang, W., et al. (2024a). Construction of an artificial neural network diagnostic model and investigation of immune cell infiltration characteristics for idiopathic pulmonary fibrosis. BMC Pulm. Med. 24, 458. doi: 10.1186/s12890-024-03249-6
Zhang, H., Lin, Y., Li, S., Bi, J., Zeng, J., Mo, C., et al. (2024b). Effects of bacterial extracellular vesicles derived from oral and gastrointestinal pathogens on systemic diseases. Microbiol. Res. 285, 127788. doi: 10.1016/j.micres.2024.127788
Zhang, X., Shen, D., Fang, Z., Jie, Z., Qiu, X., Zhang, C., et al. (2013). Human gut microbiota changes reveal the progression of glucose intolerance. PLoS One 8, e71108. doi: 10.1371/journal.pone.0071108
Keywords: tongue diagnosis, oral-gut microbiome, prediabetes mellitus, type 2 diabetes mellitus, diagnostic model
Citation: Deng J, Dai S, Liu S, Tu L, Cui J, Hu X, Qiu X, Jiang T and Xu J (2024) Application of tongue image characteristics and oral-gut microbiota in predicting pre-diabetes and type 2 diabetes with machine learning. Front. Cell. Infect. Microbiol. 14:1477638. doi: 10.3389/fcimb.2024.1477638
Received: 08 August 2024; Accepted: 16 October 2024;
Published: 04 November 2024.
Edited by:
Soumyadev Sarkar, Arizona State University, United StatesReviewed by:
Yanlong Shi, Nanjing Medical University, ChinaBenli Su, Second Hospital of Dalian Medical University, Dalian, China
Copyright © 2024 Deng, Dai, Liu, Tu, Cui, Hu, Qiu, Jiang and Xu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Jiatuo Xu, eGp0QGZ1ZGFuLmVkdS5jbg==; Tao Jiang, amlhbmd0YW9Ac2h1dGNtLmVkdS5jbg==; Xipeng Qiu, eHBxaXVAZnVkYW4uZWR1LmNu
†These authors have contributed equally to this work and share first authorship