Identifying diagnostic indicators for type 2 diabetes mellitus from physical examination using interpretable machine learning approach

Lv, Xiang; Luo, Jiesi; Huang, Wei; Guo, Hui; Bai, Xue; Yan, Pijun; Jiang, Zongzhe; Zhang, Yonglin; Jing, Runyu; Chen, Qi; Li, Menglong

doi:10.3389/fendo.2024.1376220

ORIGINAL RESEARCH article

Front. Endocrinol., 18 March 2024

Sec. Clinical Diabetes

Volume 15 - 2024 | https://doi.org/10.3389/fendo.2024.1376220

This article is part of the Research TopicDigital Technology in the Management and Prevention of DiabetesView all 11 articles

Identifying diagnostic indicators for type 2 diabetes mellitus from physical examination using interpretable machine learning approach

Xiang Lv^1†

Jiesi Luo^2†

Wei Huang^3,4†

Hui Guo¹

Xue Bai^3,4

Pijun Yan^3,4

Zongzhe Jiang^3,4

Yonglin Zhang⁵

Runyu Jing^6*

Qi Chen^3,4,7,8*

Menglong Li^1*

¹College of Chemistry, Sichuan University, Chengdu, China
²Basic Medical College, Southwest Medical University, Luzhou, China
³Department of Endocrinology and Metabolism, The Affiliated Hospital of Southwest Medical University, Luzhou, China
⁴Metabolic Vascular Disease Key Laboratoryof Sichuan Province, The Affiliated Hospital of Southwest Medical University, Luzhou, China
⁵Department of Pharmacy, The Affiliated Hospital of North Sichuan Medical College, Nanchong, China
⁶School of Cyber Science and Engineering, Sichuan University, Chengdu, China
⁷Department of Nursing, The Affiliated Hospital of Southwest Medical University, Luzhou, China
⁸School of Nursing, Southwest Medical University, Luzhou, China

Background: Identification of patients at risk for type 2 diabetes mellitus (T2DM) can not only prevent complications and reduce suffering but also ease the health care burden. While routine physical examination can provide useful information for diagnosis, manual exploration of routine physical examination records is not feasible due to the high prevalence of T2DM.

Objectives: We aim to build interpretable machine learning models for T2DM diagnosis and uncover important diagnostic indicators from physical examination, including age- and sex-related indicators.

Methods: In this study, we present three weighted diversity density (WDD)-based algorithms for T2DM screening that use physical examination indicators, the algorithms are highly transparent and interpretable, two of which are missing value tolerant algorithms.

Patients: Regarding the dataset, we collected 43 physical examination indicator data from 11,071 cases of T2DM patients and 126,622 healthy controls at the Affiliated Hospital of Southwest Medical University. After data processing, we used a data matrix containing 16004 EHRs and 43 clinical indicators for modelling.

Results: The indicators were ranked according to their model weights, and the top 25% of indicators were found to be directly or indirectly related to T2DM. We further investigated the clinical characteristics of different age and sex groups, and found that the algorithms can detect relevant indicators specific to these groups. The algorithms performed well in T2DM screening, with the highest area under the receiver operating characteristic curve (AUC) reaching 0.9185.

Conclusion: This work utilized the interpretable WDD-based algorithms to construct T2DM diagnostic models based on physical examination indicators. By modeling data grouped by age and sex, we identified several predictive markers related to age and sex, uncovering characteristic differences among various groups of T2DM patients.

1 Introduction

Type 2 diabetes mellitus (T2DM) is the most common type of diabetes mellitus (DM), whose pathogenesis is that the cells in the body are not sensitive to insulin, meaning they do not respond to insulin (1). A longer disease duration of diabetes often leads to a variety of complications, such as retinopathy (2), cardiovascular disease, stroke (3, 4), and diabetic foot (5). It is estimated that about half of T2DM patients do not know they have diabetes (44.7%) (6). Therefore, screening for T2DM is essential to prevent or delay complications, avoid premature death, and improve quality of life.

Manual review of a large amount of clinical data is time-consuming and laborious, and missed diagnosis will be inevitable (6, 7). Thus, leveraging machine learning for T2DM screening has emerged as a notable approach in auxiliary diagnostics, enhancing both the accuracy and efficiency of diagnoses. Currently, machine learning models such as the random forest (RF) (8–10), support vector machine (SVM) (8, 11), logistic regression (LR) (11–13), and eXtreme gradient boosting (XGBoost) (9, 14) have been developed for constructing accurate system of T2DM prediction. Some studies have also employed machine learning techniques to identify indicators associated with T2DM, such as the white blood cell (WBC) (15), urinary and dietary metal exposure (16) and serum calcium (17). These works demonstrate the effectiveness of machine learning in predicting T2DM and identifying relevant indicator information.

For the construction of T2DM diagnostic models, the existing problems are as follows: (I) The effective extraction of T2DM diagnostic indicators through machine learning often relies on their interpretability (10, 12, 18–23). However, some of the current work lacks evaluation of important indicators, and some rely on third-party tools such as Shapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME) (8, 9, 11), which may bring potential deviation in clinical understanding (24). (II) The clinical indicators and datasets are critical to whether a model can be used in practice. At present, the frequently used diabetes dataset public like PIMA Indian dataset contains only 8 clinical indicators (25), and unconventional indicators using in some works are often difficult to obtain in community hospitals. Therefore, it is valuable to use physical examination indicators for T2DM prediction. (III) The problem of missing values in EHRs is unavoidable during data analysis. Currently, methods based on data imputation often require exhaustive searching (26). How to handle these missing values reasonably and efficiently is a matter that needs consideration.

To this end, we introduced three weighted diversity density (WDD)-based algorithms with a focus on intrinsic interpretability, which two of the algorithms could ‘tolerate’ missing value by adding penalty terms. By applying these algorithms to physical examination data for T2DM, we identified several clinical indicators related to T2DM diagnosis, including age-related markers like glomerular filtration rate (GFR) and triglycerides (TG). Additionally, by analyzing the model’s internal parameters, we can gain a better understanding of the clinical indicators the model relies on for predictions, without the need for third-party interpretability tools.

2 Materials and methods

2.1 Dataset summary

All electronic health record (EHR) data came from the Affiliated Hospital of Southwest Medical University. A total of 16,004 EHRs and 43 usable physical examination indicators were screened out, of which half explicitly contained information about a confirmed T2DM diagnosis (Figures 1, 2; Table 1). In order to capture characteristics of early-stage T2DM, the EHRs of T2DM patients were limited to their first record in the hospital system. The physical examination indicators could be divided into three categories: routine urine indicators (9 indicators), blood cell analysis indicators (24 indicators), and biochemical indicators (10 indicators) (Figures 1, 2), the name and the abbreviation of the indicators were shown in Supplementary Table S1.

Figure 1

Figure 1 Overview of the study design. (A) The workflow of this work. (B) The first two steps in A, 11071 T2DM electronic health records (EHRs) and 126622 physical examination EHRs were collected. After preprocessing, 16004 EHRs were selected to build models for T2DM prediction. (C) The last two steps in A and the basic principle of the weighted diversity density method.

Figure 2

Figure 2 Flowchart of inclusion and exclusion criteria for the study populations of patients with type 2 diabetes mellitus (T2DM) and the physical examination population. We only use the first electronic health records for each patient in the hospital system.

Table 1

Table 1 Characteristics of the study population.

For these datasets divided according to the physical examination items, to facilitate their description here, we chose some standardized abbreviations: Whole physical examination indicators dataset (PEI dataset), Blood cell analysis dataset (BCA dataset), Urinalysis dataset (Uri dataset), Biochemical dataset (BioChem dataset).

2.2 Data preprocessing

On the collected data, three steps were performed before modelling:

(1) First, we retained only the physical examination indicators for exploring the association of these indicators with T2DM diagnosis. In total, 181 features, of which 52 are physical examination features, remained afterwards.

(2) The features were normalized by $X_{i j}^{'} = \frac{(X_{i j} - μ_{j})}{σ_{j}}$ , where $μ_{j}$ and $σ_{j}$ are the average and standard deviation of the $i$ th record, respectively. To avoid the influence from outliers, a featurewise box-plot analysis was performed to remove the features located outside of $[Q 1 - 1.5 I Q R, Q 3 + 1.5 I Q R]$ when calculating $μ_{j}$ and $σ_{j}$ . These outliers were replaced by null values. Any feature with a standard deviation of 0 (that is, the feature value is the same in all samples) was deleted, leaving 43 dimensions of physical examination features.

(3) When using the WDD-KNN algorithm for modelling, we used the K-nearest neighbour (KNN) imputer to impute the missing values of the data (n_neighbours = 15) and then normalized the processed data again in step (2).

The missing rate of each feature is shown in Supplementary Table S3. When using the MVT-WDD-DI and MVT-WDD-BF algorithms for modelling, we kept the missing value of each dimension feature of positive and negative samples consistent to eliminate bias. (Details in the Supplementary Note 4: Biased distribution misleading model using features).

2.3 Preliminary of weighted diversity density algorithm

The WDD algorithm, initially proposed by Maron for multi-instance learning problems (27), utilizes radial basis distance metrics to measure classification probabilities. We developed three new algorithms based on WDD in this work. Its non-linear nature and transparent framework make it particularly suitable for the medical field, where high interpretability in models is essential. Building upon the WDD framework, we have made enhancements to adapt it for medical classification problems, ensuring it can effectively handle missing values. Additionally, another reason for selecting the WDD method is its capability to provide a risk score for each sample, similar to the diagnostic approach of a clinical physician. This is achieved through its internal distance function, rather than merely outputting a categorical label. By decomposing the distance function at the feature level, we are further able to extract the feature-based criteria on which the model relies. This significantly enhances the transparency of the model. Unlike traditional models that offer limited insight into their decision-making process, WDD allows for a deeper understanding of how and why certain diagnostic conclusions are reached. This alignment with clinical practices not only aids in the interpretability of the model but also fosters greater trust and reliability in its application in medical settings. The ability to dissect the model’s reasoning at a feature level offers invaluable insights into the diagnostic criteria, bridging the gap between machine learning outputs and clinical decision-making.

The major principle of the WDD algorithm (Figure 1C) is to find an optimal point $x = {x_{1}, x_{2}, \dots, x_{f}}$ in the data space that maximizes the probability density that positive samples ( $b_{i}^{+}$ = ${b_{i 1}^{+}, b_{i 2}^{+}, \dots, b_{i f}^{+}}$ ) are near this point and minimizes the probability density that negative samples ( $b_{i}^{-}$ = ${b_{i 1}^{-}, b_{i 2}^{-}, \dots, b_{i f}^{-}}$ ) are near it, where $i$ is the index of the sample, $f$ is the number of features. The modified WDD algorithm can be represented as:

\begin{array}{l} \arg \max_{x} (\prod_{i} \Pr (x = t | b_{i}^{+}) \prod_{i} \Pr (x = t | b_{i}^{-})) & (1) \end{array}

$t$ is the target point, $\Pr (x = t | b_{i}^{+})$ is the probability density that positive samples are near this point, and $\Pr (x = t | b_{i}^{-})$ is the probability density of negative samples. The implicit functions could be given as:

\begin{array}{l} \Pr (x = t | b_{i}^{+}) = \exp (- D i s t (b_{i}^{+}, x)) & (2) \end{array}

\begin{array}{l} \Pr (x = t | b_{i}^{-}) = 1 - \exp (- D i s t (b_{i}^{-}, x)) & (3) \end{array}

where distance function (Dist) for both positive and negative samples is defined as the sum of weighted squares:

\begin{array}{l} D i s t (b_{i}, x) = \sum_{k} D i s t_{k} & (4) \end{array}

\begin{array}{l} D i s t_{k} = s_{k} {(b_{i k} - x_{k})}^{2} & (5) \end{array}

where k is the index of the feature.

The formulas yield a function to measure an optimal point based on both positive and negative samples, and the position of the point can be optimized during a deep learning process.

In this work, based on the idea of Maron’s work, we modified the distance function to achieve our predictive goals. One of the algorithms first imputes the data with missing values by k-nearest neighbour (KNN) and then uses WDD for prediction. The other two algorithms, named MVT-WDD-DI and MVT-WDD-BF, do not need to impute data but use penalty mechanisms for missing value tolerance.

2.4 Our proposed modified WDD algorithm

As mentioned above, we modified the WDD algorithm proposed in Maron’s work to improve the algorithms’ ability to handle data containing missing values. The modifications are introduced in the following subsections:

2.4.1 WDD-KNN

The WDD-KNN algorithm uses data imputation by k-nearest neighbor (KNN) as input and then uses the modified WDD algorithm for modelling. Since all the missing values are imputed by KNN, we only modified Equations (2) and (3) by adding a hyperparameter $γ$ to make the calculation more flexible:

\begin{array}{l} \Pr (x = t | b_{i}^{+}) = \exp (- γ D i s t (b_{i}^{+}, x)) & (6) \end{array}

\begin{array}{l} \Pr (x = t | b_{i}^{-}) = 1 - \exp (- γ D i s t (b_{i}^{-}, x)) & (7) \end{array}

Using WDD-KNN, we can classify a dataset containing missing values, but it still needs a step for missing value filling. This step limits the process of prediction. When given a new sample that contains a missing value, we need a dataset for imputing the missing value more than the trained model parameters. Therefore, we developed two missing value tolerant (MVT) algorithms for our predictive goals.

2.4.2 MVT-WDD-DI

We developed MVT-WDD-DI by adding penalty term for Equations (6) and (7) using division (DI) to handle missing values and finally obtain Equations (8) and (9):

\begin{array}{l} \Pr (x = t | b_{i}^{+}) = \exp (- γ (\frac{D i s t (b_{i}^{+}, x)}{{(f - f_{m i s s})}^{δ}})) & (8) \end{array}

\begin{array}{l} \Pr (x = t | b_{i}^{-}) = 1 - \exp (- γ (\frac{D i s t (b_{i}^{-}, x)}{{(f - f_{m i s s})}^{δ}})) & (9) \end{array}

where $f$ is the number of features and $f_{m i s s}$ is the number of missing values. Since the modification will influence the calculation of Equations (6) and (7), we added a rule to Equation (5): $i f b_{i k} i s m i s s i n g, b_{i k} - x_{k} = 0$ . We also added a hyperparameter $δ$ for adjusting the value of the penalty term. The concept behind this modification is to diminish the influence of samples containing missing values. Specifically, when a sample has numerous missing values, indicating lower data quality, we increase the penalty term to reduce its distance metric value. This approach, during the optimization process, results in the target point relying less on these data segments. The penalty term can reduce the diversity density according to the number of missing features and will not influence a sample containing all the features.

2.4.3 MVT-WDD-BF

In addition to using division to penalize missing values, we also tried to design another method by ignoring the missing feature (BF) of a sample. To make the idea work, we modified Equations (2) through (5) of the original WDD algorithm. First, Equations (4) and (5) were modified to

\begin{array}{l} D i s t^{'} (b_{i}, x, γ, λ) = \sum_{k} D i s t_{k}^{'^{λ}} & (10) \end{array}

\begin{array}{l} D i s t_{k}^{'} = \exp (- γ s_{k} | b_{i k} - x_{k} |) & (11) \end{array}

In Equation (11), $i f b_{i k} i s m i s s i n g, \exp (- γ s_{k} | b_{i k} - x_{k} |) = 0$ . This modification allows the algorithm to ‘ignore’ a missing feature when calculating. However, a reduced number of features will increase the diversity density based on the analysis of monotonicity. We modified Equations (2) and (3) to solve this problem:

\begin{array}{l} \Pr (x = t | b_{i}^{+}) = 1 - \exp (- \Pr (x = t | b_{i}^{+})) & (12) \end{array}

\begin{array}{l} \Pr (x = t | b_{i}^{-}) = \exp (- \Pr (x = t | b_{i}^{-})) & (13) \end{array}

In Equations (12) and (13), we changed the monotonicity by removing the minuend Equations ‘1’ in the production term and added Euler’s number as the base for scaling an excessively large negative number to the range $(0, 1)$ after a practical test.

2.5 Model setup

Since the goal of WDD is to find a point to maximize the diversity density, we used a deep learning framework for implementation. We used the Adam optimizer to solve the optimal problem, and the loss function of the three algorithms is shown in Equation (14):

\begin{array}{l} l o s s = - l o g (\prod_{i} \Pr (x = t | b_{i}^{+}) \prod_{i} \Pr (x = t | b_{i}^{-})) & (14) \end{array}

2.6 Model inference

After training, we obtained the optimal position $x_{t a r g e t}$ from the dataset, which optimized Equation (1). Thus, all the samples could be classified by calculating the maximum diversity density from its instances:

\begin{array}{l} D D_{i} = \exp (- Dist (b_{i}, x)) & (15) \end{array}

Using Equation (15) calculated diversity density, a sample could be assigned a label by setting a threshold. In this work, we used the method of analysing the ROC curve from the training set. First, we calculated all the diversity density values of the samples in the training dataset. Then, we calculated the false positive rate (FPR) and true positive rate (TPR), also called recall, under several different cut-off points and selected the best cut-off as the threshold when TPR-FPR reached its maximum The details are given in Equations (16) - (18):

\begin{array}{l} threshold = \arg \max_{c u t o f f} (T P R - F P R) & (16) \end{array}

\begin{array}{l} T P R = \frac{T P}{(T P + F N)} & (17) \end{array}

\begin{array}{l} F P R = \frac{F P}{(T N + F P)} & (18) \end{array}

Similar to many other works, we employed the area under the receiver operating characteristic curve (AUC), accuracy (ACC), precision, recall, and F1 score as the metrics, calculated as Equations (19) - (22):

\begin{array}{l} A C C = \frac{T P + T N}{(T P + T N + F P + F N)} & (19) \end{array}

\begin{array}{l} P r e c i s i o n = \frac{T P}{(T P + F P)} & (20) \end{array}

\begin{array}{l} R e c a l l = \frac{T P}{(T P + F N)} & (21) \end{array}

\begin{array}{l} F 1 s c o r e = 2 * \frac{P r e c i s i o n * R e c a l l}{P r e c i s i o n + R e c a l l} & (22) \end{array}

where TP, TN, FP and FN represent the number of true positives, true negatives, false positives and false negatives, respectively.

Additional details about the method, such as parameter tuning and training process, are provided in Supplemental information.

2.7 Basic workflow

In this study, we optimized the WDD model using gradient descent to identify an optimal point that is close to the distribution center of data from T2DM patients and far from the distribution center of data from normal individuals (Figure 1C). The $D i s t$ function is engineered to directly reflect the T2DM risk score, enabling the model to predict an input sample as T2DM if its risk score exceeds the risk threshold which is learned by the model. Through the analysis of learnable parameters within the $D i s t$ function, we have identified key features. Utilizing this characteristic, we have been able to uncover significant features across different age and gender groups, enhancing our understanding of T2DM risk factors.

3 Result

In the results section, we first introduce the performance scores of the model, confirming the consistency of its performance by repeating the modeling process 1000 times. More importantly, our discussion centers on the model’s transparency and interpretability, aimed at extracting effective clinical information internally. This includes the identification of key diagnostic indicators and the interpretation of the associations between model parameters and prediction result. These sections together demonstrate the model’s transparency and potential clinical utility, contributing useful perspectives for T2DM prediction and diagnosis.

3.1 Performance of the prediction model

To the above four datasets, we applied three weighted diversity density (WDD)-based algorithms to construct diagnostic prediction models. WDD-KNN refers to an algorithm using k-nearest neighbor (KNN) for imputing missing values. The two MVT-WDD algorithms denote missing value tolerant (MVT) algorithms with penalty terms, where ‘DI’ and ‘BF’ represent two different methods of penalization. A total of 12 models were obtained. The models were evaluated by 10-fold cross-validation and independent test with 1000 repetitions (Figure 2, cross-validation set: independent test set = 8:2, the independent test dataset was consistent for each repetition, details in Supplementary Note 1). The results are shown in Table 2, the best AUC achieved 0.9185 ( ± 0.0035) on the whole PEI dataset, which proves the accuracy of the model.

Table 2

Table 2 Performance of algorithms on each dataset: Mean (Standard) of 1000 repetitions.

From the Table 2, we could see that the three algorithms had their own advantages on different sub-datasets. However, it was notable that the WDD-KNN algorithm had a KNN imputation step that the other 2 missing value adaptation algorithms did not have. Generally, imputation is limited by the template dataset. Large template datasets are often owned by a few large institutions and are difficult to share for reasons such as ethical review. To their advantage, the 2 missing value adaptation algorithms can skip this step when preprocessing dataset, and the built model does not need a template dataset for prediction, which will be beneficial in practical situations.

After ensuring the reliability of our modeling results through model scores, we delved deeper into the model’s internal attributes and parameters in the following sections. This deeper analysis allowed us to extract valuable information pertinent to T2DM prediction, further validating the utility and interpretability of our algorithms.

3.2 Model scores provide auxiliary information other than blood glucose

The first aspect of our model’s transparency is reflected in how the distance function illustrates the model and feature contributions to predict T2DM. In physical examinations, clinicians often use blood glucose, sometimes with urine glucose as a reference, to initially assess whether a person may have T2DM. For WDD, every sample was given a risk score (Dist for each algorithm, see Method details) by the models and classified according to a risk threshold. We compared the risk stratification through blood glucose and the risk scores (Dist) from our models (Figures 3A–C).

Figure 3

Figure 3 Model interpretability reflected by model scores. (A-C) Scatter-density heat maps of model score versus blood glucose of 3 models trained by whole PEI dataset. (D-F) Histograms of model score distribution of 3 models using whole PEI dataset. (G-I) The raincloud plots of distance scores ( $D i s t_{k}$ ) of the three models using the whole PEI dataset.

With the PEI dataset (Figures 3A–F), we saw that in the models WDD-KNN, MVT-WDD-DI, and MVT-WDD-BF gave scores of confirmed T2DM patients that clustered in the ranges of 0.5-0.75, 0.4-0.82, and 0.4-3, respectively, while the physical examination population was clustered in the score ranges of 0.3-0.53, 0.1-0.5, and 0-1, respectively. All three models performed well in distinguishing the two populations, with WDD-KNN working best. However, it was difficult to completely distinguish the two groups if they were separated only by the level of blood glucose (Figures 3A–C), and many people with T2DM still had the same blood glucose levels as normal people. Also, we analysed the risk scores on the three sub-datasets in Supplementary Note 6.

To provide more information on the importance of the EHR features to every patient, we also calculated the ${Dist}_{k}$ score (see Method details) for each feature between the patients and normal people (Figures 3G–I; Supplementary Figure S3). As we can see, the selected important features mostly had different scores by each model. For example, in the model built on the PEI dataset using the MVT-WDD-DI algorithm, the ${Dist}_{k}$ scores of selected important features (Figures 3H, 4) such as Bact, BLD, Baso, Baso-R, and MCV were much higher than those of other features. T2DM patients and normal people could be well distinguished by the ${Dist}_{k}$ score of these important features. The result indicates that our model assigns higher significance to features with more pronounced differences, identifying them as important and thus selecting them as effective indicators for T2DM screening.

Figure 4

Figure 4 Important feature weights from different algorithms and datasets. Heat map based on the normalized feature weight values. The summation of a column (an algorithm) is 1. Darker colors represent larger weight values. The top 25% features in every column are framed by black rectangle.

The results show that our model effectively differentiated T2DM patients from healthy individuals in the PEI dataset, as shown in the scatter density heat map. Visualized model scores underscored performance differences, revealing potential for early T2DM detection. Notably, normal blood glucose levels don’t rule out T2DM, highlighting our model’s diagnostic value. Additionally, by analyzing the model’s distance function, we gained a deeper understanding of the mechanism behind the model’s selection of important features, enhancing our comprehension of the model.

3.3 The important indicators for T2DM diagnosis selected from different models

After understanding the scoring mechanism of the model and the mechanism for selecting important features, in this section and Supplementary Notes 3 and 7, we identified crucial diagnostic indicators for T2DM and analyzed the significance of the selected indicators for T2DM combining clinical knowledge.

The feature’s significance is determined by its weight within the models, identifying the indicators most associated with T2DM. Important features were defined as those ranking in the top 25% by weight across the 12 models. We visually represented this distribution of relative feature weights with a histogram and the specific details of these crucial features are detailed in Figure 4. We employed the Mann-Whitney U test to evaluate the level of feature differences between the T2DM and normal groups, finding that the selected important features exhibited significant differences (P value< 0.0001) (Supplementary Table S6). In addition, we compared the important features selected by using the internal weights of WDD with least absolute shrinkage and selection operator (LASSO) regression and SHAP framework. The important features showed certain consistency (Supplementary Note 9, Supplementary Information Figures S14, S15).

When the three algorithms were applied each dataset, the selected important features intersected. For example, on the PEI dataset, the indicators judged as important features by all three models were BASO, BASO-R, and HCT, and the features given high weights by two of the three models were HDL-C, MUCUS, BACT, BLD, LYM-R, MCH, and MCV. In this dataset, the AUCs of all three models were higher than 0.88, so these indicators were selected as having great significance for T2DM prediction (Figure 4). On the BCA dataset, the important features selected by the three algorithms are the same, which further proves the potential diagnostic value of these indicators. Moreover, we analysed the same and different important features extracted by the three algorithms, details are shown in Supplementary Note 7.

We conducted a literature review to integrate our clinical expertise with published research findings and investigate the clinical correlations between these important features and T2DM. Our analysis revealed that most of these important features are shown to have direct or indirect associations with T2DM. For example, urinary tract infections are known to be correlated with diabetes (28) and some indicators associated with urinary tract infections, such as haematuria and bacteria in urine, have been selected as important biomarkers. The details are collated in the Supplementary Note 3.

3.4 Multi-model analysis reveals characteristics among different age and sex groups

Leveraging our model’s transparency and feature extraction capabilities, we conducted group modeling for populations with varying demographic characteristics to unearth the diagnostic value of indicators across different groups. To mitigate the potential model bias introduced by data imbalance, we ensured basic balance in the sample volume of each age and sex category for both T2DM patients and normal individuals, as illustrated in Supplementary Figure S5 and Supplementary Table S2.

In most cases, age and sex will be correlated with T2DM incidence, which indicates that T2DM in different age and sex groups might have different characteristics. To further explore the importance of each feature for T2DM prediction in different age and sex groups, we tried three additional ways to divide the PEI Dataset: i. by age; ii. by sex; and iii. by age and sex. The number and proportion of people in different age and sex groups are shown in Supplementary Figure S6 and Supplementary Table S2. After the division of the datasets, 26 sub-datasets (8 ages + 2 sexes + 8 ages × 2 sexes) were generated, and 78 additional models (26 datasets × 3 models) were built. The performance of each model is shown in Figures 5A, B. The WDD-KNN and MVT-WDD-DI algorithms performed well on each group of datasets, with AUC values mostly above 0.9, while the performance of MVT-WDD-BF was not as good. Therefore, in the subsequent analysis, only the weights of the first two algorithms were taken into consideration.

Figure 5

Figure 5 Model performance of 10-fold cross validation and feature importance in different age and sex groups. (A) The AUC values of the three algorithms when modeling male, female and both sexes of different ages. Error bars were generated by 10-fold cross validation (error bar represents standard deviation). (B) The AUC values of the three algorithms when modeling male and female of all ages. (C) Heat map of normalized feature weight values extracted from the model for male and female of different ages, ‘M’ represents male, ‘F’ represents female.

The results showed that the importance of the clinical indicators varied in different age and sex groups (Figure 5C; Supplementary Figure S6). We not only integrated the results of these models using the WDD-KNN and MVT-WDD-DI algorithms but also analysed the distribution of their measured values (Supplementary Figures S7-13) to explain the various importance of these indicators for T2DM diagnosis in the different groups.

The significance of glomerular filtration rate (GFR) in T2DM diagnosis diminishes with age, showing greater importance in the 5-49 age group (Figures 5C, 6B). Elevated GFR in T2DM patients aged 5-49 distinguishes them from normal groups, particularly in the 5-39 (Figure 6). This aligns with studies linking diabetes and GFR, where early diabetic kidney disease (DKD) phases show increased GFR due to various changes in ultrastructural, vascular, and tubular factors (29). As renal health declines, GFR decreases (29, 30). Our findings suggest that GFR’s diagnostic value for T2DM varies across ages, particularly useful for early screening in younger populations, extending beyond its role in DKD.

Figure 6

Figure 6 Distribution of GFR values in different groups. (A) Different age and sex groups. (B) Different age groups. (C) Different sex groups. All the GFR values were from origin EHRs.

Triglycerides (TG) showed greater significance in the 5-39 age group compared to others (Figure 5C; Supplementary Figure S6), with notable differences in TG distribution between T2DM patients and normal individuals in this age range (Supplementary Figure S7). This variation is attributed to age-related dietary and metabolic differences and a genetic link identified by Saxena, R. et al. (31) High TG levels in T2DM patients are associated with increased cardiovascular risks (32) and metabolic changes (33). Our model emphasizes TG’s importance in T2DM, especially in younger age groups, aligning with current research trends.

Haemoglobin (HGB) was more important in the 55- to 95-year-old group (Supplementary Figures S5C, S6A). In T2DM patients, as age increased, their lower HGB compared to that in normal people became more pronounced (Supplementary Figure S12). Based on our knowledge and experience, anaemia is diagnosed by HGB decline, so the association between anaemia and T2DM might be the use of metformin. There are reports supporting that long-term metformin use in T2DM patients can cause anaemia (34, 35), and our EHR included patients who used metformin since metformin has been a commonly prescribed drug for T2DM patients for decades. Similarly, in diabetic patients with chronic kidney disease (CKD), some factors cause iron-deficiency anaemia, such as low intestinal absorption and gastrointestinal bleeding (36). In addition, erythropoietin deficiency and hyporesponsiveness can lead to anaemia in diabetic patients with CKD (36–38). Nephrotic syndrome, characterized by oedema, hypoalbuminaemia, dyslipidaemia, and increased transferrin catabolism, contributes to anaemia due to iron and erythropoietin deficiency (36, 39, 40). Long-term administration of angiotensin-converting enzyme (ACE) inhibitors and angiotensin receptor antagonists in diabetic patients also leads to a reversible decrease in HGB through a direct blockade of the proerythropoietic effects of angiotensin II on red cell precursors, degradation of physiological inhibitors of haematopoiesis, and suppression of IGF-I (36, 41). Thus, based on many studies and reports, taking HGB as an important feature will be a useful indicator for older patients with longer duration of diabetes, so HGB was selected after modelling the EHR data. In other words, HGB decline might be a marker of T2DM or T2DM-correlated disease, but it might be interfered with by some confounding factors, so its use for early diagnosis might be limited. This limitation is caused by the lack of medication information in our EHR data. Despite our meticulous selection of the patient’s first record within the hospital system, we cannot guarantee that they have not undergone therapeutic interventions at other institutions. To address this shortcoming, cohort studies with long-term follow-up are needed.

The Neutrophils (NEU), neutrophil rate (NEU-R), lymphocyte rate (LYM), and lymphocyte rate (LYM-R) have also been observed to correlate with age or sex, as discussed in Supplementary Note 8.

The result of this section demonstrates that through group modeling and the model’s feature extraction capability, we identified several age and sex-related biomarkers for T2DM prediction. Integrating insights from the results, it’s evident that our model can glean valuable auxiliary diagnostic information from internal parameters like feature weights and distance functions. This effectively showcases the algorithm’s transparency throughout the modeling process, highlighting its capacity to provide interpretable insights crucial for clinical application.

4 Discussion

Here, we developed WDD framework-based models for the prediction of T2DM using physical examination features in EHRs. Using the model parameters, the importance of the features can be measured by the distance between the sample and optimal point. Based on our investigation, the top 25% of indicators were found to be directly or indirectly related to T2DM, offering potential value as diagnostic markers for T2DM.

In our analysis, a variety of white blood cells—neutrophils, basophils, eosinophils, and lymphocytes—emerged as significant features. This aligns with existing literature, which indicates that T2DM patients experiencing concurrent infections may exhibit inflammation, leading to an altered white blood cell count (42, 43). While current research posits that the count or ratio levels of these white cells alone do not suffice as diabetes risk factors, a multifaceted approach is often necessary. For instance, the neutrophil-lymphocyte ratio is recognized as an independent predictor of T2DM (44). The inclusion of these white blood cells as important indicators by our model is consistent with current research insights, further affirming the potential of combining these white blood cell levels for aiding T2DM diagnosis. Moreover, this demonstrates our model’s proficiency in capturing the complex interrelationships among indicators, highlighting its diagnostic relevance.

Indicators related to red blood cells and platelets, such as mean platelet volume, plateletcrit, hematocrit, coefficient of variation of red cell distribution width, mean corpuscular volume, and mean corpuscular haemoglobin, were also identified as significant by our model. In T2DM patients experiencing insulin resistance and metabolic syndrome, the adverse metabolic conditions—including hyperglycemia, hypertension, dyslipidemia, inflammation, and impaired fibrinolysis—elevate the risk of atherosclerosis and lead to microvascular complications like diabetic retinopathy, nephropathy, and neuropathy (45, 46). Additionally, atherosclerosis, which may result from increased platelet adhesion and hypercoagulability in T2DM patients, is a key pathological mechanism behind macrovascular complications (46). These vascular complications can cause abnormalities in red blood cells and platelets. Therefore, the aforementioned indicators are linked to the common microvascular and macrovascular complications in diabetics, suggesting their potential as diagnostic markers for T2DM.

Urinalysis-related indicators, such as haematuria, leukocytes in urine, mucinous filaments, bacteria in urine, epithelial cells in urine, urine pH, and specific gravity, have been selected as significant markers by our model. These indicators are primarily associated with conditions prevalent among individuals with diabetes, such as urinary tract infections (28), which are notably common and can lead to haematuria or abnormal quantities of cells and bacteria in the urine. Moreover, the inflammation caused by these infections may result in an abnormal number of white blood cells, further validating the model’s ability to discern potential relationships between indicators. The combination of increased net acid excretion and reduced use of ammonia buffers in individuals with diabetes leads to lower urine pH (47, 48). A lower urine pH heightens the risk of nephrolithiasis, including uric acid stones (47, 49). Diabetic nephropathy may manifest through abnormal urine specific gravity, where a lower-than-normal urinary specific gravity, along with increased polyuria, signals diabetes insipidus (50). These findings underscore the interconnectivity of urinary markers with diabetes-related infections and complications, emphasizing their potential diagnostic relevance.

While individual physical examination indicators often cannot serve as standalone diagnostic criteria for T2DM, our model successfully integrates multiple indicators to construct a diagnostic model for T2DM. Leveraging the model’s high interpretability, we can determine the importance of each indicator in the diagnosis, enhancing its capability to aid in the auxiliary diagnosis of T2DM. This approach not only harnesses the collective diagnostic potential of various indicators but also provides valuable insights into their diagnostic significance, offering a refined perspective on T2DM diagnosis.

Besides, based on our clinical knowledge, the diagnosis of T2DM often correlates with demographic factors. Therefore, we segmented the data by age and gender, utilizing the feature weights provided by our model. Through this process, combined with a literature search, we identified several biomarkers related to age or sex, such as glomerular filtration rate, triglycerides, and haemoglobin. This further validates our model’s efficacy in extracting medically valuable information and illustrates that different indicators may require attention when diagnosing diabetes in patients of varying ages and sexes. This approach not only enriches the diagnostic model with nuanced clinical insights but also underscores the importance of personalized medicine in the management and treatment of T2DM.

The results show that our algorithm boasts a high degree of internal interpretability, enabling the extraction of key indicators for T2DM diagnosis without the need for third-party tools. Furthermore, by analyzing the model’s parameters (Figure 3), we can comprehend the mechanism behind the selection of important indicators, thereby providing reliable auxiliary diagnostic information. Additionally, our model possesses a distinct advantage as mentioned in the Methods section: benefiting from the transparency of WDD, its internal distance function is easily modifiable. Instead of imputing missing values, our approach involves ‘tolerating’ them by incorporating penalty terms into two of the algorithms. This strategy diminishes the necessity for exhaustive searches for high-quality template data during the imputation process, proving WDD to be a highly transparent and interpretable auxiliary diagnostic algorithm.

This work suggests that machine learning could extend beyond predictive accuracy to include interpretative insights, which might be useful in clinical settings. Such insights have the potential to aid clinicians in understanding the basis of diagnostic suggestions given by the model. This could lead to a more cooperative relationship between machine learning and healthcare professionals. However, the integration of these technologies in clinical practice requires careful consideration and ongoing evaluation.

5 Conclusion

Overall, we developed three WDD-based interpretability algorithms and built T2DM diagnostic models, identifying several relevant diagnostic indicators with potential utility in assisting T2DM diagnosis. However, it is crucial to acknowledge that the mechanisms of interaction among these indicators, as well as their causal connections with T2DM, cannot be directly deduced from the current model information. In our future work, leveraging the transparency of WDD, we plan to incorporate knowledge of causal probabilities to enhance our model further, uncover the complex relationships between indicators and T2DM.

Data availability statement

The code used for modelling in this work is available at: https://github.com/Lvxiang713/WDD_T2DMprediction. Rearchers should contact the corresponding authors for approval to obtain and use the source data. The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving humans were approved by This study protocol was approved by the ethics committee of the Affiliated Hospital of Southwest Medical University, China (KY2022266) and the Chinese Clinical Trial Registry (ChiCTR2200064435). The studies were conducted in accordance with the local legislation and institutional requirements. Written informed consent for participation was not required from the participants or the participants' legal guardians/next of kin in accordance with the national legislation and institutional requirements.

Author contributions

XL: Writing – review & editing, Writing – original draft, Visualization, Methodology, Formal analysis. JL: Writing – original draft, Investigation, Funding acquisition, Formal analysis, Data curation. HW: Writing – review & editing, Investigation, Data curation. HG: Writing – review & editing, Visualization, Formal analysis. XB: Writing – original draft, Investigation, Formal analysis, Data curation. PY: Writing – original draft, Data curation. ZJ: Writing – review & editing, Data curation. YZ: Writing – original draft, Investigation, Formal analysis. RJ: Writing – review & editing, Writing – original draft, Validation, Methodology, Funding acquisition, Data curation, Conceptualization. QC: Writing – original draft, Supervision, Funding acquisition, Data curation, Conceptualization. ML: Writing – review & editing, Supervision, Funding acquisition, Conceptualization.

Funding

The author(s) declare that financial support was received for the research, authorship, and/or publication of this article. This work has been supported by the National Natural Science Foundation of China (No. 22203057, No. 82200904 and No. 22173065), Science & Technology Department of Sichuan province (No.24NSFSC1621, No. 2022YF0617, No.2022YFS0612 and No.2022YFS0617-B2), Joint project of Luzhou Municipal People’s Government and Southwest Medical University (No. 2020LZXNYDJ30 and No. 2020LZXNYDJ39).

Acknowledgments

The numerical calculations in this paper have been done on Hefei advanced computing center.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fendo.2024.1376220/full#supplementary-material

References

1. Donath MY, Shoelson SE. Type 2 diabetes as an inflammatory disease. Nat Rev Immunol. (2011) 11:98–107. doi: 10.1038/nri2925

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Wong TY, Cheung CMG, Larsen M, Sharma DO, Simó R. Diabetic retinopathy. Nat Rev Dis Primers. (2016) 2:1–17. doi: 10.1038/nrdp.2016.12

CrossRef Full Text | Google Scholar

3. Matheus AS de M, Tannus LRM, Cobas RA, Palma CCS, Negrato CA, Gomes M de B. Impact of diabetes on cardiovascular disease: an update. Int J hypertension. (2013) 2013:653789. doi: 10.1155/2013/653789

CrossRef Full Text | Google Scholar

4. Carson AP, Muntner P, Kissela BM, Kleindorfer DO, Howard VJ, Meschia JF, et al. Association of prediabetes and diabetes with stroke symptoms: the REasons for Geographic and Racial Differences in Stroke (REGARDS) study. Diabetes Care. (2012) 35:1845–52. doi: 10.2337/dc11-2140

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Rathur HM, Boulton AJ. The neuropathic diabetic foot. Nat Rev Endocrinol. (2007) 3:14–25. doi: 10.1038/ncpendmet0347

CrossRef Full Text | Google Scholar

6. IDF Diabetes Atlas. Available online at: https://diabetesatlas.org/ (Accessed September 22, 2022).

Google Scholar

7. Cao X, Yang M, Huang XB, Tan XL, Liu Y, Huo N, et al. Prevalence and rates of new diagnosis and missed diagnosis of diabetes mellitus among 35–74-year-old residents in urban communities in Southwest China. Biomed Environ Sci. (2019) 32:704–9. doi: 10.3967/bes2019.089

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Tasin I, Nabil TU, Islam S, Khan R. Diabetes prediction using machine learning and explainable AI techniques. Healthcare Tech Lett. (2023) 10:1–10. doi: 10.1049/htl2.12039

CrossRef Full Text | Google Scholar

9. Ejiyi CJ, Qin Z, Amos J, Ejiyi MB, Nnani A, Ejiyi TU, et al. A robust predictive diagnosis model for diabetes mellitus using Shapley-incorporated machine learning algorithms. Healthcare Analytics. (2023) 3:100166. doi: 10.1016/j.health.2023.100166

CrossRef Full Text | Google Scholar

10. Patel R, Sivaiah B, Patel P, Sahoo B. Supervised Learning Approaches on the Prediction of Diabetic Disease in Healthcare. In: Udgata SK, Sethi S, Gao X-Z, editors. Intelligent Systems. Springer Nature, Singapore (2024). p. 157–68. doi: 10.1007/978-981-99-3932-9_15

CrossRef Full Text | Google Scholar

11. Zhao Y, Chaw JK, Ang MC, Daud MM, Liu L. A Diabetes Prediction Model with Visualized Explainable Artificial Intelligence (XAI) Technology. In: Badioze Zaman H, Robinson P, Smeaton AF, De Oliveira RL, Jørgensen BN, K. Shih T, Abdul Kadir R, Mohamad UH, Ahmad MN, editors. Advances in Visual Informatics. Springer Nature, Singapore (2024). p. 648–61. doi: 10.1007/978-981-99-7339-2_52

CrossRef Full Text | Google Scholar

12. Bhat SS, Selvam V, Ansari GA, Ansari MD, Rahman MH. Prevalence and early prediction of diabetes using machine learning in North Kashmir: A case study of district bandipora. Comput Intell Neurosci. (2022) 2022:e2789760. doi: 10.1155/2022/2789760

CrossRef Full Text | Google Scholar

13. Mahesh TR, Kumar D, Vinoth Kumar V, Asghar J, Mekcha Bazezew B, Natarajan R, et al. Blended ensemble learning prediction model for strengthening diagnosis and treatment of chronic diabetes disease. Comput Intell Neurosci. (2022) 2022:e4451792. doi: 10.1155/2022/4451792

CrossRef Full Text | Google Scholar

14. Ravaut M, Harish V, Sadeghi H, Leung KK, Volkovs M, Kornas K, et al. Development and validation of a machine learning model using administrative health data to predict onset of type 2 diabetes. JAMA Network Open. (2021) 4:e2111315. doi: 10.1001/jamanetworkopen.2021.11315

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Mansoori A, Sahranavard T, Hosseini ZS, Soflaei SS, Emrani N, Nazar E, et al. Prediction of type 2 diabetes mellitus using hematological factors based on machine learning approaches: a cohort study analysis. Sci Rep. (2023) 13:663. doi: 10.1038/s41598-022-27340-2

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Zhao M, Wan J, Qin W, Huang X, Chen G, Zhao X. A machine learning-based diagnosis modelling of type 2 diabetes mellitus with environmental metal exposure. Comput Methods Programs Biomedicine. (2023) 235:107537. doi: 10.1016/j.cmpb.2023.107537

CrossRef Full Text | Google Scholar

17. De Silva K, Jönsson D, Demmer RT. A combined strategy of feature selection and machine learning to identify predictors of prediabetes. J Am Med Inf Assoc. (2020) 27:396–406. doi: 10.1093/jamia/ocz204

CrossRef Full Text | Google Scholar

18. Zhou H, Xin Y, Li S. A diabetes prediction model based on Boruta feature selection and ensemble learning. BMC Bioinf. (2023) 24:224. doi: 10.1186/s12859-023-05300-5

CrossRef Full Text | Google Scholar

19. Patro KK, Allam JP, Sanapala U, Marpu CK, Samee NA, Alabdulhafith M, et al. An effective correlation-based data modeling framework for automatic diabetes prediction using machine and deep learning techniques. BMC Bioinf. (2023) 24:372. doi: 10.1186/s12859-023-05488-6

CrossRef Full Text | Google Scholar

20. Hassan M, Mollick S, Yasmin F. An unsupervised cluster-based feature grouping model for early diabetes detection. Healthcare Analytics. (2022) 2:100112. doi: 10.1016/j.health.2022.100112

CrossRef Full Text | Google Scholar

21. Saxena R, Sharma SK, Gupta M, Sampada GC. A novel approach for feature selection and classification of diabetes mellitus: machine learning methods. Comput Intell Neurosci. (2022) 2022:e3820360. doi: 10.1155/2022/3820360

CrossRef Full Text | Google Scholar

22. Mushtaq Z, Ramzan MF, Ali S, Baseer S, Samad A, Husnain M. Voting classification-based diabetes mellitus prediction using hypertuned machine-learning techniques. Mobile Inf Syst. (2022) 2022:e6521532. doi: 10.1155/2022/6521532

CrossRef Full Text | Google Scholar

23. Zhang M, Wang J, Wang W, Yang G, Peng J. Predicting cell-type specific disease genes of diabetes with the biological network. Comput Biol Med. (2024) 169:107849. doi: 10.1016/j.compbiomed.2023.107849

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Rudin C. Stop explaining black box machine learning models for high stakes decisions and use interpretable models instead. Nat Mach Intell. (2019) 1:206–15. doi: 10.1038/s42256-019-0048-x

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Smith JW, Everhart JE, Dickson WC, Knowler WC, Johannes RS. Using the ADAP learning algorithm to forecast the onset of diabetes mellitus. Proc Annu Symp Comput Appl Med Care. (1988), 261–5.

Google Scholar

26. Arbet J, Brokamp C, Meinzen-Derr J, Trinkley KE, Spratt HM. Lessons and tips for designing a machine learning study using EHR data. J Clin Trans Sci. (2021) 5:e21. doi: 10.1017/cts.2020.513

CrossRef Full Text | Google Scholar

27. Maron O, Lozano-Pérez T. A framework for multiple-instance learning. In: Advances in neural information processing systems, vol. 10. (1997) 570–576.

Google Scholar

28. Muller LMAJ, Gorter KJ, Hak E, Goudzwaard WL, Schellevis FG, Hoepelman AIM, et al. Increased risk of common infections in patients with type 1 and type 2 diabetes mellitus. Clin Infect Dis. (2005) 41:281–8. doi: 10.1086/431587

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Tonneijck L, Muskiet MH, Smits MM, Van Bommel EJ, Heerspink HJ, Van Raalte DH, et al. Glomerular hyperfiltration in diabetes: mechanisms, clinical significance, and treatment. J Am Soc Nephrol. (2017) 28:1023–39. doi: 10.1681/ASN.2016060666

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Moriya T, Tsuchiya A, Okizaki S, Hayashi A, Tanaka K, Shichiri M. Glomerular hyperfiltration and increased glomerular filtration surface are associated with renal function decline in normo- and microalbuminuric type 2 diabetes. Kidney Int. (2012) 81:486–93. doi: 10.1038/ki.2011.404

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Saxena R, Voight BF, Lyssenko V, Burtt NP, de Bakker PIW, Chen H, et al. Genome-wide association analysis identifies loci for type 2 diabetes and triglyceride levels. Science. (2007) 316:1331–6. doi: 10.1126/science.1142358

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Ye X, Kong W, Zafar MI, Chen L-L. Serum triglycerides as a risk factor for cardiovascular diseases in type 2 diabetes mellitus: a systematic review and meta-analysis of prospective studies. Cardiovasc Diabetol. (2019) 18:48. doi: 10.1186/s12933-019-0851-z

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Rijzewijk LJ, Jonker JT, van der MRW, Lubberink M, de JHW, Romijn JA, et al. Effects of hepatic triglyceride content on myocardial metabolism in type 2 diabetes. J Am Coll Cardiol. (2010) 56:225–33. doi: 10.1016/j.jacc.2010.02.049

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Donnelly LA, Dennis JM, Coleman RL, Sattar N, Hattersley AT, Holman RR, et al. Risk of anemia with metformin use in type 2 diabetes: A MASTERMIND study. Diabetes Care. (2020) 43:2493–9. doi: 10.2337/dc20-1104

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Aroda VR, Edelstein SL, Goldberg RB, Knowler WC, Marcovina SM, Orchard TJ, et al. Long-term metformin use and vitamin B12 deficiency in the diabetes prevention program outcomes study. J Clin Endocrinol Metab. (2016) 101:1754–61. doi: 10.1210/jc.2015-3754

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Mehdi U, Toto RD. Anemia, diabetes, and chronic kidney disease. Diabetes Care. (2009) 32:1320–6. doi: 10.2337/dc08-0779

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Thomas MC. Anemia in diabetes: marker or mediator of microvascular disease? Nat Rev Nephrol. (2007) 3:20–30. doi: 10.1038/ncpneph0378

CrossRef Full Text | Google Scholar

38. Erslev AJ, Besarab A. Erythropoietin in the pathogenesis and treatment of the anemia of chronic renal failure. Kidney Int. (1997) 51:622–30. doi: 10.1038/ki.1997.91

PubMed Abstract | CrossRef Full Text | Google Scholar

39. Vaziri ND. Erythropoietin and transferrin metabolism in nephrotic syndrome. Am J Kidney Dis. (2001) 38:1–8. doi: 10.1053/ajkd.2001.25174

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Howard RL, Buddington B, Alfrey AC. Urinary albumin, transferrin and iron excretion in diabetic patients. Kidney Int. (1991) 40:923–6. doi: 10.1038/ki.1991.295

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Marathias KP, Agroyannis B, Mavromoustakos T, Matsoukas J, Vlahakos DV. Hematocrit-lowering effect following inactivation of renin-angiotensin system with angiotensin converting enzyme inhibitors and angiotensin receptor blockers. Curr Top Med Chem. (2004) 4:483–6. doi: 10.2174/1568026043451311

PubMed Abstract | CrossRef Full Text | Google Scholar

42. Vozarova B, Weyer C, Lindsay RS, Pratley RE, Bogardus C, Tataranni PA. High white blood cell count is associated with a worsening of insulin sensitivity and predicts the development of type 2 diabetes. Diabetes. (2002) 51:455–61. doi: 10.2337/diabetes.51.2.455

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Twig G, Afek A, Shamiss A, Derazne E, Tzur D, Gordon B, et al. White blood cells count and incidence of type 2 diabetes in young men. Diabetes Care. (2013) 36:276–82. doi: 10.2337/dc11-2298

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Mangalesh S, Dudani S, Yadav P, Podury S. Evaluation of neutrophil-lymphocyte ratio in diabetes and coronary artery disease: a case control study from India. Am Heart J. (2021) 242:156–7. doi: 10.1016/j.ahj.2021.10.030

CrossRef Full Text | Google Scholar

45. Reusch JEB. Diabetes, microvascular complications, and cardiovascular complications: what is it about glucose? J Clin Invest. (2003) 112:986–8. doi: 10.1172/JCI19902

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Fowler MJ. Microvascular and macrovascular complications of diabetes. Clin Diabetes. (2011) 29:116–22. doi: 10.2337/diaclin.29.3.116

CrossRef Full Text | Google Scholar

47. Maalouf NM, Cameron MA, Moe OW, Sakhaee K. Metabolic basis for low urine pH in type 2 diabetes. Clin J Am Soc Nephrol. (2010) 5:1277–81. doi: 10.2215/CJN.08331109

PubMed Abstract | CrossRef Full Text | Google Scholar

48. Eisner BH, Porten SP, Bechis SK, Stoller ML. Diabetic kidney stone formers excrete more oxalate and have lower urine pH than nondiabetic stone formers. J Urol. (2010) 183:2244–8. doi: 10.1016/j.juro.2010.02.007

PubMed Abstract | CrossRef Full Text | Google Scholar

49. Bell DSH. Beware the low urine pH—the major cause of the increased prevalence of nephrolithiasis in the patient with type 2 diabetes. Diabetes Obes Metab. (2012) 14:299–303. doi: 10.1111/j.1463-1326.2011.01519.x

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Akarsu E, Buyukhatipoglu H, Aktaran S, Geyik R. The value of urine specific gravity in detecting diabetes insipidus in a patient with uncontrolled diabetes mellitus: urine specific gravity in differential diagnosis. J Gen Internal Med. (2006) 21:C1–2. doi: 10.1111/j.1525-1497.2006.00454.x

CrossRef Full Text | Google Scholar

Keywords: diabetes, diabetes diagnosis, diabetic prediction, diagnostic indicator, health informatics, interpretable machine learning

Citation: Lv X, Luo J, Huang W, Guo H, Bai X, Yan P, Jiang Z, Zhang Y, Jing R, Chen Q and Li M (2024) Identifying diagnostic indicators for type 2 diabetes mellitus from physical examination using interpretable machine learning approach. Front. Endocrinol. 15:1376220. doi: 10.3389/fendo.2024.1376220

Received: 25 January 2024; Accepted: 29 February 2024;
Published: 18 March 2024.

Edited by:

Yun Shen, Pennington Biomedical Research Center, United States

Reviewed by:

Md. Mehedi Hassan, Khulna University, Bangladesh
Lixia Zhang, Pennington Biomedical Research Center, United States
Laboni Akter, Khulna University of Engineering & Technology, Bangladesh

Copyright © 2024 Lv, Luo, Huang, Guo, Bai, Yan, Jiang, Zhang, Jing, Chen and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Runyu Jing, amluZ3J5QHNjdS5lZHUuY24=; Qi Chen, cWljaGVuQHN3bXUuZWR1LmNu; Menglong Li, bGltbEBzY3UuZWR1LmNu

^†These authors have contributed equally to this work

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.