Electronic Medical Records as Input to Predict Postoperative Immediate Remission of Cushing’s Disease: Application of Word Embedding

Zhang, Wentai; Li, Dongfang; Feng, Ming; Hu, Baotian; Fan, Yanghua; Chen, Qingcai; Wang, Renzhi

doi:10.3389/fonc.2021.754882

ORIGINAL RESEARCH article

Front. Oncol. , 13 October 2021

Sec. Neuro-Oncology and Neurosurgical Oncology

Volume 11 - 2021 | https://doi.org/10.3389/fonc.2021.754882

This article is part of the Research Topic Recent Advances in the Tumorigenic Mechanism and Clinical Management of Pituitary Tumors View all 16 articles

Electronic Medical Records as Input to Predict Postoperative Immediate Remission of Cushing’s Disease: Application of Word Embedding

Wentai Zhang^1†

Dongfang Li^2†

Ming Feng¹

Baotian Hu²

Yanghua Fan³

Qingcai Chen^2,4*

Renzhi Wang^1*

¹Department of Neurosurgery, Chinese Academy of Medical Sciences and Peking Union Medical College, Peking Union Medical College Hospital, Beijing, China
²School of Computer Science, and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, China
³Department of Neurosurgery, Beijing Tiantan Hospital, Beijing Neurosurgical Institute, Capital Medical University, Beijing, China
⁴Peng Cheng Laboratory, Shenzhen, China

Background: No existing machine learning (ML)-based models use free text from electronic medical records (EMR) as input to predict immediate remission (IR) of Cushing’s disease (CD) after transsphenoidal surgery.

Purpose: The aim of the present study is to develop an ML-based model that uses EMR that include both structured features and free text as input to preoperatively predict IR after transsphenoidal surgery.

Methods: A total of 419 patients with CD from Peking Union Medical College Hospital were enrolled between January 2014 and August 2020. The EMR of the patients were embedded and transformed into low-dimensional dense vectors that can be included in four ML-based models together with structured features. The area under the curve (AUC) of receiver operating characteristic curves was used to evaluate the performance of the models.

Results: The overall remission rate of the 419 patients was 75.7%. From the results of logistic multivariate analysis, operation (p < 0.001), invasion of cavernous sinus from MRI (p = 0.046), and ACTH (p = 0.024) were strongly correlated with IR. The AUC values for the four ML-based models ranged from 0.686 to 0.793. The highest AUC value (0.793) was for logistic regression when 11 structured features and “individual conclusions of the case by doctor” were included.

Conclusion: An ML-based model was developed using both structured and unstructured features (after being processed using a word embedding method) as input to preoperatively predict postoperative IR.

Introduction

Pituitary corticotroph adenoma is also called Cushing’s disease (CD). It accounts for the majority of Cushing’s syndrome cases (1, 2). Cushing’s syndrome causes various types of symptoms and signs, such as central obesity, supraclavicular fat accumulation, thinned skin, purple striae, proximal muscle weakness, fatigue, high blood pressure, glucose intolerance, acne, hirsutism, and neurological deficits (3). The first-line treatment method is transsphenoidal surgery (TSS) according to a consensus statement (4). Thus, immediate remission (IR) is important for both patients and surgeons. A previous systemic review showed that the overall IR rate was 77% (52.1%–96.6%) (5).

Several studies have been conducted to investigate perioperative risk factors for the prediction of postoperative prognosis using both traditional biostatistical and machine learning (ML) methods (6–9). ML is a computer-based method for data analysis based on the theory that there are patterns hidden in data, and it helps to predict the prognosis of diseases (10). ML enables a computer to construct models by iteratively learning from the patterns in the dataset. Therefore, an ML-based model is formed based on learning from real-world data rather than learning from doctors’ experience, which may be limited (11). In recent years, there have been an increasing number of ML-related studies on pituitary adenoma. For example, Liu et al. used seven ML-based models that incorporated 17 clinical variables to preoperatively predict the recurrence of CD. The model that performed the best was random forest (RF) with an AUC value of 0.781 (8). Fan et al. used six ML-based models that incorporated 12 clinical variables to predict the TSS response. The final model with the highest AUC value of 0.8555 was the GBDT model.

Features including the preoperative and postoperative serum adrenocorticotropic hormone (ACTH) level, postoperative serum cortisol level, age, and preoperative cavernous sinus invasion on MRI (IOMRI) have been shown to be related to postoperative prognosis (8). All risk factors initially considered in previous studies were selected by clinicians according to their clinical experience and related literature. No existing models use electronic medical records (EMR) as input to predict postoperative IR of patients with CD, even though they may contain a great deal of information that is useful for the prediction of IR. In the present study, EMR is included in the model for the preoperative prediction of postoperative IR of CD.

In recent years, EMR has facilitated data accessibility. There are different types of manifestations in patients with CD because of hypercortisolism that may contain information related to the severity of CD. However, the analysis of diverse and massive EMR data remains challenging because of the complex nature of clinical language and the interpretation process. To address these challenges, in this study, natural language processing techniques are used, specifically contextualized word embeddings, to help humans to access this information in free text to improve predictions. Word embedding is a typical type of natural language processing technique, and it is a suitable method for vectorizing free text so that it can be processed by downstream learning models. Although there has been exponential growth in the number of studies involving radiomics methods, the application of word embedding techniques is still limited (12, 13). In those studies, the text part of EMR was incorporated into the ML model as input, which increased the modal and made the input data closer to real-world data (14, 15).

Postoperative IR is important for clinician–patient communication, and it may influence the treatment strategy. The objective of the present study is to develop an ML model to preoperatively predict postoperative IR using both free text from EMR (after being processed by a word embedding technique) and structured features as input.

Materials and Methods

Study Population

The present study was approved by the ethical review committee of Peking Union Medical College Hospital (PUMCH). A total of 419 patients with CD were enrolled between January 2014 and August 2020. All surgery was performed by MF.

The inclusion criteria were as follows: (1) manifestations of Cushing’s syndrome; (2) positive result on MRI or negative result on MRI, but CD was strongly suspected according to manifestations; (3) ruling out the possibility of ectopic ACTH syndrome; and (4) plasma cortisol level (8:00 a.m.) > 22.3 μg/dl or 24-h UFC level > 103.5 μg.

Diagnosis of Cushing’s Disease

All patients had T1-weighted, T2-weighted, and T1-weighted gadolinium-enhanced MRI. Patients whose T1-weighted gadolinium-enhanced MRI showed the negative result of a pituitary tumor had T1-weighted dynamic gadolinium-enhanced MRI. A hypointense region in T1-weighted MRI within the pituitary gland indicated the positive result of a pituitary adenoma. In cases in which the profiles of the potential tumors were inconspicuous, T1-weighted gadolinium-enhanced MRI was required to outline the tumor. A microadenoma was defined as a tumor whose largest diameter was less than 10 mm and a macroadenoma was defined as a tumor whose largest diameter was ≥10 mm. A total of 392/419 participants had histological confirmation of CD, and the diagnosis of CD was based on synthesized evidence that included MRI results, clinical manifestations, results of the low-dose dexamethasone suppression test (LDDST) and high-dose dexamethasone suppression test (HDDST), and pathological results.

All patients underwent a routine combined LDDST and HDDST to verify hypercortisolism and the location of the tumor. In the LDDST, 0.5 mg of dexamethasone was given to the patient every 6 h for 2 days. The LDDST was considered to be suppressed if 24-h UFC was lower than 12.3 μg/24 h on the second day or plasma cortisol was lower than 1.8 μg/dl in the morning of the third day. In the HDDST, 2 mg of dexamethasone was given to the patient every 6 h for 2 days. HDDST was considered to be suppressed if 24-h UFC on the second day or plasma cortisol in the morning of the third day was >50% lower than the original level. The failure of suppression of the LDDST together with successful suppression of the HDDST indicated CD.

In cases in which there was no evidence of a tumor in preoperative MRI, bilateral inferior petrosal sinus sampling with a desmopressin stimulation test was implemented to confirm the location of the tumor. During the desmopressin test process, 10 mg of desmopressin was given to the patient to stimulate the secretion of ACTH. A ratio of ACTH concentration in the inferior petrosal sinus to peripheral concentration that was larger than 2 in the basal state or larger than 3 after desmopressin stimulation indicated a diagnosis of CD.

The diagnosis of CD was based on the combination of compositive evidence, including MRI results, clinical manifestations, results of LDDST and HDDST, and pathological results.

All TSS was performed by one experienced surgeon (MF). The details of the TSS were discussed previously (16). No medical therapy was administered to patients because of a lack of medicine in China.

The resected tissues were examined for pathology and immunohistochemical analysis for ACTH, growth hormone, thyroid-stimulating hormone, luteinizing hormone, follicle-stimulating hormone, prolactin, Ki-67, and P-53.

Postoperative Management and Immediate Remission

In the first 3 days after TSS (7 days if IR was not achieved), the plasma cortisol level was tested each day. If the cortisol level was lower than 5 μg/dl, glucocorticoid replacement therapy was started. Glucocorticoid replacement therapy started with 100 mg of hydrocortisone twice a day for 3 days following 30 mg of hydrocortisone orally once a day. After being discharged from hospital, patients decreased the dose by 2.5 mg per week until it reached 2.5–5 mg per day. The cessation of the drug was decided by clinicians according to the evaluation of the pituitary function.

IR was defined as a plasma cortisol level (8:00 a.m.) lower than 5 μg/dl or 24-h UFC lower than 20 μg/24 h within 7 days after surgery (17).

Study Design

The data included 11 structured clinical features and 10 unstructured features. Missing values were replaced by average values. The structured data included gender, age, first operation or not, largest tumor diameter, invasion of cavernous sinus on MRI (IOMRI), sellar floor changes (SFC), disease duration, BMI, 24-h UFC, plasma cortisol (8:00 a.m.), and plasma ACTH (8:00 a.m.). The unstructured data included the chief complaint, history of present illness (HPI), past medical history, record of first ward round by superior surgeon, cautions, transferred-out record (from the endocrinology department), transferred-in record (to the neurosurgery department), characteristics of the case, discussions about cases, and individual conclusions of the case by doctor. The 10 unstructured features were routine features of EMR in PUMCH. “Characteristics of the case” were the records of the unique characteristics of an individual patient. “Discussions about cases” were the meeting summaries about all patients’ conditions by all surgeons in the neurosurgery department of PUMCH. “Individual conclusions of the case by doctor” were the records of the summary of patients’ characteristics provided by MF. “Cautions” were the main points that needed to be noticed about treating patients provided by MF. Transferred-out records were the main points that needed to be noticed about treating patients and basic conditions of the patient provided by the endocrinologist. Transferred-in records were the main points that needed to be noticed about treating patients and basic conditions of the patient provided by the neurosurgeon. The EMR of unstructured features was vectorized using a word embedding method, and could then can be analyzed in a similar manner to structured features.

The F-test was used to rank the structured data. The 10 structured features were sequentially included into each model. Then, each model outputs AUC values for different numbers of features. The min–max normalization method was used on the data. The highest AUC values of the four algorithms were used as their baseline values. Ten features of the unstructured data were introduced into each model individually, and the importance of each unstructured feature was ranked according to the change of AUC.

ML Algorithms

Four ML algorithms were applied: support vector machine (SVM), logistic regression (LR), RF, and multilayer perceptron (MLP). In each ML algorithm, structured data were sequentially introduced into the algorithm according to their rank in the training dataset. Then, in the test dataset, the same process was conducted. In both the training and test datasets, 10-fold cross-validation was performed. Then, a grid search was used to select the best hyperparameters, as discussed elsewhere (7).

Statistical Analysis

Statistical analysis was performed using RStudio software (1.2.5042), IBM SPSS Statistics 23 (IBM Corporation), and Python. The Shapiro–Wilk test was used to evaluate the normality of continuous variables. Normally distributed variables were displayed as mean ± standard deviation. Non-normally distributed variables were displayed as the interquartile range. The Wilcoxon test was used to compare non-normal distributed continuous variables in the training dataset and test dataset. Categorical variables were analyzed using a chi-squared test or Fisher’s exact test.

Occlusion Tests

“Occlusion tests” were performed to determine the contributions that the symptomatic entities made to the ML-based models. In the “occlusion tests”, CMeKG (http://cmekg.pcl.ac.cn/) was used to select and delete the symptomatic entities to build a new HPI without symptomatic descriptions of CD. Then, the two HPIs were vectorized and merged into LR together with the structured features. The result demonstrated that the model with the input of the original HPI was conspicuously superior to that with the input of the newly built HPI.

Results

Patients’ Characteristics

A total of 419 patients were included in the study between January 2014 and August 2020. Eleven traditionally used predictors were selected in the study: age, gender, first operation (or not), SFC, IOMRI, tumor diameter (microadenoma or macroadenoma), disease duration, BMI, 24-h UFC, morning plasma cortisol level, and morning plasma ACTH level. All the predictors are presented in Table 1. The characteristics of the remission and non-remission groups are presented in Table 2. From the results of logistic univariate analysis, the first operation (p < 0.001), IOMRI (p = 0.010), SFC (p = 0.011), and ACTH (p = 0.009) were strongly correlated with IR. From the results of logistic univariate analysis, the first operation (p < 0.001), IOMRI (p = 0.046), and ACTH (p = 0.024) were strongly correlated with IR (Table 3).

TABLE 1

Table 1 Participants’ characteristics in trainning and test datasets.

TABLE 2

Table 2 Patients’ characteristics in remission and non-remission groups.

TABLE 3

Table 3 Logistic univariate and multivariate analysis of the relationship between risk factors and IR.

Predictive Performance of Models

Four ML-based algorithms were used: MLP, SVM, RF, and LR. The performance of each model with different numbers of structured features is shown in Figure 1. The highest AUC values for MLP, SVM, RF, and LR were 0.759, 0.733, 0.678, and 0.699, respectively (Figure 2). Each unstructured feature was sequentially introduced into the model, which had all structured features included. Then, each model outputs an AUC value (Table 4). The chief complaint and individual conclusions of the case by doctor, HPI and individual conclusions of the case by doctor, together with chief complaint and HPI were then introduced into each model; however, the AUC values were not higher than when only one unstructured feature was introduced into the model. The highest AUC value (0.793) was achieved by LR when 11 structured features and individual conclusions of the case by doctor were introduced.

FIGURE 1

Figure 1 AUC values of four models with different numbers of structured features selected. The highest AUC value appeared when MLP with 11 variables came into use (AUC = 0.759).

FIGURE 2

Figure 2 Performances of models with optimal number of structured features. MLP performed the best.

TABLE 4

Table 4 AUC values and 95 confidence interval of different models with different features.

Unstructured features contain too much redundant information; hence, three or more unstructured features were not combined in this study to extract valid information.

Variable Importance

F-test univariate analysis was used to rank the importance of the 11 variables. Their rank was as follows: first operation, SFC, morning ACTH, IOMRI, 24-h UFC, disease duration, BMI, tumor diameter, gender, plasma cortisol, and age. The rank of the features of unstructured data was evaluated using the change in AUC value after adding a single unstructured feature into the model based only on the structured features. For LR, “individual conclusions of the case by doctor” was ranked first.

Occlusion Tests

The performance of the model with the input of the original HPI was conspicuously better than that with the input of the HPI without symptomatic entities (Table 5). The red Chinese characters indicate the deleted symptomatic entities.

TABLE 5

Table 5 Example of Occlusion Test Results.

Discussion

TSS is the first-line treatment method for CD. IR rates are typically between 59% and 96.6% (18). In the present study, the IR rate was 75.7% (317/419), which is almost the same as the result of 76% from a previous study (19). IR may be a strong predictor of long-term remission (20). IR is also important for doctor–patient communication because patients are always concerned about whether clinical manifestations can be eliminated immediately. Thus, it is of great importance to develop an ML-based model for the preoperative prediction of IR.

Various types of manifestations exist in patients with CD because of hypercortisolism, such as abnormal fat distribution, weight gain, osteoporosis, diabetes mellitus ecchymosis, and hypokalemia. According to our limited experience, the symptoms and signs a patient has are strongly correlated with the patient’s prognosis. Therefore, we speculate that the unstructured data of patients with CD contributes to the ML-based model for the preoperative prediction of IR. The manifestations of patients with CD may be recorded in EMR, which has been ignored by clinicians in quantitative analysis because natural language could not be processed in the past. However, natural language processing techniques can now deal with EMR as the input of ML-based models, which facilitates the full use of multimodal data (structured data and unstructured data in EMR). In the present study, we performed occlusion tests on HPI and the results demonstrated that the performance of the model with the input of the original HPI was better than that with the input of HPI without symptomatic entities. Therefore, we speculated from the occlusion test and our limited experience that symptomatic entities in HPI were strongly related to IR and conducive to the prediction of IR.

In our previous study, we used several ML algorithms to build ML-based models to preoperatively predict IR (7). In that study, we only included structured data in the ML-based model, whereas in the present study, we introduced not only structured data into the models but also unstructured data. Unstructured data may contain information related to the severity of CD in addition to the 11 features of structured data that were summarized by clinicians according to their personal experience. The features included in the final model with the highest AUC (0.743) in our previous study were IOMRI, tumor size, whether it is the first operation, and ACTH level (8:00 a.m.), whereas in the present study, the model with the highest AUC (0.793) was constructed using LR with 11 structured features and “individual conclusions of the case by doctor.” The model performance in the present study was superior to that in the previous study.

The importance of the features of structured data was ranked using the F-test, whereas the importance of the features of unstructured data was evaluated using the change in AUC value after adding a single unstructured feature into the model based only on the structured features. Information such as image and voice, itself has the characteristics of vectorization, continuity. Natural language (EMR) is different. It is the expression and abstract summary of objective things. This is the advantage of human thought; however, it restricts computers to the identification of natural language because it lacks a strong correlation between specific sensory information and natural language. In the past, computers could only perform statistical and logical reasoning through the relationship between symbols, which made it difficult to express the continuity of language. In 2013, Mikolov et al. (21) enabled vocabulary to form the deep model input of continuous real number space in the same manner as images and audio, and the learning efficiency of the model was much higher than that of previous models. Thus, we used a word embedding method in the present study to vectorize EMR. “Individual conclusions of the case by doctor” are routine records in EMR at PUMCH. They are the conclusions of clinicians according to the clinical characteristics of patients, and they may reflect the subjective perception of doctors about the severity of the disease. Therefore, we speculated that key information related to the severity of the disease may be hidden in free text and could contribute to the ML-based model.

Table 4 shows that the AUC values of MLP and SVM did not increase after unstructured features were introduced into the model, whereas, simultaneously, the AUC values increased significantly after “individual conclusions of the case by doctor” was introduced into the model. These two contrasting results are mainly caused by several factors, as we speculated. First, unstructured data text is generally long, with a great deal of useless information, and can easily be overfitted in MLP, which can lead to the decline of AUC values. Similarly, SVM looks for a hyperplane to separate data points, which makes it difficult to determine an appropriate hyperplane to separate them in the case of complex data features. Therefore, the performance of SVM decreases after unstructured data that contain a great deal of redundant information are introduced into the model. The linear model structure of LR enables it to capture quasi-linear characteristics and ignore high-dimensional redundant information; hence, it can capture key information in the unstructured text to obtain a high-grade classification capacity. To summarize, MLP and SVM are more complex than LR, which made the latter even more effective in the present study.

To the best of our knowledge, the present study is the first to use unstructured data from the EMR of patients with CD as the input of ML-based models. In this process, we embedded these unstructured features, and transformed them into relatively low-dimensional dense vectors to facilitate the model construction of ML (22, 23). In previous studies on the ML model construction process, one-hot encoding on discrete characteristics was typically feasible for clinically used binary structured data (e.g., gender). However, features with one-hot encoding may be too high-dimensional and sparse for EMR data, which is not conducive to model training. CD is a type of neuroendocrine tumor that causes not only a mass effect but also various types of endocrine symptoms recorded in EMR that can be fully used by an ML-based model after embedding.

In the present study, the final model with the highest AUC included all structured features; however, according to the F-test, four structured features were correlated with IR. Their rank is as follows: first operation or not, SFC, ACTH, and IOMRI. If a patient has already undergone at least one operation, there is a higher chance that the tumor is more invasive and aggressive, which may cause postoperative residual (24). SFC was the second-most important predictor of IR in the present study. If the sellar floor of a patient is infiltrated on preoperative MRI, it is likely that the tumor has higher invasiveness that makes it invade the mucosa and bone in the sellar region. In this circumstance, there is a relatively great possibility of postoperative residual. Preoperative ACTH level was the third-most important predictor of IR, which is consistent with our previous study (9). IOMRI was the fourth-most important predictor, which is also consistent with our previous study (9). An intriguing observation is that tumor size was not a predictor of IR, which is inconsistent with previous studies (9, 19, 25, 26). In our previous study (7), tumor size was strongly correlated with IR when two surgeons performed operations over several decades. However, in the present study, only MF performed the operation. We can speculate from the result that with the evolution of surgical skills and personal experience, tumor size is no longer a major predictor of IR.

Strengths and Limitations

The present study has two strengths. First, this is the first study that used deep learning techniques to deal with EMR of patients with CD as input of an ML-based model that improved model performance. EMR contains sufficient information about the patient to reflect real-world information. Second, a relatively large CD cohort was considered. There are also two limitations. First, this was a single-center study. Second, the performance of the ML-based model depended on the quality of EMR.

Conclusions

EMR of patients with CD can be used as input to an ML-based model after being processed to preoperatively predict IR. The model with structured features together with unstructured features conspicuously enhanced the performance of the model compared with the model that used only structured features as input. First operation or not, SFC, ACTH, and IOMRI were the most important predictors of IR of CD.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by the ethical review committee of Peking Union Medical College Hospital. Written informed consent to participate in this study was provided by the participants’ legal guardian/next of kin.

Author Contributions

WZ and DL contributed equally to the present study. Each author contributes to the article in data collecting and analysis. RW and QC take final responsibility for this article. All authors contributed to the article and approved the submitted version.

Funding

This work was supported by the CAMS Innovation Fund for Medical Sciences (CIFMS) (2020-I2M-C&T-B-031), the Natural Science Foundation of China (Grant Nos. 61872113 and 62006061), and the Shenzhen Foundational Research Funding (JCYJ20200109113441941).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

We thank Maxine Garcia, PhD, from Liwen Bianji (Edanz) (www.liwenbianji.cn/) for editing the English text of a draft of this manuscript.

Abbreviations

ML, machine learning; TSS, transsphenoidal surgery; CD, Cushing’s disease; ROC, receiver operating characteristic curve; AUC, area under the curve; UFC, urine free cortisol; LDDST, low-dose dexamethasone suppression test; HDDST, high-dose dexamethasone suppression test; MRI, magnetic resonance imaging; BIPSS, bilateral inferior petrosal sinus sampling; IOMRI, invasion of cavernous sinus from MRI; LR, logistic regression; RF, random forest; MLP, Multiparametric Linear Programming; SVM, support vector machine.

References

1. Steffensen C, Bak AM, Rubeck KZ, Jorgensen JO. Epidemiology of Cushing’s Syndrome. Neuroendocrinology (2010) 92(Suppl 1):1–5. doi: 10.1159/000314297

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Pivonello R, De Leo M, Cozzolino A, Colao A. The Treatment of Cushing’s Disease. Endocr Rev (2015) 36(4):385–486. doi: 10.1210/er.2013-1048

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Arnaldi G, Angeli A, Atkinson AB, Bertagna X, Cavagnini F, Chrousos GP, et al. Diagnosis and Complications of Cushing’s Syndrome: A Consensus Statement. J Clin Endocrinol Metab (2003) 88(12):5593–602. doi: 10.1210/jc.2003-030871

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Biller BMK, Grossman AB, Stewart PM, Melmed S, Bertagna X, Bertherat J, et al. Treatment of Adrenocorticotropin-Dependent Cushing’s Syndrome: A Consensus Statement. J Clin Endocrinol Metab (2008) 93(7):2454–62. doi: 10.1210/jc.2007-2734

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Petersenn S, Beckers A, Ferone D, van der Lely A, Bollerslev J, Boscaro M, et al. Therapy of Endocrine Disease: Outcomes in Patients With Cushing’s Disease Undergoing Transsphenoidal Surgery: Systematic Review Assessing Criteria Used to Define Remission and Recurrence. Eur J Endocrinol (2015) 172(6):R227–39. doi: 10.1530/EJE-14-0883

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Zoli M, Staartjes VE, Guaraldi F, Friso F, Rustici A, Asioli S, et al. Machine Learning-Based Prediction of Outcomes of the Endoscopic Endonasal Approach in Cushing Disease: Is the Future Coming? Neurosurg Focus (2020) 48(6):E5. doi: 10.3171/2020.3.FOCUS2060

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Zhang W, Sun M, Fan Y, Wang H, Feng M, Zhou S, et al. Machine Learning in Preoperative Prediction of Postoperative Immediate Remission of Histology-Positive Cushing’s Disease. Front Endocrinol (2021) 12:635795. doi: 10.3389/fendo.2021.635795

CrossRef Full Text | Google Scholar

8. Liu Y, Liu X, Hong X, Liu P, Bao X, Yao Y, et al. Prediction of Recurrence After Transsphenoidal Surgery for Cushing’s Disease: The Use of Machine Learning Algorithms. Neuroendocrinology (2019) 108(3):201–10. doi: 10.1159/000496753

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Dai C, Fan Y, Liu X, Bao X, Yao Y, Wang R, et al. Predictors of Immediate Remission After Surgery in Cushing’s Disease Patients: A Large Retrospective Study From a Single Center. Neuroendocrinology (2020). doi: 10.1159/000509221

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Cleophas TJ. Machine Learning in Therapeutic Research: The Hard Work of Outlier Detection in Large Data. Am J Ther (2016) 23(3):e837–43. doi: 10.1097/MJT.0b013e31827ab4a0

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Rajkomar A, Dean J, Kohane I. Machine Learning in Medicine. Reply N Engl J Med (2019) 380(26):2589–90. doi: 10.1056/NEJMra1814259

CrossRef Full Text | Google Scholar

12. José Antonio Miñarro-Giménez OM-A. Matthias, Samwald. Exploring the Application of Deep Learning Techniques on Medical Text Corpora. Stud Health Technol Informatics (2014) 205:584–8. doi: 10.3233/978-1-61499-432-9-584

CrossRef Full Text | Google Scholar

13. Rajkomar A, Oren E, Chen K, Dai AM, Hajaj N, Hardt M, et al. Scalable and Accurate Deep Learning With Electronic Health Records. NPJ Digital Med (2018) 1.1:1–10. doi: 10.1038/s41746-018-0029-1

CrossRef Full Text | Google Scholar

14. Esteva A, Robicquet A, Ramsundar B, Kuleshov V, DePristo M, Chou K, et al. A Guide to Deep Learning in Healthcare. Nat Med (2019) 25(1):24–9. doi: 10.1038/s41591-018-0316-z

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Xiao C, Choi E, Sun J. Opportunities and Challenges in Developing Deep Learning Models Using Electronic Health Records Data: A Systematic Review. J Am Med Inf Assoc (2018) 25(10):1419–28. doi: 10.1093/jamia/ocy068

CrossRef Full Text | Google Scholar

16. Feng M, Liu Z, Liu X, Bao X, Yao Y, Deng K, et al. Diagnosis and Outcomes of 341 Patients With Cushing’s Disease Following Transsphenoid Surgery: A Single-Center Experience. World Neurosurg (2018) 109:e75–80. doi: 10.1016/j.wneu.2017.09.105

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Nieman LK, Biller BMK, Findling JW, Murad MH, Newell-Price J, Savage MO, et al. Treatment of Cushing’s Syndrome: An Endocrine Society Clinical Practice Guideline. J Clin Endocrinol Metab (2015) 100(8):2807–31. doi: 10.1210/jc.2015-1818

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Ioachimescu AG. Prognostic Factors of Long-Term Remission After Surgical Treatment of Cushing’s Disease. Endocrinol Metab Clinics North America (2018) 47(2):335–47. doi: 10.1016/j.ecl.2018.02.002

CrossRef Full Text | Google Scholar

19. Abu Dabrh AMA, Singh Ospina NM, Al Nofal A, Farah WH, Barrionuevo P, Sarigianni M, et al. Predictors of Biochemical Remission and Recurrence After Surgical and Radiation Treatments of Cushing Disease: A Systematic Review and Meta-Analysis. Endocr Pract: Off J Am Coll Endocrinol Am Assoc Clin Endocrinologists (2016) 22(4):466–75. doi: 10.4158/EP15922.RA

CrossRef Full Text | Google Scholar

20. Ironside N, Chatain G, Asuzu D, Benzo S, Lodish M, Sharma S, et al. Earlier Post-Operative Hypocortisolemia may Predict Durable Remission From Cushing’s Disease. Eur J Endocrinol (2018) 178(3):255–63. doi: 10.1530/EJE-17-0873

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Tomas Mikolov KC, Corrado G, Dean J. Efficient Estimation of Word Representation in Vector Space. ICLR (workshop poster) (2013).

Google Scholar

22. Tomas Mikolov IS, Chen K, Corrado GS, Dean J. Distributed Representations of Words and Phrases and Their Compositionality. Advances in Neural Information Processing System 26: 27th Annual Conference on Neural Information Processing System 2013. Proceeding of a meeting held December 5-8. Lake Tahoe, Nevada, USA (2013) 26:3111–19. doi: 10.5555/2999792.2999959

CrossRef Full Text | Google Scholar

23. Yan Song SS, Li J, Zhang H. Directional Skip-Gram: Explicitly Distinguishing Left and Right Context for Word Embeddings. Proceeding of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Laguage Technologies. New Orleans, Louisiana, USA: NAACL-HLT (2018) 2:175–80. doi: 10.18653/v1/N18-2028

CrossRef Full Text | Google Scholar

24. Starke RM, Reames DL, Chen C-J, Laws ER, Jane JA. Endoscopic Transsphenoidal Surgery for Cushing Disease: Techniques, Outcomes, and Predictors of Remission. Neurosurgery (2013) 72(2):240–7. doi: 10.1227/NEU.0b013e31827b966a

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Blevins LS, Christy JH, Khajavi M, Tindall GT. Outcomes of Therapy for Cushing’s Disease Due to Adrenocorticotropin-Secreting Pituitary Macroadenomas. J Clin Endocrinol Metab (1998) 83(1):63–7. doi: 10.1210/JCEM.83.1.4525

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Chandler WF, Barkan AL, Hollon T, Sakharova A, Sack J, Brahma B, et al. Outcome of Transsphenoidal Surgery for Cushing Disease: A Single-Center Experience Over 32 Years. Neurosurgery (2016) 78(2):216–23. doi: 10.1227/NEU.0000000000001011

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: natural language processing, Cushing’s disease, immediate remission, preoperative prediction, machine learning

Citation: Zhang W, Li D, Feng M, Hu B, Fan Y, Chen Q and Wang R (2021) Electronic Medical Records as Input to Predict Postoperative Immediate Remission of Cushing’s Disease: Application of Word Embedding. Front. Oncol. 11:754882. doi: 10.3389/fonc.2021.754882

Received: 07 August 2021; Accepted: 20 September 2021;
Published: 13 October 2021.

Edited by:

Qun Wu, Zhejiang University, China

Reviewed by:

Qingfang Sun, Shanghai Jiao Tong University, China
Anke Zhang, Shanghai Jiao Tong University, China

Copyright © 2021 Zhang, Li, Feng, Hu, Fan, Chen and Wang. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Renzhi Wang, d2FuZ3J6QDEyNi5jb20=; Qingcai Chen, cWluZ2NhaS5jaGVuQGhpdC5lZHUuY24=

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Electronic Medical Records as Input to Predict Postoperative Immediate Remission of Cushing’s Disease: Application of Word Embedding

Introduction

Materials and Methods

Study Population

Diagnosis of Cushing’s Disease

Postoperative Management and Immediate Remission

Study Design

ML Algorithms

Statistical Analysis

Occlusion Tests

Results

Patients’ Characteristics

Predictive Performance of Models

Variable Importance

Occlusion Tests

Discussion

Strengths and Limitations

Conclusions

Data Availability Statement

Ethics Statement

Author Contributions

Funding

Conflict of Interest

Publisher’s Note

Acknowledgments

Abbreviations

References

95% of researchers rate our articles as excellent or good

95% of researchers rate our articles as excellent or good