Predictive analytical model for ectopic pregnancy diagnosis: Statistics vs. machine learning

Rueangket, Ploywarong; Rittiluechai, Kristsanamon; Prayote, Akara

doi:10.3389/fmed.2022.976829

ORIGINAL RESEARCH article

Front. Med., 23 September 2022

Sec. Obstetrics and Gynecology

Volume 9 - 2022 | https://doi.org/10.3389/fmed.2022.976829

Predictive analytical model for ectopic pregnancy diagnosis: Statistics vs. machine learning

$\r\nPloywarong Rueangket*$ Ploywarong Rueangket^1*

Kristsanamon Rittiluechai¹

Akara Prayote²

¹Department of Obstetrics and Gynecology, Phramongkutklao Hospital, Bangkok, Thailand
²Department of Computer and Information Science, Faculty of Applied Science, King Mongkut’s University of Technology North Bangkok, Bangkok, Thailand

Objective: Ectopic pregnancy (EP) is well known for its critical maternal outcome. Early detection could make the difference between life and death in pregnancy. Our aim was to make a prompt diagnosis before the rupture occur. Thus, the predictive analytical models using both conventional statistics and machine learning (ML) methods were studied.

Materials and methods: A retrospective cohort study was conducted on 407 pregnancies with unknown location (PULs): 306 PULs for internal validation and 101 PULs for external validation, randomized with a nested cross-validation technique. Using a set of 22 study features based on clinical factors, serum marker and ultrasound findings from electronic medical records, analyzing with neural networks (NNs), decision tree (DT), support vector machines (SVMs), and a statistical logistic regression (LR). Diagnostic performances were compared with the area under the curve (ROC-AUC), including sensitivity and specificity for decisional use.

Results: Comparing model performance (internal validation) to predict EP, LR ranked first, with a mean ROC-AUC ± SD of 0.879 ± 0.010. In testing data (external validation), NNs ranked first, followed closely by LR, SVMs, and DT with average ROC-AUC ± SD of 0.898 ± 0.027, 0.896 ± 0.034, 0.882 ± 0.029, and 0.856 ± 0.033, respectively. For clinical aid, we report sensitivity of mean ± SD in LR: 90.20% ± 3.49%; SVM: 89.79% ± 3.66%; DT: 89.22% ± 4.53%; and NNs: 86.92% ± 3.24%, consecutively. However, specificity ± SD was ranked by NNs, followed by SVMs, LR, and DT, which were 82.02 ± 8.34%, 80.37 ± 5.15%, 79.65% ± 6.01%, and 78.97% ± 4.07%, respectively.

Conclusion: Both statistics and the ML model could achieve satisfactory predictions for EP. In model learning, the highest ranked model was LR, showing that EP prediction might possess linear or causal data pattern. However, in new testing data, NNs could overcome statistics. This highlights the potency of ML in solving complicated problems with various patterns, while overcoming generalization error of data.

Introduction

Ectopic pregnancy (EP) occurs when a fertilized egg implants outside the uterine cavity, resulting from numerous factors that interrupt the successful migration of the conceptus (1). The incidence rate of EP in Thailand and worldwide was 9.3 and 10–20 per 1,000 pregnancies, respectively (1–4). Unfortunately, the mortality rate was high compared with low numbers of incidence, since EP was reported as a major cause of maternal death in early pregnancy (5, 6). UK’s Healthcare Safety Investigation Branch (HSIB) has uncovered that failure or delay in diagnosis was the main concern (7, 8) and declared early diagnosis of EP to be a life-or-death medical decision (9).

EP is usually diagnosed in the first trimester of pregnancy. Presenting symptoms range from vaginal bleeding, abdominal pain, missed menstruation, and fainting. Examination findings include abdominal and/or adnexal tenderness, cervical motion tenderness, or hypotension. Unfortunately, many studies found that history and physical examination do not reliably predict outcomes, because up to 50% of patients revealed no risk factor (10) and 9% reported no pain. Also, the normal examination found nearly one-third of cases (11). When a pregnancy test was confirmed and early pregnancy complications were suspected, ultrasound examination was commonly used to confirm the location of pregnancy (12). However, only 73.9% of tubal EPs were visualized by initial TVS (13), and those with no signs of intra- or extrauterine pregnancy on transvaginal ultrasonography would initially be defined as pregnancy of unknown location (PUL), ranging from 8 to 31% in prevalence. Within this cohort, one-third was found as early intrauterine pregnancy, and a range of 8.7–42.8% was found as EP (14–16). Recently, serial measurements of serum hCG have been shown to improve the diagnostic rate (12). Unfortunately, the result could not differentiate those with EP from normal intrauterine pregnancy or miscarriage precisely enough (17). Consequently, clinicians misdiagnosed more than 40% of EPs on the initial ED visit reported in a former study (18). While many clinical protocols have been improved, in addition to modern investigation tools, limitations remain in diagnosis as observed in the UK. The National Health Service (NHS) uncovered 30 missed EPs leading to “serious harm” in 1 year (2017–2018) (9). Therefore, this time-critical condition can become life-threatening when the implantation site ruptures causing immediate bleeding into the intra-abdomen and eventually leading to hemorrhagic shock.

Attempts in developing models for EP diagnosis were established from a variety of domains. These include the clinical risk factor model, classifying elevated risk group (19) or the risk factor model combined with a single serum hCG (20). While the specificity was high, the sensitivity was inconsistent. Second, the serum level of the progesterone model with single cutoffs had AUC 0.725, or the later widely known model using serial hCG, M1, M4, and M6, was presented with high performance. However, the accuracy was lower in different cohorts and required at least two follow-up examinations up to 48 h (21–23). Finally, the ultrasound score classified patients using ultrasound findings (24), and there were still unavoidable limitations of ultrasound user expertise and patient’s confounding factors.

In addition to the limitations mentioned, international consensus remains lacking, with no gold standard tools have been established to identify early EP. Our aim was to develop a model combining all three domains using traditional statistical analysis and machine learning (ML).

ML has dramatically contributed to new knowledge in the medical field in the last two decades. It has defined the evolution of interdisciplinary sciences between statistics, artificial intelligence, and medicine. It possesses the ability to conduct complex tasks, automatically, determines hidden patterns that are too complex for humans to observe, and has the advantage of discovering rules for behavior and adaption to changes in wording, making ML suitable to predict new EP cases (25).

Numerous EP studies have been based on traditional statistical analysis. Although EP is dangerous and difficult to detect, small numbers of studies have applied ML in this field. One was used as a decision support model for treatment. Interestingly, another studied different ML methods to predict EP in PUL based on serum hCG and clinical information. To the best of our knowledge, this is the first study to combine all diagnostic feature domains using both statistics and ML methods.

Our research problem applied the classification technique based on a supervised learning method. The widely used ML methods include the decision tree (DT), support vector machine (SVM), neural network (NN), and logistic regression (LR), which is the traditional and most used statistical method. Each model process uses distinctive characteristics of algorithms that are suitable for different sets of data problems. Our study aimed to compare all four models and determine the best model suitable for the stated problem.

Materials and methods

Problem definition and formulation

This constituted a retrospective cohort study, briefly summarized in Figure 1, conducted from electronic medical records of 1,604 pregnant women presenting first trimester complication symptoms including abdominal pain and/or abnormal vaginal bleeding at Phramongkutklao Hospital between October 2010 and March 2022. The criteria for inclusion were those suspected of PUL with medical report of clinical history, physical examination, and ultrasound evaluation. Women were included regardless of the report with or without taking serum hCG due to medical judgment at that time, such as those presenting suspicious signs of intrauterine pregnancy or extrauterine pregnancy via ultrasonography at the first visit. The patients presenting clinically suspicious ruptured EP (clinical instability or sign of intra-abdominal hemorrhage) or showing any evidence of intrauterine gestational content or EP (adnexal mass consisting of fetal pole or fetal heart motion) by ultrasound at the first visit were excluded. EPs were those diagnosed with pathological reports in surgical cases and abnormal serial serum hCG in non-surgical cases. The study was approved by the Royal Thai Army Medical Department Institutional Review Board, reference number R048h/62_Exp. Patient identification was coded before analysis and discussion. We declare that we used some parts of identical electronic medical data of patients, visiting Phramonkutklao Hospital, for model validation in this research, using different research questions and methods (26).

FIGURE 1

Figure 1. Study flow diagram based on the foundational method for data science (FMD), IBM (27, 28).

Outcome of measurement

Binomial value (ectopic pregnancy/non-ectopic pregnancy).

Analysis

For supervised learning analysis, four basic and powerful classification methods were chosen for their unique classifying ability as conceptually demonstrated in Figure 2. Despite the development of a variety of methods, each method provides its own characteristics, and the method capability and model requirement should be matched.

FIGURE 2

Figure 2. Conceptual overview of four predictive models.

Logistic regression

Is a traditional statistical method, invented by a British statistician David Cox in 1958 (29), dealing with classification problems using a logistic function, for which the result always falls between 0 and 1 and the graph of the function is S-shaped. The regression method has an advantage in its interpretability, which could explain how the model works and more importantly lead to an understanding of “why?” this patient was predicted “yes”. Although regression coefficients in LR are challenging to interpret and understand as in linear regression, it could interpret whether the relationship is proportional or inversely proportional between each feature (probability) (30, 31).

Support vector machines

Are also used for classification as an alternative to LR, devised by Soviet statisticians in 1963, and have become feasible with the introduction of kernels and soft margin classifiers in the 1990s (32). The advantage over simple regression is that linear or LR uses all the data points in the calculation of the line of best fit, while SVMs focus on only the set of points (called “support vector”) closest to the margin. However, in terms of interpretability, SVMs perform relatively like a black box (31).

Decision trees

Classify training data by sorting them on a tree from root to leaf nodes downwardly. Each internal node involves a feature and prediction made at leaf nodes. A leaf is a collection of examples that may not be classified any further (33). It has the ability to sequence both discrete and continuous values and can be used even when some training data have unknown values (33). However, practical issues arise from learning to determine how deeply to grow the DT, manage continuous data, choose an appropriate feature selection measure, and link training data with missing attribute values.

Neural networks

Were named as a simulation of how brain cellular networks function, which were used in the 1950s. NN comprises one or more layers of autonomous computational units or nodes receiving input from other nodes (including within the same layer), and sending output or even feedback to previous input, to present the final output prediction. Although the earliest NNs were used in classification prediction like basic SVMs or linear discriminant analysis, they have become more useful in more complex or non-linear tasks like handwriting or imaging recognition, which are competitive in solving real-world problems, including using non-linear data. However, one disadvantage found in NN was the longer time required for model running compared with the same category of problems by LR, SVM, and DT. Second, only numbers of nodes and layers were identified. Finally, NNs have no explanatory power to explain “why” this is predicted.

Software

RapidMiner Studio 9.9.003 is a well-known data analyst tool, especially used for predictive analysis and statistical computing (34, 35).

Data gathering

Study population

In total, 407 PUL patients (26) were included in this study.

Features (predictive values)

Three domains of 22 features of categorical data comprise clinical history (demographic data, risk factor history, clinical manifestations), initial serum hCG levels, and ultrasound results. All factors were extracted and selected from literature reviews that were statistical and clinically relevant to our research outcome.

Data preparation

Missing value

Due to the nature of a retrospective study, missing input data are inevitable. After reviewing cases, approximately 10–20% involved missing values in all 22 features and were missing at random (mostly insignificant negative findings or assumed irrelevant history in those hospital visits) (36). Our objective was to understand the data for training, not deleting ones, which could bias the classifier performance (37). The missing value imputation method has been shown to improve prediction capability. Thus, the Naïve-Bayes, a simple, probability ML, was applied (38).

Remove correlated features

To avoid confusing correlation and causation, features with high or substantial absolute correlation of more than 0.95 were removed (39).

Feature selection

To select the attribute that was most useful for classifying examples, optimization selection using forward/backward stepwise was applied [n (generation without improvement) = 1].

Model analysis

Dataset allocation

To maximize the use of all values, while decreasing generalization error or testing/training dataset variance, the nested cross-validation technique (40) was applied by randomly splitting and selecting a training and testing dataset (306:101) in five different loops. In addition, an inner loop 10-fold cross-validation of training/validating data was added. The performance would present in average, minimum, and maximum values from all five-model loops’ analyses.

Model training and validating (internal validation/n = 306)

To optimize the process of training sets to estimate their accuracy and to overcome model overfitting, by providing 10-fold (9:1) training and validating data (25), all four models were trained using the entire dataset.

Performance evaluation

Comparing the four models, the area under the receiver operating characteristic curve (ROC-AUC) (41) was used to report the mean ± SD of the cross-validation process. Also, report accuracy, sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were employed to gain more insights.

Prediction

Model testing/deployment (external validation/n = 101)

All four models were applied to newly separated patients’ data using the nested cross-validation technique, and then, the model performance was compared using ROC-AUC.

Results

Characteristics of study populations

Of the 1,604 pregnant women with first trimester complications, 407 (25.4%) patients initial visits were identified as PUL—the final diagnosis totaled 208 (51.1%) EPs and 199 (48.9%) non-EPs. Among non-EPs, 22 (11.1%) were threatened abortion, 1 (0.5%) blighted ovum, 1 (0.5%) corpus luteal leakage, and the other 175 (87.9%) constituted spontaneous abortion. The mean age was 30 years with 55.3% multiparity. Comparing the demographic data presented in Table 1, no difference was found between the two groups.

TABLE 1

Table 1. Descriptive demographics and features of the study population.

Regarding the data preparation process, features selected to be in the model are shown in Table 2. We then ran the model analysis and presented the performance comparison in ROC-AUC in Figure 3.

TABLE 2

Table 2. Features selected in the four models.

FIGURE 3

Figure 3. ROC-AUC (95%CI) performance comparison of the four models using cross-validation (internal validation), created by RapidMiner Studio 9.9.003.

The average performance ROC-AUC was high in all models (AUC ≥ 0.856, Figures 3, 4), also highlighting that the statistical model (LR) was superior to ML in training validation, while NNs were more superior in external testing.

FIGURE 4

Figure 4. Predictive performance of the four models (external validation).

Of the four models’ performances in the testing population shown in Figure 4, NNs ranked first, followed closely by LR, SVMs, and DT with average ROC-AUC ± SD of 0.898 ± 0.027, 0.896 ± 0.034, 0.882 ± 0.029, and 0.856 ± 0.033, respectively. For clinical aid, we reported sensitivity of mean (± SD) LR: 90.20% ± 3.49%, SVM: 89.79% ± 3.66%, DT: 89.22% ± 4.53%, and NNs: 86.92% ± 3.24%, respectively. Furthermore, specificity of mean (± SD) was ranked by NNs, followed by SVMs, LR, and DT, i.e., 82.02 ± 8.34%, 80.37 ± 5.15%, 79.65% ± 6.01%, and 78.97% ± 4.07%, respectively.

Figure 5 shows more insight on how the DT model predicts the outcome, indicating that the prediction process was in a prioritizing order. As the tree grows downward, we found that the adnexal mass was the highest of priority decision nodes, first used to classify patients, indicating it as the main classified feature, followed by cervical tenderness. Concerning the second pathway if none of these two features existed, we found that the initial serum history of PID, nausea-vomiting symptoms, and current use of emergency pill could provide additional decisional data, as well as serum hCG of more than 1,000 mIU/mL in another branch.

FIGURE 5

Figure 5. Decision tree model for predictive ectopic pregnancy diagnosis. Adx mass: inhomogeneous adnexal mass, N/V: nausea-vomiting, Cx tender: cervical tenderness, PID: pelvic inflammatory disease, created by RapidMiner Studio 9.9.003.

We then evaluated the models on the new cohort for external validation and found that NNs models performed best with ROC-AUC of 0.898, followed by LR, SVMs, and DT as shown in Table 3. However, a slight difference in performance was observed between LR and NNs in both internal and external validations.

TABLE 3

Table 3. Average ROC-AUC performance comparison of the four models applied to the internal and external validation datasets.

Discussion

Our study reported an incidence of 51.1% EPs among initially suspected women with PUL. However, other studies reported an incidence ranging from 7 to 31% (14, 15, 42, 43). A similar rate was observed in one large prospective observational study by Malek-mellouli et al. (44) with a rate of 43%. This could be explained by spontaneous resolution of EP in PUL that might constitute a failed diagnosis, because the location remained unknown, while some cases might have been misclassified as missed abortion. Also, the higher number might have resulted from the sensitivity of ultrasound at the initial diagnosis that subjectively differed between cohorts. Gestational age of diagnosis was also similar in both groups, leading to a challenge for early diagnosis. Using the method of data science for model development, two main steps of results were developed.

First, model learning and validation

While ML is believed to empower prediction fields, theoretically using complex algorithms should enable highly accurate models (45). Our result showed that LR provided a better predictive ability throughout ROC-AUC. One explanation could be that the model was obtained from features, chosen by reviewing and studying many literature reviews, resulting in true causation, which was unavoidable because medical data are reasonably based on fact. Evidently, due to the presence of non-random variation (causal or linear relationship) in the input variables, LR performed the best of the four models in the internal validation process. Interestingly, SVMs also presented the best accuracy, supporting the fact that medical data might possess a linear character, and the support vector of the SVM model exhibited a greater fit with the data (31).

Particularly interesting for researchers is the new feature, serum hCG cutoff at ≥ 1,000 mL/mL, for predicting EP. Also, another study found similar associations (46). To the best of our knowledge, no model has used this serum cutoff as a feature in prediction, yet. Also, it has been shown in these four models that cervical tenderness, adnexal mass, and serum hCG were chosen using the optimized selection feature process. This could be interpreted by the DT model. While ultrasound findings of inhomogeneous adnexal mass were prioritized, followed by physical examination of cervical tenderness, the initial level of serum hCG up to 1,000 IU/mL, clinical risk factors of nausea-vomiting, and current use of emergency pills were shown to be useful to classify women with PUL in consecutive order. This was related to the evidence based on observational studies in that these factors were shown to correlate with EP with high odds ratio (44, 47). This research might prove that the four factors were not just related to EP but might offer biological plausibility as well.

Comparing the ML models, the major drawback of ML models, especially NNs and DT, occurs in their training phase. We found that accuracy was highly dependent on the size of input data (48). Although the extraordinary generalization capability of NNs and its discriminative power make NNs perform better than DT, those models practically and theoretically achieve less than NNs. However, DT has advantages in dealing with training data missing values, which could be more useful in practical use.

Second, model deployment, and testing

When deploying the models to unseen data, the average ROC-AUC was slightly higher in all four models, proving the generalizability of the models by defeating model overfitting. While NNs proved to be more superior in classifying EPs, followed closely by LR, highlighting the fact that EPs prediction might tend to be in linear or causal relationships due to medical data based on fact, for which LR and SVMs proved their capabilities. More importantly, NNs, which were mostly studied well using non-linear data, also proved the potential performance in this prediction. Unfortunately, due to the small sample size, further study is required for more validation. Importantly, introducing the nested cross-validation technique by randomly splitting and matching between training and testing data over five different outer loops for model evaluations provided an acceptable validation. However, when more new data become available, and more features are explored, the model could become more complex and harder to incorporate all data in a single optimal model, which could constitute a drawback of ML.

For decision-making, ideally, we would prefer a diagnostic test offering both 100% sensitivity and 100% specificity. Unfortunately, this rarely occurs and is usually viewed as a trade-off (49). In practical use, we would like to focus on two circumstances.

Concerning the first patient visit, the model focusing on EP screening might represent the most important because limitations or pitfalls occur in many settings involving the lack of obvious clinical presentations or ultrasound findings and the lack of a specialist to consult in primary care hospitals (50). To decrease the misdiagnosis rate (ruled in), high sensitivity remains crucial, for which we found LR performed the best. This was because we were concerned whether a positive disease (EP) was not identified using a positive test result (51), leading to inappropriate discharge or inadequate follow-up. We also found the least false negative case in the LR model. Furthermore, to emphasize confidence in test sensitivity, patients predicted as non-EP still need counseling to use NPV rate, because sensitivity cannot be used to categorize other people as not having the condition when in fact they do have it (for which LR also ranked first in NPV performance).

In the second circumstance, following up the EP group or, in practice, elevated risk PUL, serial serum hCG, and ultrasound would have been followed as a standard protocol to definite cases of EP and intrauterine pregnancy identified by ultrasound. Unfortunately, we found that in counseling for treatment (ruled out), high specificity was more important. As a result, NNs might be chosen, because they could perform at the best specificity. Thus, people with a positive test result would be very unlikely to be categorized as having a condition if they indeed did not have it and prevent harmful unnecessary treatment for normal pregnancy.

Therefore, selecting a model for scientific problems can markedly influence predictive performance. Building complex models using some data might create the only model that is sufficiently powerful to predict ones but might become useless concerning some questions. This is because the more complex the model, the harder the results of a prediction would be to explain, so you might never obtain the answer for “why” this says “yes.” Second, while keeping up with the changing patient’s information in the real world, simple models tend to maintain their performance, but complex ones require up-to-date maintenance. Therefore, an additional key could be to focus on the nature of the data instead of creating complex models. Finally and more importantly, decisions for what model would constitute the best might depend on the nature of data and the question of “what is the answer” vs. “why is this the answer.”

Conclusion

Our research highlights the advantage of applying ML in medical settings as an innovative way for disease prediction using its complex algorithms to discover unknown patterns or information inside the black box. The abilities in dealing with missing values, selecting the most optimized features, and analyzing non-parametric data have proved to be ground-breaking methods for clinical use.

Study limitation was mainly due to the low incidence of EP in Thai populations. Thus, a retrospective study was chosen. However, it provided sufficient power of data for statistics, but obtained unavoidable missing data. Second, while input data type was related to the analyzing process, the prediction performance was affected by the type of data. Our research mostly used category instead of continuous data, which could have limited the performance of NNs and SVMs by its nature.

Furthermore, as healthcare organizations have produced and recorded tons of patient information, which might never be used, organized electronic collection of data could be properly processed as a medical alerting system to predict using an ML-based algorithm. With the ML model, knowledge from these data could produce an ultimate benefit in terms of predicting and inventing new insights, gaining more benefit from the experiences of previous ones (52).

Data availability statement

The original contributions presented in this study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.

Ethics statement

Written informed consent for participation was not required for this study in accordance with the national legislation and the institutional requirements due to retrospective study.

Author contributions

PR collected the data and performed the data analysis. PR and AP interpreted the data, drafted, and revised the manuscript. PR and KR approved the version of the manuscript to be published. All authors contributed to the conception and design of the study.

Funding

This study was supported by the Phramongkutklao Research Fund.

Acknowledgments

We express our grateful thanks to KR for inspiring and encouraging energy. AP contributed to the study design and hypothesis, opening the door of data science and making this study possible. Finally, we wish to show our appreciation to the Department of Obstetrics and Gynecology, Phramongkutklao College of Medicine, for their support in completing this study. We wish to thank Mr. Thomas Mc Manamon for proofreading the manuscript.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Abbreviations

EP, ectopic pregnancy; PUL, pregnancy of unknown location; ML, machine learning; NNs, neural networks; DT, decision tree; LR, logistic regression; SVM, support vector machines.

References

1. Voedisch AJ, Cahill, Erica P. Early pregnancy loss and ectopic pregnancy. In: Berek JS editor. Berek & Novak’s Gynecology. 16 ed. (Philadelphia, PA: Wolters Kluwer Health/Lippincott Williams & Wilkins) (2019). p. 1912–59.

Google Scholar

2. Liampongsabhuddhi P. Epidemiological study of ectopic pregnancy in lampang hospital. ลำปาง เวช สาร (2010) 31:20–7.

Google Scholar

3. Leke RJ, Goyaux N, Matsuda T, Thonneau PF. Ectopic pregnancy in Africa: a population-based study. Obstet Gynecol. (2004) 103:692–7. doi: 10.1097/01.AOG.0000120146.48098.f2

CrossRef Full Text | Google Scholar

4. Suetrakul TL, Piyananjaratsri K, Thadsri S, Amphawa T, Phatthanapisansak C, Silalai S, et al. The Assessment of Emergency Obstetric Care (EMOC) in the Lower 5 Southern Provinces of Thailand [Internet]. Institute of Research and Development for Health of Southern (2006). Available online at: http://kb.psu.ac.th/psukb/handle/2016/10336 (accessed February 22, 2022).

Google Scholar

5. Marion LL, Meeks GR. Ectopic pregnancy: history, incidence, epidemiology, and risk factors. Clin Obstet Gynecol. (2012) 55:376–86. doi: 10.1097/GRF.0b013e3182516d7b

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Drife J, Lewis G. Why Mothers Die 2000–2002 – The Sixth Report of Confidential Enquiries into Maternal Deaths in the United Kingdom. London: Royal College of Obstetricians and Gynaecologists (2004).

Google Scholar

7. Awoleke JO, Adanikin AI, Awoleke AO. Ruptured tubal pregnancy: predictors of delays in seeking and obtaining care in a Nigerian population. Int J Womens Health. (2015) 7:141. doi: 10.2147/IJWH.S76837

PubMed Abstract | CrossRef Full Text | Google Scholar

8. HISP. The Diagnosis of Ectopic Pregnancy: an Independent Report [Internet]. (2020). Available online at: www.hsib.org.uk/investigations-cases/diagnosisectopicpregnancy/final-report (accessed March 29, 2020).

Google Scholar

9. Thornton J. Women are at serious risk of harm from late diagnosis of ectopic pregnancy. BMJ. (2020) 368:m924. doi: 10.1136/bmj.m924

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Portuondo JA, Remacha MJ, Llaguno MR. Ectopic pregnancy early diagnosis limitations. Int J Gynaecol Obstet. (1982) 20:371–8. doi: 10.1016/0020-7292(82)90196-5

CrossRef Full Text | Google Scholar

11. Kaplan BC, Dart RG, Moskos M, Kuligowska E, Chun B, Adel Hamid M, et al. Ectopic pregnancy: prospective study with improved diagnostic accuracy. Ann Emerg Med. (1996) 28:10–7. doi: 10.1016/S0196-0644(96)70131-2

CrossRef Full Text | Google Scholar

12. ACOG. Tubal ectopic pregnancy. Obstet Gynecol. (2018) 131:e91–103. doi: 10.1097/AOG.0000000000002560

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Kirk E, Daemen A, Papageorghiou AT, Bottomley C, Condous G, De Moor B, et al. Why are some ectopic pregnancies characterized as pregnancies of unknown location at the initial transvaginal ultrasound examination? Acta Obstet Gynecol Scand. (2008) 87:1150–4. doi: 10.1080/00016340802443822

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Condous G, Kirk E, Van Calster B, Van Huffel S, Timmerman D, Bourne T. Failing pregnancies of unknown location: a prospective evaluation of the human chorionic gonadotrophin ratio. BJOG. (2006) 113:521–7. doi: 10.1111/j.1471-0528.2006.00924.x

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Banerjee S, Aslam N, Woelfer B, Lawrence A, Elson J, Jurkovic D. Expectant management of early pregnancies of unknown location: a prospective evaluation of methods to predict spontaneous resolution of pregnancy. BJOG. (2001) 108:158–63. doi: 10.1111/j.1471-0528.2001.00031.x

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Hajenius P, Mol B, Ankum W, Van der Veen F, Bossuyt P, Lammes F. Suspected ectopic pregnancy: expectant management in patients with negative sonographic findings and low serum hCG concentrations. Early Pregnancy. (1995) 1:258–62.

Google Scholar

17. Barnhart K, Sammel MD, Chung K, Zhou L, Hummel AC, Guo W. Decline of serum human chorionic gonadotropin and spontaneous complete abortion: defining the normal curve. Obstet Gynecol. (2004) 104:975–81. doi: 10.1097/01.AOG.0000142712.80407.fd

CrossRef Full Text | Google Scholar

18. Abbott J, Emmans LS, Lowenstein SR. Ectopic pregnancy: ten common pitfalls in diagnosis. Am J Emerg Med. (1990) 8:515–22. doi: 10.1016/0735-6757(90)90154-R

CrossRef Full Text | Google Scholar

19. Buckley RG, King KJ, Disney JD, Gorman JD, Klausen JH. History and physical examination to estimate the risk of ectopic pregnancy: validation of a clinical prediction model. Ann Emerg Med. (1999) 34:589–94. doi: 10.1016/S0196-0644(99)70160-5

CrossRef Full Text | Google Scholar

20. Barnhart KT, Sammel MD, Takacs P, Chung K, Morse CB, O’Flynn O’Brien K, et al. Validation of a clinical risk scoring system, based solely on clinical presentation, for the management of pregnancy of unknown location. Fertil Steril. (2013) 99:193–8. doi: 10.1016/j.fertnstert.2012.09.012

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Bobdiwala S, Saso S, Verbakel JY, Al-Memar M, Van Calster B, Timmerman D, et al. Diagnostic protocols for the management of pregnancy of unknown location: a systematic review and meta-analysis. BJOG. (2019) 126:190–8. doi: 10.1111/1471-0528.15442

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Condous G, Okaro E, Khalid A, Timmerman D, Lu C, Zhou Y, et al. The use of a new logistic regression model for predicting the outcome of pregnancies of unknown location. Hum Reprod. (2004) 19:1900–10. doi: 10.1093/humrep/deh341

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Condous G, Van Calster B, Kirk E, Haider Z, Timmerman D, Van Huffel S, et al. Prediction of ectopic pregnancy in women with a pregnancy of unknown location. Ultrasound Obstet Gynecol. (2007) 29:680–7. doi: 10.1002/uog.4015

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Dart R, Howard K. Subclassification of indeterminate pelvic ultrasonograms: stratifying the risk of ectopic pregnancy. Acad Emerg Med. (1998) 5:313–9. doi: 10.1111/j.1553-2712.1998.tb02711.x

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Mitchell TM. Machine Learning. Burr Ridge, IL: McGraw Hill (1997). p. 870–7.

Google Scholar

26. Rueangket P, Rittiluechai K. Predictive analytic model for diagnosis of ectopic pregnancy. Front Med. (2021) 8:646258. doi: 10.3389/fmed.2021.646258

PubMed Abstract | CrossRef Full Text | Google Scholar

27. Rollins J. Why We Need a Methodology for Data Science [Internet]. (2015). Available online at: https://www-01.ibm.com/common/ssi/cgi-bin/ssialias?htmlfid=IMW14824USEN (accessed March 31, 2022).

Google Scholar

28. Foroughi F, Luksch P. Data science methodology for cybersecurity projects. arXiv. (2018) [Preprint]. arXiv:180304219. doi: 10.5121/csit.2018.80401

CrossRef Full Text | Google Scholar

29. Cox DR. The regression analysis of binary sequences. J R Stat Soc. (1958) 20:215–32. doi: 10.1111/j.2517-6161.1958.tb00292.x

CrossRef Full Text | Google Scholar

30. Zekić-Sušac M, Šarlija N, Has A, Bilandžiæ A. Predicting company growth using logistic regression and neural networks. Croat Oper Res Rev. (2016) 7:229–48. doi: 10.17535/crorr.2016.0016

CrossRef Full Text | Google Scholar

31. Nadkarni P. Chapter 4–Core technologies: machine learning and natural language processing. In: Nadkarni P editor. Clinical Research Computing. Cambridge, MA: Academic Press (2016). p. 85–114.

Google Scholar

32. Vapnik V. Pattern recognition using generalized portrait method. Autom Remote Control. (1963) 24:774–80.

Google Scholar

33. Fan W. On the Optimality of Probability Estimation by Random Decision Trees. Menlo Park, CA: AAAI (2004).

Google Scholar

34. KDnuggets. RapidMiner Named a Leader in the 2016 Gartner Magic Quadrant for Advanced Analytics Platforms. (2016). Available online at: https://www.kdnuggets.com/2016/02/rapidminer-leader-2016-gartnermqadvanced-analytics-platforms.html (accessed March 30, 2020).

Google Scholar

35. Hemlata PG. Experimental evaluation of open source data mining tools. Int J Eng Technol. (2019) 68:30–5. doi: 10.14445/22315381/IJETT-V68I8P206S

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Van Buuren S. Flexible Imputation of Missing Data. Boca Raton, FL: CRC press (2018). doi: 10.1201/9780429492259

CrossRef Full Text | Google Scholar

37. Minhas S, Khanum A, Riaz F, Alvi A, Khan SA. Early Alzheimer’s disease prediction in machine learning setup: empirical analysis with missing value computation. In: Proceedings of the International Conference on Intelligent Data Engineering and Automated Learning. Berlin: Springer (2015). doi: 10.1007/978-3-319-24834-9_49

CrossRef Full Text | Google Scholar

38. Luengo J, García S, Herrera F. On the choice of the best imputation methods for missing values considering three groups of classification methods. Knowl Inf Syst. (2012) 32:77–108. doi: 10.1007/s10115-011-0424-2

CrossRef Full Text | Google Scholar

39. Akoglu H. User’s guide to correlation coefficients. Turk J Emerg Med. (2018) 18:91–3. doi: 10.1016/j.tjem.2018.08.001

PubMed Abstract | CrossRef Full Text | Google Scholar

40. Wainer J, Cawley G. Nested cross-validation when selecting classifiers is overzealous for most practical applications. Expert Syst Appl. (2021) 182:115222. doi: 10.1016/j.eswa.2021.115222

CrossRef Full Text | Google Scholar

41. Fawcett T. An introduction to ROC analysis. Pattern Recognit Lett. (2006) 27:861–74. doi: 10.1016/j.patrec.2005.10.010

CrossRef Full Text | Google Scholar

42. Hahlin M, Thorburn J, Bryman I. The expectant management of early pregnancies of uncertain site. Hum Reprod. (1995) 10:1223–7. doi: 10.1093/oxfordjournals.humrep.a136123

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Banerjee S, Aslam N, Zosmer N, Woelfer B, Jurkovic D. The expectant management of women with early pregnancy of unknown location. Ultrasound Obstet Gynecol. (1999) 14:231–6. doi: 10.1046/j.1469-0705.1999.14040231.x

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Malek-mellouli M, Oumara M, Ben Amara F, Zouch O, Neji K, Reziga H. Prediction of ectopic pregnancy in early pregnancy of unknown location. Tunis Med. (2013) 91:27–32.

Google Scholar

45. Ishibuchi H, Nojima Y. Analysis of interpretability-accuracy tradeoff of fuzzy systems by multiobjective fuzzy genetics-based machine learning. Int J Approx Reason. (2007) 44:4–31. doi: 10.1016/j.ijar.2006.01.004

CrossRef Full Text | Google Scholar

46. Odeh M, Qasoum A, Tendler R, Kais M, Khamise Farah R, Bornstein J. Pregnancy of unknown location: the value of frozen section analysis and its relation to Beta-hCG Levels and endometrial thickness. Rev Bras Ginecol Obstet. (2019) 41:142–6. doi: 10.1055/s-0038-1676123

PubMed Abstract | CrossRef Full Text | Google Scholar

47. Mol BW, Hajenius PJ, Engelsbel S, Ankum WM, Van der Veen F, Hemrika DJ, et al. Serum human chorionic gonadotropin measurement in the diagnosis of ectopic pregnancy when transvaginal sonography is inconclusive. Fertil Steril. (1998) 70:972–81. doi: 10.1016/S0015-0282(98)00278-7

CrossRef Full Text | Google Scholar

48. Cervantes J, Lamont FG, López-Chau A, Mazahua LR, Ruíz JS. Data selection based on decision tree for SVM classification on large data sets. Appl Soft Comput. (2015) 37:787–98. doi: 10.1016/j.asoc.2015.08.048

CrossRef Full Text | Google Scholar

49. Trevethan R. Sensitivity, specificity, and predictive values: foundations, pliabilities, and pitfalls in research and practice. Front Public Health. (2017) 5:307. doi: 10.3389/fpubh.2017.00307

PubMed Abstract | CrossRef Full Text | Google Scholar

50. Nzaumvila DK, Govender I, Ogunbanjo GA. An audit of the management of ectopic pregnancies in a district hospital, Gauteng, South Africa. Afr J Prim Health Care Fam Med. (2018) 10:e1–8. doi: 10.4102/phcfm.v10i1.1757

PubMed Abstract | CrossRef Full Text | Google Scholar

51. Alexander LK, Lopes B, Ricchetti-Masterson K, Yeatts KB. Assessment of Diagnostic and Screening Tests. Chapel Hill, NC: UNC (2015).

Google Scholar

52. Linoff GS, Berry MJ. Data Mining Techniques: for Marketing, Sales, and Customer Relationship Management. Hoboken, NJ: John Wiley & Sons (2011).

Google Scholar

Keywords: ectopic pregnancy, pregnancy of unknown location, machine learning, neural networks, decision tree and support vector machines

Citation: Rueangket P, Rittiluechai K and Prayote A (2022) Predictive analytical model for ectopic pregnancy diagnosis: Statistics vs. machine learning. Front. Med. 9:976829. doi: 10.3389/fmed.2022.976829

Received: 23 June 2022; Accepted: 25 August 2022;
Published: 23 September 2022.

Edited by:

Simcha Yagel, Hadassah Medical Center, Israel

Reviewed by:

Michal Lipschuetz, Hadassah Medical Center, Israel
Ali Çetin, University of Health Sciences, Turkey

Copyright © 2022 Rueangket, Rittiluechai and Prayote. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ploywarong Rueangket, cGxveXdhcm9uZy4yNEBnbWFpbC5jb20=

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.