Skip to main content

ORIGINAL RESEARCH article

Front. Public Health, 30 November 2021
Sec. Digital Public Health
This article is part of the Research Topic Extracting Insights from Digital Public Health Data using Artificial Intelligence View all 15 articles

Prediction of PCOS and Mental Health Using Fuzzy Inference and SVM

\nAshwini Kodipalli,
Ashwini Kodipalli1,2*Susheela Devi,Susheela Devi1,2
  • 1Department of Computer Science and Automation, Indian Institute of Science, Bengaluru, India
  • 2Department of Artificial Intelligence and Data Science, Global Academy of Technology, Bengaluru, India

Polycystic ovarian syndrome (PCOS) is a hormonal disorder found in women of reproductive age. There are different methods used for the detection of PCOS, but these methods limitedly support the integration of PCOS and mental health issues. To address these issues, in this paper we present an automated early detection and prediction model which can accurately estimate the likelihood of having PCOS and associated mental health issues. In real-life applications, we often see that people are prompted to answer in linguistic terminologies to express their well-being in response to questions asked by the clinician. To model the inherent linguistic nature of the mapping between symptoms and diagnosis of PCOS a fuzzy approach is used. Therefore, in the present study, the Fuzzy Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) method is evaluated for its performance. Using the local yet specific dataset collected on a spectrum of women, the Fuzzy TOPSIS is compared with the widely used support vector machines (SVM) algorithm. Both the methods are evaluated on the same dataset. An accuracy of 98.20% using the Fuzzy TOPSIS method and 94.01% using SVM was obtained. Along with the improvement in the performance and methodological contribution, the early detection and treatment of PCOS and mental health issues can together aid in taking preventive measures in advance. The psychological well-being of the women was also objectively evaluated and can be brought into the PCOS treatment protocol.

Introduction

Polycystic ovarian syndrome (PCOS) is a hormonal disorder usually found in young adult women and in women of reproductive age. It is a common and pervasive hormonal disorder with multiple phenotypes having different presentations (1). PCOS was first discovered by Stein and Leventhal (2). According to WHO, around 6–26% of women are affected by PCOS. It includes failure to ovulate and infertility, along with mental health complications. PCOS has several effects ranging from acne and obesity to irregular menstruation and infertility, which might lead to imparity in the quality of life (35). PCOS can be detected using the Rotterdam Consensus, where the person under study should meet at least two of these conditions: (a) absence of ovulation, (b) symptoms of hyperandrogenism, and (c) ovaries affected with polycysts (6). PCOS causes several effects such as amenorrhea, obesity, type 2 diabetes, metabolic impairment, and cardiovascular disorders. As per WHO, the prevalence of PCOS is estimated to be between 6 and 26% (7). In India, the prevalence of PCOS ranges between 9.13 and 36% (8). PCOS is one of the major factors for infertility and emotional problems. Approximately up to one-third of females with PCOS have fertility issues. Women with PCOS have more anxiety and depression than women in general. The percentage of depression and anxiety in women with PCOS is 28–39% and 11–25%, respectively (9, 10). International research shows that PCOS has a significant effect on both physical and emotional well-being and, hence, affects the quality of life (11).

Recent research has shown that PCOS can be detected using an ultrasound scan where the doctor counts the size and number of follicles in the ovaries. PCOS can also be detected using biochemical examinations such as blood tests and hormonal examinations, yet it is an expensive investigation. It can also be predicted using clinical parameters such as menstrual cycle length and body mass index (BMI), however, it involves clinical acumen and the subjectivity of the clinician, as well as the intrusion of privacy of the women; also, the process is not automated and the emotional wellness of the women is not considered.

In the literature, various methods are proposed to detect PCOS. They are broadly categorized into two groups.

1. The studies where the association between PCOS and mental health is investigated using various statistical tools such as χ2 and Fisher's exact test. Here, however, the diagnosis of PCOS and its accuracy are not considered.

2. The studies where PCOS is detected using machine learning algorithms and image processing techniques, but psychological and emotional wellness is not considered.

We were motivated by the above limitations of existing works; thus, the current research is designed to include both clinical and psychological parameters to detect and diagnose the likelihood of having PCOS and its associated mental health problems. Since the clinical and psychological data pertaining to PCOS varies from woman to woman and involves inherent uncertainty, imprecision, and ambiguity, the traditional machine learning algorithms will not model the system under consideration. The fuzzy algorithms such as the fuzzy technique for order of preference by similarity to ideal solution (TOPSIS) and fuzzy analytical hierarchal process (AHP) are evaluated for their efficacy and robustness against traditional algorithms such as support vector machines (SVM), K nearest neighbor, and decision tree (D-tree).

The organization of the paper is as follows. Related work describes the related work, the drawbacks of the existing methods, and the problem statement. Proposed method provides the proposed methodology and collection of data. Results explains the results and Discussion includes the comparative study of the results through discussion. Finally, Conclusion concludes the paper.

Related Work

Some of the studies which proved the relationship between PCOS and mental health and the classification algorithms used for the prediction of PCOS are reviewed and presented below:

As per the systematic and exhaustive survey conducted by Himelein et al. (12), it was indicated that PCOS is also associated with psychological/mental health issues such as body dissatisfaction, depression, anxiety, and eating disorders, and thus, reduces the quality of life. Although in his research he did not describe the implicit relationship between PCOS and mental health, he suggested strongly that the effective treatment of PCOS can stabilize mental health issues.

Conte et al. (13) conducted a systematic review on mental health and physical exercise in women with PCOS and reported that introducing physical activity can improve the quality of life and psychological well-being of women. Kerchner et al. (14) established the relationship between PCOS and mental health and identified the risks and predictors for depression in women with PCOS. In his research, a total of 60 women with PCOS were considered and were given Primary Care Evaluation of Mental Disorders Patient Health questionnaires (PRIME-MD PHQ) (15), the Beck Depression Inventory-II (BDI) (16), and the Beck Anxiety Inventory (BAI) (17) to detect mental health disorders. The data were analyzed using χ2 and Fisher's exact test to evaluate the categorical variables. It was found that 40% (24 out of 60) of women are suffering from depression and 16.6% suffer from mood disorders which shows that at least 56.6% of women with PCOS have mental health issues.

Wolf et al. (18) conducted a systematic review based on geographical location and ethnicity. The study found that the prevalence of PCOS was between 6 and 9% across the United States, United Kingdom, Spain, Greece, Australia, Asia, and Mexico based on the National Institutes of Health (NIH) diagnostic criteria. The study also suggested that there is no racial or ethnic influence on the prevalence of PCOS. Naz et al. (19), demonstrated the prevalence of PCOS in the adult population. In his meta-analysis, 12 published articles were considered and Egger and Begg's tests were used to check the publication bias. Using 477 participants and the STATA software, it is proved that PCOS in the adult population was 11.04% based on the Rotterdam criteria, 3.39% based on the NIH criteria, and 8.03% based on the Androgen excess and PCOS society.

Banting et al. (20), in their research along with the association of mental health and PCOS, also investigated physical inactivity, motivators, and support providers for women with and without PCOS. Using a local database and computational statistics, he concluded that a woman with PCOS requires more support compared with women without PCOS. Similarly, Berni et al. (21), investigated Autism Spectrum Disorder (ASD) and Attention Deficit Hyperactivity Disorder (ADHD) and proved that the incidence of ASD and ADHD is higher in children of mothers with PCOS. In a similar development, Chandhari et al. (22) investigated the association of PCOS with psychiatric morbidity and its impact on quality of life. Using binary logistic regression, he could associate PCOS with psychiatric morbidity.

Some studies investigated the detection of PCOS using computational and machine learning algorithms. In a recent development, Sachdeva et al. (23) compared the phenotypes in PCOS by considering their metabolic, clinical, and hormonal profiles, and their differential responses to clomiphene. Using descriptive statistics, the authors concluded that patients with full-blown PCOS (phenotype A) are at a higher risk of metabolic and cardiovascular disorders compared with others, thus concluding that phenotype division can help patients in their accurate diagnoses. In a recent computational study by Denny et al. (24), an early detection and prediction model for PCOS was proposed and it was concluded that the random forest classifier provided an accuracy of 89.02%. He considered 541 women with 23 attributes that consist of both clinical and metabolic parameters. Similarly, another work by Vikas et al. (25) considered 119 women with 18 attributes consisting of clinical parameters that were used to evaluate PCOS and obtained an accuracy of 97.65% using the Naïve Bayes classifier. The study conducted by Anuradha and Priyanka (26) considered 84 women and 13 attributes to detect PCOS and obtained an accuracy of 94% using artificial neural networks. The study by Deshpande and Wakankar (27) considered imagining parameters such as follicles along with the biochemical and clinical parameters such as hormonal levels and BMI for the detection of PCOS. Support vector machine was used for the classification and obtained an accuracy of 95%.

Mishra and Prakash (28) found that, for computing human cognitive capabilities, fuzzy logic is the most effective computing method. The categories of the medical data are clearly explained in this paper. Further, Ansari et al. (29), explained in their research how Fuzzy TOPSIS can be used extensively for trustworthy health care development software.

But none of the studies have considered mental health in their analysis. In all the studies, the data was obtained by interviewing the women and from the laboratory. As the characteristics of the data are uncertain and since the likelihood of having PCOS depends on multiple parameters, fuzzy algorithms are more appropriate than crisp algorithms.

The following lacunae were found from the literature survey:

1. The studies where the association between PCOS and mental health is investigated did not evaluate the performance, efficacy, and robustness of PCOS detection.

2. Among the studies where the PCOS detection is carried out using machine learning algorithms, the psychological well-being parameters are not considered.

3. Integrated studies on PCOS detection and the analysis of its associated mental health were not present in the literature.

4. The existing studies employ statistical tools. However, there are no studies that could implement soft computing approaches like fuzzy sets and multi-criteria decision analysis systems.

5. An effective and early diagnosis system that can screen many people in a time-bound and cost-effective manner does not exist.

6. There are limited computer-assisted tools that help women in the screening of PCOS and its associated mental health privately and comfortably. It only requires the woman who is to be tested to take the questionnaire and get the diagnosis immediately.

In the current study, both physical and psychological parameters are considered for developing an automated PCOS and its associated mental health detection. The clinical data collected from the diverse population is non-deterministic, irregular, imprecise, and uncertain, making soft computing approaches such as fuzzy TOPSIS and fuzzy AHP better candidate methods than traditional methods.

Proposed Method

The methodology proposed in this study is described below:

Study Design

The research is mainly focused on developing an early prediction model. The study is designed to include young adult women of age <25. Women taking treatment for acute illness, pregnant women, and women under substance abuse are excluded from this study. The study protocol was approved by the ethical committee of the institution where the study is carried out. The written consent of the participants (subjects) was collected before involving them in the study. The study involved the participation of two expert gynecologists and one psychologist who designed the questionnaires. The questionnaire included 11 questions related to physical aspects, 10 questions related to anxiety and depression using the K10 tool, 5 questions on social phobia, and 5 questions on body image dissatisfaction. Table 1 provides a brief overview of this questionnaire. The expert panels comprising of two gynecologists and one psychologist interviewed the subjects and filled the questionnaire form. Based on the answers for different questions, the expert panels have mapped the subjects into one of the four categories, viz, A1: subject having both PCOS and mental health issues; A2: subject having only PCOS problem; A3: subject having only mental health issues; and A4: subject being normal. The expert panel filled the questions by interviewing the subjects and also mapped every subject to one of the four predefined categories.

TABLE 1
www.frontiersin.org

Table 1. Brief overview of the questionnaire.

Data Collection

In total, 660 subjects were chosen for the study where 624 have agreed to participate and others have declined to participate due to various reasons. The expert panel completed the questions by interviewing the subjects and also mapped every subject to one of the four pre-defined categories. This data is stored in a comma-separated values (CSV) file and it is used and processed for subsequent analysis by algorithms. Data visualization and data analysis, using matplotlib, were performed to gain more insight into the data. After analysis, the results of the regularity in the menstrual cycles of the subjects are shown in the form of a graph in Figure 1. The pie chart of the answers of the subjects regarding the length of their menstrual cycle is shown in Figure 2.

FIGURE 1
www.frontiersin.org

Figure 1. Non- polycystic ovarian syndrome (PCOS) subjects vs. PCOS subjects (regularity in the cycle).

FIGURE 2
www.frontiersin.org

Figure 2. Non-PCOS subjects vs. PCOS subjects (typical length of menstrual cycle).

Methodology

As noted in the literature survey in the previous section, the majority of research in this area is carried out using traditional methods like the T-square test, Chi-Square test, etc. The current implementation is an approach to apply machine learning and fuzzy systems to the data and perform a comparative study of the two. Figure 3 depicts the block diagram of the proposed method.

FIGURE 3
www.frontiersin.org

Figure 3. Block diagram of the proposed method.

A woman who has agreed to participate in the study is called a subject under study. The data collection for this study includes both physical assessments (menstrual cycle, regularity of the cycle, length of the cycle, duration of the cycle, recent weight gain, hair loss, family history of having diabetes and hypertension, and eating and sleeping habits) and psychological assessments (anxiety, depression, body image dissatisfaction, and social phobia). Based upon the answers given by the subjects to the 31 questions, the panel of doctors decides to put them into one of the four categories. Once the data is collected, it is stored in a CSV file. The data pertaining to each subject corresponds to a row and their answers for each question are noted in the corresponding columns. Thus, the dataset has 31 columns i.e., one column for every question and one last column for the category they fall into. This file is then used to implement classifier algorithms in the machine learning and TOPSIS method in the Fuzzy systems.

In the machine learning module, the acquired data in the CSV file undergoes data processing, which includes checking for missing and erroneous values. As the responses chosen by the subject are categorical, the values are replaced with integers for the representation and processing by machine learning algorithms. Eighty percent of the data is used for training and the remaining 20% of the data is used as test data. After analyzing the training data, it is observed that the data is not balanced as few classes had fewer numbers of samples and the remaining classes had more numbers of samples. Therefore, the data is balanced using the K nearest neighbor algorithm. Some samples are chosen and their K nearest neighbors are found and the mean values of these samples are added to the training data. The first 31 columns are considered as the features and the last column is the target variable. A classifier is used to predict the category of the new instances. The algorithms used here are the D-Tree classifier, K-Nearest Neighbor (KNN) classifier, and the SVM as they are some of the most commonly used classifiers in the domain. The algorithms use the training data to learn. The prediction is performed on the test data. The models are evaluated by checking their accuracy scores. An automated system is also a part of the implementation which is used to take a new instance as input by entering options in the set of questions. This is fed to the classifier to predict the appropriate class of the new instance.

Similarly, the same CSV file is used to implement the TOPSIS method. For the implementation using the fuzzy TOPSIS method, the data in the original form is considered without converting the response (for example, less than a day, more than 7 days, 5–7 days, quite irregular, fairly irregular, sometimes, most of the times, etc.) into numerical form, where the response given by the subjects are uncertain or vague.

Figure 4 depicts the computational model to detect PCOS using physical parameters. As stated in the study design, in total 11 criteria from C11 to C111 are used as input in the computational model involving machine learning algorithms and multi-criteria decision systems such as fuzzy AHP followed by the fuzzy TOPSIS. The output of the computational model is discrete, viz, normal, moderate, and high PCOS problems. Similarly, Figure 5 shows the detection of mental health disorders using anxiety and depression, social phobia, and body dissatisfaction aspects. As depicted in Table 1 of the study design, C21–C45 are the criteria variables given as input to the computational models. In the same lines, the output is normal, moderate, and high mental health issues. The outcome of both physical aspects and mental health disorder are combined using fuzzy rules to provide final inference as abnormal having both PCOS and mental health, abnormal having only PCOS, Abnormal having only mental health issues, or Normal.

FIGURE 4
www.frontiersin.org

Figure 4. The computational model for the detection of PCOS.

FIGURE 5
www.frontiersin.org

Figure 5. The computational model for the detection of mental health.

Fuzzy AHP

In the proposed PCOS detection system, in order to quantify the weights in Fuzzy TOPSIS, Fuzzy AHP (30, 31) is used, which is depicted in Figure 6. The systematic steps are as follows:

Step 1: A criteria pair-wise comparison matrix Mij is constructed. In total, 31 criteria are considered for the detection of PCOS and its associated mental health. Mij is constructed by comparing the criteria in pairs to decide which among each pair is preferred or has a greater amount of quantitative property, or whether or not two of the criteria are identical. For this research work, the Mij is constructed with the linguistic variables and its associated triangular fuzzy numbers as shown in Table 2.

Step 2: Construct the fuzzified geometric mean value. It is calculated as follows,

Mij=(x1×x2×x3×xn)1n×(y1×y2×y3×yn)1n            ×(z1×z2×z3×zn)1n    (1)

where x1xn are the first elements in the fuzzy triangular numbers, y1yn are the second elements in the fuzzy triangular numbers, similarly, z1znare the third elements in the fuzzy triangular numbers, and n is the number of criteria, i.e., n = 31.

Step 3: Calculate the fuzzy weights (w~) using the formula,

wi=ri×(r1+r2+r3rn)-1    (2)

where r1 is the addition of the first column of the fuzzified geometric mean value, similarly, r2 and rn is the addition of second and nth columns of the fuzzified geometric mean value, respectively.

Step 4: Any defuzzification method can be used to calculate the defuzzified weights (wi). For the proposed work, the Center of Area (COA) method is used.

COA=(a+b+c)3    (3)

Where a, b, and c are the 3 sides of the triangle formed by the triangular membership function.

Step 5: From the weights (wi), calculate normalized weights (wj),

wj=wii=1nci    (4)

These normalized weights are used to assign weights for the TOPSIS method.

Fuzzy TOPSIS

The Fuzzy TOPSIS method (32) is used which is described as follows:

Step 1: Initially the committee of decision-makers is formed. For this research work, the decision-makers are expert doctors in the gynecology department and the psychology department. The fuzzy rating of each decision-maker Dk where(k = 1, 2, 3) can be expressed as a triangular fuzzy number p~k where (k = 1, 2, 3) with membership function μ×p~k(x ).

Step 2: Identify the key criteria and sub-criteria for evaluation.

Step 3: Select the suitable linguistic variables for evaluating the parameters such as criteria and alternatives. Tables 3, 4 show the linguistic variables and their associated fuzzy numbers for each criterion and alternatives.

Step 4: Quantify the weights for each criterion based on the values obtained from the AHP method. Using triangular fuzzy numbers, the decision-makers ratings are described as follows,

p~k=(mk,nk,ok), where k = (1, 2, 3), then the total fuzzy rating can be determined as p~ =(m, n, o) where,

m=mink{mk},  n=1kk=1knk,o=maxk{ok}    (5)

If the fuzzy ratings and the weights of kth decision-makers are

ijk=(mijk,nijk,oijk) and w~ijk=(wjk1,wjk2,wjk3)    (6)

where i = 1, 2, 3 … m and j = 1, 2, 3,……n., then the total fuzzy rating (ỹij) of alternative will be

ij=(mij,nij,oij)

where

mij=mink{mijk},nij=1kk=1knijk,oij=maxk{oijk}    (7)

w~ij is the aggregated fuzzy weights of each criterion and is given by w~j= (wj1, wj2, wj3) where

wj1=mink{wjk1},wj2=1kk=1kwjk2,wj3=maxk{wjk3}    (8)

Step 5: The fuzzy decision matrix is constructed as

D~=11121n21222nm1m2mn 

w~=[w~1,w~2,w~n]. Here, ỹij and w~j can be approximated by the positive triangular fuzzy numbers.

Step 6: Using linear scale transformation, the values of the criteria are transformed from a criteria scale into a comparable scale as

   Ã=[ãij]m×n whereãij=(mijcj*,nijcj*,oijcj*),cj*=maxi(cij)    (9)

Step 7: The normalized weighted decision matrix is calculated. The weighted normalized decision matrix (ṽ) is calculated as

                  =[ãij]m×nwhere,ãij=ãij×w~j    (10)

Step 8: Then the fuzzy positive ideal solution (FPIS,B*) and fuzzy negative ideal solution (FNIS, B) are calculated as

             B*=(ã1*,ã2*,ãn*)           B-=(ã1-,ã2-,ãn-)where ãj*=maxi{vij3 }where ãj-=mini{vij1 }    (11)

Step 9: The distance from the alternative to the FPIS and FNIS as

di*=j=1n(ãij,ãj*)×dv where i=1,2,m    (12)
di-=j=1n(ãij,ãj-)×dv where i=1,2,m    (13)

where dv is the distance between 2 fuzzy numbers.

Step 10: The closeness coefficient (CCi) is calculated as

CCi=di-di*+di-    (14)

Step 11: The physical and mental health are obtained in the decreasing order of the closeness coefficient values. The entire flowchart of the TOPSIS method is shown in Figure 7.

FIGURE 6
www.frontiersin.org

Figure 6. Fuzzy AHP.

TABLE 2
www.frontiersin.org

Table 2. Linguistic variables and their fuzzy numbers.

TABLE 3
www.frontiersin.org

Table 3. Linguistic variables for the importance weight of each criterion.

TABLE 4
www.frontiersin.org

Table 4. Linguistic variables for alternatives.

FIGURE 7
www.frontiersin.org

Figure 7. The flow of the Fuzzy TOPSIS method.

Results

The proposed methodology using Fuzzy AHP and Fuzzy TOPSIS is implemented in python (Python Software Foundation. Delaware, United States). For quantifying the weights in fuzzy TOPSIS, fuzzy AHP is used. The normalized weights from the fuzzy AHP are shown in Tables 5AC for a test example. As seen from Table 5A, it is observed that the weights C11, C12, C13, C14, and C16 have higher values compared with C15, C17, C18, C19, C110, and C111. Higher values for the weights indicate the relatively higher importance of those criteria. From Table 5B, it can be seen that C21, C22, C23, C25, and C27 have higher importance since the weights have higher values. In the same lines, as observed from Table 5C, C41 and C42 have higher weights and hence, these criteria have more importance compared with other criteria. Hence it is validated that, C11, C12, C13, C14, C16, C21, C22, C23, C25, C27, C41, and C42 are more important criteria in comparison with other criteria in making decision as given by fuzzy AHP. In the same way, the clinicians were also asked to provide the relative importance of the 31 criteria based on their clinical acumen. Their expert opinion also exactly matched with the outcome of the fuzzy AHP. Hence these weights are used for assigning the values in the fuzzy TOPSIS.

TABLE 5A
www.frontiersin.org

Table 5A. Weights for the criteria from C11 to C111.

TABLE 5B
www.frontiersin.org

Table 5B. Weights for the criteria from C21 to C210.

TABLE 5C
www.frontiersin.org

Table 5C. Weights for the criteria from C31 to C35 and C41 to C45.

From equations 12 and 13, the distance between FPIS and FNIS for different criteria is computed and is given in Tables 68. The values obtained in these tables are used to compute the closeness coefficients. Table 9A provides the closeness coefficients between FPIS and FNIS for PCOS. A higher closeness coefficient value indicates a high probability of having PCOS. It can be seen that CC1 is higher compared with CC3 and CC2. Hence, it is observed that P1 (high PCOS) > P3 (moderate PCOS) > P2 (normal PCOS), therefore it is inferred that the possible condition is high PCOS for this particular test example. Similarly, Table 9B, shows the closeness coefficients between FPIS and FNIS for mental health. It is observed that M1 (high mental health) > M3 (moderate mental health) > M2 (normal mental health), and therefore the inference is high mental health issues for the same test example.

TABLE 6
www.frontiersin.org

Table 6. Distance between P* and Pi (i = 1, 2, 3), P and Pi (i = 1, 2, 3) for the criteria from C11 to C110.

TABLE 7
www.frontiersin.org

Table 7. Distance between M* and Mi (i = 1, 2, 3), M and Mi (i = 1, 2, 3) for the criteria from C21 to C210.

TABLE 8
www.frontiersin.org

Table 8. Distance between M* and Mi (i = 1, 2, 3), M and Mi (i = 1, 2, 3) for the criteria from C31 to C35 and C41. C45.

TABLE 9A
www.frontiersin.org

Table 9A. Closeness coefficient for PCOS for the alternatives P1, P2, and P3.

TABLE 9B
www.frontiersin.org

Table 9B. Closeness coefficient for mental health for the alternatives M1, M2, and M3.

Based on the individual (both physical and psychological) closeness coefficients, the fuzzy rules are framed in consultation with the clinicians. The rules are given in Table 10.

TABLE 10
www.frontiersin.org

Table 10. Fuzzy inference rules.

Similarly, machine learning algorithms, D-tree, KNN, and SVM algorithms are used to predict PCOS and its associated mental health issues independently and the final inference is drawn as per the rules provided in Table 9. In this process, the entire dataset is divided into a training set (80%) and testing set (20%).

The confusion matrix due to the classification using SVM and fuzzy TOPSIS is provided in Table 11 respectively. It can be noted that the Fuzzy TOPSIS method provides the highest performance compared with all other classifiers.

TABLE 11
www.frontiersin.org

Table 11. Confusion matrix during support vector machines (SVM) classification (left) and fuzzy technique for order of preference by similarity to ideal solution (TOPSIS) (right).

The number of classes defines the size of the confusion matrix. In the current study, there are four classes in the data and hence, the confusion matrix is 4 × 4. The true positive (TP), false positive (FP), true negative (TN), and false negative (FN) values are evaluated as follows.

TP of current class = Principal diagonal values of current row/column.

FP of current class = (Sum of current row values–TP of current class).

FN of current class = (Sum of current column values–TP of current class).

TN of current class = Sum of the matrix–TP of current class–Sum of current row values–Sum of current column values.

Tables 12, 13 present the TP, FP, and FN for SVM and the fuzzy TOPSIS classifier, respectively.

TABLE 12
www.frontiersin.org

Table 12. Components of confusion matrix for SVM.

TABLE 13
www.frontiersin.org

Table 13. Components of confusion matrix for fuzzy TOPSIS.

It can be seen from Table 12 that there are 10 FP and 10 FN during SVM classification, whereas during the fuzzy TOPSIS method, as indicated in Table 13, there are only three FP and three FN. It is noted that fuzzy TOPSIS provides less FN as compared with other classifiers. The presence of FN is crucial. It means that the woman actually with PCOS are diagnosed as not having PCOS by the automated system. It also indicates that the particular woman actually with PCOS goes undiagnosed, which is a fatal error. It can be inferred that Fuzzy TOPSIS provides less FN and hence, it is superior compared with all other classifiers. On the other hand, fuzzy TOPSIS has provided less FP compared with other classifiers. False-positive means a woman does not actually have PCOS but the automated system diagnoses her with PCOS. This causes a lot of undue panic and anxiety to the patients. Hence the fuzzy TOPSIS is superior in terms of less FP as well.

The accuracy of classification using KNN, D-tree, SVM, and fuzzy TOPSIS is shown in Figure 8. It can be noted that the SVM provides an accuracy of 94.01%, and D-tree and KNN provide 88.02 and 87.42%, respectively. It is observed that SVM outperforms the D-tree and KNN. The fuzzy TOPSIS provides an accuracy of 98.20%, outperforming all other classifiers.

FIGURE 8
www.frontiersin.org

Figure 8. Model evaluation: accuracies of different classifier.

It is also interesting to assess the mental health issues of a woman who was diagnosed with PCOS. Table 14 provides the proportion of women with PCOS in the testing data, as well as the proportion of women with mental health issues in the test data. It can be seen that the fuzzy TOPSIS provided that 52 women have a high possibility of PCOS and four women have a moderate possibility of PCOS. So in total, 56 women have PCOS.

TABLE 14
www.frontiersin.org

Table 14. The proportion of women having PCOS and mental health issues in the testing data.

From this data, the mental wellness indicator (IM) of PCOS patients is defined as,

IM=n {(P1   P3)   (A1)}n {P1  P3}×100

where ‘n’ represents the count meeting the condition.

Similarly, the physical wellness indicator (IP) of a patient suffering from mental illness is defined as,

IP=n {(M1   M3)   (A1)}n {M1  M3}×100

Figure 9 shows the mental wellness indicator (IM) which resulted in a value of 66.07%, which means the majority of women suffering from PCOS also suffer from mental health issues. This shows that there is a strong association between PCOS and the mental well-being of a woman which needs to be taken care of during the clinical investigation and diagnosis.

FIGURE 9
www.frontiersin.org

Figure 9. Mental wellness indicator (IM).

Figure 10 depicts the physical wellness indicator (IP) which has a value of 44%, suggesting that a considerable proportion of women suffering from mental health also have PCOS. This information can be effectively used during clinical evaluation and appropriate diagnosis can be established.

FIGURE 10
www.frontiersin.org

Figure 10. Physical wellness Indicator (IP).

Discussion

The present study investigates the early prediction of PCOS and its associated mental health issues more objectively by incorporating inherent fuzziness in the decision process. The fuzzy algorithm TOPSIS outperformed the conventional methods. Further, the mental wellness parameters, such as anxiety, depression, social phobia, and body image dissatisfaction, are integrated with clinical parameters to assess the women for the prognosis and diagnosis of PCOS and its associated socio-psychological well-being as 66.07% of women with PCOS have associated mental health issue as per the results of this study.

The computer diagnosis/automation of the early prediction of PCOS can bring a paradigm change in the conventional practices in the management of PCOS. The incorporation of the computer algorithm in the diagnosis can speed up the process and it can help the doctors in delivering the diagnosis quickly and easily. The developed computer diagnosis methodology is faster and more efficient; more women can be screened for PCOS in a short periodof time. Hence this methodology can offer a low-cost and mass screening protocol to detect PCOS. This in turn can help women of low socio-economic conditions by providing a quick, efficient, and low-budget solution.

The regular screening of women for PCOS involves personal interaction, clinical examination, ultra-sound scans, etc. The developed methodology in this study involves a pre-designed questionnaire presented by the computer to the women, which comprises both physical and psychological health questions where the privacy of the women is taken care of. Women can feel free while answering the questions with the computer in their own comfort zone without invading their privacy.

It is interesting to compare our results with previously published research as shown in Table 15. Denny et al. (24), in his study, used multiple parameters such as ultrasound result (size of follicles, number of follicles), blood investigations (TSH, AMH, vitamin D3, etc.), and clinical features (cycle length and regularity), and obtained an accuracy of 89.02% using the random forest classifier. Since hormonal tests are involved in the data collection, the process becomes expensive in terms of cost. Further, Vikas et al. (25), in their study, considered parameters such as lifestyle and food intake habits, and psychological parameters like anxiety and depression, and obtained an accuracy of 97.65% using the Naïve Bayes classifier, however, the questionnaire involved binary (yes and no) responses. Using ultrasound scanned images, Anuradha and Priyanka (26) and Deshpande and Wakankar (27) investigated PCOS and obtained an accuracy of 98% using Artificial Neural Networks and 95% using SVM, respectively. Further, Meena et al. (33), using endometrial biopsies, obtained an accuracy of 83.70% using Artificial Neural Networks. However, in these studies, mental health parameters are not considered. Further, Satish et al. (34), using clinical and biochemical parameters such as BMI, pulse rate, hemoglobin, and hormonal tests (which include FSH, Prolactin, Progesterone), obtained an accuracy of 87.72% using the Naïve Bayes classifier, yet even in this study, mental health parameters are not considered. All these above studies involve complex and expensive parameters such as hormonal investigations, blood investigations, and ultrasound scanned images which are difficult to acquire, take a longer time, and are expensive, and hence, women with low socioeconomic status cannot afford them.

TABLE 15
www.frontiersin.org

Table 15. Comparison of results with the previously published research.

Given the limitations of the above studies, this research involves simple regular parameters (physical and mental health), yet an improved performance of 94.01 and 98.20% with SVM and TOPSIS, respectively, are obtained. The present study provides higher performance than the existing standard and widely used methods. It also provides a new paradigm of PCOS diagnosis using computational intelligence by using the fuzzy TOPSIS method. In addition, the method also provides improved performance when compared with conventional methods. This methodology can also be used for the mass screening of the population by developing countries with large populations and constrained resources. Women can undergo this methodology in their own comfort zone without the intervention of a physician guarding their privacy.

The nature of data comprising of physical and mental health parameters is inherently imprecise, vague, and perceptually subjective. The traditional data analysis methods such as SVM, KNN, and the D-tree can model the crisp data accurately. When the data itself is imprecise, the crisp models cannot address the intricate fuzzy interrelations in the data. Hence the fuzzy set-based models can address the inherent subjectivity in the data and can provide improved performance. The fuzzy TOPSIS can be used to find out the most suitable alternative concerning different selection criteria. Also, the fuzzy distances used in fuzzy TOPSIS provide improved closeness coefficient values compared with other fuzzy measures. Therefore, the selected fuzzy TOPSIS has a higher potential in representing the physical phenomena of the scenario.

In addition to the above contributions, this research work also demonstrates the importance of the mental well-being of a woman during PCOS. The result indicates that the majority of women suffering from PCOS also undergo mental health issues. The outcome of this study recommends having mental health evaluations during the diagnosis of PCOS.

Conclusion

The strength of the proposed automated system involves the inclusion of PCOS and mental health questionnaires which have been validated by expert gynecologists and psychiatrists. Considering anxiety, depression, social phobia, and body image dissatisfaction analysis along with physical aspects adds new knowledge to the literature. In this study, the multi-criteria decision analysis system is developed for the early prediction of PCOS problems and mental health before the progression of the disease which can reduce morbidity. The proposed fuzzy TOPSIS-based automated system provides a high performance of 98.20% in real-time scenarios. The inherent uncertainty and imprecision involved in the data and decision-making process are well-modeled by Fuzzy TOPSIS in comparison with other methods which is evident with the high performance obtained. In addition to methodological contributions and improved performance, the study also indicates that the majority of women suffering from PCOS also have associated mental health issues. The present study proved that 66.07% of women with PCOS have associated mental health issues. This strong association implies that there is a strong need for improved protocols and efficient stress and mental health balancing guidelines in the management of PCOS.

The developed system in this study can help physicians in their clinical practice by acting as an adjunct tool. The proposed method is cost-effective and hence, women with low socio-economic conditions can also avail the screening. As the screening is automated and does not involve the intervention of clinicians, women can use the tool in their own comfort level with privacy. The use of the automated algorithm integrated with minimal intervention from physicians can speed up the screening process, making the screening available for more women in a given time and can bring a change in the conventional practices.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Author Contributions

All authors listed have made a substantial, direct and intellectual contribution to the work and approved it for publication.

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

References

1. Ekramzadeh M, Hajivandi L, Noroozi M, Mostafavi F. Psychological experiences of adolescent girls with polycystic ovary syndrome: a qualitative study. Iran J Nurs Midwifery Res. (2020) 25:341–7. doi: 10.4103/ijnmr.IJNMR_276_19

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Naqvi SMAS, Bhattarai JB, Hui L, Wang XW. Polycystic ovarian syndrome and female infertility. Yangtze Med. (2020) 4:11–27. doi: 10.4236/ym.2020.41002

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Franks S. Polycystic ovary syndrome. N Engl J Med. (1995) 333:853–61. doi: 10.1056/NEJM199509283331307

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Kiddy DS, Sharp PS, White DM, Scanlon MF, Mason HD, Bray CS, et al. Differences in clinical and endocrine features between obese and non-obese subjects with polycystic ovary syndrome: an analysis of 263 consecutive cases. Clin Endocrinol. (1990) 32:213–20. doi: 10.1111/j.1365-2265.1990.tb00857.x

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Lim SS, Clifton PM, Noakes M, Norman RJ. Obesity management in women with polycystic ovary syndrome. Women Health. (2007) 3:73–86. doi: 10.2217/17455057.3.1.73

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Rotterdam ESHRE/ASRM-Sponsored PCOS Consensus Workshop Group. Revised 2003 consensus on diagnostic criteria and long-term health risks related to polycystic ovary syndrome. Fertil Steril. (2004) 81:19–25. doi: 10.1016/j.fertnstert.2003.10.004

PubMed Abstract | CrossRef Full Text

7. Lauritsen MP, Bentzen JG, Pinborg A, Loft A, Forman JL, Thuesen LL, et al. The prevalence of polycystic ovary syndrome in a normal population according to the Rotterdam criteria versus revised criteria including anti-mullerian hormone. Hum Reprod. (2014) 29:791–801. doi: 10.1093/humrep/det469

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Nidhi R, Padmalatha V, Nagarathna R, Amritanshu R. Prevalence of polycystic ovarian syndrome in Indian adolescents. J Pediatr Adolesc Gynecol. (2011) 24:223–7. doi: 10.1016/j.jpag.2011.03.002

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Hussain A, Chandel RK, Ganie MA, Dar MA, Rather YH, Wani ZA, et al. Prevalence of psychiatric disorders in patients with a diagnosis of polycystic ovary syndrome in Kashmir. Indian J Psychol Med. (2015) 37:66–70. doi: 10.4103/0253-7176.150822

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Upadhyaya SK, Sharma A, Agrawal A. Prevalence of anxiety and depression in polycystic ovarian syndrome. Int J Med Sci Public Health. (2016) 5:681–3. doi: 10.5455/ijmsph.2016.30072015110

CrossRef Full Text

11. Fauser BC, Tarlatzis BC, Rebar RW, Legro RS, Balen AH, Lobo R, et al. Consensus on women's health aspects of polycystic ovary syndrome (PCOS): the Amsterdam ESHRE/ASRM-Sponsored 3rd PCOS Consensus Workshop Group. Fertil Steril. (2012) 97:28–38. doi: 10.1016/j.fertnstert.2011.09.024

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Himelein MJ, Thatcher SS. Polycystic ovary syndrome and mental health: a review. Obstet Gynecol Surv. (2006) 61:723–32. doi: 10.1097/01.ogx.0000243772.33357.84

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Conte F, Banting L, Teede HJ, Stepto NK. Mental health and physical activity in women with polycystic ovary syndrome: a brief review. Sports Med. (2015) 45:497–504. doi: 10.1007/s40279-014-0291-6

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Kerchner A, Lester W, Stuart SP, Dokras A. Risk of depression and other mental health disorders in women with polycystic ovary syndrome: a longitudinal study. Fertil Steril. (2009) 91:207–12. doi: 10.1016/j.fertnstert.2007.11.022

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Spitzer RL, Kroenke K, Williams JB. Validation and utility of a self-report version of PRIME-MD: the PHQ primary care study. JAMA. (1999) 282:1737–44. doi: 10.1001/jama.282.18.1737

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Beck AT, Ward CH, Mendelson M, Mock J, Erbaugh J. An inventory for measuring depression. Arch Gen Psychiatry. (1961) 6:561–71. doi: 10.1001/archpsyc.1961.01710120031004

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Beck Anxiety Inventory. ACP PIER: The Physicians' Information and Education Resource Evidence Ratings and Criteria. Philadelphia, PA: American College of Physicians (2007).

18. Wolf WM, Wattick RA, Kinkade ON, Olfert MD. Geographical prevalence of polycystic ovary syndrome as determined by region and race/ethnicity. Int J Environ Res Public Health. (2018) 15:2589–602. doi: 10.3390/ijerph15112589

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Naz MSG, Tehrani FR, Majd HA, Ahmadi F, Ozgoli G, Fakari FR, et al. The prevalence of polycystic ovary syndrome in adolescents: a systematic review and meta-analysis. Int J Reprod Biomed. (2019) 17:533–42. doi: 10.18502/ijrm.v17i8.4818

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Banting LK, Gibson-Helm M, Polman R, Teede HJ, Stepto NK. Physical activity and mental health in women with polycystic ovary syndrome. BMC Women's Health. (2014) 14:51–9. doi: 10.1186/1472-6874-14-51

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Berni TR, Morgan CL, Berni ER, Rees DA. Polycystic ovary syndrome is associated with adverse mental health and neurodevelopmental outcomes. J Clin Endocrinol Metab. (2018) 103:2116–25. doi: 10.1210/jc.2017-02667

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Chaudhari AP, Mazumdar K, Mehta PD. Anxiety, depression, and quality of life in women with polycystic ovarian syndrome. Indian J Psychol Med. (2018) 40:239–46. doi: 10.4103/IJPSYM.IJPSYM_561_17

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Sachdeva G, Gainder S, Suri V, Sachdeva N, Chopra S. Comparison of the different PCOS phenotypes based on clinical metabolic, and hormonal profile, and their response to clomiphene. Indian J Endocrinol Metab. (2020) 23:326–31. doi: 10.4103/ijem.IJEM_30_19

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Denny A, Raj A, Ashok A, Ram CM, George R. i-HOPE: detection and prediction system for Polycystic Ovary Syndrome (PCOS) using machine learning techniques. In TENCON 2019 - 2019 IEEE Region 10 Conference (TENCON). Kochi (2019).

Google Scholar

25. Vikas B, Anuhya BS, Chilla M, Sarangi S. A critical study of Polycystic Ovarian Syndrome (PCOS) classification techniques. Int J Comput Eng Manag. (2018) 4:1–7.

Google Scholar

26. Anuradha DT, Priyanka RL. Genetic clustering for polycystic ovary syndrome detection in women of reproductive age. Int J Eng Adv Technol. (2020) 3:1359–61. doi: 10.35940/ijeat.C5457.029320

PubMed Abstract | CrossRef Full Text

27. Deshpande SS, Wakankar A. Automated detection of polycystic ovarian syndrome using follicle recognition. In IEEE International Conference on Advanced Communications, Control and Computing Technologies. Ramanathapuram (2014).

Google Scholar

28. Mishra S, Prakash M. Study of fuzzy logic in medical data analytics. Int J Pure Appl Math. (2018) 119:16321–42.

Google Scholar

29. Ansari MTJ, Al-Zahrani FA, Pandey D, Agrawal A. A fuzzy TOPSIS based analysis toward selection of effective security requirements engineering approach for trustworthy healthcare software development. BMC Med Inform Decis Mak. (2020) 20:236. doi: 10.1186/s12911-020-01209-8

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Buckley JJ. Fuzzy hierarchical analysis. Fuzzy Sets Syst. (1985) 17:233–47. doi: 10.1016/0165-0114(85)90090-9

CrossRef Full Text | Google Scholar

31. Carlsson C, Full'er R. Fuzzy multiple criteria decision making: Recent developments. Fuzzy Sets Syst. (1996) 78:139–53. doi: 10.1016/0165-0114(95)00165-4

CrossRef Full Text | Google Scholar

32. Chen CT. Extension of the TOPSIS for group decision-making under fuzzy environment. Fuzzy Sets Syst. (2000) 114:1–9. doi: 10.1016/S0165-0114(97)00377-1

CrossRef Full Text | Google Scholar

33. Meena K, Manimekalai M, Rethinavalli S. A novel framework for filtering the PCOS attributes using data mining techniques. Int J Eng Res Technol. (2015) 4:702–6.

34. Satish C, Nandipati R, Chew X, Khaw KW. Polycystic Ovarian Syndrome (PCOS) classification and featur selection by machine learning techniques. Appl Math Comput Intelligence. (2020) 9:65–74.

Google Scholar

Keywords: support vector machines, fuzzy TOPSIS, fuzzy AHP, polycystic ovarian syndrome, mental health issues, machine learning, classifiers, fuzzy logic

Citation: Kodipalli A and Devi S (2021) Prediction of PCOS and Mental Health Using Fuzzy Inference and SVM. Front. Public Health 9:789569. doi: 10.3389/fpubh.2021.789569

Received: 05 October 2021; Accepted: 25 October 2021;
Published: 30 November 2021.

Edited by:

Steven Fernandes, Creighton University, United States

Reviewed by:

Dinesh Ramegowda, Samsung, India
Chandan Hegde, Surana Educational Institutes, India
Duddela Sai Prashanth, Sahyadri College of Engineering and Management, India

Copyright © 2021 Kodipalli and Devi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Ashwini Kodipalli, kashwini@iisc.ac.in

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.