- 1Department of Computer science, College of Computer Science and Information Technology, King Faisal University, Hofuf, Saudi Arabia
- 2Department of Computer Science & Engineering, Gandhi Institute of Technology and Management (GITAM) (Deemed to be University), Visakhapatnam, Andhra Pradesh, India
- 3Department of Computer Science & Engineering, Anil Neerukonda Institute of Technology and Sciences, Visakhapatnam, Andhra Pradesh, India
- 4School of Science, Engineering and Environment, University of Salford, Manchester, United Kingdom
- 5Department of Electrical and Computer Engineering, Lebanese American University, Byblos, Lebanon
- 6Department of Computer Science, College of Computer and Information Sciences, Princess Nourah Bint Abdulrahman University, Riyadh, Saudi Arabia
- 7Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam Bin Abdulaziz University, Alkharj, Saudi Arabia
- 8School of Computer Science and Engineering, Lovely Professional University, Phagwara, Punjab, India
Introduction: Our research addresses the critical need for accurate segmentation in medical healthcare applications, particularly in lung nodule detection using Computed Tomography (CT). Our investigation focuses on determining the particle composition of lung nodules, a vital aspect of diagnosis and treatment planning.
Methods: Our model was trained and evaluated using several deep learning classifiers on the LUNA-16 dataset, achieving superior performance in terms of the Probabilistic Rand Index (PRI), Variation of Information (VOI), Region of Interest (ROI), Dice Coecient, and Global Consistency Error (GCE).
Results: The evaluation demonstrated a high accuracy of 91.76% for parameter estimation, confirming the effectiveness of the proposed approach.
Discussion: Our investigation focuses on determining the particle composition of lung nodules, a vital aspect of diagnosis and treatment planning. We proposed a novel segmentation model to identify lung disease from CT scans to achieve this. We proposed a learning architecture that combines U-Net with a Two-parameter logistic distribution for accurate image segmentation; this hybrid model is called U-Net++, leveraging Contrast Limited Adaptive Histogram Equalization (CLAHE) on a 5,000 set of CT scan images.
1 Introduction
Lung cancer begins in the lungs and spreads throughout the rest of the body (1), including the brain. Lung cancer is the most common cause of cancer-related mortality worldwide (2). Although lung cancer is more frequent in smokers, it may also occur in nonsmokers (3). The incidence of lung cancer is often and excessively increased with smoking. Lung cancer risk may be lowered even if you have smoked for a long period. Segmentation, a type of image compression, is necessary to infer information from photos. Imaging modalities (4), including Magnetic Resonance Imaging (MRI) and Computed Tomography (CT), can be utilized to create Computer-Aided Diagnostic (CAD) (5) models that can be used to diagnose and treat patients in precision medicine. Using a limited quantity of medical image data, we demonstrated the efficacy of our proposed model, which we refer to as U-NET++. A method known as the dice coefficient loss was used to compute the findings of the investigations. An approach to labeling preprocessing that is in line with the approaches that are already in use is presented in this paper.
The main novelty of this study is as follows.
• To propose the segmentation model for identifying lung disease made on CT scans with the limited set of CT scan images using the CLAHE.
• To develop the learning architecture combining U-Net with a two-parameter logistic distribution for image segmentation, was used for segmentation.
• To train the models using several deep learning classifiers and evaluate the performance of the models using benchmarks on the LUNA16 dataset using different information retrieval metrics.
The following section describes the organization of the subsequent sections of this study.
A considerable amount of important research is presented in Section 2. Deep learning architectures are used in segmenting medical images by U-NET++, which is created by combining the two-parameter model recommended with distribution learning of the U-Net type. Section 3 provides a comprehensive explanation of the topic. At this point, the criteria for evaluating the model's performance discussed in the fourth part of the section are presented.
2 Related works
A meta-analysis of the literature was performed. Table 1 clearly shows the literature matrix representation of their meta-analysis and the strong relationships between the authors and their respective works. CT scans were assessed based on the image brightness. Different areas of the same region should have the same intensity; hence, segmentation is an effective method to separate objects. Various segmentation procedures were found to be useful in this study. Three-step segmentation-based strategy for distinguishing lung regions.
First, the lung was segmented using gray-level thresholding. Dynamic programming then divides the lung lobes. Finally, morphology-based smoothing approaches were employed. Region-based segmentation includes enlarging, dividing, and combining the areas (17).
A novel convolutional network type known as U-NET++ was developed to analyze CT images used in the biological sciences. U-NET++ was used in this study to extract lung fields from CT images. In healthcare, U-NET++ is nothing more than a variation of ConvNet, combined with various ad hoc data augmentation methods.
The robustness of the model was compromised because the authors of (6–8) carried out their research using the same data potential. The traditional U-Net network (9–16) is a semantic segmentation network built using a fully convolutional neural network. Although it has a relatively small number of layers, the network is nevertheless capable of functioning well, although less complex than its predecessors. The UNET network consists of two main components: down-sampling and up-sampling algorithms. The process of feature extraction, also known as down sampling, involves using convolutional, and pooling layers. This stage is accountable for obtaining characteristics from the original image. A deconvolution technique is employed to enhance the feature map's intricacy. The alternative term for the structure that involves down-sampling and up-sampling is the decoder-encoder structure. The original picture undergoes convolutional and pooling layers during the down-sampling process. This leads to the generation of feature maps that include different levels of information. Regarding visual characteristics, the feature maps exhibit diverse abstraction levels. Combining the down-sampled feature map makes it possible to retrieve a larger portion of the abstract detail information lost during training. As a consequence, the network becomes more successful at segmentation. During the up-sampling process, the deconvolution layer systematically increases the feature image's dimensions. Consequently, the lung's three-dimensional nature results in a substantial loss of spatial information. Consequently, a substantial quantity of relevant information is lost when down-sampling occurs. As retrieving all data is impractical, up-sampling yields imprecise outcomes and disregards visual nuances. Moreover, in addition to the aforementioned concerns, implementing a deep neural network is necessary for future advancement. According to the results of applying U-NET++ to a new dataset, the precision of the IOU and Dice coefficients improved. The test results demonstrate that the U-NET++ architecture improves the efficiency of multiscale conversion and fully connected systems. The authors in (18) propose a novel approach for lung CT scan classification. They combined handcrafted features were extracted using Q-deformed entropy (QDE), which captured image texture based on intensity variations, with features automatically learned by a Convolutional Neural Network (CNN). This fusion strategy aimed to improve the identification of healthy lungs from those affected by conditions like COVID-19 or pneumonia (18). This proposed approach demonstrated the benefits of combining handcrafted and automatically learned features. Segmentation focused the model on relevant lung regions, and the LSTM network effectively utilized the fused features for accurate classification.
3 Materials and methods
3.1 U-NET++ architectural design
This study introduces the U-NET++ hybrid model, which utilizes a two-parameter logistic function to identify lung nodules from CT scans accurately. Lung CT scans were classified as “benign” or “malignant” when used as an input for a binary classification system. A unique hybrid model that combines U-Net (19) and two-parameter logistic distribution was developed to segment and diagnose lung cancer. The model was generated using the dataset of LUNA-16 lung CT images. The U-NET++ model is highly esteemed as a leading architecture in computer vision, primarily because it is built on established computer vision approaches. When assessed using the ImageNet test dataset, this model achieved a precision rate of 91%. The main architectural improvement in the model is the filter size, an improved version of the U-NET. Figure 1 illustrates the architecture of the proposed model.
In the current section, a detailed presentation of the combination of two- and three-parameter logistic distribution models is presented. Figure 2 shows a two-parameter U-NET++ logistic-type distribution. In general, the pixel intensities are the content through which the quantification of the image details performed on several regions of the images. The brightness of a picture or image can be measured by using several performance metrics such as the moisture in the surroundings, lightening of the images, vision, and the surrounding environmental conditions. This measurement can be performed using the pixel values and pixel intensities. For instance, pixel (a, b) intensity measurement was performed using the function z = f (a, b) and considered as a random variable. To better analyze and understand the performance of the currently considered model and the intensities of pixels for various images, the model was designed for both parametric and parametric models. The pdf of the pixel intensity is given by
Where y is the pixel Intensity, Մ is the mean of pixels s, and omegas the variance of the ented image's pixels.
3.2 U-NET++ algorithm
3.2.1 U-Net ith two parameter type distribution
For updating we differentiate R(Q, Q) with respect to and equate it to zero. That is . This implies The derivative was applied and implemented for both parameter models with for the two-parameter model, with estimation error of 0.001 and it was with the biased estimation. From the Equations 1–6 segmentation algorithm used in the proposed algorithm.
The updated equations of at (l+1)th iteration is
For three-parameter logistic type distribution: -
Were
3.3 Module design
Figure 3 discuss about the methodology design followed in our proposed work. A typical image processing method is contrast-limited adaptive histogram (CLAHE) equalization. Smooth regions become noisier with adaptive histogram equalization. CLAHE may enhance noise in hectic circumstances. Histogram size may be limited by CLAHE. Understand that deep learning variation is a major issue. Use two tag techniques for variety. Match the center to the background to reduce variation. This study employed the dice coefficient loss function used by picture segmentation pros. The experiment suggests labeling may be better than initial marking in cases with insufficient data. Medical images are hard to classify and find. Everyone agrees transferring less data is hard. Semi-supervised learning overcomes auto-labeling naming issues. Proposed study successfully locates the lung using ROI segmentation from CT scans. Process attention model. The ROI segmentation model during data processing may find lung tumors, study suggests.
4 Model parameters and discussions
4.1 LUNA-16 dataset
A total of 5,000 CT scans were obtained from LUNA-16. Four expert radiologists annotated the images in the LIDC/IDRI database for 2 years (20–22). Each radiologist diagnosed the nodules as non-nodules, nodules with a diameter of ≤ 3 mm, or nodules with a diameter of ≥3 mm (23). This article examines the annotation process in detail. Three of every four nodules larger than 3 mm in diameter must be identified by radiologists (24). Non-standard findings have not been noted before (non-nodules, nodules < 3 mm, and nodules annotated by only one or two radiologists). Table 2 shows various illustrations of nodules in the LUNA-16 dataset.
Table 3 presents various feature extraction values obtained from the LUNA16 database. A node, which refers to a specific structure, has a wide range of characteristics, with malignancy being used as an example to illustrate this. The estimation of the node's outline coordinates is utilized, whereas the surrounding area of the nodule is often underestimated. Lobulation refers to the configuration and attributes of a nodule. The measurement of a nodule in millimeters determines its diameter, which in turn determines its length. The border of the nodule indicates a transparent region.
Table 4 describes the dataset used in our study. We compiled a custom LUNA-16 dataset by combining annotated lung CT scans from various sources, including LIDC-IDRI datasets. This dataset comprises 5,000 annotated CT scans slices, each with a resolution of 512 × 512 pixels. The images were annotated by expert radiologists using semi-automated tools, ensuring high quality labels for training and evaluation.
4.2 Study design
Three categories of data were created, namely training, validation, and testing. We built a model, trained it using validation data, and tested it. This method is repeated until a firm understands how our model reacts in real-world scenarios. Allow average pooling and expand the size of the final output by using layers in the filter. We examined our test data to determine what we could learn from it in order to enhance the model. Because we are neither testing nor training a model on a test dataset, we can utilize it only once per session. Two-parameter and three-parameter mixtures generate a model using a single test dataset, which significantly reduces the time and effort required. Figure 4 illustrates the study design.
4.3 Split and pre-process data
Jpeg serves as the data transport format in our architecture in the same way as DICOM. The Neuroimaging Informatics Technology Initiative (NIFIT) (25) is a 501(c)(3) not-for-profit organization committed to the advancement of neuroimaging informatics (NITI) (26). Despite its origins in neuroimaging, it is now commonly used in brain and other medical imaging. By memorizing the coordinates, it is possible to relate pixel values (i, j, k) to the position space (x, y, z) (x, y, z). Each data scan may provide three-dimensional medical images comprising 128 × 128 slices of varying thicknesses. Additional RAM is required to store the data in the DICOM format.
CLAHE12 contribute to the enhancement of CT scan quality (Contrast Limited Adaptive Histogram Equalization). The artwork places a premium on contrast and visual detail. The Hounsfield center values for the lung window and soft tissue were 600, 1,500, and 50,400. As a result, the lung window is the most frequently used Hounsfield range for lung image diagnosis. As shown in Table 1, the Hounsfield values of various body components were dispersed. Following sampling, the objective was to compress a snapshot to preserve the memory. Standardization is the next step in reducing computing costs. Subsequently, CLAHE was used to enhance nodule contrast and visibility.
Contrast-limited adaptive histogram equalization (CLAHE) has been used in image processing for a long time. Instead of adaptive Histogram Equalization (AHE13) (27), it cannot be used. Standard adaptive equalization may amplify noise in ordinarily homogeneous areas of the image. Consequently, the histogram tends to focus on this region. The CLAHE has the potential to enhance noise in locations where it is almost continuous. In Figure 5, the LUNA-16 dataset is preprocessed using the Wiener filter and CLAHE.
The CLAHE approach can be used to decrease the histogram concentration. When utilizing CLAHE, the concentrated histogram component was maintained. On the other hand, the exceeding histogram was maintained and equally distributed throughout all histogram bins. The Wiener filter is an extremely successful technique for visual noise reduction. PET/CT scans were afflicted with an additive noise of constant intensity. Figure 6 shows an example of the original CT scan image, second image is with CLAHE and third one is with CLAHE and weiner.
Figure 6. The first picture from left to right shows how the Wiener filter works with CLAHE. (A) Original CT scan image. (B) CT image with CLAHE image. (C) CLAHE with Weiner filter.
4.4 Architecture and implementation
The lung segmentation method utilized in this study used 5,000 lung CT scan images and masks. Each CT scan image has a resolution of 128 × 128 pixels. Images s black and white the final consequence is a split lung. The technique begins with the data being saved in memory and each image being resized to 32 × 32 pixels. Image processing was accelerated by shrinking the photographs. The images were corrected after rescaling. Subsequently, the dataset was partitioned into 70 percent training set and 30 percent test set. Rotation was performed to increase the number of training samples. There were eight rotating copies for each training sample. In Table 5, U-NET++ is composed of layer blocks that compress and stretch clockwise. The augmented dataset was initially used to define the input layer. The following are the layers of convolution, non-linearity, and down sampling. Non-linearity is first applied to decrease the final image size, followed by convolution to apply a filter, and finally max-pooling. The image is concatenated by applying similar layers in contracting and expanding patterns, and then up-sampled to make it larger. The output layer provides a lung segmentation image. After all layers have been trained, the U-Net ConvNet is created (28). For example, using Adam as the optimizer, the dropout was set to 0.5, epochs were set to 10, and steps per epoch were set to 200 (29). Each layer, similar to the model architecture, has its own set of filters. We examined the performance of U-Net ConvNet using test data.
There were five columns in total. The first column provides the layer name, followed by the number of filters, filter type/size, dimension, and concatenated layers. Eleven convolutional layers were used. The input layer is the first layer. A 32 × 32-pixel input layer is displayed in this picture. For the Con1 layer, eight 3 × 3 filters are needed. The size of the images remained unchanged. Con1 was closely related to other con1. After the con layers, there were ReLU layers.
4.4.1 Simulation settings
To facilitate the replication of our work, we provide a detailed description of the simulation settings and the dataset used. This information includes hardware and software configurations, data preprocessing steps, and hyperparameter settings.
The simulation settings outlined in Table 6 provides comprehensive details on the hardware software environment used for our requirements. Our setup included an Intel core i9-10900k CPPU and an NVIDIA GEFORCE RTX 3090 GPU, ensuring sufficient computational power for training deep learning models. We utilized Ubuntu 20.04 LTS as our operating system, with python 3.8 and TensorFlow 2.4 for model development and training.
Table 7 details the hyperparameters and model configuration. We implemented a U-NET++ with 20 layers, utilizing a kernal size of 3 × 3 and max pooling layer of 2 × 2. The ReLU activation function was used throughout the network, with a sigmoid activation function in the output layers for binary segmentation (30). A dropout rate of 0.5 and L2 regularization were applied to prevent overfitting.
These settings and configurations provide a robust framework for replicating our lung cancer segmentation model and can serve as a foundation for further research and development in this domain.
4.5 Training process
The loss function expresses the loss of the die coefficients. Frequently, the dice coefficient is used to segment medical images, as shown in Figure 7. It is often used to compare two samples. This experiment generated sufficient compelling evidence to be deemed to be conclusive.
This research is mostly concerned with two-dimensional pictures. It might end up saving a lot of money in the long term. Another example is graphics processing unit (GPU) throttling. Owing to memory limitations, the majority of GPUs have difficulty in training 3D models. 2D and 3D models are available for downloading in various formats. We break down our findings into different segmentation strategies with an emphasis on unbalanced and tiny datasets. In addition, the model training process converged in 200 epochs. The confusion matrix can be used to evaluate real-world data and calculate metrics such as accuracy, sensitivity, and specificity. The testing loss is approximately 0.4 in Figure 6, whereas CLAHE and Wiener may be as low as 0.1 without pre-processing.
5 Results discussion and comparison with other models
The results were enhanced by using the ROI segmentation method. It seems that it has the capacity to address the problem of the model's inaccurate positioning of labels. As a consequence, following the recommended methodology may lead to decreased losses. Furthermore, it was shown that the training session continued to slow down. The lesson is enhanced in its effectiveness as shown in Figure 8. It is advisable to apply the same treatment to both one-dimensional and two-dimensional data. The objective of this strategy is to eliminate any errors in labeling in both directions. Over time, there was a gradual reduction in the size of each point. Engaging in conversations with individuals helps achieve both objectives.
Figure 8. Prior to and during the segmentation procedure, the ground-truth forecast was used in each of these instances.
If the dataset is insufficient, it may be necessary to round up more labels. Overall, there were 159 cancerous tumors, and the standard deviation of the Dice coefficient was 0.2. Although its model had a low mFPI, the DL-based model was successful in detecting lung tumors from chest X-rays, the results are shown in Figure 8. The evaluations of the proposed models are presented in Table 8.
Table 8. The evaluation report of the different lung nodule semantic segmentation with comparison to our proposed algorithm.
TensorFlow was used to evaluate the effectiveness of the U-NET++ approach for the segmentation of lung tumors. The evaluation was performed with the assistance of an image segmentation examiner. Images from LUNA-16 were used to complete the segmentation process. The results of the logistic distributions with the two parameters are shown in the following table. Based on the information shown in Table 9, it is presumed that the intensities of the image pixels adhere to a combination of logistic-type distributions with two parameters.
The pixel intensities in each of the k sectors of the image were assumed to follow a two-parameter logistic distribution, with unique parameters. This assumption was based on the fact that a picture. The histogram of pixel intensities was analyzed to estimate the segment count for each CT scan image used in the experiment. The histograms that indicate the pixel intensities that may be observed in the CT scan images are shown in Figure 9.
Figure 9. In this illustration, the pixel intensities generated from CT scan images of lung nodules that were either benign or malignant were included. (A) Malignant tumor. (B) Benign tumor. (C) Shows the radius mean for benign and malignant tumors.
Typically, malignant tumors have higher average radius values compared to benign tumors, as seen by histograms and bar graphs. The average radius of malignant tumors is 20.1020, whereas benign tumors normally have a radius of 11.3286. These data indicate the differences in average radius values between benign and malignant tumor types.
5.1 Visualization of the model
After examining the data, they found a connection, as shown in Table 10, between how well the suggested method worked and other ways of showing the same thing. To determine how well the U-NET++ model segmented the LUNA16 trial dataset, five radiotherapists were used for comparison with real experts. Of the three radiologists, 81.26% were good at segmenting patients. The U-NET++ model was also tested by comparing it with the U-NET model and many other benchmark models, such as the newest ResNet152V2.
Table 10. Comparing the proposed model's quantitative segmentation results to well-established benchmark models.
The number of nodes, Dice coefficient value index, and distribution are presented in Figure 10. This allowed the U-NET++ model to be tested on a test set. Giving each node a number and placing it in the midst of a test set trial is standard.
Duan et al. (23) employed a U-NET architecture with advanced deep learning techniques, resulting in a dice co-efficient of 0.88. Similarly, Duan et al. (23) utilized V-NET incorporating 3D convolutional layers, achieving a dice co-efficient of 0.90. The method by Petit et al. (25) leveraged transformer networks, while Ali et al. (26) utilized efficient net for a more parameter efficient approach.
Table 11 shows a numeric comparison of how well the new method U-NET++ works with three other deep learning models, U-Net (7), NU-Net (6), and WU-Net (12), using CT images of lung nodules from a dataset that was already made public, the suggested method is better than the average method for segmenting images of lung nodules.
Table 11. Quantitative evaluation of lung cancer segmentation methods based on key performance metrics, model architectures and unique features.
We used Fisher's least significant difference (LSD) method in SPSS software to look at the numeric results and see if the suggested way in Table 12 worked. By using the LSD test, we can see that the suggested method does better than standard methods in terms of IoU, recall, precision, and F1-score (p < 0.001).
After preprocessing the image, shown in the Figure 11A the grouped picture, Figure 11B what was found when Lung tumors were identified. Figure 11C results of cutting lung tumors into whole pieces. Figure 11D the findings of the lung tumor search. Figure 11E picture showing the effects on a specific area of lung tumors when they are cut into pieces. Figure 11F a picture of a lung tumor that was accurately cut into pieces.
Figure 11. Utilizing the provided approach, we performed visual segmentation of heterogeneous lung nodules. (A) Clustered image. (B) Segmented image. (C) Extracted image. (D) Extracted image with nodules localizations. (E) Nodule capture. (F) Nodule region highlighted.
Our Model, built on a U-NET++ architecture, demonstrated a baseline performance with a dice-coefficient of 91.76% and an IoU of 89.78%. Recent methods, such as the swin Transformer by Ronneberger et al. (27), achieved higher performance metrics through the use of advanced architectures and techniques.
The images in Figure 12 show a DSC value of at least 0.8 can be trusted for most tumors. The dice index results were compared with the U-NET++ architecture's specific performance to ensure that the model's results were correct. The Dice similarity score (DSC) for the U-NET++ model was 90.84%, which is an unusually high level of success. Because it has fewer parameters than the original U-NET design, the U-NET++ model can effectively separate features and divide them into groups.
The ROC curves in Figure 11 demonstrate that radiologists have the capacity to obtain much greater levels of specificity (i.e., decreased false positive rates) with a low impact on sensitivity (31). By narrowing down the requirement for a positive screen for individuals who are recommended to undergo repeat computed tomography (CT) scans, it is possible to achieve a specificity of 92.4%, while slightly decreasing the sensitivity to 86.9%.
6 Conclusions and future work
Lung segmentation is necessary for the effective diagnosis and identification of lung disorders. There has been a frenzy of lung segmentation research over the past few years, all aimed at improving the accuracy. To identify and categorize lung illnesses, automated analysis of a CT scan must first “segment” the lung. The precision at which lung segmentation can be performed has been the subject of several studies. Deep learning algorithms and basic thresholding approaches have been applied to lung segmentation. U-NET++ is particularly effective in separating cells and neurons from images acquired using a PET Scan. In this study, U-NET++ was used for lung segmentation. The accuracy of the lung segmentation using U-NET++ was 91%. The original purpose of U-NET++ was to separate tiny images. The lungs were effectively divided using CT images. By shrinking the images, they were reduced from 128 × 128 to 32 × 32 pixels. There were 25 convolutional layers in total in this network. It is much more accurate to train U-NET++ using an original image size of 128 × 128. The convolutional layers may be increased in size to enhance the accuracy of the filter.
Data availability statement
The dataset used for the findings will be shared by the corresponding authors upon reasonable request.
Author contributions
EA: Investigation, Software, Writing – original draft, Writing – review & editing. EN: Methodology, Software, Supervision, Writing – original draft, Writing – review & editing. NJ: Formal analysis, Project administration, Supervision, Writing – original draft, Writing – review & editing. SB: Project administration, Supervision, Validation, Writing – original draft, Writing – review & editing. HS: Resources, Validation, Writing – original draft, Writing – review & editing. SA: Conceptualization, Methodology, Visualization, Writing – original draft, Writing – review & editing. JN: Resources, Validation, Visualization, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was supported by Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2024R135), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia.
Acknowledgments
This work was supported by Princess Nourah bint Abdulrahman University Researchers Supporting Project number (PNURSP2024R135), Princess Nourah bint Abdulrahman University, Riyadh, Saudi Arabia. This work was supported by the Deanship of Scientific Research, Vice President for Graduate Studies and Scientific Research, King Faisal University, Saudi Arabia [GrantA013].
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
References
1. Drokin I, Ericheva E. End-to-end lung nodule detection framework with model-based feature projection block. In: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (2021). p. 91–100. doi: 10.1007/978-3-030-87589-3_10
2. Shirokikh B, Shevtsov A, Dalechina A, Krivov E, Kostjuchenko V, Golanov A, et al. Accelerating 3D medical image segmentation by adaptive small-scale target localization. J Imag. (2021) 7:35. doi: 10.3390/jimaging7020035
3. Chen W, Yang F, Zhang X, Xu X, Qiao X. MAU-Net: multiple attention 3D U-Net for lung cancer segmentation on CT images. Procedia Comput Sci. (2021) 192:543–52. doi: 10.1016/j.procs.2021.08.056
4. Blanc D, Racine V, Khalil A, Deloche M, Broyelle J-A, Hammouamri I, et al. Artificial intelligence solution to classify pulmonary nodules on CT. Diagn Interv Imaging. (2020) 101:803–10. doi: 10.1016/j.diii.2020.10.004
5. Zhao Y, Chen S, Shen F, Long D, Yu T, Wu M, et al. In vitro neutralization of autocrine IL-10 affects Op18/stathmin signaling in non-small cell lung cancer cells. Oncol Rep. (2019) 41:501–11. doi: 10.3892/or.2018.6795
6. Huang W, Hu L. Using a noisy U-Net for detecting lung nodule candidates. IEEE Access. (2019) 7:67905–15. doi: 10.1109/ACCESS.2019.2918224
7. Zhao C, Han J, Jia Y, Gou F. Lung nodule detection via 3D U-Net and contextual convolutional neural network. In: 2018 International Conference on Networking and Network Applications (NaNA), IEEE (2018). p. 356–361. doi: 10.1109/NANA.2018.8648753
8. Chiu T-W, Tsai Y-L, Su S-F. Automatic detection of lung nodes with deep learning in segmentation and imbalanced data labeling. Sci Rep. (2021) 11:11174. doi: 10.1038/s41598-021-90599-4
9. Gao R, Lin Q, Man Z, Cao Y. Automatic lesion segmentation of metastases in SPECT images using U-Net-based model. In: 2nd International Conference on Signal Image Processing and Communication (ICSIPC 2022) (2022). doi: 10.1117/12.2643560
10. Cai Y, Chen R, Liu H, Peng Z, Yuan Y, Lu Y, et al. Lung nodule CT medical image analysis based on deep learning. In: AOPC 2022: AI in Optics and Photonics (2023). doi: 10.1117/12.2652067
11. Banu SF, Sarker MM, Abdel-Nasser M, Rashwan HA, Puig D. WEU-Net: weight excitation U-Net for lung nodule segmentation. In: Artificial Intelligence Research and Development (2021). doi: 10.3233/FAIA210154
12. Xia L. Auxiliary diagnosis of lung cancer with magnetic resonance imaging data under deep learning. Comput Math Methods Med. (2022) 2022:1994082. doi: 10.1155/2022/1994082
13. Chhabra M, Ravulakollu KK, Kumar M, Sharma A, Nayyar A. Improving automated latent fingerprint detection and segmentation using deep convolutional neural network. Neural Comput Applic. (2023) 35:6471–97. doi: 10.1007/s00521-022-07894-y
14. Venkatesh C, Chinna Babu J, Kiran A, Nagaraju CH, Kumar M, A. hybrid model for lung cancer prediction using patch processing and deeplearning on CT images. Multimed Tools Appl. (2024) 83:43931–52. doi: 10.1007/s11042-023-17349-8
15. Madhu G, Kautish S, Gupta Y, Nagachandrika G, Biju SM, Kumar M. XCovNet: An optimized xception convolutional neural network for classification of COVID-19 from point-of-care lung ultrasound images. Multimed Tools Appl. (2024) 83:33653–74. doi: 10.1007/s11042-023-16944-z
16. Lamba M, Munjal G, Gigras Y, Kumar M. Breast cancer prediction and categorization in the molecular era of histologic grade. Multimed Tools Appl. (2023) 82:29629–48. doi: 10.1007/s11042-023-14918-9
17. Hesamian MH, Jia W, He X, Kennedy P. Deep learning techniques for medical image segmentation: achievements and challenges. J Digit Imaging. (2019) 32:582–96. doi: 10.1007/s10278-019-00227-x
18. Hasan AM, Al-Jawad MM, Jalab HA, Shaiba H, Ibrahim RW, Al-Shamasneh AR. Classification of Covid-19 coronavirus, pneumonia and healthy lungs in CT Scans using Q-deformed entropy and deep learning features. Entropy. (2020) 22:517. doi: 10.3390/e22050517
19. Seo H, Badiei Khuzani M, Vasudevan V, Huang C, Ren H, Xiao R, et al. Machine learning techniques for biomedical image segmentation: an overview of technical aspects and introduction to state-of-art applications. Med Phys. (2020) 47:e148–e167. doi: 10.1002/mp.13649
20. Huang H, Lu RF, Tao JL Li Y, Zhang JQ. Segmentation of lung nodules in CT images using improved U-Net++. Acta Photonica Sinica. (2021) 50:73–83. doi: 10.3788/gzxb20215002.0210001
21. Jeong EJ, Choi D, Lee DW. U-Net deep-learning-based 3D cell counter for the quality control of 3D cell-based assays through seed cell measurement. SLAS Technol. (2021) 26:468–476. doi: 10.1177/24726303211017532
22. Ji S, Xiao Z. Integrated context and multi-scale features in thoracic organs segmentation. J Image Graph. (2021) 26:2135–2145. doi: 10.32604/cmc.2022.020561
23. Duan X, Xie D, Zhang R, Li X, Sun J, Qian C, et al. A novel robotic bronchoscope system for navigation and biopsy of pulmonary lesions. Cyborg Bionic Syst. (2023) 4:0013. doi: 10.34133/cbsystems.0013
24. Cui Y, Wang X, Lin F, Li W, Zhao Y, Zhu F, et al. MiR-29a-3p improves acute lung injury by reducing alveolar epithelial cell PANoptosis. Aging Dis. (2022) 13:899–909. doi: 10.14336/AD.2021.1023
25. Petit O, Thome N, Rambour C, Themyr L, Collins T, Soler L. U-net transformer: Self and cross attention for medical image segmentation. In: Machine Learning in Medical Imaging: 12th International Workshop, MLMI 2021, Held in Conjunction with MICCAI 2021, Strasbourg, France, September 27, 2021, Proceedings 12, Springer International Publishing (2021). p. 267–276. doi: 10.1007/978-3-030-87589-3_28
26. Ali Z, Irtaza A, Maqsood M. An efficient U-net framework for lung nodule detection using densely connected dilated convolutions. J Supercomput. (2022) 78:1602–23. doi: 10.1007/s11227-021-03845-x
27. Ronneberger O, Fischer P, Brox T. U-Net: convolutional networks for biomedical image segmentation. In:Navab N, Hornegger J, Wells W, Frangi A., , editors Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015. MICCAI 2015. Notes in Computer Science, Cham: Springer (2015). doi: 10.1007/978-3-319-24574-4_28
28. Zheng W, Lu S, Yang Y, Yin Z, Yin L, Ali H. Lightweight transformer image feature extraction network. PeerJ Comput Sci. (2024) 10:e1755. doi: 10.7717/peerj-cs.1755
29. Yang C, Sheng D, Yang B, Zheng W, Liu C. A dual-domain diffusion model for sparse-view CT reconstruction. In: IEEE Signal Processing Letters (2024). doi: 10.36227/techrxiv.23726703.v2
30. Zheng J, Yue R, Yang R, Wu Q, Wu Y, Huang M, et al. Visualization of zika virus infection via a light-initiated bio-orthogonal cycloaddition labeling strategy. Front Bioeng Biotechnol. (2022) 10:1051. doi: 10.3389/fbioe.2022.940511
Keywords: image segmentation, two-parameter logistic type distribution, performance evaluation, CLAHE, ROI segmentation, lung cancer detection
Citation: Albalawi E, Neal Joshua ES, Joys NM, Bhatia Khan S, Shaiba H, Ahmad S and Nazeer J (2024) Hybrid healthcare unit recommendation system using computational techniques with lung cancer segmentation. Front. Med. 11:1429291. doi: 10.3389/fmed.2024.1429291
Received: 07 May 2024; Accepted: 26 June 2024;
Published: 19 July 2024.
Edited by:
Udhaya Kumar, Baylor College of Medicine, United StatesReviewed by:
Manoj Kumar, University of Wollongong in Dubai, United Arab EmiratesSuci Aulia, Telkom University, Indonesia
Copyright © 2024 Albalawi, Neal Joshua, Joys, Bhatia Khan, Shaiba, Ahmad and Nazeer. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Sultan Ahmad, cy5hbGlzaGVyJiN4MDAwNDA7cHNhdS5lZHUuc2E=; Eid Albalawi, ZWxiYWxhd2kmI3gwMDA0MDtrZnUuZWR1LnNh