Prediction of single pulmonary nodule growth by CT radiomics and clinical features — a one-year follow-up study

Yang, Ran; Hui, Dongming; Li, Xing; Wang, Kun; Li, Caiyong; Li, Zhichao

doi:10.3389/fonc.2022.1034817

ORIGINAL RESEARCH article

Front. Oncol., 28 October 2022

Sec. Cancer Imaging and Image-directed Interventions

Volume 12 - 2022 | https://doi.org/10.3389/fonc.2022.1034817

This article is part of the Research TopicIncorporation of Texture Analysis in Diagnosing and Characterizing CancerView all 12 articles

Prediction of single pulmonary nodule growth by CT radiomics and clinical features — a one-year follow-up study

Ran Yang^1†

Dongming Hui^1†

Xing Li²

Kun Wang²

Caiyong Li^2*

Zhichao Li^1*

¹Department of Radiology, Second People’s Hospital of JiuLongPo District, Chongqing, China
²Department of Radiology, Chongqing Western Hospital, Chongqing, China

Background: With the development of imaging technology, an increasing number of pulmonary nodules have been found. Some pulmonary nodules may gradually grow and develop into lung cancer, while others may remain stable for many years. Accurately predicting the growth of pulmonary nodules in advance is of great clinical significance for early treatment. The purpose of this study was to establish a predictive model using radiomics and to study its value in predicting the growth of pulmonary nodules.

Materials and methods: According to the inclusion and exclusion criteria, 228 pulmonary nodules in 228 subjects were included in the study. During the one-year follow-up, 69 nodules grew larger, and 159 nodules remained stable. All the nodules were randomly divided into the training group and validation group in a proportion of 7:3. For the training data set, the t test, Chi-square test and Fisher exact test were used to analyze the sex, age and nodule location of the growth group and stable group. Two radiologists independently delineated the ROIs of the nodules to extract the radiomics characteristics using Pyradiomics. After dimension reduction by the LASSO algorithm, logistic regression analysis was performed on age and ten selected radiological features, and a prediction model was established and tested in the validation group. SVM, RF, MLP and AdaBoost models were also established, and the prediction effect was evaluated by ROC analysis.

Results: There was a significant difference in age between the growth group and the stable group (P < 0.05), but there was no significant difference in sex or nodule location (P > 0.05). The interclass correlation coefficients between the two observers were > 0.75. After dimension reduction by the LASSO algorithm, ten radiomic features were selected, including two shape-based features, one gray-level-cooccurence-matrix (GLCM), one first-order feature, one gray-level-run-length-matrix (GLRLM), three gray-level-dependence-matrix (GLDM) and two gray-level-size-zone-matrix (GLSZM). The logistic regression model combining age and radiomics features achieved an AUC of 0.87 and an accuracy of 0.82 in the training group and an AUC of 0.82 and an accuracy of 0.84 in the verification group for the prediction of nodule growth. For nonlinear models, in the training group, the AUCs of the SVM, RF, MLP and boost models were 0.95, 1.0, 1.0 and 1.0, respectively. In the validation group, the AUCs of the SVM, RF, MLP and boost models were 0.81, 0.77, 0.81, and 0.71, respectively.

Conclusions: In this study, we established several machine learning models that can successfully predict the growth of pulmonary nodules within one year. The logistic regression model combining age and imaging parameters has the best accuracy and generalization. This model is very helpful for the early treatment of pulmonary nodules and has important clinical significance.

Introduction

According to the glossary of terms proposed by the Fleischner Society, a pulmonary nodule is defined as an approximately rounded opacity with a diameter of less than 3 cm (1). Recently, an increasing number of pulmonary nodules have been found during screening. Studies have shown that approximately 12.0% of the US population has incidental pulmonary nodules (2). Pulmonary nodules may develop into lung cancer. A total of 2.27% of incidental pulmonary nodules developed into lung cancer during a 2-year follow-up (2). According to data from the World Health Organization (3), lung cancer was the leading cause of cancer death, with 1.8 million deaths in 2020. Early diagnosis can greatly help with treatment (4) and improve the prognosis of millions of patients.

However, for single pulmonary nodules, there are many difficulties in the selection of treatment methods and operation time. Several societies, such as The American College of Chest Physicians (5), The British Thoracic Society (6), and The Fleischner Society of the United States (7–9), have developed guidelines for the management of pulmonary nodules. The American College of Radiology has also developed a structured report template (Lung-RADS) based on the needs of diagnostic radiology practice (10). These guidelines provide recommendations for the management of pulmonary nodules according to the classification of risk factors and nodule morphology. For different types of nodules, it is recommended to carry out a second CT test at different intervals, and further treatment is determined according to the dynamic changes of nodules. The practice intervals recommended by these guidelines currently depend solely on the size of the nodules. For example, the Fleischner Society’s 2017 guideline (9) recommends review after 12 months for solid nodules smaller than 6 mm and within 3-6 months for partially solid and ground-glass nodules larger than 6 mm. If the growth of nodules can be predicted in advance, the review interval can be adjusted according to the predicted results and biopsy/surgical pathology can be conducted earlier and improve the prognosis of patients.

Conventional HRCT can reflect the size and general morphology of nodules but cannot provide depth information based on the visual information. Radiomics was proposed by Philippe Lambin in 2011. It refers to an automated and repeatable analysis that uses a high-throughput method to extract a large number of image features from radiographs (11). Since the concept of radiomics emerged, it has been widely used in the identification, grading, efficacy evaluation and prognostics of various tumors (12–15). For example, radiomics has been successfully used to distinguish benign and malignant pulmonary nodules (16, 17). Yu et al. also developed a transfer learning radiomics (TLR) model for the prediction of lymph node metastasis of papillary thyroid carcinoma and achieved high accuracy (18). However, until now, there has been no study to predict the growth of pulmonary nodules in one year using radiomics.

In this study, we intended to collect more than 200 patients with incidental pulmonary nodules and to follow up with them for one year to observe the dynamic changes in the nodules. After that, the correlation between the high-throughput features extracted by radiomics and the growth of pulmonary nodules was then analyzed. On this basis, a model was proposed to predict whether nodules are likely to grow within one year. This model can help doctors operate on dangerous nodules in time and reduce the number of re-examinations for stable nodules.

Materials and methods

Patients

From Jan 2020 to Dec 2021, a total of 314 patients from the Second People’s Hospital of JiuLongPo District and Chongqing Western Hospital were involved, and all of them were followed up for one year. This study was approved by the ethics committees of the two hospitals. As a retrospective analysis, the informed consent requirement was waived.

The inclusion criteria were as follows: (a) patients with high-resolution chest CT images at baseline and at the one-year follow-up. (c) The nodule was solitary, and the baseline diameter of pulmonary nodules was ≥3 mm and ≤20 mm. The exclusion criteria were as follows: (a) the patient’s information was incomplete. (b) The image quality was low, (c) the nodules disappeared during follow-up, and (d) multiple pulmonary nodules were found in the baseline images. An overview of the workflow of this study is shown in Figure 1.

FIGURE 1

Figure 1 Overview workflow of this study (HRCT, high-resolution computed tomography).

The follow-up protocol were as follows: (a) the size of the nodule was 6-8mm, and HRCT of lung scan was performed at 6-12 months. (b) The nodules were 8-20mm in size and HRCT of lung scans were performed every 3 months.

Through the exclusion criteria, 228 of 314 patients for follow-up were finally included. All patients were randomly divided into a training group and a validation group at a ratio of 7:3. The pulmonary nodules were labeled growth or stable according to whether they grew within the one-year follow-up. According to the literature (19), growing nodules were defined as nodules that increased in diameter by more than 1.8 mm in one year. Stable nodules were defined as a change in size of less than 1.8 mm over a year.

CT scanning

The CT images were obtained on a dual source scanner (Siemens SOMATOM Drive, Siemens Healthineers, Germany), a 64-slice detector scanner (Canon Aquilion PRIME TSX-303A, Canon Medical, Japan) and a 16-slice detector scanner (Philips Brilliance 16, Philips Medical, Netherlands). The scanning parameters were as follows:

a. SOMATOM Drive: tube voltage: 120 kV; tube current: automatic; detector collimation = 0.6 mm * 128; pitch: 1.2; rotation time = 0.5 s; reconstruction layer thickness: 1 mm; reconstruction matrix: 512 * 512.

b. Aquilion PRIME: tube voltage: 120 kV; tube current: automatic; detector collimation = 0.5 mm * 64; pitch: 0.824; rotation time = 0.75 s; reconstruction layer thickness: 1 mm; reconstruction matrix: 512*512;

c. Brilliance 16: tube voltage: 120 kV; tube current: 200-300 mAs; detector collimation = 0.75 mm * 16; pitch: 0.938; rotation time = 0.75 s; reconstruction layer thickness: 1 mm; reconstruction matrix: 512*512.

The scan area was from the thoracic entrance to the lung base, covering the whole lung. The scanning was started when the patient held their breath at the end of inhalation.

Region-of-interest segmentation

All images were exported as Dicom files from the scanners. The DICOM images were converted to Nifft format by MRICroGL software (version: 2.1.60). The Nifft format images were imported into 3D-Slicer (an open-source software application for visualization and analysis of medical image computing data sets) (20). The regions of interest (ROIs) were independently segmented by two radiologists with more than 6 years of clinical experience. Two-dimensional ROIs were limned around the boundary of the lesions on each layer of axial CT images. Three-dimensional ROIs (volume of interest) were conducted by the accumulation of all two-dimensional region ROIs.

Radiomics features extraction

Radiomics features were extracted by an open-source python package of Pyradiomics (21). The implementation of all radiomics features followed the Imaging Biomarkers Standardization Initiative recommendations (22). This process worked on the original images, wavelet images and Laplacian of Gaussian images. A total of 1316 features were extracted. The extracted features are listed in Supplementary Table 1. The definitions of the texture parameters are shown on the site of Pyradimics (https://pyradiomics.readthedocs.io/en/latest/features.html). The workflow of this process is shown in Figure 2.

FIGURE 2

Figure 2 The flow chart of radiomic feature extraction and model building.

Prediction model building

The radiomics signature was constructed in 4 steps. In step one, all radiomic feature values were normalized. In step two, the algorithm of the least absolute shrinkage and selection operator (LASSO) method was used to select the features with a nonzero coefficient. In step three, the coefficients of the features from step two were computed using multivariate logistic regression analysis. In step four, the radscore was constructed by linearly combining the coefficients of the features from the third step.

The support vector machine (SVM), random forest (RF), adaptive boosting (Adaboost), and multilayer perceptron (MLP) machine learning algorithms were used to train the model. The algorithm deployment procedure was assessed by stratified 10-fold cross-validation in the training group, which tested each model ten times to maximize the use of data and promote the accuracy of the models (23). The grid search was used to optimize the parameters of the models. The ROC areas under the receiver operating characteristic curve (AUC) and accuracy were calculated to assess the differential ability of the models. The ML algorithms were all programmed using the Python (version 3.8) machine-learning library known as scikit-learn (version 1.1) (24).

A simple threshold screening model was constructed and was compared with the method using nodule size as a basis for the follow-up in the guidelines. The length of the nodule along the X, Y and Z axes was used to calculate the average nodule length, and the average length was used as a screening index. ROC curves were calculated under SPSS using average length. The 1-specificity and sensitivity of different lengths were calculated, and then the Jorden index was calculated to find diagnostic thresholds. The average length of 6 mm from the literature (9) was also used as the threshold for predicting nodular growth. Statistical Analysis

Statistical analyses were performed using IBM SPSS Statistics 25.0. A two-sided p value < 0.05 was considered to indicate a statistically significant difference. The approximate t test was used for the intergroup comparison of continuous variables after the homogeneity test of variance. The chi-square test was used for the intergroup comparison of categorical variables. To meet the requirements of the chi-square test (R*C), the number of nodules in the left inferior lobe anterior basal segment was merged with the posterior basal segment, and the number of nodules in the right inferior lobe anterior basal segment, medial basal segment and posterior basal segment were merged. The radiomics features between the two observers were assessed for reproducibility with intraclass correlation coefficients.

Results

Clinical characteristics of the patients

A total of 228 nodules were finally included in the study. Eighty nodules grew in one year (growth group), and 148 nodules remained stable (stable group). The clinical characteristics of the patients in the two groups are listed in Table 1. The age of the stable group was 52.56 ± 12.14 and that of the growth group was 58.41 ± 14.02. An approximate t test was performed on age, as the square difference between the two groups was even (Levene’s Test F value =2.337 at p value = 0.128). There was a significant difference in age between the two groups (t value =-2.26, p value =0.025, 95% confidence interval (CI): -9.172~-2.522), and the age of the growth group was older than that of the stable group (Supplementary Figure 1). The sex ratios were 83:98 and 41:56 (male:female) for the stable and growth groups, respectively. There was no statistically significant difference between the two groups (χ2 = 0.329 at p value = 0.566, Supplementary Figure 2). The diameters of the nodules in the stable group and the growing group were 5.56 ± 1.19 mm and 7.82 ± 2.58 mm, respectively, showing a significant difference in the T test (t = -9.042 at p value < 0.001, 95% CI -2.75 ~ -1.77, Supplementary Figures 3–5). The chi-square test showed no significant difference in nodule location between the two groups (χ2 = 13.294 at p value = 0.425).

TABLE 1

Table 1 Clinical characteristics of the patients in the training and validation cohort.

Characteristics of the radiomics parameters

A total of 1316 features were extracted from each nodule. A total of 107 features were extracted from the original image, 465 features were extracted from the LOG filtered image, and 744 features were extracted from the wavelet filtered image. With the least absolute shrinkage and selection operator (LASSO), ten features were selected to form a radiomics signature for predicting the growth of nodules. The ten selected features with their contribution coefficients are shown in Figure 3. They included two shape-based features, one gray-level- cooccurrence-matrix (GLCM), one first-order feature, one gray-level-run-length-matrix (GLRLM), three gray-level-dependence-matrix (GLDM) and two gray-level-size-zone-matrix (GLSZM).

FIGURE 3

Figure 3 Employing the least absolute shrinkage and selection operator (LASSO) algorithm to reduce the redundancy feature. (A) Regression coefficient diagram of LASSO. (B) Features selected and their weight.

Linear prediction model

The ten radiomics features selected by LASSO and the clinical signature (age) were combined to establish a classification model by logistic regression. The AUC and accuracy attained by the combined model on the training group and validation group were 0.87 (95% CI: 0.74–0.98), 0.82, 0.82 (95% CI: 0.68–0.95) and 0.84, respectively (Figure 4B). The relationship between the predicted value and the true value is shown in the line chart in the Supplementary Materials (Supplementary Figure 6). The established logistics classification formulation is stated in the Supplementary Material, and the nomogram is described in Figure 5.

FIGURE 4

Figure 4 The receiver operator characteristic (ROC) curves of the linear models for predicting the growth of the nodules within one year. (A) ROC curve of the threshold prediction model (area under the ROC curve (AUC) = 0.73 as threshold at 6 mm, AUC = 0.77 as threshold at 6.3 mm). (B) ROC curve of logistic regression (LR) (AUC = 0.87 in the training group, AUC =0.82 in the validation group).

FIGURE 5

Figure 5 A nomogram was made to predict the one-year growth of single pulmonary nodules.

While using the nodule diameter line length as the screening threshold, in the training group, the diagnostic threshold for mean length was 6.3 mm (sensitivity: 0.778, specificity: 0.771, AUC: 0.81). With 6.3 mm as the threshold, the accuracy and AUC in the validation group were 0.754 and 0.777, respectively, but when 6 mm was used as the threshold to predict growth in all 278 patients, the accuracy and AUC were 0.705 and 0.728, respectively (Figure 4A).

Nonlinear prediction models

In this study, four nonlinear methods were trained to predict the growth of the nodules, including support vector machine (SVM), random forest (RF), adaptive boosting (Adaboost), and multilayer perceptron (MLP). The ROC curves of the four nonlinear models in the training group and validation group are shown in Figure 4, and the classification reports of these models are listed in Table 2.

TABLE 2

Table 2 The classification report of the different models on the validation group.

In the training group, the AUC of the SVM model was 0.95 (95% CI: 0.82-0.99, Figure 6A), the accuracy rate was 0.86, the AUC of the RF model was 1.00 (95% CI: 0.76-1.0, Figure 6B), the accuracy rate was 0.99, the AUC of the MLP model was 1.00 (95% CI: 1.00: 0.79-1.0, Figure 6C), and the accuracy was 1.00. The AUC of the Adaboost model was 1.00 (95% CI: 0.84-1.0, Figure 6D), and the accuracy was 1.00. In the validation group, the AUC of the SVM model was 0.81 (95% CI: 0.64-0.89, Figure 6A), the accuracy rate was 0.81, the AUC of the RF model was 0.77 (95% CI: 0.660-0.83, Figure 6B), the accuracy rate was 0.74, and the AUC of the MLP model was 0.81 (95% CI: 0.69-0.92, Figure 6C). The AUC of the Adaboost model was 0.71 (95% CI: 0.62-0.76, Figure 6D), and the accuracy was 0.78.

FIGURE 6

Figure 6 The receiver operator characteristic (ROC) curves of the nonlinear models for predicting the growth of the nodules within one year. (A) ROC curve of the SVM model (area under the ROC curve (AUC) = 0.95 in the training group, AUC =0.81 in the validation group). (B) ROC curve of the random forest (RF) model (AUC = 1.0 in the training group, AUC =0.77 in the validation group). (C) ROC curve of the multilayer perceptron (MLP) model (AUC = 1.0 in the training group, AUC =0.81 in the validation group). (D) ROC curve of the Adaboost model (AUC = 1.0 in the training group, AUC =0.71 in the validation group).

Discussion

Pulmonary nodules are very common, and it is difficult to accurately predict their growth. Tumor growth kinetics (TGK) have usually been used for the prediction of tumor growth in the past. It is generally considered to have three well-defined phases: the first (lagged phase) is associated with tumor establishment in the host; the second stage (log or exponential) is associated with rapid tumor growth; and the third stage (stationary phase) shows slow growth of the tumor and gradual convergence to the final volume (25). To describe tumor growth, the exponential growth model (26), linear growth model and Gompertzian growth model (27) have been proposed. These models require pathological data of tumors, such as cell lines, cell surface diffusion, and cell proliferation, which cannot be obtained before surgery.

In clinical practice, CT follow-up is of great clinical significance to help manage pulmonary nodules without pathological information. The Fleischner Society of the United States, the American College of Chest Physicians, the British Thoracic Society, and the American College of Radiology have published their guidelines for the management of nodules based on CT findings to help physicians develop an effective follow-up protocol. However, even among the most widely applied Fleischner guidelines, there was considerable heterogeneity in the choice of nodule treatment in clinical practice (8). Additionally, the CT findings adopted by these guidelines were gross morphology, which was limited in information. Previous studies have shown that radiomics features can be used to analyze the biological and pathophysiological information of lung cancer and provide rapid and accurate noninvasive biomarkers for its diagnosis, prognosis and treatment response monitoring (28). This study was the first to use radiomics tools to predict single pulmonary nodule growth within one year. The results showed that our model performs well in both the training group and the validation group. This model could help to develop a follow-up plan for uncertain pulmonary nodules and reduce the over treatment of nodules in clinical practice.

In this study, five different machine learning methods were used to develop prediction models of whether pulmonary nodules would increase within one year. In general, the growth of nodules was related to gender, adhesion, location, size, and characteristics of nodules (29). The size and characteristics (such as solid, subsolid, ground glass, and spiculated) in the guidelines were gross changes, and high-throughput radiomics features could decompose these features into more detailed texture features to determine more nuanced information. These features included size and shape-based features, first-order features of the image gray histogram, second-order features of image voxel relations, such as gray-level cooccurrence matrix (GLCM), run length matrix (RLM), size zone matrix (SZM) and neighborhood gray tone difference matrix (NGTDM), texture features extracted by wavelet and Gaussian Laplacian filter, etc. (22). These high-dimensional data contained information reflecting the underlying pathophysiology (30), which can be revealed by quantitative image analysis (31, 32). In this study, the 1316 radiomics features extracted from the CT images were reduced to ten features with the LASSO algorithm. The ten features and their weights are shown in Figure 3B. Among them, the morphological features LeastAxisLength and MajorAxisLength reflected the nodule size, which corresponded to the nodule diameter adopted in the guidelines (5–7, 9). In a previous study of portal phase expansive versus infiltrative tumor growth front, wavelet_LHH_glrlm_ShortRun-LowGrayLevelEmphasis was considered to be the best predictor of tumor growth (33, 34). The pathological association of textural features derived from gray-level cooccurrence matrices (GLCMs) has been proven and applied to the diagnosis of breast cancer (35). The GLSZM and GLDM features could reflect tumor heterogeneity and homogeneity (36).

Generally, age, sex and nodule location are related to whether a nodule is benign or malignant (7, 9, 37), but whether these factors could predict the growth of a nodule within one year is unclear. In this study, the average age of the patients with enlarged nodules was older than that of the patients with stable nodules at the 1-year follow-up, and the difference was statistically significant. These results indicated that age was an independent predictor of nodule growth (38). There was no significant difference in sex or nodule location between the two groups. This finding was inconsistent with literature reports that women and nodules in the upper lobe of the right lung were risk factors for lung cancer (39). A possible reason was that this study focused on nodular growth rather than benign or malignant nodules, and the growth curves of benign and malignant nodules partially overlapped (40).

In this study, the logistic regression model has the best AUC and accuracy compared to the SVM, RF, MLP and AdaBoost models. It can help doctors predict whether the nodules will grow after one year and has important clinical significance. In previous studies, logistic regression models have been used to predict the malignant degree of solitary pulmonary nodules (41), showing good predictive performance. The nonlinear ML algorithm can deal with multidimensional features and identify some underlying patterns from data that are not linear or polynomial. Previously, Jiang Yuming et al. found that an SVM model can predict the survival rate of gastric cancer patients (42). Mitra Montazeri found that the random forest model is a useful tool for survival prediction and medical decision-making of breast cancer (43). QZ et al. successfully used the AdaBoost model to predict local prostate cancer recurrence (44). MLP models have also been used to predict mortality in elderly patients with hip fractures (45). In this study, the LR model obtained the best AUC and F1 scores in the validation group among the five models, so it was selected to construct the prediction formula and nomogram. The SVM, RF, MLP and AdaBoost models had high AUC and accuracy in the training group but showed low performance in the validation group. Therefore, overfitting may exist and could affect the generalization of the model. According to previous studies, the more complex the model is, the overfit is more likely, the more parameters need to be adjusted, and more samples are needed to learn (46). Therefore, in this study, these models performed worse than the LR models.

In conclusion, in this study, we found that the logistical regression model combining high-resolution CT-derived radiomics and age could accurately predict whether a lung nodule will increase after one year. It has great potential clinical value in helping clinicians develop diagnostic and treatment strategies.

The study has several limitations. First, the sample size was relatively small due to the strict inclusion/exclusion criteria, nearly one-third of the patients were lost to follow-up, and there may have been a potential selection bias. Second, patients with multiple nodules were not included in the analysis. Third, in the model construction, only the imaging features of high-resolution CT plain scans were used, and other imaging data were not considered. In the future, more patients need to be followed up to verify the validity of the model, and different imaging technologies, such as CT enhancement and MRI, should be combined to further improve the prediction efficiency of the model.

Data availability statement

The original contributions presented in the study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding authors.

Ethics statement

The studies involving human participants were reviewed and approved by the ethic committee of the second people’s hospital of JiuLongPo district and the Chongqing western hospital. Written informed consent to participate in this study was provided by the participants’ legal guardian/next of kin.

Author contributions

RY, DH, XL and KW collected the relevant data, and RY and ZL analyzed the data. DH, and ZL wrote this manuscript. ZL put forward the study topic and revised the manuscript. All authors read and approved this manuscript.

Funding

This study was supported by grants from the Science and Health Joint Medicine research project of Chongqing (including traditional Chinese medicine) (grant number 2022MSXM140).

Acknowledgments

The authors thank Siemens Healthcare, Philips Healthcare, and Canon Medical for their kind support.

Conflict of interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher’s note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Supplementary material

The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fonc.2022.1034817/full#supplementary-material

References

1. Hansell DM, Bankier AA, MacMahon H, McLoud TC, Muller NL, Remy J. Fleischner society: glossary of terms for thoracic imaging. Radiology (2008) 246(3):697–722. doi: 10.1148/radiol.2462070712

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Osarogiagbon RU, Miller EA, Faris N, Pinsky PF. Incidental pulmonary nodules, lung cancer screening, and lung cancer in the Medicare population. J Clin Oncol (2022) 40(16_suppl):6536–. doi: 10.1200/JCO.2022.40.16_suppl.6536

CrossRef Full Text | Google Scholar

3. Organization WH. Cancer 2022 . Available at: https://www.who.int/news-room/fact-sheets/detail/cancer.

Google Scholar

4. Henschke C, McCauley D, Yankelevitz DP, Naidich D, McGuinness G, Miettinen OS, et al. Early lung cancer action project: Overall design and findings from baseline screening. Lancet (1999) 354(9173):99–105. doi: 10.1016/S0140-6736(99)06093-6

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Gould MK, Donington J, Lynch WR, Mazzone PJ, Midthun DE, Naidich DP, et al. Evaluation of individuals with pulmonary nodules: When is it lung cancer? diagnosis and management of lung cancer, 3rd ed: American college of chest physicians evidence-based clinical practice guidelines. Chest (2013) 143(5 Suppl):e93S–e120S. doi: 10.1378/chest.12-2351

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Callister ME, Baldwin DR, Akram AR, Barnard S, Cane P, Draffan J, et al. British Thoracic society guidelines for the investigation and management of pulmonary nodules. Thorax (2015) 70(8):794–8. doi: 10.1136/thoraxjnl-2015-207168

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Macmahon H. Guidelines for management of small pulmonary nodules detected on CT scans : a statement from the fleischner society. Radiology (2005) 237(2):395–400. doi: 10.1148/radiol.2372041887

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Mets OM, de Jong PA, Chung K, Lammers JJ, van Ginneken B, Schaefer-Prokop CM, et al. Fleischner recommendations for the management of subsolid pulmonary nodules: High awareness but limited conformance - a survey study. Eur Radiol (2016) 26(11):3840–9. doi: 10.1007/s00330-016-4249-y

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Macmahon H, Naidich DP, Goo JM, Lee KS, Leung A, Mayo JR, et al. Guidelines for management of incidental pulmonary nodules detected on CT images: From the fleischner society 2017. Radiology (2017) 284(1):228–43. doi: 10.1148/radiol.2017161659

PubMed Abstract | CrossRef Full Text | Google Scholar

10. McKee BJ, Regis SM, McKee AB, Flacke S, Wald C. Performance of ACR lung-RADS in a clinical CT lung screening program. J Am Coll Radiol Jacr (2015) 12(3):273–6. doi: 10.1016/j.jacr.2014.08.004

CrossRef Full Text | Google Scholar

11. Lambin P, Rios-Velazquez E, Leijenaar R, Carvalho S, van Stiphout RG, Granton P, et al. Radiomics: Extracting more information from medical images using advanced feature analysis. Eur J Cancer (2012) 48(4):441–6. doi: 10.1016/j.ejca.2011.11.036

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Wong AJ, Kanwar A, Mohamed AS, Fuller CD. Radiomics in head and neck cancer: From exploration to application. Trans Cancer Res (2016) 5(4):371–82. doi: 10.21037/tcr.2016.07.18

CrossRef Full Text | Google Scholar

13. Cameron A, Khalvati F, Haider M, Wong A. MAPS: A quantitative radiomics approach for prostate cancer detection. IEEE Trans Biomed Eng (2015) 63(6):1–. doi: 10.1109/TBME.2015.2485779

CrossRef Full Text | Google Scholar

14. Li H, Zhu Y, Burnside ES, Huang E, Drukker K, Hoadley KA, et al. Quantitative MRI radiomics in the prediction of molecular classifications of breast cancer subtypes in the TCGA/TCIA data set. NPJ Breast Cancer. (2016) 2:16012. doi: 10.1038/npjbcancer.2016.12

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Nie K, Shi L, Chen Q, Hu X, Jabbour SK, Yue N, et al. Rectal cancer: Assessment of neoadjuvant chemoradiation outcome based on radiomics of multiparametric MRI. Clin Cancer Res (2016) 22(21):5256–64. doi: 10.1158/1078-0432.CCR-15-2997

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Tu W-T, Fan L, Liu S-Y. Progress on research of radiomics in lung cancer. Chin J Cancer Prev Treat (2018) 25(8):604–8.Available at: https://www.researchgate.net/publication/330886520_Progress_on_research_of_radiomics_in_lung_cancer.

Google Scholar

17. Kalpathy-Cramer J, Mamomov A, Zhao B, Lu L, Cherezov D, Napel S, et al. Radiomics of lung nodules: A multi-institutional study of robustness and agreement of quantitative imaging features. Tomography J Imaging Res (2016). doi: 10.18383/j.tom.2016.00235

CrossRef Full Text | Google Scholar

18. Yu J, Deng Y, Liu T, Zhou J, Jia X, Xiao T, et al. Lymph node metastasis prediction of papillary thyroid carcinoma based on transfer learning radiomics. Nat Commun (2020) 11(1):4807. doi: 10.1038/s41467-020-18497-3

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Bankier AA, Macmahon H, Goo JM, Rubin GD, Schaefer-Prokop CM, Naidich DP. Recommendations for measuring pulmonary nodules at CT: A statement from the fleischner society. Radiology (2017) 285(2):584–600. doi: 10.1148/radiol.2017162894

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Fedorov A, Beichel R, Kalpathy-Cramer J, Finet J, Fillion-Robin JC, Pujol S, et al. 3D slicer as an image computing platform for the quantitative imaging network. Magn Reson Imaging. (2012) 30(9):1323–41. doi: 10.1016/j.mri.2012.05.001

PubMed Abstract | CrossRef Full Text | Google Scholar

21. van Griethuysen JJM, Fedorov A, Parmar C, Hosny A, Aucoin N, Narayan V, et al. Computational radiomics system to decode the radiographic phenotype. Cancer Res (2017) 77(21):e104–e7. doi: 10.1158/0008-5472.CAN-17-0339

PubMed Abstract | CrossRef Full Text | Google Scholar

22. Zwanenburg A, Vallieres M, Abdalah MA, Aerts H, Andrearczyk V, Apte A, et al. The image biomarker standardization initiative: Standardized quantitative radiomics for high-throughput image-based phenotyping. Radiology (2020) 295(2):328–38. doi: 10.1148/radiol.2020191145

PubMed Abstract | CrossRef Full Text | Google Scholar

23. Molinaro A, Simon R, Pfeiffer R. Prediction error estimation: A comparison of resampling methods. Bioinformatics (2005) 21(15):3301–7. doi: 10.1093/bioinformatics/bti499

PubMed Abstract | CrossRef Full Text | Google Scholar

24. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine learning in Python. J Mach Learn Res (2011) 12(85):2825–30. Available at: http://jmlr.org/papers/v12/pedregosa11a.html.

Google Scholar

25. González M, Joa J, Cabrales L, Pupo A, González GVS. Is cancer a pure growth curve or does it follow a kinetics of dynamical structural transformation? BMC Cancer (2017) 17(1):174. doi: 10.1186/s12885-017-3159-y

PubMed Abstract | CrossRef Full Text | Google Scholar

26. Shackney SE. Tumor growth, cell cycle kinetics, and cancer treatment. Med Oncol (1993), 43–60.

Google Scholar

27. Brú A, Albertos S, Luis Subiza J, García-Asenjo JL, Brú I. The universal dynamics of tumor growth. Biophys J (2003) 85(5):2948–61. doi: 10.1016/S0006-3495(03)74715-8

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Tunali I, Gillies RJ, Schabath MB. Application of radiomics and artificial intelligence for lung cancer precision medicine. Cold Spring Harb Perspect Med (2021) 11(8):a039537. doi: 10.1101/cshperspect.a039537

PubMed Abstract | CrossRef Full Text | Google Scholar

29. Mazzone PJ, Lam L. Evaluating the patient with a pulmonary nodule: A review. Jama (2022) 327(3):264–73. doi: 10.1001/jama.2021.24287

PubMed Abstract | CrossRef Full Text | Google Scholar

30. Kotrotsou A, Zinn PO, Colen RR. Radiomics in brain tumors: An emerging technique for characterization of tumor environment. Magn Reson Imaging Clin N Am (2016) 24(4):719–29. doi: 10.1016/j.mric.2016.06.006

PubMed Abstract | CrossRef Full Text | Google Scholar

31. Yang L, Gu D, Wei J, Yang C, Rao S, Wang W, et al. A radiomics nomogram for preoperative prediction of microvascular invasion in hepatocellular carcinoma. Liver Cancer. (2019) 8(5):373–86. doi: 10.1159/000494099

PubMed Abstract | CrossRef Full Text | Google Scholar

32. Schniering J, Maciukiewicz M, Gabrys HS, Brunner M, Blüthgen C, Meier C, et al. Computed tomography-based radiomics decodes prognostic and molecular differences in interstitial lung disease related to systemic sclerosis. Eur Respir J (2022) 59(5):2004503. doi: 10.1183/13993003.04503-2020

PubMed Abstract | CrossRef Full Text | Google Scholar

33. Granata V, Fusco R, De Muzio F, Cutolo C, Setola SV, Dell' Aversana F, et al. Contrast MR-based radiomics and machine learning analysis to assess clinical outcomes following liver resection in colorectal liver metastases: A preliminary study. Cancers (Basel) (2022) 14(5):1110. doi: 10.3390/cancers14051110

PubMed Abstract | CrossRef Full Text | Google Scholar

34. Granata V, Fusco R, De Muzio F, Cutolo C, Mattace Raso M, Gabelloni M, et al. Radiomics and machine learning analysis based on magnetic resonance imaging in the assessment of colorectal liver metastases growth pattern. Diagnostics (Basel) (2022) 12(5):1115. doi: 10.3390/diagnostics12051115

PubMed Abstract | CrossRef Full Text | Google Scholar

35. Li X, Guindani M, Ng CS, Hobbs BP. Spatial Bayesian modeling of GLCM with application to malignant lesion characterization. J Appl Stat (2018) 46(2):230–46. doi: 10.1080/02664763.2018.1473348

PubMed Abstract | CrossRef Full Text | Google Scholar

36. Mayerhoefer ME, Materka A, Langs G, Häggström I, Szczypiński P, Gibbs P, et al. Introduction to radiomics. J Nucl Med (2020) 61(4):488–95. doi: 10.2967/jnumed.118.222893

PubMed Abstract | CrossRef Full Text | Google Scholar

37. Naidich DP, Bankier AA, Macmahon H, Schaefer-Prokop CM, Pistolesi M, Goo JM, et al. Recommendations for the management of subsolid pulmonary nodules detected at CT: A statement from the fleischner society. Radiology (2013) 266(1):304–17. doi: 10.1148/radiol.12120628

PubMed Abstract | CrossRef Full Text | Google Scholar

38. Xia T, Cai M, Zhuang Y, Ji X, Fu G. Risk factors for the growth of residual nodule in surgical patients with adenocarcinoma presenting as multifocal ground-glass nodules. Eur J Radiology. (2020) 133(5):109332. doi: 10.1016/j.ejrad.2020.109332

CrossRef Full Text | Google Scholar

39. Cruickshank A, Stieler G, Ameer F. Evaluation of the solitary pulmonary nodule. Internal Med J (2019) 49(3):306–15. doi: 10.1111/imj.14219

CrossRef Full Text | Google Scholar

40. Wang X, Han R, Guo F, Li X, Zheng W, Wang Q, et al. Analysis of growth curve type in pulmonary nodules with Different characteristics. Zhongguo Fei Ai Za Zhi. (2017) 20(5):334–40. doi: 10.3779/j.issn.1009-3419.2017.05.06

PubMed Abstract | CrossRef Full Text | Google Scholar

41. Valero MS, Pastor-Valero M, Librero J. Solitary pulmonary nodule malignancy predictive models applicable to routine clinical practice: A systematic review. Systematic Rev (2021) 10(1):308. doi: 10.1186/s13643-021-01856-6

CrossRef Full Text | Google Scholar

42. Jiang Y, Xie J, Huang W, Chen H, Xi S, Han Z, et al. Tumor immune microenvironment and chemosensitivity signature for predicting response to chemotherapy in gastric cancer. Cancer Immunol Res (2019) 7(12):2065–73. doi: 10.1158/2326-6066.CIR-19-0311

PubMed Abstract | CrossRef Full Text | Google Scholar

43. Montazeri M, Montazeri M, Montazeri M, Beigzadeh A. Machine learning models in breast cancer survival prediction. Technol Health Care (2016) 24(1):31–42. doi: 10.3233/THC-151071

PubMed Abstract | CrossRef Full Text | Google Scholar

44. Zhong QZ, Long LH, Liu A, Li CM, Xiu X, Hou XY, et al. Radiomics of multiparametric MRI to predict biochemical recurrence of localized prostate cancer after radiation therapy. Front Oncol (2020) 10:731. doi: 10.3389/fonc.2020.00731

PubMed Abstract | CrossRef Full Text | Google Scholar

45. Cary MP Jr., Zhuang F, Draelos RL, Pan W, Amarasekara S, Douthit BJ, et al. Machine learning algorithms to predict mortality and allocate palliative care for older patients with hip fracture. J Am Med Dir Assoc (2021) 22(2):291–6. doi: 10.1016/j.jamda.2020.09.025

PubMed Abstract | CrossRef Full Text | Google Scholar

46. Ying X. An overview of overfitting and its solutions. J physics: Conf Ser (2019) 1168(2):022. doi: 10.1088/1742-6596/1168/2/022022

CrossRef Full Text | Google Scholar

Keywords: pulmonary nodule, computed tomography, prediction, growth, radiomics, LASSO, logistics regression

Citation: Yang R, Hui D, Li X, Wang K, Li C and Li Z (2022) Prediction of single pulmonary nodule growth by CT radiomics and clinical features — a one-year follow-up study. Front. Oncol. 12:1034817. doi: 10.3389/fonc.2022.1034817

Received: 02 September 2022; Accepted: 05 October 2022;
Published: 28 October 2022.

Edited by:

Chuanming Li, Chongqing University Central Hospital, China

Reviewed by:

Yuwei Xia, Huiying Medical Technology Co., Ltd., China
Tian-wu Chen, Affiliated Hospital of North Sichuan Medical College, China

Copyright © 2022 Yang, Hui, Li, Wang, Li and Li. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Zhichao Li, TGl6YzQ3QGdtYWlsLmNvbQ==; Caiyong Li, MzE5NTY4MjlAcXEuY29t

^†These authors have contributed equally to this work and share first authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

Prediction of single pulmonary nodule growth by CT radiomics and clinical features — a one-year follow-up study

Introduction

Materials and methods

Patients

CT scanning

Region-of-interest segmentation

Radiomics features extraction

Prediction model building

Results

Clinical characteristics of the patients

Characteristics of the radiomics parameters

Linear prediction model

Nonlinear prediction models

Discussion

Data availability statement

Ethics statement

Author contributions

Funding

Acknowledgments

Conflict of interest

Publisher’s note

Supplementary material

References

94% of researchers rate our articles as excellent or good

94% of researchers rate our articles as excellent or good