AUTHOR=Wei Hong , Wang Yichun , Li Jinyao , Wang Yanyan , Lu Longdi , Sun Jiawei , Wang Xiaolei TITLE=Diagnosis of benign and malignant peripheral lung lesions based on a feature model constructed by the random forest algorithm for grayscale and contrast-enhanced ultrasound JOURNAL=Frontiers in Oncology VOLUME=14 YEAR=2024 URL=https://www.frontiersin.org/journals/oncology/articles/10.3389/fonc.2024.1352028 DOI=10.3389/fonc.2024.1352028 ISSN=2234-943X ABSTRACT=Rationale and objectives

To construct a predictive model for benign and malignant peripheral pulmonary lesions (PPLs) using a random forest algorithm based on grayscale ultrasound and ultrasound contrast, and to evaluate its diagnostic value.

Materials and methods

We selected 254 patients with PPLs detected using chest lung computed tomography between October 2021 and July 2023, including 161 malignant and 93 benign lesions. Relevant variables for judging benign and malignant PPLs were screened using logistic regression analysis. A model was constructed using the random forest algorithm, and the test set was verified. Correlations between these relevant variables and the diagnosis of benign and malignant PPLs were evaluated.

Results

Age, lesion shape, size, angle between the lesion border and chest wall, boundary clarity, edge regularity, air bronchogram, vascular signs, enhancement patterns, enhancement intensity, homogeneity of enhancement, number of non-enhancing regions, non-enhancing region type, arrival time (AT) of the lesion, lesion-lung AT difference, AT difference ratio, and time to peak were the relevant variables for judging benign and malignant PPLs. Consequently, a model and receiver operating characteristic curve were constructed with an AUC of 0.92 and an accuracy of 88.2%. The test set results showed that the model had good predictive ability. The index with the highest correlation for judging benign and malignant PPLs was the AT difference ratio. Other important factors were lesion size, patient age, and lesion morphology.

Conclusion

The random forest algorithm model constructed based on clinical data and ultrasound imaging features has clinical application value for predicting benign and malignant PPLs.