Introduction

AUTHOR=Zhang Pinzhi , Swaminathan Alagappan , Uddin Ahmed Abrar 

TITLE=Pulmonary disease detection and classification in patient respiratory audio files using long short-term memory neural networks

JOURNAL=Frontiers in Medicine

VOLUME=10

YEAR=2023

URL=https://www.frontiersin.org/journals/medicine/articles/10.3389/fmed.2023.1269784

DOI=10.3389/fmed.2023.1269784

ISSN=2296-858X

ABSTRACT=<sec><title>Introduction</title><p>In order to improve the diagnostic accuracy of respiratory illnesses, our research introduces a novel methodology to precisely diagnose a subset of lung diseases using patient respiratory audio recordings. These lung diseases include Chronic Obstructive Pulmonary Disease (COPD), Upper Respiratory Tract Infections (URTI), Bronchiectasis, Pneumonia, and Bronchiolitis.</p></sec><sec><title>Methods</title><p>Our proposed methodology trains four deep learning algorithms on an input dataset consisting of 920 patient respiratory audio files. These audio files were recorded using digital stethoscopes and comprise the Respiratory Sound Database. The four deployed models are Convolutional Neural Networks (CNN), Long Short-Term Memory (LSTM), CNN ensembled with unidirectional LSTM (CNN-LSTM), and CNN ensembled with bidirectional LSTM (CNN-BLSTM).</p></sec><sec><title>Results</title><p>The aforementioned models are evaluated using metrics such as accuracy, precision, recall, and F1-score. The best performing algorithm, LSTM, has an overall accuracy of 98.82% and F1-score of 0.97.</p></sec><sec><title>Discussion</title><p>The LSTM algorithm's extremely high predictive accuracy can be attributed to its penchant for capturing sequential patterns in time series based audio data. In summary, this algorithm is able to ingest patient audio recordings and make precise lung disease predictions in real-time.</p></sec>