AUTHOR=Zitu Md Muntasir , Zhang Shijun , Owen Dwight H. , Chiang Chienwei , Li Lang TITLE=Generalizability of machine learning methods in detecting adverse drug events from clinical narratives in electronic medical records JOURNAL=Frontiers in Pharmacology VOLUME=14 YEAR=2023 URL=https://www.frontiersin.org/journals/pharmacology/articles/10.3389/fphar.2023.1218679 DOI=10.3389/fphar.2023.1218679 ISSN=1663-9812 ABSTRACT=
We assessed the generalizability of machine learning methods using natural language processing (NLP) techniques to detect adverse drug events (ADEs) from clinical narratives in electronic medical records (EMRs). We constructed a new corpus correlating drugs with adverse drug events using 1,394 clinical notes of 47 randomly selected patients who received immune checkpoint inhibitors (ICIs) from 2011 to 2018 at The Ohio State University James Cancer Hospital, annotating 189 drug-ADE relations in single sentences within the medical records. We also used data from Harvard’s publicly available 2018 National Clinical Challenge (n2c2), which includes 505 discharge summaries with annotations of 1,355 single-sentence drug-ADE relations. We applied classical machine learning (support vector machine (SVM)), deep learning (convolutional neural network (CNN) and bidirectional long short-term memory (BiLSTM)), and state-of-the-art transformer-based (bidirectional encoder representations from transformers (BERT) and ClinicalBERT) methods trained and tested in the two different corpora and compared performance among them to detect drug–ADE relationships. ClinicalBERT detected drug–ADE relationships better than the other methods when trained using our dataset and tested in n2c2 (ClinicalBERT F-score, 0.78; other methods,