AUTHOR=Sammani Arjan , Jansen Mark , de Vries Nynke M. , de Jonge Nicolaas , Baas Annette F. , te Riele Anneline S. J. M. , Asselbergs Folkert W. , Oerlemans Marish I. F. J. 

TITLE=Automatic Identification of Patients With Unexplained Left Ventricular Hypertrophy in Electronic Health Record Data to Improve Targeted Treatment and Family Screening

JOURNAL=Frontiers in Cardiovascular Medicine

VOLUME=Volume 9 - 2022

YEAR=2022

URL=https://www.frontiersin.org/journals/cardiovascular-medicine/articles/10.3389/fcvm.2022.768847

DOI=10.3389/fcvm.2022.768847

ISSN=2297-055X

ABSTRACT=Background: Unexplained Left Ventricular Hypertrophy (ULVH) may be caused by genetic and non-genetic aetiologies (e.g. sarcomere variants, cardiac amyloid or Anderson-Fabry’s disease). Identification of ULVH patients allows for early targeted treatment and family screening. 
Aim: To automatically identify patients with ULVH in electronic health record (EHR) data using two computer methods: text-mining and machine learning (ML).
Methods: Adults with echocardiographic measurement of interventricular septum thickness (IVSt) were included. A text-mining algorithm was developed to identify patients with ULVH. An ML algorithm including a variety of clinical, ECG and echocardiographic data was trained and tested in an 80%/20% split. Clinical diagnosis of ULVH was considered the gold standard. Misclassifications were reviewed by an experienced cardiologist. Sensitivity, specificity, positive and negative likelihood ratios (LHR+ and LHR-) of both text-mining and ML were reported.
Results: In total, 26954 subjects (median age 61 years, 55% male) were included. ULVH was diagnosed in 204/26954 (0.8%) patients, of which 56 had amyloidosis and two Anderson-Fabry Disease. Text-mining flagged 8192 patients with possible ULVH, of whom 159 were true positives (sensitivity, specificity, LHR+ and LHR-of 0.78, 0.67, 2.36 and 0.33). Machine learning resulted in a sensitivity, specificity, LHR+ and LHR- of 0.32, 0.99, 32 and 0.68 respectively. Pivotal variables included IVSt, systolic blood pressure and age.
Conclusions: Automatic identification of patients with ULVH is possible with both Text-mining and ML. Text-mining may be a comprehensive scaffold but can be less specific than machine learning. Deployment of either method depends on existing infrastructures and clinical applications.