Skip to main content

ORIGINAL RESEARCH article

Front. Digit. Health

Sec. Connected Health

Volume 7 - 2025 | doi: 10.3389/fdgth.2025.1552746

This article is part of the Research Topic Advancing Vocal Biomarkers and Voice AI in Healthcare: Multidisciplinary Focus on Responsible and Effective Development and Use View all articles

Leveraging Large Language Models for Automated Detection of Velopharyngeal Dysfunction in Patients with Cleft Palate

Provisionally accepted
Myranda Uselton Shirk Myranda Uselton Shirk 1Catherine Dang Catherine Dang 1Jaewoo Cho Jaewoo Cho 1Hanlin Chen Hanlin Chen 1Lily Hofstetter Lily Hofstetter 1Jack Bijur Jack Bijur 1Claiborne Lucas Claiborne Lucas 2Andrew James Andrew James 3Ricardo-Torres Guzman Ricardo-Torres Guzman 3Andrea Hiller Andrea Hiller 3Noah Alter Noah Alter 3Amy Stone Amy Stone 4Maria E Powell Maria E Powell 4Matthew Pontell Matthew Pontell 3,5*
  • 1 Data Science Institute, Vanderbilt University, Nashville, Indiana, United States
  • 2 Department of General Surgery, Prisma Health Greenville, Greenville, United States
  • 3 Department of Plastic Surgery, Vanderbilt University Medical Center, Nashville, Indiana, United States
  • 4 Department of Otolaryngology, Vanderbilt University Medical Center, Nashville, United States
  • 5 Monroe Carell Jr. Children's Hospital, Vanderbilt University Medical Center, Nashville, Tennessee, United States

The final, formatted version of the article will be published soon.

    Background: Hypernasality, a hallmark of velopharyngeal insufficiency (VPI), is a speech disorder with significant psychosocial and functional implications. Conventional diagnostic methods rely heavily on specialized expertise and equipment, posing challenges in resourcelimited settings. This study explores the application of OpenAI's Whisper model for automated hypernasality detection, offering a scalable and efficient alternative to traditional approaches.: The Whisper model was adapted for binary classification by replacing its sequence-tosequence decoder with a custom classification head. A dataset of 184 audio recordings, including 96 hypernasal (cases) and 88 non-hypernasal samples (controls), was used for training and evaluation. The Whisper model's performance was compared to traditional machine learning approaches, including support vector machines (SVM) and random forest (RF) classifiers. Results: The Whisper-based model effectively detected hypernasality in speech, achieving a test accuracy of 97% and an F1-score of 0.97. It significantly outperformed SVM and RF classifiers, which achieved accuracies of 88.1% and 85.7%, respectively. Whisper demonstrated robust performance across diverse recording conditions and required minimal training data, showcasing its scalability and efficiency for hypernasality detection. Conclusion: This study demonstrates the effectiveness of the Whisper-based model for hypernasality detection. By providing a reliable pretest probability, the Whisper model can serve as a triaging mechanism to prioritize patients for further evaluation, reducing diagnostic delays and optimizing resource allocation.

    Keywords: velopharyngeal dysfunction (VPD), Hypernasality Detection, Artificial intelligence (AI), Cleft Palate, Machine Learning (ML), Speech Diagnostics

    Received: 29 Dec 2024; Accepted: 10 Mar 2025.

    Copyright: © 2025 Shirk, Dang, Cho, Chen, Hofstetter, Bijur, Lucas, James, Guzman, Hiller, Alter, Stone, Powell and Pontell. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

    * Correspondence: Matthew Pontell, Department of Plastic Surgery, Vanderbilt University Medical Center, Nashville, 37232, Indiana, United States

    Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.

    Research integrity at Frontiers

    Man ultramarathon runner in the mountains he trains at sunset

    94% of researchers rate our articles as excellent or good

    Learn more about the work of our research integrity team to safeguard the quality of each article we publish.


    Find out more