ORIGINAL RESEARCH article

Front. Child Adolesc. Psychiatry

Sec. Autism and Other Neurodevelopmental Disorders

Volume 4 - 2025 | doi: 10.3389/frcha.2025.1519753

Detecting ADHD through Natural Language Processing and Stylometric Analysis of Adolescent Narratives

Provisionally accepted
  • 1Developmental Clinical Psychology Research Unit, Faculty of Psychology and Educational Sciences, University of Geneva, Geneva, Switzerland
  • 2Faculty of Humanities, University of Geneva, Geneva, Switzerland
  • 3Ecole nationale des chartes, PSL, Paris, France
  • 4Research Department of Clinical, Educational and Health Psychology, University College London, London, United Kingdom

The final, formatted version of the article will be published soon.

Attention-Deficit/Hyperactivity Disorder (ADHD) significantly affects adolescents' everyday lives, particularly in emotion regulation and interpersonal relationships. Despite its high prevalence, ADHD remains underdiagnosed, highlighting the need for improved diagnostic tools. This study explores, for the first time, the potential of Natural Language Processing (NLP) and stylometry to identify linguistic markers within Self-Defining Memories (SDMs) of adolescents with ADHD and to evaluate their utility in detecting the disorder. A further novel aspect of this research is the use of SDMs as a linguistic dataset, which reveals meaningful patterns while engaging psychological processes related to identity and memory. Our objectives were to: (1) characterize linguistic features of SDMs in ADHD and control groups; (2) assess the predictive power of stylometry for group classification, distinguishing between controls and individuals with ADHD; and (3) conduct a qualitative analysis of key linguistic markers of each group. Twenty-five adolescents diagnosed with ADHD and forty-one typically developing adolescents participated in the study. The results showed that adolescents with ADHD produced shorter, less diverse, and less cohesive narratives. In addition, stylometric analysis using Support Vector Machine (SVM) classifiers distinguished between ADHD and control groups with up to 100% precision. Distinct linguistic markers were identified, potentially reflecting challenges in emotion regulation. These findings suggest that NLP and stylometry can enhance ADHD diagnostics by providing objective linguistic markers, thereby improving both its understanding and diagnostic procedures. Further research is needed to validate these methods in larger and more diverse populations.

Keywords: ADHD, Self-defining memories, Emotion Regulation, Natural Language Processing, Stylometry ing, computational linguistics, and Speech Recognition

Received: 30 Oct 2024; Accepted: 17 Apr 2025.

Copyright: © 2025 Barrios Rudloff, Poznyak, Samson, Rafi, Gabay, Cafiero and Debbané. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

* Correspondence: Juan Barrios Rudloff, Developmental Clinical Psychology Research Unit, Faculty of Psychology and Educational Sciences, University of Geneva, Geneva, Switzerland

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.