- 1Winterlight Labs, Toronto, ON, Canada
- 2Department of Psychology, University of Toronto, Toronto, ON, Canada
- 3Hurvitz Brain Sciences Program, Sunnybrook Research Institute, Toronto, ON, Canada
- 4School of Public Health Sciences, University of Waterloo, Waterloo, ON, Canada
- 5Ontario Brain Institute, Toronto, ON, Canada
- 6Department of Medicine (Neurology), University of Toronto, Toronto, ON, Canada
Background: The detection and characterization of speech changes may help in the identification and monitoring of neurodegenerative diseases. However, there is limited research validating the relationship between speech changes and clinical symptoms across a wide range of neurodegenerative diseases.
Method: We analyzed speech recordings from 109 patients who were diagnosed with various neurodegenerative diseases, including Alzheimer’s disease, Frontotemporal Dementia, and Vascular Cognitive Impairment, in a cognitive neurology memory clinic. Speech recordings of an open-ended picture description task were processed using the Winterlight speech analysis platform which generates >500 speech features, including the acoustics of speech and linguistic properties of spoken language. We investigated the relationship between the speech features and clinical assessments including the Mini Mental State Examination (MMSE), Mattis Dementia Rating Scale (DRS), Western Aphasia Battery (WAB), and Boston Naming Task (BNT) in a heterogeneous patient population.
Result: Linguistic features including lexical and syntactic features were significantly correlated with clinical assessments in patients, across diagnoses. Lower MMSE and DRS scores were associated with the use of shorter words and fewer prepositional phrases. Increased impairment on WAB and BNT was correlated with the use of fewer nouns but more pronouns. Patients also differed from healthy adults as their speech duration was significantly shorter with more pauses.
Conclusion: Linguistic changes such as the use of simpler vocabularies and syntax were detectable in patients with different neurodegenerative diseases and correlated with cognitive decline. Speech has the potential to be a sensitive measure for detecting cognitive impairments across various neurodegenerative diseases.
1 Introduction
Speech and language impairments are a prominent symptom in many neurodegenerative diseases including Alzheimer’s Disease (AD), Frontotemporal Dementia (FTD) and Vascular Dementia (VD) (1–5). These changes emerge along with the primary set of symptoms such as memory deficits and unusual behaviors associated with each neurodegenerative disease. There have been efforts in differentiating various dementias from healthy older populations based on speech and language assessments (1, 4, 6–10). However, many patients with dementia diagnoses exhibit comorbid conditions. The manifestation of symptoms could differ dramatically from one to another patient despite having received the same diagnosis, making characterization of disease incredibly complicated. Nevertheless, beyond the diagnosis of the patient, each neurodegenerative disease presents multiple symptoms that progress differently in each individual. Thus, understanding how speech and language changes manifest themselves in neurodegenerative diseases and how they relate to clinical symptoms can help add to the clinical picture and improve characterization of neurodegenerative disease. Here, we investigate this question by leveraging a rich, heterogeneous dataset that includes a wide range of neurodegenerative disease diagnoses.
Speech and language changes are an integral part of symptom progression in neurodegenerative diseases such as in AD. Although symptoms of typical AD concern deficits in episodic memory, executive function or reasoning, patients might also experience language impairments, specifically, in semantic abilities, verbal fluency or language comprehension (1, 8, 11, 12). These language deficits manifest themselves as impairments in verbal naming, speech pauses and word finding ability (13–15). Atypical AD, specifically the logopenic variant of primary progressive aphasia (PPA), is associated with impaired lexical access, naming difficulty, dysfluencies (16–19). Recent efforts identified a set of language changes, specifically in lexical retrieval and fluency, that differed between the logopenic variant of PPA and the typical AD although these were not related to clinical outcomes (20). Nevertheless, typical AD and the logopenic variant of PPA share certain language deficits such as production of more adverbs, fewer prepositions and nouns (20). In fact, many neurodegenerative diseases show overlapping speech and language changes, emphasizing the importance of understanding their relation to cognitive decline.
Frontotemporal Dementia (FTD) encompasses distinct cognitive, social and behavioral symptoms that differ from AD, yet speech changes have also been reported across subtypes. Clinically, FTD could present itself as the behavioral variant or PPA, which could be further subdivided into fluent (semantic) and non-fluent variants (21). Patients with the non-fluent variant exhibit deficits in prosody, articulation and local coherence of speech (22–24). Thus, the symptoms include effortful speech, morphological or syntactic deficits, and word finding difficulty that result in simplified language (1). Semantic dementia (fluent variant), on the other hand, is characterized by decline of conceptual knowledge (i.e., semantic memory), primarily involving word comprehension (25). These patients lose the meaning of the words, mostly, nouns, but speech fluency and phonology stays relatively intact (26). As a result, patients with semantic dementia tend to produce fewer but more familiar nouns, and more adverbs than healthy adults (9). Lastly, patients with the logopenic variant of PPA, who tend to show AD pathology at autopsy, exhibit word finding difficulty, phonemic errors, and increased number of pronouns (16, 19, 22). In contrast to fluent and non-fluent variants of FTD, semantic memory, grammar, syntax and production of speech remain relatively unaffected in these patients (16, 19, 22). Lastly, speech and language change in naming, single word or discourse comprehension, and prosody in the behavioral variant of FTD although, historically, this variant was not considered to present such symptoms (27–30). The heterogeneity across different variants of FTD complicates the characterization of distinct speech and language differences for each disease.
Overlapping speech and language deficits across neurodegenerative diseases blur the diagnostic boundaries. Flow of speech as known as fluency could depend on the stage of the disease, in other words, the degree of cognitive decline (31). Fluency changes are common in AD although it remains unclear whether these reflect deficits in semantic knowledge, accessing or retrieving information while producing speech (32, 33). Early onset of AD often exhibits similar speech and language deficits as mild cognitive impairment, primarily in lexico-semantic domain such as access to semantic knowledge or lexical decisions (12, 34, 35). Language changes including difficulty with name generation and single word comprehension in AD are also common in semantic dementia (i.e., fluent FTD). Although single word comprehension is spared in logopenic progressive aphasia, speech rate tends to be slow due to slow word retrieval and frequency pauses to find the right words (17). In fact, logopenic variant aphasia can exhibit AD pathology or progress into dementia caused by AD in later stages (36, 37). Thus, similar speech and language changes may occur across different neurodegenerative diseases depending on the disease stage, social factors, and co-morbid condition.
Heterogeneity in and across neurogenerative diseases necessitates the development of digital speech and language biomarkers that can capture the cognitive decline and clinical symptoms, extending beyond the diagnostic categories. Literature reveals overlapping clinical symptoms and speech changes across neurogenerative diseases. For example, semantic dementia or progressive non-fluent aphasia could present different clinical symptoms, but both patient groups use significantly shorter words than healthy controls (9). However, the word length does not differ between the patients group (9). Similarly, both AD and semantic dementia (i.e., fluent FTD) may exhibit difficulty with name generation or word comprehension (12, 25, 38). Distinct diagnoses may share similar speech and language symptoms although the underlying pathology or affected cognitive domains may differ. Thus, understanding how speech and language changes are linked to cognitive impairment and clinical symptoms beyond the diagnosis labels is as crucial as distinguishing patients from healthy controls. As speech and language changes occur across a variety of neurodegenerative diseases, speech assessments might not be as powerful for differentiating diagnoses, but relate more to cognition, function, and clinical outcomes across diseases. This would mean that speech might be more useful for a cross-diagnosis cognitive assessment, rather than a sole diagnostic tool.
Speech and language features have the potential to be used as a screening or symptom-monitoring tool for neurodegenerative diseases. There is an added benefit of the feasibility of collecting speech samples remotely and with high frequency. We recently demonstrated the importance of high frequency speech assessments in understanding the individual symptom fluctuations relating to depression and cognitive impairment (30, 39, 40). In neurodegenerative diseases, there are no standardized speech and language assessments to be utilized. Thus, short, automated speech assessments which can be administered remotely and with high frequency can be powerful tools in understanding disease progressions. Machine learning and natural language processing models could be used to classify patients with dementia using speech features or differentiate them from healthy controls. Yet, the challenge is to interpret the speech and language changes in terms of the clinical symptoms. Here, our aim is to determine how speech and language changes relate to clinical outcomes in neurodegenerative diseases, beyond the diagnostic categories. Thus, we leveraged a rich, heterogenous patient sample with many neurodegenerative diseases and comorbid diagnoses and investigated the link between clinical outcomes and hundreds of speech features derived through natural language processing. Understanding how potential speech and language biomarkers relate to clinical symptoms will enable early detection or monitoring of cognitive decline at the individual level, as a way of addressing heterogeneity in neurodegenerative diseases.
2 Methods
2.1 Participants
The part of this study related to the patient cohort was approved by the Sunnybrook Research Ethics Board. Patients were recruited from the memory clinics at Sunnybrook Health Sciences Center in Toronto, Ontario, Canada. An informed consent discussion was conducted with all research candidates; all of these candidates agreed to participate, provided a written consent, and were successfully enrolled in the study. For our additional analyses, we also included a healthy older adult sample to compare timing related speech features to patients. The healthy control arm of this study was approved by the Advarra Research Ethics Board. Healthy controls were recruited from the community. Informed consent was collected from all participants. To be eligible for the study, participants needed to be between the ages of 50–95 and fluent English speakers (i.e., either English as their first language or they can speak with conversational proficiency). The exclusion criteria included the following: residing outside of Canada or the United States and having diagnosis of dementia, memory impairment, recent concussion or traumatic brain injury, or uncorrected hearing or visual impairment.
2.2 Clinical assessments
Patients completed a series of cognitive assessments including the Mini Mental State Examination (MMSE) (41), Dementia Rating Scale (DRS) (42), Western Aphasia Battery (WAB) (43), and Boston Naming Test (BNT) (44). MMSE and DRS have been administered to assess cognitive impairment in patients while WAB and BNT have been used to assess speech and language related changes. WAB is helpful in characterizing many different aspects of speech and language including fluency, comprehension, naming, reading and writing. BNT is mostly used to assess retrieval of lexical information while naming an object. WAB and BNT were used to validate our speech features and determine how they relate to specific speech changes. On the other hand, MMSE and DRS were used to assess the cognitive decline and its relationship to speech and language changes detected with our extensive features.
2.3 Acoustic and linguistic speech features
Patients’ speech recordings were collected in the clinic while they were performing a picture description task as part of WAB. The picture that participants described was a line drawing of a picnic scene. Healthy controls completed the picture description task as well, but using an app-based interface and a picture of a family in the kitchen scene. The recordings from healthy controls were collected with the Winterlight Speech App. Because of the significant differences between the 2 stimuli, we only analyzed the timing related speech features for the comparison between the patients and healthy controls since linguistic differences of spoken language could relate to the different picture content.
The patients’ speech recordings were transferred to the Ontario Brain Institute’s “Brain-CODE” informatics platform1 for processing and analysis. “Brain-CODE” was designed to support the collection, integration, sharing, and analysis of patient-level data, while abiding by ethical principles and government legislation (45).
First, a trained transcriptionist transcribed and labeled the speech samples, ensuring the quality of audio content and flagged any recordings with significant issues for removal, such as no audible speech or poor audio quality. Although speech recordings were transcribed for quality purposes, the analyses are based on the recorded spoken speech. To make the distinction between speech and language, here acoustics refer to the auditory features of speech while the linguistics relates to the spoken language. Speech samples were then passed through the Winterlight Lab processing pipeline, which relies on python-based language processing libraries and proprietary custom code. Open source packages include SpaCy for parts-of-speech tagging and morphological features (46), the Stanford NLP parser for syntactic features (47), Praat and Parselmouth for acoustic features (48, 49), and GloVe and FastText models for semantic features (50, 51). The pipeline also uses custom code to compute additional features based on the transcript and audio file, using lexical norms from previous publications (52–55) or previously published models and features (56).
The Winterlight Lab processing pipeline enables us to extract 707 speech features from the audio files and transcripts. These features reflect various aspects of speech: acoustics (e.g., properties of the sound wave, speech rate, number of pauses), lexical content (e.g., rates and types of words used, and their characteristics such as frequency and imageability, which reflect how commonly words are used and how easy they are to picture, respectively), semantics (relating to the meaning of the words, e.g., semantic relatedness of subsequent utterances, semantic relatedness of utterances to the items in the picture) and syntactic (relating to the grammar of the sentences, e.g., syntactic complexity, use of different syntactic constructions) properties of the recordings. While it is not possible to review each feature in detail, we provide more detailed definitions for some of the features of interest in this paper. For instance, average word length and noun or pronoun count fall under the lexical category. Average word length represents the mean number of letters in each word used to describe the picture. Noun and pronoun count are calculated based on the number of nouns or pronouns in a transcript divided by the number of words, respectively. Lexical features also include noun familiarity and frequency, which represent the mean familiarity or frequency of the nouns in the transcript, based on familiarity or frequency norms, respectively (52, 54). In other words, familiarity is a subjective rating of how common the word is, and frequency is how often it appears in a standard corpus of speech. Prepositional phrase count, as an example of syntactic features, is the number of times the phrase occurs in a transcript. Timing related features include mean speech or pause duration. Speech duration is the total number of seconds that participants take to describe the picture while pause duration is calculated by dividing the duration of unvoiced segments of speech by the total number of unvoiced segments. An overview of the feature categories, definitions, numbers, and examples is provided in Table 1.
2.4 Statistical analyses
All analyses were completed on R statistical software, version 4.1.2 (57). We eliminated the speech features that had empty values for at least 20% of participants. 80% of the features eliminated fell under morphological or syntactic speech categories. Specifically, a total of 146 morphological and 94 syntactic speech features had empty values, because of a combination of adverb and prepositional phrases that did not occur in all transcripts, and thus were eliminated from the analyses. Some of the remaining 62 speech features eliminated were related to tags for specific words, for instance, the words with a hyphen (e.g., t-shirt) in them. Most of the remaining speech categories in Table 1 were not affected from this cleaning process, yielding 405 features in total.
We fit separate linear mixed effects models to each speech feature to investigate their unique relationship with the clinical assessments, with covariates of age, sex, and years of education. These analyses were done separately for each of the cognitive assessments; MMSE, DRS, BNT and WAB. Statistical significance was set to an alpha level equal to 0.0001 taking multiple comparisons into consideration through Bonferroni correction (0.05/405 features). Using a series of analysis of variances (ANOVAs), we have compared different diagnoses in terms of the speech feature that we identified in the above analyses. However, these analyses did not yield significant results due to small sample size in each diagnosis.
Lastly, we included a healthy control dataset to be able to compare the speech changes in patients. We only included the timing related features for this comparison. As mentioned above, acoustic and linguistic features might not be appropriate for the comparison because the healthy older adults described a different picture, and recordings were obtained using different devices (digital vs. analog) and recording conditions. We used a Principal Component Analysis (PCA) to reduce the dimensionality of the feature set and distinguish the patients from healthy controls based on the resulting composite variables. This was used as an exploratory cluster analysis. We focused on the first few principal components, that, when combined, were able to explain at least 80% variance. Although there is no straight forward way to determine the number of principal components to retain, the first few components are expected to explain at least 75% of variance or even less in some situations (58–60).
3 Results
3.1 Demographics
This study included 109 patients (52 females, 57 males) with an age range of 51–91 (M = 72.63 ± 8.61). Patient diagnoses included AD, FTD and Vascular dementia. The largest subset of patients was those diagnosed with AD and familial AD, consisting of 34 patients (Supplementary Table S1). All patients completed MMSE, DRS, WAB and BNT as well as the picture description task as part of WAB (Table 2). We also included 74 healthy controls (39 females) with mean age 61.31 ± 7.29. Healthy controls only completed the picture description task, but no additional clinical assessments reported here.
3.2 MMSE and DRS relate to linguistic features
We investigated the relationship of clinical scores on MMSE and DRS to acoustic and linguistic speech features. We conducted separate linear mixed effects models for each speech feature and repeated the analyses for MMSE and DRS separately. We observed that of all the features, average word length was significantly associated with MMSE, R2 = 0.14, F (4, 103) = 5.46, β = 0.01, p = 0.001, 95% CI [0.005, 0.02], and DRS total scores, R2 = 0.20, F (4, 100) = 7.35, β = 0.005, p = 5.82E-05, 95% CI [0.003, 0.008] (Figures 1A,B). Similarly, prepositional phrase count was also significantly correlated with MMSE, R2 = 0.14, F (4, 103) = 5.53, β = 0.002, p = 0.0001, 95% CI [0.001, 0.003], and DRS total scores, R2 = 0.13, F (4, 100) = 5.03, β = 0.0007, p = 0.0002, 95% CI [0.0003, 0.001] (Figures 1C,D). All of these results remained significant following Bonferroni correction.
Figure 1. Clinical impairment measured with MMSE and DRS is correlated with average word length and prepositional phrase usage. (A) Lower MMSE and (B) DRS scores are associated with shorter word length. (C) Lower MMSE and (D) DRS are also correlated with use of fewer prepositional phrases.
Additional results included that the use of familiar nouns was inversely associated with MMSE, R2 = 0.14, F (4, 103) = 5.20, β = −1.60, p = 0.0001, 95% CI [−2.40, −0.80], and DRS total scores, R2 = 0.14, F (4, 100) = 5.42, β = −0.51, p = 0.0001, 95% CI [−0.76, −0.26]. Similarly, using frequent nouns were negatively correlated with MMSE, R2 = 0.14, F(4, 103) = 5.29, β = −0.02, p = 0.00005, 95% CI [−0.03, −0.01], and DRS total scores, R2 = 0.12, F (4, 100) = 4.21, β = −0.006, p = 0.0005, 95% CI [−0.01, −0.003].
3.3 WAB and BNT relate to linguistic features
We investigated the relationship of language deficits reported on WAB and BNT to acoustic and linguistic speech features. We conducted separate linear mixed effects models for each speech feature and repeated the analyses for WAB and BNT separately. We observed that noun count was significantly associated with WAB, R2 = 0.16, F (4, 104) = 6.13, β = 0.001, p = 0.0002, 95% CI [0.0007, 0.002], and BNT total scores, R2 = 0.24, F (4, 98) = 8.95, β = 0.005, p = 1.25E-06, 95% CI [0.002, 0.005] (Figures 2A,B). On the contrary, pronoun count was negatively correlated with WAB, R2 = 0.09, F (4, 104) = 3.63, β = −0.001, p = 0.002, 95% CI [−0.002, −0.0005], and BNT total scores, R2 = 0.11, F (4, 98) = 4.25, β = −0.003, p = 0.0003, 95% CI [−0.004, −0.001] (Figures 2C,D).
Figure 2. Impairment in language (WAB) and semantic processing (BNT) is associated with the amount of noun and pronoun usage. (A) Lower WAB total and (B) BNT scores are correlated with reduced noun usage. Conversely, (C) lower WAB total and (D) BNT scores relate to higher pronoun usage.
3.4 Timing features distinguish patients from healthy older adults
Principal component analysis based on timing related features, including duration of speech and pauses, differentiated patients from healthy older adults (Figure 3A). The first two principal components explained more than 80% of the variance within the data (Figure 3A). Mean pause and total speech duration had two of the highest loadings and were used to show the differences between the patients and healthy controls. According to the student t-test, patients (M = 0.59, SD = 0.16) produced significantly shorter speech than the healthy controls (M = 0.79, SD = 0.12), t (180.41) = −9.63, p = 2.2E-16 (Figure 3B). On the other hand, patients (M = 0.95, SD = 0.49) paused significantly more than the healthy controls (M = 0.75, SD = 0.37), t (182.05) = −3.36, p = 0.001 (Figure 3C).
Figure 3. Timing related speech features distinguish patients with neurodegenerative diseases from healthy older adults. (A) First 2 principal components based on timing features distinguish patients from healthy controls. Patients produced significantly (B) shorter speech durations but (C) longer pauses than healthy controls.
4 Discussion
This study reports that clinical symptoms in a wide range of neurodegenerative diseases are linked to digital linguistic speech features. Many of the speech and language changes that distinguish patients from the healthy older adults might be overlapping between different neurodegenerative diseases (1). We leveraged a rich and heterogeneous patient sample with many diagnostic labels to investigate how digital speech measures relate to cognitive impairment as well as the linguistic deficits measured with traditional assessments. We show that patients with neurodegenerative diseases tend to use simpler vocabulary and syntax; shorter words and fewer prepositional phrases, reflecting cognitive impairment. They utilize fewer nouns and mostly those that are familiar or frequently used in everyday life. In fact, there appears to be a tradeoff between reduced nouns and increased pronouns, which could result in less specific language. While a healthy older adult might describe a picnic scene as “A lady is sitting on the grass and pouring a beverage next to a gentleman by the lake,” a patient with dementia might only say “She is sitting and pouring something. There is a man.” Building on our previous work looking at longitudinal linguistic changes in AD and FTD (30, 40), the link between linguistic changes and cognition across diagnoses highlights the importance of these language properties beyond the neurodegenerative disease categories.
Our results revealed that the cognitive impairment measured with MMSE and DRS was correlated with use of shorter words on average. Shorter word length is reported in many neurodegenerative diseases including AD, semantic dementia, progressive non-fluent aphasia, behavioral variant of FTD (9, 40, 61, 62). Those with semantic dementia and progressive nonfluent aphasia, for example, may differ from healthy controls in terms of word length, yet the 2 patient groups were not previously found to differ from each other (9). Thus, it is necessary to understand how word length is linked to cognitive impairment in addition to distinguishing patients from healthy controls. Our analyses on a heterogenous patient sample revealed a link between word length and cognitive impairment, suggesting the relationship is beyond the diagnostic labels. This is in line with recent findings that shorter word usage in typical AD and the logopenic variant of PPA is correlated with MMSE and BNT (20). Patients with different neurodegenerative diseases may exhibit common linguistic changes, possibly emerging from different underlying deficits. Change in word length might arise from top-down lexical and semantic processing. Unavailability of long, sophisticated words might also result in word finding difficulty, which is commonly reported across diseases (1, 19, 22).
Since the earlier work, naming difficulty has been linked to word frequency and familiarity in dementia (63, 64). Noun frequency and familiarity are two related linguistic features reflecting vocabulary complexity, and have been previously shown to be impacted in semantic dementia and, more generally, in FTD (30, 65). These features could even distinguish semantic dementia from healthy controls and patients with progressive non-fluent aphasia (9). Besides FTD, we report the involvement of noun frequency and familiarity across many neurodegenerative diseases. These linguistic features are also closely linked to cognitive impairment on MMSE and DRS. Similarly, noun frequency in conversation was recently reported to be implicated in AD (66). However, the frequency was reported to decrease with age, suggesting increased use of more rare or complex words with increased age (66). It was speculated that the frequency measure could vary with education (66, 67). Thus, the varying sensitivity of linguistic features also adds to the heterogeneity in dementia. This emphasizes the importance of understanding the link between linguistic changes and cognitive impairment, not specifically at the diagnostic level, but at the individual level.
Syntactic properties might be less sensitive to dementia than lexical features such as word length or noun frequency (68). We report that patients use fewer prepositional phrases – a syntactic feature - as cognitive impairment increases. Overall, prepositional phrases are relatively less studied, but they could be contributing to fragmented sentences in neurodegenerative diseases (68, 69) or a reduced ability to form connections between concepts. Typical AD and the logopenic variant of PPA produce fewer prepositions than healthy controls (20). Similarly, patients with FTD use fewer and fewer prepositional phrases over time (30). Prepositional phrases in AD were associated with performance on BNT, relating to the difficulty in efficiently retrieving semantic information (20). Interestingly, while prepositional phrase count was related to MMSE and DRS scores in our study, we identified distinct linguistic features relating to traditional language assessments such as BNT and WAB.
With increased deficits on WAB and BNT, we observed decreased use of nouns but increased number of pronouns in patients with neurodegenerative diseases. Linguistic impairments measured with WAB were associated with the severity of dementia across diagnoses such as behavioral variant of FTD, primary progressive aphasia and AD (70). Patients with typical AD or the logopenic variant of PPA produce fewer nouns (20), and the use of nouns in AD and FTD decreases over time (30, 40). Semantic dementia also presents with decreased noun use, specifically in connected speech (9). Supporting these findings, we showed that decreased noun use along with increased pronouns across neurodegenerative diseases was associated with language deficits measured with WAB and BNT. This might indicate semantic deficits (65), which might extend to other neurodegenerative diseases beyond semantic dementia as our results suggest. Indeed, BNT relies heavily on semantic memory (71) and is thus more correlated with category fluency such as knowledge of words rather than the rhyming of words or verbal fluency (32, 33). This could be why BNT performance is linked to nouns and pronouns across diseases in the current study as these features are capturing more of the semantic processing.
Characteristics of speech production such as rate and timing are also affected in neurodegenerative diseases. Our results show that patients produced significantly shorter speech and more pauses in speech than the healthy older adults. This might indirectly be an indication of lexico-semantic deficits in these patients and relate to our findings discussed above. Patients with dementia produced shorter speech recordings than healthy controls (20, 72). Similarly, those with logopenic variants of PPA or AD pause more or longer (14, 17, 20, 73). Increased pauses are observable in MCI and early AD as well (74). Between-utterance pauses were also shown to differ between AD and MCI, and was related to episodic memory performance, suggesting its importance in early detection of symptoms (14). Supporting this, we provided evidence that timing features distinguished healthy controls from the heterogenous patient sample that included more than 10 different diagnoses and various comorbid conditions. Yet, we did not observe any associations between number of pauses and MMSE or DRS. This might suggest that timing of speech could be involved in specific cognitive domains rather than overall cognitive decline. This emphasizes the importance of understanding how each speech and language change is linked to a particular clinical outcome beyond the clinical diagnoses, which could be the key in unpacking the heterogeneity and comorbidity seen in neurodegenerative diseases.
We were not able to distinguish the linguistic differences between the diagnostic groups due to small sample size in each group. Although many neurodegenerative diseases show a great overlap in terms of speech changes, future research should investigate the relationship between linguistic speech changes and various clinical outcomes in each disease population separately. These identified language features here can be the first step in understanding how linguistic changes manifest themselves as cognitive impairment progresses in each disease. In addition to linguistic changes, acoustic aspects of speech have recently been part of the efforts in capturing speech changes in neurodegenerative diseases. We generated a wide range of acoustic features including but not limited to the power spectrum of speech signals, speech intensity, and jitter. Yet, acoustic features were not significantly correlated with clinical scores. The speech recordings were collected during clinical interviews on relatively older devices and on a platform other than the Winterlight App. Future research should investigate the acoustic features in high quality speech samples collected with more up-to-date devices.
This study identified several linguistic features that are linked to cognitive impairment in neurodegenerative diseases. It suggested that certain linguistic features may relate to cognition more generally, others to language abilities while timing related features were best suited for broadly distinguishing patients and healthy controls. In particular, it could be that word length and prepositions relate more to cognitive abilities in general, nouns and pronouns relate more specifically to language abilities and speech duration/pausing distinguishes controls from patients. Although these results alone are not enough to make a strong argument, future research should explore this idea that perhaps we could use one assessment to derive different features to inform on different aspects of cognition/language. Gold standard assessments for cognitive impairment can be laborious to conduct as they require expertise and time. Speech assessments, on the other hand, are automated, fast and could be administered in addition to the existing clinical assessments. Nevertheless, many neurodegenerative diseases including various variants of AD and FTD present overlapping speech and language changes (1). There have been efforts in understanding disease specific changes, yet individual differences make this investigation complicated. Each patients’ medical history, severity level, lifestyle, and cognitive resource might be contributing to their phenotype in different ways. A recent review suggests that neurodegenerative diseases should be considered as a multi-faceted condition that involves biology, psychology and social levels to explain the resulting digital phenotypes (75). We can get one step closer to a more objective understanding of cognitive decline in neurodegenerative diseases through development of digital linguistic biomarkers that link to specific cognitive deficits.
Data availability statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics statement
The studies involving humans were approved by 1. Sunnybrook Research Ethics Board or 2. Advarra Research Ethics Board. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
MG: Formal analysis, Visualization, Writing – original draft, Writing – review & editing. MK: Data curation, Writing – review & editing. CMS: Resources, Writing – review & editing. AB: Project administration, Writing – review & editing. JR: Supervision, Writing – review & editing. SEB: Conceptualization, Supervision, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This study was conducted with the support of the Ontario Brain Institute, an independent non-profit corporation, funded partially by the Ontario government. This study was also supported by Canadian Institutes of Health Research (CIHR) Grants (MOP13129 and FDN159910) awarded to SEB.
Acknowledgments
We would like to thank the Ontario Brain Institute for providing access to Brain-CODE, a secure neuroinformatics platform, which was used for the data storage, data sharing and analysis workspace.
Conflict of interest
MG and JR were employed by Winterlight Labs that is part of Cambridge Cognition.
The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fneur.2024.1373341/full#supplementary-material
Footnotes
References
1. Boschi, V, Catricalà, E, Consonni, M, Chesi, C, Moro, A, and Cappa, SF. Connected speech in neurodegenerative language disorders: a review. Front Psychol. (2017) 8:269. doi: 10.3389/fpsyg.2017.00269
2. Cuetos, F, Arango-Lasprilla, JC, Uribe, C, Valencia, C, and Lopera, F. Linguistic changes in verbal expression: a preclinical marker of Alzheimer’s disease. J Int Neuropsychol Soc. (2007) 13:433–9. doi: 10.1017/S1355617707070609
3. Geraudie, A, Battista, P, García, AM, Allen, IE, Miller, ZA, Gorno-Tempini, ML, et al. Speech and language impairments in behavioral variant frontotemporal dementia: a systematic review. Neurosci Biobehav Rev. (2021) 131:1076–95. doi: 10.1016/j.neubiorev.2021.10.015
4. Martínez-Nicolás, I, Llorente, TE, Martínez-Sánchez, F, and Meilán, JJG. Ten years of research on automatic voice and speech analysis of people with Alzheimer’s disease and mild cognitive impairment: a systematic review article. Front Psychol. (2021) 12:620251. doi: 10.3389/fpsyg.2021.620251
5. Martínez-Nicolás, I, Llorente, TE, Martínez-Sánchez, F, and Meilán, JJG. Speech biomarkers of risk factors for vascular dementia in people with mild cognitive impairment. Front Hum Neurosci. (2022) 16:1057578. doi: 10.3389/fnhum.2022.1057578
6. Bertola, L, Mota, NB, Copelli, M, Rivero, T, Diniz, BS, Romano-Silva, MA, et al. Graph analysis of verbal fluency test discriminate between patients with Alzheimer’s disease, mild cognitive impairment and normal elderly controls. Front Aging Neurosci. (2014) 6:185. doi: 10.3389/fnagi.2014.00185
7. Cho, S, Nevler, N, Ash, S, Shellikeri, S, Irwin, DJ, Massimo, L, et al. Automated analysis of lexical features in frontotemporal degeneration. Cortex. (2021) 137:215–31. doi: 10.1016/j.cortex.2021.01.012
8. Forbes-McKay, KE, and Venneri, A. Detecting subtle spontaneous language decline in early Alzheimer’s disease with a picture description task. Neurol Sci. (2005) 26:243–54. doi: 10.1007/s10072-005-0467-9
9. Fraser, KC, Meltzer, JA, Graham, NL, Leonard, C, Hirst, G, Black, SE, et al. Automated classification of primary progressive aphasia subtypes from narrative speech transcripts. Cortex. (2014) 55:43–60. doi: 10.1016/j.cortex.2012.12.006
10. Petti, U, Baker, S, and Korhonen, A. A systematic literature review of automatic Alzheimer’s disease detection from speech and language. J Am Med Inform Assoc. (2020) 27:1784–97. doi: 10.1093/jamia/ocaa174
11. Catricalà, E, Della Rosa, PA, Plebani, V, Perani, D, Garrard, P, and Cappa, SF. Semantic feature degradation and naming performance. Evidence from neurodegenerative disorders. Brain Lang. (2015) 147:58–65. doi: 10.1016/j.bandl.2015.05.007
12. Taler, V, and Phillips, NA. Language performance in Alzheimer’s disease and mild cognitive impairment: a comparative review. J Clin Exp Neuropsychol. (2008) 30:501–56. doi: 10.1080/13803390701550128
13. Fox, NC, Warrington, EK, Seiffer, AL, Agnew, SK, and Rossor, MN. Presymptomatic cognitive deficits in individuals at risk of familial Alzheimer’s disease. A longitudinal prospective study. Brain. (1998) 121:1631–9. doi: 10.1093/brain/121.9.1631
14. Pistono, A, Jucla, M, Barbeau, EJ, Saint-Aubert, L, Lemesle, B, Calvet, B, et al. Pauses during autobiographical discourse reflect episodic memory processes in early Alzheimer’s disease. J Alzheimers Dis. (2016) 50:687–98. doi: 10.3233/JAD-150408
15. Silagi, ML, Bertolucci, PHF, and Ortiz, KZ. Naming ability in patients with mild to moderate Alzheimer’s disease: what changes occur with the evolution of the disease? Clinics (São Paulo). (2015) 70:423–8. doi: 10.6061/clinics/2015(06)07
16. Ash, S, Evans, E, O’Shea, J, Powers, J, Boller, A, Weinberg, D, et al. Differentiating primary progressive aphasias in a brief sample of connected speech. Neurology. (2013) 81:329–36. doi: 10.1212/WNL.0b013e31829c5d0e
17. Gorno-Tempini, ML, Brambati, SM, Ginex, V, Ogar, J, Dronkers, NF, Marcone, A, et al. The logopenic/phonological variant of primary progressive aphasia. Neurology. (2008) 71:1227–34. doi: 10.1212/01.wnl.0000320506.79811.da
18. Gorno-Tempini, ML, Dronkers, NF, Rankin, KP, Ogar, JM, Phengrasamy, L, Rosen, HJ, et al. Cognition and anatomy in three variants of primary progressive aphasia. Ann Neurol. (2004) 55:335–46. doi: 10.1002/ana.10825
19. Wilson, SM, Henry, ML, Besbris, M, Ogar, JM, Dronkers, NF, Jarrold, W, et al. Connected speech production in three variants of primary progressive aphasia. Brain. (2010) 133:2069–88. doi: 10.1093/brain/awq129
20. Cho, S, Cousins, KAQ, Shellikeri, S, Ash, S, Irwin, DJ, Liberman, MY, et al. Lexical and acoustic speech features relating to Alzheimer disease pathology. Neurology. (2022) 99:e313–22. doi: 10.1212/WNL.0000000000200581
21. Boxer, AL, and Miller, BL. Clinical features of frontotemporal dementia. Alzheimer Dis Assoc Disord. (2005) 19:S3–6. doi: 10.1097/01.wad.0000183086.99691.91
22. Ash, S, and Grossman, M. Why study connected speech production? In: RM Willems, editor. Cognitive neuroscience of natural language use. Cambridge: Cambridge University Press (2015). 29–58.
23. Ash, S, Moore, P, Antani, S, McCawley, G, Work, M, and Grossman, M. Trying to tell a tale: discourse impairments in progressive aphasia and frontotemporal dementia. Neurology. (2006) 66:1405–13. doi: 10.1212/01.wnl.0000210435.72614.38
24. Graham, NL, Patterson, K, and Hodges, JR. When more yields less: speaking and writing deficits in nonfluent progressive aphasia. Neurocase. (2004) 10:141–55. doi: 10.1080/13554790409609945
25. Snowden, J, Goulding, PJ, and David, N. Semantic dementia: a form of circumscribed cerebral atrophy. Behav Neurol. (1989) 2:167–82. doi: 10.1155/1989/124043
26. Hodges, JR, Patterson, K, Oxbury, S, and Funnell, E. Semantic dementia. Progressive fluent aphasia with temporal lobe atrophy. Brain. (1992) 115:1783–806. doi: 10.1093/brain/115.6.1783
27. Hardy, CJD, Buckley, AH, Downey, LE, Lehmann, M, Zimmerer, VC, Varley, RA, et al. The language profile of behavioral variant frontotemporal dementia. J Alzheimers Dis. (2016) 50:359–71. doi: 10.3233/JAD-150806
28. Luzzi, S, Baldinelli, S, Ranaldi, V, Fiori, C, Plutino, A, Fringuelli, FM, et al. The neural bases of discourse semantic and pragmatic deficits in patients with frontotemporal dementia and Alzheimer’s disease. Cortex. (2020) 128:174–91. doi: 10.1016/j.cortex.2020.03.012
29. Nevler, N, Ash, S, Jester, C, Irwin, DJ, Liberman, M, and Grossman, M. Automatic measurement of prosody in behavioral variant FTD. Neurology. (2017) 89:650–6. doi: 10.1212/WNL.0000000000004236
30. Robin, J, Xu, M, Kaufman, LD, Simpson, W, McCaughey, S, Tatton, N, et al. Development of a speech-based composite score for remotely quantifying language changes in frontotemporal dementia. Cogn Behav Neurol. (2023) 36:237–48. doi: 10.1097/WNN.0000000000000356
31. Kertesz, A, Davidson, W, McCabe, P, Takagi, K, and Munoz, D. Primary progressive aphasia: diagnosis, varieties, evolution. J Int Neuropsychol Soc. (2003) 9:710–9. doi: 10.1017/S1355617703950041
32. Henry, JD, Crawford, JR, and Phillips, LH. Verbal fluency performance in dementia of the Alzheimer’s type: a meta-analysis. Neuropsychologia. (2004) 42:1212–22. doi: 10.1016/j.neuropsychologia.2004.02.001
33. Weakley, A, and Schmitter-Edgecombe, M. Analysis of verbal fluency ability in Alzheimer’s disease: the role of clustering, switching and semantic proximities. Arch Clin Neuropsychol. (2014) 29:256–68. doi: 10.1093/arclin/acu010
34. Duong, A, Whitehead, V, Hanratty, K, and Chertkow, H. The nature of lexico-semantic processing deficits in mild cognitive impairment. Neuropsychologia. (2006) 44:1928–35. doi: 10.1016/j.neuropsychologia.2006.01.034
35. Tsantali, E, Economidis, D, and Tsolaki, M. Could language deficits really differentiate mild cognitive impairment (MCI) from mild Alzheimer’s disease? Arch Gerontol Geriatr. (2013) 57:263–70. doi: 10.1016/j.archger.2013.03.011
36. Appell, J, Kertesz, A, and Fisman, M. A study of language functioning in Alzheimer patients. Brain Lang. (1982) 17:73–91. doi: 10.1016/0093-934X(82)90006-2
37. Oh, MJ, Kim, S, Park, YH, Suh, J, and Yi, S. Early onset Alzheimer’s disease presenting as Logopenic primary progressive aphasia. Dement Neurocogn Disord. (2018) 17:66–70. doi: 10.12779/dnd.2018.17.2.66
38. Macoir, J, and Turgeon, Y. Dementia and language In: K Brown, editor. Encyclopedia of Language & Linguistics. Oxford: Elsevier (2006). 423–30.
39. Gumus, M, DeSouza, DD, Xu, M, Fidalgo, C, Simpson, W, and Robin, J. Evaluating the utility of daily speech assessments for monitoring depression symptoms. Digital Health. (2023) 9:20552076231180523. doi: 10.1177/20552076231180523
40. Robin, J, Xu, M, Balagopalan, A, Novikova, J, Kahn, L, Oday, A, et al. Automated detection of progressive speech changes in early Alzheimer’s disease. Alz Dem Diag Ass Dis Mo. (2023) 15:e12445. doi: 10.1002/dad2.12445
41. Folstein, MF, Folstein, SE, and McHugh, PR. Mini-mental state. J Psychiatr Res. (1975) 12:189–98. doi: 10.1016/0022-3956(75)90026-6
42. Mattis, S. Dementia rating scale: Professional manual. Lutz, FL: Psychological Assessment Resources, Incorporated (1988).
43. Risser, AH, and Spreen, O. The Western aphasia battery. J Clin Exp Neuropsychol. (1985) 7:463–70. doi: 10.1080/01688638508401277
45. Vaccarino, AL, Dharsee, M, Strother, S, Aldridge, D, Arnott, SR, Behan, B, et al. Brain-CODE: a secure Neuroinformatics platform for management, federation, sharing and analysis of multi-dimensional neuroscience data. Front Neuroinform. (2018) 12:28. doi: 10.3389/fninf.2018.00028
46. Honnibal, M., and Montani, I. (2017). spaCy 2: natural language understanding with bloom embeddings, convolutional neural networks and incremental parsing. Available at: https://sentometrics-research.com/publication/72/
47. Chen, D., and Manning, C. (2014). A fast and accurate dependency parser using neural networks. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP) (Doha, Qatar: Association for Computational Linguistics).
48. Boersma, P., and Weenink, D. (2010). Praat: Doing phonetics by computer. Available at: http://www.praat.org/
49. Jadoul, Y, Thompson, B, and de Boer, B. Introducing Parselmouth: a Python interface to Praat. J Phon. (2018) 71:1–15. doi: 10.1016/j.wocn.2018.07.001
50. Bojanowski, P, Grave, E, Joulin, A, and Mikolov, T. Enriching word vectors with subword information. Comput Language. (2017). doi: 10.48550/arXiv.1607.04606
51. Pennington, J., Socher, R., and Manning, C. (2014). GloVe: global vectors for word representation. Proceedings of the 2014 conference on empirical methods in natural language processing (EMNLP) (Doha, Qatar: Association for Computational Linguistics).
52. Brysbaert, M, and New, B. Moving beyond Kučera and Francis: a critical evaluation of current word frequency norms and the introduction of a new and improved word frequency measure for American English. Behav Res Methods. (2009) 41:977–90. doi: 10.3758/BRM.41.4.977
53. Kuperman, V, Stadthagen-Gonzalez, H, and Brysbaert, M. Age-of-acquisition ratings for 30,000 English words. Behav Res. (2012) 44:978–90. doi: 10.3758/s13428-012-0210-4
54. Stadthagen-Gonzalez, H, and Davis, CJ. The Bristol norms for age of acquisition, imageability, and familiarity. Behav Res Methods. (2006) 38:598–605. doi: 10.3758/BF03193891
55. Warriner, AB, Kuperman, V, and Brysbaert, M. Norms of valence, arousal, and dominance for 13,915 English lemmas. Behav Res. (2013) 45:1191–207. doi: 10.3758/s13428-012-0314-x
56. Mota, NB, Vasconcelos, NAP, Lemos, N, Pieretti, AC, Kinouchi, O, Cecchi, GA, et al. Speech graphs provide a quantitative measure of thought disorder in psychosis. PLoS One. (2012) 7:e34928. doi: 10.1371/journal.pone.0034928
57. R Core Team (2021). R: A Language and Environment for Statistical Computing. Available at: https://www.R-project.org/
58. Habeck, C, Foster, NL, Perneczky, R, Kurz, A, Alexopoulos, P, Koeppe, RA, et al. Multivariate and univariate neuroimaging biomarkers of Alzheimer’s disease. NeuroImage. (2008) 40:1503–15. doi: 10.1016/j.neuroimage.2008.01.056
60. Stühler, E, Merhof, D, Stühler, E, and Merhof, D. Principal component analysis applied to SPECT and PET data of dementia patients – a review In: P Sanguansat, editor. Principal component analysis-multidisciplinary applications (IntechOpen). England: InTech (2012)
61. Cumming, TB, Patterson, K, Verfaellie, M, and Graham, KS. One bird with two stones: abnormal word length effects in pure alexia and semantic dementia. Cogn Neuropsychol. (2006) 23:1130–61. doi: 10.1080/02643290600674143
62. Ferrante, FJ, Migeot, J, Birba, A, Amoruso, L, Pérez, G, Hesse, E, et al. Multivariate word properties in fluency tasks reveal markers of Alzheimer’s dementia. Alzheimers Dement. (2023) 20:925–40. doi: 10.1002/alz.13472
63. Ralph, MAL, Graham, KS, Ellis, AW, and Hodges, JR. Naming in semantic dementia—what matters? Neuropsychologia. (1998) 36:775–84. doi: 10.1016/S0028-3932(97)00169-3
64. Taylor, R. Effects of age of acquisition, word frequency, and familiarity on object recognition and naming in dementia. Percept Mot Skills. (1998) 87:573–4. doi: 10.2466/pms.1998.87.2.573
65. Bird, H, Ralph, MAL, Patterson, K, and Hodges, JR. The rise and fall of frequency and imageability: noun and verb production in semantic dementia. Brain Lang. (2000) 73:17–49. doi: 10.1006/brln.2000.2293
66. Williams, E, Theys, C, and McAuliffe, M. Lexical-semantic properties of verbs and nouns used in conversation by people with Alzheimer’s disease. PLoS One. (2023) 18:e0288556. doi: 10.1371/journal.pone.0288556
67. Tainturier, MJ, Tremblay, M, and Lecours, AR. Educational level and the word frequency effect: a lexical decision investigation. Brain Lang. (1992) 43:460–74. doi: 10.1016/0093-934x(92)90112-r
68. Hier, DB, Hagenlocker, K, and Shindler, AG. Language disintegration in dementia: effects of etiology and severity. Brain Lang. (1985) 25:117–33. doi: 10.1016/0093-934x(85)90124-5
69. Banovic, S, Zunic, LJ, and Sinanovic, O. Communication difficulties as a result of dementia. Mater Sociomed. (2018) 30:221–4. doi: 10.5455/msm.2018.30.221-224
70. Blair, M, Marczinski, CA, Davis-Faroque, N, and Kertesz, A. A longitudinal study of language decline in Alzheimer’s disease and frontotemporal dementia. J Inter Neuropsych Soc. (2007) 13:237–45. doi: 10.1017/S1355617707070269
71. Ivnik, RJ, Malec, JF, Smith, GE, Tangalos, EG, and Petersen, RC. Neuropsychological tests’ norms above age 55: COWAT, BNT, MAE token, WRAT-R Reading, AMNART, STROOP, TMT, and JLO. Clin Neuropsychol. (1996) 10:262–78. doi: 10.1080/13854049608406689
72. Sluis, RA, Angus, D, Wiles, J, Back, A, Gibson, T, Liddle, J, et al. An automated approach to examining pausing in the speech of people with dementia. Am J Alzheimers Dis Other Dement. (2020) 35:1533317520939773. doi: 10.1177/1533317520939773
73. Davis, BH, and Maclagan, M. Examining pauses in Alzheimer’s discourse. Am J Alzheimers Dis Other Dement. (2009) 24:141–54. doi: 10.1177/1533317508328138
74. Robin, J, Xu, M, Kaufman, LD, and Simpson, W. Using digital speech assessments to detect early signs of cognitive impairment. Front Digit Health. (2021) 3:749758. doi: 10.3389/fdgth.2021.749758
Keywords: speech, linguistic, neurodegenerative diseases, digital health, clinical symptoms
Citation: Gumus M, Koo M, Studzinski CM, Bhan A, Robin J and Black SE (2024) Linguistic changes in neurodegenerative diseases relate to clinical symptoms. Front. Neurol. 15:1373341. doi: 10.3389/fneur.2024.1373341
Edited by:
Hiroko H. Dodge, Massachusetts General Hospital and Harvard Medical School, United StatesReviewed by:
Liu Chen, Massachusetts General Hospital and Harvard Medical School, United StatesCharalambos Themistocleous, University of Oslo, Norway
Copyright © 2024 Gumus, Koo, Studzinski, Bhan, Robin and Black. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Melisa Gumus, melisa.gumus@camcog.com