- 1Department of Speech, Language and Hearing Sciences, Indiana University Bloomington, Bloomington, IN, United States
- 2Program in Neuroscience, Indiana University Bloomington, Bloomington, IN, United States
- 3Department of Communication Sciences and Special Education, University of Georgia, Athens, GA, United States
- 4Department of Communication Sciences and Disorders, University of Delaware, Newark, DE, United States
- 5Delaware Center for Cognitive Aging Research, University of Delaware, Newark, DE, United States
Purpose: Mild language impairments experienced by adults with neurogenic communication disorders are often difficult to detect due to the lack of sensitive traditional performance-based measures. This is problematic since many adults who have mild language deficits experience daily activity and participation limitations that are undetected and not managed. This study evaluates the potential for variables derived through core lexicon analysis to differentiate two clinical groups (latent aphasia, MCI) from each other, and from a cognitively healthy adult group, across three different discourse tasks (Aim 1). Innovatively, it also contrasts the sensitivity with which each task differentiates the groups based on this metric (Aim 2).
Methods: Transcribed connected speech data from TalkBank were analyzed for three discourse tasks (i.e., Sandwich Procedure, Cat Rescue Picture Description, and Cinderella Story) from three participant groups [Mild Cognitive Impairment (MCI) n = 30, stroke-induced latent aphasia n = 29, and Cognitive Healthy Adults (CHA) n = 56]. Aim 1 used one-way ANOVAs (or non-parametric equivalents) to identify differences in lexical variables (total number of core lexical items; proportion of core lexical items out of all words produced; and rate of core lexical items produced per second) between participant groups. Aim 2 used linear discriminant analysis with cross validation to characterize the sensitivity of discourse task in identifying lexical variables differentiating the participant groups.
Results: Univariate analysis revealed significant differences among the three participant groups. During the Cinderella task, the latent aphasia and MCI groups produced significantly fewer core lexical items than CHAs, while their proportion of core lexical items to total tokens was higher than CHAs. The latent aphasia group produced core lexical items more slowly than the MCI group for all three discourse tasks. Finally, individuals with latent aphasia produced significantly fewer core lexical items during the Sandwich task than either the MCI or CHA groups. Aim 2’s sensitivity analysis revealed that number of core lexical items produced during the Cinderella task best differentiated the MCI group from CHAs, number of core lexical items produced during Sandwich best differentiated latent aphasia from CHAs, and core lexical items per second during Cinderella best differentiated latent aphasia from MCI.
Conclusion: Our study suggests that the Cinderella story is more sensitive than a picture description task for demonstrating the subtle lexical-semantic changes in MCI and latent aphasia compared to CHAs. Core lexicon appears to be a sensitive discourse metric to identify linguistic differences between CHAs and individuals with mild cognitive and/or language deficits. These findings further support calls to provide speech/language and cognitive therapy to individuals with MCI and/or latent aphasia.
Introduction
People with very mild language impairments frequently experience significant activity and participation limitations (Cavanaugh and Haley, 2020). These very mild language impairments are often not discernable using traditional performance-based neuropsychological tests (e.g., standardized language and cognitive norm-referenced measures), which lack adequate sensitivity. However, it is important that these individuals receive speech/language services to reduce the negative impacts of their language impairment Referrals cannot happen, and adequate treatment plans cannot be made, unless there are metrics sensitive enough to detect mild impairments. Two clinical groups that are at particular risk of having unmet rehabilitation needs include individuals with mild cognitive impairment (MCI) and individuals with latent aphasia following stroke.
MCI is a clinical syndrome characterized by cognitive decline that is greater than expected for an individual’s age and education level but does not significantly interfere with daily functioning (American Psychiatric Association, 2013). Language impairments in MCI can manifest in various ways such as reduced verbal fluency (Weakley et al., 2013) or errors in confrontation naming (Ahmed et al., 2013). In older adults, language decline may be indicative of underlying neurodegenerative diseases, such as Alzheimer’s disease, and may predict progression to further cognitive impairment earlier than memory decline (Mueller et al., 2018b; Oulhaj et al., 2009). Diagnosing someone early in the disease process is not only difficult due to the lack of sensitive measures (Petersen and Yaffe, 2020) but also the expertise and time needed for conducting a comprehensive evaluation (Alzheimer’s Association, 2022). For example, a recent study found that the Montreal Cognitive Assessment (MoCA; Nasreddine et al., 2005), a common cognitive screening tool used by speech-language pathologists (SLPs) (Lanzi et al., 2023; Roitsch et al., 2021) does not have adequate sensitivity or reliability to detect mild language/communication challenges in adults with MCI (Stagge et al., 2024). Early detection and monitoring of language impairments in MCI is especially crucial for appropriate intervention and management strategies, e.g., early uptake of lifestyle management strategies (Chandler et al., 2019; Clark et al., 2005; Cohen et al., 2021; Ropacki et al., 2017; Sabbagh et al., 2020).
The second group at increased risk of unmet rehabilitation needs are individuals who have experienced a left hemisphere stroke but score above the cut-off for aphasia on standardized language assessments. These individuals have historically received little attention in the research literature, either by being excluded for not having aphasia or by being grouped with individuals with anomic aphasia (e.g., Cruice et al., 2014; Papanicolaou et al., 1988; Sekine and Rose, 2013; Sekine et al., 2013). However, increased attention has been paid to this group recently (Crutch and Warrington, 2003; Cunningham and Haley, 2020; Dalton and Richardson, 2015, 2019; DeDe and Salis, 2020; Fromm et al., 2017; Law et al., 2015; Martzoukou et al., 2023; Richardson et al., 2018, 2021; Salis and DeDe, 2022; Silkes et al., 2020). The terminology used to refer to this group has varied widely, but there appears to be a coalescence in the literature around the term “latent aphasia” (e.g., DeDe and Hoover, 2021; DeDe and Salis, 2020; Martzoukou et al., 2023; Salis and DeDe, 2022; Silkes et al., 2020; Zhang et al., 2024). We prefer this label since (1) it does not refer to a specific clinical assessment (in contrast to another common label, “not aphasic by WAB [Western Aphasia Battery]”); and (2) the language impairments observed in this group are consistent with the definition of latent as a “quality or state existing but not yet manifest” and “lying dormant or hidden until circumstances are suitable for development or manifestation” (Oxford Languages). Individuals with latent aphasia produce discourse that is significantly different from healthy controls who have not experienced a stroke. For example, individuals with latent aphasia demonstrate reduced typicality and informativeness, fewer utterances, lower lexical diversity and lexical entropy, reduced efficiency, longer formulation time, longer silent pauses, and slower speech rate than healthy controls (e.g., Cunningham and Haley, 2020; Dalton et al., 2020a; DeDe and Salis, 2020; Fromm et al., 2017). Continued investigations that focus on improving identification of latent aphasia are warranted given these findings.
Discourse and lexical access
Discourse, or language production beyond the sentence level produced for a specific purpose (Armstrong, 2000), is a fundamental component of communication. Successful discourse production relies upon a complex interplay between cognitive and linguistic processes. It comprises microlinguistic (e.g., syntax, phonology, lexical-semantics) and macrostructural components (e.g., topic management, story grammar) (Dipper et al., 2021), as well as cognitive functions such as semantic memory (Tchakoute et al., 2017) and executive function (Cannizzaro and Coelho, 2013; Dutta et al., 2024). Because of the long history of eliciting these more complex communication behaviors in patients with dementia and stroke, many different terms have been used, including “connected speech,” “discourse,” “language,” or “connected language.” Connected speech is perhaps the most frequently used term and highlights the historical focus on microlinguistic measures of production (such as type/token ratio, words per minute and prosodic factors), to the exclusion of more complex language (e.g., inferencing, organization, and sequencing) or cognitive (e.g., judgment, reasoning, problem-solving) processes. We use the term “discourse” since successful communication at this level requires speech, language, and cognitive processes working in tandem.
Discourse provides complementary and extended information to that collected through typical neuropsychological performance-based language measures (e.g., verbal fluency tasks, confrontation object naming). Research suggests that discourse analysis may be sensitive to early neuropsychological changes during the MCI phase, in subjective cognitive decline, and in subtle presentations of language impairments after brain injury (Ahmed et al., 2013; Berisha et al., 2015; Garrard et al., 2005). Of particular interest to the current study is that discourse analysis may sensitively demonstrate language changes in MCI from those associated with cognitively healthy aging, as well as to differentiate whether language is ‘sufficiently’ impaired post-stroke to warrant a referral for speech-language therapy services.
While discourse can be both interactional and monologic, the predominant focus in clinical practice and research has been on monologic discourse. Within monologs, single picture description tasks are the predominant means of elicitation for both individuals with MCI (Filiou et al., 2020; Mueller et al., 2018a) and those with aphasia (Bryant et al., 2016). Single picture description tasks are included in most standardized assessment batteries for individuals with communication impairments (e.g., Goodglass and Kaplan, 1972; Kertesz, 2007; Wilson et al., 2018). In this type of task, a relatively complex visual scene is presented to the client, and they are asked to talk about it. Importantly, specific elicitation instructions vary (e.g., with some tasks specifying to “speak in full sentences,” and others lacking precise instructions), which previous research shows may have an impact on discourse production (Wright and Capilouto, 2009). In the MCI population, it has been argued that using single picture descriptions minimizes demands on potentially impaired memory systems and more sensitively demonstrates language, rather than memory, difficulties (Mueller et al., 2018a). Giving someone a picture to describe constrains the language that can be produced, therefore requiring individuals to access specific vocabulary to successfully describe the picture. This restriction on the ‘correct’ vocabulary can demonstrate phonological and lexical-semantic impairments, in that persons with stroke or MCI may be unable to produce the relevant vocabulary, produce unrelated or vague vocabulary (e.g., “that one”), or produce various errors (e.g., “dog” instead of “lion”). Despite these strengths, single picture description tasks also have drawbacks. They tend to require little cognitive effort and therefore may elicit language that does not demonstrate subtle clinical changes. For example, in a group of persons with and without aphasia, a single picture description was shown to demonstrate lower propositional idea density (e.g., fewer semantically relevant words) than a narrative task (retelling a well-known story) (Stark, 2019) which poses a problem for documenting changes in MCI. Indeed, propositional idea density has been widely examined in studies of dementia and aging (Kemper et al., 2001; Snowdon, 1996), with individuals at risk for cognitive decline or Alzheimer’s disease showing a decline in propositional idea density and syntactic complexity. Finally, a recent study investigated the utility of discourse variables derived from a single picture description task to differentiate between cognitively healthy adults and individuals with MCI (Mefford et al., 2023). Results indicated that this task had variable sensitivity to group differences by MCI subtype (amnestic vs. non-amnestic) and/or by the proportion of various subtypes in an undifferentiated MCI population (Mefford et al., 2023).
Other common ways of collecting discourse samples include the retelling of fictional, well-known (“familiar”) narratives. The person is typically presented with a wordless picture book (e.g., Cinderella), and asked to retell the story with any information they knew about the story beforehand and what they had just seen in the book (MacWhinney et al., 2010; Saffran et al., 1989). Another common method involves the description of a procedure, where the participant is asked to tell the listener “how to do” something. Procedural narratives are particularly interesting for demonstrating communicative competence via multimodal communication, because these narratives typically produce spatial language that describes the position, relationship, and movement of objects (likely because they draw on implicit, motor memory) and also associate with co-speech, meaningful gesture (Pritchard et al., 2015; Stark and Cofoid, 2022; Stark and Oeding, 2024).
Discourse analysis
Early detection and monitoring of language impairments in MCI is especially crucial for appropriate intervention and management strategies (Chandler et al., 2019; Cohen et al., 2021; Ropacki et al., 2017; Sabbagh et al., 2020). Further, persons with stroke (who may also have latent aphasia) are at a heightened risk for development of MCI (e.g., Sachdev et al., 2009) and dementia (Kuźma et al., 2018). It is therefore extremely important to sensitively monitor for transition from a primarily stable clinical state (e.g., chronic stroke-induced latent aphasia) to a progressive clinical state (e.g., chronic stroke-induced aphasia and dementia diagnosis).
Burgeoning research has demonstrated that individuals with latent aphasia produce discourse distinct from healthy control speakers and speakers with anomic aphasia during narrative discourse. Individuals with latent aphasia make more word errors, speak more slowly, and produce decreased essential content compared to cognitively healthy peers (Fromm et al., 2017). Others have also found increased silent pause duration in individuals with latent aphasia during narrative tasks (DeDe and Salis, 2020). Growing research suggests that there are measurable differences in specific microlinguistic processes (e.g., fluency and semantics) between cognitive healthy adults and clinical syndromes from Alzheimer’s disease (Filiou et al., 2020; Mueller et al., 2018b), though differentiation in discourse-level (e.g., coherence) features of MCI from cognitively healthy aging adults has seen mixed results (Bschor et al., 2001; Toledo et al., 2018). These mixed finding may be the result of the majority of analyses only evaluating a single picture description task that elicits a relatively short language sample (e.g., Cookie Theft) (e.g., Lanzi et al., 2023; Mueller et al., 2018b). Therefore, there is a distinct gap in knowledge about the extent to which different discourse tasks can be leveraged to differentiate persons with clinical diagnoses that present with subtle language impairments (latent aphasia; MCI) from cognitively healthy adults.
While clinicians and researchers recognize the importance of discourse analysis and report a desire to use it (Bryant et al., 2017; Cruice et al., 2020; Stark et al., 2021), it is not standard clinical (or research) practice because a variety of barriers exist, e.g., lack of training, lack of tools, and lack of time (Stark et al., 2021; Stark and Dalton, 2024). Transcript-based analysis is the gold standard for comprehensively analyzing discourse, but transcribing at a detailed level is rarely feasible in clinical settings, and the level of detail contained in research-setting transcripts (e.g., phonetic analysis) may not be necessary for clinical decision making. For this reason, metrics that are easy to extract (either from a quick transcript or perceptually) are ideal in clinical settings (Stark and Dalton, 2024).
Core lexicon analysis
Core lexicon analysis is a discourse metric that evaluates context-specific access to lexical-semantic knowledge (Dalton et al., 2020a,b). For a given discourse task, a core lexicon is comprised of typical lexical items produced by cognitively healthy persons (Dalton et al., 2020a,b; Kim and Wright, 2020). A unique aspect of core lexicon analysis compared to similar measures, such as correct information units (Nicholas and Brookshire, 1993), is the focus on the typicality of vocabulary used. For example, during the Cinderella task, cognitively healthy controls tend to use lexical items such as “prince” and “slipper,” to describe core components of the story (Dalton et al., 2020a,b). Using less specific vocabulary such as “man” or “shoe” in place of these core lexical items would still be broadly informative but leads to a less rich and nuanced discourse production. Indeed, core lexicon measures have been shown to correlate with main concept production, suggesting that single lexical items can reflect broader ‘main ideas’ (sometimes called ‘macropropositions’) constructed during discourse (Dalton and Richardson, 2015). As such, core lexicon analysis provides an interface between linguistic and propositional levels of discourse.
Recent work has suggested that evaluating core lexical items produced during discourse may be a sensitive way to evaluate longitudinal change from acute to chronic stage in post-stroke aphasia (Kim et al., 2022) and has suggested that core lexicon production associates with dementia severity in persons with Alzheimer’s disease (Kintz et al., 2024). Indeed, the subtle change in the ability to access topic-relevant lexical-semantic vocabulary has been documented in persons with Alzheimer’s disease (Mueller et al., 2020) and latent aphasia (Dalton and Richardson, 2015), though not yet systematically characterized across discourse tasks or across clinical groups. More recent work from Chen et al. (2024) suggests that core lexical access, in particular, is the most sensitive in differentiating language between individuals with MCI and cognitively healthy adults. This study evaluated n = 16 English-speaking participants with MCI and n = 16 matched cognitively healthy adults, examining eight lexical-semantic features across four discourse tasks (two picture descriptions, a familiar story narrative [Cinderella], and a procedural narrative) (Chen et al., 2024). Univariate analyses showed inter-group differences in core lexicon and other variables, depending on the task. Multivariate pattern analysis demonstrated that the Cinderella narrative was the only task that discriminated the two groups above chance (65.6%), and when examining the lexical-semantic features that drove this, identified that the MCI group produced fewer core lexical items. The authors suggest that this finding shows that individuals with MCI exhibit a decrease in lexical diversity and richness in a story recall setting.
While both individuals with MCI and individuals with latent aphasia present with lexical-semantic impairments behaviorally, it is important to consider the underlying cognitive changes driving performance between the groups. For individuals with latent aphasia, the lexical-semantic impairment is likely a result of difficulty accessing mental representations of the lexicon, although mental representations remain intact. On the other hand, lexical-semantic impairment in individuals with MCI may arise via several routes. For individuals with amnestic MCI, lexical-semantic impairments may be a result of deficits in working or semantic memory. For individuals with non-amnestic MCI, lexical-semantic impairments may be driven by executive function, memory, or language deficits.
Core lexicon lists have been developed for discourse tasks commonly used to evaluate clinical samples (Dalton et al., 2020a,b). Of particular note and the main impetus of the current study is that burgeoning evidence supports the feasibility and utility of core lexicon analysis in clinical settings (Dalton et al., 2020a,b; Kim and Wright, 2020). Preliminary research suggests that, once clinicians are familiar with the core lexical item checklists for the discourse samples that they typically utilize, scoring can be completed without transcripts and potentially in real time (Dalton et al., 2020a,b). Given this high potential for clinical utilization, additional investigations of core lexicon’s sensitivity to group differences are warranted.
Motivation and study aims
This study evaluates the potential with which variables derived through core lexicon analysis can differentiate two clinical groups (latent aphasia, MCI) from each other, and from a cognitively healthy adult group, across three different discourse tasks (Aim 1). Innovatively, it also contrasts the sensitivity with which each task differentiates the groups based on this metric (Aim 2).
Methods
Methods details were guided by the best practices for publishing on spoken discourse in aphasia (Stark et al., 2022); a table with page numbers highlighting each best practice can be found in the Supplementary material.
Participants
All participants were drawn from the English corpora of the TalkBank consortium, specifically, DementiaBank (Lanzi et al., 2023) and AphasiaBank (MacWhinney et al., 2011). Authors BCS and AL directly contributed data: BCS to the latent aphasia group and cognitively healthy adult group (The NEURAL Research Lab Corpuses in AphasiaBank, ethical approval from Indiana University), and AL to the MCI group and cognitively healthy adult group (The Delaware Corpus in DementiaBank, ethical approval from the University of Delaware). All other data came from the consortium, which were provided by authors who had their own established IRBs.
Participant groups were relatively matched for age, gender, education, and race/ethnicity, as seen in Table 1 which reports participant demographic and neuropsychological information. Thirty individuals with MCI, 27 with Latent Aphasia, and 56 Cognitive Healthy Adults were ultimately included.
Inclusion
For homogeneity of age, and to reflect typical diagnostic ranges of stroke aphasia and MCI, individuals were included if they were aged 50–79 years.
Individuals with latent aphasia—defined as testing above a standard aphasia cut off on a standardized battery [the Western Aphasia Battery-Revised or Western Aphasia Battery Bedside (Kertesz, 2007)]—were identified across all corpora in the AphasiaBank database. All had impacted language as a result of a stroke; some individuals had more than one stroke, but the number of strokes (and location of stroke) was not always reported. Motor speech information was limited for participants.
Individuals with amnestic MCI were identified through the Delaware Corpus of DementiaBank. Amnestic MCI status was documented in the database and was based on the National Institute on Aging and Alzheimer’s Association working group (Albert et al., 2011) and Petersen (2004) criteria. That is, individuals with amnestic MCI produced at least one cognitive test score 1.5 standard deviations below age-and-demographically matched cognitively healthy adults but were functionally independent as measured by the Clinical Dementia Rating Scale obtained from a structured interview with a study partner (Morris, 1993). Since all participants presented with amnestic MCI, the groups’ primary cognitive deficits were with memory function.
Cognitively healthy adults (CHA) were selected from the control samples within DementiaBank and AphasiaBank, across various corpora. Potential participants were selected based on the age and sex distribution of the two clinical groups. All cognitively healthy adults had either a Mini Mental State Exam or Montreal Cognitive Assessment score in the typical range.
Exclusion
Individuals were excluded from the analysis if they did not have language data for the three tasks of interest, described below, or were otherwise missing demographic or neuropsychological testing data.
Elicitation materials and database
Transcribed speech data were already available in AphasiaBank and DementiaBank for participants described above. These transcriptions undergo reliability verification prior to inclusion in the database, though the methodology differs by the contributing lab. The general process is that each contributing lab completes in-house reliability checks, then sends transcription and audio or video data to the TalkBank team, who then double-check the transcriptions for accuracy.
Transcribed speech data were analyzed for three discourse tasks: Cat Rescue description (a single picture description), the Cinderella narrative (a familiar, fictional story retell with no pictorial cues during the retelling), and the Sandwich narrative (a procedural narrative where individuals tell how to make a peanut butter and jelly sandwich with no pictorial cues). Transcriptions were available in AphasiaBank and DementiaBank and were not further checked for the purposes of this study since previous research has reported high fidelity of transcription (Dalton and Richardson, 2015). The transcriptions contained orthographic and phonetic information and were coded using Codes for the Human Analysis of Transcripts (CHAT) (MacWhinney, 2000). For full task instructions and pictures given to participants, see the AphasiaBank and DementiaBank protocols, located on their respective websites. Media and transcripts for all participants are available through consortium membership to AphasiaBank and DementiaBank.
Dependent variable summary
Core lexical analysis is a clinically feasible tool for evaluating lexical-semantic, context-specific knowledge and has been used to evaluate a variety of different spoken discourse prompts and procedures in aphasia (Dalton et al., 2020a,b; Dalton and Richardson, 2015; Kim et al., 2022; Kim and Wright, 2020) and dementia (Kintz et al., 2024). In this study we used previously established core lexicon checklists for the Cat Rescue, Cinderella, and Sandwich discourse tasks (Dalton et al., 2020a,b).
As such, core lexical information was modeled in three ways: (1) raw number of core lexicon items produced, reflecting topic-relevant lexical-semantic access; (2) core lexicon items as a proportion of total tokens (similar to a metric of lexical diversity but with an emphasis on typicality), reflecting the extent to which the information produced was topically informative; and (3) rate of core lexicon items produced, modeled per second of speech, reflecting the extent to which topic-relevant lexical-semantic information was produced in an efficient manner.
These three variable iterations were chosen because of their potential clinical feasibility. Studies have demonstrated that the number of core lexical items is able to be collected in real time for the elicitation materials used in our study (Dalton et al., 2020a,b; Kim and Wright, 2020) and the proportion of core lexical items and core lexical items per second can be easily calculated post hoc by dividing the total core lexical items by total words and total seconds spoken. It is common in clinical and research settings to record spoken discourse in order to do analyses later (Bryant et al., 2017; Stark et al., 2021), so ascertaining these variables is an additional step, but straightforward. Further, the efficiency of discourse production (modeled as variables per second or minute) has been before shown to be a sensitive measure across other variables, like correct information units (Boyle et al., 2022; Doyle et al., 1995; Nicholas and Brookshire, 1993; Stark et al., 2023).
Extracting lexical variables from transcripts using CLAN
While core lexical items can be scored by hand in a clinic setting (Dalton et al., 2020a,b), this analysis opted to automatically extract them from the pre-created transcripts (Dalton et al., 2022). The Computerized Language Analysis Program (CLAN) program was used to automatically extract the dependent variables described above (MacWhinney, 2000). CLAN version 19jul23 on Windows was used. Table 2 describes the list of CLAN commands that were run on the transcripts for all participants.
Analysis
All analyses were run using RStudio 2023.12.1 Build 402 and R version 4.3.2 (2023-10-31 ucrt). Project analyses and de-identified data are available on the Open Science Framework.1
Aim 1: identify differences in lexical variables between participant groups
It was determined that measures did not conform to linear ANOVA assumptions (via Bartlett and Levene’s tests), and thus a Brown-Forsythe one-way ANOVA for unequal variances was used (via R package onewaytests). To run this analysis, an interaction term between Group and Task was created. Then, the interaction term was modeled alongside each dependent variable. If the ANOVA was significant, a post hoc Games Howell Test was employed via rstatix package to examine significant pairwise comparisons. The post hoc Games Howell Test took into account multiple comparisons.
To evaluate the actual core lexical items produced by participants of each group, one-way ANOVAs (or non-parametric equivalents) were run between the core lexical items (the number varied by task) across groups. This enabled us to evaluate the impact of the participant group on the percentage of group members producing an item from each task’s core lexicon at least once. To identify the achieved power of each one-way analysis, a sensitivity analysis was computed for each variable using harmonic means to account for different sample sizes per group per task and the Cohen’s f from each ANOVA.
Aim 2: task sensitivity in identifying lexical variables differentiating the three participant groups
The second primary analysis evaluated the extent to which dependent variables enabled supervised classification into participant groups, by task. Latent discriminant analysis using the MASS and caret packages in R was employed. To avoid overfitting, 50% cross validation was used (i.e., the model was trained on 50% of data and then tested on 50% of the remaining, unseen data). Of interest was the ability of the dependent variables to differentiate between the clinical groups (latent aphasia, MCI), and between the clinical groups and non-clinical group (latent aphasia, CHA; MCI, CHA). Latent discriminant analysis was only conducted for pairwise comparisons that were indicated to be significantly different in Aim 1. As is best practice, prediction accuracy is provided for each classification. The current study was not pre-registered.
Results
Aim 1: identify differences in lexical-semantic measures between subject groups
Brown-Forsythe one-way test (alpha = 0.05) indicated a significant difference across tasks and groups for the number of core lexicon items (F[8, 114.74] = 313.92, p < 0.00001, Cohen’s f = 2.89, achieved β > 0.99), proportion of core lexicon to total tokens (F[8, 162.61] = 5.02, p = 0.0001, Cohen’s f = 0.36, achieved β = 0.999), and core lexicon items per second (F[8, 179.94] = 11.71, p < 0.000001, Cohen’s f = 0.52, achieved β > 0.99). Therefore, Games-Howell post hoc tests with multiple comparison correction were performed for all dependent variable comparisons to evaluate significant pairwise differences between participant groups and for each task, described below. See Table 3 for full descriptive statistics of variables used in the one-way tests, and Table 4 for post hoc analyses. See Figure 1 for a visual comparison of significant variables across groups.
Figure 1. Visual demonstration of three core lexical variables, arranged by participant group and task. (A) Number of core lexicon items produced was most successful at differentiating the MCI group from the CHA group but only for the overlearned narrative Cinderella task, where the peer group produced a higher quantity of lexical-semantic information; (B) Proportion of core lexicon items was most successful in differentiating the MCI and the latent aphasia groups from the CHA group, but only for the overlearned narrative Cinderella task; and (C) Core lexical items produced per second was most successful at differentiating the latent aphasia from the MCI group across all tasks, where the latent aphasia group produced lexical-semantic information less fluently.
By task
Cat rescue
For core lexicon total items, there were no significant differences for any group comparisons: latent aphasia and MCI groups (1.84 [CI −1.41, 5.08], p = 0.66), latent aphasia and CHA groups (−1.89 [−4.97, 1.18], p = 0.55), or MCI and CHA groups (0.06 [−2.42, 2.53], p > 0.99). For the proportion of core lexicon items to total tokens, there was not a significant difference for any group comparisons: latent aphasia and MCI groups (−0.01 [−0.09, 0.06], p > 0.99), latent aphasia and CHA groups (0.036 [−0.04, 0.11], p = 0.799), or MCI and CHA groups (−0.02 [−0.08, 0.03], p = 0.93). For core lexicon items per second, there was a significant difference between latent aphasia and MCI groups (0.24 [0.01, 0.48], p = 0.039), with the latent aphasia group producing fewer core lexicon items per second than the MCI group. There was not a significant difference between latent aphasia and CHA groups (−0.15 [−0.37, 0.06], p = 0.38) or between MCI and CHA groups (−0.09 [−0.32, 0.14], p = 0.94).
Cinderella
For core lexicon total items, there was not a significant difference between the latent aphasia and MCI groups (6.12 [−5.82, 18.06], p = 0.77). There was a significant difference between the latent aphasia and CHA groups (−19.18 [−28.82, −9.54], p < 0.001) and MCI and CHA groups (13.06 [3.26, 22.86], p = 0.002). The CHA group produced more core lexical items than both clinical groups (Table 5). For the proportion of core lexicon items to total tokens, there was not a significant difference between the latent aphasia and MCI groups (−0.02 [−0.097, 0.05], p = 0.991). There was a significant difference between the latent aphasia and CHA groups (0.101 [0.04, 0.17], p = 0.0003) and MCI and CHA groups (−0.08 [−0.14, −0.02], p = 0.002). The CHA group produced a lower proportion of core lexical items to all tokens than either clinical group. For core lexicon items per second, there was a significant difference between the latent aphasia and MCI groups (0.17 [0.02, 0.33], p = 0.02), but no significant differences between the latent aphasia and CHA groups (−0.01 [−0.16, 0.14], p > 0.99) or MCI and CHA groups (−0.16 [−0.32, 0.002], p = 0.055).
Table 5. Core lexicon items produced at least once during the Cinderella story, modeled as a percentage of the subject group (CHA, MCI, Latent) who produced them.
Sandwich
For core lexicon total items, there was not a significant difference between the MCI and CHA groups (0.95 [−1.38, 3.29], p = 0.92), but there was a significant difference between the latent aphasia and CHA groups (−5.26 [−9.33, −1.19], p = 0.004) and latent aphasia and MCI groups (4.31 [0.006, 8.61], p = 0.049). For the proportion of core lexicon items to total tokens, there were no significant differences identified between any groups: latent aphasia and MCI (0.03 [−0.12, 0.17], p = 0.999), latent aphasia and CHA (0.02 [−0.12, 0.15], p > 0.99) or MCI and CHA (−0.04 [−0.14, 0.05], p = 0.84). For core lexicon items per second, there was a significant difference between the latent aphasia and MCI groups (0.44 [0.04, 0.85], p = 0.02), but no significant differences between the latent aphasia and CHA groups (−0.22 [−0.50, 0.06], p = 0.26) or MCI and CHA groups (−0.22 [−0.62, 0.18], p = 0.68).
Core lexicon item-level group analysis
Cat rescue
One-way ANOVA (because data met linear assumptions) was computed to explore the impact of subject group on percentage of group members producing an item from the Cat Rescue Core Lexicon at least once (Supplementary Table 1). There was not a significant impact of group on percentage of the group that tended to produce a core lexical item (F[2, 99] = 1.46, p = 0.24, Cohen’s f = 0.17), suggesting that no group produced a significantly different percentage of core lexical items. This analysis was underpowered (n = 34 core lexicon items; achieved β = 0.31).
Cinderella
A Brown-Forsyth one-way ANOVA (because linear assumptions were not met) was computed to explore the impact of subject group on percentage of group members producing an item from the Cinderella Core Lexicon at least once. A significant main effect of group was identified (F[2,249.45] = 25.08, p < 0.0001, Cohen’s f = 0.42). This analysis was well powered (n = 94 core lexicon items; achieved β > 0.999). Games-Howell post hoc tests identified that this main effect was driven by the pairwise differences between the latent aphasia and CHA groups (0.20 [0.13, 0.27], p < 0.000001) and MCI and CHA groups (0.14 [0.08, 0.20], p < 0.000001). That is, a higher percentage of the CHA group tended to produce at least one core lexicon item at a higher percentage than the latent aphasia and MCI groups. There was not a significant difference between the latent aphasia and MCI groups (0.06 [−0.02, 0.14], p = 0.14).
Sandwich
One-way ANOVA (because data met linear assumptions) was computed to explore the impact of subject group on percentage of group members producing an item from the Sandwich procedural narrative Core Lexicon at least once (Supplementary Table 2). There was not a significant impact of group on percentage of the group that tended to produce a core lexical item (F[2, 72] = 0.81, p = 0.45, Cohen’s f = 0.15), suggesting that no group produced a significantly different percentage of core lexical items. This analysis was underpowered (n = 25 core lexicon items; β = 0.19).
Aim 2: task sensitivity in identifying lexical-semantic measures differentiating the three participant groups
For only the tasks and dependent variables where there was a significant difference between two groups in one-way analyses, above, linear discriminant analysis with cross validation was used to evaluate the extent to which those metrics could classify participants into two groups (see Table 6 for prediction accuracy). The best group differentiators between the latent aphasia and MCI groups were fluency metrics, with similar prediction accuracy for core lexicon per second across all tasks. Cat Rescue (62.07%) and Cinderella (65.52%) had higher accuracy than Sandwich (55.17%). The best group differentiators between MCI and CHA groups were lexical access and diversity metrics, but only for the Cinderella task. Total core lexicon items (67.44%) and core lexicon proportion per total tokens (62.79%) had a similar accuracy for differentiating the clinical groups from the CHA group. The best group differentiators between latent aphasia and CHA groups were core lexicon items in Cinderella (78.57%) and Sandwich (80.95%), and proportion of core lexicon items in Cinderella (73.81%).
Table 6. Prediction accuracy percentage on unseen data (trained on 50% of data, tested on 50% unseen data).
Discussion
Overview of major findings
In the present study, which notably used three iterations of a single clinically accessible metric—core lexicon—showed good prediction accuracy (62–67%) in differentiating cognitively healthy adults from individuals with verified MCI using the Cinderella task (a fictional narrative). Our results also complement the recent results from Chen et al. (2024), which demonstrated the importance of evaluating the number of core lexical items produced during a familiar story narrative in differentiating between a small group of individuals with MCI from cognitively healthy adults. Prior research using ten variables extracted from discourse (a picture sequence description) has demonstrated 77% sensitivity and 80% specificity for predicting cognitive decline in individuals with cardiovascular disease (Roberts et al., 2021). Others have also found that variables derived from spoken discourse outperform data from standardized neuropsychological tests in differentiating individuals with MCI from their cognitively healthy peers (Sanborn et al., 2022). Our results also complement a review of evidence that evaluated performance of persons with MCI during picture description tasks and identified decline in semantic content (Mueller et al., 2018a). The evidence provided in our study suggests that lexical-semantic decline is indeed a noted decline in the MCI group, but that a discourse task that implicates declarative memory and produces more complex language, such as the Cinderella story, will be more sensitive than a picture description task for demonstrating subtle change in lexical-semantic decline compared to a matched cognitively healthy group. This is consistent with the amnestic presentation of the MCI cohort included in the current study.
Of particular interest was the ability of core lexicon to differentiate two groups with shared symptomology, e.g., word finding impairment and lexical-semantic degradation—those with post-stroke, latent aphasia, and those with verified MCI—which, to our knowledge, have not before been systematically differentiated. Prediction accuracy for group assignment was 65% on a procedural narrative for number of core lexicon items, and 65 and 62% on the Cinderella fictional narrative and picture description, respectively, for core lexical items per second. While the prediction accuracy in our study could be improved for differentiating these two groups, it does suggest that this is a fruitful area of future research, especially using a metric that can be ascertained in a clinical setting with relative ease.
The latent group could be differentiated from the cognitively healthy adult group with higher accuracy, for number of lexical items (78.5% on the Cinderella narrative, 81% on the procedural narrative) and proportion of core lexical items (74% on Cinderella narrative). This finding firmly refutes the clinical status quo, which is that individuals scoring above a certain criterion on established aphasia tests are not referred to or are excluded from speech and language services. Our study clearly demonstrates that, on a single metric of lexical-semantic knowledge, individuals with latent aphasia are performing worse than their cognitively healthy peers and suggests that current clinical standards reform their benchmarks in order to make services available to these individuals.
Expanding what is known about language decline in individuals with mild cognitive impairment
Research by Toledo et al. (2018) evaluated topical propositions (e.g., main concept production) in the Cinderella narrative between cognitively healthy adults, adults with amnestic MCI, and adults with Alzheimer’s disease, but did not find a significant difference in number of propositions between the MCI and cognitively healthy control group. Similarly, a study by Drummond et al. (2015) had cognitively healthy adults, adults with amnestic MCI, and adults with Alzheimer’s disease retell a visually presented narrative, and did not find a significant difference in the number of topical propositions between the MCI and cognitively health group. Our results significantly expand upon this prior literature, suggesting that core lexicon analysis may be a more sensitive way to evaluate differences between cognitively healthy adults and adults with MCI, in both raw number of core lexicon items produced as well as the proportion of core lexicon items to total words. Interestingly, Drummond et al. (2015) also had an index – which they called the ‘discourse effectiveness index’—that reflected a similar idea to our core lexicon proportion metric. Their discourse effectiveness index was obtained by dividing the total number of words by the number of topical propositions, and this index did effectively differentiate all three of their groups (cognitively healthy, MCI, AD). This provides added support to our finding that core lexicon proportion also appeared to differentiate both the MCI and the latent aphasia group from the cognitively healthy group during a narrative task. That is, the informativeness or quality of the typical and accurate lexical-semantic information produced during a story retelling narrative has the potential to be a sensitive indicator of subtle language change in clinical groups.
When comparing the current results to prior research in MCI and AD, it is important to consider the impact of task instructions on discourse production. The majority of research in MCI and Alzheimer’s disease has used the “Cookie Theft” picture description task from the Boston Diagnostic Aphasia Examination (BDAE; Goodglass and Kaplan, 1972). Elicitation for this task instructs individuals to “tell me everything you see going on in this picture.” From a linguistic perspective, these instructions are more likely to produce discourse with limited macrostructural organization and lower coherence since there is no impetus to tell a narrative. Instead, the production is characterized by utterances with simple syntactic structures that list people, actions, and objects, with limited links between them (Wright and Capilouto, 2009). From a cognitive perspective, these instructions likely reduce cognitive effort, since development and production of a narrative requires greater contributions of attention, memory, and executive function than producing relatively independent utterances. These instructions may reduce the confound between cognitive impairments and language impairments in individuals with MCI or AD, allowing for a more precise understanding of how neurodegeneration specifically impacts language. However, these insights are limited to primarily microlinguistic features of language, given the limited macrostructural organization.
In this project, we investigated discourse produced in response to the “Cat Rescue” single picture stimulus. Elicitation for this task instructs individuals to, “Look at everything that’s happening and then tell me a story about what you see. Tell me the story with a beginning, a middle, and an end.” These instructions elicit a more complex discourse sample which includes more macrostructural organization, coherence, and cohesive ties (e.g., use of pronoun referents, temporal links, and conjunctions). As such, the individuals with MCI in this study likely experience increased cognitive demands during the picture description task than individuals in studies which use the Cookie Theft stimulus (or other stimulus with the same or similar elicitation instructions). Despite the likely increased cognitive demands of the picture description task used here, our results are consistent with other research in MCI which has found limited sensitivity of picture description tasks to differentiate cognitively healthy controls from individuals with MCI. It may be that access to the visual stimulus throughout picture description tasks provides sufficient support to overcome the mild decrements in cognition and language experienced by individuals with MCI. Indeed, the Cinderella story retell is both the longest, most complex task, and the only narrative task that does not include visual support during the retell (individuals review a wordless picture book of the Cinderella story, but it is removed prior to beginning the retell). As such, the cognitive components (in particular, declarative and working memory) are drawn upon most by the Cinderella task in comparison to the picture description task, where the visual stimulus is present throughout, and the procedural task, which involves declarative memory but which is likely scaffolded by implicit, motor memory.
Expanding what is known about language ability in individuals with latent aphasia
The latent aphasia group was sensitively differentiated from the cognitively healthy group using core lexical item production during both Cinderella (familiar, fictional) and Sandwich (procedural) narratives, but notably, not during the Cat Rescue single picture description. In a study by Fromm et al. (2017), it was also demonstrated that the Cinderella narrative was a sensitive task for demonstrating language impairments in persons with latent aphasia. Specifically, persons with latent aphasia tended to produce fewer utterances, reduced lexical diversity, less fluent speech (words per minute), and fewer main concepts than age-similar cognitively healthy adults. Findings from the Fromm et al. (2017) study were further confirmed by DeDe and Salis (2020), who evaluated the discourse of a different group of persons with latent aphasia during the Cinderella story. They also found that the latent aphasia group differed from controls in total production (number of words) and in speech rate. Similarly, Cunningham and Haley (2020) demonstrated a significant difference in lexical diversity on the Cinderella task between a group with latent aphasia and a cognitively healthy adult group. Finally, Salis and DeDe (2022) reported that individuals with latent aphasia demonstrate longer pauses within utterances than cognitively healthy adults during the Cinderella story (although the syntactic complexity of utterances did not differentially affect pause length within utterances between the two groups), which they interpret as evidence of the mild cognitive and language impairments experienced by individuals with latent aphasia. While these studies all confirmed existing, subtle language impairments in persons with latent aphasia, none attempted to predict or classify the extent to which different discourse tasks adequately separated the latent aphasia group from the cognitively healthy group, especially using variables extracted from a single, clinically feasible measure (core lexicon). The total number of core lexical items on both Cinderella and Sandwich narratives exceeded group prediction of 78%, suggesting real potential of this measure and narrative tasks to sensitively identify subtle impairments in the latent group.
Moving away from evaluating only single picture descriptions clinically and in research
In the current study, the Cat Rescue picture description was least successful at differentiating the three groups in one-way analyses, and subsequent supervised classification demonstrated its limited effectiveness in predicting group membership based on core lexical variables (see Tables 4, 6). Our findings add to a burgeoning literature that demonstrates the limited utility of using single picture descriptions to accurately and sensitively detect subtle language impairments, such as those demonstrated by individuals with MCI and individuals with latent aphasia, and urges researchers and clinicians to additionally collect data using tasks that draw upon macrostructural and cognitive resources, such as narratives. A recent article evaluated a large number of lexical, semantic, and syntactic variables extracted using Natural Language Processing across several monolog discourse genres in n = 25 cognitively healthy adults, n = 25 individuals with MCI, and n = 25 individuals with Alzheimer’s disease (Clarke et al., 2021). Findings complemented our own, in that the Cinderella narrative had the highest accuracy (0.78) and sensitivity (0.75), outperforming picture description (accuracy, 0.76; sensitivity, 0.69), procedural narrative (accuracy, 0.74; sensitivity, 0.78), conversational speech (accuracy, 0.66; sensitivity, 0.62) and novel narrative retelling (accuracy, 0.62; sensitivity, 0.53). This was likewise true when examining cognitively healthy controls versus those with MCI only, with the Cinderella narrative once again achieving the highest balanced accuracy, sensitivity, and specificity. Clarke et al. (2021) used 286 linguistic features derived via automated linguistic analyses, which is not clinically feasible, but does present evidence that using overlearned narratives, which likely draw heavily on cognitive, microlinguistic, and macrolinguistic processes, is warranted for populations with subtle language impairments.
The findings presented in this paper suggest some clinical feasibility, as well. Calculating the total core lexicon items is likely the most feasible of the three dependent variables, as the other two variables (proportion of total words and per second) involve a subsequent step, i.e., transcribing and/or recording the sample and then analyzing the sample. Since the tallying of total core lexicon items can be done “live” (Dalton et al., 2020a,b; Kim et al., 2022), and was shown to be beneficial for differentiating the latent aphasia group from the cognitively healthy adult group for both Cinderella and Sandwich, and the MCI from the cognitively healthy group for Cinderella, practitioners can uptake this practice to improve discourse assessment sensitivity when evaluating for lexical-semantic impairment.
Future directions
A future, longitudinal, prospective design is the next step to establish the efficacy of core lexical variables in being an early identifier of cognitive decline in individuals with cerebrovascular disease and stroke. In addition, the specificity of these tasks to identify only individuals with language or cognitive changes should be determined to ensure over-referral of individuals with intact language and cognition does not occur. Finally, combination of core lexicon metrics (i.e., total, proportion and rate variables modeled together) and combination of core lexicon with other discourse-level metrics [e.g., main concept analysis (Nicholas and Brookshire, 1995)] will be imperative for identifying sensitivity (and specificity) for discourse outcomes for these populations.
Limitations
As with many studies of individuals with aphasia and MCI, the sample included here had restricted racial and ethnic diversity. This reduces the ability to generalize these findings to diverse patients with latent aphasia or MCI. While the field of aphasiology has begun to recognize the importance of including diverse populations in research, systemic barriers continue to persist in the recruitment and retention of minoritized groups. This is particularly evident in database driven research, since the growth of a database relies upon voluntary contributions to amass a sufficiently large sample to allow for investigation.
In addition, the number of individuals with latent aphasia and MCI in the study, while generally consistent with sample sizes seen in the extant literature, may have impacted the sensitivity of the linear discriminant analysis, especially given the overlap between the three groups. Cross validation was used to ameliorate some concern. Given the promising results reported here, further investigation with larger sample sizes is warranted.
Conclusion
In this study we demonstrated the utility of core lexicon analysis to differentiate between groups of cognitively healthy adults and adults with latent aphasia or MCI during production, with the overlearned Cinderella narrative having the most sensitivity in doing so. In sum, core lexicon analysis appears to be a sensitive, and potentially clinically feasible way to identify differences between cognitively healthy adults and individuals with mild cognitive and/or language deficits. This has the potential to increase access to rehabilitation for these individuals, thereby improving participation and quality of life.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary material, further inquiries can be directed to the corresponding author.
Ethics statement
The studies involving humans were approved by the Indiana University IRB and University of Delaware IRB. The studies were conducted in accordance with the local legislation and institutional requirements. The participants provided their written informed consent to participate in this study.
Author contributions
BS: Conceptualization, Data curation, Formal analysis, Methodology, Resources, Visualization, Writing – original draft, Writing – review & editing. SD: Formal analysis, Methodology, Writing – original draft, Writing – review & editing. AL: Data curation, Formal analysis, Methodology, Writing – original draft, Writing – review & editing.
Funding
The author(s) declare financial support was received for the research, authorship, and/or publication of this article. This work was supported by the National Institute of Aging of the National Institutes of Health under Grant 1RF1AG083823 awarded to Alyssa M. Lanzi and Brian MacWhinney and the National Institute on Deafness and Other Communication Disorders under grant R01-DC008524 awarded to Brian MacWhinney, on which Brielle C. Stark is a Co-I.
Acknowledgments
The authors would like to thank Anna Saylor for her assistance with review and copy editing.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnhum.2024.1500735/full#supplementary-material
Footnotes
References
Ahmed, S., Haigh, A.-M. F., de Jager, C. A., and Garrard, P. (2013). Connected speech as a marker of disease progression in autopsy-proven Alzheimer’s disease. Brain J. Neurol. 136, 3727–3737. doi: 10.1093/brain/awt269
Albert, M. S., DeKosky, S. T., Dickson, D., Dubois, B., Feldman, H. H., Fox, N. C., et al. (2011). The diagnosis of mild cognitive impairment due to Alzheimer’s disease: recommendations from the National Institute on Aging-Alzheimer’s association workgroups on diagnostic guidelines for Alzheimer’s disease. Alzheimers Dement. 7, 270–279. doi: 10.1016/j.jalz.2011.03.008
Alzheimer’s Association (2022). 2022 Alzheimer’s disease facts and figures. Alzheimers Dement. 18, 700–789. doi: 10.1002/alz.12638
American Psychiatric Association (Ed.) (2013). Diagnostic and statistical manual of mental disorders. 5th Edn. Washington, D.C.: American Psychiatric Association.
Armstrong, E. (2000). Aphasic discourse analysis: the story so far. Aphasiology 14, 875–892. doi: 10.1080/02687030050127685
Berisha, V., Wang, S., LaCross, A., and Liss, J. (2015). Tracking discourse complexity preceding Alzheimer’s disease diagnosis: a case study comparing the press conferences of presidents Ronald Reagan and George Herbert Walker bush. J. Alzheimer’s Dis. 45, 959–963. doi: 10.3233/JAD-142763
Boyle, M., Akers, C. M., Cavanaugh, R., Hula, W. D., Swiderski, A. M., and Elman, R. J. (2022). Changes in discourse informativeness and efficiency following communication-based group treatment for chronic aphasia. Aphasiology 37, 563–597. doi: 10.1080/02687038.2022.2032586
Bryant, L., Ferguson, A., and Spencer, E. (2016). Linguistic analysis of discourse in aphasia: a review of the literature. Clin. Linguist. Phon. 30, 489–518. doi: 10.3109/02699206.2016.1145740
Bryant, L., Spencer, E., and Ferguson, A. (2017). Clinical use of linguistic discourse analysis for the assessment of language in aphasia. Aphasiology 31, 1105–1126. doi: 10.1080/02687038.2016.1239013
Bschor, T., Kühl, K. P., and Reischies, F. M. (2001). Spontaneous speech of patients with dementia of the Alzheimer type and mild cognitive impairment. Int. Psychogeriatr. 13, 289–298. doi: 10.1017/s1041610201007682
Cannizzaro, M. S., and Coelho, C. A. (2013). Analysis of narrative discourse structure as an ecologically relevant measure of executive function in adults. J. Psycholinguist. Res. 42, 527–549. doi: 10.1007/s10936-012-9231-5
Cavanaugh, R., and Haley, K. L. (2020). Subjective communication difficulties in very mild aphasia. Am. J. Speech Lang. Pathol. 29, 437–448. doi: 10.1044/2019_AJSLP-CAC48-18-0222
Chandler, M. J., Locke, D. E., Crook, J. E., Fields, J. A., Ball, C. T., Phatak, V. S., et al. (2019). Comparative effectiveness of behavioral interventions on quality of life for older adults with mild cognitive impairment: a randomized clinical trial. JAMA Netw. Open 2:e193016. doi: 10.1001/jamanetworkopen.2019.3016
Chen, Y., Hartsuiker, R. J., and Pistono, A. (2024). A comparison of different connected-speech tasks for detecting mild cognitive impairment using multivariate pattern analysis. Aphasiology 1–24, 1–24. doi: 10.1080/02687038.2024.2358556
Clark, P., Kutner, N., Goldstein, F., Peterson-Hazen, S., Garner, V., Zhang, R., et al. (2005). Impediments to timely diagnosis of Alzheimer’s Disease in African Americans. J. Am. Geriatr. Soc. 53, 2012–2017. doi: 10.1111/j.1532-5415.2005.53569.x
Clarke, N., Barrick, T. R., and Garrard, P. (2021). A comparison of connected speech tasks for detecting early Alzheimer’s Disease and mild cognitive impairment using natural language Processing and machine learning. Front. Comput. Sci. 3:634360. doi: 10.3389/fcomp.2021.634360
Cohen, M. L., Ryan, A. C., and Lanzi, A. M. (2021). Prevention of and early intervention for cognitive decline due to Alzheimer’s Disease and related disorders. Delaware J. Public Health 7, 118–122. doi: 10.32481/djph.2021.09.014
Cruice, M., Botting, N., Marshall, J., Boyle, M., Hersh, D., Pritchard, M., et al. (2020). UK speech and language therapists’ views and reported practices of discourse analysis in aphasia rehabilitation. Int. J. Lang. Commun. Dis. 55, 417–442. doi: 10.1111/1460-6984.12528
Cruice, M., Pritchard, M., and Dipper, L. (2014). Verb use in aphasic and non-aphasic personal discourse: what is normal? J. Neurolinguistics 28, 31–47. doi: 10.1016/j.jneuroling.2013.12.002
Crutch, S. J., and Warrington, E. K. (2003). The selective impairment of fruit and vegetable knowledge: amultiple processing channels account of fine-grain category specificity. Cogn. Neuropsychol. 20, 355–372. doi: 10.1080/02643290244000220
Cunningham, K. T., and Haley, K. L. (2020). Measuring lexical diversity for discourse analysis in aphasia: moving-average type–token ratio and word information measure. J. Speech Lang. Hear. Res. 63, 710–721. doi: 10.1044/2019_JSLHR-19-00226
Dalton, S. G. H., Hubbard, H. I., and Richardson, J. D. (2020a). Moving toward non-transcription based discourse analysis in stable and progressive aphasia. Semin. Speech Lang. 41, 032–044. doi: 10.1055/s-0039-3400990
Dalton, S. G. H., Kim, H., Richardson Heather, J. D., and Wright, H. (2020b). A compendium of Core lexicon checklists. Semin. Speech Lang. 41, 45–60. doi: 10.1055/s-0039-3400972
Dalton, S. G., and Richardson, J. D. (2015). Core-lexicon and main-concept production during picture-sequence description in adults without brain damage and adults with aphasia. Am. J. Speech Lang. Pathol. 24, S923–S938. doi: 10.1044/2015_AJSLP-14-0161
Dalton, S. G., and Richardson, J. D. (2019). A large-scale comparison of main concept production between persons with aphasia and persons without brain injury. Am. J. Speech Lang. Pathol. 28, 293–320. doi: 10.1044/2018_AJSLP-17-0166
Dalton, S. G., Stark, B. C., Fromm, D., Apple, K., MacWhinney, B., Rensch, A., et al. (2022). Validation of an automated procedure for calculating core lexicon from transcripts. J. Speech Lang. Hear. Res. 65, 2996–3003. doi: 10.1044/2022_JSLHR-21-00473
DeDe, G., and Hoover, E. (2021). Measuring change at the discourse-level following conversation treatment: examples from mild and severe aphasia. Top. Lang. Disord. 41, 5–26. doi: 10.1097/TLD.0000000000000243
DeDe, G., and Salis, C. (2020). Temporal and episodic analyses of the story of Cinderella in latent aphasia. Am. J. Speech Lang. Pathol. 29, 449–462. doi: 10.1044/2019_ajslp-cac48-18-0210
Dipper, L., Marshall, J., Boyle, M., Hersh, D., Botting, N., and Cruice, M. (2021). Creating a theoretical framework to underpin discourse assessment and intervention in aphasia. Brain Sci. 11:183. doi: 10.3390/brainsci11020183
Doyle, P. J., Goda, A. J., and Spencer, K. A. (1995). The communicative Informativeness and efficiency of connected discourse by adults with aphasia under structured and conversational sampling conditions. Am. J. Speech Lang. Pathol. 4, 130–134. doi: 10.1044/1058-0360.0404.130
Drummond, C., Coutinho, G., Fonseca, R. P., Assunção, N., Teldeschi, A., de Oliveira-Souza, R., et al. (2015). Deficits in narrative discourse elicited by visual stimuli are already present in patients with mild cognitive impairment. Front. Aging Neurosci. 7:96. doi: 10.3389/fnagi.2015.00096
Dutta, M., Murray, L. L., and Stark, B. C. (2024). The relationship between executive functioning and narrative language abilities in aphasia. Am. J. Speech Lang. Pathol. 33, 2500–2523. doi: 10.1044/2024_AJSLP-23-00314
Filiou, R.-P., Bier, N., Slegers, A., Houzé, B., Belchior, P., and Brambati, S. M. (2020). Connected speech assessment in the early detection of Alzheimer’s disease and mild cognitive impairment: a scoping review. Aphasiology 34, 723–755. doi: 10.1080/02687038.2019.1608502
Fromm, D., Forbes, M., Holland, A., Dalton, S. G., Richardson, J., and MacWhinney, B. (2017). Discourse characteristics in aphasia beyond the Western aphasia battery cutoff. Am. J. Speech Lang. Pathol. 26, 762–768. doi: 10.1044/2016_AJSLP-16-0071
Garrard, P., Maloney, L. M., Hodges, J. R., and Patterson, K. (2005). The effects of very early Alzheimer’s disease on the characteristics of writing by a renowned author. Brain J. Neurol. 128, 250–260. doi: 10.1093/brain/awh341
Goodglass, H., and Kaplan, E. (1972). Boston diagnostic aphasia examination. Philadelphia: Lea & Febiger.
Kemper, S., Marquis, J., and Thompson, M. (2001). Longitudinal change in language production: effects of aging and dementia on grammatical complexity and propositional content. Psychol. Aging 16, 600–614. doi: 10.1037//0882-7974.16.4.600
Kertesz, A. (2007). Western aphasia battery—revised. San Antonio, Texas: The Psychological Corporation.
Kim, H., Berube, S., and Hillis, A. E. (2022). Core lexicon in aphasia: a longitudinal study. Aphasiology 37, 1679–1691. doi: 10.1080/02687038.2022.2121598
Kim, H., and Wright, H. H. (2020). Concurrent validity and reliability of the core lexicon measure as a measure of word retrieval ability in aphasia narratives. Am. J. Speech Lang. Pathol. 29, 101–110. doi: 10.1044/2019_AJSLP-19-0063
Kintz, S., Kim, H., and Wright, H. H. (2024). A preliminary investigation on core lexicon analysis in dementia of the Alzheimer’s type. Int. J. Lang. Commun. Disord. 59, 1336–1350. doi: 10.1111/1460-6984.12999
Kuźma, E., Lourida, I., Moore, S. F., Levine, D. A., Ukoumunne, O. C., and Llewellyn, D. J. (2018). Stroke and dementia risk: a systematic review and meta-analysis. Alzheimers Dement. 14, 1416–1426. doi: 10.1016/J.JALZ.2018.06.3061
Lanzi, A. M., Saylor, A. K., Fromm, D., Liu, H., MacWhinney, B., and Cohen, M. L. (2023). DementiaBank: theoretical rationale, protocol, and illustrative analyses. Am. J. Speech Lang. Pathol. 32, 426–438. doi: 10.1044/2022_AJSLP-22-00281
Law, S.-P., Kong, A. P.-H., Lai, L. W.-S., and Lai, C. (2015). Effects of context and word class on lexical retrieval in Chinese speakers with anomic aphasia. Aphasiology 29, 81–100. doi: 10.1080/02687038.2014.951598
MacWhinney, B. (2000). The CHILDES project: tools for analysing talk. 3rd Edn. Mahwah, NJ: Lawrence Erlbaum Associates Inc.
MacWhinney, B., Forbes, M., and Holland, A. (2011). AphasiaBank: methods for studying discourse. Aphasiology 25, 1286–1307. doi: 10.1080/02687038.2011.589893
MacWhinney, B., Fromm, D., Holland, A., Forbes, M., and Wright, H. (2010). Automated analysis of the Cinderella story. Aphasiology 24, 856–868. doi: 10.1080/02687030903452632
Martzoukou, M., Nousia, A., and Nasios, G. (2023). Undetected language deficits in left or right hemisphere post-stroke patients. Appl. Neuropsychol. Adult 1–9. doi: 10.1080/23279095.2023.2195111
Mefford, J. A., Zhao, Z., Heilier, L., Xu, M., Zhou, G., Mace, R., et al. (2023). Varied performance of picture description task as a screening tool across MCI subtypes. PLOS Digital Health 2:e0000197. doi: 10.1371/journal.pdig.0000197
Morris, J. C. (1993). The clinical dementia rating (CDR). Neurology 43:2412. doi: 10.1212/WNL.43.11.2412-a
Mueller, K. D., Hermann, B., Mecollari, J., and Turkstra, L. S. (2018a). Connected speech and language in mild cognitive impairment and Alzheimer’s disease: a review of picture description tasks. J. Clin. Exp. Neuropsychol. 40, 917–939. doi: 10.1080/13803395.2018.1446513
Mueller, K. D., Koscik, R. L., Du, L., Bruno, D., Jonaitis, E. M., Koscik, A. Z., et al. (2020). Proper names from story recall are associated with beta-amyloid in cognitively unimpaired adults at risk for Alzheimer’s disease. Cortex 131, 137–150. doi: 10.1016/j.cortex.2020.07.008
Mueller, K. D., Koscik, R. L., Hermann, B. P., Johnson, S. C., and Turkstra, L. S. (2018b). Declines in connected language are associated with very early mild cognitive impairment: results from the Wisconsin registry for Alzheimer’s prevention. Front. Aging Neurosci. 9:437. doi: 10.3389/fnagi.2017.00437
Nasreddine, Z. S., Phillips, N. A., Bédirian, V., Charbonneau, S., Whitehead, V., Collin, I., et al. (2005). The Montreal cognitive assessment, MoCA: a brief screening tool for mild cognitive impairment. J. Am. Geriatr. Soc. 53, 695–699. doi: 10.1111/j.1532-5415.2005.53221.x
Nicholas, L. E., and Brookshire, R. H. (1993). A system for quantifying the informativeness and efficiency of the connected speech of adults with aphasia. J. Speech Hear. Res. 36, 338–350. doi: 10.1044/jshr.3602.338
Nicholas, L. E., and Brookshire, R. H. (1995). Presence, completeness, and accuracy of main concepts in the connected speech of non-brain-damaged adults and adults with aphasia. J. Speech Hear. Res. 38, 145–156. doi: 10.1044/jshr.3801.145
Oulhaj, A., Wilcock, G. K., Smith, A. D., and de Jager, C. A. (2009). Predicting the time of conversion to MCI in the elderly. Neurology 73, 1436–1442. doi: 10.1212/WNL.0b013e3181c0665f
Papanicolaou, A. C., Moore, B. D., Deutsch, G., Levin, H. S., and Eisenberg, H. M. (1988). Evidence for right-hemisphere involvement in recovery from aphasia. Arch. Neurol. 45, 1025–1029. doi: 10.1001/archneur.1988.00520330117020
Petersen, R. C. (2004). Mild cognitive impairment as a diagnostic entity. J. Intern. Med. 256, 183–194. doi: 10.1111/j.1365-2796.2004.01388.x
Petersen, R. C., and Yaffe, K. (2020). Issues and questions surrounding screening for cognitive impairment in older patients. JAMA 323, 722–724. doi: 10.1001/jama.2019.22527
Pritchard, M., Dipper, L., Morgan, G., and Cocks, N. (2015). Language and iconic gesture use in procedural discourse by speakers with aphasia. Aphasiology 29, 826–844. doi: 10.1080/02687038.2014.993912
Richardson, J. D., Dalton, S. G., Greenslade, K. J., Jacks, A., Haley, K. L., and Adams, J. (2021). Main concept, sequencing, and story grammar analyses of Cinderella narratives in a large sample of persons with aphasia. Brain Sci. 11:110. doi: 10.3390/brainsci11010110
Richardson, J. D., Hudspeth Dalton, S. G., Fromm, D., Forbes, M., Holland, A., and MacWhinney, B. (2018). The relationship between confrontation naming and story gist production in aphasia. Am. J. Speech Lang. Pathol. 27, 406–422. doi: 10.1044/2017_AJSLP-16-0211
Roberts, A., Aveni, K., Basque, S., Orange, J. B., McLaughlin, P., Ramirez, J., et al. (2021). Predicting cognitive impairment in cerebrovascular disease using spoken discourse production. Top. Lang. Disord. 41, 73–98. doi: 10.1097/TLD.0000000000000242
Roitsch, J., Prebor, J., and Raymer, A. M. (2021). Cognitive assessments for patients with neurological conditions: a preliminary survey of speech-language pathology practice patterns. Am. J. Speech Lang. Pathol. 30, 2263–2274. doi: 10.1044/2021_AJSLP-20-00187
Ropacki, M. T., Hannesdottir, K., Hendrix, S., Gordon, M. F., Stephenson, D., Coons, S. J., et al. (2017). Clinically meaningful outcomes in early Alzheimer Disease: a consortia-driven approach to identifying what matters to patients. Ther. Innov. Regul. Sci. 51, 380–390. doi: 10.1177/2168479016689712
Sabbagh, M. N., Boada, M., Borson, S., Chilukuri, M., Doraiswamy, P. M., Dubois, B., et al. (2020). Rationale for early diagnosis of mild cognitive impairment (MCI) supported by emerging digital technologies. J. Prev Alzheimers Dis. 7, 158–164. doi: 10.14283/jpad.2020.19
Sachdev, P. S., Chen, X., Brodaty, H., Thompson, C., Altendorf, A., and Wen, W. (2009). The determinants and longitudinal course of post-stroke mild cognitive impairment. J. Int. Neuropsychol. Soc. 15, 915–923. doi: 10.1017/S1355617709990579
Saffran, E. M., Berndt, R. S., and Schwartz, M. F. (1989). The quantitative analysis of agrammatic production: procedure and data. Brain Lang. 37, 440–479. doi: 10.1016/0093-934X(89)90030-8
Salis, C., and DeDe, G. (2022). Sentence production in a discourse context in latent aphasia: a real-time study. Am. J. Speech Lang. Pathol. 31, 1284–1296. doi: 10.1044/2022_AJSLP-21-00232
Sanborn, V., Ostrand, R., Ciesla, J., and Gunstad, J. (2022). Automated assessment of speech production and prediction of MCI in older adults. Appl. Neuropsychol. Adult 29, 1250–1257. doi: 10.1080/23279095.2020.1864733
Sekine, K., and Rose, M. L. (2013). The relationship of aphasia type and gesture production in people with aphasia. Am. J. Speech Lang. Pathol. 22, 662–672. doi: 10.1044/1058-0360(2013/12-0030)
Sekine, K., Rose, M. L., Foster, A. M., Attard, M. C., and Lanyon, L. E. (2013). Gesture production patterns in aphasic discourse: in-depth description and preliminary predictions. Aphasiology 27, 1031–1049. doi: 10.1080/02687038.2013.803017
Silkes, J. P., Zimmerman, R. M., Greenspan, W., Reinert, L., Kendall, D., and Martin, N. (2020). Identifying verbal short-term memory and working memory impairments in individuals with latent aphasia. Am. J. Speech Lang. Pathol. 30, 391–406. doi: 10.1044/2020_AJSLP-19-00105
Snowdon, D. A. (1996). Linguistic ability in early life and cognitive function and Alzheimer’s disease in late life. Findings from the Nun study. JAMA 275, 528–532. doi: 10.1001/jama.1996.03530310034029
Stagge, F., Lanzi, A. M., Saylor, A. K., and Cohen, M. L. (2024). Montreal cognitive assessment scores do not associate with communication challenges reported by adults with Alzheimer’s Disease or Parkinson’s Disease. Am. J. Speech Lang. Pathol. 33, 1902–1910. doi: 10.1044/2024_AJSLP-23-00248
Stark, B. C. (2019). A comparison of three discourse elicitation methods in aphasia and age-matched adults: implications for language assessment and outcome. Am. J. Speech Lang. Pathol. 28, 1067–1083. doi: 10.1044/2019_AJSLP-18-0265
Stark, B. C., Alexander, J. M., Hittson, A., Doub, A., Igleheart, M., Streander, T., et al. (2023). Test-retest reliability of microlinguistic information derived from spoken discourse in persons with chronic aphasia. J. Speech Lang. Hear. Res. 66, 2316–2345. doi: 10.1044/2023_JSLHR-22-00266
Stark, B. C., Bryant, L., Themistocleous, C., den Ouden, D.-B., and Roberts, A. C. (2022). Best practice guidelines for reporting spoken discourse in aphasia and neurogenic communication disorders. Aphasiology 37, 761–784. doi: 10.1080/02687038.2022.2039372
Stark, B. C., and Cofoid, C. (2022). Task-specific iconic gesturing during spoken discourse in aphasia. Am. J. Speech Lang. Pathol. 31, 30–47. doi: 10.1044/2021_AJSLP-20-00271
Stark, B. C., and Dalton, S. G. (2024). A scoping review of transcription-less practices for analysis of aphasic discourse and implications for future research. Int. J. Lang. Commun. Disord. 59, 1734–1762. doi: 10.1111/1460-6984.13028
Stark, B. C., Dutta, M., Murray, L. L., Fromm, D., Bryant, L., Harmon, T. G., et al. (2021). Spoken discourse assessment and analysis in aphasia: an international survey of current practices. J. Speech Lang. Hear. Res. 64, 4366–4389. doi: 10.1044/2021_JSLHR-20-00708
Stark, B. C., and Oeding, G. (2024). Demographic, neuropsychological, and speech variables that impact iconic and supplementary-to-speech gesturing in aphasia. Gesture 22, 62–93. doi: 10.1075/gest.23019.sta
Tchakoute, C. T., Sainani, K. L., and Henderson, V. W.Raloxifene in Alzheimer’s Disease Investigators (2017). Semantic memory in the clinical progression of Alzheimer Disease. Cogn. Behav. Neurol. 30, 81–89. doi: 10.1097/WNN.0000000000000131
Toledo, C. M., Aluísio, S. M., dos Santos, L. B., Brucki, S. M. D., Trés, E. S., de Oliveira, M. O., et al. (2018). Analysis of macrolinguistic aspects of narratives from individuals with Alzheimer’s disease, mild cognitive impairment, and no cognitive impairment. Alzheimer’s Dementia 10, 31–40. doi: 10.1016/j.dadm.2017.08.005
Weakley, A., Schmitter-Edgecombe, M., and Anderson, J. (2013). Analysis of verbal fluency ability in amnestic and non-amnestic mild cognitive impairment. Arch. Clin. Neuropsychol. 28, 721–731. doi: 10.1093/arclin/act058
Wilson, S. M., Eriksson, D. K., Schneck, S. M., and Lucanie, J. M. (2018). A quick aphasia battery for efficient, reliable, and multidimensional assessment of language function. PLoS One 13:e0192773. doi: 10.1371/journal.pone.0192773
Wright, H. H., and Capilouto, G. J. (2009). Manipulating task instructions to change narrative discourse performance. Aphasiology 23, 1295–1308. doi: 10.1080/02687030902826844
Keywords: aphasia, language, narrative, discourse, mild cognitive impairment
Citation: Stark BC, Dalton SG and Lanzi AM (2025) Access to context-specific lexical-semantic information during discourse tasks differentiates speakers with latent aphasia, mild cognitive impairment, and cognitively healthy adults. Front. Hum. Neurosci. 18:1500735. doi: 10.3389/fnhum.2024.1500735
Edited by:
Anastasia Marie Raymer, Old Dominion University, United StatesReviewed by:
JoAnn Silkes, San Diego State University, United StatesLynn M. Maher, University of Houston, United States
Copyright © 2025 Stark, Dalton and Lanzi. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Brielle C. Stark, YmNzdGFya0BpdS5lZHU=
†ORCID: Brielle C. Stark, orcid.org/0000-0002-7001-8324
Sarah Grace Dalton, orcid.org/0000-0002-1504-8002
Alyssa M. Lanzi, orcid.org/0000-0002-6460-9283