- 1Aphasia Research and Treatment Lab, Department of Speech, Language, and Hearing Sciences, University of Texas at Austin, Austin, TX, United States
- 2SoundBrain Lab, Department of Communication Science and Disorders, University of Pittsburgh, Pittsburgh, PA, United States
- 3Language Neurobiology Laboratory, Department of Neurology, Memory and Aging Center, University of California, San Francisco, San Francisco, CA, United States
- 4Center for Neuroscience, University of Pittsburgh, Pittsburgh, PA, United States
- 5Department of Neurology, Dell Medical School, University of Texas at Austin, Austin, TX, United States
Logopenic variant primary progressive aphasia (lvPPA) is a neurodegenerative language disorder primarily characterized by impaired phonological processing. Sentence repetition and comprehension deficits are observed in lvPPA and linked to impaired phonological working memory, but recent evidence also implicates impaired speech perception. Currently, neural encoding of the speech envelope, which forms the scaffolding for perception, is not clearly understood in lvPPA. We leveraged recent analytical advances in electrophysiology to examine speech envelope encoding in lvPPA. We assessed cortical tracking of the speech envelope and in-task comprehension of two spoken narratives in individuals with lvPPA (n = 10) and age-matched (n = 10) controls. Despite markedly reduced narrative comprehension relative to controls, individuals with lvPPA had increased cortical tracking of the speech envelope in theta oscillations, which track low-level features (e.g., syllables), but not delta oscillations, which track speech units that unfold across a longer time scale (e.g., words, phrases, prosody). This neural signature was highly correlated across narratives. Results indicate an increased reliance on acoustic cues during speech encoding. This may reflect inefficient encoding of bottom-up speech cues, likely as a consequence of dysfunctional temporoparietal cortex.
Introduction
Logopenic variant primary progressive aphasia (lvPPA) is a neurodegenerative language disorder characterized by a gradual dissolution of phonological processing ability. Individuals with lvPPA present with reduced phonological working memory capacity, impaired lexical retrieval and, in many cases, phonemic paraphasias in spontaneous speech. Cortical atrophy is most prominent in left temporoparietal cortex and spreads to additional regions subserving language and memory with disease (usually Alzheimer's) progression (Rohrer et al., 2010; Gorno-Tempini et al., 2011). Although phonological processing deficits are the defining feature of lvPPA, studies deconstructing and characterizing these deficits are lacking, especially in receptive language processing.
Receptive language processing at the single-word level is relatively intact in lvPPA but deteriorates as a function of stimulus length. This pattern has been attributed to phonological working memory deficits rather than semantic or syntactic processing impairments (Lukic et al., 2019). To date, processing of speech beyond single sentences has not been investigated in lvPPA. Further, functional neuroimaging studies investigating language processing in lvPPA have only examined processing of single syllables or words (Hardy et al., 2017). In everyday communication, however, we are rarely presented with single syllables, words or sentences in isolation, limiting the generalizability of previous research (Hamilton and Huth, 2018).
Recent analytical advances allow for online assessment of cortical tracking of ecologically valid, continuous speech stimuli, with the potential to generate findings that can be generalized to real-world situations. This approach involves fitting a temporal response function (TRF) to map speech features derived from naturalistic stimuli (e.g., audiobooks, movies) to temporally-precise neurophysiological data (Crosse et al., 2016). For example, TRF analyses can be used to assess the fidelity of cortical tracking of the speech envelope, which contains quasi-rhythmic amplitude fluctuations that are critical for transforming acoustic input into discrete representations (Ding and Simon, 2014), and the TRF time course can be used to infer the temporal dynamics of speech envelope encoding across the auditory processing hierarchy (Ding and Simon, 2012; Puvvada and Simon, 2017).
Cortical tracking of the speech envelope depends on neural oscillations in the delta (1–4 Hz) and theta (4–8 Hz) range (Ding and Simon, 2014), which have been linked to parsing linguistic structures from heard speech (Giraud and Poeppel, 2012). Delta oscillations are linked to parsing speech at the level of words and phrases as well as processing prosodic cues that are vital for constructing higher-level linguistic structures (Ding and Simon, 2014; Ding et al., 2017; Teoh et al., 2019); theta oscillations track the primary energetic rhythm in speech that is driven by low-level segmental features (syllables) (Ghitza, 2017). TRF analyses examining cortical tracking of the speech envelope by delta and theta oscillations have been applied to studies of continuous speech perception in neurotypical (Di Liberto et al., 2015) and clinical (Di Liberto et al., 2018; Fuglsang et al., 2020) populations.
Here we assess cortical tracking of the speech envelope of naturalistic, continuous speech in individuals with lvPPA (relative to age-matched controls) using EEG. This objective is motivated by converging evidence. First, neurodegeneration in lvPPA is most prominent in left temporoparietal regions, including the superior temporal gyrus and sulcus, middle temporal gyrus, and inferior parietal lobule (Rohrer et al., 2010), regions hypothesized to subserve the parsing of continuous speech input into perceptually-relevant units (Mesgarani et al., 2014). Second, aberrant neural processing of syllables presented in isolation has been observed in individuals with lvPPA (Hardy et al., 2017). Lastly, individuals with lvPPA show resting-state delta-theta hyperactivity in relatively spared frontal cortex as well as hypersynchrony in medial frontal and posterior parietal regions (Ranasinghe et al., 2017). We thus predict aberrant cortical tracking of the speech envelope in individuals with lvPPA relative to controls. This may manifest differently across oscillatory frequencies, so we assess the specificity (or lack thereof) across delta (word/phrase-level) and theta (syllable-level) oscillations in tracking of the speech envelope. Crucially, we evaluate the stability of this neural signature across two independent continuous speech narratives that vary in acoustic and linguistic complexity.
Materials and Methods
Participants
Ten participants with a diagnosis of lvPPA (age M = 68.8, s = 8.8) and ten age-matched healthy controls (HC; age M = 65.9, s = 6.4) participated in the study. All participants were right-handed. Participants with lvPPA underwent comprehensive neurological and neuropsychological assessment and met current diagnostic criteria for lvPPA (Table 1); they had no history of stroke or other psychiatric, neurological, or medical diagnoses that could account for their language symptoms (Gorno-Tempini et al., 2011). Structural MRI scans showed significant cortical thinning in lvPPA participants localized primarily to left temporoparietal cortex, extending anteriorly within temporal cortex (Figure 1A). HC participants had no history of stroke, neurological disorders, or cognitive impairment. Hearing thresholds were assessed in all participants using pure tone audiometry (Figure 1B). All participants provided written informed consent. Study procedures were approved by the Institutional Review Board at the University of Texas at Austin and conformed to the Declaration of Helsinki.
Figure 1. Atrophy profile, hearing thresholds and comprehension question accuracy for individuals with lvPPA relative to healthy controls. (A) Cortical thinning in individuals with lvPPA relative to n = 42 age-matched controls (two-sample t-test, controls > lvPPA, FWE-corrected p < 0.01, k = 133, covariates: age and sex). MRI scans were not available for HC presented in the current study, so a separate group of control data were used to identify significant cortical thinning in individuals with lvPPA. Lighter colors indicate more severe cortical thinning. (B) Pure tone hearing thresholds in healthy controls (HC) and individuals with lvPPA. Bold lines and markers represent group means. Thin lines and markers represent thresholds for each participant. The shaded area represents the standard deviation for the participant groups. Two (between-subjects, participant group [lvPPA, HC]) × 2 (within-subjects, ear [right, left]) × 4 (within-subjects, frequency [500, 1,000, 2,000, 4,000 Hz]) mixed ANOVA on hearing thresholds indicated that the two participant groups did not differ across frequencies or ears (two-way interactions with participant group and three-way interaction p's > 0.25). (C) Multiple-choice question accuracy (percent correct) in HC and individuals with lvPPA for each narrative.
Stimuli and Task
Stimuli consisted of two narratives (continuous speech) of ~15 min each: Alice's Adventures in Wonderland (narrative 1) (Carrol, 1865) and Who Was Albert Einstein? (narrative 2) (Brallier, 2002), the latter of which has been used and validated in stroke-induced aphasia (Wilson et al., 2018). Stimuli differed in acoustic (e.g., speech rate) and linguistic (e.g., clauses/sentence) features (see Supplementary Table 1 and Supplementary Figure 1 for details). The two narratives were segmented into 15 tracks of ~60 s each (narrative 1: 57–65 s; narrative 2: 56–67 s), with each track beginning and ending with a complete sentence, and presented binaurally using insert earphones (ER-3A; Etymotic Research, Elk Grove Village, IL). To assess narrative comprehension, after each track, participants were visually presented with two multiple-choice questions on a ViewPIXX monitor and selected one of four answer choices using a keyboard. A researcher assisted individuals with lvPPA by reading the questions aloud (if needed) and making keyboard responses.
EEG Acquisition and Pre-processing
While participants listened to the narratives, EEG data were acquired using a 32-channel active electrode system (Brain Products, Gilching, Germany). Data were preprocessed using EEGLAB 2019.1 (Delorme and Makeig, 2004) in MATLAB 2016b (MathWorks Inc., Natick, MA, USA). Raw EEG data were downsampled to 128 Hz then filtered from 1 to 15 Hz. Following recommendations from de Cheveigné and Nelken (2019), we present the impulse and frequency response of the filter in Supplementary Figure 2. Channels with activity >3 standard deviations from surrounding channels were rejected and replaced via spherical spline interpolation. Artifact subspace reconstruction (ASR) was used to suppress large artifacts (Mullen et al., 2015). ASR-cleaned data were epoched from −5 to 70 s relative to stimulus onset and referenced to the common average of all channels. Independent component analysis was performed to correct for eye movement, muscle, and electrocardiographic artifacts. Given the proposed roles of delta and theta oscillations in relation to cortical tracking of speech (Ding and Simon, 2014; Ding et al., 2017; Ghitza, 2017; Teoh et al., 2019), the pre-processed signals were further bandpass filtered from 1 to 8 Hz, which we operationally defined as the full band. Lastly, since delta and theta oscillations may process different aspects of an incoming speech stream (i.e., words/phrases vs. syllables, respectively), the ICA-cleaned EEG data (1–15 Hz filtered) were furthered filtered into delta (1–4 Hz) and theta (4–8 Hz) bands to assess their different contributions to tracking of the speech envelope. Specific details regarding the filter parameters and additional analysis of data for different frequency bands are presented in the Supplementary Material.
Cortical Tracking of Speech
To evaluate the extent to which neural responses could be predicted by the speech envelope, the multiband stimulus envelope model was mapped onto preprocessed EEG data using a forward modeling TRF approach (Figure 2) implemented using regularized linear ridge regression (Lalor and Foxe, 2010; Crosse et al., 2016; McHaney et al., 2020; Reetzke et al., in press).
Figure 2. Schematic representation of the multivariate regularized linear ridge regression procedure used to obtain cortical tracking metrics. The stimulus envelope is extracted across sixteen gammatone filters resulting in a multiband stimulus envelope. This is used as the stimulus model and mapped onto recorded EEG using multivariate regularized linear ridge regression. In an iterative process, the temporal response function (TRF) is trained on a set of training data and tested on left-out testing data. Pearson's correlation coefficients (r-values) between the observed (testing) and predicted EEG data are obtained, reflecting the accuracy of the prediction. This is performed using leave-one-out cross-validation. The TRFs and prediction accuracies are estimated for each EEG channel and stimulus segment.
Multiband Speech Envelope Stimulus Model
To derive the multiband speech envelope, the auditory stimuli were first filtered using a bank of 16 gammatone filters uniformly spaced on an ERB scale from 250 to 8,000 Hz (Slaney, 1998). The absolute value of the Hilbert transform in each of the 16 bands comprised the multiband stimulus envelope, which was then raised to a power of 0.6 to mimic the compression characteristics of the inner ear (Vanthornhout et al., 2018). This resulted in 16 band-specific speech envelopes. The auditory cortex has been shown to be more sensitive to acoustic edges than sustained stimulus features (Hamilton et al., 2018), thus we derived the edges in the speech envelope using the first temporal derivative of the 16 band-specific envelopes. These edges signal rapid amplitude changes and predominantly represent onsets and offsets of acoustic events, such as phonemes and syllables.
TRF Estimation
The TRF estimation procedure is illustrated in Figure 2 and was done separately for the full (1–8 Hz), delta (1–4 Hz), and theta (4–8 Hz) bands. Each participant's EEG data were z-scored to the mean of all channels before TRF estimation. The TRF was estimated for each narrative by minimizing the least squares distance between EEG predicted from time-lagged features of the multiband speech envelope (−100–450 ms) and the observed EEG. Ridge regularization was used to smooth the TRFs and reduce overfitting, and the optimal ridge parameter was estimated for each participant. The resulting TRFs are a series of time-domain regression (beta) weights that explain the extent to which the EEG is predicted by the multiband speech envelope at different latencies. Prediction accuracy of the TRF was derived by obtaining Pearson's correlation coefficient (r) between the observed and TRF-predicted EEG. To reduce overfitting, the TRF was validated using leave-one-out (15-fold) cross-validation (Crosse et al., 2016), where the TRFs to n tracks were used to predict the EEG in the kth track. This cross-validation procedure was iterated to obtain the prediction accuracy for each track for each electrode. The prediction accuracies were then averaged across all the tracks and electrodes to get the final prediction accuracy, which we operationally defined as the cortical tracking metric. Lastly, to determine if the TRF is predicting the EEG at an above-chance level, we obtained the 97.5th percentile of the distribution of prediction accuracies for 1,000 permutations of mismatched stimulus track and EEG trials. All TRF analyses were performed using the mTRF_v1.4 Matlab toolbox (https://sourceforge.net/projects/aespa/files/; Crosse et al., 2016) and custom routines in Matlab (Mathworks Inc., Natick, Massachusetts).
Given the potential impact of EEG quality on the cortical tracking metrics, the signal to noise ratios (SNR) of the EEG recordings were obtained and used as a proxy to signal quality. The SNRs were estimated as a ratio of the root mean squares of post-stimulus-onset data (1.6–55 s after stimulus onset) and pre-stimulus-onset data (−2.6 to −1.6 s). The SNRs were then compared between the two groups for both narratives. A 2 × 2 mixed ANOVA (see Statistical Analysis section below) was used to compare SNR between the two narratives and the two groups. The main effects of narrative [F(1,18) = 0.34, p = 0.570, = 0.002] and group [F(1,18) = 1.56, p = 0.228, = 0.074] were not significant, nor was the interaction [F(1,18) = 0.18, p = 0.674, < 0.001]. This ensured that the cortical tracking metrics were not biased by differences in signal quality between the two groups for the two narratives.
Statistical Analysis
Multiple-choice question accuracy and cortical tracking metrics for each frequency band were analyzed via 2 (between-subjects, participant group [lvPPA vs. HC]) × 2 (within-subjects, narrative [narrative 1 vs. narrative 2]) mixed ANOVA using the afex package (version 0.26-0) (Singmann et al., 2020) in R (v3.6.3). Multiple-choice question accuracy (percentage) was adjusted using the rationalized arcsine transformation to mitigate ceiling effects (Studebaker, 1985). The data met assumptions for mixed ANOVA.
Whereas, the prediction accuracies quantify how well the multiband stimulus envelope is represented in the EEG, the TRFs inform the time-course of the neural dynamics underlying observed prediction accuracies. Thus, for frequency bands where significant differences were observed between lvPPA and HC participants, TRFs were compared between participant groups via 1,000 random, cluster-based permutations (Maris and Oostenveld, 2007) using Wilcoxon rank-sum statistics (two-tailed). The range for significant clusters is reported.
Results
For multiple-choice questions, the main effects of participant group [F(1,18) = 25.21, p < 0.0001, 0.53] and narrative [F(1,18) = 22.43, p < 0.001, 0.20] were significant, as was the interaction [F(1,18) = 7.17, p = 0.02, = 0.07]. HC performed similarly on the two narratives whereas individuals with lvPPA performed worse on narrative 1 (Figure 1C), although the effect size for the interaction was small.
All participants had above-chance-level prediction accuracies for full, delta, and theta bands (Figure 3A; see Supplementary Figure 4 for scalp distribution plots for the prediction accuracies). Prediction accuracies for both narratives were highly correlated (Figure 3B), indicating that this cortical tracking metric conveys reliable information about the encoding of the speech envelope irrespective of stimulus. For the full band, there was a significant main effect of narrative [F(1,18) = 5.84, p = 0.03, = 0.02]. The magnitude of cortical tracking was significantly larger for narrative 1, although the effect size was small. The main effect of participant group [F(1,18) = 3.80, p = 0.07, = 0.17] and the interaction [F(1,18) = 1.10, p = 0.31, = 0.004], however, were not significant. For the delta band, the main effect of narrative was significant [F(1,18) = 18.43, p < 0.001, = 0.05]; the magnitude of cortical tracking was significantly larger for narrative 2, although the effect size was small. The main effect of participant group [F(1,18) = 0.04, p = 0.84, = 0.002] and the interaction [F(1,18) = 4.01, p = 0.06, = 0.01] were not significant. For the theta band, the main effects of participant group [F(1,18) = 10.75, p = 0.004, = 0.34] and narrative were significant [F(1,18) = 5.75, p = 0.03, = 0.04], although the effect size for narrative was small. The main effects of participant group and narrative reflected a significantly larger magnitude of cortical tracking in the theta band for individuals with lvPPA and narrative 1, respectively. The interaction between participant group and narrative was not significant [F(1,18) = 1.50, p = 0.24, = 0.01].
Figure 3. Cortical tracking metrics in healthy controls (HC) and individuals with logopenic variant primary progressive aphasia (lvPPA). (A) Prediction accuracies (Pearson's r averaged across all EEG channels) within full, delta, and theta bands for each narrative. The dotted lines show chance levels of prediction accuracies. The asterisk (*) highlights the significant main effect of group. (B) Prediction accuracies within full, delta, and theta bands for narrative 2 plotted as a function of narrative 1. Points lying on the unity line have equal prediction accuracies for the two narratives. The plots show that the prediction accuracies for both the narratives are highly correlated. (C) Frontocentral electrode (Fz) temporal response functions (TRF) for the theta band for each narrative. The gray shaded regions below each TRF mark the time regions that are significantly different between HC and lvPPA (significance derived via cluster-based permutation analyses). The scalp topographic plots below the TRFs show the difference (masked for significance) between the two groups.
For the theta band, we examined the TRFs to characterize the time-course of the neural dynamics underlying observed prediction accuracies. Consistent with prediction accuracy results, individuals with lvPPA had significantly higher beta weights than HC in the theta band (narrative 1: 16.4 ≤ zclust ≤ 71.8, narrative 2: 16.4 ≤ zclust ≤ 21.24), indicating increased cortical tracking. This difference was seen throughout the time-course of the TRF with no specific difference at early or late latencies (Figure 3C).
Discussion
Phonological processing impairment is the hallmark of logopenic variant primary progressive aphasia (lvPPA). This impairment has been examined using various tasks that broadly assess phonological processing (e.g., repetition, confrontation naming, sentence comprehension, phoneme manipulation), but research seeking to deconstruct this deficit and characterize the level(s) of breakdown is lacking. We assessed narrative comprehension and utilized temporal response function (TRF) modeling to examine cortical tracking of the speech envelope for two continuous speech narratives. Relative to healthy controls (HC), we observed reduced comprehension in lvPPA, extending previous findings of impaired comprehension for long sentences (Henry and Gorno-Tempini, 2010; Rohrer et al., 2010) using more ecologically valid, naturalistic, continuous speech stimuli. Cortical tracking metrics differed between lvPPA and HC, with significantly larger cortical tracking within the theta band (4–8 Hz) in lvPPA, but no difference between the two groups in the full (1–8 Hz) or delta bands (1–4 Hz). We also examined the TRF time-course, as latencies of TRFs have been associated with hierarchical cortical processing of auditory objects (i.e., latencies increase along the cortical processing hierarchy) (Ding and Simon, 2012; Puvvada and Simon, 2017). We observed larger regression coefficients (beta weights) for TRFs in the theta band in lvPPA over a broad time region, reflecting increased cortical tracking along the cortical processing hierarchy in both primary auditory and higher-level cortical regions. This neural signature was robust and consistent across two narratives that differed in acoustic and linguistic complexity, despite differences in comprehension for the two narratives. The findings of this study have implications for the characterization of deficits observed in lvPPA.
Temporoparietal cortex, which is significantly atrophic in lvPPA, plays a critical role in speech envelope encoding and the transformation of features represented in the envelope into discrete, perceptually-meaningful units (Mesgarani et al., 2014). Unlike stroke-induced aphasia, PPA is characterized by progressive degeneration of the left hemisphere language network (Rohrer et al., 2010). Longitudinal brain imaging in semantic variant PPA indicates that there is a gradual shifting in network dynamics in the context of atrophic changes (Canu et al., 2020). In fact, there is evidence of altered network connectivity in all three PPA variants (Agosta et al., 2014; Ranasinghe et al., 2017; Mandelli et al., 2018; Tao et al., 2020). Specifically in lvPPA (relative to HC), increased connectivity in right frontoparietal cortex has been observed using resting-state fMRI (Tao et al., 2020) and resting-state MEG has revealed bilateral changes within delta-theta oscillations, comprising both hypersynchrony between medial frontal and temporoparietal cortex and hyperactivity in the frontal cortex (Ranasinghe et al., 2017). Changes in network connectivity have been linked to changes in behavior inside (Wilson et al., 2009; Borghesani et al., 2020; Canu et al., 2020) and outside (Mandelli et al., 2018; Battistella et al., 2019) the MRI scanner in semantic and non-fluent variant PPA. In some cases, the recruitment of additional regions beyond sites of significant atrophy is related to better performance on speech and language tasks (Mandelli et al., 2018; Canu et al., 2020). Alternatively, some studies have found that the shift of function leads to errors (e.g., in the case of surface dyslexia in semantic variant PPA) (Wilson et al., 2009; Borghesani et al., 2020).
One possibility is that speech envelope encoding in individuals with lvPPA shifts away from atrophic temporoparietal cortex to relatively spared cortex that is not ideally suited to process information contained within the envelope. One might predict that this would lead to a reduced magnitude of cortical tracking, rather than an increased magnitude of cortical tracking. However, there are at least two candidate mechanisms underlying this paradoxical finding. First, increased cortical tracking of the envelope has been observed in the aging literature and argued to be a consequence of network degeneration and an imbalance between excitation and inhibition leading to inefficient information processing (Brodbeck et al., 2018). Second, the increased cortical tracking of the envelope may reflect functional hemispheric asymmetries. There is evidence that the cortical oscillations that support speech envelope encoding are lateralized, with faster oscillations being left lateralized (i.e., gamma oscillations) and slower oscillations (i.e., delta-theta oscillations) being right lateralized (Gross et al., 2013). Because left temporoparietal cortex is significantly atrophic in lvPPA, there may be an increased reliance on the right hemisphere for speech envelope encoding, thereby leading to increased cortical tracking by slower oscillatory frequencies, such as the theta band. The spatial resolution of EEG is relatively poor, and future research seeking to disentangle potential mechanisms (“what and where”) underlying the neural signature observed in this study will benefit from the use of tools such as MEG and source-level analyses.
In sum, we posit that increased cortical tracking of the speech envelope in individuals with lvPPA relative to HC is likely a consequence of shifting of function away from atrophic temporoparietal cortex to relatively spared brain regions in both hemispheres. Of note, this does not occur at a global level but rather, selectively enhances tracking of the syllabic rhythm of speech (4–8 Hz), as evidenced by increased cortical tracking in the theta band only. Phonological processing deficits in developmental dyslexia are argued to be a consequence of reduced envelope tracking within the theta range, leading to an increased sensitivity to phonetic features (Goswami, 2011). Our findings in lvPPA may represent the opposite pattern—increased reliance on syllabic structure during envelope encoding as a consequence of impaired processing of phonetic features due to left temporoparietal atrophy. Evidence for aberrant neural processing of phonetic features has been observed in left temporoparietal cortex in lvPPA (Hardy et al., 2017), lending some support to this theory.
Cortical tracking metrics, like those used in the current study, may have potential for use as a neural signature to inform differential diagnosis in PPA. To determine the viability of using such an approach for differential diagnosis, future research will require examination of our findings in lvPPA relative to other PPA variants and related disorders (e.g., Alzheimer's dementia, posterior cortical atrophy) to determine whether this neural signature is specific to lvPPA. Moreover, these metrics may provide valuable insights into perceptual deficits in PPA in naturalistic listening conditions that are not easily assessed using conventional clinical approaches. In conclusion, the current study points to an increased reliance on the speech envelope during continuous speech perception in lvPPA, arising from inefficient processing of acoustic cues within the theta band. The current study thus marks an important step toward more precise characterization of speech processing deficits in lvPPA.
Data Availability Statement
The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.
Ethics Statement
The studies involving human participants were reviewed and approved by the Institutional Review Board at the University of Texas at Austin. The patients/participants provided their written informed consent to participate in this study.
Author Contributions
Data collection, analysis and background literature review were conducted by HD, GNG, and RT. Clinical diagnosis and characterization was conducted by HD, MG-T, and MH. Data interpretation and manuscript preparation were conducted by HD, GNG, RT, MG-T, BC, and MH. All authors contributed to the article and approved the submitted version.
Funding
This work was funded by grants from the National Institutes of Health (NIDCD F32 DC016812, awarded to HD; NIDCD R01 DC016291, awarded to MH; NIDCD R01 DC013315, awarded to BC).
Conflict of Interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
We wish to thank all of our participants with PPA for their participation in the current study. We also thank members of the UT Austin Aphasia Research and Treatment Lab and the University of Pittsburgh SoundBrain Lab who contributed to EEG data collection and patient assessment, especially Ben Zinszer, Zilong Xie, Jacie McHaney, Rachel Reetzke, Kirsten Laursen, Sarah Campbell, Stephanie Grasso, and Lisa Wauters. We acknowledge that this work was supported in part by resources provided through the University of Pittsburgh Center for Research Computing (Moore et al., 2017) and the Texas Advanced Computing Center (TACC) at The University of Texas at Austin (http://www.tacc.utexas.edu).
Supplementary Material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fnhum.2020.597694/full#supplementary-material
References
Agosta, F., Galantucci, S., Valsasina, P., Canu, E., Meani, A., Marcone, A., et al. (2014). Disrupted brain connectome in semantic variant of primary progressive aphasia. Neurobiol. Aging, 35, 2646–2655. doi: 10.1016/j.neurobiolaging.2014.05.017
Battistella, G., Henry, M. L., Gesierich, B., Wilson, S. M., Borghesani, V., Shwe, W., et al. (2019). Differential intrinsic functional connectivity changes in semantic variant primary progressive aphasia. Neuroimage Clin. 22:101797. doi: 10.1016/j.nicl.2019.101797
Borghesani, V., Hinkley, L. B. N., Ranasinghe, K. G., Thompson, M. M. C., Shwe, W., Mizuiri, D., et al. (2020). Taking the sublexical route: brain dynamics of reading in the semantic variant of primary progressive aphasia. Brain 143, 2545–2560. doi: 10.1093/brain/awaa212
Brodbeck, C., Presacco, A., Anderson, S., and Simon, J. Z. (2018). Over-representation of speech in older adults originates from early response in higher order auditory cortex. Acta Acust. United Acust. 104, 774–777. doi: 10.3813/AAA.919221
Canu, E., Bessi, V., Calderaro, D., Simoni, D., Castelnovo, V., Leocadi, M., et al. (2020). Early functional MRI changes in a prodromal semantic variant of primary progressive aphasia: a longitudinal case report. J. Neurol. 267, 3100–3104. doi: 10.1007/s00415-020-10053-9
Carrol, L. (1865). Alice's Adventures in Wonderland. Available online at: Https://librivox.org/alices-adventures-in-wonderland-by-lewis-carroll-5.
Crosse, M. J., Di Liberto, G. M., Bednar, A., and Lalor, E. C. (2016). The multivariate temporal response function (mTRF) toolbox: a MATLAB toolbox for relating neural signals to continuous stimuli. Front. Hum. Neurosci. 10:604. doi: 10.3389/fnhum.2016.00604
de Cheveigné, A., and Nelken, I. (2019). Filters: when, why, and how (not) to use them. Neuron 102, 280–293. doi: 10.1016/j.neuron.2019.02.039
Delorme, A., and Makeig, S. (2004). EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J. Neurosci. Methods 134, 9–21. doi: 10.1016/j.jneumeth.2003.10.009
Di Liberto, G. M., O'Sullivan, J. A., and Lalor, E. C. (2015). Low-frequency cortical entrainment to speech reflects phoneme-level processing. Curr. Biol. 25, 2457–2465. doi: 10.1016/j.cub.2015.08.030
Di Liberto, G. M., Peter, V., Kalashnikova, M., Goswami, U., Burnham, D., and Lalor, E. C. (2018). Atypical cortical entrainment to speech in the right hemisphere underpins phonemic deficits in dyslexia. Neuroimage 175, 70–79. doi: 10.1016/j.neuroimage.2018.03.072
Ding, N., Melloni, L., Yang, A., Wang, Y., Zhang, W., and Poeppel, D. (2017). Characterizing neural entrainment to hierarchical linguistic units using electroencephalography (EEG). Front. Hum. Neurosci. 11:481. doi: 10.3389/fnhum.2017.00481
Ding, N., and Simon, J. Z. (2012). Neural coding of continuous speech in auditory cortex during monaural and dichotic listening. J. Neurophysiol. 107, 78–89. doi: 10.1152/jn.00297.2011
Ding, N., and Simon, J. Z. (2014). Cortical entrainment to continuous speech: functional roles and interpretations. Front. Hum. Neurosci. 8:311. doi: 10.3389/fnhum.2014.00311
Folstein, M. F., Folstein, S. E., and McHugh, P. R. (1975). Mini-mental state. J. Psychiatr. Res. 12, 189–198.
Fuglsang, S. A., Märcher-Rørsted, J., Dau, T., and Hjortkjær, J. (2020). Effects of sensorineural hearing loss on cortical synchronization to competing speech during selective attention. J. Neurosci. 40, 2562–2572. doi: 10.1523/JNEUROSCI.1936-19.2020
Ghitza, O. (2017). Acoustic-driven delta rhythms as prosodic markers. Lang. Cogn. Neurosci. 32, 545–561. doi: 10.1080/23273798.2016.1232419
Giraud, A., and Poeppel, D. (2012). Cortical oscillations and speech processing: emerging computational principles and operations. Nat. Neurosci. 15, 511–517. doi: 10.1038/nn.3063
Gorno-Tempini, M. L., Hillis, A. E., Weintraub, S., Kertesz, A., Mendez, M., Cappa, S. F., et al. (2011). Classification of primary progressive aphasia and its variants. Neurology 76, 1006–1014. doi: 10.1212/WNL.0b013e31821103e6
Goswami, U. (2011). A temporal sampling framework for developmental dyslexia. Trends Cogn. Sci. 15, 3–10. doi: 10.1016/j.tics.2010.10.001
Gross, J., Hoogenboom, N., Thut, G., Schyns, P., Panzeri, S., Belin, P., et al. (2013). Speech rhythms and multiplexed oscillatory sensory coding in the human brain. PLoS Biol. 11:e1001752. doi: 10.1371/journal.pbio.1001752
Hamilton, L. S., Edwards, E., and Chang, E. F. (2018). A spatial map of onset and sustained responses to speech in the human superior temporal gyrus. Curr. Biol. 28, 1860–1871.e4. doi: 10.1016/j.cub.2018.04.033
Hamilton, L. S., and Huth, A. G. (2018). The revolution will not be controlled: natural stimuli in speech neuroscience. Lang. Cogn. Neurosci. 35, 573–582. doi: 10.1080/23273798.2018.1499946
Hardy, C. J. D., Agustus, J. L., Marshall, C. R., Clark, C. N., Russell, L. L., Brotherhood, E. V., et al. (2017). Functional neuroanatomy of speech signal decoding in primary progressive aphasias. Neurobiol. Aging 56, 190–201. doi: 10.1016/j.neurobiolaging.2017.04.026
Henry, M. L., and Gorno-Tempini, M. L. (2010). The logopenic variant of primary progressive aphasia. Curr. Opin. Neurol. 23, 633–637. doi: 10.1097/WCO.0b013e32833fb93e
Henry, M. L., Hubbard, H. I., Grasso, S. M, Dial, H. R., Beeson, P. M., Miller, B. L., et al. Treatment for word retrieval in semantic logopenic variants of primary progressive aphasia: immediate long-term outcomes. J. Speech Lang. Hear. Res. (2019) 62, 2723–2749. doi: 10.1044/2018_JSLHR-L-18-0144.
Kertesz, A., and Raven, J. (2007). WAB-R: Western Aphasia Battery-Revised. San Antonio, TX: PsychCorp.
Kramer, J. H. P., Jurik, J., Sha, S. J. M., Rankin, K. P., Rosen, H. J., Johnson, J. K., et al. (2003). Distinctive neuropsychological patterns in frontotemporal dementia, semantic dementia, and alzheimer disease. Cogn. Behav. Neurol. 16, 211–218. doi: 10.1097/00146965-200312000-00002
Lalor, E. C., and Foxe, J. J. (2010). Neural responses to uninterrupted natural speech can be extracted with precise temporal resolution. Eur. J. Neurosci. 31, 189–193. doi: 10.1111/j.1460-9568.2009.07055.x
Lukic, S., Mandelli, M. L., Welch, A., Jordan, K., Shwe, W., Neuhaus, J., et al. (2019). Neurocognitive basis of repetition deficits in primary progressive aphasia. Brain Lang. 194, 35–45. doi: 10.1016/j.bandl.2019.04.003
Mandelli, M. L., Welch, A. E., Vilaplana, E., Watson, C., Battistella, G., Brown, J. A., et al. (2018). Altered topology of the functional speech production network in non-fluent/agrammatic variant of PPA. Cortex 108, 252–264. doi: 10.1016/j.cortex.2018.08.002
Maris, E., and Oostenveld, R. (2007). Nonparametric statistical testing of EEG- and MEG-data. J. Neurosci. Methods 164, 177–190. doi: 10.1016/j.jneumeth.2007.03.024
McHaney, J. R., Gnanateja, G. N., Smayda, K. E., Zinszer, B. D., and Chandrasekaran, B. (2020). Cortical tracking of speech in delta band relates to individual differences in speech in noise comprehension in older adults. Ear Hear. doi: 10.1097/AUD.0000000000000923. [Epub ahead of print].
Mesgarani, N., Cheung, C., Johnson, K., and Chang, E. F. (2014). Phonetic feature encoding in human superior temporal gyrus. Science 343, 1006–1010. doi: 10.1126/science.1245994
Moore, B. II., Schrader, R. L., Kowalski, K., and Autschbach, J. (2017). Electronic π-to-π* excitations of rhodamine dyes exhibit a time-dependent Kohn–Sham theory “cyanine problem”. ChemistryOpen 6, 385–392. doi: 10.1002/open.201700046
Mullen, T. R., Kothe, C. A. E., Chi, Y. M., Ojeda, A., Kerth, T., Makeig, S., et al. (2015). Real-time neuroimaging and cognitive monitoring using wearable dry EEG. IEEE Trans. Biomed. Eng. 62, 2553–2567. doi: 10.1109/TBME.2015.2481482
Puvvada, K. C., and Simon, J. Z. (2017). Cortical representations of speech in a multi-talker auditory scene. J. Neurosci. 37, 9189–9196. doi: 10.1101/124750
Ranasinghe, K. G., Hinkley, L. B., Beagle, A. J., Mizuiri, D., Honma, S. M., Welch, A. E., et al. (2017). Distinct spatiotemporal patterns of neuronal functional connectivity in primary progressive aphasia variants. Brain 140, 2737–2751. doi: 10.1093/brain/awx217
Reetzke, R., Gnanateja, G. N., and Chandrasekaran, B. (in press). Neural tracking of the speech envelope is differentially modulated by attention language experience. Brain Lang. 213:104891. doi: 10.1016/j.bandl.2020.104891
Rohrer, J. D., Ridgway, G. R., Crutch, S. J., Hailstone, J., Goll, J. C., Clarkson, M. J., et al. (2010). Progressive logopenic/phonological aphasia: erosion of the language network. Neuroimage 49, 984–993. doi: 10.1016/j.neuroimage.2009.08.002
Singmann, H., Bolker, B., Westfall, J., Aust, F., and Ben-Shachar, M. (2020). AFEX: Analysis of Factorial Experiments.
Slaney, M. (1998). Auditory Toolbox (2.0) [Matlab]. Interval Research Corporation. Available online at: https://engineering.purdue.edu/~malcolm/interval/1998-010/
Studebaker, G. A. (1985). A “rationalized” arcsine transform. J. Speech Lang. Hear. Res. 28, 455–462.
Tao, Y., Ficek, B., Rapp, B., and Tsapkini, K. (2020). Different patterns of functional network reorganization across the variants of primary progressive aphasia: a graph-theoretic analysis. Neurobiol. Aging 96, 184–196. doi: 10.1016/j.neurobiolaging.2020.09.007
Teoh, E. S., Cappelloni, M. S., and Lalor, E. C. (2019). Prosodic pitch processing is represented in delta-band EEG and is dissociable from the cortical tracking of other acoustic and phonetic features. Eur. J. Neurosci. 50, 3831–3842. doi: 10.1111/ejn.14510
Vanthornhout, J., Decruy, L., Wouters, J., Simon, J. Z., and Francart, T. (2018). Speech Intelligibility predicted from neural entrainment of the speech envelope. J. Assoc. Res. Otolaryngol. 19, 181–191. doi: 10.1007/s10162-018-0654-z
Weintraub, S., Mesulam, M.-M., Wieneke, C., Rademaker, A., Rogalski, E. J., and Thompson, C. K. (2009). The Northwestern Anagram Test: measuring sentence production in primary progressive aphasia. Am. J. Alzheimers Dis. Other Dement. 24, 408–416. doi: 10.1177/1533317509343104
Wertz, R., LaPointe, L., and Rosenbek, J. (1984). Apraxia of Speech in Adults: The Disorder and Its Management. New York, NY: Grune & Stratton.
Wilson, S. M., Brambati, S. M., Henry, R. G., Handwerker, D. A., Agosta, F., Miller, B. L., et al. (2009). The neural basis of surface dyslexia in semantic dementia. Brain 132, 71–86. doi: 10.1093/brain/awn300
Wilson, S. M., Henry, M. L., Besbris, M., Ogar, J. M., Dronkers, N. F., Jarrold, W., et al. (2010). Connected speech production in three variants of primary progressive aphasia. Brain 133, 2069–2088. doi: 10.1093/brain/awq129
Keywords: logopenic variant, logopenic variant of primary progressive aphasia (lvPPA), cortical tracking of speech, temporal response function (TRF), speech perception, speech envelope, speech envelope tracking
Citation: Dial HR, Gnanateja GN, Tessmer RS, Gorno-Tempini ML, Chandrasekaran B and Henry ML (2021) Cortical Tracking of the Speech Envelope in Logopenic Variant Primary Progressive Aphasia. Front. Hum. Neurosci. 14:597694. doi: 10.3389/fnhum.2020.597694
Received: 21 August 2020; Accepted: 19 November 2020;
Published: 06 January 2021.
Edited by:
Kirrie J. Ballard, The University of Sydney, AustraliaReviewed by:
Roeland Hancock, University of Connecticut, United StatesDavid Jackson Morris, University of Copenhagen, Denmark
Copyright © 2021 Dial, Gnanateja, Tessmer, Gorno-Tempini, Chandrasekaran and Henry. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Bharath Chandrasekaran, b.chandra@pitt.edu; Maya L. Henry, maya.henry@austin.utexas.edu