Skip to main content

ORIGINAL RESEARCH article

Front. Neurol., 25 February 2022
Sec. Neurological Biomarkers

An Automated Tongue Tracker for Quantifying Bulbar Function in ALS

  • 1Institute for Cognitive Neurology and Dementia Research, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
  • 2Department of Neurology, Otto-von-Guericke University Magdeburg, Magdeburg, Germany
  • 3Department of Neurology, Hannover Medical School, Hanover, Germany
  • 4Department of Neurology, Rostock University Medical Centre, Rostock, Germany
  • 5German Center for Neurodegenerative Diseases (DZNE), Rostock, Germany
  • 6German Center for Neurodegenerative Diseases (DZNE), Magdeburg, Germany
  • 7Center for Behavioral Brain Sciences, Magdeburg, Germany

Introduction: Bulbar symptoms, including difficulty swallowing and speaking, are common in amyotrophic lateral sclerosis (ALS) and other neurological disorders, such as stroke. The presence of bulbar symptoms provides important information regarding clinical outcomes, such as survival time after diagnosis. Nevertheless, there are currently no easily accessible, quantitative methods to measure bulbar function in patients.

Methods: We developed an open-source tool called Tongue Tracker (TT) to quantify bulbar function by training a neural network to track kinematic tongue features of short video clips of lateral tongue movements. We tested 16 healthy controls and ten patients with ALS, of whom two patients were clinically diagnosed with bulbar-onset type and eight patients were clinically diagnosed with limb-onset type. Of the limb-onset patients, five patients also showed symptoms of bulbar impairment.

Results: We validated TT by comparing the results with manual delineation of tongue movements in the clips. We demonstrate an early-stage bulbar-onset patient who showed fewer and slower tongue sweeps compared to healthy controls and limb-onset patients and we show that five bulbar-impaired limb-onset patients have a different tongue kinematic profile compared to healthy controls.

Discussion: TT may serve to detect quantitative markers of bulbar dysfunction in ALS and other motor disorders, such as stroke, by identifying signatures of spasticity or muscle weakness that affects tongue movement speed and/or tongue movement topography.

Introduction

Bulbar symptoms, characterized by difficulty swallowing and speaking, are common in amyotrophic lateral sclerosis (ALS) and other neurological disorders, such as stroke (1, 2). Critically, the presence of bulbar symptoms is associated with poorer clinical outcomes in ALS, including shorter survival time (3, 4). In ALS, these symptoms are caused by the degeneration of upper and lower motor neurons that support the bulbar muscles in the tongue and throat. While only one-third of patients with ALS first present with bulbar symptoms (so-called “bulbar-onset ALS”), the majority of patients with lower or upper limb-onset ALS will experience bulbar impairment with disease progression because the disorder spreads through the topographically-organized motor system (57). Difficulty swallowing, clinically termed dysphagia, causes malnutrition and even respiratory infections in patient populations. The presence of bulbar symptoms is, therefore, a critical marker of both the disease onset and disease progression in ALS and in other neurological disorders including stroke (1, 8). Despite the clinical relevance of bulbar impairment, there is currently no standardized and accessible measure to quantify bulbar function in a clinical setting (2). This study introduces a tool named “Tongue Tracker” (TT), a novel and automated tool for quantifying bulbar function based on short video clips of tongue movements, which can be acquired in the clinic by using a laptop or mobile phone. We first demonstrate the usage of TT in a sample of healthy controls and patients with ALS with and without clinical bulbar symptom manifestation.

In clinical practice, the bulbar sub-score of the ALS Functional Rating Scale-Revised (ALSFRS-R) (9) is the most commonly used measure of bulbar symptoms. While this score correlates with disease progression in ALS (10), it relies on limited, qualitative self-reports of speech intelligibility, salivation levels and swallowing. Even though the Center for Neurologic Study-Bulbar Function Scale (CNS-BFS) (11) is more comprehensive and specific, it is also dependent on subjective self-reports and, therefore, not objective. Alternatively, speech-based measures can identify bulbar symptoms undetected by the ALSFRS-R and clinicians' speech ratings (12). While promising, measures of speech intelligibility are unlikely to support early detection of bulbar impairment since such changes present later in the disease course (13). Dysphagia presents earlier in bulbar impairment, yet advanced measures of swallowing function have not been validated in ALS and require invasive and expensive methods unsuitable in a routine clinical investigation (8).

Interestingly, kinematic features of the articulatory sub-system, which includes the jaw, lip and tongue, provide sensitive markers of bulbar dysfunction (14). Measures of the extent and speed of lip and jaw movements predict a decline in speech intelligibility after 3 months in patients with ALS, while movement duration increased with disease progression (15, 16). In addition, measures of tongue movement duration and tongue movement speed differentiated between patients with advanced bulbar symptoms and healthy controls, while speed and size decreased with disease progression in early-stage bulbar patients (17). While these studies highlight the sensitivity of tongue kinematic features for the detection and tracking of bulbar impairment, the application of methods requires new and expensive equipment that is unavailable in most standard clinical settings. Therefore, these methods have not been adopted in standard clinical practice.

Building on this knowledge, this study aimed to develop a fast, automated and objective tool for quantifying bulbar kinematic features in patients by using equipment that is usually available in a clinical setting. We developed TT, a software that extracts tongue kinematic features from short, 5-s video clips of lateral tongue movements where the mouth is half-open. To use TT, it is only required to record tongue movements with a built-in camera by using a laptop, computer or mobile phone. TT is an open-source tool (link provided in the Methods section), which is platform-independent and can be optimized for different patient populations (e.g., stroke) or outcome variables (e.g., decreased movement performance over time). Here, we present and test our tool on ten patients with ALS and 16 age- and education-matched healthy controls. Two of the patients were clinically diagnosed with bulbar-onset type and eight patients were clinically diagnosed with (upper or lower) limb-onset type. In addition, we defined five limb-onset patients as bulbar-impaired by using cut-off scores on standard clinical assessments of bulbar function. We investigated whether the TT could differentiate between (i) bulbar-onset patients and healthy controls, (ii) bulbar-onset patients and limb-onset patients, (iii) bulbar-impaired limb-onset patients and healthy controls, and (iv) patients with ALS, regardless of onset type (bulbar and limb) and healthy controls. If differentiation was successful, our results would support the clinical relevance of tongue kinematic features for detecting and quantifying bulbar symptoms in patients with ALS with and without bulbar-onset type and would indicate that TT can serve as a useful tool to pursue that goal. Such measures could support the detection of bulbar symptoms as a marker of disease onset and progression for use in clinical trials.

Materials and Methods

Participants

Tongue movement videos were collected from patients with ALS (n = 17) and healthy control participants (n = 23). Several videos had to be excluded after quality inspection (see below for more details), leaving data of patients with ALS (n = 10) and healthy control participants (n = 16) for analysis. The demographic information for all the participants is shown in Table 1 and additional demographic information for patients is shown in Table 2. Patients with ALS were recruited from the Clinic for Neurology of the University Hospital of the Otto-von-Guericke University Magdeburg (SS), Hannover Medical School (SP) and Rostock University Medical Center (JP) in Germany. Criteria for selection of participants as healthy controls included no history of any neurological or psychiatric disease and the absence of sensorimotor impairments. The control group was age- and education-matched to the patient with ALS group at the group level (i.e., no significant difference between the groups in age and years of education, p-value below 0.05). Since all the patients and healthy controls also participated in a 7T-MRI study, they additionally had to meet 7T inclusion criteria, such as intact hearing (no tinnitus) and no tattoos or metal implants. All the participants provided informed consent and were paid seven euros per hour. This study was approved by the Ethics Committee of the University Hospital of the Otto-von-Guericke Universität Magdeburg (Ethics number: 16/17).

TABLE 1
www.frontiersin.org

Table 1. Demographic variables for patients with amyotrophic lateral sclerosis (ALS) and healthy controls.

TABLE 2
www.frontiersin.org

Table 2. Demographic information for patients with ALS.

Clinical Assessment

Table 3 provides an overview of the clinical information for the patients included in this study. Patients were classified according to the revised El-Escorial criteria (18) into one of the following categories: definite ALS, probable ALS or possible ALS by an experienced neurologist (SV) of the University Hospital Magdeburg, who also performed all the clinical assessments detailed below. Patients were also classified regarding onset type into either bulbar-onset type (n = 2) or (upper or lower) limb-onset type (n = 8) categories. Patients underwent a series of standard clinical tests including the ALSFRS-R (9), the CNS-BFS (11) and the Penn Upper Motor Neuron (UMN-Penn) score (19). Neither patients nor controls were tested for cognitive impairment.

TABLE 3
www.frontiersin.org

Table 3. Clinical scores of patients with ALS.

We defined patients as bulbar-impaired if they had the ALSFRS-R score equal to or below 11 or the CNS-BFS score above 43, as recommended for clinical use (11). Patients (n = 5) with limb-onset ALS were, thus, defined as bulbar-impaired and grouped separately from the two clinically diagnosed bulbar-onset patients. Disease duration was defined as the number of months between the onset of ALS symptoms and the time point of tongue measurement. Disease progression rate (DPR) was calculated according to the formula: DPR = (48-ALSFRS-R)/Disease duration (20).

Experimental Task

Participants were seated in front of a laptop with a built-in camera. Three different models of Apple MacBook laptops with 720p cameras were used (MacBook Pro, 13-inch, 2017; MacBook Air, 13-inch, Mid 2012; MacBook Air, 13-inch, 2020), with which one of three total experimenters performed the test. Participants were given specific verbal instructions for the tongue movement task. Each participant was asked to open his/her mouth and move his/her tongue repeatedly between the two mouth commissures as fast as possible, while keeping the mouth half-open and while keeping the tongue visible, as given in Figure 1B. The movements were video recorded for a minimum of 5 s of correct movement. The task was completed twice to include one training run and one test run, resulting in two videos per participant. The second video was taken for further analysis, unless the video quality was rated as poor during pre-processing, in which case the first video was taken (this occurred for n = 5 participants). Such a simple experimental setting (i.e., without a second software to indicate onset and offset times or to give instructions) was chosen to make this task suitable for clinical routine. Healthy controls were tested at the German Center for Neurodegenerative Diseases (DZNE) in Magdeburg, whereas patients were tested at the University Clinic for Neurology within the University Hospital of the Otto-von-Guericke Universität Magdeburg.

FIGURE 1
www.frontiersin.org

Figure 1. Overview of Tongue Tracker (TT). The videos were first pre-processed via steps detailed in the text (see pre-processing section). (A) The face and mouth were detected and extracted from each frame of the pre-processed videos. (B) The tongue was then detected by using a trained neural network, resulting in a binary image showing the location of the tongue in each video frame. (C) A record of the location of the tongue, either in the left or right portion of the frame, was recorded and saved for extracting kinematic features of the tongue. The code can be accessed here: https://github.com/BudhaTronix/Automated-Video-Analysis-Tool-for-Quantifying-Bulbar-Function.git.

Video Quality Inspection

The inclusion criterion for the video clips was that the full face (including the mouth and the tongue) was visible in all the frames. This was important since TT first performs face detection and segmentation before it classifies tongue movements (see Figure 1). This criterion was ensured by two raters. Videos in which parts of the face or the eyes were not visible due to changes in camera angle or distance during the clip were, therefore, excluded from further analysis. Based on this criterion, healthy controls (n = 8) and patients with ALS (n = 7) had to be excluded from analysis. After exclusions, data from patients with ALS (n = 10) and healthy controls (n = 16) remained. To ensure that the full face is covered in the video clip, it is, therefore, recommended that the experimenter/clinician checks the video immediately after recording.

Tongue Tracker

We developed a tongue tracking algorithm called TT, which uses the software Python to extract and quantify tongue movement features from video clips. Figure 1 provides an overview of each stage after pre-processing, including feature extraction, tongue movement detection and localization. The code is organized into separate modules (modules 1–5) for each processing stage and can be openly accessed here: https://github.com/BudhaTronix/Automated-Video-Analysis-Tool-for-Quantifying-Bulbar-Function.git. TT was developed on an HP Laptop with a Windows 10 operating system by using Pycharm with Python version 3.7 (21). It is an open-source tool and can be run on any operating system where Python 3.7 is installed. It is accessible through the command line but also via an easy-to-use graphical user interface (GUI). The user has an option to select whether the code should be run in single-mode (i.e., for one video) or in multi-mode (i.e., for all the videos in the local folder).

Pre-processing

The videos underwent pre-processing including trimming, stabilization and compression by using module 1 and module 2 of TT. First, the videos were trimmed to include the first 5 s only, to standardize the length of the videos across participants. The video stabilization module then minimized the effects of camera movement by tracking and estimating the motion of the features between two frames with the help of the Lucas–Kanade optical flow method in OpenCV (22). The videos were then compressed and resized to a standard aspect ratio by using the Moviepy package in the Python environment. Finally, the video files were converted from their original MOV format to standard MPEG format.

Face and Mouth Detection

The pre-processed videos then underwent feature extraction by using module 3 and module 4 of TT (see Figure 1A). The face was detected on a frame-by-frame basis by using Facenet, a pre-trained neural network in PyTorch (23). To extract the face in all the frames of the video, a custom function was created to add a bounding box around the detected face in each frame of the video. The coordinates of the bounding box in each frame were stored and the most extreme coordinates were used as a bounding box across all the frames, to ensure that the face was accurately extracted from every frame. The face-extracted videos were then passed through Facenet again, to detect and extract the mouth with a bounding box in each frame, as previously outlined. The mouth-extracted videos were then taken for further processing.

Neural Network

Since, to the best of our knowledge, there are no pre-trained models for tongue detection available online, we created a dataset to train a neural network for segmentation and detection of the tongue in each frame of the video (see Figure 1B). The dataset was created from the mouth-extracted videos. The videos were converted from a series of frames to images and the tongue masks were manually drawn by using ImageJ software for the purpose of training the neural network (24). The dataset, including the image and ground truth (i.e., binary image masks of the tongue segmented region), was then used to train the model. MobileNetV2 architecture (25) was used for tongue detection, since it is optimized for video analysis and supports faster detection and segmentation of the target. The neural network was pre-trained on ImageNet and then fine-tuned on the tongue dataset. After the network was trained, the tongue was detected automatically. Therefore, no manual intervention is required for using the final version of TT.

Extracting Tongue Movement Features

Using module 5 of TT, each frame of the video was passed through the new trained neural network to create a binary image showing the location of the tongue in each frame, as shown in Figure 1B. Using the midline of the frame, the location of the tongue in either the left or right side of the frame was recorded (see Figure 1C). This resulted in a location record of the tongue showing “L” for left or “R” for right, for each frame of the video.

Using the recorded locations of the tongue in each frame, we extracted the following tongue movement features. First, we calculated the total number of sweeps performed within the 5-s video clip. The lateral movement of the tongue back and forth between the corners of the mouth resulted in a typical location profile of multiple “R”s followed by multiple “L”s. A single sweep was defined as a tongue movement starting from halfway through a sequence of “R”s to halfway through a sequence of “L”s, depending on the location of the tongue at the start of the video. We took half of the sequence since the tongue moved from the center of the mouth to the corner before the next sweep began. We next calculated the average duration of a single sweep for each video by first calculating the number of frames taken for each sweep and then converting frames to seconds based on the known frame rate. Finally, we identified and recorded the number of errors in the tongue movement profiles. To achieve this, we first calculated the average number of frames in which the tongue was on the left and right sides of the frame. From this, we identified outliers where the tongue spent a longer than average time on one side of the frame, which could indicate an error made by the participant (i.e., lateral movement paused due to confusion regarding the direction of movement) or spasm due to involuntary muscle contractions.

Validation

To validate the outputs of TT, an independent expert rater (MW) manually counted the number of sweeps performed within the analyzed video sequence without knowledge of the results of TT (i.e., first, the number of sweeps were counted manually before the automated numbers via TT were revealed).

Statistical Analysis

To test the accuracy of TT in each group, we calculated correlations between the total number of sweeps identified by using TT and those identified by the independent expert rater for each group separately, using Pearson's correlation coefficient with a significance level of 5%. To test our hypotheses (see Introduction), we tested for group differences in tongue movement features (number of sweeps, average sweep duration, and number of errors) between each of the five bulbar-impaired limb-onset patients with healthy controls (one sample) using two-tailed, one-sample t-tests with a significance level of 5%. Additionally, we compared each of the two bulbar-onset patients with healthy controls (one sample) and limb-onset patients (one sample) by using two-tailed, one-sample t-tests with a significance level of 5%. Finally, we compared all the patients with healthy controls by using two-tailed, two-samples t-tests with a significance level of 5%. We report which analyses survived correction for multiple comparisons by using the Bonferroni correction.

Results

Validation Results

We correlated the total number of sweeps detected through manual counting with those from TT. The two measures were significantly correlated in both the patient group (R = 0.95**, p = 2.1 × 10−5) and the control group (R = 0.88**, p = 1 × 10−5), as shown in Figure 2.

FIGURE 2
www.frontiersin.org

Figure 2. Validation results. Relationship between manual and automated (TT) methods for quantifying bulbar function based on the same video clips. The total number of sweeps are plotted separately for patients (N = 10) and healthy controls (N = 16). Correlations marked ** are significant at the 5% level.

Tongue Kinematic Feature Results

We tested for group differences in tongue kinematic features, for which detailed statistics are shown in Table 4. With respect to our first hypothesis [(i) differentiation between clinically diagnosed bulbar-onset patients with ALS (i.e., patient nine and patient ten) and healthy controls], as expected, patient nine performed significantly fewer sweeps with a significantly longer sweep duration compared to healthy controls, whereas, unexpectedly, patient ten did not show any significant differences in tongue kinematic features when compared to healthy controls. Patient nine also showed a trend toward more sweep errors compared to healthy controls (see Table 4 for statistics), as shown in Figure 3. With respect to our second hypothesis [(ii) differentiation between bulbar-onset patients with ALS (i.e., patient nine and patient ten) and limb-onset patients with ALS], patient nine again performed significantly fewer sweeps with a significantly longer duration compared to limb-onset patients, while patient ten showed a trend toward shorter sweep duration compared to limb-onset patients (see Table 4 for statistics), as shown in Figure 3.

TABLE 4
www.frontiersin.org

Table 4. Overview over tongue kinematic features.

FIGURE 3
www.frontiersin.org

Figure 3. Group differences in tongue movement features. Shown are mean number of sweeps, mean sweep duration and mean number of errors. Top row: Box plots show patients with amyotrophic lateral sclerosis (ALS) (N = 10, circles, red) and healthy controls (N = 16, squares, blue). Middle row: Box plots show bulbar-onset patients (N = 2, triangles) and healthy controls (N = 16, squares, blue). Bottom row: Box plots show bulbar-onset patients (N = 2, triangles) and limb-onset patients (N = 16, circles, yellow), where limb-onset patients with bulbar impairment are shaded circles (Individual patients marked * are significant when compared to the healthy controls or limb-onset patients).

With respect to our third hypothesis [(iii) differentiation between bulbar-impaired limb-onset patients with ALS (i.e., patients one, four, five, six and eight) and healthy controls], as expected, patients one, five, six and eight performed significantly fewer sweeps compared to healthy controls (see Table 4 for statistics), while patient four did not show a significant difference compared to healthy controls. Patients one, five and eight performed significantly slower tongue sweeps compared to healthy controls, while patients four and six did not show a significant difference when compared to healthy controls. With respect to the number of errors, patient six showed significantly more errors compared to healthy controls, while patient five did not show a significant difference in number of errors compared to healthy controls. Patients four and eight showed a trend toward more errors compared to healthy controls, while patient one showed a trend toward fewer errors compared to healthy controls (see Table 4 for statistics).

With respect to our fourth hypothesis [(iv) differentiation between patients with ALS regardless of onset type and healthy controls], the group of patients with ALS showed a statistical trend toward completing fewer sweeps compared to the group of controls, while there were no significant differences and no trend toward a significant difference in sweep duration or in sweep errors in the patient group compared to healthy controls (see Table 4 for statistics), as shown in Figure 3.

Discussion

We present a novel and open-access tool called TT that automatically quantifies tongue kinematic features from video clips of lateral tongue movements. We tracked tongue kinematic features, namely, the number of tongue sweeps, tongue movement duration, and tongue movement errors, in healthy controls and in a heterogeneous group of patients with ALS, some of whom presented with bulbar symptoms and some of whom showed mainly limb-associated symptoms. One of the two patients with bulbar-onset ALS had a significantly different tongue kinematic profile compared to both the group of healthy controls and compared to the limb-onset patients with respect to the number of tongue sweeps and average sweep duration. The second bulbar-onset patient, who suffered from ALS for many years without rapid disease progression, did not show such a distinct profile. In the group of limb-onset patients with bulbar impairment, four patients showed significantly different tongue kinematic profiles when compared to healthy controls with respect to a number of tongue sweeps. Overall, the group of patients with ALS showed a statistical trend toward fewer and slower tongue movements compared to healthy controls. Our open-source tool provides a new, quantitative measure of bulbar dysfunction for use in ALS and possibly other neurological disorders presenting with dysphagia that can be used to identify dysfunction and to track bulbar function over time.

We first investigated how TT characterizes the tongue movements of patients with a clinical diagnosis of bulbar-onset type ALS. We, therefore, compared each of the two cases of bulbar-onset patients with ALS with both the healthy controls and the limb-onset patients. Patient nine showed a tongue kinematic profile in line with reduced bulbar function (i.e., significantly fewer and slower tongue movements), whereas patient ten did not show such a profile. There is a clear difference in the disease etiology of these two patients. Patient nine was in the early disease stage at only 3 months post-diagnosis, whereas patient ten had been diagnosed 33 months prior to this study. Despite this long disease duration, patient 10 was able to complete 1 h of MRI scanning, suggesting a slow disease progression. This difference in disease stage is reflected by the patients' clinical scores on the CNS-BFS, the ALSFRS-R and the ALSFRS-R bulbar sub-score, in which patient 10 showed greater impairment on all of the scales. Research has highlighted a potential compensatory mechanism in which patients in the advanced stages of ALS show increased jaw movements to compensate for decreased tongue movement speed and range in speech, before they then present with more severe symptoms when compensation fails (17, 26). This compensatory mechanism in advanced bulbar ALS, which may particularly occur when the disease progresses slowly, may account for the trend toward faster tongue movements when patient ten was compared to limb-onset patients. In this respect, it would be interesting in future studies to also record and analyse jaw movements in the videos.

While bulbar function is supported bilaterally in the healthy brain, disease onset in ALS is typically lateralised to one hemisphere and seems to spread differently through the lower motor neurons in comparison to the primary motor cortex (6). We speculate that in bulbar-onset patients with greater UMN involvement, tongue kinematic features may be most impacted in the early disease stage. With disease progression, a slower decline or even improvement in performance may occur due to compensation from both the non-affected hemisphere and non-affected jaw muscles. At the more advanced disease stages, we would expect tongue kinematic features to decline again as compensation is inhibited by more severe and bilateral bulbar impairment. The two bulbar-onset patients in this study differed in their motor neuron involvement, where patient 9 has a lower motor neuron dominant phenotype and patient 10 has an UMN dominant phenotype. This difference in phenotype may explain why the tongue kinematic profile of patient 10 could not be distinguished from healthy controls, where patient 10 may have been in the stage of compensation as mentioned above.

Next, we investigated the sensitivity of TT to detect bulbar symptoms in limb-onset patients. We, therefore, compared each of the five bulbar-impaired limb-onset patients with ALS to the healthy controls. Four out of five bulbar-impaired limb-onset patients showed fewer tongue movements compared to controls and three out of five bulbar-impaired limb-onset patients also showed significantly slower tongue movements compared to controls. Out of the two patients who did not show significant differences in movement duration compared to controls, patient six showed significantly more errors, while patient four showed a trend toward more errors in tongue movement compared to healthy controls. This may reflect a speed-accuracy trade-off in both the patients. Since a similar tongue kinematic profile was reflected in the bulbar-onset patients (see above), we think that, in particular, the number of sweeps and sweep duration, rather than the number of errors, are sensitive markers to quantify bulbar impairment in patients with ALS. Such features likely reflect the signature spasticity of the tongue associated with bulbar impairment caused by UMN degeneration, which has been shown to independently predict survival time in patients with ALS (27). This is also supported by previous studies in which speech-based measures identified early bulbar symptoms undetected by both the patient self-reports and clinician speech ratings (12) and where lip and jaw kinematic features predicted a decline in speech intelligibility by 3 months (15).

Next, we also compared the full, heterogeneous group of patients with ALS to healthy controls to see if TT could differentiate between patients and controls. Here, TT detected a trend toward patients with ALS completing fewer tongue sweeps compared to healthy controls. Our sample of patients with ALS was highly heterogeneous in regards to disease duration and disease progression rate and also with respect to the involvement of bulbar impairments. Even though this trend may be significant when testing the larger patient groups, TT may be more suitable for identifying bulbar impairments in the patient groups rather than differentiating between patients with ALS and controls.

We think that TT may also be particularly suitable to track changes in tongue kinematic features in limb-onset patients as they appear when the disease spreads to the bulbar area of the primary motor cortex (M1) over time (6). To test for these assumptions, one could perform longitudinal sampling with our tool to track tongue kinematic features with disease progression. Accurately quantifying bulbar dysfunction is further necessary to identify corresponding brain changes to the bulbar area of M1 by using neuroimaging methods (7, 28). In addition, our measure could be used to track tongue movement training success to improve outcomes in dysphagia, as previously demonstrated with specialized equipment to measure tongue pressure in patients following acquired brain injury and stroke (4). While we did not directly compare the patients with bulbar symptoms to a disease control group (e.g., stroke patients with bulbar symptoms), we expect a differentiation because the performance of patients with ALS should worsen over time, even after a possible compensation period as the disease progresses further due to continuous neurodegeneration.

The current diagnosis of bulbar dysfunction in ALS is clinical and, thus, requires assessment by an expert neurologist to identify subtle signs, particularly in the early stages of the disease. A key advantage of TT is that it requires neither new equipment nor expert medical knowledge to extract information regarding the kinematic features of the tongue. Instead of using expensive sensors to estimate camera motion, we pre-processed the videos to reduce motion and background noise. We think that TT is also relevant for clinicians with expertise in ALS, since it can quantify bulbar impairment reliably over time with different examiners. Along with our open-access code, we also provide all the tongue video clips to enable other researchers to fine-tune the model as per their needs. For example, one could extend the code to track the jaw movement, which could be used to test the compensation hypothesis in ALS (see above).

One limitation of this study is the relatively small sample size of patients with ALS. Given the low prevalence of ALS in the general population, availability of patients is generally poor. With our relatively low sample size, the investigation of sub-groups was limited and some statistical evaluations, such as disease classification or prediction, could not be calculated based on our dataset. Future studies may apply our tool in large-cohort analyses to reveal more information about sub-group classification. In addition, this study did not directly compare the results of TT with invasive, quantitative measures of tongue movement or swallowing function. It would be important to compare the reliability of the different tools to quantify bulbar impairment. Future studies may, therefore, investigate a multitude of parallel measures to provide recommendations as to which tool is most suitable in different disease stages.

Given the signature spasticity symptoms of UMN degeneration, our measure likely largely reflects UMN involvement in bulbar impairment. We suggest that the addition of a still tongue task along with modifications to the motion tracking code could allow for measurement of tongue fasciculations, which may provide important insight into lower motor neuron involvement. Since we had a high exclusion rate of videos, we recommend that future users of TT position the camera directly in front of the participant and not below, to ensure that the full face, including eyes and forehead, is visible. This is necessary for successful face detection and tracking during pre-processing. While we did not test the impact of cognitive impairment on TT, we suggest that TT would be equally successful with additional training time and/or when using visual instructions via photos or short instruction videos. Since we did not have the disease control group, for example, a group of patients with non-ALS with bulbar symptoms (e.g., patients with stroke or patients with Parkinson's disease), we cannot make any conclusions about the utility of TT for the other patient groups.

In conclusion, we here introduce TT, a fast, quantitative and open-access tool for quantifying bulbar function and provide first clinical data in a small sample of patients with ALS and healthy control participants. We found that the tongue kinematic features, in particular the number and duration of tongue movements, can identify some patients with bulbar symptoms with or without a diagnosis of bulbar-onset type. Such features likely reflect the signature spasticity and weakness of the tongue associated with bulbar impairment, caused by UMNs. Our open-source tool may serve as a quantitative marker for symptoms of bulbar dysfunction in other disorders beyond ALS, such as stroke.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors, without undue reservation.

Ethics Statement

The studies involving human participants were reviewed and approved by Ethics Committee of the University Clinic Magdeburg. Number: 16/17. The patients/participants provided their written informed consent to participate in this study. Written informed consent was obtained from the individual(s) for the publication of any potentially identifiable images or data included in this article.

Author Contributions

This study was designed by EK, MW, and SS. Patients were recruited by SS, SP, and JP. Data collection was performed by MW and AN. Clinical assessment was performed by SV. Formal analysis was completed by BM and AN. Data investigation and statistical analysis were completed by AN. The first draft of the manuscript was written by AN. The manuscript was reviewed and edited by MW, EK, SS, and BM. This study was supervised by EK and SS. All authors contributed to the article and approved the submitted version.

Funding

This project was funded by Else Kröner-Fresenius-Stiftung (EKFS) (Grant Number 2019-A03).

Conflict of Interest

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Publisher's Note

All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.

Acknowledgments

The authors would like to thank all the participants for their involvement in this study. Additionally, we thank Lilith-Sophie Lange for data collection.

References

1. Arnold M, Liesirova K, Broeg-Morvay A, Meisterernst J, Schlager M, Mono ML, et al. Dysphagia in acute stroke: incidence, burden and impact on clinical outcome. PloS ONE. (2016) 11:e0148424. doi: 10.1371/journal.pone.0148424

PubMed Abstract | CrossRef Full Text | Google Scholar

2. Green JR, Yunusova Y, Kuruvilla MS, Wang J, Pattee, GL, et al. Bulbar and speech motor assessment in ALS: challenges and future directions. Amyotroph Lateral Scler Frontotemporal Degener. (2013) 14:494–500. doi: 10.3109/21678421.2013.817585

PubMed Abstract | CrossRef Full Text | Google Scholar

3. Elamin M, Bede P, Montuschi A, Pender N, Chio A, Hardiman, et al. Predicting prognosis in amyotrophic lateral sclerosis: a simple tool. J Neurol. (2015) 262:1447–54. doi: 10.1007/s00415-015-7731-6

PubMed Abstract | CrossRef Full Text | Google Scholar

4. Steele CM, Bailey GL, Polacco REC, Hori SF, Molfenter SM, et al. Outcomes of tongue-pressure strength and accuracy training for dysphagia following acquired brain injury. Int J Speech Lang Pathol. (2013) 15:492–502. doi: 10.3109/17549507.2012.752864

PubMed Abstract | CrossRef Full Text | Google Scholar

5. Bede P, Bokde A, Elamin M, Byrne S, McLaughlin RL, Jordan N, et al. Grey matter correlates of clinical variables in amyotrophic lateral sclerosis (ALS): a neuroimaging study of ALS motor phenotype heterogeneity and cortical focality. J. Neurol. (2013) 84:766–73. doi: 10.1136/jnnp-2012-302674

PubMed Abstract | CrossRef Full Text | Google Scholar

6. Ravits JM, La Spada AR. ALS motor phenotype heterogeneity, focality, and spread: deconstructing motor neuron degeneration. Neurology. (2009) 73:805–11. doi: 10.1212/WNL.0b013e3181b6bbbd

PubMed Abstract | CrossRef Full Text | Google Scholar

7. Schreiber S, Northall A, Weber M, Vielhaber S, Kuehn E. Topographical layer imaging as a tool to track neurodegenerative disease spread in M1. Nat Rev Neurosci. (2021) 22:68–9. doi: 10.1038/s41583-020-00404-w

PubMed Abstract | CrossRef Full Text | Google Scholar

8. Kühnlein P, Gdynia HJ, Sperfeld AD, Lindner-Pfleghar B, Ludolph AC, Prosiegel M, et al. Diagnosis and treatment of bulbar symptoms in amyotrophic lateral sclerosis. Nat Clin Pract Neurol. (2008) 4:366–74. doi: 10.1038/ncpneuro0853

PubMed Abstract | CrossRef Full Text | Google Scholar

9. Cedarbaum JM, Stambler N, Malta E, Fuller C, Hilt D, Thurmond B, et al. The ALSFRS-R: a revised ALS functional rating scale that incorporates assessments of respiratory function. J Neurol Sci. (1999) 169:13–21. doi: 10.1016/S0022-510X(99)00210-5

PubMed Abstract | CrossRef Full Text | Google Scholar

10. Kollewe K, Mauss U, Krampfl K, Petri S, Dengler R, Mohammadi B. ALSFRS-R score and its ratio: a useful predictor for ALS-progression. J Neurol Sci. (2008) 275:69–73. doi: 10.1016/j.jns.2008.07.016

PubMed Abstract | CrossRef Full Text | Google Scholar

11. Smith RA, Macklin EA, Myers KJ, Pattee GL, Goslin KL, Meekins GD, et al. Assessment of bulbar function in amyotrophic lateral sclerosis: validation of a self-report scale (Center for Neurologic Study Bulbar Function Scale). Eur J Neurol. (2018) 25:907–e66. doi: 10.1111/ene.13638

PubMed Abstract | CrossRef Full Text | Google Scholar

12. Allison KM, Yunusova Y, Campbell TF, Wang J, Berry JD, Green JR. The diagnostic utility of patient-report and speech-language pathologists' ratings for detecting the early onset of bulbar symptoms due to ALS. Amyotroph Lateral Scler. (2017) 18:358–66. doi: 10.1080/21678421.2017.1303515

PubMed Abstract | CrossRef Full Text | Google Scholar

13. Mefferd AS, Green JR, Pattee G. A novel fixed-target task to determine articulatory speed constraints in persons with amyotrophic lateral sclerosis. J Commun Disord. (2012) 45:35–45. doi: 10.1016/j.jcomdis.2011.09.002

PubMed Abstract | CrossRef Full Text | Google Scholar

14. Wilson EM, Kulkarni M, Simione M, Rong P, Green JR, Yunusova Y. Detecting bulbar motor involvement in als: comparing speech and chewing tasks. Int J Speech Lang Pathol. (2019) 21:564–71. doi: 10.1080/17549507.2018.1557254

PubMed Abstract | CrossRef Full Text | Google Scholar

15. Yunusova Y, Green JR, Lindstrom MJ, Ball LJ, Pattee GL, Zinman L. Kinematics of disease progression in bulbar ALS. J Commun Disord. (2010) 43:6–20. doi: 10.1016/j.jcomdis.2009.07.003

PubMed Abstract | CrossRef Full Text | Google Scholar

16. Yunusova Y, Green JR, Wang J, Pattee G, Zinman L. A protocol for comprehensive assessment of bulbar dysfunction in amyotrophic lateral sclerosis (ALS). J Vis Exp. (2011) 48:e2422. doi: 10.3791/2422

PubMed Abstract | CrossRef Full Text | Google Scholar

17. Shellikeri S, Green JR, Kulkarni M, Rong P, Martino R, Zinman L, et al. Speech movement measures as markers of bulbar disease in amyotrophic lateral sclerosis. J Speech Lang Hear Res. (2016) 59:887–99. doi: 10.1044/2016_JSLHR-S-15-0238

PubMed Abstract | CrossRef Full Text | Google Scholar

18. Brooks BR, Miller RG, Swash M, Munsat TL. El Escorial revisited: revised criteria for the diagnosis of amyotrophic lateral sclerosis. Amyotroph Lateral Scler Other Motor Neuron Disord. (2000) 1:293–9. doi: 10.1080/146608200300079536

PubMed Abstract | CrossRef Full Text | Google Scholar

19. Woo JH, Wang S, Melhem ER, Gee JC, Cucchiara A, McCluskey L, et al. Linear associations between clinically assessed upper motor neuron disease and diffusion tensor imaging metrics in amyotrophic lateral sclerosis. PloS ONE. (2014) 9:e105753. doi: 10.1371/journal.pone.0105753

PubMed Abstract | CrossRef Full Text | Google Scholar

20. Walhout R, Westeneng HJ, Verstraete E, Hendrikse J, Veldink JH, Van Den Heuvel MP, et al. Cortical thickness in ALS: towards a marker for upper motor neuron involvement. J Neurol Neurosurg Psychiatry. (2015) 86:288–94. doi: 10.1136/jnnp-2013-306839

PubMed Abstract | CrossRef Full Text | Google Scholar

21. Van Rossum G, Drake FL. Python Reference Manual. The Netherlands: Centrum voor Wiskunde en Informatica Amsterdam (1995).

Google Scholar

22. Bradski G. The openCV library. Dobbs J. (2000) 25:120–3. Available online at: https://www.elibrary.ru/item.asp?id=4934581, https://opencv.org/releases/

Google Scholar

23. Schroff F, Kalenichenko D, Philbin J. Facenet: a unified embedding for face recognition and clustering [Paper presentation]. In: Proceedings of the IEEE conference on computer vision and pattern recognition 2015. Boston, MA (2015). p. 815–23.

Google Scholar

24. Schneider CA, Rasband QS, Eliceiri KW. NIH Image to ImageJ: 25 years of image analysis. Nat Methods. (2012) 9:671–5. doi: 10.1038/nmeth.2089

PubMed Abstract | CrossRef Full Text | Google Scholar

25. Sandler M, Howard A, Zhu M, Zhmoginov A, Chen LC. Mobilenetv2: inverted residuals and linear bottlenecks [Paper presentation]. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2018. Salt Lake City, UT (2018). 4510–20.

Google Scholar

26. Shellikeri S. Articulatory Compensation in Amyotrophic Lateral Sclerosis: Tongue and Jaw in Speech (dissertation). Toronto, Ontario: University of Toronto (2014).

PubMed Abstract | Google Scholar

27. Weikamp JG, Schelhaas HJ, Hendriks JCM, de Swart BJM, Geurts ACH. Prognostic value of decreased tongue strength on survival time in patients with amyotrophic lateral sclerosis. J Neurol. (2012) 259:2360–5. doi: 10.1007/s00415-012-6503-9

PubMed Abstract | CrossRef Full Text | Google Scholar

28. Donatelli G, Ienco EC, Costagli M, Migaleddu G, Cecchi P, Siciliano G, et al. MRI cortical feature of bulbar impairment in patients with amyotrophic lateral sclerosis. NeuroImage Clin. (2019) 24:101934. doi: 10.1016/j.nicl.2019.101934

PubMed Abstract | CrossRef Full Text | Google Scholar

Keywords: tongue, bulbar, amyotrophic lateral sclerosis, neural network, quantification

Citation: Northall A, Mukhopadhyay B, Weber M, Petri S, Prudlo J, Vielhaber S, Schreiber S and Kuehn E (2022) An Automated Tongue Tracker for Quantifying Bulbar Function in ALS. Front. Neurol. 13:838191. doi: 10.3389/fneur.2022.838191

Received: 17 December 2021; Accepted: 24 January 2022;
Published: 25 February 2022.

Edited by:

Cristina Moglia, University of Turin, Italy

Reviewed by:

Enrica Bersano, IRCCS Carlo Besta Neurological Institute Foundation, Italy
Giorgia Querin, INSERM U1146 Laboratoire d'Imagerie Biomédicale, France

Copyright © 2022 Northall, Mukhopadhyay, Weber, Petri, Prudlo, Vielhaber, Schreiber and Kuehn. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

*Correspondence: Alicia Northall, YWxpY2lhLm5vcnRoYWxsJiN4MDAwNDA7bWVkLm92Z3UuZGU=

These authors have contributed equally to this work and share first authorship

These authors have contributed equally to this work and share last authorship

Disclaimer: All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article or claim that may be made by its manufacturer is not guaranteed or endorsed by the publisher.