- Voice Research Laboratory, Faculty of Education, The University of Hong Kong, Pokfulam, Hong Kong SAR, China
Purpose: This study set out to determine the contributions of the suprahyoid and sternocleidomastoid (SCM) muscles in changing pitch and loudness during phonation among vocally healthy populations.
Method: Thirty-nine participants were first recruited, and twenty-nine of them who passed the screening test (Voice Handicap Index [VHI]-10 score ≤11, auditory-perceptual voice rating score ≤2) were finally selected (mean age = 28.2 years). All participants were measured for their surface electromyographic (sEMG) activity collected from the bilateral suprahyoid and SCM muscles when producing the vowel /a/, /i/, and /u/ in natural (baseline) and at different pitch (+3, +6, -3, -6 semitones) and loudness (+5, +10, −5 dB) levels. Linear mixed-effects models were performed to determine the influencing factors on the root-mean-square percentage of maximal voluntary contraction (RMS %MVC) value of the sEMG signals.
Results: Compared with the baseline, a significant decrease of RMS %MVC was found in the suprahyoid muscles during overall phonations of lower pitches (−3 and −6 semitones) and loudness (−5 dB). However, no significant change was detected when producing speech at higher pitch (+3 and +6 semitones) and loudness (+5 and +10 dB) levels. Among the three vowels, /i/ demonstrated significantly higher RMS %MVC than those of /a/ and /u/. The SCM muscles, however, did not show any significant change in the RMS %MVC values among different vowels in relation to the pitch and loudness changes. When the muscles were compared across the two sides, significantly higher RMS %MVC was found in the right side of the suprahyoid (in pitch and loudness control) and SCM (in pitch control) when compared to the left side.
Conclusions: The suprahyoid muscle activities were significantly decreased when producing lower pitches and intensities compared to the natural baselines. The production of sustained /i/ required significantly more suprahyoid muscle activities than those of /a/ and /u/. The SCM muscles did not show much sEMG activity in any of the pitch and loudness levels, which could be used potentially as the calibration or normalization of peri-laryngeal sEMG measurement. The findings also showed a tendency for bilateral asymmetry in the use of suprahyoid and SCM muscles.
1 Introduction
Phonation plays an important role in human communication, which is a complex system relying on the laryngeal and peri-laryngeal muscles to change the pitch and loudness at all times (Vilkman et al., 1996). The intrinsic laryngeal muscles, which connect different anatomical parts of the larynx, were considered to control the tension of the vocal folds, hence determining the subglottic pressure, and the airflow (Hirano et al., 1969). The extrinsic laryngeal muscles bounded by the hyoid bone and divided into the suprahyoid and infrahyoid muscle groups, were more directly related to the position of the larynx (Honda et al., 1999). The suprahyoid muscles (e.g., the geniohyoid, mylohyoid, stylohyoid, and digastricus muscles) serve as the elevator that helps draw the hyoid bone upwards and lift the larynx, while the infrahyoid muscles (e.g., the sternothyroid, thyrohyoid, and the sternohyoid muscles) play the role of a depressor that pulls down the hyoid bone and lowers the larynx (Marchal, 2009).
Although in theory, an elevated hyoid bone or larynx usually correlates with an increase in pitch production (and vice versa, e.g., see Honda et al., 1999), researchers in practice, however, have no consensus on how different extrinsic laryngeal muscles participate or respond to changes of pitch. In the 1990s, Vilkman et al. (1996) reviewed 15 studies that investigated the contributions of extrinsic laryngeal muscles to pitch variations. They found that 1) the infrahyoid muscles received more attention (15 studies) than the suprahyoid muscles (5 studies); 2) none of the muscles showed a consistent contribution for pitch variations among different studies, or even different subjects in a certain study. It should be noted that the phonation tasks (e.g., singing, speech, intonation) and pitch ranges (e.g., extreme pitch range, two octaves) varied among these 15 studies, which made it difficult to compare and extract consistent evidence on how extrinsic laryngeal muscles contribute to pitch manipulations.
Fewer studies focused specifically on the correlations between the extrinsic laryngeal muscles and control of loudness. Hirano et al. (1967) studied specifically the function of the sternohyoid muscle (which belongs to the infrahyoid muscle group) and found an increased electrical activity following the increasing loudness at different pitch levels. However, they also noted that these muscle activation patterns differed among subjects depending on individual vocal habits or experience of vocal training. Dietrich and Verdolini Abbott (2012) indicated that the infrahyoid muscle activities were increased along with the self-perceived vocal effort under stressful conditions, while McKenna et al. (2019) reported stronger activation of the suprahyoid muscles with an increased self-perception of vocal effort.
Pitch and loudness control is generally associated with vocal effort. Hence, they were commonly used as vocal loading tasks (Fujiki and Sivasankar, 2017). The excessive vocal effort might result in an increased extrinsic laryngeal muscle tension, which was so often recognized as a possible cause and/or typical symptom of voice disorders, specifically in muscle tension dysphonia and vocal hyperfunction (Van Houtte et al., 2011). An increasing number of studies, therefore, attempted to investigate the clinical values of the extrinsic laryngeal muscles for the diagnosis of voice disorders (Khoddami et al., 2013) using surface electromyography (sEMG). Surface EMG has the advantage of being non-invasive in measuring the activities of the relatively superficial extrinsic laryngeal muscles in the peri-laryngeal area (Stepp, 2012). One would expect that the sEMG might be a useful clinical tool in identifying typical voice disorders which involve laryngeal muscle activities. However, recent reviews (Wang and Yiu, 2021; Desjardins et al., 2022) showed that this is far from the truth. There is still no consensus on how to reliably identify voice disorders using sEMG. Wang and Yiu (2021) pointed out that there is little agreement on what phonation tasks should be used in sEMG measurement for the peri-laryngeal muscles. This is probably due to little knowledge of how sEMG measurement can be affected differently by different degrees of task design, such as changes in pitch and loudness. It is therefore necessary to investigate how the control of pitch and loudness in sustained phonation tasks affects the measurement of sEMG activity in the peri-laryngeal muscles.
The sternocleidomastoid (SCM) muscles are often classified as respiratory muscles in the peri-laryngeal area that are not so important for speech production (Stepp, 2012). However, it has been found that the SCM muscles were also activated during speech (Pettersen et al., 2005), and varied with the change in loudness (Arifin et al., 2014) and pitch (Yu et al., 2016). A recent study also found a fatigue-related change in the sEMG signals of the SCM muscles after a vocal loading task (Yiu et al., 2023). Compared with other peri-laryngeal muscles, the SCM muscles have relatively large sizes suitable for sEMG measures, but whether or how they worked during pitch and loudness control in sustained phonation remains unclear so far.
A better understanding of the role that the suprahyoid and SCM muscles play in phonation would not only help to understand the physiological mechanism of the neuromuscular modulation during speech and voice production, but also shed light on the clinical pathology of voice disorders. The present study, therefore, aimed to determine the contributions of the suprahyoid and SCM muscles to pitch and loudness control. We hypothesized that the target muscles would participate in pitch and loudness control and might show different activities in changing pitch and loudness levels.
2 Materials and methods
2.1 Participants
Thirty-nine healthy adults (21 females, 18 males) aged between 20 and 78 years (M = 30.4 years, SD = 11.1 years) were recruited. All participants were current Hong Kong Chinese residents without a reported history of language, hearing, or voice disorders. None of the participants reported a prior experience with professional singing training. The present study was approved by the Human Research Ethics Committee (HREC), The University of Hong Kong (EA2001023(A)). Participants had enough time to fully read all the related information and ask any questions about this study. They had to sign a written consent before they formally took part in this study.
2.2 Procedure
2.2.1 Participant screening
All participants, although reported no voice disorders by themselves, were further ascertained for their vocal conditions. First, each participant was asked to fill in a Chinese Voice Handicap Index-10 (VHI-10) questionnaire (Lam et al., 2006) to determine their voice conditions and how their voices affected their lives. Using the cut-off point as suggested by Arffa et al. (2012), only participants who got a VHI-10 score of 11 or less were included in the study.
These participants were then asked to produce sustained vowels /a/, /i/, and /u/ (each for 3–5 s) and read a Chinese poetry Jingye Si at a comfortable pitch and loudness. The Chinese poetry Jingye Si contained different phonemes that required diverse laryngeal behaviors (see the complementary materials) and was used to measure speech, voice, and respiratory signals (Li, 2015). Samples were recorded using the Visi-Pitch (KAY3950c, PENTAX, Montvale, NJ) and high-quality microphone (SHURE SM48, Niles, IL) with a microphone-to-mouth distance of 10 cm. Two female certified speech pathologists, who were current PhD students in voice and swallowing disorders and had experiences with voice-disordered patients, undertook an auditory-perceptual rating for each of the recordings independently. These two judges were blinded to the VHI-10 scores of each participant and made an overall judgment on each participant’s voice using an equal-appearing interval scale from 0 (no voice problem) to 10 (severe voice problem). Based on the findings of auditory-perceptual voice evaluation studies using a visual analog scale (VAS), a cut-off value of 2/10 points (usually more than 30/100 points in previous literature, e.g., see Lee et al., 2018; Simberg et al., 2000) was used to select the healthy participants to be included in the final study.
2.2.2 Surface EMG (sEMG) setup
Each participant was required to sit upright on a backrest chair with four surface electrodes (single-differential Trigno™ Mini sensor, Delsys, Natick) taped on the skin surface of the neck area. To avoid a high skin-electrode impedance, all participants were informed in advance to shave the anterior neck area (if applicable) the day before the study. In addition, a 70% isopropyl alcohol wipe was used to clean the skin of the neck area before electrode placement (Stepp, 2012).
The placement of the sEMG electrodes was demonstrated in Figure 1, mainly targeting the bilateral suprahyoid and SCM muscles. Sensor 1 and 2 were placed symmetrically in the submandibular region, each was about 1 cm away from the midline. These two electrodes were used to detect the activation of the suprahyoid muscles involving the anterior belly of the digastric muscle, the anterior mylohyoid muscle, and the geniohyoid muscles. The potential activities of the platysma muscle, a superficial layer overlaying the suprahyoid area, were also unavoidably collected (Stepp, 2012). The associated ground sensor A and B were placed on the ipsilateral mastoid process. Sensor 3 and 4 were placed approximately at the mid-point of the left and right SCM muscles, with the corresponding ground sensor C and D on the mid-point of the left and right clavicle. The clavicle and ipsilateral mastoid process were superficial and appropriate in size for the ground sensor placement. To minimize the “cross-talk” between electrode sites, every two electrodes were confirmed to keep a distance of no less than 2 cm (Stepp, 2012).
FIGURE 1. Surface electromyographic (sEMG) sensor placement (sensor 1, 2, 3, 4 and corresponding ground sensor A, B, C, D) on the region of suprahyoid muscles and sternocleidomastoid (SCM) muscles.
2.2.3 Recordings of the sEMG signals
Participants were first required to complete two maximal voluntary contraction (MVC) tasks. The sEMG data collected during the MVC tasks would serve as a reference for data normalizations that would ensure comparisons among individuals under different conditions (Netto and Burnett, 2006; Stepp, 2012). For the suprahyoid muscles, participants were asked to keep seated and put their chins on a stable platform (the height of the platform was adjusted to be flush with each participant’s chin) and then pressed their chins down against the platform by exerting a maximum force for about 3 s (Redenbaugh and Reich, 1989; Stepp et al., 2011). Participants were required to keep a vertically downward force without any other movement of the head and shoulders throughout the process. For the SCM muscles, participants sat with a stable body and resting arms and then performed an isometric neck flexion 90° to the left (for the MVC of the right SCM) and then to the right (for the MVC of the left SCM), each with their maximum force for about 3 s (Barbero et al.,2012).
Each participant was then asked to produce a prolongation of each of the vowels /a/, /i/, and /u/ respectively for 4 seconds at a comfortable pitch and loudness level. These three vowels were selected as previous literature found that the extrinsic laryngeal muscle activities varied depending on different vowels (Faaborg-Andersen and Vennard, 1964). Real-time visual feedback (including the instantaneous values and value changes over time) of the pitch and loudness (as shown in Figure 2) was provided to each participant using a Vocal Pitch Monitor application (Tadao, 2019) and online Youlean loudness meter (Nikolic, 2020) on a Microsoft Surface Pro Pad (Microsoft, model 1796) which was set up about 20 cm away from the participant. The baseline levels of pitch and loudness for a natural phonation of /a/, /i/, and /u/ were recorded. Participants were then asked to produce the vowel /a/, /i/, and /u/ at four different pitch levels (+3 semitones, +6 semitones, −3 semitones, −6 semitones, compared with the baseline) and three different loudness levels (+5 dB, +10dB, −5dB, compared with the baseline) respectively with the assistance of the real-time visual feedback. They were given a few practices by specifically producing different pitches without producing great variations of the loudness, and vice versa. After the practices, participants then produced the vowel /a/, /i/, and /u/ respectively for 4 seconds at four different pitch levels and three different loudness levels in a randomized order for sEMG recording. The real-time visual feedback of the pitch and loudness levels was always shown to each participant. An examiner who was also exposed to the real-time feedback kept monitoring the pitch and loudness levels to ensure each participant had properly completed each task. All sEMG signals from the four sensors were recorded to four separate channels using the Delsys Trigno™ Wireless System (16-channel, Delsys, Natick).
FIGURE 2. Visual feedback interfaces demonstrating real-time loudness (left, displayed with the online loudness and sound meter by Youlean) and pitch (right, displayed with the Vocal Pitch Monitor Application).
2.3 Data analysis
All sEMG signals were analyzed using the EMGworks Analysis software (Delsys, Natick). To minimize the effects of the noise signals, a band-pass Butterworth filter between 20 and 500 Hz was used to filter all sEMG signals. The middle 2 s of each phonation (audio and sEMG signal) was extracted. A randomly selected 30% of the audio signals were perceptually evaluated by a certificated speech-language pathologist (SLP) to assess the stability of voice production (202 of the 209 tokens were evaluated as stable phonation), and all the extracted sEMG signals were analyzed for the root-mean-square (RMS). The RMS has been considered a reliable estimator that reflects the overall amplitude of sEMG, which would increase generally in relation to stronger muscle activation and force (De Luca, 1997; Stepp, 2012). The RMS value of each sEMG signal was normalized by the MVC of the corresponding sensor to minimize the effects of differences in submental fat among individuals (Netto and Burnett, 2006), resulting in an RMS %MVC value for each sEMG signal as the main outcome measure of this study.
To investigate the influencing factors on the RMS %MVC values of different muscles, the linear mixed-effects models were applied for calculating both fixed effects and random effects (Baayen et al., 2008) respectively in pitch manipulation and loudness manipulation tasks. In each condition, the gender (female, male), symmetry (left muscles, right muscles), vowel (/a/, /i/, and /u/), muscle (suprahyoid, SCM), and task (for pitch manipulation: baseline, +3, +6, −3, −6 semitones; for loudness manipulation: baseline, +5, +10, −5 dB) were set as fixed effects. Different participants were considered as the random effect. Analyses of the linear mixed-effects models were performed in R using the lme4 package (Bates et al., 2015). The post hoc comparisons were performed in R with the lsmeans package (Lenth, 2016). Different models were compared using likelihood ratio tests and the maximum likelihood identification predicted by the lowest Akaike Information Criterion (AIC, see Akaike, 1973). The goodness-of-fit of a certain model was represented by the R-square value (Johnson, 2014).
3 Results
There were 10 participants (5 males, 5 females) who failed to pass the screening criteria (VHI >11: six subjects; average auditory-perceptual rating score >2: three subjects; both criteria: one subject). As a result, 29 participants (16 females, 13 males) aged between 20 and 53 years (M = 28.2 years, SD = 7.7 years) were finally selected as qualified participants for this study. Table 1 listed the demographic and voice-related characteristics of the 29 participants. No significant gender difference was found for the VHI-10 (Mann-Whitney U = 0.728, p = 0.475) or the average auditory-perceptual rating score (Mann-Whitney U = 0.229, p = 0.846).
3.1 Muscle activities in pitch control
The mean and standard deviation of the RMS %MVC of sEMG signals at different pitch levels were listed in Table 2. Compared with the null model with only random intercepts of the subject, an addition of the by-subject random slopes for different muscles could significantly improve the model (χ2 (2) = 1148.5, p < 0.001, AIC = 12,110). Random intercepts of the subject accounted for different baselines of the RMS %MVC among individuals. The by-subject random slopes for different muscles indicated that the effects of the fixed explanatory variables on the RMS %MVC values of the suprahyoid and SCM muscles were inconsistent among different participants.
The best-fitted model (AIC = 11,654.9, R2 = 0.75) contained muscle (suprahyoid, SCM), symmetry (left muscles, right muscles), vowel (/a/, /i/, and /u/), and task (baseline, +3, +6, −3, −6 semitones) as fixed factors, as well as the interactions of muscle*symmetry, muscle*vowel, and muscle*task. Given the interactions in the best-fitted model, post hoc comparisons of the fixed factors were performed to compare the contrasts in different conditions (see Table 3; Figure 3). A significant bilateral asymmetry in muscle activation at different pitch levels was found in both the suprahyoid and SCM muscles, with the right side demonstrating a higher RMS %MVC than the left side (β = 15.50, p < 0.001). The vowel /i/ had a significantly higher RMS for the suprahyoid muscles than the vowel /a/ (β = 4.81, p < 0.001) and /u/ (β = 5.67, p < 0.001), while /a/ and /u/ showed no difference in the use of suprahyoid muscles. Compared with the normal pitch baseline, the higher pitch changes, regardless of an increase by either 3 or 6 semitones, did not result in a significant change of the RMS %MVC for the suprahyoid muscles. Instead, the suprahyoid muscles demonstrated a significant decrease in RMS %MVC when lowering the pitch by 3 semitones (β = 5.76, p < 0.001) and 6 semitones (β = 7.20, p < 0.001; no significant difference between −3 semitones and −6 semitones). The SCM muscles, however, were not sensitive to different vowels and did not show any significant difference in RMS %MVC at different pitch levels.
TABLE 3. Post-hoc comparisons of the fixed factors in the fitted Linear mixed-effects model (in pitch control).
FIGURE 3. RMS %MVC value of the sEMG signals collected from the bilateral suprahyoid and SCM muscles during phonations at different pitch levels. SCM, sternocleidomastoid, st, semitones, “***”, post hoc contrast with p-values less than 0.001.
3.2 Muscle activities in loudness control
Table 4 demonstrated the mean and standard deviation of the RMS %MVC of sEMG signals at different loudness levels. Results of the Linear mixed-effects models also supported that adding the by-subject random slopes for different muscles would improve the null model with only random intercepts of the subject (χ2 (2) = 679.75, p < 0.001, AIC = 10,059).
The fixed factors in the best-fitted model (AIC = 9694.5, R2 = 0.67) also included muscle (suprahyoid, SCM), symmetry (left muscles, right muscles), vowel (/a/, /i/, and /u/), task (baseline, +5, +10, −5 dB), along with the interactions of muscle*symmetry, muscle*vowel, and muscle*task. The post hoc comparisons of the fixed factors were shown in Table 5; Figure 4. A significantly higher RMS %MVC for the right suprahyoid muscles (β = 16.33, p < 0.001) was also found during loudness manipulation, but the SCM muscles showed a bilaterally symmetric performance. The vowel /i/ had the highest RMS %MVC than /a/ (β = 6.35, p < 0.001) and /u/ (β = 9.26, p < 0.001) for the suprahyoid muscles, and /u/ elicited an even lower RMS %MVC amplitude than the vowel /a/ (β = 2.91, p = 0.0447). The suprahyoid muscles had a significantly declined RMS %MVC amplitude during a decreased (−5 dB) loudness manipulation (β = 9.44, p < 0.001), but no significant change was found during the increase of loudness. Consistent with the cases during pitch manipulations, the SCM muscle did not present a significant variation among different vowels or different loudness levels.
TABLE 5. Post-hoc comparisons of the fixed factors in the fitted Linear mixed-effects model (in loudness control).
FIGURE 4. RMS %MVC value of the sEMG signals collected from the bilateral suprahyoid and SCM muscles during phonations at different loudness levels. SCM, sternocleidomastoid, “***”, post hoc contrast with p-values less than 0.001.
4 Discussion
This study sought to explore the contributions of the suprahyoid and SCM muscles to pitch and loudness control during phonation. The findings showed that when compared with the normal baseline, the suprahyoid muscles had significantly lower RMS %MVC amplitude during the production of lower pitches (−3 and −6 semitones) and loudness (−5 dB), which suggested relatively lower activations of the suprahyoid muscles in response to the lowering of pitch and loudness. However, no significant change was found with reference to the normal baseline in the activities of suprahyoid muscles when phonating with higher pitch (+3 and +6 semitones) and loudness (+5 and +10 dB). The SCM muscles were not contributing to pitch and loudness control during phonation.
It should be noted that the suprahyoid muscles did not show significantly different activity at the higher pitch levels (+3 and +6 semitones) when compared to the baseline, which seems to be inconsistent with what has been known that the suprahyoid muscles could serve as the elevator to help lift the larynx and result in a rise in pitch (e.g., Honda et al., 1999). As mentioned earlier, the 15 studies reviewed by Vilkman et al. (1996) also showed varied activation patterns of both the suprahyoid and infrahyoid muscles in response to pitch manipulations. The findings from these studies and that of the present study undoubtedly support the hypothesis of redundancy in the control of speech production, in which the human muscle system allows a high degree of freedom or flexibility in the muscle activation patterns which ultimately achieves the same target by using different muscles or different degrees of activation (Sorokin, 2010). In particular, the findings from the present study supported the hypothesis that the activation of the suprahyoid muscles is a sufficient but unnecessary condition for pitch rising, i.e., the pitch might rise following the activation of the suprahyoid muscles, but a higher pitch (within 6 semitones above the baseline, according to the present findings of this study) does not necessarily need more activation of the suprahyoid muscles. On the other hand, a lower activation of the suprahyoid muscles at the lower pitch levels (within 6 semitones below the baseline) was significant, which was consistent with the findings by Andersen and Sonninen (1960) that the mylohyoid muscles (suprahyoid muscle) showed less activation when singing in low pitch. Our findings suggest that the control of lower pitch does not necessarily rely on the suprahyoid muscles.
The suprahyoid muscles played a similar role in loudness control. Hirano et al. (1969) pointed out that changes in loudness were assisted by the tension and compression of the vocal cords, and the airflow through the glottis with different subglottic pressure. These locations are further away from the suprahyoid muscles than the infrahyoid muscles which have been discovered to activate during a stronger loudness (Hirano et al., 1967). It is therefore reasonable to see that the suprahyoid muscles did not show a different activation at the increased loudness levels. The suprahyoid muscles were even less activated at a weaker loudness level, which might indicate more uses of other muscles. This needs to be further investigated in future studies.
The present findings also revealed different activity levels of the suprahyoid muscles for different vowels. The largest activity level of the suprahyoid muscles was observed during the phonation of the vowel /i/ at different pitch and loudness levels. This was expected because the production of the vowel /i/ requires a forward pull of the tongue while drawing the hyoid bone upwards and forwards, which no doubt requires the assistance of the suprahyoid muscles such as the genioglossus and hyoglossus muscles, leading to the activity level of the suprahyoid muscles. It is therefore necessary to take caution when comparing the sEMG signals under different vowel phonation tasks.
This study found the SCM muscles insensitive to different vowels at all pitch and loudness levels. Although there were studies claiming that the SCM muscles might contribute to human speech and voice (Pettersen et al., 2005; Yu et al., 2016), some confounding factors in these previous studies might have affected the activity of the SCM muscles. Pettersen et al. (2005) focused on the SCM activation during respiratory phasing, while Yu et al. (2016) used natural Cantonese materials which could be affected by the laryngeal manipulations in meaningful speech. Given the fact that the SCM muscles are in the peri-laryngeal area but neither of their two joints is connected to a certain laryngeal structure, there is little justification to hypothesize that the SCM muscles would play an important role in pitch and loudness control. Although the SCM muscles showed relatively low level of activities as the tasks required non-extraneous respiratory effort, the influence of the adjacent scalene muscle, especially during the respiratory tasks (Chiti et al., 2008), could not be completely excluded. Hence, future studies should take cautions in conducting sEMG measurements of the SCM muscles using tasks that require respiratory effort. Considering the adequate size and location of the SCM muscles, we would like to propose that the SCM muscles could be used as a reference site for calibration or normalization of the sEMG signals collected from the anterior neck area during phonation tasks without excessive respiratory effort.
An interesting finding of the present study was the asymmetry of the bilateral sEMG signals for the suprahyoid muscles and the SCM muscles. Significantly higher RMS %MVC values were found for the suprahyoid (during pitch and loudness manipulations) and SCM (during pitch manipulations) muscles. It is intuitive to expect that the vocally healthy population would have a bilaterally symmetrical activity of the peri-laryngeal muscles. Indeed, some studies have reported a symmetrical activity of the bilateral suprahyoid and SCM muscles (Van Houtte et al., 2013; Lu et al., 2021). However, Hocevar-Boltezar et al. (1998) reported that three of the five subjects in their study showed different left and right EMG levels. Another interesting finding about the symmetry of muscle activity came from the study by Van Houtte et al. (2013). Although they did not find a bilateral difference in the RMS %MVC of the suprahyoid, infrahyoid, and SCM muscles in either the control group or patients with muscle tension dysphonia (MTD), a consistent right-dominant tendency existed in the increase of muscle activity during different phonation tasks. Compared to the reference of the resting condition, the right suprahyoid and SCM muscles had consistently higher activities than their left counterpart. Such right-dominant difference, surprisingly, was not consistently present in all the subjects in the MTD group. As the muscle asymmetry was not uncommon in populations with normal muscle function (e.g., the paraspinal muscle, see Niemeläinen et al., 2011), and all the participants in the present study were right-handed, this study would argue a possibility of the asymmetry of the bilateral suprahyoid and SCM muscles in the vocally-healthy group. It should be noted that personal variations were found in sEMG activity of the bilateral suprahyoid and SCM muscles (see Figures 3, 4). Furthermore, the sEMG amplitude does not typically show a linear relationship with the muscle activation and force (Disselhorst-Klug et al., 2009). Therefore, the identification of bilateral asymmetry of the muscle activities cannot be determined merely by the average RMS %MVC value presented in Tables 2, 4. Further determination of bilateral asymmetry would benefit from more advanced sEMG technology such as high-density sEMG (Zhu et al., 2019), and more studies are in need to explore the asymmetry of bilateral perilaryngeal muscles in the normal and voice-disordered groups.
5 Conclusion
The suprahyoid muscles demonstrated significantly fewer activities at the lower pitch (−3 and −6 semitones) and loudness (−5 dB) levels. The vowel /i/ was associated with relatively higher activities of the suprahyoid muscles than /a/ and /u/. The SCM muscles showed relatively little changes in producing different vowels and did not play a significant role in pitch and loudness control. Therefore, it is proposed that the SCM muscle could have the potential of being used as a reference for calibration and normalization of sEMG signals for the peri-laryngeal muscle involving phonatory tasks. A right-dominant bilateral asymmetry was found in the suprahyoid and SCM muscles, which would require further corroboration with more evidence.
Data availability statement
The original contributions presented in the study are included in the article/Supplementary Material, further inquiries can be directed to the corresponding author.
Ethics statement
The studies involving human participants were reviewed and approved by the Human Research Ethics Committee (HREC), The University of Hong Kong (EA2001023(A)). The patients/participants provided their written informed consent to participate in this study.
Author contributions
FW: conceptualization, methodology, investigation, formal analysis, writing- original draft preparation, review and editing. EM: supervision, conceptualization, methodology, writing—review and editing.
Acknowledgments
Publication made possible in part by support from the HKU Libraries Open Access Author Fund sponsored by the HKU Libraries. The authors would like to thank Yu-Fu Chien and Liang Ma of Fudan University for their statistical advice. The assistance of Miss Junyan Wang in drawing the figure of human laryngeal muscles is also acknowledged with appreciation.
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher’s note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Supplementary material
The Supplementary Material for this article can be found online at: https://www.frontiersin.org/articles/10.3389/fphys.2023.1147795/full#supplementary-material
References
Akaike, H. (1973). Maximum likelihood identification of Gaussian autoregressive moving average models. Biometrika 60, 255–265. doi:10.1093/biomet/60.2.255
Andersen, K. F., and Sonninen, A. (1960). The function of the extrinsic laryngeal muscles at different pitch: An electromyographic and roentgenologic investigation. Acta Otolaryngol. 51, 89–93. doi:10.3109/00016486009124468
Arffa, R. E., Krishna, P., Gartner-Schmidt, J., and Rosen, C. A. (2012). Normative values for the voice Handicap index-10. J. Voice 26, 462–465. doi:10.1016/j.jvoice.2011.04.006
Arifin, F., Sardjono, T. A., and Purnomo, M. H. (2014). The relationship between electromyography signal of neck muscle and human voice signal for controlling loudness of electrolarynx. Biomed. Eng. - Appl. Basis Commun. 26, 1450054–1450057. doi:10.4015/S1016237214500549
Baayen, R. H., Davidson, D. J., and Bates, D. M. (2008). Mixed-effects modeling with crossed random effects for subjects and items. J. Mem. Lang. 59, 390–412. doi:10.1016/j.jml.2007.12.005
Barbero, M., Merletti, R., and Rainoldi, A. (2012). in Milano. Atlas of Muscle Innervation Zones. Milano: Springer. doi:10.1007/978-88-470-2463-2
Bates, D., Mächler, M., Bolker, B. M., and Walker, S. C. (2015). Finding patients before they crash: The next major opportunity to improve patient safety. J. Stat. Softw. 67, 1–3. doi:10.1136/bmjqs-2014-003499
Chiti, L., Biondi, G., Morelot-Panzini, C., Raux, M., Similowski, T., and Hug, F. (2008). Scalene muscle activity during progressive inspiratory loading under pressure support ventilation in normal humans. Respir. Physiol. Neurobiol. 164, 441–448. doi:10.1016/j.resp.2008.09.010
De Luca, C. J. (1997). The use of surface electromyography in biomechanics. J. Appl. Biomech. 13, 135–163. doi:10.1123/jab.13.2.135
Desjardins, M., Apfelbach, C., Rubino, M., and Verdolini Abbott, K. (2022). Integrative review and framework of suggested mechanisms in primary muscle tension dysphonia. J. Speech, Lang. Hear. Res. 65, 1867–1893. doi:10.1044/2022_JSLHR-21-00575
Dietrich, M., and Verdolini Abbott, K. (2012). Vocal function in introverts and extraverts during a psychological stress reactivity protocol. J. Speech, Lang. Hear. Res. 55, 973–987. doi:10.1044/1092-4388(2011/10-0344
Disselhorst-Klug, C., Schmitz-Rode, T., and Rau, G. (2009). Surface electromyography and muscle force: Limits in sEMG-force relationship and new approaches for applications. Clin. Biomech. 24, 225–235. doi:10.1016/j.clinbiomech.2008.08.003
Faaborg-Andersen, K., and Vennard, W. (1964). Electromyography of extrinsic laryngeal muscles during phonation of different vowels. Ann. Otol. Rhinol. Laryngol. 73, 248–254. doi:10.1177/000348946407300127
Fujiki, R. B., and Sivasankar, M. P. (2017). A review of vocal loading tasks in the voice literature. J. Voice 31, 388.e33–388. doi:10.1016/j.jvoice.2016.09.019
Hirano, M., Koike, Y., and Leden, H. (1967). The sternohyoid muscle during phonation: Electromyographic studies. Acta Otolaryngol. 64, 500–507. doi:10.3109/00016486709139135
Hirano, M., Ohala, J., and Vennard, W. (1969). The function of laryngeal muscles in regulating fundamental frequency and intensity of phonation. J. Speech Hear. Res. 12, 616–628. doi:10.1044/jshr.1203.616
Hocevar-Boltezar, I., Janko, M., and Zargi, M. (1998). Role of surface EMG in diagnostics and treatment of muscle tension dysphonia. Acta Otolaryngol. 118, 739–743. doi:10.1080/00016489850183287
Honda, K., Hirai, H., Masaki, S., and Shimada, Y. (1999). Role of vertical larynx movement and cervical lordosis in F0 control. Lang. Speech 42, 401–411. doi:10.1177/00238309990420040301
Johnson, P. C. D. (2014). Extension of Nakagawa and Schielzeth’s R 2 GLMM to random slopes models. Methods Ecol. Evol. 5, 944–946. doi:10.1111/2041-210X.12225
Khoddami, S. M., Nakhostin Ansari, N., Izadi, F., and Talebian Moghadam, S. (2013). The assessment methods of laryngeal muscle activity in muscle tension dysphonia: A review. Sci. World J. 2013, 507397–507406. doi:10.1155/2013/507397
Lam, P. K. Y., Chan, K. M., Ho, W., Kwong, E., Yiu, E. M., and Wei, W. I. (2006). Cross-cultural adaptation and validation of the Chinese voice Handicap index-10. Laryngoscope 116, 1192–1198. doi:10.1097/01.mlg.0000224539.41003.93
Lee, Y. W., Kim, G. H., Bae, I. H., Park, H. J., Wang, S. G., and Kwon, S. B. (2018). The cut-off analysis using visual analogue scale and cepstral assessments on severity of voice disorder. Logop. Phoniatr. Vocology 43, 175–180. doi:10.1080/14015439.2018.1461925
Lenth, R. V. (2016). Least-squares means: The R package lsmeans. J. Stat. Softw. 69. doi:10.18637/jss.v069.i01
Li, Y., Zhou, X., Li, Z., Li, J., Chen, S., Guo, C., et al. (2015). Linear modeling of metrical poetry respiratory signal. Proc. First Int. Conf. Inf. Sci. Mach. Mater. Energy 31, 1728–1740. doi:10.2991/icismme-15.2015.356
Lu, J., Fang, Q., Cheng, L., and Xu, W. (2021). Exploring the characteristics of functional dysphonia by multimodal methods. J. Voice. 37, 291.e1–291291.e9. doi:10.1016/j.jvoice.2020.12.020
McKenna, V. S., Diaz-Cadiz, M. E., Shembel, A. C., Enos, N. M., and Stepp, C. E. (2019). The relationship between physiological mechanisms and the self-perception of vocal effort. J. Speech, Lang. Hear. Res. 62, 815–834. doi:10.1044/2018_JSLHR-S-18-0205
Netto, K. J., and Burnett, A. F. (2006). Reliability of normalisation methods for EMG analysis of neck muscles. Work 26, 123–130.
Niemeläinen, R., Briand, M.-M., and Battié, M. C. (2011). Substantial asymmetry in paraspinal muscle cross-sectional area in healthy adults questions its value as a marker of low back pain and pathology. Spine (Phila. pa. 36, 2152–2157. doi:10.1097/BRS.0b013e318204b05a
Nikolic, J. (2020). Youlean online loudness meter. Available at: https://youlean.co/online-loudness-meter/.
Pettersen, V., Bjørkøy, K., Torp, H., and Westgaard, R. H. (2005). Neck and shoulder muscle activity and thorax movement in singing and speaking tasks with variation in vocal loudness and pitch. J. Voice 19, 623–634. doi:10.1016/j.jvoice.2004.08.007
Redenbaugh, M. A., and Reich, A. R. (1989). Surface EMG and related measures in normal and vocally hyperfunctional speakers. J. Speech Hear. Disord. 54, 68–73. doi:10.1044/jshd.5401.68
Simberg, S., Laine, A., Sala, E., and Rönnemaa, A.-M. (2000). Prevalence of voice disorders among future teachers. J. Voice 14, 231–235. doi:10.1016/S0892-1997(00)80030-2
Sorokin, V. N. (2010). Redundancy of the control of speech production. J. Commun. Technol. Electron. 55, 1442–1455. doi:10.1134/S106422691012017X
Stepp, C. E., Heaton, J. T., Stadelman-cohen, T. K., Braden, M. N., Jetté, M. E., and Hillman, R. E. (2011). Characteristics of phonatory function in singers and non-singers with vocal fold nodules. J. Voice 25, 714–724. doi:10.1016/j.jvoice.2010.06.003
Stepp, C. E. (2012). Surface electromyography for speech and swallowing systems: Measurement, analysis, and interpretation. J. Speech, Lang. Hear. Res. 55, 1232–1246. doi:10.1044/1092-4388(2011/11-0214
Tadao, Y. (2019). Vocal pitch monitor. Available at: https://play.google.com/store/apps/details?id=com.tadaoyamaoka.vocalpitchmonitor.
Van Houtte, E., Claeys, S., D’haeseleer, E., Wuyts, F., and Van Lierde, K. (2013). An examination of surface EMG for the assessment of muscle tension dysphonia. J. Voice 27, 177–186. doi:10.1016/j.jvoice.2011.06.006
Van Houtte, E., Van Lierde, K., and Claeys, S. (2011). Pathophysiology and treatment of muscle tension dysphonia: A review of the current knowledge. J. Voice 25, 202–207. doi:10.1016/j.jvoice.2009.10.009
Vilkman, E., Sonninen, A., Hurme, P., and Körkkö, P. (1996). External laryngeal frame function in voice production revisited: A review. J. Voice 10, 78–92. doi:10.1016/S0892-1997(96)80021-X
Wang, F., and Yiu, E. M. (2021). Is surface electromyography (sEMG) a useful tool in identifying muscle tension dysphonia? An integrative review of the current evidence. J. Voice. 10. doi:10.1016/j.jvoice.2021.10.006
Yiu, E. M.-L., Lau, G. W. H., and Wang, F. (2023). Fatigue-related change in surface electromyographic activities of the perilaryngeal muscles. J. Speech, Lang. Hear. Res. 66, 98–109. doi:10.1044/2022_JSLHR-22-00283
Yu, S., Lee, T., and Ng, M. L. (2016). Surface electromyographic activity of extrinsic laryngeal muscles in Cantonese tone production. J. Signal Process. Syst. 82, 287–294. doi:10.1007/s11265-015-1022-4
Zhu, M., Lu, L., Yang, Z., Wang, X., Liu, Z., Wei, W., et al. (2019). “Contraction patterns of neck muscles during phonating by high-density surface electromyography,” in Proceedings of the 2018 IEEE International Conference on Cyborg and Bionic Systems(CBS), Shenzhen, China, June 2019, 572–575. doi:10.1109/CBS.2018.8612181
Keywords: surface electromyography, laryngeal muscles, speech, pitch, loudness
Citation: Wang F and Yiu EM-L (2023) Surface electromyographic (sEMG) activity of the suprahyoid and sternocleidomastoid muscles in pitch and loudness control. Front. Physiol. 14:1147795. doi: 10.3389/fphys.2023.1147795
Received: 19 January 2023; Accepted: 20 April 2023;
Published: 04 May 2023.
Edited by:
Valentina Di Felice, University of Palermo, ItalyReviewed by:
Julien Frere, UMR5216 Grenoble Images Parole Signal Automatique (GIPSA-lab), FranceMarie-Cécile Niérat, INSERM U1158 Neurophysiologie Respiratoire Expérimentale et Clinique, France
Copyright © 2023 Wang and Yiu. This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
*Correspondence: Feifan Wang, ZmVpZmFuLndhbmdAaGt1Lmhr