The Influence of the Pinyin and Zhuyin Writing Systems on the Acquisition of Mandarin Word Forms by Native English Speakers

Hayes-Harb, Rachel; Cheng, Hui-Wen

doi:10.3389/fpsyg.2016.00785

ORIGINAL RESEARCH article

Front. Psychol. , 03 June 2016

Sec. Psychology of Language

Volume 7 - 2016 | https://doi.org/10.3389/fpsyg.2016.00785

This article is part of the Research Topic Phonology in the bilingual and bidialectal lexicon View all 13 articles

The Influence of the Pinyin and Zhuyin Writing Systems on the Acquisition of Mandarin Word Forms by Native English Speakers

$\r\nRachel Hayes-Harb*$ Rachel Hayes-Harb^*

Hui-Wen Cheng

Department of Linguistics, University of Utah, Salt Lake City, UT, USA

The role of written input in second language (L2) phonological and lexical acquisition has received increased attention in recent years. Here we investigated the influence of two factors that may moderate the influence of orthography on L2 word form learning: (i) whether the writing system is shared by the native language and the L2, and (ii) if the writing system is shared, whether the relevant grapheme-phoneme correspondences are also shared. The acquisition of Mandarin via the Pinyin and Zhuyin writing systems provides an ecologically valid opportunity to explore these factors. We first asked whether there is a difference in native English speakers' ability to learn Pinyin and Zhuyin grapheme-phoneme correspondences. In Experiment 1, native English speakers assigned to either Pinyin or Zhuyin groups were exposed to Mandarin words belonging to one of two conditions: in the “congruent” condition, the Pinyin forms are possible English spellings for the auditory words (e.g., < nai> for [nai]); in the “incongruent” condition, the Pinyin forms involve a familiar grapheme representing a novel phoneme (e.g., < xiu> for [ɕiou]). At test, participants were asked to indicate whether auditory and written forms matched; in the crucial trials, the written forms from training (e.g., < xiu>) were paired with possible English pronunciations of the Pinyin written forms (e.g., [ziou]). Experiment 2 was identical to Experiment 1 except that participants additionally saw pictures depicting word meanings during the exposure phase, and at test were asked to match auditory forms with the pictures. In both experiments the Zhuyin group outperformed the Pinyin group due to the Pinyin group's difficulty with “incongruent” items. A third experiment confirmed that the groups did not differ in their ability to perceptually distinguish the relevant Mandarin consonants (e.g., [ɕ]) from the foils (e.g., [z]), suggesting that the findings of Experiments 1 and 2 can be attributed to the effects of orthographic input. We thus conclude that despite the familiarity of Pinyin graphemes to native English speakers, the need to suppress native language grapheme-phoneme correspondences in favor of new ones can lead to less target-like knowledge of newly learned words' forms than does learning Zhuyin's entirely novel graphemes.

Introduction

Adult second language (L2) learners can exploit the availability of orthographic input in learning the phonological forms of L2 words (e.g., Escudero et al., 2008). However, we have also seen that there are limits to the utility of orthographic input in supporting learners' target-like acquisition of words' forms—the literature provides cases where written input either had no beneficial effect (Simon et al., 2010; Hayes-Harb and Hacking, 2015; Showalter and Hayes-Harb, 2015) or in fact interfered with the target-like acquisition of L2 word forms (e.g., Hayes-Harb et al., 2010; Young-Scholten and Langer, 2015). Two factors that have emerged as possibly associated with whether or not orthographic input supports or interferes with word form learning are (i) whether the writing system is shared by the native language and the L2, and (ii) if the writing system is shared, whether the relevant grapheme-phoneme correspondences are also shared. The case of native English speakers learning Mandarin via the Zhuyin and Pinyin writing systems provides an ecologically valid opportunity to explore the relative impact of these two factors on L2 word form learning. Pinyin uses the Roman alphabet, shared with English, while Zhuyin uses an entirely different set of graphemes. Each writing system poses its own set of challenges to native English learners: Zhuyin requires learners to acquire an entirely novel grapheme set; Pinyin, on the other hand, involves only familiar graphemes, but learners must suppress a number of English grapheme-phoneme correspondences in favor of new ones (e.g., in Mandarin, Pinyin < x>¹ maps to /ɕ/). In the present study we explored the consequences of these characteristics of Pinyin and Zhuyin for native English speakers' ability to learn the phonological forms of a set of Mandarin words, with the goal of elucidating the relative difficulty associated with each writing system.

Orthographic Input and L2 Phonological Acquisition

The role of orthographic input in L2 phonological and lexical acquisition has received increased attention in recent years (Bassetti, 2008; Bassetti et al., 2015). While a number of studies have demonstrated a facilitative effect of orthographic input for L2 learners (e.g., Escudero et al., 2008; Showalter and Hayes-Harb, 2013), others have found limited or no effect of orthographic input (e.g., Simon et al., 2010; Pytlyk, 2011; Escudero, 2015; Hayes-Harb and Hacking, 2015; Showalter and Hayes-Harb, 2015). Indeed, there are also circumstances where orthographic input can interfere with L2 phonological and lexical acquisition (Bassetti, 2006; Escudero and Wanrooij, 2010; Hayes-Harb et al., 2010; Mathieu, 2016). We begin by reviewing studies on the influence of orthographic input in L2 word form learning, followed by a discussion of the small number of studies that have considered the influence of Zhuyin and Pinyin input on L2 Mandarin phonological and lexical acquisition.

Orthographic Input and L2 Word Form Learning

In cases where orthographic input facilitates L2 word form learning, learners may benefit from familiarity of the graphemes in addition to familiarity of the grapheme-phoneme correspondences. For example, native Dutch speakers who saw written forms during an English word learning task (e.g., < tandek> and < tenzer>) were more likely to have established lexical representations that distinguish between English /æ/ and /ɛ/ (corresponding to the letters < a> and < e>) than those who did not have access to written forms (Escudero et al., 2008). In this case, the L2 English graphemes were familiar to the native Dutch learners, and additionally, while the particular grapheme-vowel mappings differ between Dutch and English, the graphemes < a> and < e> capture a phonological contrast in both languages, presumably allowing participants to infer the English phonological contrast from the differential spellings.

More recent studies, however, have provided evidence of the limitations of written input in facilitating second language word learning. For example, a number of studies have found no effect of orthographic input in some cases where the graphemes and/or grapheme-phoneme correspondences are unfamiliar (e.g., Simon et al., 2010; Showalter and Hayes-Harb, 2013). Others have even found detrimental effects when the grapheme-phoneme correspondences of the L1 and L2 differ (Young-Scholten, 2002; Hayes-Harb et al., 2010; Hayes-Harb et al., submitted), or when the orthography is entirely unfamiliar (e.g., Mathieu, 2016). For example, Hayes-Harb et al. (2010) demonstrated that using a familiar orthography with unfamiliar grapheme-phoneme correspondences can lead learners to misremember the phonological forms of newly learned words. In this study, native English speakers were taught a set of auditory English non-words along with pictured meanings, and were later tested on their ability to match auditory forms to the pictures. In the “congruent” condition, participants always saw written forms (presented immediately below the picture) that were spelled according to English grapheme-phoneme correspondences (e.g., the auditory form [faza] was accompanied by the written form < faza>). In the “incongruent” condition, participants saw some written forms did not conform to English grapheme-phoneme correspondences (e.g., the auditory form [faza] was accompanied by < fasha>). In the control condition, participants saw < xxx> instead of written forms. At test, participants in the incongruent condition were more likely than participants in the other two conditions to misremember the phonological forms of the words in ways that reflected the (incongruent) spellings (e.g., accept [fa∫a] as a possible pronunciation of the word [faza]). In this way, the incongruent spellings of the newly learned words appear to have interfered with participants' ability to correctly remember the words' phonological forms at test. Hayes-Harb et al. (submitted), following up on earlier studies such as those of Young-Scholten (2002) and Young-Scholten and Langer (2015), demonstrated that access to spelled forms in the L2 input can interfere with native English speakers' acquisition of German final obstruent devoicing. Hayes-Harb et al. (submitted) taught native English speakers German nonwords in two conditions: in one condition, participants saw spelled forms (e.g., hear [krɑt]; see < krad>); in the other condition, participants did not see spelled forms. At test, participants who had seen < krad> during the word learning phase were more likely than those in the no spelled forms condition to pronounce it as [krɑd]. They conclude that in cases where auditory forms and written forms conflict, inferences about the pronunciation of words from written input may override the auditory input. Escudero et al. (2014) provide additional evidence that “congruency” between the grapheme-phoneme correspondences of the L1 and L2 influence the effect of written input on L2 word form learning. They taught native Spanish speakers auditory Dutch nonwords and pictured meanings in two conditions (one with and one without written forms), and later tested them on their ability to distinguish between minimal pairs of test words. In this study, “congruency” was defined somewhat differently than in other studies mentioned here in that it related to whether or not a graphemic contrast signals a phonemic contrast in both the L1 and L2, not to the grapheme-phoneme correspondences themselves. Some pairs of test words were “congruent” in the sense that the corresponding orthographic forms signal a phonemic contrast in both Spanish and English (e.g., Dutch − < uu> = /I/ − /y/ and Spanish − = /i/ − /u/), while others were “incongruent” in that the orthographic forms signal a phonemic contrast in Dutch but not in Spanish (e.g., Dutch: − < uu> = /Y/ − /y/; Spanish: = /u/). The native Spanish participants performed more accurately at test on congruent than incongruent items. Escudero et al. (2014) thus found further evidence that L2 learners experience a benefit associated with congruency between the L1 and L2 writing systems when learning new words.

Why should auditory and orthographic input interact in these ways in second language word learning? The influence of orthography on spoken word recognition is well documented. For example, Ziegler and Ferrand (1998) demonstrated that native French speakers respond faster in an auditory lexical decision task to words whose rimes have a single possible spelling (e.g., < age> for for the rime /ɑʒ/) than to words whose rimes can be spelled variously (e.g., < omb> or < om> for the rime /om/). In addition, the effect of orthography on phonological processing begins in childhood along with early literacy. For example, Racine et al. (2014) found that native French readers (9–10 years old) show evidence for the influence of words' spelled forms on their auditory processing of French production variants resulting from schwa deletion, while native French pre-readers (5–6 year olds) do not.

As noted by Veivo and Jarvikivi (2013), a “consequence of many L2 learners being literate is that the teaching and the learning of L2 are often based on written language to a significant degree” (p. 866). Thus L2 learners' (alphabetic) literacy, presumably including their knowledge of specific grapheme-phoneme correspondences and/or the expectation that written input will provide phonologically relevant information about the forms of L2 words, may exert an influence beginning with their earliest exposure to L2 words. In light of the vast literature documenting learners' propensity for transferring aspects of their L1 into L2 acquisition (see, e.g., Eckman and Iverson, 2013), it may be unsurprising that learners appear to transfer their L1 grapheme-phoneme correspondences to L2 learning.

Relative to the number of studies that have considered the impact of orthographic congruency on L2 word form learning, very few have investigated the effect of unfamiliar orthographies. Hayes-Harb and Hacking (2015) investigated the influence of diacritic stress marks on Russian written words on native English speakers' ability to learn Russian lexical stress. They secondarily asked whether the effect of stress marks differed depending on whether the words were written in Cyrillic or Roman letters. They found no beneficial effect of the diacritic stress marks, and no difference in performance associated with the Cyrillic vs. Roman letter condition, suggesting at a minimum that the familiarity of the graphemes did not influence word form learning. Showalter and Hayes-Harb (2015) similarly did not find a difference in word learning performance between groups of naïve native English speakers exposed to Arabic vs. Roman written forms when learning Arabic words minimally distinguished by the difficult velar /k/—uvular /q/ contrast. While these two studies do not indicate a word learning disadvantage associated with novel orthographies vs. familiar ones, it is worth noting that the measure of learning in both of these studies involved perceptually discriminating difficult novel phonological contrasts. In the present study, we focus not on the role of orthographic input in learners' ability to differentiate words containing difficult novel contrasts, but rather on the issues of orthographic congruency and familiarity and their effects on L2 word form learning.

In summary, the growing literature on the influence of orthographic input in L2 word form learning has highlighted two factors that may be associated with whether written input supports or interferes with word form learning: (1) whether the writing system is shared by the native language and the L2, and (2) if the writing system is shared, whether the relevant grapheme-phoneme correspondences are shared by the two languages. The acquisition of L2 Mandarin provides an opportunity to explore these factors, given that the Pinyin writing system involves familiar graphemes with a number of novel grapheme-phoneme correspondences, and Zhuyin involves an entirely new set of graphemes. The following section reviews the small number of studies that have considered the influence of these two writing systems on L2 Mandarin acquisition.

Orthographic Input and the Acquisition of L2 Mandarin

Chinese characters are known for their opacity in terms of grapheme-phoneme correspondences. Indeed, the phonetic component of a Chinese character provides reliable cues to the pronunciation of the character < 30% of the time (Cheng, 2012). To facilitate the learning of Chinese characters, a phonetic script that transparently presents the phonological forms of Chinese words is usually introduced to beginning learners (including both L1 and L2 learners). Pinyin and Zhuyin are the scripts that are most commonly used for this purpose. Pinyin (formally known as Hanyu Pinyin) is a Romanization system used in China and Singapore, and has been adopted by the International Organization for Standardization for the Romanization of Chinese ISO (2015). Zhuyin (also called Zhuyin fuhao or Bopomofo) consists of components of ancient Chinese characters, and is widely used in Taiwan. Crucially for the present purposes, while both are transparent phonographic writing systems, Pinyin and Zhuyin differ from one another in the graphemes they employ. There are also organizational differences between Pinyin, which is an alphabet, and Zhuyin, which is a semi-syllabary, or a combination of an alphabet and a syllabary; (Taylor and Taylor, 2014). Here we explore the differential effects of the two writing systems on the acquisition of Mandarin word forms by native English speakers. In particular, we ask whether the orthographic differences between Pinyin and Zhuyin influence Mandarin word learning. This question is particularly intriguing in the context of adult L2 Mandarin acquisition, because these learners are equipped with the knowledge of their L1 writing system, which may interact with the characteristics of Pinyin and Zhuyin. For example, native English-speaking learners, whose L1 employs the Roman alphabet, may find Pinyin less difficult than Zhuyin initially given the familiarity of the Pinyin symbols, which form a subset of the English alphabet. However, for a subset of Pinyin graphemes, the grapheme-phoneme correspondences differ from those of English. For example, in Chinese the Pinyin grapheme < x> maps to the voiceless alveopalatal fricative /ɕ/, a phoneme that does not exist in English; the same grapheme maps to /ks/ as in “tax” or /z/ as in “xylophone” in English. Thus native English speakers learning Mandarin who are exposed to Pinyin may benefit from the familiarity of the graphemes but experience difficulty learning novel grapheme-phoneme correspondences. In other words, native English speakers may show evidence of the negative transfer of English grapheme-phoneme correspondences when learning Mandarin with Pinyin.

On the other hand, no such opportunity for negative transfer is associated with Zhuyin, whose graphemes do not overlap with English graphemes. For instance, the voiceless alveolar affricate /ts/, which is written < z> in Pinyin, is written <ㄗ> in Zhuyin. Table 1 provides example Zhuyin and Pinyin graphemes, along with their corresponding phonemes. Zhuyin, however, presents its own challenge for native English speakers—that of learning a new set of graphemes. At present we are interested in the relative difficulty associated with learning new graphemes vs. learning new grapheme-phoneme correspondences on native English speakers' ability to learn the phonological forms of new Mandarin words.

TABLE 1

Table 1. Example Pinyin and Zhuyin graphemes and their corresponding Mandarin phonemes.

A small number of studies have specifically investigated the influence of orthographic input on the acquisition of Mandarin by native English speakers (Bassetti, 2006; Pytlyk, 2011; Showalter and Hayes-Harb, 2013). Bassetti (2006) and Pytlyk (2011) specifically explored the acquisition of Mandarin by native English speakers via the written medium of Pinyin, focusing on the potential for interference due to the negative transfer of native English grapheme-phoneme correspondences. Bassetti (2006) investigated whether Pinyin spelling conventions for rimes influences native English speakers' Mandarin phonological representations, focusing on the confusion they may cause for native English speakers with respect to the number of segments contained in the rimes. In Pinyin, rimes may be spelled differently depending on their context, in particular with respect to the inclusion of a letter representing what is called the “main vowel.” For example, following a consonantal onset, the rime /uei/ is spelled < ui> (without a letter corresponding to the main vowel /e/), as in < kui>. The same rime is spelled < wei> (with the letter < e> representing the main vowel) in onsetless syllables. Bassetti (2006) asked native English speakers who were beginning learners of Mandarin to perform two phonological tasks. In the phoneme counting task, participants were asked to read (logographic) Chinese characters and to count the number of “sounds” in each. In the phoneme segmentation task, participants were asked to pronounce the characters' sounds one-by-one. Bassetti found that for syllables where the Pinyin spellings do not represent the main vowel, participants counted one fewer vowel in the rime than when the Pinyin spellings represent the main vowel. The segmentation task confirmed that the vowel omitted by learners was indeed the main vowel, or the one that is not represented in Pinyin spellings. Bassetti concluded that the native English speakers' phonological representations for Chinese syllables was affected by the Pinyin spelling conventions with respect to main vowels.

Pytlyk (2011) investigated whether exposure to Pinyin, in particular in cases where English and Mandarin have different grapheme-phoneme correspondences, negatively influences native English speakers' ability to perceive Mandarin consonants. Pytlyk predicted that while native English speakers may benefit from the positive transfer of knowledge of the Roman alphabet in learning Mandarin via Pinyin (a “shared” orthography), they may experience difficulty where the grapheme-phoneme correspondences of Pinyin and the English alphabet differ. Specifically, the prediction was that “learners who learn Mandarin via Pinyin…will tend to equate a similar Mandarin (L2) phoneme with its English counterpart because the shared orthographic symbols would make perceiving the differences between the similar sounds even more difficult” (p. 545). In contrast, it was predicted that learner groups who were exposed to Zhuyin or to no written forms at all would outperform the Pinyin learners in Mandarin consonant perception because neither of these groups would experience the orthographic interference associated with Pinyin. Native English speakers with no previous Chinese language experience participated in a language training phase followed by a perception test. During the language training phase, they were taught the Mandarin phoneme inventory via Pinyin, Zhuyin, or no written input. At test, participants performed an odditiy discrimination task, in which they heard three stimuli and were asked to determine which one differed from the other two. There were no significant differences in test performance among the participants trained via Pinyin, Zhuyin, or no written input. While Pytlyk (2011) did not find the predicted differences in perception performance, this study nonetheless highlights the utility of Mandarin and its Pinyin and Zhuyin writing systems for addressing questions concerning the role of orthographic transfer in second language phonological learning.

Research Questions

The Bassetti (2006) and Pytlyk (2011) studies investigated the influence of orthographic input on phonological representations of Mandarin syllables and on the ability of learners to perceive Mandarin phonological contrasts, respectively. In focus in the present work is the influence of orthographic input in early lexical-phonological development—specifically, the influence of Pinyin and Zhuyin on native English speakers' ability to accurately remember the phonological forms of newly learned Mandarin words. The broadest question guiding our research is thus: Is there a difference in the difficulty associated with learning the grapheme-phoneme correspondences for novel graphemes (as in Zhuyin) and learning new grapheme-phoneme correspondences for familiar graphemes (as in Pinyin)? The first research question that this study is designed to answer is whether there is a difference in native English speakers' ability to learn Pinyin vs. Zhuyin grapheme-phoneme correspondences, specifically whether native English speakers exposed to Pinyin experience particular difficulty with “incongruent” grapheme-phoneme correspondences. This question is addressed in Experiment 1. Our second research question is whether there is a difference in native English speakers' ability to learn the phonological forms of new words when exposed to Pinyin vs. Zhuyin, specifically whether native English speakers exposed to Pinyin experience particular difficulty learning the phonological forms of words with “incongruent” spellings (Experiment 2). Our final research question, addressed in Experiment 3, is whether participants exposed to Pinyin vs. Zhuyin differ in their ability to perceive Mandarin consonant contrasts.

Experiments

This study was carried out with approval from the University of Utah Institutional Review Board and with written informed consent from all participants.

Participants

Thirty monolingual native English speakers were recruited from the University of Utah community and received course credit for participating in the study. All participated in all three experiments in the same order. A background questionnaire confirmed that none of the participants had previously studied Chinese, and none reported speech, language, hearing, or neurological disorders. The participants were randomly assigned to the Pinyin group or the Zhuyin group (n = 15 each). Each group consisted of 5 males and 10 females. The mean age in the Pinyin group was 23.7 years old (SD = 4.7), and the mean age in the Zhuyin group was 25.7 years old (SD = 8.7). Participants assigned to the Pinyin group reported experience with Spanish (8 participants), Japanese (2), French (2), and one each with Arabic, Latin, Korean, German, Modern Greek, Samoan, Turkish, and Swahili; two participants reported no second language experience. Participants in the Zhuyin group reported experience with Spanish (12), French (3), and one each had experience with Russian, Armenian, ASL, German, or Italian; two reported no second language experience.

Materials

For the purposes of the study, we developed a set of 16 Mandarin syllables (“words”), along with their written forms in Pinyin and Zhuyin and randomly-assigned line-drawing visual referents (i.e., the words' “meanings”). The words belonged to two conditions: congruent and incongruent. In the congruent condition, the Pinyin forms are possible English spellings for the auditory words (e.g., < nai> for [nai]); in the incongruent condition, the Pinyin forms involve a familiar (English) grapheme representing a novel (Mandarin) consonant (e.g., the < x> in < xiu> for [ɕiou]). It is important to note that words are categorized as congruent and incongruent on the basis of their Pinyin spellings only—the novel Zhuyin graphemes are neither congruent nor incongruent from the point of view of participants. To determine the use of Pinyin graphemes in the congruent vs. incongruent word conditions, we first conducted a norming study. In this study, 10 native English speakers (who did not participate in the three experiments) were asked to use English graphemes to transcribe the initial consonants in 105 aurally-presented Mandarin CV syllables. The syllables were produced by a male Mandarin-English bilingual speaker reading from Pinyin transcriptions. Following a brief practice session using English nonwords to familiarize them with the task, the native English speakers were asked to respond to the entire block of 105 syllables, presented twice and in a different random order each time.

We calculated the percentage of participants' English letter responses that matched the Pinyin letters used to transcribe the initial consonants in Mandarin. For example, the auditory syllable /lin/, which is spelled with an initial < l> in Pinyin, was always transcribed by the native English participants with an initial < l>, and thus received a “match” score of 100%. On the other hand, the initial consonant in [tɕ^hie], transcribed as < q> in Pinyin, was transcribed by the native English participants as < ch, C, sh, t, ts>, but never as < q>, and thus received a match score of 0%. The four graphemes that received the highest match scores were selected for use in the congruent condition: < l> (100%), < m> (100%), < s> (98%), and < n> (96%). The four receiving the lowest match scores were selected for use in the incongruent condition: < c> (0%), < q> (0%), < x> (0%), and < z> (13%). [Note: Although Pinyin < zh> also had a low match score (5%), its corresponding Mandarin consonant phoneme had a similar response profiled to < q>, indicating that the Mandarin phonemes represented by < zh> and < q> are potentially confusable by native English speakers. For this reason, we excluded < zh> from the study materials].

We next created 16 Mandarin syllables using the Mandarin phonemes represented by the congruent and incongruent graphemes that were selected via the norming study. To control for lexical tone, all word stimuli were produced in Tone 4 (high-falling); in this tone, some of the words were actual words in Mandarin and others were nonwords; all are referred to here as “words” since our participants were unfamiliar with Mandarin. Due to restrictions on vowel distributions in Mandarin, words with initial < z, c, n, s> (/ts, ts^h, n, s/, respectively) contained the rimes < ai> or < ao> (/aI/ or /au/), and those with initial < q, x, l, m> (/tɕ^h, ɕ, l, m/) contained the rimes < ie> or < iu> (/iε/ or /iou/). Each of the eight initial consonants was combined with its two corresponding rimes to create the 16 Mandarin words; a full list of the words is provided in Table A1 in Appendix. These words served as the stimuli presented in the exposure, criterion, and test phases of the three experiments described below.

In addition, we created a set of 16 foil words for use in the test phases. For the incongruent condition, we chose the phonemes that the incongruent Pinyin graphemes usually represent in English to serve as foils. The foil phoneme for < z> and < x> is thus /z/, and the foil phoneme for < c> and < q> is thus /k/ (Note: as there is no /z/ phoneme in Mandarin, some of the words used in the study are in fact impossible in Mandarin; as a whole, the stimulus set is thus quasi-Mandarin). The foil phonemes for the congruent graphemes were selected randomly: /d/ for < n> and < l>, and /ɵ/ for < s> and < m>. Table 2 summarizes the construction of the Mandarin words and their foils. Words are categorized as congruent and incongruent on the basis of their Pinyin spellings only, given that the native English speakers who participated in the present experiments do not have existing grapheme-phoneme correspondences for the (unfamiliar) Zhuyin graphemes.

TABLE 2

Table 2. The Pinyin and Zhuyin graphemes used in the study, along with the foil phonemes assigned to each grapheme and the vowels added to create the Mandarin word stimuli.

Each of the 16 words was randomly assigned a “meaning” from among a set of nonobject line drawings; the word-meaning pairings were the same for all participants. The words were produced by a male Mandarin-English bilingual speaker reading from Pinyin transcriptions.

Experiment 1 (Grapheme-Phoneme Correspondence Learning) Procedures

In Experiment 1, we exposed participants to the set of auditory Mandarin words and their written forms, and later tested them on their ability to accurately determine whether the auditory and written forms were correctly matched. The experiment involved three phases: exposure, criterion, and test. All three experiments were conducted in a sound-attenuated booth; the entire session lasted ~ 1 h, with brief participant-controlled breaks between experiments.

Exposure Phase

Participants were asked to learn the 16 words. in each exposure trial, a written form was presented on the computer screen while the auditory word was played over headphones at a comfortable listening level. The written form remained on the screen for 2 s, followed by the next trial. The 16 words constituted one block, and there were four blocks in the exposure phase. see Table 3 for example exposure phase trials.

TABLE 3

Table 3. Experiment 1 example stimuli, by phase.

Criterion Phase

The criterion phase consisted of 16 matched and 16 mismatched trials. Participants were asked to indicate whether a written word matched the auditory word by pressing “yes” or “no” buttons on the keyboard. Table 3 illustrates example criterion phase trials. Congruent-matched and congruent-mismatched trials were expected to be easy for participants (e.g., see pinyin/zhuyin written form for [nai] and hear [nai] (matched) or [ts^hai] (mismatched)). In incongruent-matched trials, participants saw a written form and heard its corresponding auditory word (e.g., see the pinyin/zhuyin written form for the word [ɕiou] and hear [ɕiou]). In the incongruent-mismatched trials, as in the congruent-mismatched trials, participants saw a written form and heard a word beginning with an entirely different consonant that was also not the foil (e.g., see the pinyin/zhuyin written form for the word [ɕiou] but hear [miou]). In this way, no criterion phase trials were designed to be difficult for participants in either exposure condition; rather, the criterion phase was used to ensure that all participants achieved a similar level of ability to distinguish learned forms from quite different foils before continuing to the test phase. Participants repeated the exposure and criterion phases until they reached 90% accuracy on the criterion test.

Test Phase

The test phase was identical to the criterion phase except that the test phase was designed to determine whether participants experienced confusion due to differences between Pinyin and English grapheme-phoneme correspondences. Congruent-matched and congruent-mismatched trials were again expected to be easy for participants (e.g., see Pinyin/Zhuyin written form for [nai] and hear [nai] (matched) or [dai] (mismatched)), as were the incongruent-matched trials (e.g., see the Pinyin/Zhuyin written form for the word [ɕiou] and hear [ɕiou]). However, the incongruent-mismatched trials were designed to be difficult for participants in the Pinyin condition if they experienced interference from English grapheme-phoneme correspondences. In these trials, participants saw a written form and heard a word beginning with a consonant reflecting English grapheme-phoneme correspondences (e.g., see the Pinyin/Zhuyin written form for the word [ɕiou], which is spelled < xiu> in Pinyin, but hear [ziou], a possible English pronunciation of the Pinyin written form). See Table 3 for an illustration of test phase trials.

Results

The first analysis concerns the number of exposure-criterion phase cycles that participants required to reach the criterion necessary to continue to the final test. Participants in the Pinyin group (mean = 1.6 cycles; SD = 0.632) required significantly fewer cycles than did participants in the Zhuyin group [mean = 3.47; SD = 1.807; F_{(1, 28)} = 14.255, p = 0.001, partial η² = 0.337].

We converted the final test phase accuracy data (see Table 4) to d-primes using Signal Detection Theory (see Figure 1; for more information about d-prime, please see MacMillan and Creelman, 2004). The d-primes were submitted to ANOVA with exposure condition (two levels: Pinyin, Zhuyin) as a between-subjects variable and item condition (congruent, incongruent) as a within-subjects variable. There was a main effect of exposure group, with participants in the Zhuyin group performing more accurately than participants in the Pinyin group overall [F_{(1, 28)} = 4.275, p = 0.048, partial η² = 0.132], a main effect of item type, with higher d-primes on congruent than incongruent items [F_{(1, 28)} = 32.027, p < 0.0005, partial η² = 0.534], and an interaction of the two [F_{(1, 28)} = 5.991, p = 0.021, partial η² = 0.176]. Following up on the interaction, we looked at the effect of exposure condition in the two item conditions separately. On congruent items, there was no effect of exposure condition [F_{(1, 28)} = 0.284, p = 0.598, partial η² = 0.010]. However, on incongruent items, the effect of exposure condition was significant [F_{(1, 28)} = 6.277, p = 0.018, partial η² = 0.183], with participants in the Zhuyin condition outperforming those in the Pinyin condition².

TABLE 4

Table 4. Experiment 1 test accuracy (proportion correct responses; 95% confidence intervals in parentheses), by exposure condition and item condition.

FIGURE 1

Figure 1. Experiment 1 mean test d-primes (whiskers represent 95% confidence intervals), by exposure condition and item condition.

Experiment 2 (Word Learning) Procedures

Our second research question concerned whether there is a difference in native English speakers' ability to learn the phonological forms of new words when exposed to Pinyin vs. Zhuyin. Experiment 2 was identical to Experiment 1 except that participants additionally saw line drawings depicting word meanings during the exposure phase, and at test were asked to match auditory forms with the line drawings.