Yao, He, Chen, Zhu (2024) A Meta-Analysis of Second Language Phonetic Training

brannenkathleen
Jul 3, 2025
5 min read

Updated: Jan 24

A synopsis of the article:

Phonetic training

This meta-analysis seeks to answer these questions:

What is the effect of L2 phonetic training?
What factors moderate the effectiveness of L2 phonetic training? (education, proficiency, training approach, training stimuli, mode of delivery, phonetic subcompetence).

The meta-analysis looked at studies that met the following criteria. The study had to:

be an empirical phonetic training study;
look at L2 segments, not suprasegmentals;
report on accuracy and/or response time of participants' perception and/or production;
look at second or foreign language phonetic training;
have participants without speech, language, or hearing impairments.

Based on these criteria, the meta-analysis included 65 studies.

Data analyses were performed using Comprehensive Meta-analysis Version 3.0 (Biostat Inc.) Cohen's d was used to compare effect sizes treatment groups with control groups. To assess publication bias, Fail-Safe N and Trim and Fill analyses were conducted.

Results

The effect sizes between groups with phonetic training versus groups without was large (Cohen's d = 0.762), indicating that there is a larger difference between the groups relative to the variability in the data.

Moderator analyses were conducted to explore potential factors influencing the effects of L2 phonetic training. Results found highly significant differences (p < .001) in educational levels (university, language institute, high school, pre-middle school, unspecified), training approach (perceptual, production, combined), mode of delivery (auditory, visual, audiovisual), outcome measure ( identification, discrimination, both identification and discrimination, subjective perception judgement, objective acoustic measurement), and phonetic sub-competence (perception, production). Significant differences (p < .05) were found for training stimuli (natural, synthetic, combined). Language proficiency (advanced, intermediate, novice, unspecified) was not significant.

Educational levels: The largest effects were observed at the high school level.

Training approach: Perceptual training yielded the largest effect size.

Mode of delivery: The audiovisual mode yielded the largest effect size.

Outcome measure: Identification tasks generated the largest effect size.

Phonetic sub-competence: Perception yielded the largest effect size.

Training stimuli: Synthetic stimuli yielded the largest effect size.

Outcome measure: Identification tasks yielded the largest effect.

Phonetic Subcompetence: Perception yielded larger gains than production.

Generalizaton of Phonetic Training: Production training yielded a larger effect size than perceptual training.

Discussion

Phonetic training is helpful.
High school L2 learners performed better than university L2 learners. The authors suggest this may be to the younger brain being more plastic. However, pre-middle school learners performed worse than the high school learners. The authors suggest this may be due to task difficulties related to immature phonemic awareness.
Perceptual training transferred to production more than production training transferred to perception. This supports the hypothesis that perception is a precursor to production.
A combination of discrimination and identification tasks is more effective than each on its own.

Questions raised by this article

What type of phonetic training was used in the studies used in this meta-analysis?

Some examples of phonetic training in studies examined in this meta-analysis:

*Bradlow, A. R., Akahane-Yamada, R., Pisoni, D. B., & Tohkura, Y. (1999). Training Japanese listeners to identify English /r/ and /l/: Long-term retention of learning in perception and production. Perception and Psychophysics, 61, 977–985. https://doi.org/10.3758/BF03206911

minimal-pair identification task
- to encourage classification into broad phonetic categories rather than emphasizing the discrimination of fine-grained within-category acoustic differences
stimuli were naturally produced in a variety of phonetic environments
- previous studies with synthetic stimuli not very successful in training /r/-/l/ distinction
stimuli were produced by multiple talkers (high-variability) of American English

*Carlet, A., & Cebrian, J. (2022). The roles of task, segment type, and attention in L2 perceptual training. Applied Psycholinguistics, 43(2), 271–299. https://doi.org/10.1017/S0142716421000515

Participants: Spanish & Catalan learners of English
Tasks:
- High-variability phonetic training (HVPT) using
  - forced-choice identification task
  - AX discrimination task
Stimuli: CVC nonwords V = /æ ʌ ɪ i: ɜ:/ C = /p t k b d g/
- e.g., vap, vup, vab, vub, deedge, teedge, vik, vig, parsh, barsh

*Earle, F. S., & Myers, E. B. (2015). Overnight consolidation promotes generalization across talkers in the identification of nonnative speech sounds. The Journal of the Acoustical Society of America, 137(1), EL91–EL97. https://doi.org/10.1121/1.4903918

*Flege, J. E. (1995b). Two procedures for training a novel second language phonetic contrast. Applied Psycholinguist, 16(4), 425–442. https://doi.org/10.1017/S0142716400066029

*Georgiou, G. P. (2021a). Effects of phonetic training on the discrimination of second language sounds by learners with naturalistic access to the second language. Journal of Psycholinguistic Research, 50, 707–721. https://doi.org/10.1007/s10936-021-09774-3

*Herd, W., Jongman, A., & Sereno, J. (2013). Perceptual and production training of intervocalic /d, ɾ, r/ in American English learners of Spanish. The Journal of the Acoustical Society of America, 133(6), 4247–4255. https://doi.org/10.1121/1.4802902

*Inceoglu, S. (2015). Effects of perceptual training on second language vowel perception and production. Applied Psycholinguistics, 37(5), 1175–1199. https://doi.org/10.1017/s0142716415000533

*Kartushina, N., Hervais-Adelman, A., Frauenfelder, U. H., & Golestani, N. (2015). The effect of phonetic production training with visual feedback on the perception and production of foreign speech sounds. The Journal of the Acoustical Society of America, 138(2), 817–832. https://doi.org/10.1121/1.4926561

*Lambacher, S. G., Martens, W. l., Kakehi, K., Marasinghe, C. A., & Molholt, G. (2005). The effects of identification training on the identification and production of American English vowels by native speakers of Japanese. Applied Psycholinguistics, 26(2), 227–247. https://doi.org/10.1017/S0142716405050150

*Martin, I. A., & Sippel, L. (2021). Is giving better than receiving? The effects of peer and teacher feedback on L2 pronunciation skills. Journal of Second Language Pronunciation, 7(1), 62–88. https://doi.org/10.1075/jslp.20001.mar

*Nishi, K., & Kewley-Port, D. (2007). Training Japanese listeners to perceive American English vowels: Influence of training sets. Journal of Speech, Language, and Hearing Research, 50(6), 1496–1509. https://doi.org/10.1044/1092-4388(2007/103)

*Piguet, T. E. (2001). Effect of phonetic/phonics instruction on reading pronunciation [Doctoral dissertation, Florida International University].

*Qian, M. (2018). An adaptive computational system for automated, learner-customized segmental perception training in words and sentences: Design, implementation, assessment [Doctoral dissertation, Iowa State University]. Iowa State University Digital Repository. https://dr.lib.iastate.edu/handle/20.500.12876/31476

*Rahimi, M. (2016). Second language articulatory training and computer-generated feedback in L2 pronunciation improvement. International Journal of Applied Linguistics, 167(2),190–209. https://doi.org/10.1075/itl.167.2.04rah

*Saito, K. (2012). Reexamining effects of form-focused instruction on L2 pronunciation development: The role of explicit phonetic information. Studies in Second Language Acquisition, 35(1), 1–29. https://doi.org/10.1017/s0272263112000666

*Tricomi, E., Delgado, M. R., McCandliss, B. D., McClelland, J. L., & Fiez, J. A. (2006). Performance feedback drives caudate activation in a phonological learning task. Journal of Cognitive Neuroscience, 18(6), 1029–1043. https://doi.org/10.1162/jocn.2006.18.6.1029

*Vlahou, E., Seitz, A. R., & Kopčo, N. (2019). Nonnative implicit phonetic training in multiple reverberant environments. Attention, Perception, and Psychophysics, 81(4), 935–947. https://doi.org/10.3758/s13414-019-01680-0

*Wei, P. (2016). Cross-linguistic perception and learning of Mandarin Chinese sounds by Japanese adult learners [Doctoral dissertation, University of Oregon]. Scholars’ Bank. https://hdl.handle.net/1794/22279

*Ylinen, S., Uther, M., Latvala, A., Vepsäläinen, S., Iverson, P., Akahane-Yamada, R., & Näätänen, R. (2010). Training the brain to weight speech cues differently: A study of Finnish second-language users of English. Journal of Cognitive Neuroscience, 22(6), 1319–1332. https://doi.org/ 10.1162/jocn.2009.21272

*Zaltz, Y., & Segal, O. (2022). The perception of Arabic vowel duration by L1 Hebrew speakers: Can a short training remold the effect of the native phonological system? Studies in Second Language Acquisition, 44(1), 143–163. https://doi.org/10.1017/S0272263120000728

Questions raised by this article

Comments