Artificial Intelligence Technology for EAP Speaking Skills: Student Perceptions of Opportunities and Challenges

  • Bin ZouEmail author
  • Sara Liviero
  • Mengyuan Hao
  • Chaoyang Wei
Part of the New Language Learning and Teaching Environments book series (NLLTE)


This study explores university students’ attitudes regarding the potential of artificial intelligence (AI)-assisted mobile applications (apps) to support the development of speaking skills in English for academic purposes (EAP) courses in higher education. Analysis of the data shows students expressing a preference to use AI tools for speaking development due to limited teacher feedback, and although they were generally satisfied practising their English using the AI technologies, the findings also point to certain limitations of the current AI apps, such as lack of applicable feedback and few model examples. In addition, students held strong views discouraging any notion that AI could replace actual language teachers. In conclusion, students suggest the need for more AI resources, especially apps that accommodate a variety of English accents.



This research is supported by KSF-E-16 in XJTLU.


  1. Al-Fahad, F. N. (2009). Students’ attitudes and perceptions towards the effectiveness of Mobile learning in King Saud University, Saudi Arabia. Turkish Online Journal of Educational Technology, 8(2), 1–9.Google Scholar
  2. Apple Inc. (2018). Siri [Software]. Retrieved from
  3. Baicizhan. (2018). Baichizhan [Software]. Retrieved from
  4. BALEAP. (2018). BALEAP Can Do Framework. Retrieved from
  5. Beaven, A., & Neuhoff, A. (2012, January 1). Assessing oral proficiency for intercultural professional communication: The CEFcult project. European Association for Computer-Assisted Language Learning (EUROCALL). Retrieved from
  6. Bernstein, J., Cohen, M., Murveit, H., Rtischev, D., & Weintraub, M. (1990). Automatic evaluation and training in English pronunciation. Paper presented at Conference. ISCA, Kobe, Japan, pp. 1185–1188. Retrieved from
  7. Boersma. P., & Weenink, D. (2018). Praat: Doing Phonetics by Computer [Software]. Retrieved from:
  8. Bordonaro, K. (2003). Perceptions of technology and manifestations of language learner autonomy. CALL-EJ Online, 5(1). Retrieved from
  9. Bruce, I. (2011). Theory and concepts of English for academic purposes. London: Palgrave Macmillan.Google Scholar
  10. Cargill, M., & O’Connor, P. (2011). Writing scientific research articles: Strategy and steps. London: Wiley-Blackwell.Google Scholar
  11. Celce-Murcia, M., Brinton, D. M., Goodwin, J. M., & Griner, B. (2010). Teaching pronunciation. Hardback with audio CDs (2): A course book and reference guide. Cambridge: Cambridge University Press.Google Scholar
  12. Charmaz, K. (2003). Qualitative interviewing and grounded theory analysis. In J. F. Gubrium & J. A. Holstein (Eds.), Handbook of interview research: Context & method (pp. 675–694). London: Sage.Google Scholar
  13. Chavan, K., & Gawande, U. (2015). Speech recognition in noisy environment, issues and challenges: A review. In Proceedings of 2015 International Conference on Soft-Computing and Networks Security (ICSNS). Coimbatore.Google Scholar
  14. Chen, H. H.-J. (2011). Developing and evaluating an oral skills training website supported by automatic speech recognition technology. ReCALL, 23(1), 59–78.CrossRefGoogle Scholar
  15. Chiu, T.-L., Liou, H.-C., & Yeh, Y. (2007). A study of web-based oral activities enhanced by automatic speech recognition for EFL college learning. Computer Assisted Language Learning, 20(3), 209–233.CrossRefGoogle Scholar
  16. Chivox, Ltd. (2018a). CHIVOX-kami English system [computer software]. Suzhou: Chivox Ltd.Google Scholar
  17. Chivox, Ltd. (2018b). CHIVOX-Pioneer in intelligent speech analysis technology [computer software]. Suzhou: Chivox Ltd.Google Scholar
  18. Council of Europe. (2018, April 16). Common european framework of reference for languages: Learning, teaching, assessment (CEFR). Retrieved from
  19. Creswell, J. W. (2013). Research design: Qualitative, quantitative, and mixed methods approaches. Thousand Oaks, CA: SAGE Publications.Google Scholar
  20. de Jong, J., & Benigno, V. (2018, April 16). The CEFR in higher education: Developing descriptors of academic English. Retrieved from
  21. Demouy, V., & Kukulska-Hulme, A. (2010). On the spot: Using mobile devices for listening and speaking practice on a French language programme. Open Learning: The Journal of Open, Distance and e-Learning, 25(3), 217–232.CrossRefGoogle Scholar
  22. Deng, Q., & Trainin, G. (2015). Learning vocabulary with apps: From theory to practice. The Nebraska Educator, 2, 49–69.Google Scholar
  23. Derwing, T. M., Munro, M. J., & Carbonaro, M. (2012). Does popular speech recognition software work with ESL speech? TESOL Quarterly, 34(3), 592–603.CrossRefGoogle Scholar
  24. Dlaska, A., & Krekeler, C. (2008). Self-assessment of pronunciation. System, 36(4), 506–516.CrossRefGoogle Scholar
  25. Dörnyei, Z. (2007). Research methods in applied linguistics: Quantitative, qualitative, and mixed methodologies. Oxford: Oxford Applied Linguistics.Google Scholar
  26. Douma, P., Anderson, G., Akahane, M., & Mizikovsky, S. (1996). Methods and apparatus for training and operating voice recognition systems. In US5583965A Documentation. Retrieved from
  27. Duolingo. (2018). Duolingo: Learn Spanish, French and other languages for free [computer software]. Cheshire: Duolinguo.Google Scholar
  28. Gardner, R. C., & Lambert, W. E. (1972). Attitudes and motivation in second language learning. Rowley, MA: Newbury House.Google Scholar
  29. Gilakjani, A. P. (2011). A study on the situation of pronunciation instruction in ESL/EFL classrooms. Journal of Studies in Education, 1(1:E4), 1–15.Google Scholar
  30. Gilakjani, A. P., & Sabouri, N. B. (2016). How can EFL teachers help EFL learners improve their English pronunciation? Journal of Language Teaching and Research, 7(5), 967–972.CrossRefGoogle Scholar
  31. Glasman-Deal, H. (2010). Science research writing for non-native speakers of English. London: Imperial College Press.Google Scholar
  32. Google Cloud. (2018). Cloud Speech-to-Text API [computer software]. Retrieved from
  33. Hincks, R. (2005). Measures and perceptions of liveliness in student oral presentation speech: A proposal for an automatic feedback mechanism. System, 33(4), 575–591.CrossRefGoogle Scholar
  34. IBM SPSS. (2018). IBM SPSS. Statistics package for the social sciences (Version 22) [Software]. Retrieved from
  35. IELTS. (2018). IELTS. Australia: British Council. Retrieved from
  36. iFlytek Co., Ltd. (2018). iFlytek [Software]. Retrieved from
  37. Jenkins, J. (2014). English as a lingua Franca in the international university: The politics of academic English language policy. Oxon: Routledge.Google Scholar
  38. Jenkins, J. (2017). Mobility and English language policies and practices in higher education. Oxon: Routledge.CrossRefGoogle Scholar
  39. Jia, J. (2009). An AI framework to each English as a foreign language: CSIEC. AI Magazine, 30(2), 59–71.CrossRefGoogle Scholar
  40. K12 Inc. (2018). K12 online education programs & schooling. Retrieved from
  41. Kan, Q., & Tang, J. L. (2018). Researching mobile-assisted English language learning among adult distance learners in China: Emerging practices and learner perception of teacher role. International Journal of Computer-Assisted Language Learning and Teaching, 8(3), 1–28.CrossRefGoogle Scholar
  42. Kang, O., Thomson, R., & Moran, M. (2018). The effects of international accents and shared first language on listening comprehension tests. TESOL Quarterly, 53(1), 56–81.CrossRefGoogle Scholar
  43. Kessler, G., Bikowski, D., & Boggs, J. (2012). Collaborative writing among second language learners in academic web-based projects. Language Learning and Technology, 16(1), 91–109.Google Scholar
  44. Kim, I.-S. (2006). Automatic speech recognition: Reliability and pedagogical implications for teaching pronunciation. Journal of Educational Technology & Society, 9(1), 322–334.Google Scholar
  45. Kim, Y., Soyata, T., & Behnagh, R. F. (2018). Towards emotionally aware AI smart classroom: Current issues and directions for engineering and education. IEEE Access, 6, 5308–5331.CrossRefGoogle Scholar
  46. Knight, W. (2017). China’s AI awakening. The West should stop worrying about China’s AI revolution. Retrieved from
  47. Köse, U., & Arslan, A. (2014). Design and development of a chaos-based image encryption system. In S. Banerjee & Ş. Ş. Erçetin (Eds.), Chaos, complexity and leadership 2012 (pp. 23–28). Dordrecht: Springer.CrossRefGoogle Scholar
  48. Liakin, D., Cardoso, W., & Liakina, N. (2015). Learning L2 pronunciation with a mobile speech recognizer: French /y/. CALICO Journal, 32(1), 1–25.CrossRefGoogle Scholar
  49. Liulishuo. (2017). Liulishuo—Your personal AI English teacher [Software]. Retrieved from
  50. Mauranen, A. (2012). Exploring ELF: Academic English shaped by non-native speakers. Cambridge: Cambridge University Press.Google Scholar
  51. McCrocklin, S. M. (2016). Pronunciation learner autonomy: The potential of automatic speech recognition. System, 57, 25–42.CrossRefGoogle Scholar
  52. Meisam, R., & Tavakoli, M. (2015). The effectiveness of CALL in helping Persian L2 learners produce the English vowel /ɒ/. GEMA Online Journal of Language Studies, 15(3), 17–30.Google Scholar
  53. Murphy, J. M. (2014). Intelligible, comprehensible, non-native models in ESL/EFL pronunciation teaching. System, 42, 258–269.CrossRefGoogle Scholar
  54. Neri, A., Cucchiarini, C., & Strik, W. (2003). Automatic speech recognition for second language learning: How and why it actually works. Paper presented at 15th ICPhS Barcelona, Spain.Google Scholar
  55. Nuance Communications. (2018). Dragon naturally speaking [Software]. Available from
  56. Oppenheim, A. N. (1992). Questionnaire design, interviewing and attitude measurement. London: Continuum.Google Scholar
  57. Pallant, J. (2013). SPSS survival manual: A step by step guide to data analysis using IBM SPSS. Maidenhead, Berkshire: McGraw-Hill Education.Google Scholar
  58. Park, M., & Slater, T. (2014). A typology of tasks for mobile-assisted language learning: Recommendations from a small-scale needs analysis. TESL Canada Journal, 31(SI8), 93–115.Google Scholar
  59. Reinders, H., & Darasawang, P. (2012). Diversity in learner support. In G. Stockwell (Ed.), Computer-assisted language learning: Diversity in research and practice (pp. 49–70). Cambridge: Cambridge University Press.CrossRefGoogle Scholar
  60. Sapsford, R. (1999). Survey research. London: SAGE Publications.Google Scholar
  61. Setter, J., & Jenkins, J. (2005). State-of-the-art review article. Language Teaching, 38(1), 1–17.CrossRefGoogle Scholar
  62. Sun, C., Branum-Martin, L., Peng, P., & Tao, S. (2018). Phonology, orthography, and decoding skills within and across English and Chinese. Scientific Studies of Reading, 22(5), 401–419.CrossRefGoogle Scholar
  63. Thomas, D. R. (2006). A general inductive approach for analyzing qualitative evaluation data. American Journal of Evaluation, 27(2), 237–246.CrossRefGoogle Scholar
  64. Wang, Y.-H., & Young, S. S.-C. (2014). A study of the design and implementation of the ASR-based iCASL system with corrective feedback to facilitate English learning. Journal of Educational Technology & Society, 17(2), 219–233.Google Scholar
  65. Xu, Q., & Peng, H. (2017). Investigating mobile-assisted oral feedback in teaching Chinese as a second language. Computer Assisted Language Learning, 30(3–4), 173–182.CrossRefGoogle Scholar
  66. Young, V., & Mihailidis, A. (2010). Difficulties in automatic speech recognition of dysarthric speakers and implications for speech-based applications used by the elderly: A literature review. Assistive Technology, 22(2), 99–112.CrossRefGoogle Scholar
  67. Zhang, H., Song, W., & Huang, R. (2014). Business English vocabulary learning with mobile phone: A Chinese students’ perspective. International Journal of Computer-Assisted Language Learning and Teaching, 4(2), 46–63.CrossRefGoogle Scholar
  68. Zou, B., Li, H., & Li, J. (2018). Exploring a curriculum app and a social communication app for EFL learning. Computer Assisted Language Learning., 31(7), 694–713.CrossRefGoogle Scholar
  69. Zou, B., Wang, D. S., & Xing, M. J. (2016). Collaborative tasks in wiki-based environment in EFL learning. Computer Assisted Language Learning, 29(5), 1000–1016.CrossRefGoogle Scholar

Copyright information

© The Author(s) 2020

Authors and Affiliations

  • Bin Zou
    • 1
    Email author
  • Sara Liviero
    • 1
  • Mengyuan Hao
    • 1
  • Chaoyang Wei
    • 2
  1. 1.Xi’an Jiaotong-Liverpool UniversitySuzhouChina
  2. 2.University of LiverpoolLiverpoolUK

Personalised recommendations