Mispronunciation Detection and Diagnosis for Young Arabic Learners Using Transfer Learning
Author(s) -
Taha Fanoush,
Wasfi G. Al-Khatib,
Mohammad Amro,
Abdulkareem Alzahrani,
Moustafa Elshafei
Publication year - 2025
Publication title -
ieee access
Language(s) - English
Resource type - Magazines
SCImago Journal Rank - 0.587
H-Index - 127
eISSN - 2169-3536
DOI - 10.1109/access.2025.3616335
Subject(s) - aerospace , bioengineering , communication, networking and broadcast technologies , components, circuits, devices and systems , computing and processing , engineered materials, dielectrics and plasmas , engineering profession , fields, waves and electromagnetics , general topics for engineers , geoscience , nuclear engineering , photonics and electrooptics , power, energy and industry applications , robotics and control systems , signal processing and analysis , transportation
Improving primary school students’ reading skills supports their academic growth and communication abilities. Pronunciation accuracy is central to reading, especially in Arabic, where small diacritic changes can alter meaning. This is complicated by Arabic’s low-resource nature. This study developed a Mispronunciation Detection and Diagnosis (MDD) system for Arabic learners, allowing teachers and learners to use Computer-Assisted Pronunciation Training (CAPT) for improved instruction and assessment. A pretrained self-supervised learning (SSL) model was fine-tuned to detect phoneme-level pronunciation errors in Modern Standard Arabic using a unique dataset of primary school learner speech from Saudi Arabia. The data were structured, preprocessed, normalized, and aligned to phoneme sequences. The system showed improved phoneme recognition and performance approaching that of a human expert with an F1 score of 71.4%.
Accelerating Research
Robert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom
Address
John Eccles HouseRobert Robinson Avenue,
Oxford Science Park, Oxford
OX4 4GP, United Kingdom