Wang, Y.-B.Y.-B.WangLIN-SHAN LEE2020-06-112020-06-11201315206149https://scholars.lib.ntu.edu.tw/handle/123456789/498586https://www.scopus.com/inward/record.uri?eid=2-s2.0-84890507052&doi=10.1109%2fICASSP.2013.6639270&partnerID=40&md5=eab572e36bda2aae49e06b6b430a8f3fIn Computer-Aided Pronunciation Training, we hope to specify the type of mispronunciation, or Error Pattern (EP), the language learner has made as a more effective feedback. But derivation of EPs usually requires expert knowledge and pedagogical experiences, which is not easy to obtain for each pair of target and native languages. In this paper we propose a preliminary framework toward unsupervised discovery of EPs from a corpus of learners' recordings. We use Universal Phoneme Posteriorgram, derived from Multi-Layer Perceptron trained with a corpus of mixed languages, as features to bring supervised knowledge into the unsupervised task. We also use Hierarchical Agglomerative Clustering algorithm to explore sub-segmental variation of phoneme segments for distinguishing EPs. We tested K-means (assuming known number of EPs) and Gaussian Mixture Model with minimum description length principle (estimating unknown number of EPs) for EP discovery. Preliminary experimental results illustrated the effectiveness of the proposed framework, although there is still a long way to go compared to human annotators. © 2013 IEEE.GMM-MDL; HAC; K-means; Pronunciation Error Pattern Discovery; Rand Index; Universal Phoneme PosteriorgramError patterns; GMM-MDL; HAC; K-means; Posteriorgram; Rand index; Electrical engineering; Signal processingToward unsupervised discovery of pronunciation error patterns using universal phoneme posteriorgram for computer-assisted language learningconference paper10.1109/ICASSP.2013.66392702-s2.0-84890507052