Hung-Chieh FangNai-Xuan YeYi-Jen ShihPuyuan PengHsuan-Fu WangLayne BerryHung-Yi LeeDavid Harwath2024-10-142024-10-142024-04-14https://scholars.lib.ntu.edu.tw/handle/123456789/722039Integrating Self-Supervised Speech Model with Pseudo Word-Level Targets from Visually-Grounded Speech Modelconference paper10.1109/icasspw62465.2024.10625802