Chuang, S.-P.S.-P.ChuangWan, C.-H.C.-H.WanHuang, P.-C.P.-C.HuangYang, C.-Y.C.-Y.YangHUNG-YI LEE2020-06-112020-06-112018https://scholars.lib.ntu.edu.tw/handle/123456789/498365Seeing and hearing too: Audio representation for video captioningconference paper10.1109/ASRU.2017.82689612-s2.0-85050515076https://www.scopus.com/inward/record.uri?eid=2-s2.0-85050515076&doi=10.1109%2fASRU.2017.8268961&partnerID=40&md5=dc4a0e841d3815819e0f6a819e683dea