公開日期 | 標題 | 作者 | 來源出版物 | scopus | WOS | 全文 |
2021 | S2VC: A framework for any-to-any voice conversion with self-supervised pretrained representations | Lin J.-H; Lin Y.Y; Chien C.-M; HUNG-YI LEE | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH | | | |
2018 | Seeing and hearing too: Audio representation for video captioning | Chuang, S.-P.; Wan, C.-H.; Huang, P.-C.; Yang, C.-Y.; HUNG-YI LEE | 2017 IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 2017 - Proceedings | | | |
2018 | Segmental Audio Word2Vec: Representing Utterances as Sequences of Vectors with Applications in Spoken Term Detection | Wang, Y.-H.; Lee, H.-Y.; HUNG-YI LEE ; LIN-SHAN LEE | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings | 36 | 0 | |
2020 | Self-Supervised Deep Learning for Fisheye Image Rectification | Chao, C.-H.; Hsu, P.-L.; YU-CHIANG WANG ; HUNG-YI LEE | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings | 19 | 0 | |
2022 | Self-supervised Representation Learning for Speech Processing | HUNG-YI LEE ; Mohamed, Abdelrahman; Watanabe, Shinji; Sainath, Tara; Livescu, Karen; Li, Shang Wen; Yang, Shu Wen; Kirchhoff, Katrin | NAACL 2022 - 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Tutorial Abstracts | 2 | | |
2022 | Self-Supervised Speech Representation Learning: A Review | Mohamed, Abdelrahman; HUNG-YI LEE ; Borgholt, Lasse; Havtorn, Jakob D.; Edin, Joakim; Igel, Christian; Kirchhoff, Katrin; Li, Shang Wen; Livescu, Karen; Maaløe, Lars; Sainath, Tara N.; Watanabe, Shinji | IEEE Journal on Selected Topics in Signal Processing | 57 | 13 | |
2012 | Semantic query expansion and context-based discriminative term modeling for spoken document retrieval | Tu, T.-W.; Lee, H.-Y.; Chou, Y.-Y.; HUNG-YI LEE ; LIN-SHAN LEE | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing | 4 | 0 | |
2014 | Semantic retrieval of personal photos using matrix factorization and two-layer random walk fusing sparse speech annotations with visual features | Liou, Y.-M.; Fu, Y.-S.; Lee, H.-Y.; Lee, L.-S.; HUNG-YI LEE | Annual Conference of the International Speech Communication Association, INTERSPEECH | | | |
2020 | Semi-supervised learning for multi-speaker text-to-speech synthesis using discrete speech representation | Tu T; Chen Y.-J; Liu A.H; HUNG-YI LEE | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH | | | |
2021 | Semi-supervised spoken language understanding via self-supervised speech and language model pretraining | Lai C.-I; Chuang Y.-S; Li S.-W; Glass J.; HUNG-YI LEE | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings | 26 | 0 | |
2020 | Sequence-to-Sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding | Liu, A.H.; Sung, T.-W.; Chuang, S.-P.; HUNG-YI LEE ; LIN-SHAN LEE | ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings | 8 | 0 | |
2023 | SLUE Phase-2: A Benchmark Suite of Diverse Spoken Language Understanding Tasks | Shon, Suwon; Arora, Siddhant; Lin, Chyi Jiunn; Pasad, Ankita; Wu, Felix; Sharma, Roshan; Wu, Wei Lun; HUNG-YI LEE ; Livescu, Karen; Watanabe, Shinji | Proceedings of the Annual Meeting of the Association for Computational Linguistics | 0 | | |
2013 | Speaking Rate Normalization with Lattice-based Context-dependent Phoneme Duration Modeling for Personalized Speech Recognizers on Mobile Devices | Ching-Feng Yeh; HUNG-YI LEE ; LIN-SHAN LEE | Interspeech | 2 | | |
2020 | SpeechBERT: An audio-and-text jointly learned language model for end-to-end spoken question answering | Chuang Y.-S; Liu C.-L; Lee H.-Y; HUNG-YI LEE ; LIN-SHAN LEE | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH | 26 | 0 | |
2015 | Spoken Content Retrieval - Beyond Cascading Speech Recognition with Text Retrieval | Lin-shan Lee; James Glass; Hung-yi Lee; HUNG-YI LEE ; LIN-SHAN LEE ; JIUN-HAW LEE | IEEE/ACM Transactions on Audio, Speech, and Language Processing | 78 | 68 | |
2014 | Spoken knowledge organization by semantic structuring and a prototype course lecture system for personalized learning | Lee, H.-Y.; Shiang, S.-R.; Yeh, C.-F.; Chen, Y.-N. ; Huang, Y.; Kong, S.-Y.; HUNG-YI LEE ; LIN-SHAN LEE ; YUN-NUNG CHEN ; JIUN-HAW LEE | IEEE Transactions on Audio, Speech and Language Processing | 16 | 15 | |
2014 | Spoken question answering using tree-structured conditional random fields and two-layer random walk | Shiang, S.-R.; Lee, H.-Y.; Lee, L.-S.; HUNG-YI LEE | Annual Conference of the International Speech Communication Association, INTERSPEECH | | | |
2018 | Spoken SQuAD: A Study of Mitigating the Impact of Speech Recognition Errors on Listening Comprehension | Chia-Hsuan Li; Szu-Lin Wu; Chi-Liang Liu; Hung-yi Lee; HUNG-YI LEE | INTERSPEEH | 26 | 0 | |
2009 | Spoken term detection from bilingual spontaneous speech using code-switched lattice-based structures for words and subword units | Lee, H.-Y.; Tang, Y.-L.; Tang, H.; HUNG-YI LEE ; LIN-SHAN LEE | Proceedings of the 2009 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009 | 8 | 0 | |
2022 | Spoofing-Aware Speaker Verification by Multi-Level Fusion | Wu, Haibin; Meng, Lingwei; Kang, Jiawen; Li, Jinchao; LI XU; Wu, Xixin; HUNG-YI LEE ; Meng, Helen | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH | 2 | 0 | |