https://scholars.lib.ntu.edu.tw/handle/123456789/577591
標題: | MatDC: A Multi-turn Multi-domain Annotated Task-oriented Dialogue Dataset in Chinese | 作者: | Tseng Y.-H SHU-KAI HSIEH Lian R Chiang C.-Y Chang Y.-L LI-PING CHANG Hsieh J.-L. |
關鍵字: | Artificial intelligence; Semantics; Back-end database; Building model; Conversational agents; Different domains; Multi domains; Pipeline architecture; Semantic annotations; Testing grounds; Statistical tests | 公開日期: | 2020 | 起(迄)頁: | 165-170 | 來源出版物: | Proceedings - 25th International Conference on Technologies and Applications of Artificial Intelligence, TAAI 2020 | 摘要: | The modern conversational agent requires high-quality datasets, which are often the bottlenecks when building models. This paper introduces MatDC, an entirely human-produced dialogue dataset with full semantic annotations in Chinese. The dataset features linguistic variations given users' intents and fully annotated semantic slots. MatDC dataset was completely human-edited, and the curation comprises two stages. At first, templates design stage, domain editors first construct schemas and compose ten dialogues between the agents and the users based on the back-end database. Secondly, in the dialogue rewrite stage, rewriters generate sentential variations for each template, under the constraints that the normalized slot values are kept unchanged. The underlying methodology of the MatDC is more open to extension and more adaptable to different domains. To demonstrate the applicability of the dataset, we build a dialogue agent with conventional pipeline architecture. We expect the MatDC dataset to provide additional training data and testing ground for dialogue agent studies. ? 2020 IEEE. |
URI: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85103812885&doi=10.1109%2fTAAI51410.2020.00038&partnerID=40&md5=b498430830968cd4ec16a5014d5719ab https://scholars.lib.ntu.edu.tw/handle/123456789/577591 |
DOI: | 10.1109/TAAI51410.2020.00038 |
顯示於: | 語言學研究所 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。