智慧型知識擷取技術與應用研究─子計畫一:語料庫之設計與製作--語料庫與資訊檢索標竿測試集設計之研究(III)
Date Issued
1999-07-31
Date
1999-07-31
Author(s)
DOI
882213E002035
Abstract
The research and development of information retrieval
(IR) has made much progress recently. However,
there’s not any applicable mechanism for system
evaluation in the Chinese research society. This project
aims at the design and the implementation for Chinese
information retrieval benchmark. Generally speaking,
a benchmark consists of a set of documents, a set of
topics, and a set of relevance between documents and
topics. Accordingly, our task is also separated into
three parts. The document set is downloaded from
various electronic news sites, and totally 132,207
documents are collected. To build the topics, we
investigate the real user information needs by using a
questionnaire, and then modify them to be the formal
topics. As to relevance judgment, we first set up a pool
of candidate documents for each topic, and then invite
three persons to judge the relevance. Finally, we
combine the judgments and offer a relevance measure for each document in the pool. The result of our
research shows that the benchmark possesses a
complete structure and medium scale, and we may
further expand and improve it based on existing
framework in the future.
Subjects
Benchmark
Information Retrieval
Relevance Judgment
Topic
Publisher
臺北市:國立臺灣大學圖書資訊學系暨研究所
Type
report
File(s)![Thumbnail Image]()
Loading...
Name
882213E002035.pdf
Size
93.95 KB
Format
Adobe PDF
Checksum
(MD5):c33f44c63a7fb3aea6e1975cbfd6d254
