智慧型知識擷取技術與應用研究─子計畫一：語料庫之設計與製作--語料庫與資訊檢索標竿測試集設計之研究(III)

陳光華

智慧型知識擷取技術與應用研究─子計畫一：語料庫之設計與製作--語料庫與資訊檢索標竿測試集設計之研究(III)

Date Issued

1999-07-31

Date

1999-07-31

Author(s)

陳光華

DOI

882213E002035

URI

http://ntur.lib.ntu.edu.tw//handle/246246/20395

Abstract

The research and development of information retrieval (IR) has made much progress recently. However, there’s not any applicable mechanism for system evaluation in the Chinese research society. This project aims at the design and the implementation for Chinese information retrieval benchmark. Generally speaking, a benchmark consists of a set of documents, a set of topics, and a set of relevance between documents and topics. Accordingly, our task is also separated into three parts. The document set is downloaded from various electronic news sites, and totally 132,207 documents are collected. To build the topics, we investigate the real user information needs by using a questionnaire, and then modify them to be the formal topics. As to relevance judgment, we first set up a pool of candidate documents for each topic, and then invite three persons to judge the relevance. Finally, we combine the judgments and offer a relevance measure for each document in the pool. The result of our research shows that the benchmark possesses a complete structure and medium scale, and we may further expand and improve it based on existing framework in the future.

Subjects

Benchmark

Information Retrieval

Relevance Judgment

Topic

Publisher

臺北市：國立臺灣大學圖書資訊學系暨研究所

Type

report

File(s)

Name

882213E002035.pdf

Size

93.95 KB

Format

Adobe PDF

Checksum

(MD5):c33f44c63a7fb3aea6e1975cbfd6d254

智慧型知識擷取技術與應用研究─子計畫一：語料庫之設計與製作--語料庫與資訊檢索標竿測試集設計之研究(III)

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)