Repository logo
  • English
  • 中文
Log In
Have you forgotten your password?
  1. Home
  2. College of Electrical Engineering and Computer Science / 電機資訊學院
  3. Computer Science and Information Engineering / 資訊工程學系
  4. Improved Approaches of Spoken Document Retrieval – Subword-based Techniques and User/System Interaction
 
  • Details

Improved Approaches of Spoken Document Retrieval – Subword-based Techniques and User/System Interaction

Date Issued
2008
Date
2008
Author(s)
Pan, Yi-Cheng
URI
http://ntur.lib.ntu.edu.tw//handle/246246/184804
Abstract
This thesis consists of two parts. In the first part, we propose two new subword-based approaches for Spoken Document Retrieval (SDR), including Subword-based Position Specific Posterior Lattices (S-PSPL) and Subword-based Confusion Network (S-CN). These approaches are motivated by the PSPL and CN, respectively, but based on subword units instead of words.e introduce S-PSPL first.n the S-PSPL approach we encode the posterior probabilities and proximity information of subword units in a word lattice. critical issue in S-PSPL is to calculate the subword posterior probabilities (SPP) in a word lattice, which can not be carried out directly by simple dynamic programming.e make solve the problem by a simple approximation. To verify that this subword posterior probability (SPP) approximation procedure is accurate enough, we bring Subword-based Confusion Network (S-CN) onto stage.s the original goal of Confusion Network (CN) is to construct a decoding structure to meet the minimum word error rate criterion, S-CN can be used for minimum subword error rate.e embed the SPP approximation in the S-CN structure and achieved significant improvement in subword error rate reduction. This implicitly verifies the feasibility of the SPP approximation. Moreover, though introduced as a decoding structure, S-CN can be used as an efficient and compact indexing structure. This is the second subword-based approach for SDR proposed in this thesis.xtensive evaluations are then made on S-PSPL and S-CN to verify their superiorities. Further discussion and analysis are also given to compare the two very similar data structures PSPL/S-PSPL and CN/S-CN.n the evaluation and analysis S-PSPL is proved to be very attractive and even better than S-CN since it requires less or fairly equal resources while offers better accuracies under most circumstances.here are some possibilities to improve S-PSPL/S-CN system. In the thesis we propose an algorithm, Lexicon Adaptation with Reduced Character Errors (LARCE), to adapt the lexicon in the LVCSR system to improve the character recognition accuracy. In the evaluation, LARCE gives significant improvements in terms of character accuracy. It can be expected that with the improved subword recognition, S-PSPL/S-CN can be improved respectively.n the second part, we present a formulation and a framework for a new type of dialogue systems, referred to as the extit{type-II dialogue systems}, which evolves from the SDR systems but with a whole new definition and formulation. extit{Type-II dialogue systems} are proposed for the difficulties which can not be solved by traditional SDR systems. The new definition and formulation emphasize the interactions between the user and the system and this carries the term extit{dialogue systems}. However, it is significantly different from the conventional spoken dialogue systems and this is why we refer to it as extit{type-II}.he distinct feature of such dialogue systemss their tasks of information access from unstructured knowledge sources, or the lack of a well-organized back-end database offering the information for the user.ypical example tasks of this type of dialogue systems include information retrieval/browsing and question answering.he functionalities of each module in such extit{type-II dialogue systems} are analyzed, presented, and compared with the respective modules in extit{type-I dialogue systems}. series of novel technologies helpful in constructing extit{type-II dialogue systems} are then proposed in the thesis. In addition to the new SDR technologies already presented in part one, Named Entity Recognition (NER) from text and spoken documents, topic hierarchy construction for spoken documents, and dialogue modelling for information access are discussed here.or the NER, two novel approaches are proposed for text and spoken documents, respectively. For text documents we introduce to use global information in addition to local information (internal and external information) widely used in the NER community. For spoken documents, we propose to utilize the relevant documents retrieved from internet to augment the new NEs into the recognized lattice to compensate for the defects of the ASR system since many NEs are Out-of-Vocabulary words (OOVs).or the topic hierarchy construction, a novel approach HAC+P proposed recently cite{ChuangTOIS05} is used. We use the NEs extracted from the spoken documents to construct the balanced tree structures by HAC+P, to be used as a convenient system output for user interaction.or the dialogue modelling, a Markov Decision Process (MDP) based method is proposed to learn the best path to guide the user during the retrieval process. In many cases, the user''s initial query leads to too many retrieval results and the way for the system to guide the user is through the query expansion to specify user''s information need more clearly. In the proposed approach, the system learns to predict the user''s information need so as to be able to recommend the most discriminative and informative terms for query expansion with an MDP-based method.here is still a long way to go in the research and development of SDR technologies. It is hoped that the works in this thesis will be helpful in this research topic.
Subjects
Dialogue System
Spoken Document Retrieval
Indexing Structure
Type
thesis
File(s)
Loading...
Thumbnail Image
Name

ntu-97-D91922006-1.pdf

Size

23.32 KB

Format

Adobe PDF

Checksum

(MD5):ddb73befd3609b15b39b03f037b90067

臺大位居世界頂尖大學之列,為永久珍藏及向國際展現本校豐碩的研究成果及學術能量,圖書館整合機構典藏(NTUR)與學術庫(AH)不同功能平台,成為臺大學術典藏NTU scholars。期能整合研究能量、促進交流合作、保存學術產出、推廣研究成果。

To permanently archive and promote researcher profiles and scholarly works, Library integrates the services of “NTU Repository” with “Academic Hub” to form NTU Scholars.

總館學科館員 (Main Library)
醫學圖書館學科館員 (Medical Library)
社會科學院辜振甫紀念圖書館學科館員 (Social Sciences Library)

開放取用是從使用者角度提升資訊取用性的社會運動,應用在學術研究上是透過將研究著作公開供使用者自由取閱,以促進學術傳播及因應期刊訂購費用逐年攀升。同時可加速研究發展、提升研究影響力,NTU Scholars即為本校的開放取用典藏(OA Archive)平台。(點選深入了解OA)

  • 請確認所上傳的全文是原創的內容,若該文件包含部分內容的版權非匯入者所有,或由第三方贊助與合作完成,請確認該版權所有者及第三方同意提供此授權。
    Please represent that the submission is your original work, and that you have the right to grant the rights to upload.
  • 若欲上傳已出版的全文電子檔,可使用Open policy finder網站查詢,以確認出版單位之版權政策。
    Please use Open policy finder to find a summary of permissions that are normally given as part of each publisher's copyright transfer agreement.
  • 網站簡介 (Quickstart Guide)
  • 使用手冊 (Instruction Manual)
  • 線上預約服務 (Booking Service)
  • 方案一:臺灣大學計算機中心帳號登入
    (With C&INC Email Account)
  • 方案二:ORCID帳號登入 (With ORCID)
  • 方案一:定期更新ORCID者,以ID匯入 (Search for identifier (ORCID))
  • 方案二:自行建檔 (Default mode Submission)
  • 方案三:學科館員協助匯入 (Email worklist to subject librarians)

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science