I-vector based language modeling for spoken document retrieval

Chen K.-Y.; Lee H.-S.; Wang H.-M.; Chen B.; HSIN-HSI CHEN; Chen K.-Y.;Lee H.-S.;Wang H.-M.;Chen B.;Chen H.-H.

doi:10.1109/ICASSP.2014.6854974

I-vector based language modeling for spoken document retrieval

Journal

IEEE International Conference on Acoustics, Speech and Signal Processing

Pages

7083-7087

ISBN

9781479928927

Date Issued

2014

Author(s)

Chen K.-Y.

Lee H.-S.

Wang H.-M.

Chen B.

HSIN-HSI CHEN

DOI

10.1109/ICASSP.2014.6854974

URI

https://www.scopus.com/inward/record.uri?eid=2-s2.0-84905232829&doi=10.1109%2fICASSP.2014.6854974&partnerID=40&md5=d675ba85e4b8cd6b3c1affe8dac5f1f6

Abstract

Since more and more multimedia data associated with spoken documents have been made available to the public, spoken document retrieval (SDR) has become an important research subject in the past two decades. The i-vector based framework has been proposed and introduced to language identification (LID) and speaker recognition (SR) tasks recently. The major contribution of the i-vector framework is to reduce a series of acoustic feature vectors of a speech utterance to a low-dimensional vector representation, and then numbers of well-developed postprocessing techniques (such as probabilistic linear discriminative analysis, PLDA) can be readily and effectively used. However, to our best knowledge, there is no research up to date on applying the i-vector framework for SDR or information retrieval (IR). In this paper, we make a step forward to formulate an i-vector based language modeling (IVLM) framework for SDR. Furthermore, we evaluate the proposed IVLM framework with both inductive and transductive learning strategies. We also exploit multi-levels of index features, including word- and subword-level units, in concert with the proposed framework. The results of SDR experiments conducted on the TDT-2 (Topic Detection and Tracking) collection demonstrate the performance merits of our proposed framework when compared to several existing approaches. ? 2014 IEEE.

Subjects

i-vector

inductive

language modeling

Spoken document retrieval

transductive

SDGs

[SDGs]SDG10

Type

conference paper

I-vector based language modeling for spoken document retrieval

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)