Extractive broadcast news summarization leveraging recurrent neural network language modeling techniques

Chen K.-Y.; Liu S.-H.; Chen B.; Wang H.-M.; Jan E.-E.; Hsu W.-L.; Chen H.-H.; Chen H.-H.;Hsu W.-L.;Jan E.-E.;Wang H.-M.;Chen B.;Liu S.-H.;Chen K.-Y.

doi:10.1109/TASLP.2015.2432578

Extractive broadcast news summarization leveraging recurrent neural network language modeling techniques

Journal

IEEE Transactions on Audio, Speech and Language Processing

Journal Volume

23

Journal Issue

8

Pages

1322-1334

Date Issued

2015

Author(s)

Chen K.-Y.

Liu S.-H.

Chen B.

Wang H.-M.

Jan E.-E.

Hsu W.-L.

Chen H.-H.

DOI

10.1109/TASLP.2015.2432578

URI

https://scholars.lib.ntu.edu.tw/handle/123456789/413105

URL

https://www.scopus.com/inward/record.uri?eid=2-s2.0-84930943980&doi=10.1109%2fTASLP.2015.2432578&partnerID=40&md5=282ef2e7a12ea5d63de7d8a7163a7bd9

Abstract

Extractive text or speech summarization manages to select a set of salient sentences from an original document and concatenate them to form a summary, enabling users to better browse through and understand the content of the document. A recent stream of research on extractive summarization is to employ the language modeling (LM) approach for important sentence selection, which has proven to be effective for performing speech summarization in an unsupervised fashion. However, one of the major challenges facing the LM approach is how to formulate the sentence models and accurately estimate their parameters for each sentence in the document to be summarized. In view of this, our work in this paper explores a novel use of recurrent neural network language modeling (RNNLM) framework for extractive broadcast news summarization. On top of such a framework, the deduced sentence models are able to render not only word usage cues but also long-span structural information of word co-occurrence relationships within broadcast news documents, getting around the need for the strict bag-of-words assumption. Furthermore, different model complexities and combinations are extensively analyzed and compared. Experimental results demonstrate the performance merits of our summarization methods when compared to several well-studied state-of-the-art unsupervised methods. ? 2015 IEEE.

Subjects

Language modeling

long-span structural information

recurrent neural network

speech summarization

SDGs

[SDGs]SDG4

Type

journal article

Extractive broadcast news summarization leveraging recurrent neural network language modeling techniques

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)