Spoken document understanding and organization
Journal
IEEE Signal Processing Magazine
Journal Volume
22
Journal Issue
5
Pages
42-60
Date Issued
2005
Author(s)
Chen, Berlin
Abstract
Spoken documents associated with the network content are critical for retrieval and browsing. An overview is given of various technology areas reaching towards this goal in a unified context. Technology areas covered include named-entity (NE) extraction, segmentation, and information extraction for the spoken documents as well as automatic summarization, title generation, and topic analysis and organization. The relevant problems and issues, general principles, and basic approaches for each area are briefly reviewed. A framework for properly integrating all these different technology areas is proposed, in which four different levels of processes are defined and bottom-up and top-down relationships are discussed. An initial prototype system for such purposes has been developed by National Taiwan University. The resultant system used broadcast news in Mandarin Chinese as example spoken documents.
SDGs
Other Subjects
Indexing (of information); Information retrieval; Mathematical models; Matrix algebra; Multimedia systems; Probability distributions; Self organizing maps; Speech processing; Speech synthesis; Statistical methods; User interfaces; Wireless telecommunication systems; Audio indexing; Browsing; Multimedia content; Singular value decomposition (SVD); Speech communication
Type
journal article
