Automatic Tagging for Disaster-related Presentation Files
Date Issued
2015
Date
2015
Author(s)
Yang, Shiang-Wen
Abstract
Disaster management requires information. Presentation files were created from experts who specialize in disaster response and used to deliver critical information during disaster period in recent years. Sufficient and rapid information delivery is the key for government to make decision. That is, a sound information delivery system in disaster management let user acquire the suitable information efficiently. However, with several types of disaster data, multimedia data, such as presentation files, has few discussion and utility. In case of lack of proper information extraction (IE) and information retrieval (IR) process, the information can be applied on further application and convert into knowledge. In this research, an automatic tagging for disaster-related presentation files is developed, which includes four process, text extraction process, word segment process, feature tagging process and reasoning tagging process. We also raise a logic manipulation to find the connection with tags and database and implement this method by the concept of rule-based and document-centered. While the data completed text extraction process and word segment process, it would run though creation rules of each kinds of tag and three types of tags: feature tag, direct reasoning tag and complex reasoning tag are created. Direct reasoning tag and complex reasoning tag are produced by connecting one or two existing tag with database. With the interaction in tag and database, the disaster-related presentation files can be categorized, searched and utilized. This research conducted efficiency and performance test. We selected 15 pieces of presentation files (totally 253 pages) and computed the computing time in each process. The average time to produce one tag is one second on average. We also selected 10 pages of slides form previous data set by systematic sampling for performance test. We invited two experts in disaster prevention domain to tag those 10 pages and compared the result with tags created by our process. Expert A collected 53 tags in about 10 minutes while expert B found 150 tags in about 10 minutes. The new tagging technique tagged 29 tags within 1 minute. There are 30 tags of expert A result and 122 tags of expert B result under limitations our process can deal with. While ignoring the limitations, expert A finds 23 tags and expert B finds 28 tags. In short, our process can get result close to human with professional knowledge and much faster. This research develops information extraction and information retrieval process of presentation files. This technology can help user acquire the suitable information during the disaster period and make presentation files create much more value in disaster management.
Subjects
Disaster Management
Information Delivery
Presentation Tool
Tag
Automatic
Type
thesis
File(s)![Thumbnail Image]()
Loading...
Name
ntu-104-R02521602-1.pdf
Size
23.32 KB
Format
Adobe PDF
Checksum
(MD5):95db71131d40a69c1e45f3124e02f184
