On Detecting Duplications in a Database of Taiwanese Old Photographs
Date Issued
2009
Date
2009
Author(s)
Chu, Kuo-Yen
Abstract
In 2003, the National Taiwan University Library produced a digital collection of old photographs of Taiwan. They cover the period from 1895 to 1945, when Taiwan was occupied by Japan. The photos, 38,653 in total, were selected from over 2,000 books published by the Japanese Colonial Government during that time, and cover a wide range of subjects. They were made into a digital library, with images and metadata records, and is the most extensive database of its kind in existence.e observed that there are duplications of photos in the database. They were either because certain photos were included in different books, or because some books were scanned twice.he purpose of the research reported in this thesis is to find duplication of images in the database. We adopted methods in content-based image retrieval and developed a system to identify pairs that might have come from the same photo. The pairs were then checked manually to see if they are indeed duplicates.mong the photographs in the database, our system identified 308,286 pairs, of which 3,270 were duplicated photo pairs. Since some photos appeared more than twice (9 being the most), there are 2,621 photo groups altogether. We estimate that the recall rate is over 90%.
Subjects
edge detecting
duplicate photos
content-based image retrieval
old photographs
Type
thesis
File(s)![Thumbnail Image]()
Loading...
Name
ntu-98-R94944020-1.pdf
Size
23.32 KB
Format
Adobe PDF
Checksum
(MD5):1c3df0a04f579f41a19d0a91d86ac4b9
