ScannerNet: A Deep Network for Scanner-Quality Document Images under Complex Illumination

Hsu, Chih Jou; Wu, Yu Ting; MING-SUI LEE; YUNG-YU CHUANG

ScannerNet: A Deep Network for Scanner-Quality Document Images under Complex Illumination

Journal

BMVC 2022 - 33rd British Machine Vision Conference Proceedings

Date Issued

2022-01-01

Author(s)

Hsu, Chih Jou

Wu, Yu Ting

MING-SUI LEE

YUNG-YU CHUANG

URI

https://scholars.lib.ntu.edu.tw/handle/123456789/637181

URL

https://api.elsevier.com/content/abstract/scopus_id/85174676555

Abstract

Document images captured by smartphones and digital cameras are often subject to photometric distortions, including shadows, non-uniform shading, and color shift due to the imperfect white balance of sensors. Readers are confused by an indistinguishable background and content, which significantly reduces legibility and visual quality. Despite the fact that real photographs often contain a mixture of these distortions, the majority of existing approaches to document illumination correction concentrate on only a small subset of these distortions. This paper presents ScannerNet, a comprehensive method that can eliminate complex photometric distortions using deep learning. In order to exploit the different characteristics of shadow and shading, our model consists of a sub-network for shadow removal followed by a sub-network for shading correction. To train our model, we also devise a data synthesis method to efficiently construct a large-scale document dataset with a great deal of variation. Our extensive experiments demonstrate that our method significantly enhances visual quality by removing shadows and shading, preserving figure colors, and improving legibility.

Type

conference paper

ScannerNet: A Deep Network for Scanner-Quality Document Images under Complex Illumination

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)