https://scholars.lib.ntu.edu.tw/handle/123456789/638636
Title: | Consistent and Multi-Scale Scene Graph Transformer for Semantic-Guided Image Outpainting | Authors: | Yang, Chiao An Wu, Meng Lin Yeh, Raymond A. YU-CHIANG WANG |
Keywords: | Image outpainting | Scene-graph | Issue Date: | 1-Jan-2023 | Source: | Proceedings - International Conference on Image Processing, ICIP | Abstract: | The task of image outpainting extends an image beyond its boundaries with semantically plausible content. Recently, Scene Graph Transformer (SGT) introduced a transformer architecture to leverage scene graph guidance for image outpainting. Despite its success, we identified two shortcomings: (a) SGT uses a positional encoding that was originally proposed for 1D signal; (b) SGT uses a scene graph attention layer that propagates information between neighboring nodes which limited the model to learning local graph features. To address these issues, we propose incorporating Laplacian positional encoding and introducing a multiscale scene graph attention into SGT. Extensive results on MS-COCO and Visual Genome show that our proposed approach generates more plausible outpainted images with higher quality. |
URI: | https://scholars.lib.ntu.edu.tw/handle/123456789/638636 | ISBN: | 9781728198354 | ISSN: | 15224880 | DOI: | 10.1109/ICIP49359.2023.10222500 |
Appears in Collections: | 電機工程學系 |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.