RESIDUAL GRAPH ATTENTION NETWORK AND EXPRESSION-RESPECT DATA AUGMENTATION AIDED VISUAL GROUNDING

Wang, Jia; Wu, Hung Yi; Chen, Jun Cheng; Shuai, Hong Han; WEN-HUANG CHENG

doi:10.1109/ICIP46576.2022.9897564

RESIDUAL GRAPH ATTENTION NETWORK AND EXPRESSION-RESPECT DATA AUGMENTATION AIDED VISUAL GROUNDING

Journal

Proceedings - International Conference on Image Processing, ICIP

ISBN

9781665496209

Date Issued

2022-01-01

Author(s)

Wang, Jia

Wu, Hung Yi

Chen, Jun Cheng

Shuai, Hong Han

WEN-HUANG CHENG

DOI

10.1109/ICIP46576.2022.9897564

URI

https://scholars.lib.ntu.edu.tw/handle/123456789/630190

URL

https://api.elsevier.com/content/abstract/scopus_id/85146723566

Abstract

Visual grounding aims to localize a target object in an image based on a given text description. Due to the innate complexity of language, it is still a challenging problem to perform reasoning of complex expressions and to infer the underlying relationship between the expression and the object in an image. To address these issues, we propose a residual graph attention network for visual grounding. The proposed approach first builds an expression-guided relation graph and then performs multi-step reasoning followed by matching the target object. It allows performing better visual grounding with complex expressions by using deeper layers than other graph network approaches. Moreover, to increase the diversity of training data, we perform an expression-respect data augmentation based on copy-paste operations to pairs of source and target images. The proposed approach achieves better performance with extensive experiments than other state-of-the-art graph network-based approaches and demonstrates its effectiveness.

Subjects

Expression-respect data augmentation | Residual graph attention network | Visual grounding

Type

conference paper

RESIDUAL GRAPH ATTENTION NETWORK AND EXPRESSION-RESPECT DATA AUGMENTATION AIDED VISUAL GROUNDING

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)