Algorithm and Architecture Design for Stereo video depth generation

LAI,  YEN-CHIEH

Algorithm and Architecture Design for Stereo video depth generation

Date Issued

2011

Date

2011

Author(s)

LAI, YEN-CHIEH

URI

http://ntur.lib.ntu.edu.tw//handle/246246/256773

Abstract

Digital video technology has played an important role in our daily life. With the evolution of the display technologies, display system can provide higher visual quality to enrich human life. As 3D display technology matures, human aspires to experience more reality. The 3D video signal processing has become an active topic in the visual processing field. The depth generation from content is one of the important parts in 3D video processing. One typical way is the 2D-to-3D conversion focusing on extracting the depth information from the 2D image. The other topic focuses on the depth generation system from stereo-views sequence. Compared to the 2D-to-3D algorithms, the depth from stereo-view sequence can provide the better depth quality and is more suitable to reconstruct the virtual view. The depth generation from stereo-views system can be applied to the multi-view 3D displays and the free viewpoint display system. In depth generation from stereo-views system is based on the stereo matching algorithm. Stereo matching can be formulated as an energy minimization problem on a 2D Markov Random Filed(MRF). Among many MRF global optimization method, belief propagation gives high quality and has highly potential to achieve real-time processing. However, because of costly iterative operations and high memory and bandwidth demand, the original belief propagation is computationally expensive for real-time system implementation. In this thesis, we focus on the algorithm and hardware architecture design of stereo matching and depth generation from the stereo vision. In first, we analyze the hardware cost in the stereo matching and belief propagation system, and indicate the challenge and bottleneck in the memory and bandwidth resource requirement. Secondly, we propose tile-based belief propagation and message reduction algorithm to greatly reduce the memory and bandwidth cost and provide similar performance compared to the original belief propagation. Moreover, we design the fast message computation PE for belief propagation to reduce the complexity of message construction. Third, we propose the trilateral-filter-based depth post processing to correct the error in the occlusion region and overcome the matching constraint in the stereo vision. We Finally, an efficient VLSI architecture of real-time, high-performance stereo depth generation system is presented. The design combines the fast message computation method, the tile-based BP, message reduction ,with the trilateral-filter-based post processing to create a parallel and flexible architecture. These techniques include a 4-stage pipeline, fully-parallel processing elements for message update, and a data reuse scheme. When operating at 227 MHz, the architecture can generate HDTV720p disparity maps at 30 fps.

Subjects

STEREO MATCHING

BELIEF PROPOGATION

Type

thesis

File(s)

Name

ntu-100-R98943001-1.pdf

Size

23.32 KB

Format

Adobe PDF

Checksum

(MD5):527533734c4954935055de694baba8a6

Algorithm and Architecture Design for Stereo video depth generation

關於 (About)

聯絡資訊 (Contact Us)

相關網站 (Useful Links)

關於開放取用 (Open Access, OA)

出版社期刊論文授權政策 (Copyright)

使用說明 (Instructions)

登入說明 (Sign-in)

匯入著作 (Submission)