Algorithm and Architecture Design for Stereo video depth generation
Date Issued
2011
Date
2011
Author(s)
LAI, YEN-CHIEH
Abstract
Digital video technology has played an important role in our daily life. With the evolution of the display technologies, display system can provide higher visual quality to enrich human life. As 3D display technology matures, human aspires to experience more reality. The 3D video signal processing has become an active topic in the visual processing field. The depth generation from content is one of the important parts in 3D video processing. One typical way is the 2D-to-3D conversion focusing on extracting the depth information from the 2D image. The other topic focuses on the depth generation system from stereo-views sequence. Compared to the 2D-to-3D algorithms, the depth from stereo-view sequence can provide the better depth quality and is more suitable to reconstruct the virtual view. The depth generation from stereo-views system can be applied to the multi-view 3D displays and the free viewpoint display system.
In depth generation from stereo-views system is based on the stereo matching algorithm. Stereo matching can be formulated as an energy minimization problem on a 2D Markov Random Filed(MRF). Among many MRF global optimization method, belief propagation gives high quality and has highly potential to achieve real-time processing. However, because of costly iterative operations and high memory and bandwidth demand, the original belief propagation is computationally expensive for real-time system implementation.
In this thesis, we focus on the algorithm and hardware architecture design of stereo matching and depth generation from the stereo vision. In first, we analyze the hardware cost in the stereo matching and belief propagation system, and indicate the challenge and bottleneck in the memory and bandwidth resource requirement. Secondly, we propose tile-based belief propagation and message reduction algorithm to greatly reduce the memory and bandwidth cost and provide similar performance compared to the original belief propagation. Moreover, we design the fast message computation PE for belief propagation to reduce the complexity of message construction. Third, we propose the trilateral-filter-based depth post processing to correct the error in the occlusion region and overcome the matching constraint in the stereo vision. We Finally, an efficient VLSI architecture of real-time, high-performance stereo depth generation system is presented. The design combines the fast message computation method, the tile-based BP, message reduction ,with the trilateral-filter-based post processing to create a parallel and flexible architecture. These techniques include a 4-stage pipeline, fully-parallel processing elements for message update, and a data reuse scheme. When operating at 227 MHz, the architecture can generate HDTV720p disparity maps at 30 fps.
Subjects
STEREO MATCHING
BELIEF PROPOGATION
Type
thesis
File(s)![Thumbnail Image]()
Loading...
Name
ntu-100-R98943001-1.pdf
Size
23.32 KB
Format
Adobe PDF
Checksum
(MD5):527533734c4954935055de694baba8a6
