https://scholars.lib.ntu.edu.tw/handle/123456789/114983
標題: | 前景切割與人體姿態估測 Foreground Segmentation and Human Pose Estimation |
作者: | 黃世勳 Huang, Shih-Shinh |
關鍵字: | 人體姿態分析;前景切割;貝式網路;姿態估測;資料驅動;RJMCMC近似演算法;Foreground Segmentation;Pose Estimation;Bayesian Network;Reversible Jump Markov Chain Monte Carlo;Data-Driven Strategy | 公開日期: | 2007 | 摘要: | 人體動態分析在人機介面互動研究應用中,例如:虛擬實境、智慧型監控以及智慧型使用者介面,扮演著重要且不可或缺的角色,目前、以電腦視覺為基礎之人體動態分析在學術界上也引起許多廣泛的討論。一般而言,其主要包含四個部分:前景切割、人行偵測、姿態估測以及姿態追蹤,於本論文中,主要針對前景偵測以及姿態估測這兩個主題,分析當前文獻並分別提出一套有效的解決方法。 於前景切割部分,我們提出一個基於機率的方法,能將前景部份自動從影片中分割出來。為節省計算複雜度以及克服雜訊的干擾,我們提出一個以邊緣為基礎之變化偵測演算法去識別影像中可能為前景的區域。接著透過貝式網路,我們整合運動資訊將所偵測區域進一步分類為前景或背景,以有效過濾陰影效應、雜訊以及未遮蓋背影。而前景切割的問題可描述為:給定連續兩張影像以及前一個時間所獲得之前景切割結果,透過幾何運動限制以及背景觀察模型,我們可以定義運動位移場以及前景切割結果之共同條件機率。利用最佳化演算法,我們可以同時找出運動位移場以及前景切割的解。 人體姿態能提供有效之資訊作為感測與分析人類行為之重要依據。因此、我們提出一個機率架構去估測影像中人體的姿態。在此研究中,我們用以衡量觀測的主要線索為人形影像輪廓。首先、透過前景輪廓與所預估人體模型的差異,定義出所謂可能機率;透過引入身體各部位之幾何限制,定義出事前機率,最後利用RJMCMC近似演算法,在姿態參數空間中,找尋出可能的解。為了提升收斂速度,我們利用資料驅動的策略設計出有效之提案函數。 Human dynamics analysis is currently one of the most active researches in computer vision because it is an important and fundamental component in many applications in the areas of human-computer interaction, such as virtual reality, smart surveillance, and intelligent user interface, etc. In the thesis, two issues which we take into considerations for human dynamics analysis are foreground segmentation and pose estimation. We present a probabilistic approach for automatically segmenting foreground objects from a video sequence. In order to perform foreground segmentation in a more semantic region level, we propose an edge-based change detection algorithm to automatically identify the regions with potential appearance variation due to the motion of objects. Then, we incorporate the motion information to perform foreground segmentation under a Bayesian framework. Given two consecutive images, the joint probability density function of the motion vector field and foreground segmentation mask is defined based on the constraints including observation likelihood and spatiotemporal constraint and thus is maximized to simultaneously achieve the foreground segmentation and the motion estimation in a mutually beneficial manner. Human pose is a natural way for a computer system to understand the intention of humans. Here, we want to propose a new statistical framework for estimating human pose by use of a reversible jump Markov Chain Monte Carlo (RJMCMC) approach, which tries to recovering the human body configuration based on its silhouette. Such problem is formulated as that of computing the maximum a posterior (MAP) of the probability density function of pose configuration given currently observed image. Equivalently, pose inference can be considered as traversing over the difference subspaces. Using of the data-driven mechanism, the mentioned reversible jump Markov chain Monte Carlo (RJMCMC) can explore such solution space much more efficiently. |
URI: | http://ntur.lib.ntu.edu.tw//handle/246246/53646 | 其他識別: | en-US |
顯示於: | 資訊工程學系 |
檔案 | 描述 | 大小 | 格式 | |
---|---|---|---|---|
ntu-96-D89922013-1.pdf | 23.31 kB | Adobe PDF | 檢視/開啟 |
在 IR 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。