簡易檢索 / 詳目顯示

研究生: 簡名彥
Ming-yen Chien
論文名稱: 基於LBP特徵空間之線上多姿態人臉建模及其於即時臉部追蹤與辨識之應用
LBP-Based On-line Multi-Pose Face Model Learning and the Application in Real-time Face Tracking and Recognition
指導教授: 李敏凡
Min-Fan Ricky Lee
口試委員: 林其禹
Chyi-Yeu Lin
邱士軒
Shih-Hsuan Chiu
學位類別: 碩士
Master
系所名稱: 工程學院 - 自動化及控制研究所
Graduate Institute of Automation and Control
論文出版年: 2012
畢業學年度: 100
語文別: 中文
論文頁數: 101
中文關鍵詞: LBP特徵空間多姿態人臉追蹤線上人臉識別
外文關鍵詞: LBP feature space, Multi-pose Face Tracking, On-line Face Recognition
相關次數: 點閱:320下載:8
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報

由於人臉並非剛體,表情及姿態的改變會導致影像有很大的變異,並且還有許多外在因素如光照、遮蔽等影響。因此要達成穩定的追蹤需要有多個不同姿態並且穩健的人臉模型以進行量測。另外,當人臉在追蹤過程可能因遮蔽區域過大而導致丟失,為了能夠尋回追蹤的對象,個人的特徵需事先被學習。然而,如何克服追蹤過程中的干擾,並線上建構出多姿態的個人人臉特徵模型是個挑戰。而線上人臉識別的應用中,如果能夠收集愈充份的資訊,則能夠提升人臉識別的準確率,然而資訊的收集也會因為追蹤過程的穩定性所影響,因此如何收集足夠且正確的資訊也是一個難題。
在本篇論文中,我們提出了一個整合泛用人臉模型及個人人臉模型的追蹤演算法,並利用人臉色彩核長條圖的輔助,以達成穩定的人臉追蹤。其中泛用人臉模型可以協助個人人臉模型的線上學習以建構出多姿態的個人人臉模型。經由多姿態的個人人臉模型,除了可以在追蹤過程中進行目標的對應,也能夠在目標因特殊狀況而丟失時,透過丟失尋回的機制將丟失的目標尋回。由於線上學習的個人人臉模型是利用LBP特徵空間所建構的區域紋理特徵,可以抵抗局部遮蔽,更進一步的,其所學習的資訊可被直接使用於人臉識別。
經由我們的實驗証實,透過整合泛用人臉模型及個人人臉模型的追蹤演算法所建構的多姿態個人人臉模型,可以在複雜背景、光度變化、局部遮蔽、姿態改變等不同因素干擾下完成目標的追蹤,並在人臉追蹤器丟失時能夠正確的找回對應的人臉。而其個人人臉模型所包含的資訊能夠直接被應用於線上人臉識別,並且透過多個人臉模型的整合可以有效提升其人臉識別的準確性。經由測試,其辨識的正確率可達到70%以上。


Because human face is not a rigid object, the changes of face expression or poses would cause huge variation in the image. Furthermore, there are other disturbances such as the varying of illumination and partial occlusions. Therefore, it is necessary to have a robust multi-pose measurement model to achieve stable tracking. In addition, tracked face could be lost when the occluded region is too large. To recover the tracking, specific features are needed to learn previously. However, how to overcome the disturbance and construct the multi-pose specific feature model is a challenge. Regarding to on-line face recognition, the more personal information we collect, the more accurate result we’ll get. The collection of such information would also be affected by the instability of tracking. How to obtain correct information for on-line face recognition is a problem.
In this thesis, we propose an integrated tracking algorithm combining generic face model and specific face model. We use the color kernel histogram of human face to assistant the integration of combining two models, and use generic face model to help construct multi-pose specific face model. Via the specific face model, even if the tracking target is lost in some situations, the model can help to find the losing target. Because the specific face model is constructed by LBP texture feature, it can achieve robust tracking, including partial occlusion. And the learned information of specific face model can be used in face recognition.
In our experiment, the purposed method can achieve good tracking result in the condition of complex background, varying of illumination, partial occlusion, changes of poses and so on. The target losing during tracking can be recovered correctly as well. For face recognition, the multi-pose specific face model can provide sufficient information to achieve acceptable accuracy rate. Through the experiment the accuracy rate is above 70%.

摘 要 I Abstract II 誌 謝 III 圖目錄 VII 表目錄 XI Chapter 1 緒論 1 1.1. 研究背景 1 1.2. 文獻探討 2 1.3. 研究動機與目的 6 1.4. 論文架構 6 Chapter 2 分析 8 2.1. 人臉追蹤演算法 8 2.2. 量測模型的建構 10 2.3. 遮蔽問題處理 12 2.4. 人臉識別 13 Chapter 3 方法 16 3.1. 泛用人臉模型的建立 17 3.1.1. Local Binary Pattern的操作原理 18 3.1.2. Adaboost 演算法 20 3.1.3. 結合MB-LBP特徵空間的弱分類器 22 3.1.4. Cascaded Adaboost分類器 23 3.1.5. 多姿態的人臉評估器 24 3.2. 個人人臉模型的建構 25 3.3. Condensation 演算法的結合 29 3.3.1. Condensation演算法的回顧 29 3.3.2. Condensation演算法與量測模型的結合 31 3.4. 泛用人臉模型與個人人臉模型的整合 33 3.5. 色彩核長條圖的建構與量測 36 3.5.1. 機率密度函數 37 3.5.2. 顏色特徵 38 3.5.3. 核函數 38 3.5.4. 相似度函數 39 3.6. 線上更新 40 3.7. 人臉追蹤的丟失與尋回 41 3.8. 人臉識別 43 Chapter 4 實驗結果 45 4.1. 測試環境介紹 45 4.2. 泛用人臉模型的離線訓練 45 4.3. 泛用人臉模型的追蹤效能 47 4.4. 個人人臉模型的線上建構結果 48 4.4.1. 人臉樣版的建構 48 4.4.2. 各別姿態的高斯機率模型 49 4.4.3. 各個高斯模型的量測機率 50 4.4.4. 個人人臉模型的機率輸出 54 4.5. 多姿態人臉追蹤器演算法的實現 58 4.5.1. 常規實驗 60 4.5.2. 追蹤過程的遮蔽影響實驗 64 4.5.3. 追蹤過程中複雜背景的實驗 68 4.5.4. 追蹤過程中光照變化的實驗 73 4.5.5. 追蹤過程中姿態改變實驗 77 4.5.6. 追蹤過程中多人臉交互影響實驗 81 4.5.7. 追蹤過程中丟失尋回實驗 85 4.5.8. 追蹤過程中尺度變化及線上更新實驗 86 4.5.9. 演算法處理速度評估 89 4.6. 線上人臉識別的實現 90 4.6.1. 未進行人臉對齊 90 4.6.2. 進行人臉對齊 91 Chapter 5 討論與總結 93 5.1. 討論 93 5.2. 總結 95 5.3. 未來展望 96 參考文獻 97 Biography 103

[1] K. K. Sung and T. Poggio, "Example-based learning for view-based human face detection," IEEE Transactions on Pattern Analysis and Machine Intelligence,vol. 20, pp. 39-51, 1998.
[2] H. A. Rowley, S. Baluja, and T. Kanade, "Neural network-based face detection," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 20, pp. 23-38, 1998.
[3] P. Viola and M. J. Jones, "Rapid object detection using a boosted cascade of simple features," In Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. I-511-I-518 vol.1, 2001.
[4] P. Viola and M. J. Jones, "Robust real-time face detection," In Proceedings. Eighth IEEE International Conference on Computer Vision, pp. 747-747, 2001.
[5] P. Viola and M. J. Jones, "Fast multi-view face detection," TR2003-96, Mitsubishi Electric Research Laboratories, August 2003.
[6] J. Friedman, T. Hastie, and R. Tibshirani, "Additive logistic regression: a statitical view of boosting," Ann, Statist., vol. 28, no. 2, pp.337-374,2000.
[7] 謝元澄, "使用聚類來加速 Adaboost 並實現噪聲數據探測," Journal of Software, vol. 21, pp. 1889-1897, 2010.
[8] A. Hadid, J. Y. Heikkila, O. Silven, and M. Pietikainen, "Face and eye detection for person authentication in mobile phones," In First ACM/IEEE International Conference on Distributed Smart Cameras, pp. 101-108, 2007.
[9] C. Demirkir and B. Sankur, "Face detection in cluttered scenes using look-up table based boosting algorithm," In Proceedings of the IEEE 13th Signal Processing and Communications Applications Conference , pp. 460-463, 2005.
[10] M. Isard and A. Blake, "Condensation: conditional density propagation for visual tracking," International Journal of Computer Vision, vol. 29, pp. 5-28, 1998.
[11] M. S. Arulampalam, S. Maskell, N. Gordon, and T. Clapp, "A tutorial on particle filters for online nonlinear/non-gaussian bayesian tracking," IEEE Transactions on Signal Processing, vol. 50, pp. 174-188, 2002.
[12] W. Peng and J. Qiang, "Robust face tracking via collaboration of generic and specific models," IEEE Transactions on Image Processing, vol. 17, pp. 1189-1199, 2008.
[13] Y. Utsumi and Y. Iwai, "Face tracking and recognition by using omnidirectional sensor network," In Third ACM/IEEE International Conference on Distributed Smart Cameras, pp. 1-8, 2009.
[14] R. C. Verma, C. Schmid, and K. Mikolajczyk, "Face detection and tracking in a video by propagating detection probabilities," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, pp. 1215-1228, 2003.
[15] Z. Wenlong and S. M. Bhandarkar, "A boosted adaptive particle filter for face detection and tracking," In 2006 IEEE International Conference on Image Processing, pp. 2821-2824, 2006.
[16] L. Wang, T. Tan, and W. Hu, "Face tracking using motion-guided dynamic template matching," presented at the The 5th Asian Conference on Computer Vision Melbourne, Australia, 2002.
[17] D. Comaniciu, V. Ramesh, and P. Meer, "Real-time tracking of non-rigid objects using mean shift," In Proceedings. IEEE Conference on Computer Vision and Pattern Recognition, pp. 142-149 vol.2, 2000.
[18] L. Kuang-Chih and D. Kriegman, "Online learning of probabilistic appearance manifolds for video-based recognition and tracking," In IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 852-859 vol. 1, 2005.
[19] L. Yongmin, G. Shaogang, and H. Liddell, "Support vector regression and classification based multi-view face detection and recognition," In Proceedings. Fourth IEEE International Conference on Automatic Face and Gesture Recognition, pp. 300-305, 2000.
[20] J. Myung-Ho and K. Hang-Bong, "Illumination invariant face tracking and recognition," In 5th European Conference on Visual Media Production (CVMP 2008), pp. 1-6,2008.
[21] R. C. Luo, C. T. Liao, and Y. J. Chen, "Robot - human face tracking and recognition using relative affine structure," In IEEE Workshop on Advanced robotics and Its Social Impacts, pp. 1-6, 2008.
[22] G. R. Bradski, "Computer vision face tracking for use in a perceptual user interface," Intel Technology Journal, vol. 2, pp. 13-27, 1998.
[23] Z. H. Khan, I. Y. Gu, and A. G. Backhouse, "Robust visual object tracking using multi-mode anisotropic mean shift and particle filters," IEEE Transactions on Circuits and Systems for Video Technology , vol. 21, pp. 74-87, 2011.
[24] W. Xian, L. Lihong, L. Jianhuang, and H. Jian, "A framework of face tracking with classification using CAMShift-C and LBP," In Fifth International Conference on Image and Graphics, pp. 217-222, 2009.
[25] X. Du, C. Liu, and Y. Yu, "Analysis of detection and track on partially occluded face," International Forum on Information Technology and Applications, pp. 158-161, 2009.
[26] J. Harguess, H. Changbo, and J. K. Aggarwal, "Occlusion robust multi-camera face tracking," In IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops, pp. 31-38, 2011.
[27] W. Ying, Y. Ting, and H. Gang, "Tracking appearances with occlusions," In Proceedings. 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. I-789-I-795 vol.1, 2003.
[28] B. Peng, Q. Hong, W. Anhua, and L. Yu, "Person-tracking with occlusion using appearance filters," In 2006 IEEE/RSJ International Conference on Intelligent Robots and Systems, pp. 1805-1810,2006.
[29] V. Ngoc-Son and A. Caplier, "Efficient statistical face recognition across pose using local binary patterns and gabor wavelets," In IEEE 3rd International Conference on Biometrics: Theory, Applications, and Systems, pp. 1-5, 2009.
[30] S. Liao, X. Zhu, Z. Lei, L. Zhang, and S. Z. Li, "Learning multi-scale block local binary patterns for face recognition," presented at the International Conference on Biometrics (ICB), 2007.
[31] W. Yunlong, X. Mei, S. Rui, and L. Tao, "Face location with LBP scale transform," In 2010 International Conference on Communications, Circuits and Systems (ICCCAS), pp. 347-350, 2010.
[32] T. Ojala, M. Pietikainen, and T. Maenpaa, "Multiresolution gray-scale and rotation invariant texture classification with local binary patterns," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 24, pp. 971-987, 2002.
[33] T. Ahonen, A. Hadid, and M. Pietikainen, "Face description with local binary patterns: application to face recognition," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 28, pp. 2037-2041, 2006.
[34] W. Bo, A. Haizhou, H. Chang, and L. Shihong, "Fast rotation invariant multi-view face detection based on real adaboost," In Proceedings. Sixth IEEE International Conference on Automatic Face and Gesture Recognition, pp. 79-84, 2004.
[35] Y. Freund and R. E. Schapire., "A decision-theoretic generalization of on-line learning and an application to boosting," In Computational Learning Theory: Eurocolt '95, pp. 23-37, 1995.
[36] R. E. Schapire and Y. Singer, Improved boosting algorithms using confidence-rated predictions vol. 37: Kluwer Academic, 1999.
[37] L. Zhang, R. Chu, S. Xiang, S. Liao, and S. Z. Li, "Face detection based on multi-block LBP representation," In Proceedings of IAPR/IEEE International Conference on Biometrics, (ICB-2007). Seoul, Korea, August 2007.
[38] D. Comaniciu, V. Ramesh, and P. Meer, "Kernel-based object tracking," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, pp. 564-577, 2003.
[39] M. K. Osman, M. Y. Mashor, H. Jaafar, R. A. A. Raof, and N. H. Harun, "Performance comparison between RGB and HSI linear stretching for tuberculosis bacilli detection in Ziehl-Neelsen tissue slide images," In 2009 IEEE International Conference on Signal and Image Processing Applications (ICSIPA),pp. 357-362,2009.
[40] D. Comaniciu and V. Ramesh, "Mean shift and optimal prediction for efficient object tracking," In Proceedings. 2000 International Conference on Image Processing, pp. 70-73 vol.3, 2000.
[41] A. Djouadi, O. Snorrason, and F. D. Garber, "The quality of training sample estimates of the Bhattacharyya coefficient," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 12, pp. 92-97, 1990.
[42] M. Castrill’on-Santana, O. D’eniz-Su’arez, C. G. Guerra, and M. Hernandez, "Real-time detection of multiple faces at different resolutions in video streams," Journal of Visual Communication and Image Representation, vol. 18, pp. 130-140, 2007.
[43] H. E. Lee, "Target tracking of autonomous mobile robots," 國立臺灣科技大學自動化及控制研究所碩士論文, 2010.

QR CODE