簡易檢索 / 詳目顯示

研究生: 吳怡明
Yi-Ming Wu
論文名稱: 手勢辨識應用於遙控音樂播放系統
The Implementation of Gesture Recognition for Media Player System
指導教授: 蔡超人
Chau-Ren Tsai
口試委員: 蘇順豐
Shun-Feng Su
陳建中
Jiann-Jone Chen
王乃堅
Nai-Jian Wang
學位類別: 碩士
Master
系所名稱: 電資學院 - 電機工程系
Department of Electrical Engineering
論文出版年: 2009
畢業學年度: 97
語文別: 中文
論文頁數: 93
中文關鍵詞: 手識辨識哈克轉換雷登轉換
外文關鍵詞: Gesture Recognition, Hough Transform, Radon Transform
相關次數: 點閱:336下載:11
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 近幾年,居家環境的功能化、數位化已成為人們對居住環境更新的要求,也帶動了數位化家庭概念。但提出的數位化家庭概念卻鮮少與個人電腦做結合,應用於居家環境的數位化與自動化控制之中。所以在本論文提供一個新的應用,使用現今家庭最常見的個人電腦與數位攝影機建立一套可藉由手勢來遙控居家音樂播放的音樂播放系統。在影像輸入部分,我們使用數位攝影機來讀入影像;接著將讀入的影像使用背景相減法取得目標物體,擷取出左手臂影像的特徵影像。隨後使用八方位尋點法尋找左手臂特徵影像的直線特徵點,來解決Hough Transform和Radon Transform偵測直線的耗時問題。最後,我們分別將攝影機的原始影像、左手臂特徵影像和音樂播放的指令的執行結果展示在音樂播放系統的程式介面上。


    In recent years, the concept of Digital Domicile has is taken, and more and more people accept the functionalization and digitizing of house. But it still is uncommon to combine Personal Computer and concept of Digital Domicile. Therefore, in this paper, we will provide one new implement of gesture recognition for media player system. At first, in the part of input image, we capture image from the camera, then the system separate the left-arm part and the background part of image from the input images by using image processing methods, and it will generate one processed image-Arm Feature Image in the system. Afterward we find the feature point of straight line form the Arm Feature Image for reducing the time spending of Hough Transform and Radon Transform Algorithm. Finally we will display the identifying result of gesture on the application program interface and save the information as a text file.

    中文摘要 ……………………………………………………………I 英文摘要 ……………………………………………………………II 誌 謝 ……………………………………………………………III 圖 索 引 ……………………………………………………………VII 表 索 引 ……………………………………………………………XII 第一章 緒論…………………………………………………………1 1.1 研究動機與目的……………………………………………1 1.2 研究方法……………………………………………………2 1.3 論文架構……………………………………………………3 第二章 手勢遙控音樂播放系統架構………………………………4 2.1 影像輸入部份………………………………………………5 2.2 影像辨識部份………………………………………………5 2.3 影像輸出部份………………………………………………8 2.4 相關硬體使用與規格………………………………………9 2.4.1 網路攝影機初始設定………………………………………10 2.4.2 系統程式介面………………………………………………12 2.4.3 個人電腦規格………………………………………………13 2.4.4 系統操作說明………………………………………………13 第三章 影像前置處理………………………………………………14 3.1 背景相減法…………………………………………………18 3.2 小雜訊去除與平滑化處理…………………………………21 3.2.1 擴張運算……………………………………………………22 3.2.2 侵蝕運算……………………………………………………22 3.2.3 封閉運算……………………………………………………23 3.2.4 開放運算……………………………………………………24 3.2.5 雜訊濾除過程………………………………………………24 3.3 區域填充……………………………………………………25 3.3.1 物件連通法…………………………………………………25 3.3.2 區域填充流程………………………………………………29 3.4 左手臂影像擷取……………………………………………32 3.5 邊緣化………………………………………………………33 第四章 手勢辨識……………………………………………………35 4.1 Hough Transform………………………………………… 36 4.2 Radon Transform………………………………………… 40 4.3 直線偵測演算法的改善─八方位尋點法…………………44 4.4 影像座標系統………………………………………………52 4.5 直線路徑的估測……………………………………………54 4.6 直線的判斷…………………………………………………57 4.7 手勢辨識的流程……………………………………………58 第五章 系統實現與效能……………………………………………60 5.1 系統實現……………………………………………………60 5.2 效能測試說明………………………………………………67 5.2.1 定義六種音樂播放手勢……………………………………67 5.2.2 手勢辨識效能測試功能鍵…………………………………70 5.3 系統效能測試………………………………………………74 5.3.1 影像前置處理單元效能評估………………………………74 5.3.2 手勢辨識效能之評估………………………………………75 5.3.3 影像前置處理的效果………………………………………81 第六章 結論…………………………………………………………86 6.1 研究成果……………………………………………………87 6.2 發展方向……………………………………………………90 參考文獻………………………………………………………………91

    [1] M. Singh, M. Mandal and A. Basu, “Visual Gesture Recognition for Ground Air Traffic Control using the Radon Transform,” Intelligent Robots and Systems, pp. 2586-2591, 2005.
    [2] R. O. Duda and P. E. Hart, “Use of the Hough Transformation to detect lines and curves in pictures,” Communications of the ACM, Vol. 15, No. 1, pp. 11-15, 1972.
    [3] M. Seul, L. O’Gorman and M. J. Sammon, Practical Algorithms for Image Analysis: Description, Examples, and Code, Cambridge University Press, pp. 139, 2000.
    [4] C. R. Wren, A. Azarbayejani, T. Darrell and A. Pentland, “Pfinder: Realtime Tracking of the Human Body,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 19, pp. 780-785, 1997.
    [5] I. Haritaoglu, D. Harwood and L. S. Davis, “Hydra: Multiple People Detection and Tracking Using Silhouettes,” Proceedings of the Second IEEE Workshop on Visual Surveillance, pp. 6-13, June 1999.
    [6] 鍾國亮,影像處理與電腦視覺,第二版,東華書局出版,pp. 22-30,2003。
    [7] R. C. Gonzalez and R. E. Woods, Digital Image Processing 2/e, Prentice Hall, pp. 555-575, 2002.
    [8] J. Serra, Image Analysis and Mathematical Morphology, Acadwmic Press, pp. 115-130, 1982.

    [9] T. C. Chen and K. L. Cheng, “A New Randomized Algorithm for Detecting Line,” Real-Time Imaging, Vol. 7, No. 6, pp. 473-482, 2001.
    [10] H. K. Yuen, J. Illingworth and J. Kittler, “Detecting Partially Occluded Ellipses Using the Hough Transform,” Image and Vision Computing, Vol. 7, No. 1, pp. 31-37, 1989.
    [11] P. Rattey and A. Lindgren, “Sampling the 2-D Radon Transform,” IEEE Transactions on Speech and Signal Processing, Vol. 29, Issue. 5, pp. 994-1002, Oct. 1996.
    [12] S. Tabbone and L. Wendling, “Technical Symbols Recognition Using the Two-Dimensional Radon Transform,” Proceedings of IEEE International Conference on Pattern Recognition, Vol. 3, pp. 200-203, Aug. 2002.
    [13] K. V. Hansen and P. A. Toft, “Fast Curve Estimation Using Preconditioned Generalized Radon Transform,” IEEE Transactions on Image Processing, Vol. 5, Issue 12, pp. 1651-1661, Dec. 1996.
    [14] T. T. Nguyen, X. D. Pham, D. Kim and J. W. Jeon, “An Improvement of the Standard Hough Transform to Detect Line Segments,” IEEE International Conference on Industrial Technology, pp. 1-6, April 2008.
    [15] N. Aggarwal and W. C. Karl, “Line Detection in Images through Regularized Hough Transform,” Proceedings of IEEE International Conference on Image Processing, Vol. 3, pp. 873-876, Sept. 2000.
    [16] Y. C. Cheng, “The Probabilistic Hough Transform with Localized Search Guided by Evidence Clusters,” IEEE Southwest Symposium on Image Analysis and Interpretation, pp. 16-20, 2006.
    [17] T. W. Chian, S. S. Kok, S. G. Lee, Gerald and O. K. Wei, “Gesture Based Control of Mobile Robots,” IEEE Conference on Innovative Technologies in Intelligent Systems and Industrial Applications, pp. 20-25, July 2008.
    [18] C. Hu, M. Q. Meng, P. X. Liu and X. Wang, “Visual Gesture Recognition for Human-Machine Interface of Robot Teleoperation,” IEEE/RSJ International Conference on Intelligent Robots and Systems, Vol. 2, pp. 1560-1565, Oct. 2003.
    [19] Y. Sriboonruang, P. Kumhom and K. Chamnongthai, “Visual Hand Gesture Interface for Computer Board Game Control,” IEEE Tenth International Symposium on Consumer Electronics, pp. 1-5, 2006.
    [20] D. J. Kang, J. W. Choi and I. S. Kweon, “Finding and Tracking Road Lanes Using Line-Snakes,” Proceedings of IEEE on Intelligent Vehicles Symposium, pp. 189-194, Sept. 1996.

    QR CODE