簡易檢索 / 詳目顯示

研究生: 李慶銘
Ching-Ming Li
論文名稱: 即時影音教學傳播系統的實現
The Implementation of the Real-Time Video-Audio Teaching Communication System
指導教授: 蔡超人
Chau-Ren Tsai
口試委員: 蘇順豐
Shun-Feng Su
郭景明
Jing-ming Guo
王乃堅
Nai-Jian Wang
學位類別: 碩士
Master
系所名稱: 電資學院 - 電機工程系
Department of Electrical Engineering
論文出版年: 2011
畢業學年度: 99
語文別: 中文
論文頁數: 139
中文關鍵詞: 數位信號處理器雙攝影機架構PTZ攝影機姿態分析
外文關鍵詞: Digital Signal Processor, Dual-Camera Module, PTZ Camera, Posture Analysis
相關次數: 點閱:265下載:2
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 在一般學校或補習班教室等演講場合,常會使用攝影機在旁做側錄,以供當時未參與的學員有二次學習的機會,側錄通常是以人工的方式去操作攝影機,因此本計畫研究以DSP實現即時影音教學傳播系統為目的,希望能全自動錄影以取代人力資源。由於數位訊號處理器(DSP:Digital Signal Processor)具有執行速度較快,整體效能佳、特殊硬體及指令設計可達成真正的即時運作的優點,因此本系統實現在德州儀器(Texas Instrument)之TMS320DM642 EVM模組上並搭配雙攝影機架構進行協同追蹤,其中場景攝影機做目標物偵測及姿態分析,並透過座標轉換引導PTZ(Pan/Tilt/Zoom)攝影機將講師置於畫面並對人物特殊動作特寫,適時的縮放畫面使呈現更清晰的教學畫面。另外在教學聲音則由無線麥克風接收模組擷取,最後再透過網路將影像與聲音傳送至遠端電腦並錄製成教學影片,如此可建構出即時影音教學傳播系統。


    In the ordinary school, cram school and other lecture occasions, we use the camera to record the program contents in order to let the audience who was absent can learn the program contents. The record works usually do by someone. Accordingly, the purpose of this research is to implement a real-time audio-video teaching communication system to replace it. The Digital Signal Processor (DSP) has many benefits such as high performance, small size and standalone for this reason. This system will combine TMS320DM642 Evaluation Module with dual-camera Module that can track the target in coordination, one of the cameras for target detection and posture analysis is field camera, via coordinates transformation it can guide PTZ camera to make the speaker stay in the frame to enlarge the frame, and zoom screen timely to present clearer teaching images. Beside, the voice in teaching is captured by wireless microphone receiver module. Finally, the image and voice will be transmitted to the remote computer and record as teaching film. Hereby develop a real-time video-audio teaching communication system.

    摘 要I AbstractII 誌 謝III 目 錄IV 圖 索 引VIII 表 索 引XV 第一章 緒論1 1.1 研究動機與目的1 1.2 研究方法2 1.3 論文架構3 第二章 系統架構5 2.1 目標物偵測程序6 2.2 PTZ攝影機曝光控制程序7 2.3 背景更新程序9 2.4 目標物姿態分析程序10 2.5 PTZ攝影機追蹤程序12 2.6 網路傳輸程序13 2.7 硬體規格與配置14 第三章 目標物偵測與背景更新20 3.1 前景物件萃取與陰影偵測20 3.1.1 前景區塊萃取導論20 3.1.2 背景相減法22 3.1.3 陰影濾除24 3.1.4 型態學處理26 3.1.5 移動目標物選取的連通標記法27 3.2 PTZ攝影機曝光控制30 3.2.1 曝光參數對影像亮度之影響31 3.2.2 曝光參數調整方法37 3.3 背景更新38 3.3.1 常用背景更新方法38 3.3.2 物件基準背景更新41 3.4 前景萃取使用線性組合語言實現43 3.4.1 TMS320DM642程式效益改善方法43 3.4.2 影像相減44 3.4.3 位元編碼應用於型態學47 第四章 姿態分析與PTZ攝影機追蹤49 4.1 建立上半身骨架50 4.1.1 頭部端點定位50 4.1.2 質心點計算51 4.1.3 邊緣偵測與外圍輪廓建立54 4.1.4 特徵點建立56 4.1.15 手部端點定位58 4.2 姿態分析方法60 4.2.1 碼本分類與向量量化(Vector Quantization)[34]60 4.2.2 姿態碼判別64 4.2.3 寫字姿態之判定條件64 4.3 PTZ攝影機之追蹤65 4.3.1 雙攝影機之架構66 4.3.2 目標物追蹤模式70 4.3.3 手部追蹤模式72 4.3.4 雙目標物追蹤條件之判定74 第五章 DM642音訊擷取程序76 5.1AIC23晶片設定76 5.2音訊擷取實現82 第六章 DM642燒錄程序85 6.1 二階導入之啟動程序85 6.2 FLASH燒錄程序91 6.2.1 FLASH操作方式91 6.2.2 FLASH燒錄程序實現92 第七章 系統實現與效能測試109 7.1 系統軟體架構109 7.2 網路傳輸架構112 7.2.1 網路資料傳輸流程114 7.2.2 遠端人機介面介紹119 7.3 系統實現與效能測試123 第八章 結論128 8.1 研究成果128 8.2 未來發展131 參 考 文 獻133

    [1]C. R. Wren, A. Azarbayejani, T. Darrell and A. Pentland. “Pfinder: Real-Time Tracking of the Human Body,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 19, pp. 780-785, 1997.
    [2]I. Haritaoglu, D. Harwood and L. S. Davis, “W4: Real-Time Surveillance of People and Their Activities,”IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, pp. 809-830, 2000.
    [3]R. C. Gonzalez and R. E. Woods, Digital Image Processing, Prentice-Hall, New Jersey, pp. 116-123, 2002.
    [4]Y. H. Liang, S. Guo, Z. Y. Wang, X. W. Xu and X. Y. Cao, “A Robust and Fast Motion Segmentation Method for Video Sequences,” Proceedings of the IEEE International Conference on Automation and Logistics, ICAL 2007, pp. 2952-2957, 2007.
    [5]L. D. Stefano and A. Bulgarelli, “A Simple and Efficient Connected Components Labeling Algorithm,” Proceedings of 10th International Conference on Image Analysis, pp. 322-327, 1999.
    [6]H. S. Chen, H. T. Chen, Y. W. Chen and S. Y. Lee, “Human Action Recognition Using Star Skeleton,” Proceedings of the ACM International Multimedia Conference and Exhibition, pp. 171-178, 2006.
    [7]Texas Instruments Inc., TMS320DM642 Evaluation Module with TVP Video Decoders Technical Reference, 2004.
    [8]S. Y. Chien, S. Y. Ma and L. G. Chen, “Efficient Moving Object Segmentation Algorithm Using Background Registration Technique,” IEEE Transactions on Circuits and Systems for Video Technology, Vol. 12, No. 7, pp. 577-586, 2002.
    [9]Texas Instruments Inc., TMS320C6000 TCP/IP Network Developer’s Kit User’s Guide, June 2003.
    [10]Texas Instruments Inc., TLV320AIC23B Data Manual, Feb. 2004.
    [11]W. C. Kao, “Robust Brightness Measurement and Exposure Control in Real-Time Video Recording,”IEEE Transactions on Instrumentation and Measurement, Vol. 6, pp. 1206-1216, 2011.
    [12]A. J. Lipton, H. Fujiyoshi and R. S. Patil, “Moving Target Classification and Tracking from Real-Time Video,” Proc. IEEE Workshop Applications of Computer Vision, pp. 8-14, 1998.
    [13]C. Kim and J. N. Hwang, “Fast and Automatic Video Object Segmentation and Tracking for Content-Based Applications,”IEEE Transactions on Circuits and Systems for Video Technology, Vol. 12, No. 2, pp. 122-129, 2002.
    [14]陳俊偉, “混合行人與汽機車於入口之影像式偵測研究系統,”國立交通大學電機與控制工程研究所碩士論文, pp. 8-12, 2009.
    [15]J. Barron, D. Fleet and S. Beauchemin, “Performance of Optical Flow Techniques,” International Journal of Computer Vision, Vol. 12, No. 1, pp. 42-77, 1994.
    [16]J. W. Hsieh, W. F. Hu, C. J. Chang and Y. S. Chen, “Shadow Elimination for Effective Moving Object Detection by Gaussian Shadow Modeling,” Image Vision and Computing Journal, Vol. 21, pp. 505-516, 2003.
    [17]G. Szwoch, P. Dalka, “Identification of Regions of Interest in Video for a Traffic Monitoring System,” Proceedings of the 2008 1st International Conference on Information Technology, pp. 19-21, May 2008.
    [18]J. Stauder, R. Mech and J. Ostermann, “Detection of Moving Cast Shadows for Object Segmentation,” IEEE Transactions on Multimedia, Vol. 1, No. 1, pp. 65-76, 1999.
    [19]C. Kim and J. N. Hwang, “Fast and Automatic Video Object Segmentation and Tracking for Content-Based Applications,”IEEE Transactions on Circuits and Systems for Video Technology, Vol. 12, No. 2, pp. 122-129, 2002.
    [20]J. P. Serra, Image Analysis and Mathematical Morphology, Academic Press, pp. 115-130, 1982.
    [21]Sony Corp., Color Video Camera Technical Manual, pp. 8-10, 2003
    [22]C. K. Liang, “Analysis and Compensation of Rolling Shutter Effect,”IEEE Transactions on Image Process, Vol. 17, No. 8, pp. 1323-1330, 2008.
    [23]B. S. Chen and Y. Q. Lei, “Indoor and Outdoor People Detection and Shadow Suppression by Exploiting HSV Color Information,” Proceedings of the Fourth International Conference on Computer and Information Technology (CIT’04), pp. 137-142, 2004.
    [24]S. Liu, J. Y. Dong, S. K. Wang and G. J. Chen, “Road Junction Background Reconstruction Based on Median Estimation and Support Vector Machines,” Proceedings of the Fifth International Conference on Machine Learning and Cybernetics, pp. 4200-4205, 2006.
    [25]Y. Fan, “A Real-Time Algorithm of Dynamic Background Extraction in Image Sequence,” Proceedings of the Fourth International Conference on Machine Learning and Cybernetics, pp. 4997-5000, 2005.
    [26]A. Bevilacqua, L. D. Stefano and P. Azzari, “An Effective Real-Time Mosaicing Algorithm Apt to Detect Motion Through Background Substraction Using a PTZ Camera,” IEEE Conference on Advanced Video and Signal Based Surveillance, pp. 511-516, Sept. 2005.
    [27]Texas Instruments Inc., TMS320C6000 Programmer’s Guide, pp. 46-60, April. 2010.
    [28]林育如,“MPEG-4系統之即時實現及視訊介面設計,” 國立成功大學電機工程學系碩士論文, pp. 18-20, 2002.
    [29]G. Borgefors, “Distance Transformations in Digital Images,” Computer Vision, Graphics, & Image Processing, Vol. 34, No. 3, pp.344-371, 1986.
    [30]A. Rosenfield and M. Thurston, “Edge and Curve Detection for Visual Scene Analysis,” IEEE Transactions Computation, Vol. 20, pp. 562-569, 1971.
    [31]J. Canny, “Computational Approach to Edge Detection,” IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-8, No. 6, pp. 679-698, 1986.
    [32]Texas Instruments Inc., TMS320C6000 DSP/BIOS Application Programming Interface (API) Reference Guide, 2004.
    [33]K. Takahashi and M. Naemura, “Remarks on Real-Time Human Posture Estimation from Silhouette Image Using Neural Network,” IEEE International Conference on Systems, Man and Cybernetics, pp. 370-375, 2004.
    [34]H. S. Chen, H. T. Chen, Y. W. Chen and S. Y. Lee, “Human Action Recognition Using Star Skeleton,” Proceedings of the ACM International Multimedia Conference and Exhibition, pp. 171-178, 2006.
    [35]P. J. Burt and E. H. Adelson, “The Laplacian Pyramid as a Compact Image Code,” IEEE Transactions on Communications, Vol. 31, No. 4, pp. 532-540, 1983.
    [36]Advanced Micro Devices Inc., Am29LV033C Data Sheet, June 2005
    [37]Texas Instruments Inc., Creating a Second-Level Bootloader for FLASH Bootloading on TMS320C6000 Platform With Code Composer Studio, May 2006.
    [38]Texas Instruments Inc., TMS320C6000 DSP External Memory Interface Reference Guide, pp. 94-134, April 2008.
    [39]Texas Instruments Inc., TMS320DM642 EVM OSD FPGA User’s Guide, June 2003.
    [40]Texas Instruments Inc., Reference Framework for eXpressDSP Software : RF5, An Extensive, High-Density System, April 2003.
    [41]郭士銘,“以DSP實現遠端同步教學傳播系統,” 國立台灣科技大學電機工程系碩士學位論文, 2010.
    [42]K. Kim, T. H. Chalidabhongse, D. Harwood and L. Davis, “Real-Time Forground-Background Segmentation Using Codebook Model,” Real-Time Imaging, Vol. 11, No. 3, pp. 172-185, 2005.
    [43]J. M. Guo and C. S. Hsu, “Hierachical Method for Forground Detection Using Codebook Model,” International Conference on Image Processing, pp. 3441-3444, 2010.

    QR CODE