研究生: |
李慶銘 Ching-Ming Li |
---|---|
論文名稱: |
即時影音教學傳播系統的實現 The Implementation of the Real-Time Video-Audio Teaching Communication System |
指導教授: |
蔡超人
Chau-Ren Tsai |
口試委員: |
蘇順豐
Shun-Feng Su 郭景明 Jing-ming Guo 王乃堅 Nai-Jian Wang |
學位類別: |
碩士 Master |
系所名稱: |
電資學院 - 電機工程系 Department of Electrical Engineering |
論文出版年: | 2011 |
畢業學年度: | 99 |
語文別: | 中文 |
論文頁數: | 139 |
中文關鍵詞: | 數位信號處理器 、雙攝影機架構 、PTZ攝影機 、姿態分析 |
外文關鍵詞: | Digital Signal Processor, Dual-Camera Module, PTZ Camera, Posture Analysis |
相關次數: | 點閱:265 下載:2 |
分享至: |
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報 |
在一般學校或補習班教室等演講場合,常會使用攝影機在旁做側錄,以供當時未參與的學員有二次學習的機會,側錄通常是以人工的方式去操作攝影機,因此本計畫研究以DSP實現即時影音教學傳播系統為目的,希望能全自動錄影以取代人力資源。由於數位訊號處理器(DSP:Digital Signal Processor)具有執行速度較快,整體效能佳、特殊硬體及指令設計可達成真正的即時運作的優點,因此本系統實現在德州儀器(Texas Instrument)之TMS320DM642 EVM模組上並搭配雙攝影機架構進行協同追蹤,其中場景攝影機做目標物偵測及姿態分析,並透過座標轉換引導PTZ(Pan/Tilt/Zoom)攝影機將講師置於畫面並對人物特殊動作特寫,適時的縮放畫面使呈現更清晰的教學畫面。另外在教學聲音則由無線麥克風接收模組擷取,最後再透過網路將影像與聲音傳送至遠端電腦並錄製成教學影片,如此可建構出即時影音教學傳播系統。
In the ordinary school, cram school and other lecture occasions, we use the camera to record the program contents in order to let the audience who was absent can learn the program contents. The record works usually do by someone. Accordingly, the purpose of this research is to implement a real-time audio-video teaching communication system to replace it. The Digital Signal Processor (DSP) has many benefits such as high performance, small size and standalone for this reason. This system will combine TMS320DM642 Evaluation Module with dual-camera Module that can track the target in coordination, one of the cameras for target detection and posture analysis is field camera, via coordinates transformation it can guide PTZ camera to make the speaker stay in the frame to enlarge the frame, and zoom screen timely to present clearer teaching images. Beside, the voice in teaching is captured by wireless microphone receiver module. Finally, the image and voice will be transmitted to the remote computer and record as teaching film. Hereby develop a real-time video-audio teaching communication system.
[1]C. R. Wren, A. Azarbayejani, T. Darrell and A. Pentland. “Pfinder: Real-Time Tracking of the Human Body,” IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 19, pp. 780-785, 1997.
[2]I. Haritaoglu, D. Harwood and L. S. Davis, “W4: Real-Time Surveillance of People and Their Activities,”IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, pp. 809-830, 2000.
[3]R. C. Gonzalez and R. E. Woods, Digital Image Processing, Prentice-Hall, New Jersey, pp. 116-123, 2002.
[4]Y. H. Liang, S. Guo, Z. Y. Wang, X. W. Xu and X. Y. Cao, “A Robust and Fast Motion Segmentation Method for Video Sequences,” Proceedings of the IEEE International Conference on Automation and Logistics, ICAL 2007, pp. 2952-2957, 2007.
[5]L. D. Stefano and A. Bulgarelli, “A Simple and Efficient Connected Components Labeling Algorithm,” Proceedings of 10th International Conference on Image Analysis, pp. 322-327, 1999.
[6]H. S. Chen, H. T. Chen, Y. W. Chen and S. Y. Lee, “Human Action Recognition Using Star Skeleton,” Proceedings of the ACM International Multimedia Conference and Exhibition, pp. 171-178, 2006.
[7]Texas Instruments Inc., TMS320DM642 Evaluation Module with TVP Video Decoders Technical Reference, 2004.
[8]S. Y. Chien, S. Y. Ma and L. G. Chen, “Efficient Moving Object Segmentation Algorithm Using Background Registration Technique,” IEEE Transactions on Circuits and Systems for Video Technology, Vol. 12, No. 7, pp. 577-586, 2002.
[9]Texas Instruments Inc., TMS320C6000 TCP/IP Network Developer’s Kit User’s Guide, June 2003.
[10]Texas Instruments Inc., TLV320AIC23B Data Manual, Feb. 2004.
[11]W. C. Kao, “Robust Brightness Measurement and Exposure Control in Real-Time Video Recording,”IEEE Transactions on Instrumentation and Measurement, Vol. 6, pp. 1206-1216, 2011.
[12]A. J. Lipton, H. Fujiyoshi and R. S. Patil, “Moving Target Classification and Tracking from Real-Time Video,” Proc. IEEE Workshop Applications of Computer Vision, pp. 8-14, 1998.
[13]C. Kim and J. N. Hwang, “Fast and Automatic Video Object Segmentation and Tracking for Content-Based Applications,”IEEE Transactions on Circuits and Systems for Video Technology, Vol. 12, No. 2, pp. 122-129, 2002.
[14]陳俊偉, “混合行人與汽機車於入口之影像式偵測研究系統,”國立交通大學電機與控制工程研究所碩士論文, pp. 8-12, 2009.
[15]J. Barron, D. Fleet and S. Beauchemin, “Performance of Optical Flow Techniques,” International Journal of Computer Vision, Vol. 12, No. 1, pp. 42-77, 1994.
[16]J. W. Hsieh, W. F. Hu, C. J. Chang and Y. S. Chen, “Shadow Elimination for Effective Moving Object Detection by Gaussian Shadow Modeling,” Image Vision and Computing Journal, Vol. 21, pp. 505-516, 2003.
[17]G. Szwoch, P. Dalka, “Identification of Regions of Interest in Video for a Traffic Monitoring System,” Proceedings of the 2008 1st International Conference on Information Technology, pp. 19-21, May 2008.
[18]J. Stauder, R. Mech and J. Ostermann, “Detection of Moving Cast Shadows for Object Segmentation,” IEEE Transactions on Multimedia, Vol. 1, No. 1, pp. 65-76, 1999.
[19]C. Kim and J. N. Hwang, “Fast and Automatic Video Object Segmentation and Tracking for Content-Based Applications,”IEEE Transactions on Circuits and Systems for Video Technology, Vol. 12, No. 2, pp. 122-129, 2002.
[20]J. P. Serra, Image Analysis and Mathematical Morphology, Academic Press, pp. 115-130, 1982.
[21]Sony Corp., Color Video Camera Technical Manual, pp. 8-10, 2003
[22]C. K. Liang, “Analysis and Compensation of Rolling Shutter Effect,”IEEE Transactions on Image Process, Vol. 17, No. 8, pp. 1323-1330, 2008.
[23]B. S. Chen and Y. Q. Lei, “Indoor and Outdoor People Detection and Shadow Suppression by Exploiting HSV Color Information,” Proceedings of the Fourth International Conference on Computer and Information Technology (CIT’04), pp. 137-142, 2004.
[24]S. Liu, J. Y. Dong, S. K. Wang and G. J. Chen, “Road Junction Background Reconstruction Based on Median Estimation and Support Vector Machines,” Proceedings of the Fifth International Conference on Machine Learning and Cybernetics, pp. 4200-4205, 2006.
[25]Y. Fan, “A Real-Time Algorithm of Dynamic Background Extraction in Image Sequence,” Proceedings of the Fourth International Conference on Machine Learning and Cybernetics, pp. 4997-5000, 2005.
[26]A. Bevilacqua, L. D. Stefano and P. Azzari, “An Effective Real-Time Mosaicing Algorithm Apt to Detect Motion Through Background Substraction Using a PTZ Camera,” IEEE Conference on Advanced Video and Signal Based Surveillance, pp. 511-516, Sept. 2005.
[27]Texas Instruments Inc., TMS320C6000 Programmer’s Guide, pp. 46-60, April. 2010.
[28]林育如,“MPEG-4系統之即時實現及視訊介面設計,” 國立成功大學電機工程學系碩士論文, pp. 18-20, 2002.
[29]G. Borgefors, “Distance Transformations in Digital Images,” Computer Vision, Graphics, & Image Processing, Vol. 34, No. 3, pp.344-371, 1986.
[30]A. Rosenfield and M. Thurston, “Edge and Curve Detection for Visual Scene Analysis,” IEEE Transactions Computation, Vol. 20, pp. 562-569, 1971.
[31]J. Canny, “Computational Approach to Edge Detection,” IEEE Transactions on Pattern Analysis and Machine Intelligence, PAMI-8, No. 6, pp. 679-698, 1986.
[32]Texas Instruments Inc., TMS320C6000 DSP/BIOS Application Programming Interface (API) Reference Guide, 2004.
[33]K. Takahashi and M. Naemura, “Remarks on Real-Time Human Posture Estimation from Silhouette Image Using Neural Network,” IEEE International Conference on Systems, Man and Cybernetics, pp. 370-375, 2004.
[34]H. S. Chen, H. T. Chen, Y. W. Chen and S. Y. Lee, “Human Action Recognition Using Star Skeleton,” Proceedings of the ACM International Multimedia Conference and Exhibition, pp. 171-178, 2006.
[35]P. J. Burt and E. H. Adelson, “The Laplacian Pyramid as a Compact Image Code,” IEEE Transactions on Communications, Vol. 31, No. 4, pp. 532-540, 1983.
[36]Advanced Micro Devices Inc., Am29LV033C Data Sheet, June 2005
[37]Texas Instruments Inc., Creating a Second-Level Bootloader for FLASH Bootloading on TMS320C6000 Platform With Code Composer Studio, May 2006.
[38]Texas Instruments Inc., TMS320C6000 DSP External Memory Interface Reference Guide, pp. 94-134, April 2008.
[39]Texas Instruments Inc., TMS320DM642 EVM OSD FPGA User’s Guide, June 2003.
[40]Texas Instruments Inc., Reference Framework for eXpressDSP Software : RF5, An Extensive, High-Density System, April 2003.
[41]郭士銘,“以DSP實現遠端同步教學傳播系統,” 國立台灣科技大學電機工程系碩士學位論文, 2010.
[42]K. Kim, T. H. Chalidabhongse, D. Harwood and L. Davis, “Real-Time Forground-Background Segmentation Using Codebook Model,” Real-Time Imaging, Vol. 11, No. 3, pp. 172-185, 2005.
[43]J. M. Guo and C. S. Hsu, “Hierachical Method for Forground Detection Using Codebook Model,” International Conference on Image Processing, pp. 3441-3444, 2010.