簡易檢索 / 詳目顯示

研究生: 陳慶樺
Ching-hua Chen
論文名稱: 整合多視角與分散式視訊編碼之改良式區塊比對預測演算法
An improved block matching and prediction algorithm for multi-view video with distributed video coder
指導教授: 陳建中
Jiann-jone, Chen
口試委員: 林大衛
none
郭天穎
none
劉俊麟
none
鍾國亮
none
學位類別: 碩士
Master
系所名稱: 電資學院 - 電機工程系
Department of Electrical Engineering
論文出版年: 2010
畢業學年度: 98
語文別: 中文
論文頁數: 75
中文關鍵詞: 多視角分散式視訊編碼區塊比對預測打洞式渦輪碼
外文關鍵詞: multi-view distributed video coding, block matching and prediction, RCPT
相關次數: 點閱:277下載:2
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報

隨著視訊技術不斷地進步,多媒體服務平台逐漸能夠提供使用者搭配偏光眼鏡或不需配帶眼鏡即可觀看三維電視(Three-dimensional Television, 3D TV)呈現立體場景的效果,將不同視角畫面的關聯性做有效的多視角視訊編碼(Multi-view Video Coding, MVC),然而該架構並不適用於行動通信裝置與無線感知監測網路的應用,因而使用分散式視訊編碼(Distributed Video Coding, DVC)技術來處理多視角視訊影像傳輸的問題,將編碼端的複雜度轉移至解碼端,稱為多視角分散式視訊編碼(Multi-view Distributed Video Coding, MDVC)。本論文提出改進MDVC架構的方法:(1)在解碼端提出利用轉換域係數分佈的特性,設計優先權速率控制的渦輪解碼器。(2)提出一個基於尺度特徵不變轉換(Scale Invariant Feature Transform, SIFT)的區塊比對預測(Block Matching and Prediction, BMP)演算法,簡稱為SIFT-BMP,產生更精確的輔助資訊(Side Information, SI)影像,提供DVC重建較佳的影像品質。與文獻中的方法相比,本論文所提的方法能提供較低的運算複雜度與時間消耗,並得到較佳的重建視訊品質。實驗結果顯示提出的SIFT-BMP方法相較於混合型的多視角運動估計(Hybrid Multi-view Motion Estimation, H-MVME)方法可降低運算量1.29~2.56倍,而解碼的PSNR效能與H.264/AVC畫面內(intra)編碼模式相比,提升平均PSNR約0.9至3dB。


With the advance of video coding technology, the multimedia platform can provide Three-dimensional Television (3D TV) display for users with or without glasses. The inter-view video correlation has been exploited to perform efficient multi-view video coding. For the extension applications on mobile devices and wireless networks, the distributed video codec (DVC) can be utilized to shift encoder complexity to decoder under the multi-view video coding framework, denoted as Multi-view Distributed Video Coding (MDVC). Several coding control methods have been proposed to improve the MDVC codec performance: (1) The block transform coefficients are utilized to design the priority rate control for the turbo coder at decoder. (2) A Block Matching and Prediction algorithm, based on Scale Invariant Feature Transform, abbreviated as SIFT-BMP, has been proposed to yield more accurate side information for better DVC reconstructed images. In comparisons with other view estimation method, the proposed method can provide lower computational complexity and time consumption, while yielding better reconstructed video quality. Simulations show that the proposed SIFT-BMP method can reduce the time complexity of motion estimation to 1.29~2.56 times smaller, as compared to previous Hybrid Multi-View Motion Estimation (H-MVME) methods, while the PSNR of decoded video can be improved 0.9~3dB, as compared to H.264/AVC intra coding.

摘要 I Abstract II 誌謝 III 目錄 IV 圖目錄 VI 表目錄 IX 第一章 緒論 1 1.1 研究動機與目的 1 1.2 問題描述 2 1.3 研究方法 3 1.4 論文組織 5 第二章 背景知識與相關研究 6 2.1 分散式視訊編碼之多視角視訊編碼 6 2.1.1 多視角視訊編碼源起 6 2.1.2 分散式視訊編碼理論 7 2.1.3 整合分散式視訊編碼與多視角視訊編碼架構 9 2.2 多視角輔助資訊重建 10 2.2.1 運動補償時間域內插 10 2.2.2 透視轉換模型 11 2.2.3 多視角運動估計 13 2.3 相關模擬工具 14 2.3.1 H.264視訊編碼器 14 2.3.2 RCPT之渦輪編碼器 17 2.3.3 SIFT之特徵比對法 19 第三章 分散式架構下改良之多視角視訊編碼系統 23 3.1 MDVC系統 23 3.2 速率控制演算法 30 3.3 輔助資訊重建之區塊比對演算法 35 3.4 輔助資訊重建之複雜度分析 40 第四章 模擬結果與比較 43 4.1 實驗參數設定 43 4.2 實驗數據比較 45 4.2.1 重建輔助資訊影像之品質 46 4.2.2 重建輔助資訊影像與編解碼之時間複雜度 49 4.2.3 解碼影像之PSNR編碼效能 51 4.3 實驗結果展示 59 4.3.1 重建輔助資訊影像 59 4.3.2 解碼之WZ影像 65 第五章 結論與未來展望 71 5.1 結論 71 5.2 未來展望 72 參考文獻 73

[1] Information technology-coding of audio-visual objects-part 10: advanced video coding, ISO/IEC Std 14496-10, 2003.
[2] “Joint draft 8.0 on multi-view video coding,” JVT-AB204, Hannover, Germany, Jul. 2008.
[3] D. Slepian and J. Wolf, “Noiseless coding of correlated information sources,” IEEE Trans. Inf. Theory, vol. 19, no. 4, pp. 471-480, Jul. 1973.
[4] A. Wyner and J. Ziv, “The rate-distortion function for source coding with side information at the decoder,” IEEE Trans. Inf. Theory, vol. 22, no. 1, pp. 1-10, Jan. 1976.
[5] A. Aaron and B. Girod, “Compression with side information using turbo codes,” in Proc. DCC, pp. 252-261, Aug. 2002.
[6] R. Puri and K. Ramchandran, “PRISM: A new robust video coding architecture based on distributed compression principles,” in Proc. Communion, Control, and Computing, Allerton, IL, Oct. 2002.
[7] B. Girod, A. Aaron, S. Rane, and D. Rebollo-Monedero, “Distributed video coding,” in Proc. IEEE Special Issue on Advances in Video Coding and Delivery, vol. 93, no. 1, pp. 71-83, Jan. 2005.
[8] X. Artigas, J. Ascenso, M. Dalai, S. Klomp, D. Kubasov, and M. Ouaret, “The DISCOVER codec: architecture, techniques and evaluation,” in Proc. PCS, Lisbon, Portugal, Nov. 2007.
[9] J. Lu, X. Zhang, and L. Wu, “Distributed video coding technology based on H.264 and turbo code,” in Proc. CISP, vol. 1, pp. 516-520, May 2008.
[10] N. Anantrasirichai, D. Agrafiotis, and D. Bull, “Enhanced spatially interleaved DVC using diversity and selective feedback,” in Proc. IEEE ICASSP, pp. 717-720, Apr. 2009.
[11] J. Slowack, J. Škorupa, S. Mys, P. Lambert, C. Grecos, and R. Van de Walle, “Flexible distribution of complexity by hybrid predictive-distributed video coding,” EURASIP Journal on Signal Processing: Image Communication, vol. 25, no. 2, pp. 94-110, Feb. 2010.
[12] A. Smolic and P. Kauff, “Interactive 3-D video representation and coding technologies,” in Proc. IEEE, vol.93, no. 1, pp. 89-110, 2005.
[13] “Free Viewpoint Television (FTV),” http://www.tanimoto.nuee.nagoya-u.ac.jp/study/FTV
[14] M. Flierl and B. Girod, “Coding of multi-view image sequences with video sensors,” in Proc. IEEE ICIP, pp. 609-612, Oct. 2006.
[15] X. Guo, Y. Lu, F. Wu, D. Zhao, and W. Gao, “Wyner-Ziv-based multiview video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 18, no. 6, pp. 713-724, Jun. 2008.
[16] M. Ouaret, F. Dufaux, and T. Ebrahimi, “Iterative multiview side information for enhanced reconstruction in distributed video coding,” EURASIP Journal on Image Video Process., pp. 1-17, Jan. 2009.
[17] X. Artigas, E. Angeli, and L. Torres, “Side information generation for multiview distributed video coding using a fusion approach,” in Proc. NORSIG, Reykjavik, Iceland, pp. 250-253, Jun. 2007.
[18] J. Ascenso, C. Brites, and F. Pereira, “Content adaptive Wyner-Ziv video coding driven by motion activity,” in Proc. IEEE ICIP, pp. 605-608, Oct. 2006.
[19] M. Ouaret, F. Dufaux, and T. Ebrahimi, “Fusion-based multiview distributed video coding,” in Proc. ACM VSSN, pp. 139-144, Oct. 2006.
[20] R. Hartley and A. Zisserman, Multiple view geometry in computer vision, 2nd Ed., Cambridge University Press, ISBN: 0-521-54051-8, 2004.
[21] X. Artigas, E. Angeli, and L. Torres, “Comparison of different side information generation methods for multiview distributed video coding,” in Proc. SIGMAP, Barcelona, Spain, pp. 450-455, Jul. 2007.
[22] The official website of visual studio: http://www.microsoft.com/visualstudio
[23] The official website of OpenCV: http://opencv.willowgarage.com
[24] The official website of IT++: http://sourceforge.net/apps/wordpress/itpp/
[25] C. Brites and F. Pereira, “Correlation noise modeling for efficient pixel and transform domain Wyner-Ziv video coding,” IEEE Trans. Circuits Syst. Video Technol., vol. 18, no. 9, pp. 1177-1190, Sep. 2008.
[26] D. Kubasov, K. Lajnef, and C. Guillemot, “A hybrid encoder/decoder rate control for Wyner-Ziv video coding with a feedback channel,” IEEE Workshop on MMSP, pp. 251-254, Oct. 2007.
[27] A. Aaron, R. Zhang, and B. Girod, “Wyner-Ziv coding of motion video,” (Invited Paper) in Proc. Asilomar Conference on Signals and Systems, Pacific Grove, CA, Nov. 2002.
[28] D. Marpe, T. Wiegard, and G. J. Sullivan, “The H.264/MPEG4 advanced video coding standard and its applications,” IEEE Communication Magazine, vol. 44, no. 8, pp. 134-143, Aug. 2006.
[29] Iain E. G. Richardson, H.264 and MPEG-4 video compression: video coding for next-generation multimedia, John Wiley & Sons, Ltd. ISBN: 0-470-84837-5, 2003.
[30] D. N. Rowitch and L. B. Milstein, “On the performance of hybrid FEC/ARQ system using rate compatible punctured turbo (PCPT) codes,” IEEE Trans. Communication, vol. 48, no. 6, pp. 948-959, Jun. 2000.
[31] L. R. Bahl, J. Cocke, F. Jelinek, and J. Racic, “Optimal decoding of linear codes for minimizing symbol error rate,” IEEE Trans. Inf. Theory, vol. 20, no. 2, pp. 284-287, Mar. 1974.
[32] G. Berrou, A. Glavieuc, and P. Thitmajshima, “Near Shannon limit error-correcting coding: turbo codes,” in Proc. IEEE ICC, pp. 1064-1070, May 1993.
[33] P. Robertson, E. Villebrn, and P. Hoeher, “A comparison of optimal and sub-optimal MAP decoding algorithms operating in the log domain,” in Proc. IEEE ICC, vol. 2, pp. 1009-1013, Jun. 1995.
[34] D. G. Lowe, “Distinctive image features from scale-invariant keypoints,” International Journal of Computer Vision, vol. 60, no. 2, pp. 91-110, Nov. 2004.
[35] R. C. Bolles and M. A. Fischler, “Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography,” Communication ACM, vol. 24, no. 6, pp. 381-395, Jun. 1981.
[36] Call for proposals on multi-view video coding, ISO/IEC JTC1/SC29/WG11, N7327, Jul. 2005.
[37] Joint Video Team (JVT) of ISO/IEC MPEG & ITU-T VCEG (ISO/IEC JTC1/SC29/WG11 and ITU-T SG16 Q.6), “JSVM Software Manual,” JSVM 9.18, 19 Jun. 2009.

QR CODE