簡易檢索 / 詳目顯示

研究生: Adhatus Solichah Ahmadiyah
Adhatus - Solichah Ahmadiyah
論文名稱: New Algorithms for Perceptual-based Image Compression and Anaglyph Stereo Video Compression
New Algorithms for Perceptual-based Image Compression and Anaglyph Stereo Video Compression
指導教授: 花凱龍
Kai-Lung Hua
口試委員: Gee-Sern Hsu.
Gee-Sern Hsu.
Wen-Huang Cheng
Wen-Huang Cheng
學位類別: 碩士
Master
系所名稱: 電資學院 - 資訊工程系
Department of Computer Science and Information Engineering
論文出版年: 2013
畢業學年度: 101
語文別: 英文
論文頁數: 38
中文關鍵詞: multitreeperceptualanagyphstereocompression
外文關鍵詞: multitree, perceptual, bayer pattern
相關次數: 點閱:254下載:4
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報

  • Compression is an important issue in digital applications as it can reduce the storage
    size and minimize the bandwith usage for file transfer over networks. In this
    document, two topics about compression is discussed. The first topic presents a
    novel block-based image coding algorithm which integrates tree-structured multitree
    dictionary and perceptual-based rate distortion optimization scheme. While
    multitree dictionary is employed to support a very large number of different tilings,
    perceptual-based rate distortion optimization utilizes the SSIM metric, instead of
    popular MSE metric, to allocate bit rate according to human visual system. Experimental
    results show that our proposed method outperforms many existing techniques
    in both subjective and objective image quality measures. The second topic
    presents two novel end-to-end stereo video compression pipelines consisting of
    single-sensor digital camera pairs, the legacy consumer-grade video decoders,
    and anaglyph displays. As 3D videos contain a large amount of data, efficient
    compression methods to distribute streams over the current communication infrastructure
    are highly required. We proposed two methods to transmit a single
    encoded stream containing only required data to create anaglyph video from
    single-sensor camera pairs. The experimental results demonstrate the superior
    performance of our proposed methods over the traditional one by achieving up to
    4.66 dB improvement in terms of Composite PSNR.

    ABTRACT ACKNOWLEDGMENTS TABLE OF CONTENTS LIST OF FIGURES 1. PERCEPTUAL-BASED IMAGE COMPRESSION 1.1 Introduction 1.2 The proposed multitree image coder with perceptual based RDO 1.3 Experimental results 1.4 Summary 2. ANAGLYPH STEREO VIDEO COMPRESSION 2.1 Introduction 2.2 Proposed Anaglyph Stereo Video Compression 2.3 Experimental results 2.4 Summary 3. CONCLUSIONS REFERENCES

    [1] Y. Huang, I. Pollak, M. Do, and C. Bouman, ``Fast search for best representations in multitree dictionaries,'' IEEE Transactions on Image Processing, vol. 15, pp. 1779--1793, Jul. 2006.
    [2] R. R. Coifman and M. V. Wickerhauser, ``Entropy-based algorithms for best
    basis selection,'' IEEE Transactions on Information Theory, vol. 38, pp. 713-
    -718, Mar. 1992.
    [3] K. Ramchandran and M. Vetterli, ``Best wavelet packet bases in a rate distortion sense,'' IEEE Transactions on Image Processing, vol. 2, pp. 160--
    175, Apr. 1993.
    [4] Y. Huang, I. Pollak, M. Do, and C. Bouman, ``Optimal representations in multitree dictionaries with application to compression,'' in Proceedings of IEEE
    International Conference on Image Processing (ICIP), vol. 1, Sep. 2005.
    [5] Y. Huang and I. Pollak, ``MLC: a novel image coder based on multitree local
    cosine dictionaries,'' IEEE Signal Processing Letters, vol. 12, pp. 843--846,
    Dec. 2005.
    [6] Y. Huang, I. Pollak, M. Do, and C. Bouman, ``Image compression with multitree tilings,'' in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 193--196, 2005.
    [7] Z. Wang, A. C. Bovik, H. R. Sheikh, and E. P. Simoncelli, ``Image quality
    assessment: From error visibility to structural similarity,'' IEEE Transactions
    on Image Processing, vol. 13, no. 4, pp. 600--612, 2004.
    [8] D. Chandler and S. Hemami, ``VSNR: A wavelet-based visual signal-to-noise
    ratio for natural images,'' IEEE Transactions on Image Processing, vol. 16,
    pp. 2284 --2298, Sep. 2007.
    [9] H. Sheikh and A. Bovik, ``Image information and visual quality,'' IEEE Transactions on Image Processing, vol. 15, pp. 430 --444, Feb. 2006.
    [10] B. Wang, Z. Wang, Y. Liao, and X. Lin, ``HVS-based structural similarity for image quality assessment,'' in Proceedings of ninth International Conference
    on Signal Processing (ICSP) 2008, pp. 1194 --1197, Oct. 2008.
    [11] Z. Wang, L. Lu, and A. Bovik, ``Video quality assessment using structural
    distortion measurement,'' in Proceedings of IEEE International Conference
    on Image Processing (ICIP), vol. 3, pp. III--65 -- III--68, 2002.
    [12] B. Girod, ``Psychovisual aspects of image processing: What's wrong with
    mean squared error?,'' in Proceedings of the Seventh Workshop on Multidimensional Signal Processing, p. P.2, Sep. 1991.
    [13] P. Teo and D. Heeger, ``Perceptual image distortion,'' in Proceedings of IEEE International Conference on Image Processing (ICIP), vol. 2, pp. 982 --986,
    Nov. 1994.
    [14] A. Eskicioglu and P. Fisher, ``Image quality measures and their performance,'' IEEE Transactions on Communications, vol. 43, pp. 2959 --2965,
    Dec. 1995.
    [15] K. Seshadrinathan, R. Soundararajan, A. Bovik, and L. Cormack, ``Study of
    subjective and objective quality assessment of video,'' IEEE Transactions on
    Image Processing, vol. 19, pp. 1427--1441, Jun. 2010.
    [16] A. Ninassi, O. Le Meur, P. Le Callet, and D. Barba, ``Considering temporal
    variations of spatial visual distortions in video quality assessment,'' IEEE
    Journal of Selected Topics in Signal Processing, vol. 3, pp. 253 --265, Apr.
    2009.
    [17] T.-S. Ou, Y.-H. Huang, and H. H. Chen, ``SSIM-based perceptual rate control
    for video coding,'' IEEE Transactions on Circuits and Systems for Video
    Technology, vol. 21, pp. 682--691, May 2011.
    [18] Y.-H. Huang, T.-S. Ou, P.-Y. Su, and H. Chen, ``Perceptual rate-distortion optimization using structural similarity index as quality metric,'' IEEE Transactions on Circuits and Systems for Video Technology, vol. 20, pp. 1614 --1624, Nov. 2010.
    [19] I. A. Ideses and L. P. Yaroslavsky, ``New methods to produce high quality
    color anaglyphs for 3-D visualization,'' in ICIAR (2), pp. 273--280, 2004.
    [20] I. Ideses and L. Yaroslavsky, ``Three methods that improve the visual quality of colour anaglyphs,'' Journal of Optics A: Pure and Applied Optics, vol. 7, pp. 755--762, Nov. 2005.
    [21] D. F. McAllister, Y. Zhou, and S. Sullivan, ``Methods for computing color
    anaglyphs,'' 2010.
    [22] H. Sanftmann and D. Weiskopf, ``Anaglyph stereo without ghosting,'' Computer Graphics Forum, vol. 30, no. 4, pp. 1251--1259, 2011.
    [23] B. E. Bayer, ``Color imaging array.'' U.S. Patent 3 971 065, Jun. 20 1976.
    [24] S. Lee and A. Ortega, ``A novel approach of image compression in digital
    cameras with a bayer color filter array,'' in Proc. IEEE Int. Conf. Image Processing, vol. 3, pp. 482--485, Oct. 2001.
    [25] C. C. Koh, J. Mukherjee, and S. K. Mitra, ``New efficient methods of image
    compression in digital cameras with color filter array,'' IEEE Trans. on Consumer Electronics, vol. 49, pp. 1448--1456, Nov. 2003.
    [26] F. Gastaldi, C. C. Koh, M. Carli, A. Neri, and S. K. Mitra, ``Compression of videos captured via bayer patterned color filter arrays,'' 2005.
    [27] C. Doutre, P. Nasiopoulos, and K. N. Plataniotis, ``H.264-based compression
    of bayer pattern video sequences,'' IEEE Trans. Circuits and Systems for
    Video Technology, vol. 18, pp. 725--734, June 2008.
    [28] H. Chen, M. Sun, and E. Steinbach, ``Compression of bayer-pattern video
    sequences using adjusted chroma subsampling,'IEEE Trans. Circuits and
    Systems for Video Technology, vol. 19, pp. 1891--1896, Dec. 2009.
    [29] S. Tripathi, E. Piccinelli, and D. Aliprandi, ``H.264/AVC stereo video compression benchmarking,'' in Image Analysis for Multimedia Interactive Services (WIAMIS), 2010 11th International Workshop on, pp. 1--4, Apr. 2010.
    [30] P. Merkle, H. Brust, K. Dix, K. Muller, and T. Wiegand, ``Stereo video compression for mobile 3D services,'' in 3DTV Conference: The True Vision -
    Capture, Transmission and Display of 3D Video, pp. 1--4, 2009.
    [31] ``No ghost jpeg stereo anaglyphs.'' http://www.pokescope.com/tutorial/jpeg/
    jpeganaglyphs.html. [32] C. L. Zitnick, S. B. Kang, M. Uyttendaele, S. Winder, and R. Szeliski, ``High quality video view interpolation using a layered representation,'in ACM SIGGRAPH 2004 Papers, SIGGRAPH '04, (New York, NY, USA), pp. 600--608, ACM, 2004.
    [33] H. Malvar, L.-W. He, and R. Cutler, ``High-quality linear interpolation for demosaicing of bayer-patterned color images,'' ICASSP, vol. 3, pp. 485--488,
    May. 2004.

    QR CODE