簡易檢索 / 詳目顯示

研究生: 顏碩甫
Shuo-Fu Yen
論文名稱: 基於權重反向索引分類與快速篩選演算法之雲端巨量影像資料庫檢索系統
A Fast Cloud Large-Scale Image Retrieval System Using Weighted-Inverted Index and Database Filtering Algorithm
指導教授: 陳建中
Jiann-Jone Chen
口試委員: 唐政元
Cheng-Yuan Tang
蔡耀弘
Yao-Hong Tsai
吳怡樂
Yi-Leh Wu
學位類別: 碩士
Master
系所名稱: 電資學院 - 電機工程系
Department of Electrical Engineering
論文出版年: 2016
畢業學年度: 104
語文別: 中文
論文頁數: 85
中文關鍵詞: 影像檢索巨量資料庫反向索引雲端運算
外文關鍵詞: CBIR, Inverted index, Cloud computing, Large-scale database
相關次數: 點閱:277下載:4
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報

隨著網路與多媒體技術的進步,網路多媒體應用已經成為日常生活中主要的資訊傳遞方式,隨之而來產生巨量資料 ( Big data ) 使得網路資料量呈現爆炸性成長,透過網路連結成為一個巨量影像資料庫 ( Large-scale image database)。另外雲端運算平台平行處理的技術進步,結合多媒體資料與雲端運算的應用成為當前的重要發展趨勢。目前的影像檢索技術以基於內容之影像檢索 ( Content-Based Image Retrieval, CBIR ) 最為常見,大多的CBIR系統以單一種類特徵值進行比對,無法有效反應影像內容的特性,因而檢索精確度有限,而在巨量影像資料庫的情況下,使用單一硬體伺服器進行影像檢索,因為檢索比對所需要的運算量過大,需要較長的檢索時間。因此,如何有效運用雲端運算技術有效地提昇巨量影像資料庫檢索的速度以及精確度效能,對於雲端巨量資料運算的應用相當重要。
本論文提出一套雲端巨量影像資料庫檢索系統,使用不同功能之影像特徵,分別為比對用特徵值以及分類用特徵值,藉以解決使用單一種類特徵值精確度不佳的問題;我們提出基於權重反向索引之資料庫分類演算法 ( Database-Categorizing based on Weighted-Inverted Index, DCWII ) 以及資料庫篩選演算法 ( Database-Filtering Algorithm, DFA ),利用權重反向索引 ( Weighted-Inverted Index ) 將影像資料庫分類,再根據檢索影像之特徵權重,設立一個門檻值進行資料庫事前篩選 ( Pre-filtering ),只選擇正相關影像資料庫進一步詳細檢索;我們將提出的方法結合雲端運算架構,加快資料庫比對的速度,降低單一硬體伺服器之比對運算量。實驗結果顯示,本論文之雲端巨量影像資料庫檢索系統在mAP效能上維持在0.678的表現,而在PR曲線效能上優於其他文獻方法,同時能夠比其他文獻的方法減少約55%至70%的檢索時間,所提方法和系統能有效地改善巨量影像資料庫檢索系統之效能。


With the advance of multimedia technology and communications, images and videos become the major streaming information through the Internet. How to fast retrieve desired similar images precisely from the Internet scale image/video databases (Big Data) is the most important retrieval control target. In this paper, a cloud based content-based image retrieval (CBIR) scheme is presented. To speed up the features matching process for large scale CBIR, we proposed to perform Database-Categorizing based on Weighted-Inverted Index (DCWII) and Database Filtering Algorithm (DFA). In the DCWII, it assigns weights to DCT coefficients histograms and categorizes the database by weighted features. In addition, the DFA filters out irrelevant image in the database to reduce unnecessary computation loading for features matching. Experiments showed that the proposed CBIR scheme outperforms previous works in the Precision-Recall performance and maintains mean average precision (mAP) about 0.678 in the large-scale database comprising one mega images. Our scheme also can reduce about 55%~70% retrieval time by pre-filtering the database, which helps to improve efficiency of retrieval system.

摘要 I Abstract II 誌謝 III 目錄 V 圖目錄 VIII 表目錄 XI 第一章 緒論 1 1.1 研究背景 1 1.2 研究動機與目的 2 1.3 研究項目與方法概述 3 1.4 論文架構 6 第二章 背景知識與相關研究 7 2.1 影像檢索技術簡介 7 2.1.1 基於文字之影像檢索 8 2.1.2 基於內容之影像檢索 9 2.1.2.1 區域性特徵 10 2.1.2.2 全域性特徵 11 2.1.2.3 MPEG-7標準 17 2.1.3 基於語意之影像檢索 18 2.2 壓縮域影像檢索相關知識 19 2.2.1 JPEG影像壓縮標準 19 2.2.1.1 色彩空間轉換 20 2.2.1.2 縮減取樣 20 2.2.1.3 離散餘弦轉換 21 2.2.1.4 量化 22 2.2.1.5 熵編碼 22 2.2.2 壓縮域影像檢索之相關文獻 25 2.3 反向索引相關知識 27 第三章 提出方法 29 3.1 本論文之系統架構 29 3.1.1 問題描述 29 3.1.2 本論文之雲端巨量影像資料庫檢索系統架構 31 3.2 影像多特徵值擷取 33 3.2.1 比對用特徵值 33 3.2.1.1 色彩分布敘述子 33 3.2.1.2 邊緣直方圖敘述子 35 3.2.2 分類用特徵值 37 3.2.2.1 碼書建立 37 3.2.2.2 DCT係數直方圖擷取 38 3.3 本論文之資料庫分類與篩選演算法 39 3.3.1 基於權重反向索引之資料庫分類演算法 39 3.3.1.1 DCT係數直方圖量化 39 3.3.1.2 基於權重之反向索引 42 3.3.2 資料庫篩選演算法 45 3.4 本論文之雲端系統應用 47 3.4.1 MapReduce平行處理模型 47 3.4.2 影像特徵值比對之權重分配 48 第四章 實驗結果 49 4.1 實驗環境與影像資料庫 49 4.2 實驗效能指標與比較的文獻 50 4.2.1 實驗效能指標 50 4.2.2 實驗之比較文獻方法 54 4.3 檢索系統效能與實驗結果 56 4.3.1 本論文系統參數配置 56 4.3.1.1 DCT係數直方圖之CodebookK值 56 4.3.1.2 多特徵值比對權重配置 59 4.3.1.3 DFA資料庫篩選門檻值 61 4.3.2 檢索系統效能評量 65 4.3.2.1 小型資料庫之檢索系統效能比較 65 4.3.2.2 巨量資料庫之檢索系統效能之比較 67 4.3.2.3 基於雲端運算架構之巨量資料庫檢索系統效能比較 69 4.3.3 事前資料庫分類於儲存硬體之時間效能測試 72 4.4 實驗結果探討與分析 73 第五章 結論與未來展望 77 5.1 結論 77 5.2 未來展望 78 5.3 研究建議 79 參考文獻 82

[1] N. Naphade et al, “Large-scale concept ontology for multimedia,” IEEE Multimedia, vol. 13, no. 3, pp. 86-91, 2006.
[2] Q. Zhang et al, “Cloud computing: state-of-the-art and research challenges,” J. Internet Services and Applications, vol. 1, no. 1, 2010.
[3] Google Image Search ( https://images.google.com ).
[4] Yahoo! Image Search ( https://images.search.yahoo.com ).
[5] TinEye Image Search ( https://www.tineye.com ).
[6] 百度識圖 ( http://image.baidu.com ).
[7] M. Chen et al, “Big data: A survey,” Mobile Networks and Applications, vol. 19, no. 2, pp. 171-209, 2014.
[8] P. Poursistani et al, “Image indexing and retrieval in JPEG compressed domain based on vector quantization,” Mathematical and Computer Modelling, vol. 57, no. 5, pp. 1005-1017, 2013.
[9] F. Rajam et al, “A survey on content based image retrieval,” Life Science Journal, vol. 10, no. 2, pp. 2475-2487, 2013.
[10] S. Gandhani et al, “Content based image retrieval: survey and comparison of CBIR system based on combined features,” Int. Journal of Signal Processing, vol. 8, no. 10, pp.155-162, 2015
[11] S. S. Mukati, and N. Rastogi, “A survey on image retrieval techniques with feature extraction,” IRJET, vol. 3, no. 1, pp. 323-327, 2016.
[12] T. Tuytelaars et al, “Local invariant feature detectors: a survey,” Foundations and Trends® in Computer Graphics and Vision, vol.3, no. 3,pp. 177-280, 2008.
[13] K. Grauman, and B. Leibe, “Visual object recognition,” Synthesis Lectures Artif. Intell. Mach. Learn., vol. 5, no. 2, pp. 1-181, 2011.
[14] F. Bellavia et al, “Improving Harris corner selection strategy,” IET Computer Vision, vol. 5, no. 2, pp. 87-96, 2011.
[15] C. Harris, and M. Stephens, “A combined corner and edge detector,” Alvey vision Conf., vol. 15, pp. 147-151, 1988.
[16] D. G. Lowe, “Object recognition from local scale-invariant features,” in Proc. IEEE Int. Conf. on Computer vision, vol. 2, pp. 1150-1157, 1999.
[17] M. J. Swain et al, “Color indexing,” Int. J. Computer Vision, vol. 7, no. 1, pp. 11-32, 1991.
[18] 鍾國亮, “影像處理與電腦視覺”,五版,pp. 471-472,2012.
[19] M. A. Stricker, and A. Dimai, “Color indexing with weak spatial constraints,” Electronic Imaging: Science & Technology. Int. Society for Optics and Photonics, pp. 29-40, 1996.
[20] 余秋忠,“以內容為基礎之影像與視訊處理技術於鑑識科學之應用”,國立中央警察大學鑑識科學研究所博士學位論文,2012。
[21] G. Raghuwanshi, and V. Tyagi, “A survey on texture image retrieval,” Springer Proc. 2nd Int. Conf. Computer & Communication Technologies, pp. 427-435, 2015.
[22] Z. Zhao, et al, “Content based image retrieval scheme using color, texture and shape features,” Int. J. Signal Processing, Image Processing and Pattern Recognition, vol. 9, no. 1, pp. 203-212, 2016.
[23] S. Agarwal et al, “Content based Image retrieval using color edge detection and discrete wavelet transform,” ICICT, pp.368-372, 2014.
[24] E. Gupta, and R.S. Kushwa, “Combination of local, global and K-Mean using wavelet transform for content base image retrieval,” Int. Journal of Computer Applications, vol. 116, no. 14, 2015.
[25] J. Canny, “A computational approach to edge detection,” IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 679-698, 1986.
[26] C. Iakovidou et al, “Searching images with MPEG-7 (& mpeg-7-like) powered localized descriptors: the SIMPLE answer to effective content based image retrieval.” CBMI, IEEE 12th Int. Workshop, pp. 1-6, 2014.
[27] A. Alzu’bi et al, “Semantic content-based image retrieval: a comprehensive study,” J. Visual Communication and Image Representation, vol. 32, pp. 20-54, 2015.
[28] X. Y. Wang et al, “A new SVM-based active feedback scheme for image retrieval,” Engineering Applications of Artificial Intelligence, vol. 37, pp. 43-53, 2015.
[29] S.Y. Irianto, “Content based Image Retrieval in the Compressed Domain,” Int. J. Computer Applications, vol. 99, no. 13, pp. 18-23, 2014.
[30] Z. M. Lu et al, “A content-based image retrieval scheme in JPEG compressed domain,” Int. J. Innovative Computing, Info. & Control, vol. 2, no. 4, pp. 831-839, 2006.
[31] D. Edmundson, and G. Schaefer, “Fast JPEG image retrieval using optimised Huffman tables,” IEEE ICPR, pp. 3188-3191, 2012.
[32] L. Zheng et al, “Fast image retrieval: query pruning and early termination,” IEEE Transactions on Multimedia, vol. 19, no. 5, pp. 648-659, 2015.
[33] Y. Zhang et al, “Image retrieval with geometry-preserving visual phrases,” IEEE CVPR, pp. 809-816, 2011.
[34] Color layout descriptor - Wikipedia
( http://en.wikipedia.org/wiki/Color_layout_descriptor ).
[35] Corel Image ( http://corel.brothersoft.com/corel-image-database-dvd.html ).
[36] Caltech 101 ( http://www.vision.caltech.edu/Image_Datasets/Caltech101/ ).
[37] MIR-Flickr 1M ( http://press.liacs.nl/mirflickr/ ).
[38] M. J. Huiskes et al, “New trends and ideas in visual concept detection: the MIR flickr retrieval evaluation initiative,” ACM Proceedings of the Int. Conf. on Multimedia information retrieval, pp. 527-536, 2010.
[39] Flickr ( https://www.flickr.com ).
[40] K. Kishida, “Property of average precision and its generalization: An examination of evaluation indicator for information retrieval experiments,” Tokyo, Japan: National Institute of Informatics, pp. 1-19, 2005.

QR CODE