簡易檢索 / 詳目顯示

研究生: 賴沅壕
Yuan-hao Lai
論文名稱: 基於移動物體軌跡與外觀之影片搜尋
Video Object Retrieval by Trajectory and Appearance
指導教授: 楊傳凱
Chuan-kai Yang
口試委員: 李育杰
Yuh-jye Lee
鮑興國
Hsing-kuo Pao
學位類別: 碩士
Master
系所名稱: 管理學院 - 資訊管理系
Department of Information Management
論文出版年: 2013
畢業學年度: 101
語文別: 中文
論文頁數: 34
中文關鍵詞: 影片搜尋物體追蹤物體再識別
外文關鍵詞: Video Retrieval, Object Tracking, Object Re-identification
相關次數: 點閱:170下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報

隨著網際網路、監視器與手持裝置的發展,大量的影片資料每天被人們製作產生及分享。因此如何管理,並從大量的影片資料中快速地找到想要的影片,成為了一個重要的課題。在傳統影片搜尋系統中,使用者多半是透過輸入文字,來從影片的標題資訊、註解甚至是影片被嵌入網頁中的文字資訊尋找資料。並依據字詞相關度、交互點閱次數來進行排序。有時這些文字資料並不能代表影片內容中所含有的全部資訊。
因此我們建置了一個能讓使用者透過指定影片中物體移動的軌跡、物體外觀等資訊進行更直觀的搜尋。進而提供保全與軍事系統之入侵行為分析、未來將可應用於攜帶型裝置。


The prevalence of video recording capability, either on surveillance or mobile devices, has contributed to the popularity of video data. As a result, video management has become relatively more important than before, and in particular, video retrieval has been one of the main issues in this regard. Traditional video retrieval systems take texts as the inputs to look for similar information from the title, annotation or embedded textual data of a video, in a way that is very similar to the keyword search adopted by a common search engine. However, the lack of visual information specification during a search often makes the result rather inaccurate or even useless. For this reason, video retrieval systems with inputs being images or videos have also been proposed; nevertheless, the associated ambiguity and complexity have made the implementation of such systems relatively difficult, and thus not as successful as desired. To address this, in this thesis, we propose to perform a video retrieval of a desired object through the inputs of its trajectory and/or appearance, together with the help of a 3D graphical user interface for more intuitive interactions, more satisfactory results can be achieved. And we firmly believe that such a framework could serve as the foundation for behavior analysis to be used in many surveillance systems.

1. 緒論 1 1.1 研究動機 1 1.2 系統架構 1 1.3 系統貢獻 3 2. 文獻探討 4 2.1影片搜尋 4 2.2物體追蹤 4 2.3軌跡比對 5 2.4物體外觀比對 5 3.移動物體抽出 6 3.1 背景分離法(Background Subtraction) 6 3.2 去除陰影 11 3.3 物體追蹤 12 4. 軌跡處理 13 4.1曲線耦合重建(Curve Fitting Reconstruction) 13 4.2曲線比對 15 4.3軌跡分群 18 5. 物體外觀特徵比對 19 5.1 SURF(Speeded Up Robust Features)特徵 19 5.2 Bag-of-words法與TF-IDF係數 19 6. 實驗結果 21 6.1 實驗系統環境 21 6.2 測試資料 22 6.3 系統畫面 25 7. 結論與未來展望 28 參考文獻 29

[1] Han, M., Xu, W., & Gong, Y. (2006, October). Video object segmentation by motion-based sequential feature clustering. Proceedings of the 14th annual ACM international conference on Multimedia (pp. 773-782). ACM.
[2] Liu, F., & Gleicher, M. (2009, June). Learning color and locality cues for moving object detection and segmentation. CVPR 2009. Conference on Computer Vision and Pattern Recognition (pp. 320-327). IEEE.
[3] Wei, Y., Sun, J., Tang, X., & Shum, H. Y. (2007, October). Interactive offline tracking for color objects. ICCV 2007. 11th International Conference on Computer Vision (pp. 1-8). IEEE.
[4] Kalal, Z., Matas, J., & Mikolajczyk, K. (2009, September). Online learning of robust object detectors during unstable tracking. ICCV Workshops. 2009 12th International Conference on Computer Vision Workshops (pp. 1417-1424). IEEE.
[5] de la Torre Frade, F., Martubez M., E., Santamria P, M. E., Moran M, J. A (1997). Moving object detection and tracking system: a real-time implementation. In 16° Colloque sur le traitement du signal et des images, FRA, 1997. GRETSI, Groupe d’Etudes du Traitement du Signal et des Images.
[6] Yilmaz, A. (2007, June). Object tracking by asymmetric kernel mean shift with automatic scale and orientation selection. CVPR'07. Conference on Computer Vision and Pattern Recognition (pp. 1-6). IEEE.
[7] Jacques, J. C. S., Jung, C. R., & Musse, S. R. (2005, October). Background subtraction and shadow detection in grayscale video sequences. SIBGRAPI 2005. 18th Brazilian Symposium on Computer Graphics and Image Processing (pp. 189-196). IEEE.
[8] Sanin, A., Sanderson, C., & Lovell, B. C. (2010, August). Improved shadow removal for robust person tracking in surveillance scenarios. 2010 20th International Conference on Pattern Recognition (pp. 141-144). IEEE.
[9] Chang, S. F., Chen, W., Meng, H. J., Sundaram, H., & Zhong, D. (1998). A fully automated content-based video search engine supporting spatiotemporal queries. IEEE Transactions on Circuits and Systems for Video Technology, 8(5), 602-615.
[10] Chen, W., & Chang, S. F. (1999, December). Motion trajectory matching of video objects. In Electronic Imaging (pp. 544-553). International Society for Optics and Photonics.
[11] Zheng, J. B., Feng, D. D., & Zhao, R. C. (2005, October). Trajectory matching and classification of video moving objects. 2005 7th Workshop on Multimedia Signal Processing (pp. 1-4). IEEE.
[12] Junejo, I. N., Javed, O., & Shah, M. (2004, August). Multi feature path modeling for video surveillance. ,ICPR 2004. Proceedings of the 17th International Conference on Pattern Recognition (Vol. 2, pp. 716-719). IEEE.
[13] Dornaika, F., & Chakik, F. (2010, August). Efficient Object Detection and Matching using Feature Classification. (ICPR), 2010 20th International Conference on Pattern Recognition (pp. 3073-3076). IEEE.
[14] Alahi, A., Marimon, D., Bierlaire, M., & Kunt, M. (2008, October). A master-slave approach for object detection and matching with fixed and mobile cameras. ICIP 2008. 15th International Conference on Image Processing (pp. 1712-1715). IEEE..
[15] Bassiouny, M., & El-Saban, M. (2011, January). Object matching using feature aggregation over a frame sequence, 2011 Workshop on Applications of Computer Vision (pp. 95-102). IEEE.
[16] Teixeira, L. F., & Corte-Real, L. (2009). Video object matching across multiple independent views using local descriptors and adaptive learning. Pattern Recognition Letters, 30(2), 157-167.
[17] Kang, H., Efros, A. A., Hebert, M., & Kanade, T. (2009, June). Image matching in large scale indoor environment. CVPR Workshops 2009. Computer Society Conference on Computer Vision and Pattern Recognition (pp. 33-40). IEEE.
[18] Bastan, M., Cam, H., Gudukbay, U., & Ulusoy, O. (2010). BilVideo-7: an MPEG-7-compatible video indexing and retrieval system. Multimedia, IEEE, 17(3), 62-73..
[19] DeMenthon, D., & Doermann, D. (2003). Video retrieval using spatio-temporal descriptors. In Proceedings of the eleventh ACM international conference on Multimedia (pp. 508-517). ACM.
[20] Lowe, D. G. (2004). Distinctive image features from scale-invariant keypoints. International journal of computer vision, 60(2), 91-110.
[21] Sillito, R. R., & Fisher, R. B. (2008, October). Semi-supervised learning for anomalous trajectory detection. In Proc. BMVC (pp. 1035-1044).
[22] Hsieh, J. W., Yu, S. L., & Chen, Y. S. (2006). Motion-based video retrieval by trajectory matching. Circuits and Systems for Video Technology, IEEE Transactions on, 16(3), 396-409..
[23] Jeannin, S., & Divakaran, A. (2001). MPEG-7 visual motion descriptors. IEEE Transactions on Circuits and Systems for Video Technology, 11(6), 720-724.
[24] Friedman, N., & Russell, S. (1997, August). Image segmentation in video sequences: A probabilistic approach. Proceedings of the Thirteenth conference on Uncertainty in artificial intelligence (pp. 175-181). Morgan Kaufmann Publishers Inc..
[25] Zivkovic, Z. (2004, August). Improved adaptive Gaussian mixture model for background subtraction. ICPR 2004. Proceedings of the 17th International Conference on Pattern Recognition (Vol. 2, pp. 28-31). IEEE..
[26] Pham, V., Vo, P., & Hung, V. T. (2010, November). GPU implementation of extended gaussian mixture model for background subtraction. Research, Innovation, and Vision for the Future (RIVF), 2010 RIVF International Conference on Computing and Communication Technologies (pp. 1-4). IEEE.
[27] Lucas, B. D., & Kanade, T. (1981, April). An iterative image registration technique with an application to stereo vision. Proceedings of the 7th international joint conference on Artificial intelligence.
[28] Stauffer, C., & Grimson, W. E. L. (2000). Learning patterns of activity using real-time tracking. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 22(8), 747-757.
[29] Gotoh, T.. and Miyoshi, T. and Takaya, Y. and Edamitsu T.., (1994), B-Spline Curve Fitting onto Measured Point Data under Consideration of Curvature, Journal of the International Societies for Precision Engineering and Nanotechnology, vol.60, no. 7, pp.964-968
[30] Dee, H. M., & Hogg, D. C. (2005, September). On the feasibility of using a cognitive model to filter surveillance data. AVSS 2005. Conference on Advanced Video and Signal Based Surveillance (pp. 34-39). IEEE.
[31] Salton, G., & Buckley, C. (1988). Term-weighting approaches in automatic text retrieval. Information processing & management, 24(5), 513-523.
[32] Freeman, H. (1961). On the encoding of arbitrary geometric configurations. Electronic Computers, IRE Transactions on, (2), 260-268.
[33] Bay, H., Ess, A., Tuytelaars, T., & Van Gool, L. (2008). Speeded-up robust features (SURF). Computer vision and image understanding, 110(3), 346-359.
[34] Agarwal, S., Awan, A., & Roth, D. (2004). Learning to detect objects in images via a sparse, part-based representation. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(11), 1475-1490.
[35] Yu, T., Wu, Y., Krahnstoever, N. O., & Tu, P. H. (2008, June). Distributed data association and filtering for multiple target tracking. CVPR 2008. Conference on Computer Vision and Pattern Recognition (pp. 1-8). IEEE.
[36] Roth, P. M., Grabner, H., Skocaj, D., Bischol, H., & Leonardis, A. (2005, October). On-line conservative learning for person detection, 2nd Joint International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance (pp. 223-230). IEEE.
[37] Grabner, H., Roth, P. M., & Bischof, H. (2007, October). Is pedestrian detection really a hard task. Proc. IEEE Intern. Workshop on Performance Evaluation of Tracking and Surveillance (pp. 1-8)
[38] Chen, X., Schonfeld, D., & Khokhar, A. (2008, June). Robust null space representation and sampling for view-invariant motion trajectory analysis. CVPR 2008. IEEE Conference on Computer Vision and Pattern Recognition (pp. 1-6). IEEE.
[39] Hunter, R. S. (1958). Photoelectric color difference meter. Josa, 48(12), 985-993.
[40] Li, C., Han, Z., Ye, Q., & Jiao, J. (2011, August). Abnormal Behavior Detection via Sparse Reconstruction Analysis of Trajectory. (ICIG), 2011 Sixth International Conference on Image and Graphics (pp. 807-810). IEEE.
[41] Freund, Y., & Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of computer and system sciences, 55(1), 119-139.
[42] Hamdoun, O., Moutarde, F., Stanciulescu, B., & Steux, B. (2008, September). Person re-identification in multi-camera system by signature based on interest point descriptors collected on short video sequences. ICDSC 2008. Second ACM/IEEE International Conference on Distributed Smart Cameras (pp. 1-6). IEEE.
[43] Bak, S., Corvee, E., Bremond, F., & Thonnat, M. (2010, August). Person re-identification using haar-based and dcd-based signature. AVSS 2010 Seventh IEEE International Conference on Advanced Video and Signal Based Surveillance (pp. 1-8). IEEE.
[44] Cheng, D. S., Cristani, M., Stoppa, M., Bazzani, L., & Murino, V. (2011). Custom pictorial structures for re-identification. In British Machine Vision Conference (pp. 68-1)
[45] Bazzani, L., Cristani, M., Perina, A., & Murino, V. (2012). Multiple-shot person re-identification by chromatic and epitomic analyses. Pattern Recognition Letters, 33(7), 898-903.
[46] Cheng, D. S., Cristani, M., Stoppa, M., Bazzani, L., & Murino, V. (2011). Custom pictorial structures for re-identification. In British Machine Vision Conference (pp. 68-1).

無法下載圖示 全文公開日期 2018/07/18 (校內網路)
全文公開日期 本全文未授權公開 (校外網路)
全文公開日期 本全文未授權公開 (國家圖書館:臺灣博碩士論文系統)
QR CODE