簡易檢索 / 詳目顯示

研究生: 古佳閔
JIA-MIN GU
論文名稱: 基於自動化相片事件分類在個人相簿之研究
A Study of Personal Photo Organization Based on Automatic Photo Event Classification
指導教授: 吳怡樂
Yi-Leh Wu
口試委員: 陳建中
Jiann-Jone Chen
何瑁鎧
Maw-Kae Hor
唐政元
Cheng-Yuan Tang
學位類別: 碩士
Master
系所名稱: 電資學院 - 資訊工程系
Department of Computer Science and Information Engineering
論文出版年: 2013
畢業學年度: 101
語文別: 英文
論文頁數: 40
中文關鍵詞: 個人相簿可交換圖像文件臉部偵測相片事件分類支持向量機
外文關鍵詞: Personal photo collection, exif, face detection, event classification, SVM
相關次數: 點閱:335下載:3
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報

相片事件在個人相簿管理是很重要的議題。因為使用者會為特定事件去拍照,所以相片事件容易被使用者記住。然而,相片事件標註目前有些問題需要改善。第一是缺乏有用的偵測模組來偵測相片資訊,第二為缺乏和相片事件有高度關聯的資訊。在本文中提出「自動化相片事件分類架構」來解決上述問題。本文主要貢獻是利用簡單的時間資訊像是月份、平日/假日、白天/晚上、場景資訊和相片事件的關聯性、以及有人/無人和室內/室外與事件之間的關係來自動分類相片事件。「自動化相片事件標註架構」由四個主要部分組成:包含時間資訊偵測、場景偵測模組、其他偵測模組以及相片事件偵測模組。這些偵測模組均可自動偵測相片資訊以做相片事件分類和標記。實驗結果均顯示「自動化相片事件標註架構」之方法,對於相簿事件標註是有幫助的。相片事件內容有球賽、海灘娛樂、生日、聖誕節、用餐、家庭時光、畢業、滑雪、城市旅行、結婚、郊外健行。


Photo events in personal photo collection management are important because people take photos for the specific events. Event is one of important cues to retrieve photos which are well remembered by people. However, event annotation work is still has some problems. The first problem is lack of detectors to detect the cues of photos automatically. The second problem is lack of useful cues which are highly related to photo events. In this work, an automatic photo event annotation framework is proposed to address these problems. Our main contribution is to annotate photo events automatically using simple temporal cues such as month, weekday/weekends, day/night, scenes, and people/non-people and indoor/outdoor. The framework was composed by four main components: the temporal detector, the scene detector, the other detector and event detector. The experimental results show that the methods in the framework are useful to detect the event of personal photos. According to the contents of events, we detect ballgames, beachfun, birthday, Christmas, dining, family time, graduation, skiing, urbantour, wedding, and yardprk.

論文摘要 I Abstract II Contents III List of Figures IV List of Tables V Chapter 1. Introduction 1 Chapter 2. Temporal Detector 4 2.1 Exif 4 2.2 Extracted Temporal Information 5 Chapter 3. Scene Detector 6 3.1 Support Vector Machine (SVM) 6 3.2 Visual Feature (GIST) 9 3.3 Scenes 11 Chapter 4. Other Detector 14 4.1 Indoor/Outdoor 14 4.2 People/Non-people 15 Chapter 5. Event Detector 17 5.1 Events 17 5.2 Results of Event Classification 19 Chapter 6. Experiments 21 6.1 Datasets 21 6.2 Evaluation on Scene Detector 22 6.3 Evaluation on Other Detector 24 6.4 Evaluation on Event Detector 25 Chapter 7. Conclusions and Future Works 28 References 29 Appendix A –The Confusion Matrices of Detectors 32 Appendix B – The Confusion Matrices of Event Detector 33 Appendix C - The Confusion Matrices of Photo Collections (5 photos detection) 34 Appendix D - The Confusion Matrices of Photo Collections 35

[1] K. Rodden, and K. R. Wood, “How Do People Manage Their Digital Photographs?,” Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 2003.
[2] M. Naaman, S. Harada, Q. Wang, H. Garcia-Molina, and A. Paepcke, “Context data in geo-referenced digital photo collections,” Proceedings of the 12th annual ACM international conference on Multimedia, 2004.
[3] P. Sandhaus and S. Boll, “Semantic analysis and retrieval in personal and social photo collections,” Multimedia Tools and Applications, vol. 51, no. 1, 2010.
[4] L. Hollink, G. Schreiber, B. Wielinga, and J. Wielemaker, “Semantic annotation of image collections,” Workshop on Knowledge Markup and Semantic Annotation, 2003.
[5] B. C. Russell, A. Torralba, K. P. Murphy, and W. T. Freeman, “LabelMe: A Database and Web-Based Tool for Image Annotation,” International Journal of Computer Vision, 2007.
[6] A. Hanbury, “A survey of methods for image annotation,” Journal of Visual Languages & Computing, 2008.
[7] R. Pagare and A. Shinde, “A Study on Image Annotation Techniques,” International Journal of Computer Applications, vol. 37, no. 6, 2012.
[8] A. C. Loui and A. Savakis, “Automated event clustering and quality screening of consumer pictures for digital albuming,” IEEE Transactions on Multimedia, 2003.
[9] M. Cooper, J. Foote, and L. Wilcox “Temporal event clustering for digital photo collections,” ACM Trans. Multimedia Comput. Commun. Appl. 1, 2005.
[10] L.-J. Li and F.-F. Li, “What, where and who? Classifying events by scene and object recognition,” IEEE 11th International Conference on Computer Vision, 2007.
[11] M. Das and A. C. Loui, “Event Classification in Personal Image Collections,” IEEE International Conference on Multimedia and Expo, 2009.
[12] C. S. Firan, M. Georgescu, W. Nejdl, and R. Paiu, “Bringing Order to Your Photos: Event-Driven Classification of Flickr Images Based on Social Knowledge Categories and Subject Descriptors,” Proceedings of the 19th ACM international conference on Information and knowledge management (CIKM’10), 2010.
[13] F. Tang, D. R. Tretter, and C. Willis, “Event classification for personal photo collections,” IEEE International Conference on Acoustics, Speech and Signal Processing, 2011.
[14] I. Tankoyeu, J. Paniagua, J. Stöttinger, and F. Giunchiglia, “Event Detection and Scene Attraction by Very Simple Contextual Cues,” Proceedings of the 2011 joint ACM workshop on Modeling and representing events, 2011.
[15] M. Naaman, Y. J. Song, A. Paepcke, and H. Garcia-Molina, “Automatic Organization for Digital Photographs with Geographic Coordinates,” in Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries, 2004.
[16] P. Serdyukov, V. Murdock, and R. V. Zwol. “Placing flickr photos on a map,” Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, 2009.
[17] L. Cao, J. Luo, H. Kautz, and T. S. Huang, “Image annotation within the context of personal photo collections using hierarchical event and scene models,” IEEE Transactions on Multimedia, vol. 11, no. 2, 2009.
[18] C. Cusano, G. Ciocca, and R. Schettini, “Image annotation using SVM,” Proceedings of SPIE 5304, Internet Imaging V, 2003.
[19] G. Anthony, H. Gregg, and M. Tshilidzi, “Image classification using SVMs: one-against-one v.s. one-against-all,” CoRR, 2007.
[20] A. Oliva and A. Torralba, “Modeling the Shape of the Scene: A Holistic Representation of the Spatial Envelope,” International Journal of Computer Vision, vol. 42, no. 3, 2001.
[21] A. Oliva,“Gist of a scene,” The Encyclopedia of Neurobiology of Attention. L. Itti, G. Rees, and J.K. Tsotsos (Eds.), Elsevier, 2005.
[22] M. Douze, H. Jégou, H. Sandhawalia, L. Amsaleg, and C. Schmid. “Evaluation of GIST descriptors for web-scale image search,” Proceedings of the ACM International Conference on Image and Video Retrieval (CIVR '09), 2009.
[23] F.-F. Li and P. Perona, “A Bayesian Hierarchical Model for Learning Natural Scene Categories,” IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005.
[24] N. Serrano, A. Savakis, and J. Luo, “A computationally efficient approach to indoor/outdoor scene classification,” Proceedings of 16th International Conference on Pattern Recognition, 2002.
[25] A. Payne and S. Singh, “Indoor vs. outdoor scene classification in digital photographs,” Pattern Recognition, 2005.
[26] C. Pavlopoulou, C. Hill, and S. X. Yu, “Indoor-Outdoor Classification with Human Accuracies: Image or Edge Gist?,” IEEE Conference on Computer Vision and Pattern Recognition, 2009.
[27] A. Quattoni and A. Torralba, “Recognizing Indoor Scenes,” IEEE Conference on Computer Vision and Pattern Recognition, 2009.
[28] P. Zhou, Y. Zheng, Z. Li, M. Li, and G. Shen, “IODetector: A Generic Service for Indoor Outdoor Detection,” Proceedings of the 10th ACM Conference on Embedded Network Sensor Systems, 2012.
[29] P. Viola and M. Jones, “Robust real-time object detection,” in International Journal of Computer Vision, 2001.
[30] L. Cao, J. Luo, H. Kautz, and T. S. Huang “Annotating collections of photos using hierarchical event and scene models,” IEEE Conference on Computer Vision and Pattern Recognition, 2008.

QR CODE