簡易檢索 / 詳目顯示

研究生: 楊承翰
Cheng-Han Yang
論文名稱: 基於字幕與使用者回饋之影片推薦與回顧
Video Recall and Recommendation based on Caption and User Feedback
指導教授: 楊傳凱
Chuan-kai Yang
口試委員: 楊立偉
Li-wei Yang
林柏慎
Bor-shen Lin
學位類別: 碩士
Master
系所名稱: 管理學院 - 資訊管理系
Department of Information Management
論文出版年: 2015
畢業學年度: 103
語文別: 中文
論文頁數: 59
中文關鍵詞: 光學文字辨識關鍵影格提取關鍵詞提取回憶推薦
外文關鍵詞: OCR, Extract keyframe, Extract keyword, Recall, Recommendation
相關次數: 點閱:290下載:2
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報

影像處理的需求隨著數位影像的普及日漸增大,需要有效的方法來瀏覽和查看它們。
其中在電影內容摘要與分類推薦的領域中,通常需要使用者去標記、萃取關鍵內容,本系統希望能自動化讀取使用者曾看過的影片來做影片內容分析;通常看完影片一段時間後只會留有模糊的記憶,而我們希望可以讓使用者利用這些模糊的記憶來尋找回影片的資訊。
系統能透過關鍵影格與影片字幕的關鍵詞,來幫助使用者回想起看過的影片內容與劇情,讓使用者在不需手動處理的情況下,即可透過抽取出的瀏覽內容快速喚起影片的相關訊息,並了解自己的喜好分類並給予相關影片的推薦。
除了幫助使用者完成劇情回想外,系統也會簡易地根據使用者的喜好推薦相關的影片。


With more and more digital videos being made available, so we need powerful ways to handle, browse and view them. And about the movies, we make a summary, classification and suggestion.
A user usually needs to label and extract the content. Our system can automatically load the videos and analyze them after the user has already watched the videos.
After watching the movies for some time, people typically have vague memories of the contents; we use those vague memories to recall the information of the video.
The system can browse through keyframes and caption to help a user recall the video content and any other information, and it can also recommend some video based on user is feedback.

1. 緒論 1 2. 文獻探討 2 2.1. 字幕抽取 3 2.2. 錯字偵測 8 2.3. 關鍵影格提取 10 2.4. 影片搜尋與推薦 14 3. 系統實作 16 3.1 系統流程圖 17 3.2 前置處理 20 1. 影片名稱、分類 20 2. 錯字檢測 22 3. OCR訓練 23 4. 歷史資料讀取 24 3.3 關鍵影格抽取 25 1. 利用Incremental Clustering分群 25 2. 過濾關鍵影格 26 3. 輸出 28 3.4 字幕抽取 30 1. 字幕資訊獲取 30 2. 字幕裁切與過濾 31 3. OCR辨識 32 4. 輸出 33 4. 系統結果呈現 35 4.1. 預覽 35 1. 概要預覽 35 2. 進階預覽 36 4.2. 尋找相似與推薦 38 1. 尋找相似依據 38 2. 推薦 40 4.3. 使用者訊息 40 1. 影片與字幕類 40 2. 圖表類 41 5. 系統環境與評估 42 5.1. 系統環境 42 5.2. 比較 43 1. 關鍵影格抽取 43 2. 字幕抽取 44 3. 影片尋找與視覺化呈現 45 6. 結論與未來展望 46 7. 參考文獻 47

[1].Jie Yuan, Baogang Wei, Weiming Lu, Lidong Wang. "A New Video Text Detection Method". JCDL '11: Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries, 359-362, 2011.
[2].D. S. Guru, S. Manjunath, P. Shivakumara, C. L. Tan. "An Eigen Value Based Approach for Text Detection in Video". DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, 501-506, 2010.
[3].Palaiahnakote Shivakumara, Anjan Dutta, Chew Lim Tan, Umapada Pal. "A New Wavelet-Median-Moment based Method for Multi-Oriented Video Text Detection". DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems, 279-286, 2010.
[4].Shaohua Yang, Hai Zhao, Xiaolin Wang, Bao-liang Lu. "Spell Checking for Chinese". LREC '12: Proceedings of the Eight International Conference on Language Resources and Evaluation, 2012.
[5]. Yu-Ming Hsieh, Ming-Hong Bai, Keh-Jiann Chen. "Introduction to CKIP Chinese Spelling Check System for SIGHAN Bakeoff 2013 Evaluation". SIGHAN '13: Proceedings of the 7th SIGHAN Workshop on Chinese Language Processing, 2013.
[6].Christoph Brachmann, Rainer Malaka. "Keyframe-less Integration of Semantic Information in a Video Player Interface". EuroITV '09: Proceedings of the seventh european conference on European interactive television conference, 137-140, 2009.
[7].Engin Mendi, Coskun Bayrak. "Shot Boundary Detection and Key Frame Extraction using Salient Region Detection and Structural Similarity". ACM SE '10: Proceedings of the 48th Annual Southeast Regional Conference, 2010.
[8].Andreas Girgensohn, Frank Shipman, Lynn Wilcox. "Adaptive Clustering and Interactive Visualizations to Support the Selection of Video Clips".ICMR '11: Proceedings of the 1st ACM International Conference on Multimedia Retrieval, 2011.
[9].Fabrice Souvannavong, Bernard Merialdo, Benoit Huet. "REGION-BASED VIDEO CONTENT INDEXING AND RETRIEVAL". CBMI 2005, Fourth International Workshop on Content-Based Multimedia Indexing, 21-23, 2005.
[10].Tao Mei, Bo Yang, Xian-Sheng Hua, Linjun Yang, Shi-Qiang Yang, Shipeng Li. "VideoReach: An Online Video Recommendation System". SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, 767-768, 2007.
[11].Dr. Neal Krawetz. "Looks Like It". http://www.hackerfactor.com/blog/index.php?/archives/432-Looks-Like-It.html
[12].Xiaoqian Liu, Weiqiang Wang. " Extracting Captions from Videos Using Temporal Feature". MM '10: Proceedings of the international conference on Multimedia, 843-846, 2010.
[13].Jin Yuan, Yi-Liang Zhao, Huanbo Luan, Meng Wang, Tat-Seng Chua. " Memory Recall Based Video Search: Finding Videos You Have Seen Before Based on Your Memory". TOMM '14 : Transactions on Multimedia Computing, Communications, and Applications , 2014.
[14].Eugene Borovikov, Ilya Zavorin, Mark Turner. "A filter based post-OCR accuracy boost system". HDP '04 : Proceedings of the 1st ACM workshop on Hardcopy document processing, 23-28, 2004.
[15].Sezer Karaoglu, Jan C. van Gemert, Theo Gevers. "Con-text: text detection using background connectivity for fine-grained object classification". MM '13 : Proceedings of the 21st ACM international conference on Multimedia, 757-760, 2013.
[16].Chao-Lin Liu, Jen-Hsiang Lin. "Using structural information for identifying similar Chinese characters". HLT-Short '08 : Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies, 93-96, 2008.
[17].Yuan Lin, Jiangqin Wu, Pengcheng Gao, Yang Xia, Tianjiao Mao. "LSH-based large scale chinese calligraphic character recognition". JCDL '13 : Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries, 323-330, 2013.
[18].Yabin Zheng, Lixing Xie, Zhiyuan Liu, Maosong Sun, Yang Zhang, Liyun Ru. "Why Press Backspace? Understanding User Input Behaviors in Chinese Pinyin Input Method". HLT '11 : Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies, 485-490, 2011.
[19].Lei Zhang, Changning Huang, Ming Zhou, Haihua Pan. "Automatic detecting/correcting errors in Chinese text by an approximate word-matching algorithm". ACL '00 : Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, 248-254, 2000.
[20].Joshua Hailpern, Nicholas Jitkoff, Andrew Warr, Karrie Karahalios, Robert Sesek, Nik Shkrob. "YouPivot: improving recall with contextual search". CHI '11 : Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, 1521-1530, 2011.
[21].Wei-Ta Chu, Cheng-Jung Li, Yeh-Kai Chou. "Tag suggestion and localization for web videos by bipartite graph matching". WSM '11 : Proceedings of the 3rd ACM SIGMM international workshop on Social media, 35-40, 2011.
[22]. Damian Borth, Adrian Ulges, Christian Schulze, Thomas M. Breuel. " Keyframe Extraction for Video Tagging & Summarization". Informatiktage, 45-48, 2008.
[23]. Yen-Chu Lai. " Tesseract-ocr文字訓練". http://bigbabaychu.blogspot.tw/2014/11/tesseract-ocr_26.html

QR CODE