簡易檢索 / 詳目顯示

研究生: 林庭佑
Ting-You Lin
論文名稱: 一個基於卷積神經網路及層次序走訪之攝影構圖指引方法
A Photographic Composition Guidance Method Based on Convolutional Neural Network and Level-order Traversal
指導教授: 范欽雄
Chin-Shyurng Fahn
口試委員: 范欽雄
Chin-Shyurng Fahn
王聖智
Shen-Jyh Wang
謝君偉
Jun-Wei Hsieh
吳怡樂
Yi-Leh Wu
學位類別: 碩士
Master
系所名稱: 電資學院 - 資訊工程系
Department of Computer Science and Information Engineering
論文出版年: 2018
畢業學年度: 106
語文別: 英文
論文頁數: 63
中文關鍵詞: 攝影構圖卷積神經網路層次序走訪構圖指引
外文關鍵詞: Photography composition, Convolutional neural network, Level-order traversal, Composition guidance
相關次數: 點閱:219下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 隨著科技的進步,數位相機與智慧型拍照手機逐漸普及化,拍照成為人們生活中不可或缺的一部分。該如何拍攝出美麗的照片是一門很重要的學問,需要花費一番苦心去學習,但並不是每個人都有時間去學習攝影技巧。
    構圖是攝影中非常重要的一部分,對於照片的美學有一定的關連性。一個很好的構圖可以讓我們專注於攝影師想要在攝影主題中傳達的內容。為了讓幫助攝影的初學者學習如何拍攝構圖,在本篇論文中,我們研究了一套攝影構圖的指引方法。攝影的初學者能夠透過這個方法縮減學習拍攝構圖的時間。
    在本篇論文中,我們使用卷積神經網路的分類系統並且加上邊緣特徵輔助預測結果。透過這個方法來做構圖的分類並將預測機率當作構圖的評分。最後利用層次序走訪的方法結合上述的構圖評分原則來找出指引的區域。
    實驗中使用七種類型的構圖當作指引的對象。本文提出的方法可以有效地對影像的攝影構圖類型進行分類,召回率達95.0%以及精確率達94.8%。從實驗結果可以得知,被指引後的影像會更趨近於被指引的構圖類型。


    With the progress of science and technology, digital cameras and smart phone gradually popularized, photography has become an indispensable part of people's lives. How to take a beautiful picture is a very important knowledge, and it takes a lot of time to learn, but not everyone has time to learn photography skills.
    Composition is a very important part of photography and it has a certain relevance to the photography aesthetics. A good composition allows viewers to focus on the subject of the photography which the photographer wants to convey. In order to support photography beginners to learn how to take pictures with a good composition, we study a photographic composition guidance method in this paper. Photography beginners can shorten the time to learn photography composition.
    In this paper, we use a convolutional neural network classification system and add edge features to support composition prediction. We use the training model to classify photography composition and the probability of prediction is used as the score of the composition. Finally, the level-order traversal method is used to find out the guidance region with the above scoring method.
    Seven types of composition are used as guidelines in this experiment. The method proposed in this paper can effectively classify the photographic composition of images. The recall rate is 95.0% and the precision rate is 94.8%. It can be known from the experimental results that the guidance image will be closer to the type of guidance composition.

    中文摘要 i Abstract ii 致謝 iii Contents iv List of Figures vii List of Tables ix Chapter 1 Introduction 1 1.1 Overview 1 1.2 Motivation 2 1.3 Type of photography composition 3 1.3.1 Central Composition 3 1.3.2 Diagonal Composition 3 1.3.3 Horizontal Composition 4 1.3.4 Perspective Composition 4 1.3.5 Rule of Thirds Composition 5 1.3.6 Symmetry Composition 5 1.3.7 Vertical Composition 6 1.4 System Description 6 1.5 Thesis Organization 9 Chapter 2 Related Work 10 2.1 Image characteristics 10 2.1.1 Colorfulness 10 2.1.2 Contrast 11 2.1.3 Sharpness 11 2.1.4 Saliency 11 2.1.5 Composition 11 2.2 Reviews of Photographic Composition Classification 12 2.3 Composition Improvement and Photo Guidance 14 Chapter 3 Composition Guidance 16 3.1 Features for Composition Guidance 16 3.1.1 Sobel operator 16 3.1.2 Otsu’s method 19 3.2 Convolutional Neural Network 21 3.2.1 Convolutional Layer 22 3.2.2 Pooling Layer 24 3.2.3 Neuron Dropout 26 3.2.4 Fully Connected Layer 27 3.2.5 Activation Function 27 3.2.6 Our CNN Model 29 3.3 Mask Scanning by Level-order Traversal 30 3.3.1 Composition scoring mechanism 30 3.3.2 Level-order Traversal 30 Chapter 4 Experimental Results and Discussions 32 4.1 Experimental Setup 32 4.2 Results of Classified Compositions 33 4.3 Results of Composition Guidance 39 4.3.1 Results of Scanning Method 39 4.3.2 Guidance Results of Central Composition 39 4.3.3 Guidance Results of Diagonal Composition 40 4.3.4 Guidance Results of Horizontal Composition 41 4.3.5 Guidance Results of Perspective Composition 42 4.3.6 Guidance Results of Rule of Thirds Composition 43 4.3.7 Guidance Results of Symmetry Composition 44 4.3.8 Guidance Results of Vertical Composition 45 4.3.9 Guidance Results that Cannot Be Improved 46 Chapter 5 Conclusions and Future Work 47 5.1 Conclusions 47 5.2 Future Work 48 References 49

    [1] G. Peters, “Aesthetic primitives of images for visualization,” in Proceedings of the 11th International Conference Information Visualization, Zurich, Switzerland, pp. 316–325, Jul 2007.
    [2] R. Datta, D. Joshi, J. Li, and J. Z. Wang, "Studying aesthetics in photographic images using a computational approach," in Proceedings of the European Conference on Computer Vision, Graz, Austria, pp. 288-301, May 2006.
    [3] H. H. Su, T. W. Chen, C. C. Kao, W. H. Hsu, and S. Y. Chien, "Scenic photo quality assessment with bag of aesthetics-preserving features," in Proceedings of the 19th ACM International Conference on Multimedia, Scottsdale, Arizona, pp. 1213-1216, Nov. 2011.
    [4] S. Dhar, V. Ordonez, and T. L. Berg, "High level describable attributes for predicting aesthetics and interestingness," in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Colorado Springs, CO, pp. 1657-1664, Jun 2011.
    [5] K. Yan, T. Xiaoou, and J. Feng, "The design of high-level features for photo Quality Assessment," in Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, New York, NY, pp. 419-426, Jun 2006.
    [6] E. Mavridaki and V. Mezaris, "A comprehensive aesthetic quality assessment method for natural images using basic rules of photography," in Proceedings of the IEEE International Conference on Image Processing, Quebec City, Canada, pp. 887-891, Sep 2015.
    [7] M. Nishiyama, T. Okabe, I. Sato, and Y. Sato, “Aesthetic quality classification of photographs based on color harmony,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Colorado Springs, CO, pp. 33–40, Jun 2011.
    [8] C Song, B Zhou, and W Guo “Aesthetic quality assessment of photographic images,” in Proceedings of the IEEE 12th World Congress on In Intelligent Control and Automation, Guilin, China, pp. 3088-3093, Jun 2016.
    [9] X. Hou and L. Zhang, “Saliency detection: A spectral residual approach,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Minneapolis, MN, pp. 1-8, Jun 2007.
    [10] M. M. Cheng, N. J. Mitra, X. Huang, P. H. S. Torr, and S. M. Hu, “Global Contrast Based Salient Region Detection,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 37, no. 3, pp. 569-582, Mar 2015.
    [11] L. Mai, H. Le, Y. Niu, and F. Liu, “Rule of Thirds Detection from Photograph,” in Proceedings of the IEEE International Symposium on Multimedia, Dana Point, CA, pp. 91-96, Dec 2011.
    [12] L. Bai, X. Wang and Y. Chen, “Landscape Image Composition Analysis Based on Image Processing,” in Proceedings of the IEEE International Conference on Computer Science and Automation Engineering, Zhangjiajie, China, Vol. 2, pp. 787-790, May 2012.
    [13] J. H. Huang, “A fuzzy logic approach for recognition of photographic compositions,” M.S. Thesis, Dept. Mathematical Sciences, National Chengchi University, Taipei, Taiwan, 2007.
    [14] Y. T. Lin, “A photo composition classification system based on supervised learning,” M.S. Thesis, Dept. Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan, 2014.
    [15] J. H. Liu. “An Automatic Aesthetic Photo Composition Classification System Based on a Random Forest” M.S. Thesis, Dept. Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan, 2017.
    [16] X. Tang, W. Luo, and X. Wang, "Content-Based Photo Quality Assessment," IEEE Transactions on Multimedia, vol. 15, no. 8, pp. 1930-1943, Dec 2013.
    [17] L. Liu, R. Chen, L. Wolf and D. Cohen-Or, “Optimizing Photo Composition,” Computer Graphics Forum, vol. 29, no. 2, pp. 469-478, Jun 2010.
    [18] S. Ma, Y. Fan, and C. W. Chen, “Finding your spot: a photography suggestion system for placing human in the scene,” in Proceedings of the IEEE International Conference on Image Processing, Paris, France, pp556-560, Oct 2014.
    [19] S. Ma, Y. Fan, and C. W. Chen, “Pose maker: A pose recommendation system for person in the landscape photographing,” in Proceedings of the 22nd ACM international conference on Multimedia, New York, NY, pp. 1053-1056, Nov 2014.
    [20] Y. Zhang, Q. Huang, L. Qin, S. Zhao, X. Lu, X. Sun, and H. Yao “Strategy for aesthetic photography recommendation via collaborative composition model,” IET Computer Vision, vol. 9, no.5, pp. 691-698, Sep 2015.
    [21] S. Bhattacharya, R. Sukthankar, and M. Shah, “A framework for photoquality assessment and enhancement based on visual aesthetics,” in Proceedings of the ACM International Conference on Multimedia, Firenze, Italy, pp. 271–280. Oct 2010
    [22] K. Y. Lo, K. H. Liu, and C. S. Chen, “Assessment of photo aesthetics with efficiency,” in Proceedings of the IEEE International Conference on Pattern Recognition, Tsukuba, Japan, pp. 2186–2189, Nov 2012.
    [23] L. Mai, H. Jin, and F. Liu, “Composition-preserving deep photo aesthetics assessment,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, pp. 497-506, Jun 2016.
    [24] L. Zhang, M. Song, Q. Zhao, X. Liu, J. Bu, and C. Chen, “Probabilistic graphlet transfer for photo cropping,” IEEE Transactions on Image Processing, vol. 22, no. 2, pp. 802–815, Oct 2012.
    [25] L. K. Wong and K. L. Low, “Saliency-enhanced image aesthetics class prediction,” in Proceedings of the IEEE International Conference on Image Processing, Cairo, Egypt, pp. 997-1000, Nov 2009.
    [26] P. Obrador, L. Schmidt-Hackenberg, and N. Oliver, “The role of image composition in image aesthetics,” in Proceedings of the IEEE International Conference on Image Processing, Hong Kong, China, pp. 3185-3188, Sep 2010.

    無法下載圖示 全文公開日期 2023/07/25 (校內網路)
    全文公開日期 2028/07/25 (校外網路)
    全文公開日期 2023/07/25 (國家圖書館:臺灣博碩士論文系統)
    QR CODE