簡易檢索 / 詳目顯示

研究生: 陳泰丞
Tai-cheng chen
論文名稱: 基於旋律片段分群之曲風分類方法
Music Genre Classification Base On Clustered Melody Patterns
指導教授: 林伯慎
Bor-shen Lin
口試委員: 古鴻炎
Hong-yen Gu
Chuan-kai Yang
學位類別: 碩士
系所名稱: 管理學院 - 資訊管理系
Department of Information Management
論文出版年: 2012
畢業學年度: 100
語文別: 中文
論文頁數: 46
中文關鍵詞: 音樂曲風分類N-gram條件式平滑化階層式分群
外文關鍵詞: conditional smoothing, N-gram, Musical genre classification, hierarchical cluster
相關次數: 點閱:285下載:3
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本論文提出使用音樂旋律特徵進行自動風格分類的方法。我們比較了三種統計分類方法:以關聯度為基礎的方法、類神經網路、k個最近相似度方法。在基礎實驗中以類神經網路67.5%的正確率最高,而本論文的關聯度分類方法正確率則達到66.2%。
    接著我們將旋律片段分群對關聯度為基礎的方法進行改進。在資料稀疏時,藉由群聚內的旋律片段達到機率共享的好處,可以將正確率提升到 70.0%。因此,進一步我們提出條件式平滑化方法,希望能依照訓練資料的充足與否做為判斷是否相信群聚關聯度統計值的準則。實驗結果發現,當在旋律片段出現次數排名前20%者(統計資料充足),可以把正確率在提升到70.67%。也就是說,條件式平滑化是有助於正確率提升。

    This paper proposes a scheme of using melodic patterns to classify the musical genres. Three types of statistical classification approaches are compared, correlation-based classifier, artificial neural network(ANN), K-nearest neighbor classifier. In the baseline experiment, ANN can achieve the highest accuracy of 67.5%, while the correlation-based classifier proposed in this paper the accuracy of 66.2%.
    The correlation-based classifier were there improved by smoothing the statistical of correlations based on clustered melodic patterns. The accuracy of 70.0% can be achieved for correlation-based classifier after applying the smoothing approach. Finally, a scheme of conditional smoothing by considering the amount of training data can be further used to improve the accuracy up to 70.67%.

    目錄 第一章 緒論 1 1.1研究動機 1 1.2研究背景 2 1.3研究目的與成果簡介 4 1.4 論文組織與架構 5 第二章 文獻與背景技術 6 2.1音樂資訊檢索 6 2.1.1主旋律抽取 7 2.1.2 主旋律標準化 9 2.1.3 相似度測量 11 2.2 重複片段抽取 12 2.3 階層式分群 13 2.4第K位最接近的鄰居(KNN: K-NEAREST-NEIGHBOR) 14 2.5前綴樹 14 2.6 動態時間校準 16 2.7 本章摘要 17 第三章 旋律片段抽取 18 3.1 抽取旋律流程 18 3.2 音符編碼 19 3.3 以旋律片段建立前綴樹 20 3.4 旋律片段篩選 21 3.5 旋律片段排序 22 3.6 本章摘要 23 第四章 樂曲曲風識別方法 24 4.1 以關聯度為基礎的分類方法 24 4.2 分群相似度計算 24 4.3 相關係數計算 26 4.4 樂曲和標籤相關係數計算 28 4.5 類神經網路曲風識別機制 31 4.6 K-最鄰近法曲風識別機制 32 4.7 關聯度輔以分群曲風識別機制 33 4.8 實驗結果 33 4.9 DTW扭曲範圍的影響 36 第五章 選擇式分群方法 37 5.1 選擇式分群機制 37 5.2 實驗結果 38 5.3 其他作法的比較 40 第六章 結論 42 參考文獻 43 圖目錄 圖1.1 網路使用者對資訊檢索種類的變化 2 圖1.2 研究架構流程圖 4 圖2.1 音樂資訊檢索主要組成 7 圖2.2鋼琴琴鍵上的半音與全音 9 圖2.3小蜜蜂部分段落的五線譜 9 圖2.4階層式分群樹狀結構圖 14 圖2.5前綴樹的範例 15 圖2.6 動態時間校準示意圖 17 圖3.1建構旋律片段字典流程圖 19 圖3.2 以兒歌「小毛驢」之音符編碼範例 20 圖3.3(A) 以{AABCAAA}限制旋律片段長度為五的組合 21 圖3.3(B) 以{AABCAAA}建立前綴樹示意圖 21 圖4.1以關聯度為基礎的分類方法 24 圖4.2旋律片段分群流程圖 25 圖4.3 DTW計算範例 26 圖4.4相關係數計算流程圖 27 圖4.5 相關係數概念示意圖 28 圖4.6相關係數應用於音樂風格示意圖 28 圖4.7 歌曲和曲風標記相關係數示意圖 29 圖4.8兩群集間距離示意圖 30 圖4.9門檻可做調整的樹狀結構示意圖 30 圖4.10運用類神經曲風分類架構圖 31 圖4.11 運用K-最鄰近法分類架構圖 32 圖4.12 以關聯度輔以分群為基礎的分類方法架構圖 33 圖4.13分群對效能的影響實驗結果 35 圖4.14 兩旋律片段扭曲範圍示意圖 36 圖5.1關聯度+分群+條件式平滑化架構圖 37 表目錄 表 1.1 使用的特徵類型和例子一覽表 3 表2.1 MIR三大種類簡介 6 表2.1.1數位音樂結構種類分析 8 表4.6 本實驗所有音樂在五種風格各佔的樂曲數目與各風格訓練比重 34 表4.7 三種方法初步實驗結果 34 表4.8 7000群和不做分群各曲風的結果 35 表4.9 DTW扭曲範圍實驗 36 表5.1使用不同比例選擇式分群的正確率變化 39 表5.3 MIDI音樂選的音樂特徵一覽 40 表5.4 用統計方法各曲風正確率一覽 41

    [1] PadamaIyer: Music Information Retrieval, Vishvabharti Publications, New Delhi, (2004).
    [2] Michael A. Casey, RemcoValtkamp, MasatakaGoto, Marc Leman, Christophe Rhodes, and Malcolm Slaney:“Content-Based Music Information Retrieval: Current Directions and Future Challenges”,Proceedings of the IEEE, Vol. 96, No. 4, pp. 668–696, (2008).
    [3] 梁敬偉,「基於不同音樂特徵的音樂檢索方法的效果及效率比較」,碩士論文,國立政治大學,台北(2006)。
    [4] Hewijin Christine Jiau and Chuan-Wang Chang:“A Dual Ternary Indexing Approach for Music Retrieval System”, Journal of Advanced Computational Intelligence and Intelligent Informatics, Vol.12, No.3, pp. 227-233, (2008).
    [5] JueHou, et al:“Effectiveness of N-Gram fast Match for Query-by-Humming System”, IEEE International Conference on Multimedia and Expo(ICME 2009), pp. 1310-1313.
    [6] Aurora Marsye and MirnaAdriani, “Searching Polyphonic Indonesian Folksongs Based on N-gram Indexing Technique”, The Fifth Asia Information Retrieval Symposium (AIRS 2009), LNCS Series, pp. 387-396, (2009).
    [7] Alexandra Uitdenbogerd and Justin Zobel. “Melodic Matching Techniques for Large Music Databases”,In:Proceedings of the ACM Multimedia Conference, (1999).
    [8] S. Doraisamy, “Polyphonic Music Retrieval: The N-GramApproach”, PhD Thesis, Imperial College London, (2004).
    [9] Alexandra Uitdenbogerd and Justin Zobel.“Manipulation of Music for Melody Matching”,In: ProceedingACM International Multimedia Conferences, pp. 235-240. ACM Press, Bristol, (1998).
    [10] A. Ghias, J. Logan, D. Chamberlin and B. C. Smith,“Query by humming: Musicalinformation retrieval in an audio database” In Proc ACM Int'l Conf. on Multimedia, ACM,pp. 231-236, (1995).
    [11] J.L. Hsu, C.C. Liu, and A.L.P. Chen: “Discovering Non-trivial Repeating Patterns in Music Data”, IEEE Transactions on Multimedia, (2001).
    [12] Tom Collins, Jeremy Thurlow, Robin Laney, Alistair Willis, and Paul H. Garthwaite: “A Comparative Evaluation of Algorithms for Discovering Translational Patterns In Baroque Keyboard Works”, Proceedings of International Society for Music Information Retrieval Conference, pp. 3-8, (2010).
    [13] J.L. Hsu, A.L.P. Chen, and H.C. Chen: “Finding Approximate Repeating Patterns from Sequence Data”, In Proceedings of International Symposium on Music Information Retrieval, (2004).
    [14] Suyoto, I.S., Uitdenbogerd, A.L.: “Effectiveness of Note Duration Information for MusicRetrieval”, Proceeding Tenth International Conference on Database Systems for AdvancedApplication, pp. 265-275. Springer, Beijing, (2005).
    [15] Uitdenbogerd, A.L.; Zobel, Justin:“Musicranking techniques evaluated.”, Twenty-Fifth Australasian Computer Science Conference, (2002).
    [16] D. Meredith, K. Lemstrom, and G.A. Wiggins, "Algorithms for discovering repeated patterns in multidimensional representations of polyphonic music",Journal of New Music Research, vol. 31, no. 4, pp. 321-345, (2002).
    [17] Yu-lung Lo, Ho-cheng Yu, and Mei-chin Fan, "FastPET: A FastNon-trivial Repeating Pattern Extracting Technique for Music Data," 2001National Computer Symposium, Taipei, pp. 43-52, (2001).
    [18]羅有隆、游合成,「以線性記憶體空間發現音樂資料中非不重要重覆片段之技術」,2002 年第六屆資訊管理學術暨警政資訊實務研討會,中央警察大學,桃園,pp. 341-350(2002)。
    [19] ShyamalaDoraisamy and Stefan Rüger, "Robust Polyphonic Music Retrieval with N-grams," Journal of Intelligent Information Systems, vol. 21, no. 1, pp. 53-70, (2003).
    [20] C. C.Liu, A. J.L. Hsu, and A. L. P. Chen, “1-D List: An Approximate StringMatching Algorithm for Content-Based Music Data Retrieval,” IEEE Int. Conf. on Multimedia Computing and Systems, vol.1,pp.451-456, (1999).
    [21] T. C.Chou, A. L. P. Chen, and C. C. Liu, “Music Databases: IndexingTechniques and Implementation,” International Workshop on Multimedia DataBase Management Systems, (1996).
    [22] Ian Knopke and FraukeJurgensen: “A System for Identifying Common Melodic Phrases in the Masses of Palestrina”, Journal of New Music Research, Vol. 38,
    No. 2, pp. 171-181, (2009).

    [23] H. Sakoe and S. Chiba, "Dynamic programming algorithm optimization for spoken word recognition",IEEE Transactions on Acoustics, Speech, and Signal Processing,vol. ASSP-26,no. 1,pp.43-49, (1978).
    [24] Gerard Salton, Christopher Buckley, “Term-weighting approaches in automatic text retrieval”,Information Processing and Management: an International Journal, vol.24 no.5, pp.513-523, (1988).
    [25]Yi Hong, Sam Kwong, Hanli Wang, QingshengRen“Resampling-based selective clustering ensembles”, Pattern Recognition Letters 30 (2009) 298–305.

    [26] Grimalidi,M.,A. Kokaram, and P. Cunningham.2003.Classifying music by genre using a discrete wavelet transform and a round-robin ensemble. Work report. Trinity College, University of Dublin, Ireland.

    [27] Koshina, K. 2002. Music genre recognition.Diplomathesis.Technical College of Hagenberg, Austria.

    [28] Tzanetakis, G., and P. Cook. 2002. Musical genre classification of audio signals. IEEE Transactions on Speech and Audio Processing 10 (5).293-302.

    [29] Whitman, B., and P. Smaragdis. 2002. Combining Musical and Cultural Features for Intelligent Style Detection. Proceedings of the International Symposium on Music Information Retrieval.47-52.

    [30] Jang, J.-S. Roger and Gao, Ming-Yang, “A Query-by-Singing System based onDynamicProgramming”, International Workshop on Intelligent Systems Resolutions (the 8th BellmanContinuum), Hsinchu, Taiwan, pp. 85-89, (2000).

    [31] Chuang, S.-L. and L.-F. Chien (2002). Towards automatic generation of query taxonomy: a hierarchical query clustering approach

    [32] FarbrizioS,”Machine Learning in AutomatedText Categorization,”ACM Computing Surveys,Vol.34, NO.1, March 2002,pp.1-47.
