簡易檢索 / 詳目顯示

研究生: 黃昇郁
Sheng-yu Huang
論文名稱: 洞簫聲合成系統之初步研究
An Initial Study on Vertical Bamboo Flute Sound Synthesis System
指導教授: 古鴻炎
Hung-Yan Gu
口試委員: 鄭士康
none
黃紹華
none
陳秋華
Chyou-hwa Chen
學位類別: 碩士
Master
系所名稱: 電資學院 - 資訊工程系
Department of Computer Science and Information Engineering
論文出版年: 2010
畢業學年度: 98
語文別: 中文
論文頁數: 57
中文關鍵詞: 洞簫合成
外文關鍵詞: VBF, Vertical Bamboo Flute
相關次數: 點閱:91下載:1
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報

本論文的研究目標是,以HNM信號模型為基礎,加強原始HNM模型未考慮到的地方,以便發展一個能夠作洞簫聲合成的方法。在HNM參數分析方面,我們首先研究簫聲的聲譜特性,接著研究基頻的偵測的方法及諧波参數的求取方法。在簫聲合成階段,為了提升合成之洞簫聲的自然度,我們提出一種基於ADSR的方法來作時間長度的伸長、縮短,此外為了模擬洞簫吹奏的技巧,我們也研究了轉音和抖音的處理。目前已完成一個線上的洞簫聲合成系統,當拿合成音檔去作聽測實驗,聽測之分數顯示所合成之簫聲只是略像真人吹奏之簫聲,所以仍有很大的改進空間。


The goal of this thesis is to develop a method that can synthesize vertical bamboo flute (VBF) sounds. Here, HNM (harmonic plus noise model) is based and enhanced for places that the original HNM model does not considered. In the analysis of HNM parameters, we study the spectrogram characteristics of VBF sounds first. Then, a method to estimate the fundmental frequently of a signal frame and a method to analyze the harmonic parameters are investigated. In the synthesis of VBF sounds, in order to improve the naturalness level of synthetic VBF sounds, we propose an ADSR(attack,decay,sustain,release) based method to lengthen or shorten a note’s duration. In addition, to simulate the skills for playing a VBF, we have studied methods for synthesizing the sound effects of pitch gliding and pitch and amplitude vibrato. Currently, an on-line system for synthesizing VBF sounds has been built. Synthetic sound files are then took to conduct listening tests. The results of the tests show that the synthetic VBF sounds are just slightly like the VBF sounds played by a person. Therefore, the space to be improved for synthetic VBF sounds is still large.

摘要 ABSTRACT 誌謝 目錄 圖索引…….. 第一章 導論 1.1研究動機 1.2文獻回顧 1.3研究方法 1.4章節簡介 第二章 洞簫聲分析 2.1 洞簫聲之聲譜(spectrogram)特性 2.2 HNM模型介紹 2.3 基頻偵測 2.4 諧波参數分析 2.5 雜訊参數分析 2.6 儲存格式 第三章 洞簫聲合成 3.1 樂譜分析 3.2 ADSR音長伸縮 3.3 音高未變之HNM参數 3.4轉音、抖音之基週軌跡計算 3.5 音高改變之HNM参數 3.6 振幅抖動控制 3.7 信號樣本產生 3.6.1 諧波信號合成 3.6.2 雜訊信號合成 第四章 系統實作與聽測實驗 4.1 分析階段 (1)基頻偵測: (2)諧波参數分析方式: 4.2 合成階段 (1)轉音: (2)音色: 4.3 合成音符實例 4.4 聽測實驗 第五章 結論 參考文獻

[1]網絡孔子學院,洞簫《紅豆曲》,http://music.chinese.cn/article/2009-12/15/content_26418.htm
[2]梁伯達, 洞簫音色之Hilbert-Huang Transform(HHT)分析,碩士論文,臺灣大學電信工程學研究所,台北(2007)。
[3]H. Valbret , E. Moulines and J.P. Tubach, “Voice transformation using PSOLA technique”, Acoustics, Speech, and Signal Processing, 1992. ICASSP-92, 1992 IEEE International Conference on Volume: 1 , 1992 , Page(s): 145 -148 vol.1
[4]T. Dan, B. Mukherjee, and A. Datta, “Temporal approach for synthesis of singing,” In Proceedings of the Stockholm Music Acoustics Conference,pp.282-287, 1993.
[5]S.G. Chen and G.J. Lin, “High Quality and Low Complexity Pitch Modification of Acoustic Signals,” Proceedings of the 1995 IEEE International Conference on Acoustic, Speech, and Signal Processing, May, Detroit, USA, 1995, p2987-2990.
[6]F. Charpentier and Moulines, “Pitch-synchronous Waveform Processing Technique for Text-to-Speech Synthesis Using Diphones,” European Conf. On Speech Communication and Technology, pp.13-19, Paris, 1989.
[7]G.S. Ying , L.H. Jamieson and C.D. Michell, “A probabilistic approach to AMDF pitch detection”, Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on Volume: 2 , 1996 , Page(s): 1201-1204 vol.2
[8]M. Edgington and A. Lowry, “Residual-based speech modification algorithms for text-to-speech synthesis”, Spoken Language, 1996. ICSLP 96. Proceedings , Fourth International Conference on Volume: 3 , 1996 , Page(s): 1425 -1428 vol.3
[9]C. Hamon ,E. Mouline and F. Charpentier , “A diphone synthesis system based on time-domain prosodic modifications of speech”, Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on , 1989 , Page(s): 238 -241 vol.1
[10]L.Rabiner and B. Juang. “Fundamentals of speech recognition.” Prentice Hall, 1993, p97-117
[11]F. R. Moore, Elements of Computer Music, Prentice-Hall, Englewood Cliffs, NJ, 1990.
[12]Yannis Stylianou, Harmonic plus Noise Models for Speech, Combined with Statistical Methods, for Speech and Speaker Modification, Ph.D. thesis, Ecole Nationale Supèrieure des Télécommunications, Paris, France, 1996
[13]C. Dodge and T. A. Jerse, Computer Music: Synthesis, Composition, and Performance,second edition, Schirmer Books, New York, 1997.
[14]M. Russ, Sound Synthesis and Sampling, Boston: Focal Press, 1996.
[15]古鴻炎、廖皇量,「用於國語歌聲合成之諧波加噪音模型的改進研究」,WOCMAT 2006 國際電腦音樂與音訊技術研討會,台北,session 2 (音訊處理I), 2006。
[16]古鴻炎、張小芬、吳俊欣,「仿趙氏音高尺度之基週軌跡正規化方法及其應用」,第十六屆自然語言與語音處理研討會(ROCLING XVI),台北,第325-334 頁,2004。
[17]Kim, H. Y., et al., “Pitch Detection with Average Magnitude Difference Function Using Adaptive Threshold Algorithm for Estimating Shimmer and Jitter”, Proc. of the 20th Annual International Conference of the IEEE, Engineering in Medicine and Biology Society, Vol. 6, pp. 3162 -3164, 1998.
[18]R.C. Maher and J.W. Beauchamp, "Fundamental frequency estimation of musical signals using a two-way mismatch procedure", Journal of the Acoustic Society of America, Vol. 95, page 2254-2263, 1993.
[19]王小川,語音訊號處理(修訂二版),全華圖書公司,台北,2009。
[20]Quatieri, T. F., Discrete-Time Speech Signal Processing, Prentice-Hall, NJ, USA, 2002.
[21]周彥佐,基於HNM 之國語音節信號的合成方法,碩士論文,國立台灣科技大學資訊工程研究所,台北(2007)。
[22]詹詩涵,基於音高調節之歌聲合成系統,碩士論文,國立清華大學資訊系統與應用研究所,新竹(2006)。
[23]古鴻炎、林正甫,「國語歌聲抖音參數之分析」,WOCMAT 2007 國際電腦音樂與音訊技術研討會(新竹),Session III: Audio Signal Processing,2007。

QR CODE