研究生: 蔡哲彰
Che-chang Tsai
論文名稱: 國語合成歌聲流暢度改進之研究
Fluency Improving for Mandarin Singing Voice Synthesis
指導教授: 古鴻炎
Hung-yan Gu
口試委員: 王新民
Hsin-min Wang
Ming-shing Yu
Shi-jinn Horng
Bor-shen Lin
學位類別: 碩士
系所名稱: 電資學院 - 資訊工程系
Department of Computer Science and Information Engineering
論文出版年: 2009
畢業學年度: 97
語文別: 中文
論文頁數: 72
中文關鍵詞: 國語歌聲合成頻譜演進共振峰軌跡連接
外文關鍵詞: Mandarin singing voice synthesis, spectrum progression, formant trace connecting
相關次數: 點閱:353下載:1
In this thesis, the goal is to synthesize fluent singing voice by using a small amount of synthesis units. Between adjacent syllables, we propose a reflection-coefficient based spectrum interpolation method to let the formant traces be smoothly connected. To improve the intra-syllable fluency level of a synthetic syllable, we make use of the concept of spectrum progression proposed for speech synthesis to construct a spectrum progression model suitable for singing voice synthesis. Since the two fluency promoting methods must be realized with signal synthesis, we modify and correct the HNM synthesis program developed by others. In addition, we use a larger corpus to train the ANN vibrato parameter models in order to increase the naturalness level of the synthetic singing voice. According to the results of the listening tests, the score obtained by using spectrum progression model and formant trace connecting processing is indeed higher than those obtained without such processing.

摘要 I ABSTRCT II 誌謝 III 目錄 IV 圖索引 VI 表索引 VIII 第1章 緒論 1 1.1 研究動機 1 1.2 文獻回顧 2 1.3 研究方法 4 1.4 論文架構 7 第2章 頻演路徑與抖音參數分析 8 2.1 語料準備 8 2.2 頻譜演進簡介 10 2.3 基於DTW之頻演路徑分析 12 2.4 頻演路徑求取之程式 15 2.5 抖音參數求取之程式 16 第3章 類神經網路模型 19 3.1 類神經網路簡介 19 3.2 類神經網路結構 20 3.3 類神經網路輸出入參數 22 3.4 單元個數實驗 26 3.5 MLP模型之訓練誤差比較 33 第4章 共振峰軌跡連接處理 34 4.1 音節邊界頻譜連接 34 4.2 頻譜包絡估計 35 4.3 頻譜包絡與反射係數相互轉換 38 4.4 不同係數之內插效果 41 4.5 反射係數內插實驗 47 第5章 系統製作與聽測實驗 50 5.1 頻演時間軸對映 50 5.2 共振峰軌跡之連接處理 53 5.3 HNM分析之MVF 57 5.4 HNM合成之參數 58 5.5 系統介面 60 5.6 聽測實驗 63 第6章 結論 66 參考文獻 69 作者簡介 72

