簡易檢索 / 詳目顯示

研究生: 華堃
Kun - Hua
論文名稱: 歌唱聲以及樂器聲合成改進之研究
Singing Voice with Instruments Synthesis System
指導教授: 洪西進
Shi-Jinn Horng
口試委員: 古鴻炎
none
鍾國亮
none
蘇民揚
none
高宗萬
none
學位類別: 碩士
Master
系所名稱: 電資學院 - 資訊工程系
Department of Computer Science and Information Engineering
論文出版年: 2011
畢業學年度: 99
語文別: 英文
論文頁數: 45
中文關鍵詞: ADSR模型快速傅立葉轉換加法合成線性預估法編碼
外文關鍵詞: ADSR model, Fast Fourier Transform, additive-synthesis, Linear Predictive Coding
相關次數: 點閱:251下載:1
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報

這篇PAPER實現了讓歌唱聲可以隨著MIDI樂器來歌唱,要達到這個目的,我們必須要去合成人聲以及樂器聲。我們在人聲部分是用加法合成,並用傅立葉轉換從時域轉換到頻域,為了找出更精確的值,我們使用Cubic-Hermite-Spline內插法來實做。在合成樂器聲方面,我們也是用加法合成來合出樂器聲,至於弦樂的部分,我們採用LPC法。最後我們將兩個聲音相加存入WAVE檔,就完成了人聲配合樂器聲歌唱。


This paper implements an application model enabling people to enjoy
music by filling lyrics and choosing corresponding MIDI. To implement
this interesting application model, there are several methods which have
been proposed.
There are two parts — voice synthesis and instruments
synthesis — in this thesis. The former uses additive-synthesis method
based on sinusoidal model to implement our application model with the
purpose to generate a natural and smooth sound. In the analysis stage, we
use FFT (Fast Fourier Transform) to get spectrum by computing each two
adjacent frame respectively, and utilize to get more
accurate frequency, amplitude and phase. In the synthesis stage, we apply
ADSR model to calculate the number of frames of synthesized sound, and
then adjust the pitch curve of synthesized sound, finally, utilize
Cubic-Hermite-Spline to get amplitude of synthesized sound, again. In
the instrument synthesis part, it also adopts additive-synthesis method and
uses this method to build a model to play instruments. It uses LPC
(Linear Predictive Coding) to get better sound when some instruments
have to exert strength on them continuously like sax and violin.
In order to evaluate the synthesized sound produced by our system,
we have conducted experiments on real-time listening evaluation, and the
result is compared to those of previous existing systems. Our
experimental results show that the method implemented in this thesis can
improve both naturalness and smoothness of the synthesized sound.

I. Introduction II. Related Works III. Synthesis Based on Sinusoidal Model IV. Implementation of Singing Voice Synthesis V. Synthesis of instrument VI. Experiments and Result Discussions VII. Conclusion

[1] Birkholz, P., ─Articulatory Synthesis of Singing,∥ Singing Synthesis
Challenge 2007 at the Interspeech‘07, Antwerp, pp.4001-4004
[2] Cambell, N., ─Conversational Speech Synthesis and Need for Some
Laughter,∥ IEEE Transactions on Audio, Speech, and Language
Processing, VOL. 14, NO. 4, July 2006, pp.1171-1178
[3] Cubic Hermite Spline,
http://en.wikipedia.org/wiki/Cubic_Hermite_spline
[4] Equal temperament, http://en.wikipedia.org/wiki/Equal_temperament
[5] Hermite Curve Interpolation, http://cubic.org/docs/hermite.htm
[6] Kim, Y. E., ─A Framework for Parametric Singing Voice
Analysis/Synthesis,∥ 2003 IEEE Workshop on Applications of Signal
Processing to Audio and Acoustics, October 19-22, 2003, New Paltz,
NY, pp.123-126
[7] Klatt, D. H. and L. C. Klatt, ─Analysis, Synthesis, and Perception of
Voice Quality Variations among Female and Male Talkers,∥ J. Acoust.
Soc. Am., Vol. 87, No. 2, February 1990, pp.820-857
[8] Lai, Wen-Hsing, ─F0 Control Model for Mandarin Singing Voice
Synthesis,∥ Second International Conference on Digital
Telecommunications (ICDT‗07)
[9] Lee, M. E. and M. J. T. Smith, ─Digital Singing Voice Synthesis
Using a New Alternating Reflection Model,∥ Circuits and Systems,
2002. ISCAS 2002. IEEE International Symposium on, pp.863-866
[10]Macon, M. W., L. Jensen-Link, J. Oliverio, M. A. Clements, and E. B.
George, ─A Singing Voice Synthesis System Based on Sinusoidal
39
Modeling,∥ Acoustics, Speech, and Signal Processing,
1997.ICASSP-97., 1997 IEEE International Conference on,
pp.435-438
[11]McClellan, J. H., R. W. Schafer, and M. A. Yoder, ─Signal Processing
First,∥ Pearson Prentice Hall, 2003
[12]Meron, Y. and K. Hirose, ─Synthesis of Vibrato Singing,∥ Acoustics,
Speech, and Signal Processing, 2000. ICASSP ‘00. Proceedings.
2000 IEEE International Conference on, pp.745-748
[13]Nordstrom, K. I., G. A. Rutledge, and P. F. Driessen, ─Using Voice
Conversion as a Paradigm for Analyzing Breathy Singing Voices,∥
Communications, Computers and Signal Processing, 2005. PACRIM.
2005 IEEE Pacific Rim Conference on, pp.428-431
[14]O‘Shaughnessy, D., ─Speech Communications: Human and Machine
2nd edition,∥ IEEE Press, 2000
[15]Saitou, T., M. Goto, M. Unoki, and M. Akagi, ─Speech-to-Singing
Synthesis:Converting Speaking Voices to Singing Voices by
Controlling Acoustic Features Unique to Singing Voices,∥ IEEE
Workshop on Applications of Signal Processing to Audio and
Acoustics, October 21-24, 2007, New Paltz, NY, pp.215-218
[16]Siivola, V., ─A Survey of Methods for the Synthesis of the Singing
Voice,∥ Presentation for the course S-89.155, Sound Synthesis,
November 19, 2002
[17]Stylianou, Y., ─A Simple and Fast Way of Generating a Harmonic
Signal,∥ IEEE Signal Processing Letters, VOL. 7, NO. 5, May 2000,
pp.111-113
[18]Zen, H., T. Nose, J. Yamagishi, S. Sako, T. Masuko, A.W. Black, and
K. Tokuda, ─The HMM-based speech synthesis system version 2.0,∥
Proc. of ISCA SSW6, Bonn, Germany, Aug. 2007, pp.294-299
[19]王小川,“語音信號處理”,全華科技圖書股份有限公司,台北,
2009
[20]詹朋翰,“基於FPGA 之可變長度快速傅立葉轉換處理器設
計”,碩士論文,國立臺灣科技大學電子工程研究所,2005
[21]盛思豪,“即時歌唱聲合成系統與音樂合成系統之整合”,碩士
論文,國立臺灣科技大學電機工程研究所,2002
[22]陳安璿,“整合MIDI 伴奏之歌唱聲合成系統”,碩士論文,國
立臺灣科技大學資訊工程研究所,2004
[23]詹詩涵,“基於音高調節之歌聲合成系統”,碩士論文,國立清
華大學資訊系統與應用研究所,2006
[24]廖皇量,“國語歌聲合成信號品質改進之研究”,碩士論文,國
立臺灣科技大學資訊工程研究所,2006
[25]林正甫,“使用ANN 抖音參數模型之國語歌聲合成”,碩士論
文,國立臺灣科技大學資訊工程研究所,2008
[26]呂元傑, ─以加法合成為基礎的寅月合成之研究∥ ,碩士論文,
國立台灣科技大學電機工程研究所,1999
[27]JFugue http://www.jfugue.org/

QR CODE