簡易檢索 / 詳目顯示

研究生: 莊世昌
Shih-chang Chuang
論文名稱: 用於網路電話之多頻帶聲學迴音消除研究
A Study on Multiband Acoustic-Echo-Cancellation for Voice over Internet Protocol
指導教授: 古鴻炎
Hung-yan Gu
口試委員: 王新民
Hsin-min Wang
馮輝文
Huei-wen Ferng
學位類別: 碩士
Master
系所名稱: 電資學院 - 資訊工程系
Department of Computer Science and Information Engineering
論文出版年: 2014
畢業學年度: 102
語文別: 中文
論文頁數: 75
中文關鍵詞: 網路電話迴音消除雜訊刪減多速率信號處理非均勻子頻帶自適應濾波均勻子頻帶自適應濾波
外文關鍵詞: voice over IP, multirate signal processing, nonuniform subband adaptive filter, uniform subband adaptive filter
相關次數: 點閱:285下載:3
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 本論文研究多頻帶聲學迴音消除的方法,並且製作成軟體模組,然後把先人的雜訊刪減模組及我們的迴音消除模組整合到Linphone網路電話軟體裡,用以提升網路電話的語音品質。關於多頻帶迴音消除的方法,我們基於多速率信號處理的理論,研究了樹狀式分級串連之非均勻頻帶的濾波方法與均勻頻帶之DFT濾波方法,然後拿這二個方法和正規化最小均方(NLMS)準則之單頻帶濾波方法作效能比較。依據實驗量測到的誤差收斂曲線與主觀聽測的平均評分,顯示我們研究的多頻帶聲學迴音消除方法比起NLSM單頻帶濾波方法,讓迴音衰減得更多一些,而使網路電話語音品質獲得進一步的提升。


    In this thesis, the problem of acoustic echo cancellation is studied, and multiband based methods are adopted and programmed into software modules. Then, the noise reduction module developed by others and the echo cancellation modules by us are integrated into the internet telephony software package, Linphone, in order to improve the voice quality over IP network. As to multiband based acoustic echo cancellation, we first review the theory of multirate signal processing. Then, we study tree structured nonuniform subband adaptive filter and uniform DFT subband adaptive filter. Next, we compare the two filtering methods with normalized least mean square (NLMS) single-band filtering method in their performances. According to the error convergence curves measured and the average scores obtained from subjective perception tests, one of the multiband based acoustic echo cancellation methods is better than NLMS method. Thus, the quality of transmitted voice can be increased.

    摘要 IV 誌謝 VI 目錄 i 圖表索引 iii 第1章 緒論 1 1.1 研究動機與目的 1 1.2 文獻回顧 3 1.3 研究方法 6 1.4 論文架構 8 第2章 噪音刪減 9 2.1 語音活動偵測 9 2.2 改良式頻譜刪減法 10 2.2.1 非線性頻譜刪減法 10 2.2.2 溫尼濾波器 13 2.2.3 結合非線性頻譜刪減加上溫尼濾波器 15 2.3 雜訊刪減實驗 16 第3章 聲學迴音消除 19 3.1 迴音消除架構 19 3.2 雙向通話偵測 20 3.3 自適應演算法 21 3.4 非線性處理 23 第4章 自適應濾波器組 24 4.1 多速率信號處理簡介 24 4.1.1 取樣頻率轉換元件 24 4.1.2 完美重建之兩通道正交鏡像濾波組 31 4.1.3 樹狀式頻帶分解之完美重建正交鏡像濾波組 32 4.1.4 M通道均勻頻帶濾波器組 33 4.2 子頻帶自適應濾波器組 34 4.3 自適應濾波器之效能比較 40 4.4 自行實驗之效能比較 43 第5章 系統整合及實驗 49 5.1 系統實作 49 5.2 固定式話機之迴音消除實驗 54 5.3 移動式話機之迴音消除實驗 60 5.4 聽測實驗 66 第6章 結論 69 參考文獻 72 圖1.1聲訊傳播示意圖[17] 1 圖1.2網路電話之系統區塊 7 圖2.1語音加入白雜訊之波形 17 圖2.2頻譜亂度曲線 17 圖2.3語音訊號經改良式頻譜刪減後語音之波形 17 圖2.4語音加工廠噪訊信號圖 17 圖2.5頻譜亂度圖 18 圖2.6語音訊號經改良式頻譜刪減圖後語音圖 18 圖3.1迴音消除之架構 19 圖4.1降取樣器[39] 25 圖4.2加上濾波器之降取樣器[39] 25 圖4.3升取樣器[39] 26 圖4.4原始信號頻譜 26 圖4.5兩倍升取樣後產生鏡像示意 26 圖4.6加上濾波器之升取樣器[39] 27 圖4.7第一種Noble恆等式[39] 27 圖4.8第四種Noble 恆等式[39] 27 圖4.9第一型多相分解用於降取樣處理[39] 30 圖4.10第二型多相分解用於升取樣處理[39] 30 圖4.11雙通道濾波器組[39] 31 圖4.12四通道非均勻頻帶濾波器組[39] 33 圖4.13 DFT濾波器組中各濾波器頻率響應的頻率平移特性[39] 34 圖4.14子頻帶自適應濾波器組之架構[38] 35 圖4.15四通道非均勻頻帶分析濾波器 [39] 37 圖4.16四通道非均勻頻帶分析濾波器之頻譜響應 37 圖4.17四通道非均勻頻帶合成濾波器[39] 37 圖4.18四通道非均勻頻帶合成濾波器之頻譜響應 38 圖4.19均勻頻帶分析濾波器 38 圖4.20四通道均勻頻帶分析濾波器之頻譜響應[38] 39 圖4.21均勻頻帶合成濾波器 39 圖4.22四通道均勻頻帶合成濾波器之頻譜響應[38] 39 圖4.23房間的脈衝響應波形[38] 45 圖4.24自適應濾波器之誤差收斂曲線 47 圖4.25自適應濾波器之誤差收斂曲線 48 圖5.1 Linphone軟體之架構 49 圖5.2 Linphone之聲訊處理程序 50 圖5.3 speex資料結構 51 圖5.4噪音刪減和聲學迴音消除的主流程 52 圖5.5 Linphone播話畫面 53 圖5.6 Linphone通話畫面 53 圖5.7遠端錄音及傳送之信號波形 56 圖5.8近端播放後再錄音之信號波形 56 圖5.9近端錄音信號經噪音刪減後之波形 57 圖5.10經正規化最小均方法作迴音消除後之波形 57 圖5.11經均勻子頻帶自適應濾波作迴音消除後之波形(降取樣數4) 58 圖5.12經均勻子頻帶自適應濾波作迴音消除後之波形(降取樣數1) 58 圖5.13經正規化最小均方法作迴音消除之MSE 59 圖5.14經均勻子頻帶自適應濾波作迴音消除之MSE(降取樣數為4) 59 圖5.15經均勻子頻帶自適應濾波作迴音消除之MSE(降取樣數為1) 60 圖5.16遠端錄音信號之波形 62 圖5.17近端播放後再錄音之信號波形 62 圖5.18近端錄音信號經噪音刪減後之波形 63 圖5.19經正規化最小均方法作迴音消除後之波形 63 圖5.20經均勻子頻帶自適應濾波作迴音消除後之波形(降取樣數4) 64 圖5.21經均勻子頻帶自適應濾波作迴音消除後之波形(降取樣數1) 64 圖5.22經正規化最小均方法作迴音消除之MSE 65 圖5.23經均勻子頻帶自適應濾波作迴音消除之MSE(降取樣數為4) 65 圖5.24經均勻子頻帶自適應濾波作迴音消除之MSE(降取樣數為1) 66 表4.1正規化最小均方法之計算複雜度[38] 41 表4.2均勻子頻帶自適應濾波之計算複雜度[38] 42 表4.3自適應濾波之計算複雜度比較 42 表5.1 固定式話機之迴音消除聽測評分 68 表5.2移動式話機之迴音消除聽測評分 68

    [1] A. Varga and H. J. Steeneken, "Assessment for automatic speech recognition: Ii.noisex-92: A database and an experiment to study the effect of additive noise on speech recognition systems," Speech Communication, Vol. 12, No. 3, pp. 247–251, 1993.
    [2] C. Breining , P. Dreiscitel , E. Hansler, A. Mader , et al, "Acoustic Echo Control, " IEEE Signal Processing Magazine, Vol. 16, pp. 42-69, 1999.
    [3] C. E. Shannon, "A mathematical theory of communication," Bell System Technical Journal, Vol. 27, pp. 379-423 and 623-656, 1984.
    [4] C. Gritton, D. Lin, "Echo cancellation algorithms," IEEE ASSP Magazine, Vol. 1, pp. 30-38, 1984.
    [5] C. K. Chui, Wavelets: A Tutorial in Theory and Applications, Boston, Academic Press, San Diego, New York ,1992.
    [6] D. L. Duttweiler, "A twelve-channel digital echo canceler," IEEE Trans. Commun., Vol. COM-26, pp. 647-653, May 1978.
    ELECTRONICS LETTERS 21st, Vol. 26, No.13, Jun. 1990.
    [7] G. Oetken, T. W. Parks and H. W. Schussler, "New results in the design of digital interpolators," IEEE Trans. Acoustics, Speech, and Signal Processing. Vol. ASSP-23, No. 3, pp.301-309, Jun. 1975.
    [8] G. Zoia, A. Sturzenegger, and O. Hochreutiner, "Audio Quality and Acoustic Echo Issues for VOIP on Portable Devices," IEEE Int. Conf. on Portable Information Devices, Orlando, FL, pp. 1-5, 2007.
    [9] H. J. Nussbaumer, "Pseudo QMF filter bank," IBM Tech. Disclosure Bull. 24, pp. 3081-3087, Nov. 1981.
    [10] H. S. Malvar, "Modulated QMF filter banks with perfect reconstruction," ELECTRONICS LETTERS 21st, Vol. 26, No.13, Jun. 1990.
    [11] I. Abdallah, S. Montresor and M. Baudry, "Speech signal detection in noisy environment using a local entropic criterion," in Eurospeech, Rhodes, Greece, Sep. 1997.
    [12] J. D. Johnston, "A filter family designed for use in quadrature mirror filter banks," IEEE International Conf. on Acoustics, Speech, and Signal Processing, Denver, CO, 1980.
    [13] J. S. Soo and K. K. Pang, "Multidelay block frequency domain adaptive filter," IEEE Trans. Acoustics, Speech, Signal Processing, Vol. 38, No. 2, pp. 373–376, Feb. 1990.
    [14] K. Lee, W. Gan, S.M. Kuo, Subband adaptive filtering:Theory and Implementation, John Wiley and Sons, United Kingdom, 2009.
    [15] L. Milic, Multirate Filtering for Digital Signal Processing: MATLAB Applications, Information Science Reference, 2009.
    [16] Linphone,http://www.linphone.org/。
    [17] M. J. T. Smith and T. P. Barnwell, "Exact reconstruction techniques for tree-structured Subband coders," IEEE Trans. on Acoustics, Speech, and Signal Processing, Vol. 34, pp. 434-441, 1986.
    [18] M. M. Sondhi, "An adaptive echo canceler," Bell System Techn.J, Vol. XLVI, No. 3, pp. 497-510, 1967.
    [19] P. N. Heller, T. Karp and T. Q. Nguyen, "A general formulation of modulated filter banks," IEEE Trans. on Signal Processing, Vol. 47, Issue 4, pp. 986-1002, Apr. 1999.
    [20] P. P. Vaidyanathan, "On power-complementary FIR filters," IEEE Trans. Circuits Syst., Vol. CAS-32, pp. 1308 -1310, 1985.
    [21] P. P. Vaidyanathan, "Theory and design of M-channel maximally decimated quadrature mirror filters with arbitrary M, having perfect reconstruction property," IEEE Trans. on Acoustics, Speech, and Signal Processing, Vol. 35, No. 4, pp. 476-492, Apr. 1987.
    [22] P. P. Vaidyanathan and P. Q. Hoang, "Lattice structures for optimal design and robust implementation of two-channel perfect-reconstruction QMF banks," IEEE Trans. Acoustics, Speech, and Signal Processing, Vol. 36, No. 1, pp. 81-94, Jan. 1988.
    [23] P. P. Vaidyanathan, Multirate Systems and Filter Banks, Englewood Cliffs, New Jersey: Prentice Hall, 1993.
    [24] R. D. Koilpillai and P. P. Vaidyanathan, "Cosine Modulated FIR Filter Banks Satisfying Perfect Reconstruction," IEEE Trans. Signal Processing, Vol. 40, No. 4, pp. 770-783, Apr. 1992.
    [25] R. Meyer and C. Burrus, "A unified analysis of multirate and periodically time-varying digital filters," IEEE Trans., Circuits and Systems, Vol. 22, pp. 162-168, 1975.
    [26] R. Storn, "Echo Cancellation Techniques for Multimedia Applications – a Survey," International Computer Science Institute, Berkeley, Nov. 1996.
    [27] R. W. Schafer and L R. Rabiner, "A digital signal processing approach to interpolation," Proc. IEEE, Vol. 61, No. 6, pp. 692-702, Jun. 1973.
    [28] S. F. Boll, "Suppression of Acoustic Noise in Speech Using Spectral Subtraction," IEEE Trans. Acoustics, Speech, Signal Processing, Vol. 27, No. 2, pp. 113-120, 1979.
    [29] SPEEX,http://www.speex.org。
    [30] Y. Lu, R. Fowler, W. Tian, and L. Thompson, "Enhancing echo cancellation via estimation of delay," IEEE Trans. on Signal Processing, Vol. 53, No. 11, Nov. 2005.
    [31] 王小川,語音訊號處理,全華出版社,2004。
    [32] 朱楠群,用於網路電話之雜訊刪減與迴音消除之研究,台灣科技大學,資訊工程研究所,2008。
    [33] 洪維廷、陳弘啟、陳軍廷,"麥克風陣列技術應用於語音加強及語音辨識之研究",全國電信研討會,2008。
    [34] 郭振甫,應用數位訊號處理器於聲學迴音消除,國立成功大學航空太空工程研究所,2002。
    [35] 陳慶森,應用聯合迴音與噪音刪減架構於免持設備之研究,逢甲大學資訊電機工程碩士在職專班,2008。
    [36] 陶然、張惠云、王越,多抽樣率數字號處理理論與其應用,清華大學出版社,2007。
    [37] 葉國楨,免持式語音通訊之聲學迴音與雜訊消除,大葉大學,通訊工程研究所,2009。
    [38] 鄭聖穎,聯合聲學迴音消除與雙向通話偵測演算法及其於定點式 DSP之實現,國立雲林科技大學電子工程研究所,2005。
    [39] 賴辰瑋,強健性語音辨識之研究:語音前端端點偵測與語音強化法,國立暨南國際大學電機工程研究所,2004。

    QR CODE