簡易檢索 / 詳目顯示

研究生: 張添旻
Tien-Min Chang
論文名稱: 可調適高處理速度之奇異值分解處理器電路架構設計
The VLSI Architecture Design of a Configurable and High-Throughput Singular Value Decomposition Processor
指導教授: 沈中安
Chung-An Shen
口試委員: 沈中安
Chung-An Shen
黃琴雅
Chin-Ya Huang
吳晉賢
Chin-Hsien Wu
林昌鴻
Chang-Hong Lin
學位類別: 碩士
Master
系所名稱: 電資學院 - 電子工程系
Department of Electronic and Computer Engineering
論文出版年: 2023
畢業學年度: 111
語文別: 英文
論文頁數: 44
中文關鍵詞: 矩陣分解高處理速度奇異值分解
外文關鍵詞: Matrix Decomposition, High Throughput, Singular Value Decomposition
相關次數: 點閱:220下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報
  • 奇異值分解在各種訊號處理的應用領域中扮演著非常重要的角色,因此多年來諸多學者提出各種高效能的演算法與電路架構進行奇異值分解運算。然而,絕大多數文獻中提出的奇異質分解演算法或電路架構皆假設矩陣為一方陣。由於許多應用都需要對任意大小矩陣進行奇異值分解運算,假設矩陣為一方陣極大限制了這些運算方法的實用性。本論文提出了一種高處理速度高硬體利用率之奇異值分解處理器,並且能夠對任意大小矩陣進行奇異值分解運算,以滿足對任意大小矩陣分解的需求。具體而言,本論文提出了一種新的奇異質分解演算法,以降低傳統演算法的迭代次數,使得處理速度得到顯著提升。同時,本論文設計的電路架構是基於分時多工的運算流程方法,以達到提升了計算單元的硬體使用率,更進一步提升處理速度與效率。本文所提出之奇異值分解處理器是基於積體電路設計流程並採用CMOS 90 奈米製程來實現,其效能評估採用自動化布局與繞線後的結果。實驗結果表示,我的設計的電路複雜度以NAND 邏輯閘來估計約為210.64k 個,而處理效能達到每秒15384.6k 個矩陣運算的處理速度。相較於先前文獻的設計,本文所提出的奇異值分解處理器在方正矩陣的情況下,效能提升了約25.3%,而在非方正矩陣的情況下,效能提升了約33.1%。


    Singular value decomposition (SVD) plays an essential role in the field of signal processing applications. Therefore, many scholars over the years propose various high-efficiency algorithms and circuit architectures for singular value decomposition operations. However, most of the literature proposes both the singular value decomposition algorithm and the circuit structure for a square matrix. Since many applications require singular value decomposition operation of the matrix with an arbitrary size, assuming that the matrix is square greatly limits the practicability of these approaches. This thesis presents a singular value decomposition processor with high throughput and high hardware utilization. Furthermore, the design proposed in this work can decompose matrices of any size and meet the needs of arbitrary matrix decomposition. Specifically, a novel algorithm is presented in this thesis where the number of iterations is greatly reduced so that the processing throughput can be significantly enhanced. Moreover, the utilization rate of the computing hardware units is improved by means of the carefully designed processing flow. This further improves the throughput rate and efficiency. The proposed SVD processor is designed and implemented using the design process of application-specific integrated circuit (ASIC) with CMOS 90nm technology. The performance is estimated by the results of automatic placement and routing. The experimental results demonstrate that the complexity of the proposed design is 210.64k represented by logic NAND gates. It is also shown in this thesis that the maximum throughput is up to 15384.6k matrices per second. Compared with the previous literature on singular value decomposition, the efficiency of square matrices and non-square matrices of SVD is higher about 25.3% and 33.1%.

    Abstract in Chinese iii Abstract in English iv Contents v List of Figures vii List of Tables viii List of Algorithms ix 1 Introduction 1 2 Background and Related Work 4 2.1 The Concept of SVD 4 2.2 The Two-Sided Jacobi SVD Algorithm 4 2.3 The CORDIC Algorithm 6 2.4 The Applications of SVD 8 2.4.1 The SVD-Based Precoder in MIMO System Model 8 2.4.2 The Independent Component Analysis 9 2.5 The Related Work of SVD Unit 10 3 The Proposed SVD Processor 12 3.1 Analysis of Related Work 12 3.1.1 The Review of Algorithms 12 3.1.2 The Review of Architectures 14 3.1.3 The Timing Analysis 15 3.2 The Proposed Algorithm 17 3.3 The Proposed Architecture 19 3.4 The Proposed Processing Flow 22 3.5 The Analysis for Configurability 24 4 Experiment Results 27 4.1 Implementation Results 27 4.2 Comparisons with Prior Designs 28 5 Conclusion 31 References 32

    [1] C.-Z. Zhan, Y.-L. Chen, and A.-Y. Wu, “Iterative superlinear-convergence svd beamforming algorithm
    and vlsi architecture for mimo-ofdm systems,” IEEE Transactions on Signal Processing, vol. 60, no. 6,
    pp. 3264–3277, 2012.
    [2] Y.-T. Hwang, K.-T. Chen, and C.-K. Wu, “A high throughput unified svd/qrd precoder design for
    mimo ofdm systems,” in 2015 IEEE International Conference on Digital Signal Processing (DSP),
    pp. 1148–1151, 2015.
    [3] D. Guenther, R. Leupers, and G. Ascheid, “A scalable, multimode svd precoding asic based on the
    cyclic jacobi method,” IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 63, no. 8,
    pp. 1283–1294, 2016.
    [4] C.-H. Wu, C.-Y. Liu, and P.-Y. Tsai, “Design of an svd engine for 8× 8 mimo precoding systems,” in
    2017 IEEE International Symposium on Circuits and Systems (ISCAS), pp. 1–4, IEEE.
    [5] P.-Y. Tsai, Y. Chang, and J.-L. Li, “Fast-convergence singular value decomposition for tracking timevarying
    channels in massive mimo systems,” in 2018 IEEE International Conference on Acoustics,
    Speech and Signal Processing (ICASSP), pp. 1085–1089, IEEE.
    [6] D. Zhang, P. Pan, R. You, and H. Wang, “Svd-based low-complexity hybrid precoding for millimeterwave
    mimo systems,” IEEE Communications Letters, vol. 22, no. 10, pp. 2176–2179, 2018.
    [7] C.-H. Wu and P.-Y. Tsai, “An svd processor based on golub–reinsch algorithm for mimo precoding
    with adjustable precision,” IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 66,
    no. 7, pp. 2572–2583, 2019.
    [8] W.-J. Chen, Y.-A. Lai, and C.-A. Shen, “The vlsi architecture and implementation of a low complexity
    and highly efficient configurable svd processor for mimo communication systems,” Circuits, Systems,
    and Signal Processing, vol. 39, no. 12, pp. 6231–6246, 2020.
    [9] K.-J. Huang, W.-Y. Shih, J.-C. Liao, and W.-C. Fang, “A vlsi design of singular value decomposition
    processor used in real-time ica computation for multi-channel eeg system,” in 2013 IEEE International
    Symposium on Circuits and Systems (ISCAS), pp. 413–416, IEEE.
    [10] W.-C. Fang, J.-C. Chang, K.-J. Huang, C.-W. Feng, and C.-C. Chou, “An efficient vlsi implementation
    of svd processor of on-line recursive ica for real-time eeg system,” in 2014 IEEE Biomedical Circuits
    and Systems Conference (BioCAS) Proceedings, pp. 73–76, IEEE.
    [11] A. M. Judith, S. B. Priya, and R. K. Mahendran, “Artifact removal from eeg signals using regenerative
    multi-dimensional singular value decomposition and independent component analysis,” Biomedical
    Signal Processing and Control, vol. 74, p. 103452, 2022.
    [12] K. Mishra, S. K. Singh, and P. Nagabhushan, “An improved svd based image compression,” in 2018
    Conference on Information and Communication Technology (CICT), pp. 1–5, IEEE.
    [13] F. Yeganegi, V. Hassanzade, and S. Ahadi, “Comparative performance evaluation of svd-based image
    compression,” in Electrical Engineering (ICEE), Iranian Conference on, pp. 464–469, IEEE, 2018.
    [14] J. C. S. de Souza, T. M. L. Assis, and B. C. Pal, “Data compression in smart distribution systems via
    singular value decomposition,” IEEE Transactions on Smart Grid, vol. 8, no. 1, pp. 275–284, 2015.
    [15] S. Padhy, L. Sharma, and S. Dandapat, “Multilead ecg data compression using svd in multiresolution
    domain,” Biomedical signal processing and control, vol. 23, pp. 10–18, 2016.
    [16] S. K. Mukhopadhyay, M. O. Ahmad, and M. Swamy, “Svd and ascii character encoding-based compression
    of multiple biosignals for remote healthcare systems,” IEEE Transactions on Biomedical
    Circuits and Systems, vol. 12, no. 1, pp. 137–150, 2018.
    [17] Q. Guo, C. Zhang, Y. Zhang, and H. Liu, “An efficient svd-based method for image denoising,” IEEE
    transactions on Circuits and Systems for Video Technology, vol. 26, no. 5, pp. 868–880, 2015.
    [18] S. Li, W. Ye, H. Liang, X. Pan, X. Lou, and X. Zhao, “K-svd based denoising algorithm for dofp
    polarization image sensors,” in 2018 IEEE International Symposium on Circuits and Systems (ISCAS),
    pp. 1–5, IEEE.
    [19] T. Bhuyan, V. K. Srivastava, and F. Thakkar, “Shuffled svd based robust and secure digital image
    watermarking,” in 2016 International Conference on Electrical, Electronics, and Optimization Techniques
    (ICEEOT), pp. 1229–1233, IEEE.
    [20] C. A. Sari, E. H. Rachmawanto, and D. R. I. M. Setiadi, “Robust and imperceptible image watermarking
    by dc coefficients using singular value decomposition,” in 2017 4th International Conference on
    Electrical Engineering, Computer Science and Informatics (EECSI), pp. 1–5, 2017.
    [21] Y. He and Y. Hu, “A proposed digital image watermarking based on dwt-dct-svd,” in 2018 2nd IEEE
    Advanced Information Management, Communicates, Electronic and Automation Control Conference
    (IMCEC), pp. 1214–1218, IEEE.
    [22] S. B. B. Ahmadi, G. Zhang, and S. Wei, “Robust and hybrid svd-based image watermarking schemes,”
    Multimedia Tools and Applications, vol. 79, no. 1, pp. 1075–1117, 2020.
    [23] H.-H. Lu, C.-A. Shen, M. E. Fouda, and A. M. Eltawil, “Configurable independent component analysis
    preprocessing accelerator,” IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2022.
    [24] S. M. R. Shahshahani and H. R. Mahdiani, “A high-performance scalable shared-memory svd processor
    architecture based on jacobi algorithm and batcher’s sorting network,” IEEE Transactions on
    Circuits and Systems I: Regular Papers, vol. 67, no. 6, pp. 1912–1924, 2020.
    [25] J. E. Volder, “The cordic trigonometric computing technique,” IRE Transactions on electronic computers,
    no. 3, pp. 330–334, 1959.
    [26] P. K. Meher, J. Valls, T.-B. Juang, K. Sridharan, and K. Maharatna, “50 years of cordic: Algorithms, architectures,
    and applications,” IEEE Transactions on Circuits and Systems I: Regular Papers, vol. 56,
    no. 9, pp. 1893–1907, 2009.
    34

    無法下載圖示 全文公開日期 2025/01/17 (校內網路)
    全文公開日期 2025/01/17 (校外網路)
    全文公開日期 2025/01/17 (國家圖書館:臺灣博碩士論文系統)
    QR CODE