簡易檢索 / 詳目顯示

研究生: 鄭重文
Chung-wen Cheng
論文名稱: CUDA平行處理與OFDM上的應用
OFDM Application using CUDA Parallel Processing
指導教授: 林昌鴻
Chang-hong Lin
口試委員: 阮聖彰
Shanq-jang Ruan
王煥宗
Huan-chun Wang
學位類別: 碩士
Master
系所名稱: 電資學院 - 電子工程系
Department of Electronic and Computer Engineering
論文出版年: 2013
畢業學年度: 101
語文別: 中文
論文頁數: 75
中文關鍵詞: 平行處理
外文關鍵詞: CUDA, OFDM
相關次數: 點閱:225下載:2
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報

   近年來由於中央處理器架構從單核心多執行緒走向多核心多執行緒,伴隨著平行處理機制的概念也越來越普及;以往在超級電腦系統上才能實現的多台電腦共同處理機制,現在也可以在個人電腦上實現,而顯示晶片廠商在整合了以往的向量處理單元與貼圖處理單元後,開發出通用處理器GPGPU,藉由著顯示處理器的多核心架構優勢,在超級電腦上總是由CPU獨佔的局面也跟著產生了變化,其中由NVIDIA這家顯示晶片廠商所推出的多核心處理架構CUDA便是利用顯示晶片的多核心優勢來進行平行化處理,CUDA除了在原有的圖形處理應用外,也廣泛應用在各個領域,如科學計算、氣象模擬、商業風險評估、通信領域以及醫療方面等。

  本研究目的主要探討目前最新的CUDA架構與平行理論以及如何在OFDM的通信中利用CUDA概念來實現其中一項工作,達到平行化處理,藉此應用來探討CUDA平行處理的一些優勢與限制。


In the recent years, due to the central processor architectures moving from single-core multi-thread to multi-core multi-thread, this is followed by the concept of parallel processing becoming more and more common. Before, multiple computer parallel processing what was only able to be performed on super computer systems, now can also be performed on personal computers. After graphics chip manufacturers integrated the previous vertex unit and pixel texture unit, developed common processor GPGPU, using the advantage of multi-core framework on the display processor, what was always the dominated scene by CPU on super computers, we are beginning to see changes. The multi-core framework CUDA presented by display chip manufacturer, NVIDIA, uses exactly this display chip's multi-core advantage to perform parallel processing. CUDA is being used on the existing imaging process application; it is also widely used in areas such as scientific calculations, weather simulations, financial risk evaluations, telecommunications, medical and other areas.

The purpose of this research is to mainly discuss and discover the latest CUDA architecture and parallel theory and how to utilize CUDA concepts to fulfill one of the tasks in OFDM communications, in order to reach parallel processing, and using this to discuss further the advantages and limitations in CUDA parallel processing.

論文摘要 I ABSTRACT II 誌謝 III 目錄 IV 圖索引 VI 表索引 VIII 1. 序言 1 2. 平行處理簡介 3 2.1 平行處理概念 3 2.2 Amdahl's Law 5 2.3 超級電腦與GPU的崛起 7 3. NVIDIA CUDA介紹 9 3.1 NVIDIA GPU 9 3.2 Fermi與Kepler架構簡介 12 3.2.1 Fermi架構 12 3.2.2 Kepler架構 15 3.3 CUDA語法概念 20 3.3.1 CUDA的硬體定義與軟體流程 20 3.3.2 CUDA與C 24 3.4 記憶體存取方式的差異 25 3.5 基本CUDA處理流程 26 4. OFDM 的通信應用介紹 30 4.1 甚麼是OFDM 30 4.2 OFDM原理與無線傳輸應用 30 4.3 OFDM平行化的概念與動機 33 4.4 平行處理部分 37 5. OFDM使用CUDA平行化實驗 43 5.1 前置作業與環境配置 43 5.1.1 硬體配置 43 5.1.2 軟體配置 43 5.1.3 架設環境過程 44 5.2 OFDM可平行化部分 46 5.2.1 進行平行化的主要模擬部分 46 5.2.2 程式改寫 46 5.2.3 架構改寫 47 5.2.4 條件設定部分 49 5.2.5 計時函式 49 5.3 CUDA實驗 52 5.3.1 實驗方法一 53 5.3.2 實驗方法二 56 5.3.3 實驗方法三 60 5.3.4 執行程序的時間比較 63 5.4 總執行時間的實驗結果與比較 67 6. 結論 70 7. 參考文獻 72

[1] Nickolls, J.; Dally, W. J., "The GPU Computing Era," Micro, IEEE, vol.30,no.2, pp.56, 69, March-April 2010

[2] NVIDIA CUDA Developer Zone.(2013). [Online].

[3] General-Purpose Computation on Graphics Hardware.(2013).[Online].Available: http://gpgpu.org/

[4] Sanders, J.; Kandrot, E., "CUDA by Example: An Introduction to General-Purpose GPU Programming," Addison-Wesley, 2010

[5] Hill, M. D.; Marty, M. R., "Amdahl's Law in the Multicore Era," Computer, vol.41, no.7, pp.33, 38, July 2008

[6] CUDA Parallel Computing Platform.(2012). [Online]. Available: http://www.NVIDIA.com/object/cuda_home_new.html

[7] Intel Corporation, Find and Compare Intel Product Specification.(2012). [Online]. Available: http://ark.intel.com/

[8] Madden, P. H., "Dr. Gene Amdahl to Speak at ICCAD 2007, Pioneering Achievement Award will Commemorate 40th Anniversary of “Amdahl's Law”," Solid-State Circuits Newsletter, IEEE , vol.12, no.3, pp.22,22, Summer 2007

[9] Patterson, D. A.; Hennessy, J. L., "Computer Organization and Design 4 Edition," Morgan Kaufmann, October 2008

[10] Dong, H. W.; Lee, H. H. S., "Extending Amdahl's Law for Energy-Efficient Computing in the Many-Core Era," Computer, vol.41, no.12, pp.24, 31, December. 2008

[11] TOP 500 Supercomputer Site, Lists November 2012.(2012). [Online]. Available: http://www.top500.org/lists/2012/11/

[12] National Supercomputing Center in Tianjin, Resource.(2012). [Online]. Available: http://www.nscc-tj.gov.cn/en/

[13] TOP 500 Supercomputer Site, Lists November 2012.(2012). [Online]. Available: http://www.top500.org/lists/2012/11/

[14] Cray in the News-Cray's Titan System Takes Supercomputer Crown. (2012). [Online]. Available: http://www.cray.com/About/Newsroom/CrayInTheNews.aspx

[15] NVIDIA, "NVIDIA CUDA Programming Guide Version 5.0," 2012

[16] Intel's 'Sandy Bridge' Core processors, The Tech Report- The PC Hardware Explored, 2011.(2012). [Online]. Available:
http://techreport.com/review/20188/intel-sandy-bridge-core-processors

[17] NVIDIA, "NVIDIA's Next Generation CUDA Compute Architecture: Kepler," 2012

[18] NVIDIA, "NVIDIA's Next Generation CUDA Compute Architecture: Fermi," 2009

[19] Tech Power Up Web.(2012). [Online]. Available: http://www.techpowerup.com/gpuz/

[20] NVIDIA CUDA Developer Zone- What is CUDA.(2013) [Online].
Available: https://developer.nvidia.com/what-cuda

[21] Tariq, S., "An Introduction to GPU Computing and CUDA Architecture," NVIDIA Corporation, 2011

[22] Kirk, D. B.; Hwu, W. W., "Programming Massively Parallel Processors," Morgan-Kaufmann, 2010

[23] 張舒、褚豔利、趙開勇、張鈺勃, “GPU高效能運算之CUDA,” 中國水利水電出版社/碁峰, 2011年7月

[24] Prasad, R., "OFDM for Wireless Communications System," Artech House Publishers, August 2004

[25] 賴柏洲、林志星、魏明照、林世欽, “數位電視廣播與製作系統,”全華科技圖書股份有限公司, 2003年1月

[26] Vishay Semiconductors, "Data Formats for IR Remote Control," 2007

[27] Simon, M., "COFDM Technology Basics," Rohde & Schwarz

[28] Akapyev, A.; Krylov, V., "Implementation of 802.11n on 128-Core Processor," Nizhny Novgorod State Technical University, 2008

[29] International Telecommunication Union, "ITU-T J.83-12/2007," 2007

[30] ETSI, "EN 302 769 V1.2.1, Digital Video Broadcasting(DVB)Framing Structure Channel Coding and Modulation for a Second Generation Digital Transmission System for Cable System(DVB-C2)," April 2011

[31] ETSI, "EN 302 755 V1.3.1 Digital Video Broadcasting(DVB)Framing Structure Channel Coding and Modulation for a Second Generation Digital Terrestrial Television Broadcasting System(DVB-T2)," April 2012

[32] IEEE Standard for Air Interface for Broadband Wireless Access Systems," IEEE Std 802.16-2012(Revision of IEEE Std 802.16-2009), vol., no., pp.1, 2542, August 17 2012

[33] University of Southampton, Communications Research Group-Multiuser Communications, 2006.(2012). [Online]. Available:
http://www-mobile.ecs.soton.ac.uk/newcomms/?q=res/int/multiuser_nen

[34] ETSI, "EN 300 744 V1.4.1 Digital Video Broadcasting(DVB)Framing Structure Channel Coding and Modulation for Digital Terrestrial Television, " January 2001

[35] NVIDIA,"NVIDIA CUDA C Getting Started Guide for MICROSOFT Windows," April 2012

[36] Microsoft Visual Studio Official Website. 2013.(2013). [Online].
http://www.microsoft.com/visualstudio/eng#products/visual-studio-ultimate-2012

[37] IT 168 Visual Studio 2010中CUDA 4.0安裝配置.(2012). [Online]. Available: http://www.cnblogs.com/chunshan/archive/2011/08/15/2139992.html

[38] CodeForge.com.(2012). [Online]. Available: http://www.codeforge.com/

[39] Press, W. H., Teukolsky, S. A., Vetterling, W. T., Flannery, B. P., "Numerical Recipes 3rd Edition: The Art of Scientific Computing, " Cambridge University Press, September 2007

[40] NVIDIA, "Debugging and Profiling in Microsoft Visual Studio with Parallel Nsight," 2012

[41] Devore, J. L.,"Probability and Statistics for Engineering and the Sciences 8th Edition"Cengage Learning, February 1, 2011

[42] Introducing NVIDIA Tegra 4.(2012). [Online]. Available: http://www.nvidia.com/object/tegra-4-processor.html

[43] NVIDIA Nsight Tegra Overview.(2012). [Online]. Available: https://developer.nvidia.com/nvidia-nsight-tegra

無法下載圖示 全文公開日期 2016/06/04 (校內網路)
全文公開日期 本全文未授權公開 (校外網路)
全文公開日期 本全文未授權公開 (國家圖書館:臺灣博碩士論文系統)
QR CODE