簡易檢索 / 詳目顯示

研究生: 陳勁銘
Jing-Ming Chen
論文名稱: 以分散式及共享式記憶體架構下執行高速平行計算求解納維爾-斯托克斯方程式研究
Study of High-Performance Computing for Solving Navier-Stokes Equation in Architectures of Distributed and Shared Memory.
指導教授: 陳明志
Ming-Jyh Chern
口試委員: 謝佑明
Yo-Ming Hsieh
王謹誠
Chin-Cheng Wang
學位類別: 碩士
Master
系所名稱: 工程學院 - 機械工程系
Department of Mechanical Engineering
論文出版年: 2023
畢業學年度: 111
語文別: 英文
論文頁數: 72
中文關鍵詞: 直接施力邊界沈浸法高效能運算計算流體力學普松方程式物件導向程式設計
外文關鍵詞: Direct-forcing Immersed Boundart (DFIB), High-Performance Computing (HPC), Computational Fluid Dynamics (CFD), Poisson's Equation, Objective-oriented Programming Design
相關次數: 點閱:159下載:0
分享至:
查詢本校圖書館目錄 查詢臺灣博碩士論文知識加值系統 勘誤回報

  • Chinese Abstract................................... i Abstract........................................... ii Acknowledgements .................................. iv Contents ........................................... v Nomenclatures ....................................viii ListofTables....................................... x ListofFigures .................................... xi 1 INTRODUCTION 1 1.1 Motivation.................................... 1 1.2 LiteratureReview................................ 2 1.2.1 Computational Fluid Dynamics and Direct-Forcing ImmersedBoundaryMethod ..................... 2 1.2.2 Prediction-correction Process on DFIB method . . . . . . . . . . . 4 1.2.3 N-SSolverwithParallelComputing ................. 4 1.2.4 Iterative Methods for the Solution of Large Linear System . . . . . 6 1.2.5 SIMD .................................. 6 1.2.6 ArithmeticIntensity .......................... 7 1.3 Synopsis..................................... 7 2 METHODOLOGY 9 2.1 LESGoverningEquationandModels..................... 9 2.1.1 LESGoverningEquation ....................... 9 2.1.2 Prediction-correction Model for FSI Problem . . . . . . . . . . . . 11 2.2 CompilerOptimization............................. 13 2.3 ParallelComputing............................... 14 2.4 ComputationalEnvironment,Taiwania3................... 15 2.4.1 DataLocalityinCC-NUMA...................... 16 2.5 PerformanceTool................................ 17 2.6 VisualizationandPLOT3DFileFormat ................... 18 2.7 ProgramArchitecture ............................. 18 3 PARALLEL COMPUTATIONAL STRATEGIES FOR SOLVING POISSON’S EQUATION 21 3.1 DiscretizationofPoisson’sEquation...................... 21 3.2 ConvergenceConditionfortheIterativeMethod . . . . . . . . . . . . . . . 23 3.3 BiCGSTAB................................... 23 3.4 BiCGSTABwithMPI ............................. 25 3.5 Performance Evaluation for Inner Product Computation . . . . . . . . . . 26 3.6 SparseMatrixFormat ............................. 27 3.7 ELLSparseMatrixformat........................... 28 3.7.1 Insert Method and he Method for Extraction CSR Format . . . . 28 3.7.2 ELL SpMV Computational Optimization and Analysis Method . . 32 3.8 SPESparseMatrixformat........................... 33 3.9 SPE-2SparseMatrixformat.......................... 34 3.10 TheDataStructuresinCSR,ELL,andSPEformat. . . . . . . . . . . . . 35 3.11 PerformanceEvaluationsforSpMVComputations. . . . . . . . . . . . . . 37 4 RESULTS AND DISCUSSION 39 4.1 ComparisonwithinELLandCSRformat .................. 39 4.1.1 Introductiontothe2DConductionProblem . . . . . . . . . . . . . 39 4.1.2 Introduction to the System Matrix Provided COMSOL FEM Solver 40 4.1.3 PerformanceEvaluationsandDiscussion . . . . . . . . . . . . . . . 42 4.2 FlowPastCylinderatRe=40 ........................ 43 4.2.1 The Relationship Between the ?∗ and Parallel Performance . . . . . 46 4.2.2 A Comparison Based Algorithms for Solving Poisson’s Equation . . 48 5 CONCLUSIONS AND FUTURE WORKS 50 5.1 Conclusions................................... 50 5.2 FutureWorks.................................. 51 BIBLIOGRAPHY.................................51

    [1] J. Reinders. VTune Performance Analyzer Essentials, volume 9. Intel Press Santa Clara, 2005.
    [2] A. C. De Melo. The new linux ’perf’ tools. In Slides from Linux Kongress, volume 18, pages 1–42, 2010.
    [3] H. K. Versteeg and W. Malalasekera. An introdution to computational fluid dynamics. :The Finite Volume Method, Prentice Hall, 2, 1995.
    [4] E. A. Fadlun, R. Verzicco, P. Orlandi, and J. Mohd-Yusof. Combined immersed-boundary finite-difference methods for three-dimensional complex flow simulations. Journal of Computational Physics, 161(1):pages 35–60, 2000.
    [5] C. S. Peskin. Flow patterns around heart valves: a numerical method. Journal of Computational Physics, 10(2):pages 252–271, 1972.
    [6] M. J. Chern, E. A. Odhiambo, T. L. Horng, and A. G. L. Borthwick. Numerical simulation of vibration of horizontal cylinder induced by progressive waves. Fluid Dynamics Research, 48(1):pages 015508, 2016.
    [7] D. Z. Noor, M. J. Chern, and T. L. Horng. An immersed boundary method to solve fluid-solid interaction problems. Computational Mechanics, 44(4):pages 447–453, 2009.
    [8] M. J. Chern, D. Z. Noor, C. B. Liao, and T. L. Horng. Direct-forcing immersed boundary method for mixed heat transfer. Communications in Computational Physics, 18(4):pages 1072–1094, 2015.
    [9] T. L. Horng, P. W. Hsieh, S. Y. Yang, and C. S. You. A simple direct-forcing immersed boundary projection method with prediction-correction for fluid-solid interaction problems. Computers & Fluids, 176:pages 135–152, 2018.
    [10] C. S. You. On two Immersed Boundary Methods for Simulating the Dynamics of Fluid-structure Interaction Problems. Ph.d. thesis, National Central University, Taoyuan City, Taiwan, 2016.
    [11] H. Choi and P. Moin. Effects of the computational time step on numerical solutions of turbulent flow. Journal of Computational Physics, 113(1):pages 1–4, 1994.
    [12] W. Gropp, T. Hoefler, R. Thakur, and E. Lusk. Using Advanced MPI: Modern Features of the Message-passing Interface. MIT Press, 2014.
    [13] S. Cools and W. Vanroose. The communication-hiding pipelined bicgstab method for the parallel solution of large unsymmetric linear systems. Parallel Computing, 65:pages 1–20, 2017.
    [14] T. Ye, R. Mittal, H. Udaykumar, and W. Shyy. An accurate cartesian grid method for viscous incompressible flows with complex immersed boundaries. Journal of Computational Physics, 156(2):pages 209–240, 1999.
    [15] P. R. Eller, T. Hoefler, and W. Gropp. Using performance models to understand scalable krylov solver performance at scale for structured grid problems. In Proceedings of the ACM International Conference on Supercomputing, pages 138–149, 2019.
    [16] Y. M. Hsieh. Parallel Computation in Efficient Non-linear Finite Element Analysis with Applications to Soft-ground Tunneling Project. Ph.d. thesis, Massachusetts Institute of Technology, Cambridge, MA, 2004.
    [17] B. Wicaksono, M. Tolubaeva, and B. Chapman. Detecting false sharing in openmp applications using the darwin framework. In International Workshop on Languages and Compilers for Parallel Computing, pages 283–297, Fort Collins, CO, USA, 2011. Springer.
    [18] S. P. Frankel. Convergence rates of iterative treatments of partial differential
    equations. Mathematics of Computation, 4(30):65–75, 1950.
    [19] D. Young. Iterative methods for solving partial difference equations of elliptic type.
    Transactions of the American Mathematical Society, 76(1):pages 92–111, 1954.
    [20] M. Klemm, A. Duran, X. Tian, H. Saito, D. Caballero, and X. Martorell. Extending OpenMp∗ with vector constructs for modern multicore simd architectures. In International workshop on OpenMP, pages 59–72. Springer, 2012.
    [21] S. W. Williams. Auto-tuning Performance on Multicore Computers. University of California, Berkeley, 2008.
    [22] J. Smagorinsky. General circulation experiments with the primitive equations: I. the basic experiment. Monthly Weather Review, 91(3):pages 99–164, 1963.
    [23] Y. Zhiyin. Large-eddy simulation: Past, present and the future. Chinese Journal of Aeronautics, 28(1):pages 11–24, 2015.
    [24] D. K. Lilly. The representation of small-scale turbulence in numerical simulation experiments. IBM Form, pages pages 195–210, 1967.
    [25] B. P. Leonard. A stable and accurate convective modelling procedure based on quadratic upstream interpolation. Computer Methods in Applied Mechanics and Engineering, 19(1):pages 59–98, 1979.
    [26] F. Broquedis, J. Clet-Ortega, S. Moreaud, N. Furmento, B. Goglin, G. Mercier,
    S. Thibault, and R. Namyst. hwloc: A generic framework for managing hardware affinities in hpc applications. In 2010 18th Euromicro Conference on Parallel and Distributed and Network-based Processing, pages 180–186, Pisa, Italy, 2010. IEEE.
    [27] P. P. Walatka. PLOT3D User’s Manual, volume 101067. NASA, 1990.
    [28] S. A. Raza. Simulation of Vortex-induced Vibration of 3-D Structures in Turbulent Flow Using Direct-forcing Immersed Boundary Method. Ph.d. thesis, National Taiwan University of Science and Technology, Taiwan, 2021.
[29] S. A. Raza, Y. H. Irawan, and M. J. Chern. Effect of boundary conditions and domain size on the turbulent flow characteristics over a circular cylinder. Journal of the Chinese Institute of Engineers, 44(7):pages 659–672, 2021.
    [30] W. Kahan. Ieee standard 754 for binary floating-point arithmetic. Lecture Notes on the Status of IEEE, 754(94720-1776):pages 11, 1996.
    [31] U. Borštnik, J. VandeVondele, V. Weber, and J. Hutter. Sparse matrix multiplication: The distributed block-compressed sparse row library. Parallel Computing, 40(5-6):pages 47–58, 2014.
    [32] E. Wang, Q. Zhang, B. Shen, G. Zhang, X. Lu, Q. Wu, and Y. Wang. High-performance computing on the Intel® xeon phi. Springer, 5:pages 2, 2014.
    [33] G. Guennebaud and B. Jacob. Eigen: A c++ linear algebra library. URL http://eigen. tuxfamily. org, Accessed, 22, 2014.
    [34] A. Ekambaram and E. Montagne. An alternative compressed storage format for sparse matrices. In International Symposium on Computer and Information Sciences, pages 196–203, Antalya, Turkey, 2003. Springer.
    [35] D. Demidov. Amgcl: An efficient, flexible, and extensible algebraic multigrid implementation. Lobachevskii Journal of Mathematics, 40(5):pages 535–546, 2019.
    [36] D. Demidov. Amgcl documentation, 2017.
    [37] C. Sanderson and R. Curtin. A user-friendly hybrid sparse matrix class in c++. In International Congress on Mathematical Software, pages 422–430, South Bend, IN, USA. Springer.
    [38] D. Calhoun. A cartesian grid method for solving the two-dimensional streamfunction-vorticity equations in irregular regions. Journal of Computational Physics, 176(2):231–275, 2002.
    [39] M. N. Linnick and H. F. Fasel. A high-order immersed interface method for simulating unsteady incompressible flows on irregular domains. Journal of Computational Physics, 204(1):pages 157–192, 2005.
    [40] S. W. Su, M. C. Lai, and C. A. Lin. An immersed boundary technique for simulating complex flows with rigid boundary. Computers & Fluids, 36(2):pages 313–324, 2007.
    [41] K. Taira and T. Colonius. The immersed boundary method: a projection approach. Journal of Computational Physics, 225(2):pages 2118–2137, 2007.
    [42] B. M. Sumer. Hydrodynamics around Cylindrical Strucures, volume 26. World Scientific, 2006.

    無法下載圖示 全文公開日期 2026/01/13 (校內網路)
    全文公開日期 2026/01/13 (校外網路)
    全文公開日期 2026/01/13 (國家圖書館:臺灣博碩士論文系統)
    QR CODE