Authors | S. Alireza Zolfaghari,Hossein Mahmoodi Darian,Hamid Saadatfar |
---|---|
Journal | Computational and Applied Mathematics |
Page number | 1-35 |
Serial number | 39 |
Volume number | 305 |
Paper Type | Full Paper |
Published At | 2020 |
Journal Grade | ISI |
Journal Type | Electronic |
Journal Country | Albania |
Journal Index | ISI،JCR،Scopus |
Abstract
In the present study, an efficient GPU-based corrected explicit–implicit domain decomposition scheme is proposed to accelerate fractional steps solvers. Implicit time advancement in fractional steps solvers leads to several independent tri-diagonal systems. In the present method, by decomposing the domain and predicting the solution at the interface points, the original tri-diagonal systems are decomposed to several independent systems; this allows partitioning the workloads. After solving the systems, correction is performed to stabilize the solution. The method is implemented using different strategies and memory coalescing, and cache throttling techniques are employed to improve the performance. Numerical experiments are conducted for two- and three-dimensional reaction–diffusion problems to measure the accuracy, stability, and efficiency of the method. The new efficient prediction and correction schemes presented in this study preserve the accuracy and the stability of the solver even for a large number of sub-domains. Therefore, the method provides many independent tri-diagonal systems and creates a large number of threads to keep the GPU occupied. The partitioning procedure is well adapted for GPU-computing; thus, the method effectively accelerates the solution and outperforms the previous methods in terms of computational time.
tags: CFD · High-performance computing · GPU · CUDA · Corrected explicit–implicit domain decomposition · Reaction–diffusion equation