arrow
Volume 7, Issue 1
Implementation of Multi-GPU Based Lattice Boltzmann Method for Flow Through Porous Media

Changsheng Huang, Baochang Shi, Nanzhong He & Zhenhua Chai

Adv. Appl. Math. Mech., 7 (2015), pp. 1-12.

Published online: 2018-03

Export citation
  • Abstract

The lattice Boltzmann method (LBM) can gain a great amount of performance benefit by taking advantage of graphics processing unit (GPU) computing, and thus, the GPU, or multi-GPU based LBM can be considered as a promising and competent candidate in the study of large-scale fluid flows. However, the multi-GPU based lattice Boltzmann algorithm has not been studied extensively, especially for simulations of flow in complex geometries. In this paper, through coupling with the message passing interface (MPI) technique, we present an implementation of multi-GPU based LBM for fluid flow through porous media as well as some optimization strategies based on the data structure and layout, which can apparently reduce memory access and completely hide the communication time consumption. Then the performance of the algorithm is tested on a one-node cluster equipped with four Tesla C1060 GPU cards where up to 1732 MFLUPS is achieved for the Poiseuille flow and a nearly linear speedup with the number of GPUs is also observed.

  • Keywords

  • AMS Subject Headings

  • Copyright

COPYRIGHT: © Global Science Press

  • Email address
  • BibTex
  • RIS
  • TXT
@Article{AAMM-7-1, author = {Huang , ChangshengShi , BaochangHe , Nanzhong and Chai , Zhenhua}, title = {Implementation of Multi-GPU Based Lattice Boltzmann Method for Flow Through Porous Media}, journal = {Advances in Applied Mathematics and Mechanics}, year = {2018}, volume = {7}, number = {1}, pages = {1--12}, abstract = {

The lattice Boltzmann method (LBM) can gain a great amount of performance benefit by taking advantage of graphics processing unit (GPU) computing, and thus, the GPU, or multi-GPU based LBM can be considered as a promising and competent candidate in the study of large-scale fluid flows. However, the multi-GPU based lattice Boltzmann algorithm has not been studied extensively, especially for simulations of flow in complex geometries. In this paper, through coupling with the message passing interface (MPI) technique, we present an implementation of multi-GPU based LBM for fluid flow through porous media as well as some optimization strategies based on the data structure and layout, which can apparently reduce memory access and completely hide the communication time consumption. Then the performance of the algorithm is tested on a one-node cluster equipped with four Tesla C1060 GPU cards where up to 1732 MFLUPS is achieved for the Poiseuille flow and a nearly linear speedup with the number of GPUs is also observed.

}, issn = {2075-1354}, doi = {https://doi.org/10.4208/aamm.2014.m468}, url = {http://global-sci.org/intro/article_detail/aamm/10940.html} }
TY - JOUR T1 - Implementation of Multi-GPU Based Lattice Boltzmann Method for Flow Through Porous Media AU - Huang , Changsheng AU - Shi , Baochang AU - He , Nanzhong AU - Chai , Zhenhua JO - Advances in Applied Mathematics and Mechanics VL - 1 SP - 1 EP - 12 PY - 2018 DA - 2018/03 SN - 7 DO - http://doi.org/10.4208/aamm.2014.m468 UR - https://global-sci.org/intro/article_detail/aamm/10940.html KW - AB -

The lattice Boltzmann method (LBM) can gain a great amount of performance benefit by taking advantage of graphics processing unit (GPU) computing, and thus, the GPU, or multi-GPU based LBM can be considered as a promising and competent candidate in the study of large-scale fluid flows. However, the multi-GPU based lattice Boltzmann algorithm has not been studied extensively, especially for simulations of flow in complex geometries. In this paper, through coupling with the message passing interface (MPI) technique, we present an implementation of multi-GPU based LBM for fluid flow through porous media as well as some optimization strategies based on the data structure and layout, which can apparently reduce memory access and completely hide the communication time consumption. Then the performance of the algorithm is tested on a one-node cluster equipped with four Tesla C1060 GPU cards where up to 1732 MFLUPS is achieved for the Poiseuille flow and a nearly linear speedup with the number of GPUs is also observed.

Huang , ChangshengShi , BaochangHe , Nanzhong and Chai , Zhenhua. (2018). Implementation of Multi-GPU Based Lattice Boltzmann Method for Flow Through Porous Media. Advances in Applied Mathematics and Mechanics. 7 (1). 1-12. doi:10.4208/aamm.2014.m468
Copy to clipboard
The citation has been copied to your clipboard