Blog
About

  • Record: found
  • Abstract: found
  • Article: found
Is Open Access

Chip-level and multi-node analysis of energy-optimized lattice-Boltzmann CFD simulations

Preprint

Read this article at

Bookmark
      There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

      Abstract

      Memory-bound algorithms show complex performance and energy consumption behavior on multicore processors. We choose the lattice-Boltzmann method (LBM) on an Intel Sandy Bridge cluster as a prototype scenario to investigate if and how single-chip performance and power characteristics can be generalized to the highly parallel case. First we perform an analysis of a sparse-lattice LBM implementation for complex geometries. Using a single-core performance model, we predict the intra-chip saturation characteristics and the optimal operating point in terms of energy to solution as a function of implementation details, clock frequency, vectorization, and number of active cores per chip. We show that high single-core performance and a correct choice of the number of active cores per chip are the essential optimizations for lowest energy to solution at minimal performance degradation. Then we extrapolate to the MPI-parallel level and quantify the energy-saving potential of various optimizations and execution modes, where we find these guidelines to be even more important, especially when communication overhead is non-negligible. In our setup we could achieve energy savings of 35% in this case, compared to a naive approach. We also demonstrate that a simple non-reflective reduction of the clock speed leaves most of the energy saving potential unused.

      Related collections

      Most cited references 23

      • Record: found
      • Abstract: not found
      • Article: not found

      KMCLib A general framework for lattice kinetic Monte Carlo (KMC) simulations

        Bookmark
        • Record: found
        • Abstract: not found
        • Article: not found

        Power-Management Architecture of the Intel Microarchitecture Code-Named Sandy Bridge

          Bookmark
          • Record: found
          • Abstract: not found
          • Article: not found

          A high-performance lattice Boltzmann implementation to model flow in porous media

            Bookmark

            Author and article information

            Journal
            2013-04-29
            2015-05-22
            1304.7664 10.1002/cpe.3489

            http://arxiv.org/licenses/nonexclusive-distrib/1.0/

            Custom metadata
            23 pages, 13 figures; post-peer-review version
            cs.PF cs.DC

            Comments

            Comment on this article