67
views
0
recommends
+1 Recommend
0 collections
    4
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Chip-level and multi-node analysis of energy-optimized lattice-Boltzmann CFD simulations

      Preprint

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Memory-bound algorithms show complex performance and energy consumption behavior on multicore processors. We choose the lattice-Boltzmann method (LBM) on an Intel Sandy Bridge cluster as a prototype scenario to investigate if and how single-chip performance and power characteristics can be generalized to the highly parallel case. First we perform an analysis of a sparse-lattice LBM implementation for complex geometries. Using a single-core performance model, we predict the intra-chip saturation characteristics and the optimal operating point in terms of energy to solution as a function of implementation details, clock frequency, vectorization, and number of active cores per chip. We show that high single-core performance and a correct choice of the number of active cores per chip are the essential optimizations for lowest energy to solution at minimal performance degradation. Then we extrapolate to the MPI-parallel level and quantify the energy-saving potential of various optimizations and execution modes, where we find these guidelines to be even more important, especially when communication overhead is non-negligible. In our setup we could achieve energy savings of 35% in this case, compared to a naive approach. We also demonstrate that a simple non-reflective reduction of the clock speed leaves most of the energy saving potential unused.

          Related collections

          Most cited references22

          • Record: found
          • Abstract: not found
          • Article: not found

          KMCLib: A general framework for lattice kinetic Monte Carlo (KMC) simulations

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            Power-Management Architecture of the Intel Microarchitecture Code-Named Sandy Bridge

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              A high-performance lattice Boltzmann implementation to model flow in porous media

                Bookmark

                Author and article information

                Journal
                2013-04-29
                2015-05-22
                Article
                10.1002/cpe.3489
                1304.7664
                f660af1d-6c7f-4ec5-a0d6-3f0e7d3608bf

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                23 pages, 13 figures; post-peer-review version
                cs.PF cs.DC

                Performance, Systems & Control,Networking & Internet architecture
                Performance, Systems & Control, Networking & Internet architecture

                Comments

                Comment on this article