8
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Scaling Structured Multigrid to 500K+ Cores through Coarse-Grid Redistribution

      Preprint
      , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The efficient solution of sparse, linear systems resulting from the discretization of partial differential equations is crucial to the performance of many physics-based simulations. The algorithmic optimality of multilevel approaches for common discretizations makes them a good candidate for an efficient parallel solver. Yet, modern architectures for high-performance computing systems continue to challenge the parallel scalability of multilevel solvers. While algebraic multigrid methods are robust for solving a variety of problems, the increasing importance of data locality and cost of data movement in modern architectures motivates the need to carefully exploit structure in the problem. Robust logically structured variational multigrid methods, such as Black Box Multigrid (BoxMG), maintain structure throughout the multigrid hierarchy. This avoids indirection and increased coarse-grid communication costs typical in parallel algebraic multigrid. Nevertheless, the parallel scalability of structured multigrid is challenged by coarse-grid problems where the overhead in communication dominates computation. In this paper, an algorithm is introduced for redistributing coarse-grid problems through incremental agglomeration. Guided by a predictive performance model, this algorithm provides robust redistribution decisions for structured multilevel solvers. A two-dimensional diffusion problem is used to demonstrate the significant gain in performance of this algorithm over the previous approach that used agglomeration to one processor. In addition, the parallel scalability of this approach is demonstrated on two large-scale computing systems, with solves on up to 500K+ cores.

          Related collections

          Most cited references15

          • Record: found
          • Abstract: not found
          • Article: not found

          Optimization of Collective Communication Operations in MPICH

            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            The Multi-Grid Method for the Diffusion Equation with Strongly Discontinuous Coefficients

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Matrix-dependent prolongations and restrictions in a blackbox multigrid solver

                Bookmark

                Author and article information

                Journal
                06 March 2018
                Article
                1803.02481
                474ac43f-b502-4228-bd57-08ab255bd2d2

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                Los Alamos Report LA-UR-17-22886
                21 pages
                cs.MS cs.NA cs.PF physics.comp-ph

                Comments

                Comment on this article