23
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      High Performance Computing for gravitational lens modeling: single vs double precision on GPUs and CPUs

      Preprint
      , , ,

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Strong gravitational lensing is a powerful probe of cosmology and the dark matter distribution. Efficient lensing software is already a necessity to fully use its potential and the performance demands will only increase with the upcoming generation of telescopes. In this paper, we study the possible impact of High Performance Computing techniques on a performance-critical part of the widely used lens modeling software LENSTOOL. We implement the algorithm once as a highly optimized CPU version and once with graphics card acceleration for a simple parametric lens model. In addition, we study the impact of finite machine precision on the lensing algorithm. While double precision is the default choice for scientific applications, we find that single precision can be sufficiently accurate for our purposes and lead to a big speedup. Therefore we develop and present a mixed precision algorithm which only uses double precision when necessary. We measure the performance of the different implementations and find that the use of High Performance Computing Techniques dramatically improves the code performance both on CPUs and GPUs. Compared to the current LENSTOOL implementation on 12 CPU cores, we obtain speedup factors of up to 170. We achieve this optimal performance by using our mixed precision algorithm on a high-end GPU which is common in modern supercomputers. We also show that these techniques reduce the energy consumption by up to 98%. Furthermore, we demonstrate that a highly competitive speedup can be reached with consumer GPUs. While they are an order of magnitude cheaper than the high-end graphics cards, they are rarely used for scientific computations due to their low double precision performance. Our mixed precision algorithm unlocks their full potential. The consumer GPU delivers a speedup which is only a factor of four lower than the best speedup achieved by a high-end GPU.

          Related collections

          Most cited references25

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          A Universal Density Profile from Hierarchical Clustering

          We use high-resolution N-body simulations to study the equilibrium density profiles of dark matter halos in hierarchically clustering universes. We find that all such profiles have the same shape, independent of halo mass, of initial density fluctuation spectrum, and of the values of the cosmological parameters. Spherically averaged equilibrium profiles are well fit over two decades in radius by a simple formula originally proposed to describe the structure of galaxy clusters in a cold dark matter universe. In any particular cosmology the two scale parameters of the fit, the halo mass and its characteristic density, are strongly correlated. Low-mass halos are significantly denser than more massive systems, a correlation which reflects the higher collapse redshift of small halos. The characteristic density of an equilibrium halo is proportional to the density of the universe at the time it was assembled. A suitable definition of this assembly time allows the same proportionality constant to be used for all the cosmologies that we have tested. We compare our results to previous work on halo density profiles and show that there is good agreement. We also provide a step-by-step analytic procedure, based on the Press-Schechter formalism, which allows accurate equilibrium profiles to be calculated as a function of mass in any hierarchical model.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            The Structure of Cold Dark Matter Halos

            We use N-body simulations to investigate the structure of dark halos in the standard Cold Dark Matter cosmogony. Halos are excised from simulations of cosmologically representative regions and are resimulated individually at high resolution. We study objects with masses ranging from those of dwarf galaxy halos to those of rich galaxy clusters. The spherically averaged density profiles of all our halos can be fit over two decades in radius by scaling a simple ``universal'' profile. The characteristic overdensity of a halo, or equivalently its concentration, correlates strongly with halo mass in a way which reflects the mass dependence of the epoch of halo formation. Halo profiles are approximately isothermal over a large range in radii, but are significantly shallower than \(r^{-2}\) near the center and steeper than \(r^{-2}\) near the virial radius. Matching the observed rotation curves of disk galaxies requires disk mass-to-light ratios to increase systematically with luminosity. Further, it suggests that the halos of bright galaxies depend only weakly on galaxy luminosity and have circular velocities significantly lower than the disk rotation speed. This may explain why luminosity and dynamics are uncorrelated in observed samples of binary galaxies and of satellite/spiral systems. For galaxy clusters, our halo models are consistent both with the presence of giant arcs and with the observed structure of the intracluster medium, and they suggest a simple explanation for the disparate estimates of cluster core radii found by previous authors. Our results also highlight two shortcomings of the CDM model. CDM halos are too concentrated to be consistent with the halo parameters inferred for dwarf irregulars, and the predicted abundance of galaxy halos is larger than the observed abundance of galaxies.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Constraints on the Self‐Interaction Cross Section of Dark Matter from Numerical Simulations of the Merging Galaxy Cluster 1E 0657−56

                Bookmark

                Author and article information

                Journal
                08 February 2019
                Article
                1902.03252
                e71cd7ee-ed7c-4ead-9de6-38e4e176955a

                http://arxiv.org/licenses/nonexclusive-distrib/1.0/

                History
                Custom metadata
                28 pages, submitted to Astronomy & Computing
                astro-ph.IM astro-ph.CO

                Cosmology & Extragalactic astrophysics,Instrumentation & Methods for astrophysics

                Comments

                Comment on this article