Parallel Tensor Compression for Large-Scale Scientific Data

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

As parallel computing trends towards the exascale, scientific data produced by high-fidelity simulations are growing increasingly massive. For instance, a simulation on a three-dimensional spatial grid with 512 points per dimension that tracks 64 variables per grid point for 128 time steps yields 8~TB of data, assuming double precision. By viewing the data as a dense five-way tensor, we can compute a Tucker decomposition to find inherent low-dimensional multilinear structure, achieving compression ratios of up to 5000 on real-world data sets with negligible loss in accuracy. So that we can operate on such massive data, we present the first-ever distributed-memory parallel implementation for the Tucker decomposition, whose key computations correspond to parallel linear algebra operations, albeit with nonstandard data layouts. Our approach specifies a data distribution for tensors that avoids any tensor data redistribution, either locally or in parallel. We provide accompanying analysis of the computation and communication costs of the algorithms. To demonstrate the compression and accuracy of the method, we apply our approach to real-world data sets from combustion science simulations. We also provide detailed performance results, including parallel performance in both weak and strong scaling experiments.

Related collections

Author and article information

Journal

Publication date Created: 2015-10-22

Publication date Updated: 2016-02-23

Article

ArXiV ID: 1510.06689

SO-VID: a8e12495-605d-4631-b799-6703404dbcf4

License:

http://arxiv.org/licenses/nonexclusive-distrib/1.0/

History

Custom metadata

Categories cs.NA cs.DC

ScienceOpen disciplines: Numerical & Computational mathematics,Networking & Internet architecture

Data availability:

ScienceOpen disciplines: Numerical & Computational mathematics, Networking & Internet architecture

Parallel Tensor Compression for Large-Scale Scientific Data

Read this article at

Abstract

Related collections

Data-Driven Civil Engineering

Author and article information

Journal

Article

History

Custom metadata

Comments

Comment on this article

Similar content 97