Download

Abstract

In the era of big data, effectively compressing large datasets while performing complex mathematical operations is crucial. Tensor-based decomposition methods have shown superior compression capabilities with minimal loss of accuracy compared to traditional matrix methods. Under the star-M tensor framework, tensors can be decomposed in a matrix-mimetic way, including using the star-M SVD. This tensor SVD has optimality guarantees and has shown exceptional performance on specific types of data, but software implementations have been mostly limited to productivity-oriented languages. In this work, we present our development of a shared-memory parallel, high-performance solution designed to efficiently implement the underlying algorithms. This software will enable optimal compression of extensive scientific datasets, paving the way for enhanced data analysis and insights.


Figure 6: Strong scaling of the star-M SVD on the ncep-air-6 dataset


Citation

Md Taufique Hussain, Grey Ballard, Aditya Devarakonda, Srinivas Eswar, Naman Pesricha and Vishwas Rao, “High-Performance Star-M SVD for Big Data Compression”, arXiv:2605.16058, 2026. https://arxiv.org/abs/2605.16058

@misc{hussain2026highperformance,
      title={High-Performance Star-M SVD for Big Data Compression},
      author={Md Taufique Hussain and Grey Ballard and Aditya Devarakonda and Srinivas Eswar and Naman Pesricha and Vishwas Rao},
      year={2026},
      eprint={2605.16058},
      archivePrefix={arXiv},
      primaryClass={cs.DC},
      url={https://arxiv.org/abs/2605.16058},
}