GALE: Leveraging Heterogeneous Systems for Efficient Unstructured Mesh Data Analysis
Quick Links
Media
- arXiv Posting
- Self-Hosted Publication
© Thomas Randall 2025. This is the author’s version of the work. It is posted here for your personal use. Not for redistribution.
- Recorded Presentation
Publication Details
Appears in IEEE Transactions on Visualization and Computer Graphics by authors: Guoxi Liu, Thomas Randall, Rong Ge, Federico Iuricich
Abstract
Unstructured meshes present challenges in scientific data analysis due to irregular distribution and complex connectivity. Computing and storing connectivity information is a major bottleneck for visualization algorithms, affecting both time and memory performance. Recent task-parallel data structures address this by precomputing connectivity information at runtime while the analysis algorithm executes, effectively hiding computation costs and improving performance. However, existing approaches are CPU-bound, forcing the data structure and analysis algorithm to compete for the same computational resources, limiting potential speedups. To overcome this limitation, we introduce a novel task-parallel approach optimized for heterogeneous CPU-GPU systems. Specifically, we offload the computation of mesh connectivity information to GPU threads, enabling CPU threads to focus on executing the visualization algorithm. Following this paradigm, we propose GALE (GPU-Aided Localized data structurE), the first open-source CUDA-based data structure designed for heterogeneous task parallelism. Experiments on two 20-core CPUs and an NVIDIA V100 GPU show that GALE achieves up to 2.7x speedup over state-of-the-art localized data structures while maintaining memory efficiency.