Automatic Data Enumeration for Fast Collections
Data collections provide a powerful abstraction to organize data, simplifying development and maintenance. Choosing an implementation for each collection is a critical decision, with performance, memory and energy tradeoffs that need to be balanced for each use case. Specialized implementations offer significant benefits over their general-purpose counterparts, but also require certain properties of the data they store, such as uniqueness or ordering. To employ them, developers must either possess domain knowledge or transform their data to exhibit the desired property, which is a tedious, manual process. One such transformation—commonly used in data mining and program analysis—is data enumeration, where data items are assigned unique identifiers to enable fast equality checks and compact memory layout. In this paper, we present an automated approach to data enumeration, eliminating the need for manual developer effort. Our implementation in the MEMOIR compiler achieves speedups of 2.16× on average (up to 8.72×) and reduces peak memory consumption by 5.6% on average (up to 50.7%). This work shows that automated techniques can manufacture data properties to unlock specialized collection implementations, pushing the envelope of collection-oriented optimization.
Mon 2 FebDisplayed time zone: Hobart change
14:10 - 15:30 | |||
14:10 20mTalk | Flow-Graph-Aware Tiling and Rescheduling for Memory-Efficient On-Device Inference Main Conference Pre-print | ||
14:30 20mTalk | VFlatten: Selective Value-Object Flattening using Hybrid Static and Dynamic Analysis Main Conference Arjun H. Kumar IIT Mandi, Bhavya Hirani SVNIT, Surat, Hang Shao IBM, Tobi Ajila IBM, Vijay Sundaresan IBM Canada, Daryl Maier IBM Canada, Manas Thakur IIT Bombay Pre-print Media Attached | ||
14:50 20mTalk | FRUGAL: Pushing GPU Applications beyond Memory Limits Main Conference Lingqi Zhang RIKEN RCCS, Tengfei Wang Google Cloud, Jiajun Huang University of California, Riverside, Chen Zhuang Tokyo Institute of Technology, Riken Center for Computational Science, Ivan Ivanov Institute of Science Tokyo, Peng Chen RIKEN RCCS, Toshio Endo , Mohamed Wahib RIKEN Center for Computational Science Pre-print | ||
15:10 20mTalk | Automatic Data Enumeration for Fast Collections Main Conference Pre-print Media Attached | ||