CGO 2026
Sat 31 January - Wed 4 February 2026 Sydney, Australia
co-located with HPCA/CGO/PPoPP/CC 2026
Sun 1 Feb 2026 18:00 - 20:00 at Pyrmont Foyer - Posters

In our work, we explore tiling for a novel DL accelerator cluster called the Snitch Cluster. Unlike systolic array based accelerators, Snitch compute cores use a combination of streaming registers, hardware loops, and pipelining to achieve high FPU utilization on loop intensive computations such as matrix multiplication. Schedulers for DL workloads must take into account low level scheduling details of their target hardware, custom RISC-V instructions in the case of Snitch, to make informed decisions. Currently there are no cost models readily available to guide scheduling on Snitch.

We present Myrtle, a tiling cost model for an 8-core Snitch Cluster, parameterized by three categories of input: application, hardware, and low-level scheduling details. We combine memory footprint calculation, streaming register configuration counts, identification of streaming vs regular register loads, and heuristics for pruning to generate a promising search space and automatically select a close to optimal tile size. Building upon a Snitch-specific tile layout, we aim to take advantage of the regularity of the Snitch architecture to develop a highly interpretable cost model trained with Support Vector Regression (SVR) and Generalized Additive Models (GAMs).

Check out pre-recorded talk

Sun 1 Feb

Displayed time zone: Hobart change

18:00 - 20:00
18:00
2h
Poster
Tensor Abstraction Enabling Explicit Layout Optimization in Homomorphic Encryption
Student Research Competition
Seongho Kim Yonsei University, Hanjun Kim Yonsei University
18:00
2h
Poster
UniCon: Unified Controllers for the Quantum Computers
Student Research Competition
Ercüment Kaya Technical University of München and Leibniz Supercomputing Centre, Hossam Ahmed Technical University of München and Leibniz Supercomputing Centre, Martin Schulz Technical University of Munich
18:00
2h
Poster
MDH-DSL: Reduction-Aware Data Parallelism via Multi-Dimensional Homomorphisms
Student Research Competition
Richard Schulze University of Muenster, Sergei Gorlatch University of Muenster
Link to publication
18:00
2h
Poster
Effective Tiling for the Snitch Cluster
Student Research Competition
Emily Sillars University of Murcia, Spain, Alexandra Jimborean University of Murcia
18:00
2h
Poster
Automated Adversarial Test Generation for Debugging Neural Compiler Optimizations
Student Research Competition
Vasu Jindal Columbia University
18:00
2h
Poster
Unlocking Vectorization Scope: Extensible Vectorization via Unified Dependence Semantics
Student Research Competition
Shihan Fang Shanghai Jiao Tong University, Wenxin Zheng Shanghai Jiao Tong University
18:00
2h
Poster
Unifying Medium Sparse Processing Frameworks
Student Research Competition
Meisam Tarabkhah University of Edinburgh, Amir Shaikhha University of Edinburgh
18:00
2h
Poster
Bridging Linalg Dialect with Gemmini Backend
Student Research Competition
Jaemin Kim Yonsei University, Hanjun Kim Yonsei University
18:00
2h
Poster
Leveraging Alias Analysis Without Porting
Student Research Competition
Ravikiran Ravindranath Reddy University of Murcia, Alberto Ros University of Murcia, Alexandra Jimborean University of Murcia