CGO 2026
Sat 31 January - Wed 4 February 2026 Sydney, Australia
co-located with HPCA/CGO/PPoPP/CC 2026
VenueInternational Convention Centre Sydney
Room nameBronte
Floor2
Room numberC2.2-3
Capacity171
Room InformationNo extra information available
Program

This program is tentative and subject to change.

You're viewing the program in a time zone which is different from your device's time zone change time zone

Mon 2 Feb

Displayed time zone: Hobart change

09:50 - 11:10
Compiling for ML 1Main Conference at Bronte
09:50
20m
Talk
Enabling Spill-Free Compilation via Affine-Based Live Range Reduction Optimization
Main Conference
10:10
20m
Talk
GRANII: Selection and Ordering of Primitives in GRAph Neural Networks using Input Inspection
Main Conference
Damitha Lenadora University of Illinois at Urbana-Champaign, Vimarsh Sathia University of Illinois Urbana Champaign, Gerasimos Gerogiannis University of Illinois at Urbana-Champaign, Serif Yesil NVIDIA, Josep Torrellas University of Illinois at Urbana-Champaign, Charith Mendis University of Illinois at Urbana-Champaign
10:30
20m
Talk
Fast Autoscheduling for Sparse ML Frameworks
Main Conference
Bobby Yan Stanford University, Alexander J Root Stanford University, Trevor Gale Stanford University, David Broman KTH Royal Institute of Technology, Fredrik Kjolstad Stanford University
10:50
20m
Talk
Eliminating Redundancy: Ultra-compact Code Generation for Programmable Dataflow Accelerators
Main Conference
Prasanth Chatarasi IBM Research, Alex Gatea IBM, Bardia Mahjour IBM, Jintao Zhang Unaffiliated, Alberto Mannari IBM, Chris Bowler IBM, Shubham Jain IBM Research, Masoud Ataei Jaliseh IBM, Nicole Khoun IBM, Kamlesh Kumar Unaffiliated, Viji Srinivasan IBM Research, Swagath Venkataramani IBM Research
11:30 - 12:50
AbstractionsMain Conference at Bronte
11:30
20m
Talk
Partial-Evaluation Templates: Accelerating Partial Evaluation with Pre-compiled Templates
Main Conference
Florian Huemer JKU Linz, Aleksandar Prokopec Oracle Labs, David Leopoldseder Oracle Labs, Raphael Mosaner Oracle Labs, Hanspeter Mössenböck JKU Linz
11:50
20m
Talk
Pyls: Enabling Python Hardware Synthesis with Dynamic Polymorphism via LCRS Encoding
Main Conference
Bolei Tong Wuhan University, Yongyan Fang Wuhan University, Wang Chaorui Wuhan University, Qingan Li Wuhan University, China, Jingling Xue University of New South Wales, YUAN Mengting School of Computer Science, Wuhan University, Wuhan, China
12:10
20m
Talk
SkeleShare: Algorithmic Skeletons and Equality Saturation for Hardware Resource Sharing
Main Conference
Jonathan Van der Cruysse McGill University, Tzung-Han Juang McGill University, Shakiba Bolbolian Khah McGill University, Christophe Dubach McGill University
12:30
20m
Talk
Ember: A Compiler for Embedding Operations on Decoupled Access-Execute Architectures
Main Conference
Marco Siracusa Barcelona Supercomputing Center; Universitat Politècnica de Catalunya, Olivia Hsu Stanford University, Víctor Soria-Pardos Barcelona Supercomputing Center, Joshua Randall Arm, Arnaud Grasset Arm, Eric Biscondi Arm, Douglas J. Joseph Arm, Randy Allen Barcelona Supercomputing Center, Fredrik Kjolstad Stanford University, Miquel Moreto Technical Univeristy of Catalonia, Adrià Armejach Sanosa Barcelona Supercomputing Center & Universitat Politècnica de Catalunya
14:10 - 15:30
14:10
20m
Talk
FORTE: Online DataFrame Query Optimizer
Main Conference
Yoonho Choi POSTECH, Kyoungtae Lee Seoul National University, Minji Kim Ewha Womans University, Hyungsoo Jung Seoul National University, Hyojin Sung Seoul National University
14:30
20m
Talk
LEGO: A Layout Expression Language for Code Generation of Hierarchical Mapping
Main Conference
Amir Mohammad Tavakkoli University of Utah, Cosmin E. Oancea University of Copenhagen, Denmark, Mary Hall University of Utah
14:50
20m
Talk
Pushing Tensor Accelerators beyond MatMul in a User-Schedulable Language
Main Conference
Yihong Zhang University of Washington, Derek Gerstmann Adobe, Andrew Adams Adobe Research, Maaz Bin Safeer Ahmad University of Washington, Seattle
15:10
20m
Talk
Tawa: Automatic Warp Specialization for Modern GPUs with Asynchronous References
Main Conference
Hongzheng Chen Cornell University, Bin Fan Nvidia, Alexander Collins NVIDIA, Bastian Hagedorn NVIDIA, Evghenii Gaburov NVIDIA, Masahiro Masuda NVIDIA, Matthew Brookhart NVIDIA, Chris Sullivan NVIDIA, Jason Knight NVIDIA, Zhiru Zhang Cornell University, USA, Vinod Grover NVIDIA
15:50 - 17:10
Parallelization / VectorizationMain Conference at Bronte
15:50
20m
Talk
Enabling Automatic Compiler-Driven Vectorization of Transformers
Main Conference
Shreya Alladi University of Murcia, Alberto Ros University of Murcia, Alexandra Jimborean University of Murcia
16:10
20m
Talk
Unlocking Python Multithreading Capabilities using OpenMP-Based Programming with OMP4Py
Main Conference
César Piñeiro University of Santiago de Compostela, Juan C. Pichel University of Santiago de Compostela
16:30
20m
Talk
The Parallel-Semantics Program Dependence Graph for Parallel Optimization
Main Conference
Yian Su Northwestern University, Brian Homerding Northwestern University, Haocheng Gao Northwestern University, Federico Sossai Northwestern University, Yebin Chon Princeton University, David I. August Princeton University, Simone Campanoni Google / Northwestern University
16:50
20m
Talk
From Threads to Tiles: T2T, a Compiler for CUDA-to-NPU Translation via 2D Vectorization
Main Conference
Shuaijiang Li Institute of Computing Technology at Chinese Academy of Sciences, Jiacheng Zhao Institute of Computing Technology at Chinese Academy of Sciences; University of Chinese Academy of Sciences; Zhongguancun Laboratory, Ying Liu Institute of Computing Technology, Chinese Academy of Sciences, Shuoming Zhang Institute of Computing Technology at Chinese Academy of Sciences, Lei Chen University of Chinese Academy of Sciences, Yijin Li Institute of Computing Technology at Chinese Academy of Sciences, Yangyu Zhang Institute of Computing Technology,Chinese Academy of Sciences, lizhicheng Institute of Computing Technology at Chinese Academy of Sciences, Runyu Zhou Institute of Computing Technology at Chinese Academy of Sciences, Xiyu Shi Institute of Computing Technology at Chinese Academy of Sciences, Chunwei Xia University of Leeds, Yuan Wen University of Aberdeen, Xiaobing Feng ICT CAS, Huimin Cui Institute of Computing Technology, Chinese Academy of Sciences
17:30 - 19:00
Business MeetingMain Conference at Bronte
17:30
90m
Meeting
Business Meeting
Main Conference

Tue 3 Feb

Displayed time zone: Hobart change

09:50 - 11:10
Code GenerationMain Conference at Bronte
09:50
20m
Talk
TPDE: A Fast Adaptable Compiler Back-End Framework
Main Conference
Tobias Schwarz TU Munich, Tobias Kamm TU Munich, Alexis Engelke TU Munich
10:10
20m
Talk
Synthesizing Instruction Selection Back-Ends from ISA Specifications Made Practical
Main Conference
Florian Drescher Technical University of Munich, Alexis Engelke TU Munich
10:30
20m
Talk
SparseX: Synergizing GPU Libraries for Sparse Matrix Multiplication on Heterogeneous Processors
Main Conference
Ruifeng Zhang North Carolina State University, Xiangwei Wang North Carolina State University, Ang Li Pacific Northwest National Laboratory, Xipeng Shen North Carolina State University
10:50
20m
Talk
Compilation of Generalized Matrix Chains with Symbolic Sizes
Main Conference
Francisco López Umeå University, Lars Karlsson Umeå University, Paolo Bientinesi Umeå University
11:30 - 12:50
Profiling / InstrumentationMain Conference at Bronte
11:30
20m
Talk
TRACE4J: A Lightweight, Flexible, and Insightful Performance Tracing Tool for Java
Main Conference
Haide He UC Merced, Pengfei Su University of California, Merced
11:50
20m
Talk
Proton: Towards Multi-level, Adaptive Profiling for Triton
Main Conference
Keren Zhou George Mason University, Tianle Zhong University of Virginia, Hao Wu George Mason University, Jihyeong Lee George Mason University, Yue Guan University of California at San Diego, Yufei Ding University of California at Santa Barbara, Corbin Robeck Meta, Yuanwei Fang Meta, Jeff Niu OpenAI, Philippe Tillet OpenAI
12:10
20m
Talk
On the Precision of Dynamic Program Fingerprints Based on Performance Counters
Main Conference
Anderson Faustino da Silva State University of Maringá, Sergio Queiroz de Medeiros Universidade Federal do Rio Grande do Norte, Marcelo Borges Nogueira Federal University of Rio Grande do Norte, Jeronimo Castrillon TU Dresden, Germany, Fernando Magno Quintão Pereira Federal University of Minas Gerais
12:30
20m
Talk
PASTA: A Modular Program Analysis Tool Framework for Accelerators
Main Conference
Mao Lin University of California Merced, Hyeran Jeon University of California, Merced, Keren Zhou George Mason University
14:10 - 15:30
14:10
20m
Talk
PIP: Making Andersen’s Points-to Analysis Sound and Practical for Incomplete C Programs
Main Conference
Håvard Rognebakke Krogstie NTNU, Helge Bahmann Independent Researcher, Magnus Själander Norwegian University of Science and Technology (NTNU), Nico Reissmann Independent Researcher
14:30
20m
Talk
Thinking Fast and Correct: Automated Rewriting of Numerical Code through Compiler Augmentation
Main Conference
Siyuan Brant Qian University of Illinois at Urbana-Champaign, Vimarsh Sathia University of Illinois Urbana Champaign, Ivan Ivanov Institute of Science Tokyo, Jan Hueckelheim Argonne National Laboratory, Paul Hovland Argonne National Laboratory, William S. Moses University of Illinois Urbana-Champaign
14:50
20m
Talk
PolyUFC: Polyhedral Compilation Meets Roofline Analysis for Uncore Frequency Capping
Main Conference
Nilesh Rajendra Shah Indian Institute of Technology Hyderabad, India, M V V S Manoj Kumar IIT Hyderabad, Dhairya Baxi IIT Hyderabad, Ramakrishna Upadrasta IIT Hyderabad
15:10
20m
Talk
Accelerating App Recompilation across Android System Updates by Code Reusing
Main Conference
Hongtao Wu Wuhan University, Yu Chen Chuzhou University, Mengfei Xie Wuhan University, Futeng Yang Guangdong OPPO Mobile Telecommunications, Jun Yan Guangdong OPPO Mobile Telecommunications, Jiang Ma OPPO Electronics Corp., Jianming Fu Wuhan University, Jason Xue MBZUAI, Qingan Li Wuhan University, China
15:50 - 17:10
Compiling for ML 2Main Conference at Bronte
15:50
20m
Talk
QIGen: A Kernel Generator for Inference on Nonuniformly Quantized Large Language Models
Main Conference
Tommaso Pegolotti ETH Zürich, Dan Alistarh IST Austria, Markus Püschel ETH Zurich
16:10
20m
Talk
DyPARS: Dynamic-Shape DNN Optimization via Pareto-Aware MCTS for Graph Variants
Main Conference
Hao Qian University of New South Wales, Guangli Li Institute of Computing Technology, Chinese Academy of Sciences, Qiuchu Yu Institute of Computing Technology at Chinese Academy of Sciences, Xueying Wang Beijing University of Posts and Telecommunications, Jingling Xue University of New South Wales
16:30
20m
Talk
Compiler-Runtime Co-operative Chain of Verification for LLM-Based Code Optimization
Main Conference
Hyunho Kwon Yonsei University, Sanggyu Shin SAIT, Ju Min Lee Yonsei University, Hoyun Youm Yonsei University, Seungbin Song SAIT, Seongho Kim Yonsei University, Hanwoong Jung Samsung Advanced Institute of Technology, Seungwon Lee Samsung Advanced Institute of Technology, Hanjun Kim Yonsei University
16:50
20m
Talk
Hexcute: A Compiler Framework for Automating Layout Synthesis in GPU Programs
Main Conference
Xiao Zhang University of Toronto; NVIDIA, Yaoyao Ding University of Toronto; Vector Institute; NVIDIA, Bolin Sun University of Toronto; NVIDIA, Yang Hu NVIDIA, Tatiana Shpeisman Google, Gennady Pekhimenko University of Toronto / Vector Institute
17:15 - 18:15

Wed 4 Feb

Displayed time zone: Hobart change

09:50 - 11:10
Tensor OptimizationMain Conference at Bronte
09:50
20m
Talk
Multidirectional Propagation of Sparsity Information across Tensor Slices
Main Conference
Kaio Henrique Andrade Ananias Universidade Federal de Minas Gerais, Danila Seliayeu University of Alberta, Jose Nelson Amaral University of Alberta, Fernando Magno Quintão Pereira Federal University of Minas Gerais
10:10
20m
Talk
Synthesizing Specialized Sparse Tensor Accelerators for FPGAs via High-Level Functional Abstractions
Main Conference
Hamza Javed McGill University, Canada, Christophe Dubach McGill University
10:30
20m
Talk
Progressive Low-Precision Approximation of Tensor Operators on GPUs: Enabling Greater Trade-Offs between Performance and Accuracy
Main Conference
Fan Luo Institute of Computing Technology at Chinese Academy of Sciences, Guangli Li Institute of Computing Technology, Chinese Academy of Sciences, Zhaoyang Hao Institute of Computing Technology at Chinese Academy of Sciences, Xueying Wang Beijing University of Posts and Telecommunications, Xiaobing Feng ICT CAS, Huimin Cui Institute of Computing Technology, Chinese Academy of Sciences, Jingling Xue University of New South Wales
10:50
20m
Talk
Tensor Program Superoptimization through Cost-Guided Symbolic Program Synthesis
Main Conference
Alexander Brauckmann University of Edinburgh, Aarsh Chaube University of Edinburgh, José Wesley De Souza Magalhães University of Edinburgh, Elizabeth Polgreen University of Edinburgh, Michael F. P. O'Boyle University of Edinburgh
11:30 - 12:50
OptimizationMain Conference at Bronte
11:30
20m
Talk
A Reinforcement Learning Environment for Automatic Code Optimization in the MLIR Compiler
Main Conference
Mohammed Tirichine New York University Abu Dhabi; Ecole nationale Supérieure d'Informatique, Nassim Ameur NYU Abu Dhabi; École Nationale Supérieure d’Informatique, Nazim Bendib NYU Abu Dhabi; École Nationale Supérieure d’Informatique, Iheb Nassim Aouadj NYU Abu Dhabi, Djad Bouchama NYU Abu Dhabi; University of Science and Technology Houari Boumediene, Rafik Bouloudene NYU Abu Dhabi; University of Science and Technology Houari Boumediene, Riyadh Baghdadi New York University Abu Dhabi
11:50
20m
Talk
Towards Threading the Needle of Debuggable Optimized Binaries
Main Conference
Cristian Assaiante Sapienza University of Rome, Simone Di Biasio Sapienza University of Rome, Snehasish Kumar Google LLC, Giuseppe Antonio Di Luna Sapienza University of Rome, Daniele Cono D'Elia Sapienza University of Rome, Leonardo Querzoni Sapienza University Rome
12:10
20m
Talk
Compiler-Assisted Instruction Fusion
Main Conference
Ravikiran Ravindranath Reddy University of Murcia, Sawan Singh AMD, Arthur Perais CNRS, Alberto Ros University of Murcia, Alexandra Jimborean University of Murcia
12:30
20m
Talk
LLM-VeriOpt: Verification-Guided Reinforcement Learning for LLM-Based Compiler Optimization
Main Conference
Xiangxin Fang Queen Mary University of London; University of Edinburgh, Jiaqin Kang Queen Mary University of London, Rodrigo C. O. Rocha University of Edinburgh, Sam Ainsworth University of Edinburgh, Lev Mukhanov IMEC (Cambridge); Queen Mary University London
12:50 - 13:20

Wed 4 Feb

Displayed time zone: Hobart change

Mon 2 Feb

Displayed time zone: Hobart change

Room9:0015304510:0015304511:0015304512:0015304513:0015304514:0015304515:0015304516:0015304517:0015304518:00153045
Bronte

Tue 3 Feb

Displayed time zone: Hobart change

Room9:0015304510:0015304511:0015304512:0015304513:0015304514:0015304515:0015304516:0015304517:00153045
Bronte