FHEFusion: Enabling Operator Fusion in FHE Compilers for Depth-Efficient DNN Inference (CGO 2026 - Main Conference)

Sat 31 January - Wed 4 February 2026 Sydney, Australia

co-located with HPCA/CGO/PPoPP/CC 2026

Who

Tianxiang Sui, Jianxin Lai, Long Li, Peng Yuan, Yan Liu, Qing Zhu, Xiaojing Zhang, Linjie Xiao, Mingzhe Zhang, Jingling Xue

Track

CGO 2026 Main Conference

This program is tentative and subject to change.

Time Zone

The program is currently displayed in (GMT+11:00) Hobart.

Use conference time zone: (GMT+11:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 2 Feb 2026 11:50 - 12:10 at Balmoral - Security Chair(s): Michael Franz

Abstract

Operator fusion is essential for accelerating FHE-based DNN inference because it reduces multiplicative depth and, in turn, lowers the cost of ciphertext operations by keeping them at lower ciphertext levels. Existing approaches either rely on manual optimizations, which miss cross-operator opportunities, or on compiler pattern matching, which lacks generality. Standard DNN graphs omit FHE-specific behaviors, while fully lowering to primitive FHE operations introduces excessive granularity and obstructs effective optimization.

We present FHEFusion, a compiler framework for the CKKS scheme that enables fusion through a new IR. This IR preserves high-level DNN semantics while introducing FHE-aware operators—masking and compaction ($\mathsf{Strided_Slice}$)—that are central to CKKS, thereby exposing broader fusion opportunities. Guided by algebraic rules and an FHE-aware cost model, FHEFusion reduces multiplicative depth and identifies profitable fusions. Integrated into ANT-ACE, a state-of-the-art FHE compiler, FHEFusion outperforms nGraph, the only framework with graph-level fusion, achieving up to $3.02\times$ (average $1.40\times$) speedup across seven DNNs (13 variants from different RELU approximations) on CPUs, while maintaining inference accuracy.

Link to Preprint

https://www.conference-publishing.com/Proc/CGO26/cgo26/cgo26main-p45-p

Tianxiang Sui

Ant Group

China

Jianxin Lai

Ant Group

Long Li

Ant Group

Peng Yuan

Ant Group

China

Yan Liu

Ant Group

Qing Zhu

Ant Group

Xiaojing Zhang

Ant Group

China

Linjie Xiao

Ant Group

China

Mingzhe Zhang

Ant Group

China

Jingling Xue

University of New South Wales

Australia

Media

This program is tentative and subject to change.

Time Zone

The program is currently displayed in (GMT+11:00) Hobart.

Use conference time zone: (GMT+11:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 2 Feb
Displayed time zone: Hobart change

11:30 - 12:50	SecurityMain Conference at Balmoral Chair(s): Michael Franz University of California, Irvine

11:30 20m Talk		PriTran: Privacy-Preserving Inference for Transformer-Based Language Models under Fully Homomorphic Encryption Main Conference Yuechen Mu UNSW, Guangli Li Institute of Computing Technology, Chinese Academy of Sciences, Shiping Chen Data61 at CSIRO, Australia / UNSW, Australia, Jingling Xue University of New South Wales Pre-print
11:50 20m Talk		FHEFusion: Enabling Operator Fusion in FHE Compilers for Depth-Efficient DNN Inference Main Conference Tianxiang Sui Ant Group, Jianxin Lai Ant Group, Long Li Ant Group, Peng Yuan Ant Group, Yan Liu Ant Group, Qing Zhu Ant Group, Xiaojing Zhang Ant Group, Linjie Xiao Ant Group, Mingzhe Zhang Ant Group, Jingling Xue University of New South Wales Pre-print Media Attached
12:10 20m Talk		Towards Path-Aware Coverage-Guided Fuzzing Main Conference Giacomo Priamo Sapienza University of Rome, Daniele Cono D'Elia Sapienza University of Rome, Mathias Payer EPFL, Leonardo Querzoni Sapienza University Rome Pre-print Media Attached
12:30 20m Talk		SecSwift, a Compiler-Based Framework for Software Countermeasures in Cybersecurity Main Conference François de Ferrière STMICROELECTRONICS, Yves Janin STMICROELECTRONICS, Sirine Mechmech Grenoble INP Pre-print

Hide past events