PriTran: Privacy-Preserving Inference for Transformer-Based Language Models under Fully Homomorphic Encryption (CGO 2026 - Main Conference)

Sat 31 January - Wed 4 February 2026 Sydney, Australia

co-located with HPCA/CGO/PPoPP/CC 2026

Who

Yuechen Mu, Guangli Li, Shiping Chen, Jingling Xue

Track

CGO 2026 Main Conference

Time Zone

The program is currently displayed in (GMT+11:00) Hobart.

Use conference time zone: (GMT+11:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Mon 2 Feb 2026 11:30 - 11:50 at Balmoral - Security Chair(s): Michael Franz

Abstract

Transformer-based language models power many cloud services, but inference on sensitive data raises confidentiality concerns. Fully Homomorphic Encryption (FHE) enables computation on encrypted inputs while preserving privacy, but at high computational cost, making Transformers difficult to deploy. This paper presents PriTran, an efficient CKKS-based library for privacy-preserving Transformer inference on CPUs. Complementing the only prior work, RoLe, which supports only BERT-Tiny (2 encoders), PriTran introduces two novel algorithms with optimized data layouts that accelerate ciphertext–plaintext (CP) and ciphertext–ciphertext (CC) matrix multiplications (MMs) across all BERT models by reducing costly rotations and multiplications. On the MNLI dataset, RoLe fails on inputs longer than 36 tokens within a 5-hour per-token budget, while PriTran achieves average speedups of 29.3% and 22.2% for CP- and CC-MMs, respectively, and 24.1% end-to-end. We further evaluate PriTran on scaled BERT-Tiny variants with additional encoders and on BERT-Mini (4 encoders), demonstrating correctness and scalability beyond RoLe's limits. Within current FHE limits, these gains and RoLe's failure on longer inputs underscore PriTran's promise as a practical approach for FHE-based Transformer inference.

Link to Preprint

https://www.conference-publishing.com/Proc/CGO26/cgo26/cgo26main-p35-p

Yuechen Mu

UNSW

Australia

Guangli Li

Institute of Computing Technology, Chinese Academy of Sciences

China

Shiping Chen

Data61 at CSIRO, Australia / UNSW, Australia

Australia

Jingling Xue

UNSW Sydney

Australia

Time Zone

The program is currently displayed in (GMT+11:00) Hobart.

Use conference time zone: (GMT+11:00) HobartSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

Display full programSpecify a time band

Save

Session Program

Mon 2 Feb
Displayed time zone: Hobart change

11:30 - 12:50	SecurityMain Conference at Balmoral Chair(s): Michael Franz University of California, Irvine

11:30 20m Talk		PriTran: Privacy-Preserving Inference for Transformer-Based Language Models under Fully Homomorphic Encryption Main Conference Yuechen Mu UNSW, Guangli Li Institute of Computing Technology, Chinese Academy of Sciences, Shiping Chen Data61 at CSIRO, Australia / UNSW, Australia, Jingling Xue UNSW Sydney Pre-print
11:50 20m Talk		FHEFusion: Enabling Operator Fusion in FHE Compilers for Depth-Efficient DNN Inference Main Conference Tianxiang Sui Ant Group, Jianxin Lai Ant Group, Long Li Ant Group, Peng Yuan Ant Group, Yan Liu Ant Group, Qing Zhu Ant Group, Xiaojing Zhang Ant Group, Linjie Xiao Ant Group, Mingzhe Zhang Ant Group, Jingling Xue UNSW Sydney Pre-print Media Attached
12:10 20m Talk		Towards Path-Aware Coverage-Guided Fuzzing Main Conference Giacomo Priamo Sapienza University of Rome, Daniele Cono D'Elia Sapienza University of Rome, Mathias Payer EPFL, Leonardo Querzoni Sapienza University Rome Pre-print Media Attached
12:30 20m Talk		SecSwift, a Compiler-Based Framework for Software Countermeasures in Cybersecurity Main Conference François de Ferrière STMICROELECTRONICS, Yves Janin STMICROELECTRONICS, Sirine Mechmech Grenoble INP Pre-print