Publications

See my semantic scholar profile page for more details.

Reports

Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities.
Gemini Team, Google

Imagen 3
Imagen Team

Conference and Journal Papers

EM Distillation for One-step Diffusion Models
Sirui Xie, Zhisheng Xiao, Diederik P Kingma, Tingbo Hou, Ying Nian Wu, Kevin Patrick Murphy, Tim Salimans, Ben Poole, Ruiqi Gao
Conference on Neural Information Processing Systems (NeurIPS), 2024

MobileDiffusion: Subsecond Text-to-Image Generation on Mobile Devices
Yang Zhao, Yanwu Xu, Zhisheng Xiao, Tingbo Hou
European Conference on Computer Vision (ECCV), 2024

Ufogen: You Forward Once Large Scale Text-to-image Generation via Diffusion Gans
Yanwu Xu, Yang Zhao, Zhisheng Xiao, Tingbo Hou
IEEE / CVF Computer Vision and Pattern Recognition Conference (CVPR), 2024 (Highlight)

Adaptive Multi-stage Density Ratio Estimation for Learning Latent Space Energy-based Model
Zhisheng Xiao, Tian Han
Conference on Neural Information Processing Systems (NeurIPS), 2022 (Oral)

Tackling the Generative Learning Trilemma with Denoising Diffusion GANs
Codes Project page
Zhisheng Xiao, Karsten Kreis, and Arash Vahdat
International Conference on Learning Representations (ICLR), 2022 (Spotlight)

Two Symmetrized Coordinate Descent Methods Can Be O(n^2) Times Slower Than the Randomized Version
Peijun Xiao, Zhisheng Xiao, and Ruoyu Sun
SIAM Journal on Optimization, 2021

ControlVAE: Tuning, Analytical Properties, and Performance Analysis
Huajie Shao, Zhisheng Xiao, Shuochao Yao, Aston Zhang, Shengzhong Liu, and Tarek Abdelzaher
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021

VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models
Code
Zhisheng Xiao, Karsten Kreis, Jan Kautz, and Arash Vahdat
International Conference on Learning Representations (ICLR), 2021 (Spotlight)

Likelihood Regret: An Out-of-Distribution Detection Score For Variational Auto-encoder
Code
Zhisheng Xiao, Qing Yan, and Yali Amit
Conference on Neural Information Processing Systems (NeurIPS), 2020

Workshop Papers

Do We Really Need to Learn Representations from In-domain Data for Outlier Detection?
Zhisheng Xiao, Qing Yan, and Yali Amit
ICML Workshop on Uncertainty & Robustness in Deep Learning, 2021

EBMs Trained with Maximum Likelihood are Generator Models Trained with a Self-adverserial Loss
Zhisheng Xiao, Qing Yan, and Yali Amit
Energy Based Models Workshop - ICLR, 2021

Improving Sample Quality by Training and Sampling from Latent Energy
Zhisheng Xiao, Qing Yan, and Yali Amit
ICML Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models, 2020

Likelihood Regret: An Out-of-Distribution Detection Score For Variational Auto-encoder
Zhisheng Xiao, Qing Yan, and Yali Amit
ICML Workshop on Uncertainty & Robustness in Deep Learning, 2020 (Spotlight)

Preprints

DreamInpainter: Text-Guided Subject-Driven Image Inpainting with Diffusion Models
Shaoan Xie, Yang Zhao, Zhisheng Xiao, Kelvin C.K. Chan, Yandong Li, Yanwu Xu, Kun Zhang, Tingbo Hou

HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models
Zhonghao Wang, Wei Wei, Yang Zhao, Zhisheng Xiao, Mark Hasegawa-Johnson, Humphrey Shi, Tingbo Hou

A Method to Model Conditional Distributions with Normalizing Flows
Zhisheng Xiao, Qing Yan, and Yali Amit

Generative Latent Flow
Zhisheng Xiao, Qing Yan, and Yali Amit

Thesis

Designing Deep Generative Models with Symbiotic Composition
Zhisheng Xiao