Publications

SkyServe: Serving AI Models across Regions and Clouds with Spot Instances
Ziming Mao*, Tian Xia*, Zhanghao Wu, Wei-Lin Chiang, Tyler Griggs, Romil Bhardwaj, Zongheng Yang, Scott Shenker, Ion Stoica
EuroSys 2025
[Paper] [Code]

Locality-aware Fair Scheduling in LLM Serving
Shiyi Cao, Yichuan Wang, Ziming Mao, Pin-Lun Hsu, Liangsheng Yin, Tian Xia, Dacheng Li, Shu Liu, Yineng Zhang, Yang Zhou, Ying Sheng, Joseph Gonzalez, Ion Stoica
arXiv preprint
[Paper]

Revisiting Cache Freshness for Emerging Real-Time Applications
Ziming Mao, Rishabh Iyer, Scott Shenker, Ion Stoica
HotNets 2024
[Paper]

The Streaming Batch Model for Efficient and Fault-Tolerant Heterogeneous Execution
Frank Sifei Luan, Ziming Mao, Ron Yifeng Wang, Charlotte Lin, Amog Kamsetty, Hao Chen, Cheng Su, Balaji Veeramani, Scott Lee, SangBin Cho, Clark Zinzow, Eric Liang, Ion Stoica, Stephanie Wang
arXiv preprint
[Paper] [Code]

Trinity: A Fast and Space-efficient Multi-attribute Data Store
Ziming Mao, Anurag Khandelwal, Kiran Srinivasan
EuroSys 2024, Best Student Paper Award
[Paper] [Code]

Can’t Be Late: Optimizing Spot Instance Savings under Deadlines
Zhanghao Wu, Wei-Lin Chiang, Ziming Mao, Zongheng Yang, Eric Friedman, Scott Shenker, Ion Stoica
NSDI 2024, Outstanding Paper Award
[Paper] [Code]

GL-Cache: Group-level Learning for Efficient and High-Performance Caching
Juncheng Yang, Ziming Mao, Rashmi Vinayak, Yao Yue
FAST 2023
[Paper] [Code]

Pie: Pooling CPU Memory for LLM Inference.
Yi Xu, Ziming Mao, Xiangxi Mo, Shu Liu, Ion Stoica
arXiv preprint
[Paper]

DYLE: Dynamic Latent Extraction for Abstractive Long-Input Summarization
Ziming Mao*, Chen Henry Wu*, Ansong Ni, Yusen Zhang, Rui Zhang, Tao Yu, Budhaditya Deb, Chenguang Zhu, Ahmed Hassan Awadallah, Dragomir R. Radev
ACL 2022
[Paper] [Code]

SummN: A Multi-Stage Summarization Framework for Long Input Dialogues and Documents
Yusen Zhang, Ansong Ni, Ziming Mao, Chen Henry Wu, Chenguang Zhu, Budhaditya Deb, Ahmed Hassan Awadallah, Dragomir R. Radev, Rui Zhang
ACL 2022
[Paper] [Code]

FeTaQA: Free-form Table Question Answering
Linyong Nan, Chiachun Hsieh, Ziming Mao, Xi Victoria Lin, Neha Verma, Rui Zhang, Wojciech Kryściński, Nick Schoelkopf, Riley Kong, Xiangru Tang, Murori Mutuma, Ben Rosand, Isabel Trindade, Renusree Bandaru, Jacob Cunningham, Caiming Xiong, Dragomir Radev
TACL 2022
[Paper] [Code]

Investigating Crowdsourcing Protocols for Evaluating the Factual Consistency of Summaries
Xiangru Tang, Alexander Fabbri, Haoran Li, Ziming Mao, Griffin Adams, Borui Wang, Asli Celikyilmaz, Yashar Mehdad, Dragomir Radev
NAACL 2022
[Paper]

Enhancing fouling mitigation of submerged flat-sheet membranes by vibrating 3D-spacers
Yong Zen Tan, Ziming Mao, Yanjun Zhang, Wen See Tan, Tzyy Haur Chong, Bing Wu, Jia Wei Chew
Sep. Purif. Technol. 2019
[Paper]

Spacer vibration for fouling control of submerged flat sheet membranes
Bing Wu, Yanjun Zhang, Ziming Mao, Wen See Tan, Yong Zen Tan, Jia Wei Chew, Tzyy Haur Chong, Anthony G Fane
Sep. Purif. Technol. 2019
[Paper]