Ao Wang (王骜)

Technical Expert at Alibaba Cloud Function Compute

I'm a Technical Expert on the Alibaba Cloud Function Compute team, leading work on Serverless GPU and ultra-efficient inference services. Previously, I was a Ph.D. candidate from the Computer Science Department at George Mason University, where I worked with Dr. Yue Cheng at LeapLab and DS2 Lab.

I earned my M.S. in Computer Science from George Washington University.

My research interests include Serverless AI (serverless GPU and inference), efficient LLM serving, FaaS, and cloud storage.

Ao Wang

Education

George Mason University
Ph.D. Candidate, Computer Science
George Washington University
M.S., Computer Science

Experience

Alibaba Cloud
Technical Expert, Function Compute team
Alibaba Cloud
Research Intern, Function Compute team
George Mason University
Research/Teaching Assistant

Selected Publications

Serverless AI

FaaScale: Unlocking Fast LLM Scaling for Serverless Inference
Minchen Yu, Rui Yang, Chaobo Jia, Zhaoyuan Su, Sheng Yao, Tingfeng Lan, Yuchen Yang, Yue Cheng, Wei Wang, Ao Wang, Ruichuan Chen
MLSys26 [Paper]
Torpor: GPU-Enabled Serverless Computing for Low-Latency, Resource-Efficient Inference
Minchen Yu, Ao Wang, Dong Chen, Haoxuan Yu, Xiaonan Luo, Zhuohao Li, Wei Wang, Ruichuan Chen, Dapeng Nie, Haoran Yang
USENIX ATC 2025 [Paper]
Enabling Low-Latency, GPU-Efficient Serverless Inference with Model Swapping
Minchen Yu, Ao Wang, Dong Chen, Haoxuan Yu, Xiaonan Luo, Zhuohao Li, Wei Wang, Ruichuan Chen, Dapeng Nie, Haoran Yang
ACM Transactions on Architecture and Code Optimization (TACO)

Serverless and FaaS

Concurrency-Informed Orchestration for Serverless Functions
Qichang Liu, Yue Cheng, Haiying Shen, Ao Wang, Bharathan Balaji
ASPLOS 2025 [Paper]
InfiniStore: Elastic Serverless Cloud Storage
Jingyuan Zhang, Ao Wang, Xiaolong Ma, Benjamin Carver, Nicholas John Newman, Ali Anwar, Lukas Rupprecht, Dimitrios Skourtis, Vasily Tarasov, Feng Yan, Yue Cheng
VLDB 2023 [Paper]
Owl: Performance-Aware Scheduling for Resource-Efficient Function-as-a-Service Cloud
Huangshi Tian, Suyi Li, Ao Wang, Wei Wang, Tianlong Wu, Haoran Yang
SoCC 2022 [Paper]
FaaSNet: Scalable and Fast Provisioning of Custom Serverless Container Runtimes at Alibaba Cloud Function Compute
Ao Wang, Shuai Chang, Huangshi Tian, Hongqi Wang, Haoran Yang, Huiba Li, Rui Du, Yue Cheng
USENIX ATC 2021 [Paper]
Wukong: A Scalable and Locality-Enhanced Framework for Serverless Parallel Computing
Benjamin Carver, Jingyuan Zhang, Ao Wang, Ali Anwar, Panruo Wu, Yue Cheng
SoCC 2020 [Paper]
InfiniCache: Exploiting Ephemeral Serverless Functions to Build a Cost-Effective Memory Cache
Ao Wang, Jingyuan Zhang, Xiaolong Ma, Ali Anwar, Lukas Rupprecht, Dimitrios Skourtis, Vasily Tarasov, Feng Yan, Yue Cheng
USENIX FAST 2020 [Paper]

Patents

镜像加速系统、方法及装置
Patent No. CN115499449B [PDF]
容器的生命周期管理、函数计算方法、设备及存储介质
Patent No. CN114489947B [PDF]

Professional Activities

  • EuroSys 2020 Shadow Program Committee

Selected Awards

  • Student Travel Grant, USENIX FAST 2020
  • Student Travel Grant, USENIX NSDI 2019