Ao Wang (王骜)
I’m a Technical Expert on the Alibaba Cloud Function Compute team, leading work on Serverless GPU and ultra-efficient inference service.
I was a Ph.D. candidate from the Computer Science Department at George Mason University, where I worked with Dr. Yue Cheng at LeapLab and DS2 Lab.
I earned an M.S. degree from the Computer Science Department of George Washington University.
My research interest focuses on Serverless AI (serverless GPU and inference), efficient LLM serving, FaaS, and cloud storage.
Publications
- Torpor: GPU-Enabled Serverless Computing for Low-Latency, Resource-Efficient Inference
- Minchen Yu, Ao Wang, Dong Chen, Haoxuan Yu, Xiaonan Luo, Zhuohao Li, Wei Wang, Ruichuan Chen, Dapeng Nie, Haoran Yang
- USENIX ATC 2025
- λScale: Enabling Fast Scaling for Serverless Large Language Model Inference
- Minchen Yu, Rui Yang, Chaobo Jia, Zhaoyuan Su, Sheng Yao, Tingfeng Lan, Yuchen Yang, Yue Cheng, Wei Wang, Ao Wang, Ruichuan Chen
- Arxiv preprint, 2025
- Concurrency-Informed Orchestration for Serverless Functions
- Qichang Liu, Yue Cheng, Haiying Shen, Ao Wang, Bharathan Balaji
- ASPLOS 2025
- InfiniStore: Elastic Serverless Cloud Storage
- Jingyuan Zhang, Ao Wang, Xiaolong Ma, Benjamin Carver, Nicholas John Newman, Ali Anwar, Lukas Rupprecht, Dimitrios Skourtis, Vasily Tarasov, Feng Yan, Yue Cheng
- VLDB 2023
- Owl: Performance-Aware Scheduling for Resource-Efficient Function-as-a-Service Cloud
- Huangshi Tian, Suyi Li, Ao Wang, Wei Wang, Tianlong Wu, Haoran Yang
- SoCC 2022
- FaaSNet: Scalable and Fast Provisioning of Custom Serverless Container Runtimes at Alibaba Cloud Function Compute
- Ao Wang, Shuai Chang, Huangshi Tian, Hongqi Wang, Haoran Yang, Huiba Li, and Rui Du, Yue Cheng
- USENIX ATC 2021
- Wukong: A Scalable and Locality-Enhanced Framework for Serverless Parallel Computing
- Benjamin Carver, Jingyuan Zhang, Ao Wang, Ali Anwar, Panruo Wu, Yue Cheng
- SoCC 2020
- InfiniCache: Exploiting Ephemeral Serverless Functions to Build a Cost-Effective Memory Cache
- Ao Wang, Jingyuan Zhang, Xiaolong Ma, Ali Anwar, Lukas Rupprecht, Dimitrios Skourtis, Vasily Tarasov, Feng Yan, Yue Cheng
- USENIX FAST 2020
- IEEE Spectrum: Cloud Services Tool Lets You Pay for Data You Use - Not Data You Store
- In Search of a Fast and Efficient Serverless DAG Engine
- Benjamin Carver, Jingyuan Zhang, Ao Wang, Yue Cheng
- PDSW 2019
- HyperFaaS: A Truly Elastic Serverless Computing Framework
- Jingyuan Zhang, Ao Wang, Min Li, Yuan Chen, Yue Cheng
- USENIX NSDI 2019 (poster)
Working Experience
- 10/2021 - present, Alibaba Cloud, Function Compute team
- 08/2020 - 10/2021, Research Intern, Alibaba Cloud, Function Compute team
- 05/2019 - 08/2020, Research Assistant, George Mason University
- 08/2018 - 05/2019, Teaching Assistant, George Mason University
Professional Activities
- EuroSys 2020 shadow PC
Selected Award
- Student Travel Grant, USENIX FAST 2020
- Student Travel Grant, USENIX NSDI 2019