Publications
2026
- EuroSysTaming Latency-Memory Trade-Off in MoE-Based LLM Serving via Fine-Grained Expert OffloadingIn ACM European Conference on Computer Systems, 2026
2025
- TPDSAccelerating ML Inference via Opportunistic Pre-Loading on Serverless ClustersIEEE Transactions on Parallel and Distributed Systems, 2025
- SoCCMulti-Agent Reinforcement Learning with Serverless ComputingIn ACM Symposium on Cloud Computing, 2025
- VLDBNitro: Boosting Distributed Reinforcement Learning with Serverless ComputingThe VLDB Endowment, 2025
2024
- SoCCPre-Warming is Not Enough: Accelerating Serverless Inference With Opportunistic Pre-LoadingIn ACM Symposium on Cloud Computing, 2024
- TPDSFreyr+: Harvesting Idle Resources in Serverless Computing via Deep Reinforcement LearningIEEE Transactions on Parallel and Distributed Systems, 2024
- SCStellaris: Staleness-Aware Distributed Reinforcement Learning with Serverless ComputingIn ACM/IEEE International Conference for High Performance Computing, Networking, Storage, and Analysis, 2024
- AAAICheaper and Faster: Distributed Deep Deinforcement Learning with Serverless ComputingIn The Thirty-Eighth AAAI Conference on Artificial Intelligence, 2024
- ASPLOSRainbowCake: Mitigating Cold-starts in Serverless with Layer-wise Container Caching and SharingIn ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 1, 2024
2023
- HPDCLibra: Harvesting Idle Resources Safely and Timely in Serverless ClustersIn ACM International Symposium on High-Performance Parallel and Distributed Computing, 2023
2022
- WWWAccelerating Serverless Computing by Harvesting Idle ResourcesIn ACM Web Conference 2022, 2022
2021
- ACSOSFaaSRank: Learning to Schedule Functions in Serverless PlatformsIn IEEE International Conference on Autonomic Computing and Self-Organizing Systems (ACSOS), 2021
- ICPEEnhancing Observability of Serverless Computing with the Serverless Application Analytics FrameworkIn Companion of the ACM/SPEC International Conference on Performance Engineering, 2021
- WoSCThe Serverless Application Analytics Framework: Enabling Design Trade-off Evaluation for Serverless SoftwareIn The 2020 Sixth International Workshop on Serverless Computing, 2021
2020
- arXivLeveraging GPT-2 for Classifying Spam Reviews with Limited Labeled Data via Adversarial TrainingarXiv preprint arXiv:2012.13400, 2020
- CBDComImplications of Programming Language Selection for Serverless Data Processing PipelinesIn IEEE International Conference on Cloud and Big Data Computing, 2020