Publications

(2024). (OSDI 2024) ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models.

PDF DOI OSDI