Publications
MICA: An End-to-End Compiler Stack for Mesh Accelerators
SOSP 2026 · Under Review
2026
Swarmpilot: A Scheduler Agent Framework for Large Agentic Workflow Clusters
EuroSys 2026 · Under Review
2026
BARSA: An Adaptive Test-Time Scaling Strategy for Mathematical Reasoning under Global Compute Budgets
ICML 2026 Workshop
2026
Ryze: Evidence-Enriched Data Synthesis from Biomedical Papers
ACL 2026 Demo
2026
ContextPilot: Efficient Retrieval-Augmented Generation with Accuracy-Preserving Context Reuse
MLSys 2026
2025
LLM-Monitor: Efficient Privacy Violation Monitoring for LLMs
ACL 2025 Demo · Under Review
2025
MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems
NeurIPS 2025
2024
ServerlessLLM: Locality-Enhanced Serverless Inference for Large Language Models
OSDI 2024
2024
Symplectic Structure-Preserving Particle-in-Cell Whole-Volume Simulation of Tokamak Plasmas
SC 2021
2021