Exploring CXL-based KV Cache Storage for LLM Serving

Publication
To appear (NIPS 2024 ML for Systems Workshop)
Yupeng Tang
Yupeng Tang
Research Scientist @ Meta

My research interests include distributed systems, software-hardware co-design and hardware accelerators.