Exploring CXL-based KV Cache Storage for LLM Serving

Publication
To appear (NIPS 2024 ML for Systems Workshop)
Yupeng Tang
Yupeng Tang
Final-year PhD student @ Yale University

My research interests include distributed systems, memory disaggregation and hardware accelerators.