News

Jan 05, 2026 Our technical report “RelayGR: Scaling Long-Sequence Generative Recommendation via Cross-Stage Relay-Race Inference” is now available on ArXiv!
Jun 16, 2025 Our technical report “Serving Large Language Models on Huawei CloudMatrix384” is now available on ArXiv!
Dec 09, 2024 Our paper “AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference” was accepted by AAAI 2025. Congratulations to Zhuomin and Yizhen!
May 05, 2024 Our two papers “Aceso: Achieving Efficient Fault Tolerance in Memory-Disaggregated Key-Value Stores” and “CHIME: A Cache-Efficient and High-Performance Hybrid Index on Disaggregated Memory” were accepted by SOSP’2024. Congratulations to Zhisheng and Xuchuan!
May 05, 2024 Our paper “Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention” was accepted by USENIX ATC’2024. Congratulations to Bin Gao!
Jul 15, 2023 Our paper “Ditto: An Elastic and Adaptive Memory-Disaggregated Caching System” was accepted by SOSP’2023. Congratulations to Jiacheng!
Mar 23, 2023 Our paper “SMART: A High-Performance Apative Radix Tree for Disaggregated Memory” was accepted by OSDI’2023. Congratulations to Xuchuan!
Feb 12, 2023 Our paper “ROLEX: A Scalable RDMA-oriented Learned Key-Value Store for Disaggregated Memory Systems” received the Best Paper Award of FAST 2023!
Dec 10, 2022 Our two papers “FUSEE: A Fully Memory-Disaggregated Key-Value Store” and “ROLEX: A Scalable RDMA-oriented Learned Key-Value Store for Disaggregated Memory Systems” were accepted by FAST 2023. Congratulations to Jiacheng and Pengfei!
May 08, 2022 Our paper “uKharon: A Membership Service for Microsecond Applications” was accepted by USENIX ATC 2022!
Jan 17, 2022 Our paper “RACE: One-Sided RDMA-Conscious Extendible Hashing” was accepted by ACM Transactions on Storage as an invited paper!
Dec 17, 2021 Our paper “FORD: Fast One-sided RDMA-based Distributed Transactions for Disaggregated Persistent Memory” was accepted by FAST 2022. Congratulations to Ming!
Oct 29, 2021 Our paper “FINEdex: A Fine-grained Learned Index Scheme for Scalable and Concurrent Memory Systems” was accepted by VLDB 2022. Congratulations to Pengfei!
Aug 29, 2021 I was awarded with 2020 ACM China Doctoral Dissertation Award!
Apr 29, 2021 Our paper “One-sided RDMA-Conscious Extendible Hashing for Disaggregated Memory” was accepted by USENIX ATC 2021!