Pengfei Zuo
I currently serve as the chief architect of AI-native storage at Huawei Cloud. I lead the EMS (Elastic Memory Service) team to build a disaggregated memory service layer in the cloud, upgrading Huawei Cloud’s two-tier infrastructure architecture that disaggregates storage and computing into a three-tier architecture that disaggregates computing, memory, and storage. I received my Ph.D. degree (advised by Prof. Yu Hua) in Computer Science from Huazhong University of Science and Technology (HUST) in 2019. I was a visiting Ph.D. student (advised by Prof. Yuan Xie) at the University of California, Santa Barbara (UCSB) during 2018-2019. I received a B.E. degree in Computer Science from HUST in 2014. My research interests span cloud infrastructure, machine learning systems, storage systems, and distributed systems. I have published 40+ refereed papers in major conferences and journals in the field of computer systems and architecture, including SOSP, OSDI, MICRO, ASPLOS, FAST, USENIX ATC, VLDB, DAC, etc. I obtained the 2020 ACM China Doctoral Dissertation Award (only two awardees among all computer disciplines across China every year) and the Best Paper Award in FAST 2023. The open-source codes of our research on AI systems and disaggregated data centers are available at ASISys and dmemsys, respectively. Email: pfzuo.cs@gmail.com, pengfei.zuo@huawei.com |
![]() |
I am seeking motivated interns and postdocs in AI systems. If you’re passionate about tackling key industry challenges and publishing impactful research, feel free to reach out!
News
-
Our paper “AdaSkip: Adaptive Sublayer Skipping for Accelerating Long-Context LLM Inference” was accepted by AAAI 2025. Congratulations to Zhuomin and Yizhen!
-
Our two papers “Aceso: Achieving Efficient Fault Tolerance in Memory-Disaggregated Key-Value Stores” and “CHIME: A Cache-Efficient and High-Performance Hybrid Index on Disaggregated Memory” were accepted by SOSP’2024. Congratulations to Zhisheng and Xuchuan!
-
Our paper “Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention” was accepted by USENIX ATC’2024. Congratulations to Bin Gao!
-
Our paper “Ditto: An Elastic and Adaptive Memory-Disaggregated Caching System” was accepted by SOSP’2023. Congratulations to Jiacheng!
-
Our paper “SMART: A High-Performance Apative Radix Tree for Disaggregated Memory” was accepted by OSDI’2023. Congratulations to Xuchuan!
-
Our paper “ROLEX: A Scalable RDMA-oriented Learned Key-Value Store for Disaggregated Memory Systems” received the Best Paper Award of FAST 2023!
-
Our two papers “FUSEE: A Fully Memory-Disaggregated Key-Value Store” and “ROLEX: A Scalable RDMA-oriented Learned Key-Value Store for Disaggregated Memory Systems” were accepted by FAST 2023. Congratulations to Jiacheng and Pengfei!
-
Our paper “uKharon: A Membership Service for Microsecond Applications” was accepted by USENIX ATC 2022!
-
Our paper “RACE: One-Sided RDMA-Conscious Extendible Hashing” was accepted by ACM Transactions on Storage as an invited paper!
-
Our paper “FORD: Fast One-sided RDMA-based Distributed Transactions for Disaggregated Persistent Memory” was accepted by FAST 2022. Congratulations to Ming!
-
Our paper “FINEdex: A Fine-grained Learned Index Scheme for Scalable and Concurrent Memory Systems” was accepted by VLDB 2022. Congratulations to Pengfei!
-
I was awarded with 2020 ACM China Doctoral Dissertation Award!
-
Our paper “One-sided RDMA-Conscious Extendible Hashing for Disaggregated Memory” was accepted by USENIX ATC 2021!
Selected Publications
-
[SOSP] Zhisheng Hu, Pengfei Zuo, Yizou Chen, Chao Wang, Junliang Hu, Ming-Chang Yang, “Aceso: Achieving Efficient Fault Tolerance in Memory-Disaggregated Key-Value Stores”, Proceedings of the 30th ACM Symposium on Operating Systems Principles (SOSP), 2024.
-
[SOSP] Xuchuan Luo, Jiacheng Shen, Pengfei Zuo, Xin Wang, Michael R. Lyu, Yangfan Zhou, “CHIME: A Cache-Efficient and High-Performance Hybrid Index on Disaggregated Memory”, Proceedings of the 30th ACM Symposium on Operating Systems Principles (SOSP), 2024.
-
[USENIX ATC] Bin Gao, Zhuomin He, Puru Sharma, Qingxuan Kang, Djordje Jevdjic, Junbo Deng, Xingkun Yang, Zhou Yu, Pengfei Zuo, “Cost-Efficient Large Language Model Serving for Multi-turn Conversations with CachedAttention”, Proceedings of the 2024 USENIX Annual Technical Conference (USENIX ATC), 2024.
-
[SOSP] Jiacheng Shen, Pengfei Zuo, Xuchuan Luo, Yuxin Su, Jiazhen Gu, Hao Feng, Yangfan Zhou, Michael Lyu, “Ditto: An Elastic and Adaptive Memory-Disaggregated Caching System”, Proceedings of the 29th ACM Symposium on Operating Systems Principles (SOSP), 2023.
-
[OSDI] Xuchuan Luo, Pengfei Zuo, Jiacheng Shen, Jiazhen Gu, Xin Wang, Michael Lyu, Yangfan Zhou, “SMART: A High-Performance Apative Radix Tree for Disaggregated Memory”, Proceedings of the 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2023.
(Recommended for fast-track publication in ACM Transactions on Storage) -
[FAST] Pengfei Li, Yu Hua, Pengfei Zuo, Zhangyu Chen, Jiajie Sheng, “ROLEX: A Scalable RDMA-oriented Learned Key-Value Store for Disaggregated Memory Systems”, Proceedings of the 21st USENIX Conference on File and Storage Technologies (FAST), 2023.
(Recommended for fast-track publication in ACM Transactions on Storage)
(Best Paper Award) -
[FAST] Jiacheng Shen, Pengfei Zuo, Xuchuan Luo, Tianyi Yang, Yuxin Su, Yangfan Zhou, Michael Lyu, “FUSEE: A Fully Memory-Disaggregated Key-Value Store”, Proceedings of the 21st USENIX Conference on File and Storage Technologies (FAST), 2023.
-
[USENIX ATC] Rachid Guerraoui, Antoine Murat, Javier Picorel, Athanasios Xygkis, Huabing Yan, Pengfei Zuo, “uKharon: A Membership Service for Microsecond Applications”, Proceedings of the USENIX Annual Technical Conference (USENIX ATC), 2022.
-
[FAST] Ming Zhang, Yu Hua, Pengfei Zuo, Lurong Liu, “FORD: Fast One-sided RDMA-based Distributed Transactions for Disaggregated Persistent Memory”, Proceedings of the 20th USENIX Conference on File and Storage Technologies (FAST), 2022.
-
[VLDB] Pengfei Li, Yu Hua, Jingnan Jia, Pengfei Zuo, “FINEdex: A Fine-grained Learned Index Scheme for Scalable and Concurrent Memory Systems”, Proceedings of the 48th International Conference on Very Large Data Bases (VLDB), 2022.
-
[USENIX ATC] Pengfei Zuo, Jiazhao Sun, Liu Yang, Shuangwu Zhang, Yu Hua, “One-sided RDMA-Conscious Extendible Hashing for Disaggregated Memory”, Proceedings of the USENIX Annual Technical Conference (USENIX ATC), 2021.
(Recommended for fast-track publication in ACM Transactions on Storage: recommendations by ATC 2021 program co-chairs) -
[DAC] Pengfei Zuo, Yu Hua, Ling Liang, Xingfeng Xie, Xing Hu, Yuan Xie, “SEALing Neural Network Models in Encrypted Deep Learning Accelerators”, Proceedings of the 58th Design Automation Conference (DAC), 2021.
-
[USENIX ATC] Zhangyu Chen, Yu Hua, Bo Ding, Pengfei Zuo, “Lock-free Concurrent Level Hashing for Persistent Memory”, Proceedings of the USENIX Annual Technical Conference (USENIX ATC), 2020.
-
[ASPLOS] Xing Hu, Ling Liang, Shuangchen Li, Lei Deng, Pengfei Zuo, Yu Ji, Xinfeng Xie, Yufei Ding, Chang Liu, Timothy Sherwood, Yuan Xie, “DeepSniffer: a DNN Model Extraction Framework Based on Learning Architectural Hints”, Proceedings of the 25th International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS), 2020.
-
[DAC] Zhangyu Chen, Yu Hua, Pengfei Zuo, Yuanyuan Sun, Yuncheng Guo, “Reducing Bit Writes in Non-volatile Main Memory by Similarity-aware Compression”, Proceedings of the 57th Design Automation Conference (DAC), 2020.
-
[MICRO] Pengfei Zuo, Yu Hua, Yuan Xie, “SuperMem: Enabling Application-transparent Secure Persistent Memory with Low Overheads”, Proceedings of the 52nd IEEE/ACM International Symposium on Microarchitecture (MICRO), 2019. [slides]
-
[OSDI] Pengfei Zuo, Yu Hua, Jie Wu, “Write-Optimized and High-Performance Hashing Index Scheme for Persistent Memory”, in Proceedings of the 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI), 2018. [slides] [code]
(Recommended for fast-track publication in ACM Transactions on Storage: recommendations by OSDI 2018 program co-chairs)
(Finalist for the Memorable Paper Award in NVMW 2019) -
[MICRO] Pengfei Zuo, Yu Hua, Ming Zhao, Wen Zhou, Yuncheng Guo, “Improving the Performance and Endurance of Encrypted Non-volatile Main Memory through Deduplicating Writes”, in Proceedings of the 51st IEEE/ACM International Symposium on Microarchitecture (MICRO), 2018. [slides][video]
-
[USENIX ATC] Yuanyuan Sun, Yu Hua, Song Jiang, Qiuyu Li, Shunde Cao, Pengfei Zuo, “SmartCuckoo: A Fast and Cost-Efficient Hashing Index Scheme for Cloud Storage Systems”, in Proceedings of USENIX Annual Technical Conference (USENIX ATC), 2017.
Awards and Honors
- Best Paper Award in FAST 2023.
- ACM China Doctoral Dissertation Award, 2020. (Only two awardees among all computer disciplines across China every year.)
- ACM SIGOPS ChinaSys Doctoral Dissertation Award, 2020.
- CCF Doctoral Dissertation Award Nominee, 2020.
- Student Grant from MICRO, 2019.
- Shenzhen Stock Exchange Scholarship, 2019.
- Finalist for the Memorable Paper Award in NVMW 2019.
- National Scholarship for Ph.D. Graduate Students, 2018.
- Student Grant from OSDI, 2018.
- Student Grant from MICRO, 2018.
- National Scholarship for Ph.D. Graduate Students, 2017.
- ZhiXing Excellent Graduate Scholarship, 2015.
- Excellent Graduate Student in HUST, 2015, 2016, 2017, and 2018.
- Award of Excellent B.E. Thesis in Hubei Province, 2014.
Professional Services
PC Member at conferences:
- NSDI 2025
- Cloud 2021, 2022
- ICDCS 2021
- CloudCom 2020
- ICPADS 2020
- ChinaSys 2021, 2022, 2023, 2024
- Eurosys 2019 (Shadow)
Reviewer for Journals:
- IEEE Transactions on Parallel and Distributed Systems (TPDS)
- IEEE Transactions on Computers (TC)
- ACM Transactions on Storage (TOS)
- IEEE Transactions on Dependable and Secure Computing (TDSC)
- IEEE Micro
Talks
12/2021 | “One-sided RDMA-Conscious Extendible Hashing for Disaggregated Memory”, Invited Talk in CCF Sys 2021. | Hangzhou, China |
08/2021 | “One-sided RDMA-Conscious Extendible Hashing for Disaggregated Memory”, Invited Talk in the Young Scientists of Computing Academy (YSCA) workshop in TURC 2021. | Hefei, China |
12/2020 | “The Growing Course of A PhD in Computer System and Architecture”, Invited Talk in the 19th ChinaSys workshop. | Chongqing, China |
01/2020 | “The Growing Course of A PhD in Computer System and Architecture”, Invited Talk in the 2nd Summit Forum of SKL of Computer Architecture. | Beijing, China |
12/2019 | “SuperMem: Revitalizing the Write-through Cache for Secure Persistent Memory”, Invited Talk in the 17th ChinaSys workshop. | Zhuhai, China |
03/2019 | “Write-Optimized and High-Performance Hashing Index Scheme for Persistent Memory”, Presentation in the 10th Non-volatile Memory Workshop (NVMW 2019). | San Deigo, USA |
10/2018 | “Improving the Performance and Endurance of Encrypted Non-volatile Main Memory through Deduplicating Writes”, Paper Presentation in the 51st International Symposium on Microarchitecture (MICRO 2018). | Fukuoka, Japan |
10/2018 | “Write-Optimized and High-Performance Hashing Index Scheme for Persistent Memory”, Paper Presentation in the 13th USENIX Symposium on Operating Systems Design and Implementation (OSDI 2018). | Carlsbad, USA |
09/2018 | “Write-Optimized and High-Performance Hashing Index Scheme for Non-volatile Memory”, Invited Talk in the 24th National Conference of Information Storage (NCIS 2018). | Beijing, China |
06/2018 | “Mitigating Traffic-based Side Channel Attacks in Bandwidth-efficient Cloud Storage”, Paper Presentation in the 32nd IEEE International Parallel and Distributed Processing Symposium (IPDPS 2018). | Vancouver, Canada |
04/2018 | “A Write-friendly and Cache-optimized Hashing Scheme for Non-volatile Memory Systems”, Invited Talk in the Aliyun-CCF Excellent Paper Exchange Meeting, Shanghai Jiaotong University. | Shanghai, China |
09/2017 | “A Write-friendly Hashing Scheme for Non-volatile Memory Systems”, Invited Talk in the 23th National Conference of Information Storage (NCIS 2017), Northwestern Polytechnical University. | Xi’an, China |
06/2017 | “BEES: Bandwidth- and Energy- Efficient Image Sharing for Real-time Situation Awareness”, Paper Presentation in the 37th International Conference on Distributed Computing Systems (ICDCS 2017). | Atlanta, USA |
05/2017 | “A Write-friendly Hashing Scheme for Non-volatile Memory Systems”, Paper Presentation in the 33rd International Conference on Massive Storage Systems and Technology (MSST 2017). | San Jose, USA |
05/2017 | “A Cost-efficient Rewriting Scheme to Improve Restore Performance in Deduplication Systems”, Paper Presentation in the 33rd International Conference on Massive Storage Systems and Technology (MSST 2017). | San Jose, USA |
06/2015 | “An Efficient Cuckoo Hashing Scheme for Cloud Storage Systems”, Invited Talk in the 8th ChinaSys workshop, Xiamen University. | Xiamen, China |