Publications
[USENIX ATC] Gen Dong, Yu Hua, Yongle Zhang, Zhangyu Chen, Menglei Chen, "Understanding and Detecting Fail-Slow Hardware Failure Bugs in Cloud Systems", Proceedings of the USENIX Annual Technical Conference (USENIX ATC), 2025.
[Paper]
[Slides]
[Code]
[ICS] Aoyang Tong, Yu Hua, Menglei Chen, "DALdex: A DPU-Accelerated Persistent Learned Index via Incremental Learning", Proceedings of the ACM International Conference on Supercomputing (ICS), 2025.
[Paper]
[Slides]
[Code]
[FAST] Menglei Chen, Yu Hua, Zhangyu Chen, Ming Zhang, Gen Dong, "GPHash: An Efficient Hash Index for GPU with Byte-Granularity Persistent Memory", Proceedings of the 23rd USENIX Conference on File and Storage Technologies (FAST), 2025.
[Paper]
[Slides]
[Code]
[ICCD] Menglei Chen, Yu Hua, Rong Bai, Jianming Huang, "A Cost-Efficient Failure-Tolerant Scheme for Distributed DNN Training", Proceedings of the 41st IEEE International Conference on Computer Design (ICCD), 2023.
[Paper]
[Slides]
[Code]