news

Nov 03, 2024 “Ditto: Accelerating Diffusion Model via Temporal Value Similarity” has been accepted for HPCA 2025! I am a co-first author of the paper. Details will be shared after the camera-ready version is finalized.
Aug 01, 2024 The paper, ‘AirGun: Adaptive Granularity Quantization for Accelerating Large Language Models’, has been accepted to IEEE International Conference on Computer Design (ICCD) 2024.
May 20, 2024 The paper, ‘GUMSO: Gating Unnecessary On-Chip Memory Slices for Power Optimization on GPUs’, has been accepted to IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED) 2024.