Zihan Liu, Xinhao Luo, Junxian Guo, Wentao Ni, Yangjie Zhou, Yue Guan, Cong Guo, Weihao Cui, Yu Feng, Minyi Guo, Yuhao Zhu, Minjia Zhang, Jingwen Leng, Chen Jin
(2025).
VQ-LLM: High-performance Code Generation for Vector Quantization Augmented LLM Inference.
HPCA 2025.