Wentao Ni
Open Menu
Close Menu
Bio
Papers
Experience
Projects
Paper-Conference
VQ-LLM: High-performance Code Generation for Vector Quantization Augmented LLM Inference
Mar 5, 2025
JUNO: Optimizing High-Dimensional Approximate Nearest Neighbour Search with Sparsity-Aware Algorithm and Ray-Tracing Core Mapping
Jan 1, 2024