Home
People
Events
Research
Publications
Contact
News
Peiqiang Wang
Publications
WindowQuant: Mixed-Precision KV Cache Quantization based on Window-Level Similarity for VLMs Inference Optimization
(2026), In
TACO2026
(CCF-A)
Cite
×