Home
People
Events
Research
Publications
Contact
News
Bin Zhang
Publications
Cocktait:Chunk-AdaptiveMixed-PrecisionOuanization for Long-Context LLM inference
(2024) In
DATE2025
(CCF-B)
Cite
×