Home
People
Events
Research
Publications
Contact
News
Cocktait:Chunk-AdaptiveMixed-PrecisionOuanization for Long-Context LLM inference
Wei Tao
,
Bin Zhang
,
Xiaoyang Qu
,
Jiguang Wan
,
Jianzong Wang
January 2025
Cite
Abstract
TBD
Type
1
Publication
In
Design, Automation, and Test in Europe 2025
Click the
Cite
button above to demo the feature to enable visitors to import publication metadata into their reference management software.
LLM
Jianzong Wang
Honorary Director
[email protected]
Cite
×