Home
People
Events
Research
Publications
Contact
News
Cocktait:Chunk-AdaptiveMixed-PrecisionOuanization for Long-Context LLM inference
WeiTao
,
Bin Zhang
,
Xiaoyang Qu
,
Jiguang Wan
,
Jianzong Wang
November 2024
Cite
Abstract
TBD
Type
1
Publication
In
Design, Automation, and Test in Europe 2025
Click the
Cite
button above to demo the feature to enable visitors to import publication metadata into their reference management software.
LLM
Jianzong Wang
Honorary Director
[email protected]
Cite
×