Home
People
Events
Research
Publications
Contact
News
3
DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks
Generating realistic talking faces is a complex and widely discussed task with numerous applications. In this paper, we present …
Zipeng Qi
,
Xulong Zhang
,
Ning Cheng
,
Jing Xiao
,
Jianzong Wang
Cite
Code
arXiv
Sparks of Large Audio Models: A Survey and Outlook
This survey paper provides a comprehensive overview of the recent advancements and challenges in applying large language models to the …
Siddique Latif
,
Moazzam Shoukat
,
Fahad Shamshad
,
Muhammad Usama
,
Yi Ren
,
Heriberto Cuayáhuitl
,
Wenwu Wang
,
Xulong Zhang
,
Roberto Togneri
,
Björn W. Schuller
Cite
Code
arXiv
Cite
×