Search

Home
People
Events
Research
Publications
Contact
News

3

DiffTalker: Co-driven audio-image diffusion for talking faces via intermediate landmarks

Generating realistic talking faces is a complex and widely discussed task with numerous applications. In this paper, we present …

Zipeng Qi, Xulong Zhang, Ning Cheng, Jing Xiao, Jianzong Wang

Cite Code arXiv

Sparks of Large Audio Models: A Survey and Outlook

This survey paper provides a comprehensive overview of the recent advancements and challenges in applying large language models to the …

Siddique Latif, Moazzam Shoukat, Fahad Shamshad, Muhammad Usama, Yi Ren, Heriberto Cuayáhuitl, Wenwu Wang, Xulong Zhang, Roberto Togneri, Björn W. Schuller

Cite Code arXiv

Sparks of Large Audio Models: A Survey and Outlook

© 2008-2026 Lab of Large Audio Model All Rights Reserved.

Cite