Programme
West Meeting Room 114, 115 Sat 14 Dec, 2024
The times are in PDT time zone.
Time | Schedule |
---|---|
08:15 - 08:30 | Welcome and opening remarks |
08:30 - 09:00 | Alexis Conneau, CEO of Waveforms AI. Conversational Speech Turing Test |
09:00 - 09:30 | Joon Soon Chung, Associate Professor at KAIST. Giving Voice and Face to AI |
09:30 - 09:45 | Oral Presentation - Improving Musical Accompaniment Co-creation via Diffusion Transformers |
09:45 - 10:00 | Oral Presentation - AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation |
10:00 - 10:15 | Oral Presentation - AudioSetCaps: Enriched Audio Captioning Dataset Generation Using Large Audio Language Models |
10:15 - 10:30 | Coffee Break |
10:30 - 12:00 | Poster + Demos Session |
12:00 - 13:30 | Lunch Break |
13:30 - 13:45 | Oral Presentation - LOCKEY: A Novel Approach to Model Authentication and Deepfake Tracking |
13:45 - 14:00 | Oral Presentation - BLAP: Bootstrapping Language-Audio Pre-training for Music Captioning |
14:00 - 14:15 | Oral Presentation - Improving Source Extraction with Diffusion and Consistency Models |
14:15 - 14:45 | Yao Xie, Professor at Georgia Tech. Generative Models for Statistical Inference: Advancing Probabilistic Representations |
14:45 - 15:15 | Vikas Chandra, Director of Core AI at Meta. Audio Generation for VR/MR |
15:15 - 15:30 | Coffee Break |
15:30 - 16:15 | Panel Discussion and Closing Remarks |
16:15 - 17:30 | Poster + Demos Session |