Skip to the content.

Programme

West Meeting Room 114, 115 Sat 14 Dec, 2024

The times are in PDT time zone.

Time Schedule
08:15 - 08:30 Welcome and opening remarks
08:30 - 09:00 Alexis Conneau, CEO of Waveforms AI. Conversational Speech Turing Test
09:00 - 09:30 Joon Soon Chung, Associate Professor at KAIST. Giving Voice and Face to AI
09:30 - 09:45 Oral Presentation - Improving Musical Accompaniment Co-creation via Diffusion Transformers
09:45 - 10:00 Oral Presentation - AV-DiT: Efficient Audio-Visual Diffusion Transformer for Joint Audio and Video Generation
10:00 - 10:15 Oral Presentation - AudioSetCaps: Enriched Audio Captioning Dataset Generation Using Large Audio Language Models
10:15 - 10:30 Coffee Break
10:30 - 12:00 Poster + Demos Session
12:00 - 13:30 Lunch Break
13:30 - 13:45 Oral Presentation - LOCKEY: A Novel Approach to Model Authentication and Deepfake Tracking
13:45 - 14:00 Oral Presentation - BLAP: Bootstrapping Language-Audio Pre-training for Music Captioning
14:00 - 14:15 Oral Presentation - Improving Source Extraction with Diffusion and Consistency Models
14:15 - 14:45 Yao Xie, Professor at Georgia Tech. Generative Models for Statistical Inference: Advancing Probabilistic Representations
14:45 - 15:15 Vikas Chandra, Director of Core AI at Meta. Audio Generation for VR/MR
15:15 - 15:30 Coffee Break
15:30 - 16:15 Panel Discussion and Closing Remarks
16:15 - 17:30 Poster + Demos Session