Call for Papers
The goals of this workshop are to (1) present and discuss the latest trends in audio and computer vision fields for the common research goals, (2) understand state-of-the-art techniques and bottlenecks in the other’s discipline for the common topics, (3) investigate research opportunities of joint audio-visual scene understandings in multimedia content production.
This workshop will be a good opportunity to bring together leading experts in audio and vision, and bridge the gap between two research fields in multimedia content production and reproduction. We welcome research contributions related the following (but not limited to) topics:
- 3D audio-visual capture system
- Object segmentation and audio source separation
- Audio-Visual Tracking
- Speaker identification and speech recognition
- Scene understanding using audio-visual sensors
- Deep learning for audio-visual data analysis
- Geometry-aware auditory scene analysis
- Virtual/Augmented reality content production
- 360 video and spatial audio
- Adaptive audio-visual content rendering