The 1st Workshop in Video Understanding and its Applications

In conjunction with BMVC 2023.
Aberdeen, United Kingdom
November 24th 2023

The 1st Workshop in Video Understanding and its Applications (VUA 2023)

Video understanding is a popular field in computer vision and AI where we aim to learn/assess the world around us from video footage and can benefit many real-world applications, such as training and education, patient monitoring, sports assessment, and security systems. By automating these applications through video analysing, not only we can save money and time for their users, but also, we can decrease human errors. Despite the recent advances in the other areas of computer vision, e.g. image analysis, video understanding is still an unsolved problem and is considered a very challenging task.

The proposed workshop on video understanding aims to address the challenges in this field by making the following contributions:

  • Bringing together leading experts in the field of video understanding to help propel the field forward. This includes junior and senior researchers, with equal representation and contribution from academia and industry
  • The workshop also aims to stimulate and accelerate research progress in the field of video understanding to match the requirements of real-world applications by identifying the challenges and ways to address them.

Potential topics include, but are not limited to:

  • Application of Video Understanding to healthcare and media production
  • View-invariant and 3D video understanding (e.g. 3D action recognition)
  • Transformer for video understanding
  • Generating synthetic data for video understating tasks
  • Self-supervised learning for video understanding
  • Multi-modal video understanding
  • Action/event detection
  • Video captioning
  • Video editing and summarization
  • Videography/virtual cinematography
  • Video search and retrieval

Submission

Papers will be limited to 9 pages according to the BMVC format (c.f. main conference authors guidelines). Papers will be published in BMVC 2023 workshop proceedings.

All the papers should be submitted using CMT website https://cmt3.research.microsoft.com/VUABMVC2023.

Important Dates

  • Deadline for submission: August 20th, 2023 - 23:59 British Summer Time
  • Notification of acceptance: September 10th, 2023
  • Camera Ready submission deadline: September 25th, 2023
  • Workshop date: November 24th, 2023

Program

Robert Gordon University, Sir Ian Wood Building, Garthdee Campus

  • 10:00-10:05 - Welcome and Introduction
  • 10:05-10:55 - Keynote 1 - Dr. Joao Carreira (Google Deepmind) (40 mins talk followed by 10 mins Q&A)
  • 11:00-11:50 - Keynote 2 - Professor Dima Damen (University of Bristol) (40 mins talk followed by 10 mins Q&A)
  • 11:50-12:10 - AI4ME Presentation - Faegheh Sardari & Asmar Nadeem (Video Understanding for personalized media production)
  • 12:10-12:30 - Lunch break
  • 12:30-14:00 - Oral Session (10-min presentations + 2-min Q&A)
  • 14:00-14:50 - Keynote 3 - Dr. Fabian Caba (Adobe) (40 mins talk followed by 10 mins Q&A)

Invited Speakers

João Carreira

João Carreira

João is a senior research scientist at Google DeepMind, and prior to that, he was a postdoctoral researcher at the University of California, Berkeley. He is the first author of the paper 'Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset,' a groundbreaking work in the field of video understanding.

Dima Damen

Dima Damen

Dima is a full professor in computer vision at the University of Bristol, she is also a senior research scientist at Google DeepMind. Dima is currently an EPSRC Fellow (2020-2025), focusing her research interests in the automatic understanding of object interactions, actions and activities using wearable visual (and depth) sensors. She is the project lead for EPIC-KITCHENS, the largest dataset in egocentric vision, with accompanying open challenges.

Fabian Caba

Fabian Caba

Fabian is a Senior Research Scientist at Adobe working at the intersection of video understanding and generation. His main interests center around on the development of ML models aligned with creative human intent. He co-organized the ActivityNet and CVEU workshops during multiple editions.

Organizers

Faegheh Sardari

Faegheh Sardari

University of Surrey, United Kingdom

Armin Mustafa

Armin Mustafa

University of Surrey, United Kingdom

Asmar Nadeem

Asmar Nadeem

University of Surrey, United Kingdom

Robert Dawes

Robert Dawes

BBC R&D, United Kingdom

Adrian Hilton

Adrian Hilton

University of Surrey, United Kingdom

Acknowledgments

We gratefully acknowledge our reviewers:

  • Helge Rhodin - University of British Columbia
  • Mohammad Sabokrou - Institute for Research in Fundamental Sciences (IPM)
  • Sauradip Nag - University of Surrey
  • Davide Moltisanti - University of Edinburgh
  • Ayushi Dutta - University of Surrey
  • Hanyuan Wang - University of Bristol
  • Ozge Mercanoglu Sincan - University of Surrey
  • Mohammad khalooei - Amirkabir University of Technology
  • Otto Brookes - University of Bristol
  • Xinyu Yang - Lancaster University

Contacts

For additional info please contact us here