Trustworthy Multimodal Learning with Foundation Models: Bridging the Gap between AI Research and Real World Applications

One Day Meeting: Trustworthy Multimodal Learning with Foundation Models: Bridging the Gap between AI Research and Real World Applications

Wednesday 24 April 2024

Chairs: Chao Zhang (Toshiba Europe Ltd), Jindong Gu (University of Oxford), Shitong Sun (Queen Mary University of London), Onay Urfalioglu (Vivo Tech GmbH)

< vivo

We invite academic and industry presentations, bringing together researchers interested in all aspects of foundational models (GPT-4, CLIP, SAM, etc) and multimodal learning involving, but not limited to, image, video, audio, depth, text, drawings, laser, IMU, etc.

Please register via charitysuite on this link:    Register Here

Invited Speakers

Videos of Talks

On our BMVA YouTube channel there are recorded talks of the slides and speaker from the day here

Programme

Start   End   Title
09:00   09:15   Registration/Poster Set-up
09:15   09:20   Opening Remarks
09:20   10:00   Invited Speaker - Guohao Li,
10:00   10:40   Invited Speaker - Oleg Sinavski
10:40   11:05   Coffee Break + Posters
11:05   12:20   Accepted Talks - Pt. 1
12:20   13:20   Lunch + Posters
13:20   14:00   Invited Speaker - Da Li
14:00   15:15   Accepted Talks - Pt. 2
15:15   15:40   Coffee Break + Posters
15:40   16:20   Invited Speaker - Rudra Poudel
16:20   17:00   Invited Speaker - Ashkan Khakzar
17:00   17:05   Past, Present, and Future of Vision-Language

Talk Part 1 (15 mins each)

Talk Part 2 (15 mins each)

Posters

Meeting Location

The meeting will take place at:

British Computer Society (BCS), 25 Copthall Avenue, London EC2R 7BP

Registration

We keep the cost of attending these events as low as possible to ensure no barriers from the whole computer vision community attending. The registration costs are as follows

Both include lunch and refreshments for the day

Please register via charitysuite on this link:    Register Here