Visual Semantics Events

Overview

Explore cutting-edge research in visual event classification and multilingual video analysis through this informative talk by Kate Sanders, a PhD student at Johns Hopkins University's Center for Language & Speech Processing. Dive into two groundbreaking works: the SQUID-E dataset, which addresses the challenge of ambiguous images in visual classification tasks, and MultiVENT, a multilingual video dataset for event-centric analysis. Learn how these projects aim to improve model performance on uncertain visual data and leverage diverse online news sources. Gain insights into the creation of robust datasets, the characterization of human uncertainty in vision tasks, and the development of complex multilingual video retrieval models. Discover the potential applications of these research efforts in enhancing visual event classification and multimodal information retrieval across languages.

Syllabus

Visual Semantics Events - Kate Sanders (Johns Hopkins University) - 2023

Taught by

Center for Language & Speech Processing(CLSP), JHU

Reviews

Start your review of Visual Semantics Events

Taught by

Takeaways from the SCALE 2024 Workshop on Video-based Event Retrieval

Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models

Never Stop Learning.