First FAIR Workshop on Human-Centered AI

WP 3

ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models

Andrea Pedrotti

 Overview  Program