OpenMixer: A new open vocabulary action detection approach

0
OpenMixer is a novel open vocabulary action detection method that leverages the semantics and localizability of a large visual language model (VLM) combined with the design of a query-based detection transformer (DETR) to successfully address the problem of action detection in the open world. Experiments show that OpenMixer outperforms baseline methods in detecting both seen and unseen actions.