Qwen3-VL-Embedding and Reranker in action 馃檶馃徏
the query: a cartoon guy drinks merlot wine i like this query because we see how it can retrieve based on semantics (a cartoon), text (the label merlot), and temporal action in the video (the cartoon guy drinks the wine mid-way through the video)