Tag: Technique

Joint-GRPO

A method that orchestrates the collaboration between a Vision-Language Model and a Video Diffusion Model to optimize their outputs based on a shared reward.

GRPO

An optimization technique used in reinforcement learning to improve policy performance based on feedback from the environment.

Benchmarking

Benchmarking is the process of comparing performance metrics of systems or models against a standard or set of criteria.

TransV

A token information transfer module that compresses vision tokens into instruction tokens while preserving multimodal understanding.