CPPO: Contrastive Perception for Vision Language Policy Optimization
reinforcement-learning-algorithms policy-optimization contrastive-learning perception-aware vision-language-model entropy-based-approach cppo vision-token
-
Updated
Feb 12, 2026 - Python