Reinforcement Learning

Synaptic Q-learning

November 23, 2025 2 Reinforcement Learning

A reinforcement learning algorithm that updates action-value functions based on the Bellman equations at the synaptic level.

GRPO

November 23, 2025 2 Reinforcement Learning

Technique

An optimization technique used in reinforcement learning to improve policy performance based on feedback from the environment.

Continuous Self-Rewarding Process

November 23, 2025 3 Reinforcement Learning

Concept

A learning mechanism where agents receive feedback based on their performance, allowing them to improve autonomously.

D-GARA

November 23, 2025 1 Reinforcement Learning

Framework

D-GARA is a dynamic benchmarking framework for evaluating the robustness of GUI agents against real-world anomalies.

Continuous Control

November 23, 2025 4 Reinforcement Learning

Application

A type of reinforcement learning task where the agent must make decisions in a continuous action space, often used in robotics and simulation environments.

Reward Profiling

November 23, 2025 4 Reinforcement Learning

Technique

A method for selectively updating the policy based on high-confidence performance estimations to improve the stability and convergence of reinforcement learning algorithms.

Policy Gradient

November 23, 2025 4 Reinforcement Learning

Technique

A class of algorithms in reinforcement learning that optimize the policy directly by adjusting the parameters in the direction of the gradient of expected reward.

November 23, 2025 3 Reinforcement Learning

Technique

A type of machine learning where an agent learns to make decisions by receiving rewards or penalties based on its actions in an environment.