rl_agent
- class rbgame.agent.rl_agent.RLAgent(policy, memory=None, update_per_step=1.0, repeat_per_collect=1000)[source]
Bases:
BaseAgentBase Reinforcement Learning agent.
- Parameters:
policy (
BasePolicy) – Policy.memory (
Optional[VectorReplayBuffer]) – Replay Buffer.update_per_step (
float) – How many times agent samples from memory and learns per one step, using only in offpolicy algorithms.repeat_per_collect (
int) – How many times agents learns on sampled data, using only in onpolicy algorithms.
- abstract get_action(obs)
Compute action from observation.
- class rbgame.agent.rl_agent.OffPolicyAgent(policy, memory=None, update_per_step=1.0, repeat_per_collect=1000)[source]
Bases:
RLAgent- infer_act(obs_b_o, mask_b, exploration_noise)[source]
Forward batch of observations through network.