cntk.contrib.deeprl.agent.random_agent module

Baseline agent that selects action uniformly randomly.

class RandomAgent(o_space, a_space)[source]

Bases: cntk.contrib.deeprl.agent.agent.AgentBaseClass

Agent that selects action uniformly randomly.

end(reward, next_state)[source]

Last observed reward/state of the episode (which then terminates).

save(filename)[source]

Save best model to file.

save_parameter_settings(filename)[source]

Save parameter settings to file.

set_as_best_model()[source]

Copy current model to best model.

start(state)[source]

Start a new episode.

step(reward, next_state)[source]

Observe one transition and choose an action.