tf_agents.environments.PyEnvironment

Abstract base class for Python RL environments.

View aliases

Main aliases

tf_agents.environments.py_environment.PyEnvironment

tf_agents.environments.PyEnvironment(
    handle_auto_reset: bool = False
)

Used in the notebooks

Used in the tutorials
Environments Tutorial on Multi Armed Bandits in TF-Agents

Observations and valid actions are described with ArraySpecs, defined in the specs module.

If the environment can run multiple steps at the same time and take a batched set of actions and return a batched set of observations, it should overwrite the property batched to True.

Environments are assumed to auto reset once they reach the end of the episode, if prefer that the base class handle auto_reset set handle_auto_reset=True.

Args
`handle_auto_reset`	When `True` the base class will handle auto_reset of the Environment.

Attributes
`batch_size`	The batch size of the environment.
`batched`	Whether the environment is batched or not. If the environment supports batched observations and actions, then overwrite this property to True. A batched environment takes in a batched set of actions and returns a batched set of observations. This means for all numpy arrays in the input and output nested structures, the first dimension is the batch size. When batched, the left-most dimension is not part of the action_spec or the observation_spec and corresponds to the batch dimension. When batched and handle_auto_reset, it checks `np.all(steps.is_last())`.

Attributes

batch_size The batch size of the environment.

batched

Whether the environment is batched or not.

If the environment supports batched observations and actions, then overwrite this property to True.

A batched environment takes in a batched set of actions and returns a batched set of observations. This means for all numpy arrays in the input and output nested structures, the first dimension is the batch size.

When batched, the left-most dimension is not part of the action_spec or the observation_spec and corresponds to the batch dimension.

When batched and handle_auto_reset, it checks np.all(steps.is_last()).

tf_agents.environments.PyEnvironment Stay organized with collections Save and categorize content based on your preferences.

View aliases

Used in the notebooks

Args

Attributes

Methods

action_spec

close

current_time_step

discount_spec

get_info

get_state

observation_spec

render

reset

reward_spec

seed

set_state

should_reset

step

time_step_spec

__enter__

__exit__

tf_agents.environments.PyEnvironment

`action_spec`

`close`

`current_time_step`

`discount_spec`

`get_info`

`get_state`

`observation_spec`

`render`

`reset`

`reward_spec`

`seed`

`set_state`

`should_reset`

`step`

`time_step_spec`

`enter`

`exit`