View source on GitHub |
Trajectories module.
Modules
policy_step
module: Policy Step.
time_step
module: TimeStep representing a step in the environment.
trajectory
module: Trajectory containing time_step transition information.
Classes
class PolicyInfo
: PolicyInfo(log_probability,)
class PolicyStep
: Returned with every call to policy.action()
and policy.distribution()
.
class StepType
: Defines the status of a TimeStep
within a sequence.
class TimeStep
: Returned with every call to step
and reset
on an environment.
class Trajectory
: A tuple that represents a trajectory.
class Transition
: A tuple that represents a transition.
Functions
boundary(...)
: Create a Trajectory transitioning between StepTypes LAST
and FIRST
.
first(...)
: Create a Trajectory transitioning between StepTypes FIRST
and MID
.
from_transition(...)
: Returns a Trajectory
given transitions.
last(...)
: Create a Trajectory transitioning between StepTypes MID
and LAST
.
mid(...)
: Create a Trajectory transitioning between StepTypes MID
and MID
.
restart(...)
: Returns a TimeStep
with step_type
set equal to StepType.FIRST
.
single_step(...)
: Create a Trajectory transitioning between StepTypes FIRST
and LAST
.
termination(...)
: Returns a TimeStep
with step_type
set to StepType.LAST
.
time_step_spec(...)
: Returns a TimeStep
spec given the observation_spec.
to_n_step_transition(...)
: Create an n-step transition from a trajectory with T=N + 1
frames.
to_transition(...)
: Create a transition from a trajectory or two adjacent trajectories.
to_transition_spec(...)
: Create a transition spec from a trajectory spec.
transition(...)
: Returns a TimeStep
with step_type
set equal to StepType.MID
.
truncation(...)
: Returns a TimeStep
with step_type
set to StepType.LAST
.