tf_agents.trajectories.from_transition

Returns a Trajectory given transitions.

View aliases

Main aliases

tf_agents.trajectories.trajectory.from_transition

tf_agents.trajectories.from_transition(
    time_step: tf_agents.trajectories.TimeStep,
    action_step: tf_agents.trajectories.PolicyStep,
    next_time_step: tf_agents.trajectories.TimeStep
) -> tf_agents.trajectories.Trajectory

Used in the notebooks

Used in the tutorials
DQN C51/Rainbow

from_transition is used by a driver to convert sequence of transitions into a Trajectory for efficient storage. Then an agent (e.g. ppo_agent.PPOAgent) converts it back to transitions by invoking to_transition.

Note that this method does not add a time dimension to the Tensors in the resulting Trajectory. This means that if your transitions don't already include a time dimension, the Trajectory cannot be passed to agent.train().

Args
`time_step`	A `time_step.TimeStep` representing the first step in a transition.
`action_step`	A `policy_step.PolicyStep` representing actions corresponding to observations from time_step.
`next_time_step`	A `time_step.TimeStep` representing the second step in a transition.

tf_agents.trajectories.from_transition

View aliases

Used in the notebooks

Args