TFDS はCroissant 🥐 形式をサポートするようになりました。詳細については、ドキュメントをお読みください。

このページは Cloud Translation API によって翻訳されました。

rlu_rwrl

説明:

RL Unplugged は、オフライン強化学習のベンチマークスイートです。 RL Unplugged は、次の考慮事項に基づいて設計されています。使いやすさを促進するために、一般的なパイプラインが確立されると、実践者がスイート内のすべてのデータを簡単に操作できるようにする統合 API をデータセットに提供します。

データセットはRLDS 形式に従って、ステップとエピソードを表します。

データセットの例は、https://arxiv.org/abs/1904.12901 で説明されているように、部分的にオンラインでトレーニングされたエージェントを実行したときに保存された SAR 遷移を表しています。https ://github.com/google-researchで指定されているように、RLDS データセット形式に従います。 /rlds#データセット形式

合計 8 つのタスクで 40 のデータセットをリリースします。カートポール、ウォーカー、四足歩行、およびヒューマノイドタスクでは、組み合わせチャレンジなしと簡単な組み合わせチャレンジがあります。各タスクには、1%、5%、20%、40%、100% の 5 つの異なるサイズのデータセットが含まれています。小さいデータセットが大きいデータセットのサブセットであるとは限らないことに注意してください。データセットの生成方法の詳細については、論文を参照してください。

ホームページ: https://github.com/deepmind/deepmind-research/tree/master/rl_unplugged
ソースコード: tfds.rl_unplugged.rlu_rwrl.RluRwrl
バージョン:
- 1.0.0 : 初期リリース。
- 1.0.1 (デフォルト): ヒューマノイドデータセットの 1 つでエピソード ID が重複している RLU RWRL データセットのバグを修正します。
ダウンロードサイズ: サイズUnknown size
監視されたキー( as_supervised docを参照): None
図( tfds.show_examples ): サポートされていません。
引用：

@misc{gulcehre2020rl,
    title={RL Unplugged: Benchmarks for Offline Reinforcement Learning},
    author={Caglar Gulcehre and Ziyu Wang and Alexander Novikov and Tom Le Paine
        and  Sergio Gómez Colmenarejo and Konrad Zolna and Rishabh Agarwal and
        Josh Merel and Daniel Mankowitz and Cosmin Paduraru and Gabriel
        Dulac-Arnold and Jerry Li and Mohammad Norouzi and Matt Hoffman and
        Ofir Nachum and George Tucker and Nicolas Heess and Nando deFreitas},
    year={2020},
    eprint={2006.13888},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

rlu_rwrl/cartpole_swingup_combined_challenge_none_1_percent (デフォルト設定)

データセットサイズ: 172.43 KiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	5

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(1)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
歩数/観察/位置	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(2)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_5_percent

データセットサイズ: 862.13 KiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	25

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(1)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
歩数/観察/位置	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(2)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_20_percent

データセットのサイズ: 3.37 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	100

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(1)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
歩数/観察/位置	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(2)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_40_percent

データセットのサイズ: 6.74 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	200

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(1)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
歩数/観察/位置	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(2)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_none_100_percent

データセットサイズ: 16.84 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	500

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(1)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
歩数/観察/位置	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(2)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_1_percent

データセットサイズ: 1.77 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	5

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(12)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/利己的な状態	テンソル	(44)	float32
ステップ/観察/力_トルク	テンソル	(24)	float32
歩数/観察/imu	テンソル	(6,)	float32
steps/observation/torso_upright	テンソル	(1)	float32
steps/observation/torso_velocity	テンソル	(3)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_5_percent

データセットサイズ: 8.86 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	25

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(12)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/利己的な状態	テンソル	(44)	float32
ステップ/観察/力_トルク	テンソル	(24)	float32
歩数/観察/imu	テンソル	(6,)	float32
steps/observation/torso_upright	テンソル	(1)	float32
steps/observation/torso_velocity	テンソル	(3)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_20_percent

データセットのサイズ: 35.46 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	100

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(12)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/利己的な状態	テンソル	(44)	float32
ステップ/観察/力_トルク	テンソル	(24)	float32
歩数/観察/imu	テンソル	(6,)	float32
steps/observation/torso_upright	テンソル	(1)	float32
steps/observation/torso_velocity	テンソル	(3)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_40_percent

データセットのサイズ: 70.92 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	200

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(12)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/利己的な状態	テンソル	(44)	float32
ステップ/観察/力_トルク	テンソル	(24)	float32
歩数/観察/imu	テンソル	(6,)	float32
steps/observation/torso_upright	テンソル	(1)	float32
steps/observation/torso_velocity	テンソル	(3)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_none_100_percent

データセットのサイズ: 177.29 MiB
自動キャッシュ(ドキュメント): shuffle_files=Falseの場合のみ (トレーニング)
スプリット:

スプリット	例
`'train'`	500

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(12)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/利己的な状態	テンソル	(44)	float32
ステップ/観察/力_トルク	テンソル	(24)	float32
歩数/観察/imu	テンソル	(6,)	float32
steps/observation/torso_upright	テンソル	(1)	float32
steps/observation/torso_velocity	テンソル	(3)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_1_percent

データセットサイズ: 6.27 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	50

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(6,)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
歩数/観察/高さ	テンソル	(1)	float32
手順/観察/向き	テンソル	(14)	float32
ステップ/観察/速度	テンソル	(9)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_5_percent

データセットのサイズ: 31.34 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	250

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(6,)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
歩数/観察/高さ	テンソル	(1)	float32
手順/観察/向き	テンソル	(14)	float32
ステップ/観察/速度	テンソル	(9)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_20_percent

データセットのサイズ: 125.37 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	1,000

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(6,)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
歩数/観察/高さ	テンソル	(1)	float32
手順/観察/向き	テンソル	(14)	float32
ステップ/観察/速度	テンソル	(9)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_40_percent

データセットのサイズ: 250.75 MiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'train'`	2,000

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(6,)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
歩数/観察/高さ	テンソル	(1)	float32
手順/観察/向き	テンソル	(14)	float32
ステップ/観察/速度	テンソル	(9)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_none_100_percent

データセットサイズ: 626.86 MiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'train'`	5,000

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(6,)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
歩数/観察/高さ	テンソル	(1)	float32
手順/観察/向き	テンソル	(14)	float32
ステップ/観察/速度	テンソル	(9)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_1_percent

データセットのサイズ: 69.40 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	200

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(21)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観測/com_velocity	テンソル	(3)	float32
ステップ/観察/四肢	テンソル	(12)	float32
ステップ/観察/head_height	テンソル	(1)	float32
ステップ/観察/ジョイント_角度	テンソル	(21)	float32
steps/observation/torso_vertical	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(27)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_5_percent

データセットサイズ: 346.98 MiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'train'`	1,000

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(21)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観測/com_velocity	テンソル	(3)	float32
ステップ/観察/四肢	テンソル	(12)	float32
ステップ/観察/head_height	テンソル	(1)	float32
ステップ/観察/ジョイント_角度	テンソル	(21)	float32
steps/observation/torso_vertical	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(27)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_20_percent

データセットサイズ: 1.36 GiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'train'`	4,000

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(21)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観測/com_velocity	テンソル	(3)	float32
ステップ/観察/四肢	テンソル	(12)	float32
ステップ/観察/head_height	テンソル	(1)	float32
ステップ/観察/ジョイント_角度	テンソル	(21)	float32
steps/observation/torso_vertical	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(27)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_40_percent

データセットサイズ: 2.71 GiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'train'`	8,000

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(21)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観測/com_velocity	テンソル	(3)	float32
ステップ/観察/四肢	テンソル	(12)	float32
ステップ/観察/head_height	テンソル	(1)	float32
ステップ/観察/ジョイント_角度	テンソル	(21)	float32
steps/observation/torso_vertical	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(27)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_none_100_percent

データセットサイズ: 6.78 GiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'train'`	20,000

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(21)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観測/com_velocity	テンソル	(3)	float32
ステップ/観察/四肢	テンソル	(12)	float32
ステップ/観察/head_height	テンソル	(1)	float32
ステップ/観察/ジョイント_角度	テンソル	(21)	float32
steps/observation/torso_vertical	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(27)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_1_percent

データセットサイズ: 369.84 KiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	5

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(1)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
歩数/観察/位置	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(2)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_5_percent

データセットサイズ: 1.81 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	25

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(1)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
歩数/観察/位置	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(2)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_20_percent

データセットサイズ: 7.22 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	100

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(1)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
歩数/観察/位置	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(2)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_40_percent

データセットのサイズ: 14.45 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	200

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(1)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
歩数/観察/位置	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(2)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/cartpole_swingup_combined_challenge_easy_100_percent

データセットサイズ: 36.12 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	500

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(1,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'position': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(2,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(1)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
歩数/観察/位置	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(2)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_1_percent

データセットのサイズ: 1.97 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	5

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(12)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
ステップ/観察/利己的な状態	テンソル	(44)	float32
ステップ/観察/力_トルク	テンソル	(24)	float32
歩数/観察/imu	テンソル	(6,)	float32
steps/observation/torso_upright	テンソル	(1)	float32
steps/observation/torso_velocity	テンソル	(3)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_5_percent

データセットサイズ: 9.83 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	25

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(12)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
ステップ/観察/利己的な状態	テンソル	(44)	float32
ステップ/観察/力_トルク	テンソル	(24)	float32
歩数/観察/imu	テンソル	(6,)	float32
steps/observation/torso_upright	テンソル	(1)	float32
steps/observation/torso_velocity	テンソル	(3)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_20_percent

データセットサイズ: 39.31 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	100

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(12)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
ステップ/観察/利己的な状態	テンソル	(44)	float32
ステップ/観察/力_トルク	テンソル	(24)	float32
歩数/観察/imu	テンソル	(6,)	float32
steps/observation/torso_upright	テンソル	(1)	float32
steps/observation/torso_velocity	テンソル	(3)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_40_percent

データセットのサイズ: 78.63 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	200

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(12)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
ステップ/観察/利己的な状態	テンソル	(44)	float32
ステップ/観察/力_トルク	テンソル	(24)	float32
歩数/観察/imu	テンソル	(6,)	float32
steps/observation/torso_upright	テンソル	(1)	float32
steps/observation/torso_velocity	テンソル	(3)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/quadruped_walk_combined_challenge_easy_100_percent

データセットサイズ: 196.57 MiB
自動キャッシュ(ドキュメント): shuffle_files=Falseの場合のみ (トレーニング)
スプリット:

スプリット	例
`'train'`	500

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(12,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'egocentric_state': Tensor(shape=(44,), dtype=float32),
            'force_torque': Tensor(shape=(24,), dtype=float32),
            'imu': Tensor(shape=(6,), dtype=float32),
            'torso_upright': Tensor(shape=(1,), dtype=float32),
            'torso_velocity': Tensor(shape=(3,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(12)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
ステップ/観察/利己的な状態	テンソル	(44)	float32
ステップ/観察/力_トルク	テンソル	(24)	float32
歩数/観察/imu	テンソル	(6,)	float32
steps/observation/torso_upright	テンソル	(1)	float32
steps/observation/torso_velocity	テンソル	(3)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_1_percent

データセットサイズ: 8.20 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	50

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(6,)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
歩数/観察/高さ	テンソル	(1)	float32
手順/観察/向き	テンソル	(14)	float32
ステップ/観察/速度	テンソル	(9)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_5_percent

データセットのサイズ: 40.98 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	250

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(6,)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
歩数/観察/高さ	テンソル	(1)	float32
手順/観察/向き	テンソル	(14)	float32
ステップ/観察/速度	テンソル	(9)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_20_percent

データセットサイズ: 163.93 MiB
自動キャッシュ(ドキュメント): shuffle_files=Falseの場合のみ (トレーニング)
スプリット:

スプリット	例
`'train'`	1,000

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(6,)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
歩数/観察/高さ	テンソル	(1)	float32
手順/観察/向き	テンソル	(14)	float32
ステップ/観察/速度	テンソル	(9)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_40_percent

データセットのサイズ: 327.86 MiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'train'`	2,000

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(6,)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
歩数/観察/高さ	テンソル	(1)	float32
手順/観察/向き	テンソル	(14)	float32
ステップ/観察/速度	テンソル	(9)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/walker_walk_combined_challenge_easy_100_percent

データセットサイズ: 819.65 MiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'train'`	5,000

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(6,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'height': Tensor(shape=(1,), dtype=float32),
            'orientations': Tensor(shape=(14,), dtype=float32),
            'velocity': Tensor(shape=(9,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(6,)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
歩数/観察/高さ	テンソル	(1)	float32
手順/観察/向き	テンソル	(14)	float32
ステップ/観察/速度	テンソル	(9)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_1_percent

データセットのサイズ: 77.11 MiB
自動キャッシュ(ドキュメント): はい
スプリット:

スプリット	例
`'train'`	200

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(21)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観測/com_velocity	テンソル	(3)	float32
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
ステップ/観察/四肢	テンソル	(12)	float32
ステップ/観察/head_height	テンソル	(1)	float32
ステップ/観察/ジョイント_角度	テンソル	(21)	float32
steps/observation/torso_vertical	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(27)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_5_percent

データセットサイズ: 385.54 MiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'train'`	1,000

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(21)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観測/com_velocity	テンソル	(3)	float32
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
ステップ/観察/四肢	テンソル	(12)	float32
ステップ/観察/head_height	テンソル	(1)	float32
ステップ/観察/ジョイント_角度	テンソル	(21)	float32
steps/observation/torso_vertical	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(27)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_20_percent

データセットサイズ: 1.51 GiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'train'`	4,000

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(21)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観測/com_velocity	テンソル	(3)	float32
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
ステップ/観察/四肢	テンソル	(12)	float32
ステップ/観察/head_height	テンソル	(1)	float32
ステップ/観察/ジョイント_角度	テンソル	(21)	float32
steps/observation/torso_vertical	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(27)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_40_percent

データセットサイズ: 3.01 GiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'train'`	8,000

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(21)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観測/com_velocity	テンソル	(3)	float32
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
ステップ/観察/四肢	テンソル	(12)	float32
ステップ/観察/head_height	テンソル	(1)	float32
ステップ/観察/ジョイント_角度	テンソル	(21)	float32
steps/observation/torso_vertical	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(27)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):

rlu_rwrl/humanoid_walk_combined_challenge_easy_100_percent

データセットサイズ: 7.53 GiB
自動キャッシュ(ドキュメント): いいえ
スプリット:

スプリット	例
`'train'`	20,000

機能構造:

FeaturesDict({
    'episode_return': float32,
    'steps': Dataset({
        'action': Tensor(shape=(21,), dtype=float32),
        'discount': Tensor(shape=(1,), dtype=float32),
        'is_first': bool,
        'is_last': bool,
        'is_terminal': bool,
        'observation': FeaturesDict({
            'com_velocity': Tensor(shape=(3,), dtype=float32),
            'dummy-0': Tensor(shape=(1,), dtype=float32),
            'dummy-1': Tensor(shape=(1,), dtype=float32),
            'dummy-2': Tensor(shape=(1,), dtype=float32),
            'dummy-3': Tensor(shape=(1,), dtype=float32),
            'dummy-4': Tensor(shape=(1,), dtype=float32),
            'dummy-5': Tensor(shape=(1,), dtype=float32),
            'dummy-6': Tensor(shape=(1,), dtype=float32),
            'dummy-7': Tensor(shape=(1,), dtype=float32),
            'dummy-8': Tensor(shape=(1,), dtype=float32),
            'dummy-9': Tensor(shape=(1,), dtype=float32),
            'extremities': Tensor(shape=(12,), dtype=float32),
            'head_height': Tensor(shape=(1,), dtype=float32),
            'joint_angles': Tensor(shape=(21,), dtype=float32),
            'torso_vertical': Tensor(shape=(3,), dtype=float32),
            'velocity': Tensor(shape=(27,), dtype=float32),
        }),
        'reward': Tensor(shape=(1,), dtype=float32),
    }),
})

機能のドキュメント:

特徴	クラス	形	Dtype
	特徴辞書
episode_return	テンソル		float32
ステップ	データセット
ステップ/アクション	テンソル	(21)	float32
ステップ/割引	テンソル	(1)	float32
ステップ/is_first	テンソル		ブール
ステップ/is_last	テンソル		ブール
ステップ/is_terminal	テンソル		ブール
ステップ/観察	特徴辞書
ステップ/観測/com_velocity	テンソル	(3)	float32
ステップ/観察/ダミー-0	テンソル	(1)	float32
ステップ/観察/ダミー-1	テンソル	(1)	float32
ステップ/観察/ダミー-2	テンソル	(1)	float32
ステップ/観察/ダミー-3	テンソル	(1)	float32
ステップ/観察/ダミー-4	テンソル	(1)	float32
ステップ/観察/ダミー-5	テンソル	(1)	float32
ステップ/観察/ダミー-6	テンソル	(1)	float32
ステップ/観察/ダミー-7	テンソル	(1)	float32
ステップ/観察/ダミー-8	テンソル	(1)	float32
ステップ/観察/ダミー-9	テンソル	(1)	float32
ステップ/観察/四肢	テンソル	(12)	float32
ステップ/観察/head_height	テンソル	(1)	float32
ステップ/観察/ジョイント_角度	テンソル	(21)	float32
steps/observation/torso_vertical	テンソル	(3)	float32
ステップ/観察/速度	テンソル	(27)	float32
ステップ/報酬	テンソル	(1)	float32

例( tfds.as_dataframe ):