example_envs.tag_gridworld package¶

Submodules¶

example_envs.tag_gridworld.tag_gridworld module¶

class example_envs.tag_gridworld.tag_gridworld.CUDATagGridWorld(num_taggers=10, grid_length=10, episode_length=100, starting_location_x=None, starting_location_y=None, seed=None, wall_hit_penalty=0.1, tag_reward_for_tagger=10.0, tag_penalty_for_runner=2.0, step_cost_for_tagger=0.01, use_full_observation=True)¶

Bases: example_envs.tag_gridworld.tag_gridworld.TagGridWorld, warp_drive.utils.gpu_environment_context.CUDAEnvironmentContext

CUDA version of the TagGridWorld environment. Note: this class subclasses the Python environment class TagGridWorld, and also the CUDAEnvironmentContext

get_data_dictionary()¶

get_tensor_dictionary()¶

step(actions=None)¶

class example_envs.tag_gridworld.tag_gridworld.TagGridWorld(num_taggers=10, grid_length=10, episode_length=100, starting_location_x=None, starting_location_y=None, seed=None, wall_hit_penalty=0.1, tag_reward_for_tagger=10.0, tag_penalty_for_runner=2.0, step_cost_for_tagger=0.01, use_full_observation=True)¶

Bases: object

The game of tag on a 2D square grid plane. This is a simplified version of the continuous tag. There are a number of taggers trying to tag 1 runner. The taggers want to catch the runner. Once the runner is tagged, the game is over.

generate_observation()¶

name = 'TagGridWorld'¶

reset()¶

seed(seed=None)¶

set_global_state(key=None, value=None, t=None, dtype=None)¶

step(actions=None)¶

update_state(actions_x, actions_y)¶

example_envs.tag_gridworld package¶

Submodules¶

example_envs.tag_gridworld.tag_gridworld module¶

Module contents¶