example_envs.tag_gridworld package

Submodules

example_envs.tag_gridworld.tag_gridworld module

class example_envs.tag_gridworld.tag_gridworld.CUDATagGridWorld(num_taggers=10, grid_length=10, episode_length=100, starting_location_x=None, starting_location_y=None, seed=None, wall_hit_penalty=0.1, tag_reward_for_tagger=10.0, tag_penalty_for_runner=2.0, step_cost_for_tagger=0.01, use_full_observation=True)

Bases: example_envs.tag_gridworld.tag_gridworld.TagGridWorld, warp_drive.utils.gpu_environment_context.CUDAEnvironmentContext

CUDA version of the TagGridWorld environment. Note: this class subclasses the Python environment class TagGridWorld, and also the CUDAEnvironmentContext

get_data_dictionary()
get_tensor_dictionary()
step(actions=None)
class example_envs.tag_gridworld.tag_gridworld.TagGridWorld(num_taggers=10, grid_length=10, episode_length=100, starting_location_x=None, starting_location_y=None, seed=None, wall_hit_penalty=0.1, tag_reward_for_tagger=10.0, tag_penalty_for_runner=2.0, step_cost_for_tagger=0.01, use_full_observation=True)

Bases: object

The game of tag on a 2D square grid plane. This is a simplified version of the continuous tag. There are a number of taggers trying to tag 1 runner. The taggers want to catch the runner. Once the runner is tagged, the game is over.

generate_observation()
name = 'TagGridWorld'
reset()
seed(seed=None)
set_global_state(key=None, value=None, t=None, dtype=None)
step(actions=None)
update_state(actions_x, actions_y)

Module contents