Cooperative Reaching

These policies are for the Cooperative Reaching environment. Read environment page for detailed information about the environment.

Generic

These policies can be used for any version of this environment.

env = posggym.make("CooperativeReaching-v0")

Policy	ID	Valid Agent IDs	Description
`H1`	`CooperativeReaching-v0/H1-v0`	All	H1 always goes to the closest rewarding goal.
`H2`	`CooperativeReaching-v0/H2-v0`	All	H2 always goes to the furthest rewarding goal.
`H3`	`CooperativeReaching-v0/H3-v0`	All	H3 always goes to the closest optimal goal.
`H4`	`CooperativeReaching-v0/H4-v0`	All	H4 always goes to the furthest optimal goal.
`H5`	`CooperativeReaching-v0/H5-v0`	All	H5 always goes to the closest suboptimal goal.
`H6`	`CooperativeReaching-v0/H6-v0`	All	H6 always goes to the furthest suboptimal goal.
`H7`	`CooperativeReaching-v0/H7-v0`	All	H7 goes to a randomly selected goal.
`H8`	`CooperativeReaching-v0/H8-v0`	All	H8 goes to the goal closest to the other agent at each time step.
`H9`	`CooperativeReaching-v0/H9-v0`	All	H9 goes to the optimal goal closest to the other agent.
`H10`	`CooperativeReaching-v0/H10-v0`	All	H10 goes to the sub-optimal goal closest to the other agent.
`H11`	`CooperativeReaching-v0/H11-v0`	All	H11 follows the other agent.