Cooperative Reaching
These policies are for the Cooperative Reaching environment. Read environment page for detailed information about the environment.
Generic
These policies can be used for any version of this environment.
env = posggym.make("CooperativeReaching-v0")
Policy |
ID |
Valid Agent IDs |
Description |
|---|---|---|---|
|
|
All |
H1 always goes to the closest rewarding goal. |
|
|
All |
H2 always goes to the furthest rewarding goal. |
|
|
All |
H3 always goes to the closest optimal goal. |
|
|
All |
H4 always goes to the furthest optimal goal. |
|
|
All |
H5 always goes to the closest suboptimal goal. |
|
|
All |
H6 always goes to the furthest suboptimal goal. |
|
|
All |
H7 goes to a randomly selected goal. |
|
|
All |
H8 goes to the goal closest to the other agent at each time step. |
|
|
All |
H9 goes to the optimal goal closest to the other agent. |
|
|
All |
H10 goes to the sub-optimal goal closest to the other agent. |
|
|
All |
H11 follows the other agent. |