Generic Explorer#

Versatile implementation of Explorer.

class momba.gym.generic.Actions(value)[source]#

Specifies how edges of the controlled automaton are chosen.

EDGE_BY_INDEX = 1#

The edge is chosen based on its index.

EDGE_BY_LABEL = 2#

The edge is chosen based on its label.

class momba.gym.generic.Observations(value)[source]#

Specifies what is observable by the agent.

GLOBAL_ONLY = 1#

Only global variables are observable.

LOCAL_AND_GLOBAL = 2#

Local and global variables are observable.

OMNISCIENT = 3#

All (non-transient) variables are observable.

class momba.gym.generic.Rewards(goal_reached=100, dead_end=-100, step_taken=0, invalid_action=-100)[source]#

Specifies the rewards for a reachability objective.

dead_end = -100#

The reward when a dead end or bad state is reached.

goal_reached = 100#

The reward when a goal state is reached.

invalid_action = -100#

The reward when an invalid decision has been taken.

step_taken = 0#

The reward when a valid decision has been taken.

class momba.gym.generic.Objective(goal_predicate, dead_predicate)[source]#

A reach-avoid objective.

Represents a reach-avoid objective of the form \(\lnot\phi\mathbin{\mathbf{U}}\psi\), i.e., \(\lnot\phi\) has to be true until the goal \(\psi\) is reached. Used in conjunction with Rewards to provide rewards.

dead_predicate#

A boolean expression for \(\phi\).

goal_predicate#

A boolean expression for \(\psi\).