Algorithms
Ramo currently contains more than 10 different algorithms capable of computing or learning (approximate) equilibria in multi-objective games. The following tables give an overview of our current algorithms, the equilibria they aim to find and the utility functions they support.
Computing Equilibria
The following table gives an overview of the current algorithms for computing equilibria from a given MONFG and utility functions.
Algorithm |
Equilibrium |
Utility functions |
|---|---|---|
MOSE |
Pure-strategy Nash equilibria |
All |
MOQUPS |
Pure-strategy Nash equilibria |
Quasiconvex |
Learning Algorithms
This table show the learning algorithms currently included in Ramo. Please note that only the iterated best-response and fictitious play algorithms get access to their full individual payoff matrices. All other algorithms have to learn the payoffs together with an optimal strategy.
Algorithm |
Equilibrium |
Utility functions |
|---|---|---|
Iterated best-response |
Nash equilibrium |
All |
Fictitious play |
Nash equilibrium |
All |
Independent Q-learners |
Nash equilibrium |
All |
Independent actor-critic learners |
Nash equilibrium |
All |
Joint-action Q-learners |
Nash equilibrium |
All |
Joint-action actor-critic learners |
Nash equilibrium |
All |
Cooperative action communication |
Nash equilibrium |
All |
Self-interested action communication |
Nash equilibrium |
All |
Cooperative policy communication |
Nash equilibrium |
All |
Hierarchical communication |
Nash equilibrium |
All |
Non-stationary agent |
Leadership equilibrium |
All |
Best-response agent |
Leadership equilibrium |
All |