Algorithms

Ramo currently contains more than 10 different algorithms capable of computing or learning (approximate) equilibria in multi-objective games. The following tables give an overview of our current algorithms, the equilibria they aim to find and the utility functions they support.

Computing Equilibria

The following table gives an overview of the current algorithms for computing equilibria from a given MONFG and utility functions.

Algorithm	Equilibrium	Utility functions
MOSE	Pure-strategy Nash equilibria	All
MOQUPS	Pure-strategy Nash equilibria	Quasiconvex

Learning Algorithms

This table show the learning algorithms currently included in Ramo. Please note that only the iterated best-response and fictitious play algorithms get access to their full individual payoff matrices. All other algorithms have to learn the payoffs together with an optimal strategy.

Algorithm	Equilibrium	Utility functions
Iterated best-response	Nash equilibrium	All
Fictitious play	Nash equilibrium	All
Independent Q-learners	Nash equilibrium	All
Independent actor-critic learners	Nash equilibrium	All
Joint-action Q-learners	Nash equilibrium	All
Joint-action actor-critic learners	Nash equilibrium	All
Cooperative action communication	Nash equilibrium	All
Self-interested action communication	Nash equilibrium	All
Cooperative policy communication	Nash equilibrium	All
Hierarchical communication	Nash equilibrium	All
Non-stationary agent	Leadership equilibrium	All
Best-response agent	Leadership equilibrium	All