Minimax regret criterion (Savage criterion)

From Systems Analysis Wiki
Jump to navigation Jump to search

Savage's criterion (also known as the minimax regret criterion) is a method for decision-making under uncertainty. It is applied in situations where the probabilities of different outcomes are unknown, and the goal is to minimize potential losses resulting from making a non-optimal decision.

General Characteristics

Under conditions of uncertainty, the consequences of choosing each strategy are not precisely defined. A number of criteria are used to evaluate possible alternatives, such as the criteria of Wald, Hurwicz, Laplace, and Savage. Savage's criterion is not focused on achieving maximum profit, but on minimizing the maximum regret (losses compared to the best possible outcome).

Regret is a value that reflects the opportunity loss incurred because a non-optimal strategy was chosen for a specific outcome.

Algorithm for Applying Savage's Criterion

  • Constructing the payoff matrix: A table is created where rows correspond to possible strategies and columns correspond to possible outcomes (states of nature). The intersection records the expected result for a specific strategy and outcome.
  • Constructing the regret matrix (risk matrix): For each outcome (column), the maximum payoff value is determined. Then, for each cell, the amount of regret is calculated.
  • Determining the maximum regret for each strategy: In each row of the regret matrix, the maximum value is selected (the worst-case scenario for that strategy).
  • Choosing the optimal strategy: The strategy with the minimum maximum regret is chosen.

Thus, Savage's criterion implements the principle of minimizing the potential loss from an incorrect decision.

Mathematical Formulation

Let the following be given:

  • S={s1,s2,,sm} — the set of available strategies (alternatives).
  • Θ={θ1,θ2,,θn} — the set of possible states of nature.
  • u(si,θj) — the payoff (utility) function for choosing strategy si when state θj occurs. This is often represented by a payoff matrix A=[aij], where aij=u(si,θj).

Savage's criterion is based on the concept of regret or opportunity loss. The regret r(si,θj) for strategy si under state of nature θj is defined as the difference between the maximum possible payoff that could have been obtained for that state of nature θj (if the best strategy for that state had been chosen) and the actual payoff from strategy si.

The algorithm for applying Savage's criterion:

  1. Calculate the regret (risk) matrix:
    a) Find the maximum payoff for each state of nature (each column of the payoff matrix):
    uj*=maxk=1,,mu(sk,θj)=maxk=1,,makj
    This is the best possible result if state θj occurs.
    b) Calculate the elements of the regret matrix R=[rij]:**
    rij=r(si,θj)=uj*u(si,θj)=(maxk=1,,makj)aij
    The element rij shows how much the payoff from strategy si is less than the maximum possible payoff under state θj. All elements rij0.
  1. Find the maximum regret for each strategy: For each strategy si (each row of the regret matrix R), its worst possible outcome in terms of regret is determined:
    rimax=maxj=1,,nrij=maxj=1,,n((maxk=1,,makj)aij)
  1. Choose the strategy with the minimum maximum regret (the minimax regret principle): The strategy sSavage* that minimizes the found maximum regret is chosen:
    sSavage*=argmini=1,,m(rimax)=argminsiS(maxθjΘr(si,θj))
    Or, substituting the expression for rij:
    sSavage*=argmini=1,,m(maxj=1,,n[(maxk=1,,makj)aij])

The minimum value of the maximum regret achieved using Savage's criterion is: VSavage=mini=1,,m(rimax)=mini=1,,m(maxj=1,,nrij)

Thus, Savage's criterion aims to select the strategy that guarantees the smallest losses relative to the best possible action for each state of nature.


Key points in the mathematical formulation:

  • Definition of regret rij: This is the central concept. It is important to show that it is calculated as the difference between the best outcome in column j and the current outcome a_{ij}.
  • Regret matrix R: It is explicitly stated how it is constructed.
  • Finding rimax: The search for the maximum in each row of the regret matrix is shown.
  • Minimax principle: The choice of strategy is clearly formulated using argmin of the max of regrets.
  • Notation used: Standard for game theory and decision theory (S, Θ, u, a_ij, r_ij, max, min, arg min).


Advantages and Disadvantages

Advantages:

  • Focuses on minimizing risks.
  • Particularly effective under conditions of high uncertainty.

Disadvantages:

  • Ignores expected profit, focusing only on potential losses.
  • Can lead to overly conservative decisions.

Decision Criteria

  • Hurwicz's Criterion
  • Laplace's Criterion
  • Wald's Criterion