Minimax regret criterion (Savage criterion)

Savage's criterion (also known as the minimax regret criterion) is a method for decision-making under uncertainty. It is applied in situations where the probabilities of different outcomes are unknown, and the goal is to minimize potential losses resulting from making a non-optimal decision.

General Characteristics

Under conditions of uncertainty, the consequences of choosing each strategy are not precisely defined. A number of criteria are used to evaluate possible alternatives, such as the criteria of Wald, Hurwicz, Laplace, and Savage. Savage's criterion is not focused on achieving maximum profit, but on minimizing the maximum regret (losses compared to the best possible outcome).

Regret is a value that reflects the opportunity loss incurred because a non-optimal strategy was chosen for a specific outcome.

Algorithm for Applying Savage's Criterion

Constructing the payoff matrix: A table is created where rows correspond to possible strategies and columns correspond to possible outcomes (states of nature). The intersection records the expected result for a specific strategy and outcome.
Constructing the regret matrix (risk matrix): For each outcome (column), the maximum payoff value is determined. Then, for each cell, the amount of regret is calculated.
Determining the maximum regret for each strategy: In each row of the regret matrix, the maximum value is selected (the worst-case scenario for that strategy).
Choosing the optimal strategy: The strategy with the minimum maximum regret is chosen.

Thus, Savage's criterion implements the principle of minimizing the potential loss from an incorrect decision.

Mathematical Formulation

Let the following be given:

$S = {s_{1}, s_{2}, \dots, s_{m}}$ — the set of available strategies (alternatives).
$Θ = {θ_{1}, θ_{2}, \dots, θ_{n}}$ — the set of possible states of nature.
$u (s_{i}, θ_{j})$ — the payoff (utility) function for choosing strategy $s_{i}$ when state $θ_{j}$ occurs. This is often represented by a payoff matrix $A = [a_{i j}]$ , where $a_{i j} = u (s_{i}, θ_{j})$ .

Savage's criterion is based on the concept of regret or opportunity loss. The regret $r (s_{i}, θ_{j})$ for strategy $s_{i}$ under state of nature $θ_{j}$ is defined as the difference between the maximum possible payoff that could have been obtained for that state of nature $θ_{j}$ (if the best strategy for that state had been chosen) and the actual payoff from strategy $s_{i}$ .

The algorithm for applying Savage's criterion:

Calculate the regret (risk) matrix:
a) Find the maximum payoff for each state of nature (each column of the payoff matrix):

$u_{j}^{*} = \max_{k = 1, \dots, m} u (s_{k}, θ_{j}) = \max_{k = 1, \dots, m} a_{k j}$

This is the best possible result if state $θ_{j}$ occurs.

b) Calculate the elements of the regret matrix $R = [r_{i j}]$ :**

$r_{i j} = r (s_{i}, θ_{j}) = u_{j}^{*} - u (s_{i}, θ_{j}) = (\max_{k = 1, \dots, m} a_{k j}) - a_{i j}$

The element $r_{i j}$ shows how much the payoff from strategy $s_{i}$ is less than the maximum possible payoff under state $θ_{j}$ . All elements $r_{i j} \geq 0$ .

Find the maximum regret for each strategy: For each strategy $s_{i}$ (each row of the regret matrix $R$ ), its worst possible outcome in terms of regret is determined:
$r_{i}^{\max} = \max_{j = 1, \dots, n} r_{i j} = \max_{j = 1, \dots, n} ((\max_{k = 1, \dots, m} a_{k j}) - a_{i j})$

Choose the strategy with the minimum maximum regret (the minimax regret principle): The strategy $s_{Savage}^{*}$ that minimizes the found maximum regret is chosen:
$s_{Savage}^{*} = \arg \min_{i = 1, \dots, m} (r_{i}^{\max}) = \arg \min_{s_{i} \in S} (\max_{θ_{j} \in Θ} r (s_{i}, θ_{j}))$

Or, substituting the expression for $r_{i j}$ :

$s_{Savage}^{*} = \arg \min_{i = 1, \dots, m} (\max_{j = 1, \dots, n} [(\max_{k = 1, \dots, m} a_{k j}) - a_{i j}])$

The minimum value of the maximum regret achieved using Savage's criterion is: $V_{Savage} = \min_{i = 1, \dots, m} (r_{i}^{\max}) = \min_{i = 1, \dots, m} (\max_{j = 1, \dots, n} r_{i j})$

Thus, Savage's criterion aims to select the strategy that guarantees the smallest losses relative to the best possible action for each state of nature.

Key points in the mathematical formulation:

Definition of regret $r_{i j}$ : This is the central concept. It is important to show that it is calculated as the difference between the best outcome in column j and the current outcome a_{ij}.
Regret matrix $R$ : It is explicitly stated how it is constructed.
Finding $r_{i}^{\max}$ : The search for the maximum in each row of the regret matrix is shown.
Minimax principle: The choice of strategy is clearly formulated using $\arg \min$ of the $\max$ of regrets.
Notation used: Standard for game theory and decision theory (S, Θ, u, a_ij, r_ij, max, min, arg min).

Advantages and Disadvantages

Advantages:

Focuses on minimizing risks.
Particularly effective under conditions of high uncertainty.

Disadvantages:

Ignores expected profit, focusing only on potential losses.
Can lead to overly conservative decisions.

Decision Criteria

Hurwicz's Criterion
Laplace's Criterion
Wald's Criterion

Minimax regret criterion (Savage criterion)

Contents

General Characteristics

Algorithm for Applying Savage's Criterion

Mathematical Formulation

Advantages and Disadvantages

Decision Criteria

Navigation menu

Minimax regret criterion (Savage criterion)

General Characteristics

Algorithm for Applying Savage's Criterion

Mathematical Formulation

Advantages and Disadvantages

Decision Criteria

Navigation menu

Search