A Mathematical Analysis and Simulation-Based Evaluation of Local Decision Rules in Skyjo

Felix Reichel

doi:10.20944/preprints202601.0112.v1

Submitted:

31 December 2025

Posted:

04 January 2026

You are already at the latest version

Abstract

Skyjo is a simple stochastic card game with partial information, local replacement decisions, and score-reducing column removal events. This paper develops a formal mathematical model of the game, derives expected-score rules for turn-level actions, proves several dominance and threshold results, and evaluates a family of heuristic strategies through Monte Carlo simulation. The focus here lies on local optimality under explicit belief assumptions rather than a full equilibrium solution of the multiplayer game. Finally a simulation code is provided for reproducibility.

Keywords:

Skyjo

;

stochastic games

;

partial information

;

expected value

;

Monte Carlo simulation

;

heuristic strategies

;

card games

Subject:

Computer Science and Mathematics - Probability and Statistics

MSC: Primary: 91A60; Secondary: 60G40, 68T20

1. Introduction

Games with partial information and stochastic draws present a recurring challenge in applied game theory. Skyjo is a representative example in which players repeatedly compare known outcomes against uncertain alternatives under score minimization. The game combines features of replacement games, stopping-time effects, and pattern-based removal incentives.

The contribution of this work is threefold:

a formal expected-score model for single-turn decisions,
provable dominance and threshold results under a specific deck-belief assumption,
empirical validation through large-scale Monte Carlo simulation.

The analysis does not claim a globally optimal multiplayer strategy. Instead, it isolates local decisions for which simple and rigorous comparison is then possible.

2. Basic Rules and Game Structure of Skyjo

Skyjo is a finite-horizon stochastic card game with partial information and score minimization. Although the official rules allow for multiple players, the strategic structure of the game can be analyzed at the level of a single representative player interacting with a shared deck and discard pile.

2.1. Card Set and Distribution

The game is played with a fixed multiset of numeric cards. Each card carries an integer value and contributes additively to the final score if not removed.

Definition 1 (Skyjo Card Multiset).

The Skyjo deck consists of 150 cards with the following value distribution:

# (- 2) = 5, # (- 1) = 10, # (0) = 15, # (v) = 10 for v = 1, \dots, 12

Negative-valued cards are scarce and therefore highly desirable, while large positive values create strong incentives for early replacement or column removal.

Figure 1. Card value distribution in the Skyjo deck.

2.2. Player Tableau

Each player maintains a private tableau consisting of a

3 \times 4

grid of cards. Cards may be either face-down (hidden) or face-up (revealed).

At the start of the round, all grid positions are filled with cards drawn uniformly from the deck, after which exactly two randomly chosen positions are revealed.

Definition 2 (Tableau State).

A tableau position is in one of three states:

Hidden: card value unknown to the player,
Revealed: card value known and fixed,
Removed: position cleared due to column completion.

Figure 2. Example player tableau with revealed and hidden cards.

2.3. Player’s Turn Structure

A player’s turn consists of a sequence of deterministic decisions following a stochastic draw.

(i) Draw decision: The player chooses between

drawing the top card of the discard pile (known value), or
drawing the top card of the draw pile (unknown value).

(ii) Replacement decision: After observing the drawn card, the player must either

discard it immediately, or
replace exactly one card in the tableau with the drawn card.

If a hidden card is replaced, it becomes revealed.

(iii) Column resolution: If, after replacement, all three cards in a column are revealed and equal in value, the entire column is removed from the tableau and contributes zero to the final score.

2.4. Round Termination and Scoring

A round ends when one player has revealed all remaining (non-removed) tableau positions. All other players receive exactly one additional turn.

Definition 3 (Final Score).

The final score of a player equals the sum of all remaining card values in the tableau. Lower scores are strictly preferred.

2.5. Strategic Implications

The Skyjo rules induce three interacting incentives:

replacement of large revealed values,
risk-reward trade-offs for hidden cards,
strong nonlinear payoffs from column completion.

These features make Skyjo a natural candidate for expected-value analysis under partial information, while remaining computationally tractable for Monte Carlo evaluation.

3. Game-Theoretic Model

We model Skyjo as a stochastic, imperfect-information, finite-horizon game. The analysis focuses on a representative player interacting with a shared environment, abstracting from opponent-specific signaling.

3.1. State Space

Let the game state at time

t

be

s_{t} = (G_{t}, R_{t}, D_{t})

where:

$G_{t} \in (Z \cup {\emptyset})^{12}$ is the player’s tableau,
$R_{t} \in {0,1}^{12}$ is the reveal indicator,
$D_{t}$ is the multiset of unseen cards (draw pile + hidden tableau cards).

Removed cards are denoted by

\emptyset

and contribute zero to the score.

3.2. Action Space

At each nonterminal state, the player chooses a composite action

a_{t} = (a_{t}^{draw}, a_{t}^{replace})

where:

$a_{t}^{draw} \in {$ draw pile, discard pile $}$ ,
$a_{t}^{replace} \in {$ discard $} \cup {1, \dots, 12}$ .

Replacement actions specify the tableau index to be replaced.

3.3. Transition Kernel

Transitions are governed by:

uniform draws from $D_{t}$ ,
deterministic tableau updates,
deterministic column-removal rules.

The resulting transition kernel

P (s_{t + 1} ∣ s_{t}, a_{t})

is fully specified by the deck composition and replacement rules.

3.4. Payoff Function

Terminal payoff is defined as

u (s_{T}) = - \sum_{i = 1}^{12} G_{T} (i),

where lower tableau sums correspond to higher utility.

The negative sign converts Skyjo into a utility maximization problem.

4. Mapping Rules to Simulation Code

This section establishes a precise correspondence between formal rules and the Python implementation provided in the Appendix.

4.1. Deck and Belief Model

Card multiset: create_draw_pile()
Deck-belief mean $μ (U)$ :

4.2. Turn Logic

The formal turn sequence:

draw → replace / discard → column removal

corresponds to the function:

4.3. Round Termination

The formal round-ending condition

\forall i : R (i) = 1 or G (i) = \emptyset

is implemented by:

5. Skyjo Rule Variants and Extensions

Several Skyjo variants can be incorporated without altering the core model.

5.1. Skyjo Action Variant

The Skyjo Action deck introduces special cards allowing:

additional draws,
card swaps,
forced reveals.

Formally, these correspond to temporary action-set expansions:

A_{t}^{Action} \supset A_{t} .

Expected-value analysis remains valid, but transition kernels become history-dependent.

5.2. Alternative Column Rules

Some house rules remove columns only after explicit confirmation. This replaces deterministic removal with a voluntary action, introducing a stopping-time component similar to optimal stopping problems.

6. Strategy Taxonomy

We classify Skyjo strategies by informational sophistication.

6.1. Naive Strategies

Always draw from the draw pile.
Replace a random hidden card.

These ignore both revealed information and deck composition.

6.2. Threshold Strategies

Parameterized by

t \in R

:

take discard if $x \leq t$ ,
replace revealed card if dominated,
otherwise replace hidden card if $y < μ (U)$ .

This class includes the strategy family studied in Section 9.

6.3. Column-Seeking Strategies

Augment threshold rules with:

priority for completing 2-of-a-kind columns,

preference ordering by value $v$ .

These strategies exploit Proposition 6.1 directly.

6.4. Dominance Relations

Proposition 1.

Within the class of belief-consistent strategies, discarding a card

y < μ (U)

without replacement is weakly dominated.

This explains the empirical failure of conservative discard policies.

7. Model Scope and Generalization

The framework extends naturally to:

larger tableaux,
asymmetric deck compositions,
replacement games with pattern-based removal.

Skyjo thus serves as a canonical example of local expected-value optimization in games with partial information and non-linear payoffs.

8. Deck belief state

Let

U

denote the multiset of unseen cards from the perspective of a fixed player. This includes the draw pile and all unrevealed grid cards.

Definition 4.

The deck-belief mean is

μ (U) = \frac{1}{| U |} \sum_{v \in U} v .

9. Expected value of hidden cards

Lemma 1. Under the deck-belief assumption, the expected value of a hidden grid card equals

μ (U)

.

Proof. Each hidden card was drawn uniformly from

U

. The expectation equals the arithmetic mean of the multiset.

◻

10. Replacement decisions

Proposition 2. Replacing a hidden card with a known card of value

y

reduces expected score if and only if

y < μ (U) .

Proof. The expected score change equals

y - μ (U)

. Negativity gives the condition.

◻

Proposition 3. If the discard pile shows a card

x < μ (U)

and at least one hidden card remains, taking the discard card and replacing a hidden card yields lower expected score than drawing from the unknown pile and discarding.

Proof. Discard replacement produces expected change

x - μ (U) < 0

. Drawing and discarding leaves the grid unchanged.

◻

Proposition 4. Given a drawn card of value

y

, replacing the revealed card with maximal value greater than

y

minimizes the immediate resulting score.

Proof. Replacing position

i

changes the score by

y - g_{i}

. Minimization is achieved by choosing the largest admissible

g_{i}

.

◻

11. Incentives of Column removal

Proposition 5.Suppose a column contains two revealed cards of value

v

and one hidden card. Completing the column yields expected score change

Δ (v) = - 3 * v + μ (U) .

Proof. Before completion, the expected contribution equals

2 * v + μ (U)

. After completion, the column contributes zero.

◻

Corollary 1.

Completion attempts for smaller

v

give larger expected score reduction.

12. Player’s Grid

Figure 3. Example grid with one highlighted column.

13. Turn decision structure

Figure 4. Action flow for a single turn.

14. Simulation design

Monte Carlo simulations were performed for a two-player self-play setting. Each player used the same heuristic strategy parameterized by a discard threshold

t

.

Strategy rules:

Take the discard card if its value $x \leq t$ .
Replace the largest revealed card exceeding the drawn value.
Otherwise, replace a hidden card if the drawn value is below $μ (U)$ .
If possible, target completion of a two-of-a-kind column.

Each data point consists of 3000 games, counting both players.

15. Simulation results

Table 1. Self-play outcomes for varying discard thresholds.

Threshold $x \leq t$	mean score	standard deviation
0	3.08	8.88
1	4.37	8.52
2	7.02	7.68
3	11.12	7.66
4	15.51	7.75

Figure 5. Mean score as a function of the discard threshold.

16. Discussion

The analytical results explain the monotonic behavior observed in simulation. The discard-threshold parameter directly controls whether a player replaces an expected-value draw with a known quantity. Since the deck-belief mean remains positive for most of the round, aggressive discard selection for small values leads to sustained score reduction.

Column completion introduces a linear dependence on the repeated value. This creates a strong preference ordering among otherwise similar replacement opportunities. Simulation outcomes match these structural predictions without requiring tuning beyond the threshold parameter.

17. Limitations

The belief model assumes uniform uncertainty over unseen cards and does not track opponent-specific information. Multiplayer interactions affect round termination timing, which is approximated here by a single additional turn. These simplifications isolate local decision quality rather than full equilibrium behavior.

18. Concluding remarks

This study identifies turn-level rules in Skyjo that admit exact mathematical comparison. The combination of closed-form expected-score expressions and Monte Carlo evidence explains why simple threshold-based play performs well. The framework generalizes to other replacement games with partial information and pattern-based removal.

Funding

Supported by Johannes Kepler Open Access Publishing Fund and the federal state Upper Austria.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The author declares no conflicts of interest.

Appendix A. Python Simulation Code

Appendix B. Technical Proofs

All results are derived under the deck-belief assumption and concern local, turn-level expected-score comparisons.

Appendix B.1. Belief Consistency and Linearity

Lemma 2 (Linearity of Expected Score). Expected tableau score is additive across positions.

Proof. Each tableau position contributes additively to the final score. Linearity of expectation implies that expected total score equals the sum of expected contributions of individual positions, independently of correlations.

This justifies all subsequent marginal comparisons.

Appendix B.2. Hidden Card Expectation

Lemma 3 (Expected Value of Hidden Cards).Under the deck-belief assumption, the expected value of a hidden card equals

μ (U)

.

Proof. Hidden cards are drawn uniformly from the multiset

U

. Therefore, the expected value equals the arithmetic mean of

U

by definition.

Appendix B.3. Replacement of Hidden Cards

Proposition 6 (Hidden Replacement Criterion). Replacing a hidden card with a known card of value

y

reduces expected score if and only if

y < μ (U)

.

Proof. The hidden card contributes expected value

μ (U)

. Replacing it with value

y

changes expected score by

y - μ (U)

. The change is negative if and only if

y < μ (U)

.

◻

Appendix B.4. Discard vs. Unknown Draw

Proposition 7 (Discard Dominance).If the discard pile shows a card

x < μ (U)

and at least one hidden card remains, then taking the discard card and replacing a hidden card yields lower expected score than drawing from the draw pile and discarding.

Proof. Replacing a hidden card with

x

changes expected score by

x - μ (U) < 0

. Drawing from the draw pile and discarding produces zero expected score change. Hence discard replacement strictly dominates.

◻

Appendix B.5. Replacement of Revealed Cards

Proposition 8 (Optimal Revealed Replacement).Given a drawn card of value

y

, replacing the revealed card with maximal value exceeding y minimizes the immediate resulting score.

Proof. Replacing revealed card

g_{i}

changes score by

y - g_{i}

. Among admissible replacements (

g_{i} > y

), this is minimized by choosing the largest

g_{i}

.

◻

Appendix B.6. Column Completion Incentives

Proposition 9 (Expected Column Removal Gain).Suppose a column contains two revealed cards of value

v

and one hidden card. Completing the column yields expected score change

Δ (v) = - 3 * v + μ (U) .

Proof. Before completion, the expected column contribution equals

2 * v + μ (U)

. After completion, it equals 0. The difference gives the stated expression.

◻

Corollary 2 (Value Ordering).For

v_{1} < v_{2}

, we have

Δ (v_{1}) > Δ (v_{2})

.

Proof.

Δ (v)

is strictly decreasing in

v

.

◻

Appendix B.7. Dominance of Conservative Discarding

Proposition 10 (Dominance of Replacement).Within the class of belief-consistent strategies, discarding a card

y < μ (U)

without replacement is weakly dominated.

Proof. Discarding produces zero expected score change. Replacing a hidden card produces expected change

y - μ (U) < 0

. Thus replacement weakly dominates discarding.

◻

Appendix B.8. Threshold Monotonicity

Proposition 11 (Threshold Monotonicity).Expected final score is weakly increasing in the discard threshold

t

.

Proof. A larger threshold weakly reduces the set of discard cards taken. This weakly increases reliance on unknown draws, which have expected value

μ (U)

. Thus expected score weakly increases.

Appendix B.9. Simulation Correctness

Proposition 12 (Implementation Fidelity).The simulation code in the Appendix correctly implements the formal model defined inSection 4.

Proof. Each component of the state, action space, transition kernel, and payoff function has a direct implementation: deck draws are uniform, replacements deterministic, and column removal rule exact. Termination conditions coincide.

Appendix B.10. Scope of Validity

All proofs rely solely on:

the deck-belief assumption,
linearity of expectation,
local one-step comparisons.

No claim is made regarding global equilibrium or multiplayer optimality.

References

Magilano GmbH. Skyjo Official Rules. Magilano, Germany, 2015. Available online: https://www. magilano.com. Magilano, Germany. Available online: https://www.
Skyjo Action Rulebook; Magilano GmbH: Magilano, Germany, 2023.
Ross, S.M. Introduction to Probability Models, 11th ed.; Academic Press: Amsterdam, The Netherlands, 2014. [Google Scholar]
Puterman, M.L. Markov Decision Processes: Discrete Stochastic Dynamic Programming; Wiley: Hoboken, NJ, USA, 1994. [Google Scholar]
Fishman, G.S. Monte Carlo: Concepts, Algorithms, and Applications; Springer: New York, NY, USA, 1996. [Google Scholar]
Osborne, M.J.; Rubinstein, A. A Course in Game Theory; MIT Press: Cambridge, MA, USA, 1994. [Google Scholar]
Ferguson, T.S. Optimal stopping and applications. Electronic Journal of Probability 2008. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Copyright: This open access article is published under a Creative Commons CC BY 4.0 license, which permit the free download, distribution, and reuse, provided that the author and preprint are cited in any reuse.

A Mathematical Analysis and Simulation-Based Evaluation of Local Decision Rules in Skyjo

Abstract

Keywords:

Subject:

1. Introduction

2. Basic Rules and Game Structure of Skyjo

2.1. Card Set and Distribution

2.2. Player Tableau

2.3. Player’s Turn Structure

2.4. Round Termination and Scoring

2.5. Strategic Implications

3. Game-Theoretic Model

3.1. State Space

3.2. Action Space

3.3. Transition Kernel

3.4. Payoff Function

4. Mapping Rules to Simulation Code

4.1. Deck and Belief Model

4.2. Turn Logic

4.3. Round Termination

5. Skyjo Rule Variants and Extensions

5.1. Skyjo Action Variant

5.2. Alternative Column Rules

6. Strategy Taxonomy

6.1. Naive Strategies

6.2. Threshold Strategies

6.3. Column-Seeking Strategies

6.4. Dominance Relations

7. Model Scope and Generalization

8. Deck belief state

9. Expected value of hidden cards

10. Replacement decisions

11. Incentives of Column removal

12. Player’s Grid

13. Turn decision structure

14. Simulation design

15. Simulation results

16. Discussion

17. Limitations

18. Concluding remarks

Funding

Institutional Review Board Statement

Data Availability Statement

Conflicts of Interest

Appendix A. Python Simulation Code

Appendix B. Technical Proofs

Appendix B.1. Belief Consistency and Linearity

Appendix B.2. Hidden Card Expectation

Appendix B.3. Replacement of Hidden Cards

Appendix B.4. Discard vs. Unknown Draw

Appendix B.5. Replacement of Revealed Cards

Appendix B.6. Column Completion Incentives

Appendix B.7. Dominance of Conservative Discarding

Appendix B.8. Threshold Monotonicity

Appendix B.9. Simulation Correctness

Appendix B.10. Scope of Validity

References

MDPI Initiatives

Important Links

Subscribe