The Secretary Problem and Optimal Stopping

When to stop looking and commit: the 37% rule and beyond

Decision ScienceOptimal StoppingProbabilityPython

You're hiring for a critical role. Candidates arrive one at a time, you can rank them against those you've already seen, but once you reject someone they're gone forever. How do you maximize your chance of picking the best? This is the secretary problem: one of the most elegant results in probability and decision theory. The optimal strategy is surprisingly simple: observe the first ~37% of candidates without hiring anyone, then immediately hire the next candidate who is better than all those you've seen.

The Classic Setup

The problem has clean assumptions:

There are n candidates, arriving in random order.
After each interview, you can rank the current candidate relative to all previous ones.
You must accept or reject each candidate immediately: no callbacks.
Your goal is to select the single best candidate (not just a good one).

Under these rules, a naive strategy (pick randomly) gives you a 1/n chance of getting the best. The optimal strategy does dramatically better.

The 1/e Rule: Why 37%?

The optimal strategy is a look-then-leap rule:

Look phase: Interview the first r candidates and reject them all, but track the best score seen.
Leap phase: From candidate r+1 onward, hire the first person who beats every candidate in the look phase.

The optimal cutoff r is approximately n/e (where e ≈ 2.718), which means you should observe about 37% of the pool before switching to selection mode. With this strategy, you select the best candidate with probability ~1/e ≈ 0.368 -- regardless of how large n is. That's a striking result: even with 1,000 candidates, you have a 37% chance of picking the absolute best by using this simple rule.

Success probability vs look-phase cutoff showing the 1/e optimum

Intuition Behind the Proof

For a given cutoff r, the probability of selecting the best candidate can be written as a sum: for each position i > r, the best candidate is at position i and the best of the first i-1 candidates falls in the look phase (positions 1 through r). This leads to:

$$P(\text{win} \mid r) = \frac{r}{n} \sum_{i=r+1}^{n} \frac{1}{i-1}$$

As $n \to \infty$, this sum approximates $-\frac{r}{n} \ln!\left(\frac{r}{n}\right)$. Maximizing over the ratio $r/n$ gives the optimal fraction $1/e$, and the maximum probability converges to $1/e$.

Real-World Applications

The secretary problem isn't just a mathematical curiosity -- it maps onto many sequential decision problems:

Hiring: Interviewing candidates for a role where you must decide on the spot.
House hunting: Viewing apartments or houses in a competitive market where offers must be immediate.
Parking: Driving past open spots, deciding when to commit vs. keep looking for something closer.
A/B test variants: Running sequential experiments where each variant costs traffic. Observe early results to calibrate expectations, then commit when a candidate outperforms the baseline by enough to act on.

In each case, the core tension is the same: explore too little and you miss better options; explore too long and the best options are gone.

Variants and Extensions

The classic problem has spawned a rich family of variants:

Unknown n: When you don't know how many candidates exist, the strategy adapts using a time-based cutoff.
Multiple choices: If you can hire k candidates instead of one, the optimal look phase shrinks.
Partial information: If you can see cardinal scores (not just rankings), threshold-based strategies can outperform the 1/e rule.
Cost of search: When each interview has a cost, the optimal strategy becomes more aggressive: you stop earlier because continued search has diminishing returns.
Satisficing: Herbert Simon's alternative: set an acceptability threshold and take the first candidate who clears it, rather than maximizing.

Comparison of success rates across secretary problem variants

Connection to Optimal Stopping Theory

The secretary problem is the gateway to optimal stopping theory: the mathematics of deciding when to take an action to maximize reward. Related problems include:

The gambler's problem: When to cash out a sequence of gains.
Option pricing: The American option in finance is an optimal stopping problem.
Markov decision processes: Sequential decisions under uncertainty, where the Bayesian updating from Part 1 (Online Experiments with a Bayesian Lens) and Part 2 (Bayesian Sample Efficiency) connects naturally: each new observation updates your belief about whether you've found the best option.

Takeaway

The secretary problem teaches a deep lesson about the explore-exploit tradeoff: gathering information has value, but so does acting on it. The 37% rule gives a principled answer to "how long should you keep looking?" -- and the fact that it works regardless of the pool size makes it one of the most practically useful results in decision science.

The secretary problem shares the explore-exploit tension of Bayesian experimentation, but the mechanics differ: no priors, no posteriors, just ordinal rankings and an irrevocable accept-or-reject rule.

← PreviousHow the Bayesian Approach Improves Sample Efficiency Next →Multi-Armed Bandits and Thompson Sampling

View all articles in Decision Science →

Continue Reading

SeriesDecision Science SeriesNetwork Graph Analysis BrowseAll Articles