When Random Voting
Predicts Real Elections

by Ritam Pal

PRL 134, 017401 (2025)

Ritam Pal, Aanjaneya Kumar, M.S. Santhanam

The Puzzle

Elections Are Messy

Millions of people. Different cultures, languages, local issues, campaign strategies, media influence, caste dynamics, economic anxieties.

Every voter has their own reasons. Every election is unique.

But zoom out, and something strange happens.

The messy details wash away. A single, clean mathematical pattern emerges — the same one, every time, everywhere.

The Discovery

One Curve to Rule Them All

Take any election result. Compute the specific margin: the gap between winner and runner-up, divided by the total votes cast.

Now scale it by its average value across all constituencies.

Plot the distribution. Do this for India, South Korea, Germany, the United States. Municipal elections. Parliamentary elections. Different decades.

The Scaling

x = (M / T) / ⟨M / T⟩, where M = winner − runner-up, T = total votes

They all collapse onto the same curve.

The Model

The Random Voting Model

Imagine the dumbest possible model of an election: every voter picks a candidate completely at random. No preferences, no campaigns, no issues. Just dice rolls.

This "random voting model" produces a specific mathematical curve for the margin distribution:

Predicted Distribution (for 3 candidates)

P(x) = (1 - x)(5 + 7x) / [(1 + x)²(1 + 2x)²]

No free parameters. No fitting. Pure math derived from the structure of random competition.

The crazy part: real elections follow this curve.

The Intuition

Why Does This Work?

For the same reason the bell curve appears everywhere.

Height, IQ, measurement errors — the individual details are wildly different, but when you aggregate millions of small independent effects, the same shape always emerges. The microscopic details don't matter.

This idea has a name in physics: universality.

Completely different systems — magnets, fluids, elections — can produce identical statistical patterns. Not similar. Identical. Because the math only cares about a few structural features (how many candidates, the scaling), not the messy details underneath.

It's the same reason you don't need to simulate every water molecule to predict when water boils.

The Test

Does It Work on Real Data?

We tested the theory against actual Indian election data: 698 constituencies across 11 states, spanning 3 general elections (2009, 2014, 2019).

Log-log universality test showing election data collapsing onto the predicted curve

Log-log plot of the scaled margin distribution. The data points cluster tightly around the theoretical prediction.

The Result

It Works.

When you account for the number of effective candidates in each constituency, the data matches the theory. We used the Kolmogorov-Smirnov test — a statistical test that measures how well data fits a predicted distribution. High p-values mean good fit.

Uttar Pradesh 2019

p = 0.98

KS test p-value

West Bengal 2009

p = 0.49

KS test p-value

Rajasthan 2014

p = 0.74

KS test p-value

CDF comparison across multiple Indian states showing theory vs data

Cumulative distribution comparison: theory (curves) vs actual election results (data points) across multiple states.

Going Deeper

Constituency-Level Breakdown

Here's the fit tested at the individual constituency level — each point is one constituency in one election year. The theory holds across wildly different political landscapes.

Constituency-level RVM test results across Indian states and election years

A Surprise

The Finite-Size Puzzle

Big elections have millions of voters. But what about small voting booths with only 200 voters? Does universality hold there too?

We ran simulations from 10 to 10,000 voters and found something surprising:

The distributional shape converges faster than the information content.

Translation: by the time you have ~200 voters per booth, the universal curve already fits well (KS statistic ~0.03). You can trust the pattern before you can trust the individual data points.

The convergence follows a power law: KS ~ T^-0.73. The exponent -0.73 sits between naive statistics (-0.5) and perfect convergence (-1.0) — an open theoretical puzzle.

Application

A Tool for Detecting Election Fraud

If fair elections must produce this universal curve, then deviations signal something unusual.

The paper tested this against known cases:

1 Ethiopia 2010 — significant deviation from the universal curve. Independently documented as fraudulent.

2 Belarus 2004-2019 — persistent deviations across multiple elections. Consistent with widespread reports of rigging.

The mathematics gives us a baseline for "normal." No need to prove how fraud happened — just that the statistical fingerprint doesn't match what fair competition produces.

The Bigger Picture

Why This Matters Beyond Elections

Universality is one of the deepest ideas in physics. It says: the same math appears in completely different systems because the large-scale behavior depends only on a few structural features, not the microscopic details.

Phase transitions in magnets. The bell curve in statistics. Critical phenomena in fluids. And now: election margins.

The microscopic details don't matter.
The statistics are universal.

This is what makes physics beautiful — the unreasonable effectiveness of simple models in explaining complex systems.

Thank You

Paper PRL 134, 017401 (2025)
Authors Ritam Pal, Aanjaneya Kumar, M.S. Santhanam
Data Harvard Dataverse
Analysis Summer
Website summerslog

Build Sesh / OpenClaw · March 2026

When Random VotingPredicts Real Elections

Elections Are Messy

One Curve to Rule Them All

The Random Voting Model

Why Does This Work?

Does It Work on Real Data?

It Works.

Constituency-Level Breakdown

The Finite-Size Puzzle

A Tool for Detecting Election Fraud

Why This Matters Beyond Elections

Thank You

When Random Voting
Predicts Real Elections