Information Theory on Polymarket: How 77-year old formula prints million for quants

0xDipper · @Dipper_pol · May 4

The 77-year-old formula that built the only hedge fund with zero losing years in 20 seasons - and why it still works on prediction markets today.

In 1986, *Barron's* ranked 1,026 mutual funds by performance. Claude Shannon's personal portfolio *outperformed 1,025 of them.*

Shannon wasn't a fund manager. He wasn't a Wall Street quant. He was the MIT professor who invented information theory - the math behind every JPEG, every modem, every compression algorithm on Earth. Between the late 1950s and 1986, his stock portfolio returned 28% annually. Over the same three decades, Warren Buffett's Berkshire Hathaway returned 27%.

Shannon beat Buffett using the same math he'd used to design telephone lines

This sounds like a coincidence. It isn't. Shannon's approach was methodical, derived directly from his own information theory papers. He wasn't picking stocks by gut. He was measuring the information content of every bet he took - in bits - and sizing positions accordingly.

And one of his colleagues took the method even further.

Enter Edward Thorp

Edward Thorp - math professor, blackjack legend - walked into Shannon's MIT office in the early 1960s with a card-counting system. Shannon listened, then handed him a Bell Labs paper by John Kelly titled *A New Interpretation of Information Rate.* The paper used Shannon's entropy to derive the mathematically optimal bet size for any game with positive edge.

Thorp took the math to Vegas. Won $11,000 in a single weekend at blackjack. Then he took it to Wall Street.

So why does this matter for Polymarket?

Most Polymarket traders measure edge in dollars. They track wins, ignore losses, and confuse luck with skill. Shannon and Thorp measured edge in bits of information - and built portfolios that compounded for decades while everyone else blew up on a single bad call.

*"Bits of information" isn't a metaphor. It's literally how the math works. A Polymarket position has a measurable information-theoretic edge before you place it *-* expressed as a real number, in bits.*

If you know how to compute that number, you know exactly how much alpha you're picking up per trade. If you don't, you're guessing. This article is about three specific tools from information theory that work on Polymarket right now:

1. Measuring edge in bits via KL-divergence - before you place the trade.

1. Fusing contradicting signals via max-entropy - without overfitting on backtests.

1. Detecting insiders via entropy collapse - before the news breaks.

The framework is 77 years old. It beat Vegas, then Wall Street, then bond markets, then options desks. Polymarket is just the newest venue.

By the end of this article, you'll be able to calculate your edge in bits on any open market on Polymarket — the same way Shannon calculated his in stocks and Thorp calculated his in warrants. That number will tell you whether to trade or walk away.

*Let's start with the math Shannon used.*

*What Is Entropy, In Plain English ?*

*Entropy is a measure of uncertainty. The higher it is, the less you know about the outcome.*

Example 1 - The Coin

A fair coin is 50/50. Entropy = 1 bit. That's the maximum for a binary event. An unfair coin 90/10 (almost always heads) has entropy of just 0.47 bits. Less uncertainty.

Example 2 - Polymarket

A "Will X happen?" market at 0.50 has entropy 1 bit. At 0.95 - just 0.29 bits. The closer to 0.50, the more unknown is packed into the market, the more information you extract when the outcome resolves.

Example 3 - Language

Letter E appears in 13% of English text, Z in 0.07%. English has ~4.1 bits/character. This is why zip-archives work - they exploit letter predictability. You can do the same with Polymarket prices: if you predict better than the market, you are literally compressing its uncertainty.

X - random variable (the market outcome) p(xᵢ) - probability of each outcome log₂ - log base 2 (that's why the answer is in bits) minus sign - makes the result positive (log of a probability is always negative)

*Entropy tells you how uncertain a market is. But how do you measure how much your estimate differs from the market? Enter KL-divergence.*

Use-Case I - Measuring Edge via KL-Divergence

*Thorp calculated a version of this number on every warrant he traded. Buffett uses it implicitly when sizing positions. On Polymarket, it has a clean name: KL-divergence.*

Polymarket has 200+ active markets at any moment. Which ones to trade? Common answer: *"the ones where I have edge."* But how do you quantify edge before placing?

*P* - your probability estimate *Q* - the market's estimate (the price) *D_KL* - literally how many bits of info you have that the market doesn't

*KL-divergence isn't a metaphor for edge. It literally is edge *-* measured in bits.*

Market: *"Will Trump pardon X by Dec 31?"*. Current price: 0.35 (market thinks 35% YES).

Your model (based on 20 historical pardons): 0.55.

Calculation:

DKL = 0.55 · log2(0.55 / 0.35) + 0.45 · log2(0.45 / 0.65) = 0.55 · 0.652 + 0.45 · (−0.530) = 0.359 − 0.239 = 0.120 bits

0.00 bits - No edge. Skip.
0.05 bits - Weak. Trap zone. Fees win.
0.10-0.20 - Real edge. Pro zone. Trade.
> 0.30 - You're wrong. Check your math.

Workflow

1. Pull all open markets via Polymarket API (or scrape).

1. Compute your probability estimate for each

1. Calculate D_KL for every market.

1. Sort descending by edge.

1. Trade top 5–10 only. Ignore the rest.

Common mistakes

KL is asymmetric: D_KL(P∥Q) ≠ D_KL(Q∥P). First arg is always your estimate.
Breaks on p=0 or p=1. Always clip to [0.01, 0.99].
High D_KL ≠ guaranteed profit. Fees, liquidity, slippage still matter.

*You can measure edge from a single signal. But what if you have five signals that contradict each other?*

Use-Case II - Max-Entropy Signal Fusion

You have several signals: news, a historical base rate, the order book, Twitter sentiment, other models' predictions. They contradict. How to combine?

Why naive approaches fail?

Averaging - throws away information, weights all signals equally even if one dominates.
ML ensembles - need tons of data, overfit on backtests, die in live trading.
Manual weights - subjective, collapse in new market regimes.

*When you have incomplete information, the only honest probability distribution is the one with maximum entropy under the known constraints.*

Don't sneak in hidden assumptions. Use only the facts you actually have. Leave everything else maximally uncertain.

Three signals on a single market:

Signal 1 (base model): p ≈ 0.40
Signal 2 (news scoring): p ≈ 0.60
Signal 3 (order book): p ≈ 0.50

Naive average: 0.50. Max-entropy, weighted by signal confidence (variance), gives a single distribution that satisfies all constraints and adds nothing else. No overfitting - because there's *nothing to overfit to*.

Max-entropy doesn't overfit. It *can't* - by construction. You cannot extract more information from a distribution than the constraints you fed it. Jaynes proved this in 1957.

*You can measure edge and fuse signals. One question left: how do you know if someone on the other side knows more than you?*

Use-Case III - The Insider Detector

Polymarket has insiders. Political markets - people know decisions before the press release. Sports - staff knows injuries. Person-specific markets - friends know plans. Can you detect them moving?

A normal market drifts slowly, reacting to small news flow. An insider moves differently: they know the outcome, open a large position, the price snaps in one direction, and entropy collapses sharply.

Build a time series of market entropy and watch its derivative. Insider signals show up *before* public news.

*dH/dt* - rate of change of entropy over time *σ_H* - historical std deviation of entropy changes *k *- threshold (typically 3 - 3-sigma rule)

Method, step-by-step

Compute market entropy H(t) every 5 minutes using current price.
Build time series of H(t).
Compute rolling derivative dH/dt.
Alert when |dH/dt| exceeds 3 standard deviations from historical mean.
Check: was there public news at that moment? If no - someone knows.

Oct 13, 2024 - market *"Will Trump win Pennsylvania?"*. Between 14:00 and 14:15 EST, price moves from 0.52 to 0.68 on large orders from a single wallet. Entropy drops from 0.998 to 0.905 bits in 15 minutes - a 7-sigma event. No public news at that moment. Next public news arrives 47 minutes later. Wallet nets ~$340k during that hour.

The SEC uses similar techniques to detect insider trading in equities - academic literature dates back to early 2000s. Almost nobody applies it to Polymarket, despite public data and *more* insiders.

*Everyone watches price. Quants watch entropy. It tells them an insider arrived *-* an hour before the news does.*

The Closer

$0 → measurable edge in 5 lines of Python.Three tools. One framework. 77 years old.

Edge in bits. KL-divergence quantifies your alpha before you place. If D_KL < 0.05, you're guessing. If D_KL > 0.10, you're trading. If D_KL > 0.30, your math is wrong.
No overfitting. Max-entropy fuses signals without inventing information. Jaynes proved it in 1957. ML ensembles still haven't caught up.
Insiders, not news. Entropy collapses minutes before the press release. Watch dH/dt or stay surprised.
The framework compounds. Shannon: 28%/yr for 30 years. Thorp: 19%/yr for 20 years. Zero losing seasons. Same math.

*The barrier was never intelligence. It was access. That barrier is gone.*

Follow for more @Dipper_pol

Recent discoveries

0xDipper@Dipper_pol·May 4

Information Theory on Polymarket: How 77-year old formula prints million for quants

1. Measuring edge in bits via KL-divergence - before you place the trade.

1. Fusing contradicting signals via max-entropy - without overfitting on backtests.

1. Detecting insiders via entropy collapse - before the news breaks.

Use-Case I - Measuring Edge via KL-Divergence

1. Pull all open markets via Polymarket API (or scrape).

1. Compute your probability estimate for each

1. Calculate D_KL for every market.

1. Sort descending by edge.

1. Trade top 5–10 only. Ignore the rest.

Use-Case II - Max-Entropy Signal Fusion

Use-Case III - The Insider Detector

The Closer

Recent discoveries

Information Theory on Polymarket: How 77-year old formula prints million for quants

The only Google Maps + Claude $10k+/week method you need

5 AI personas. $127,000/month. No models. No team. Claude runs the agency

How to Build & Sell AI Automations That Generate $10K Per Month (Full Course)

OnlyFans + Claude Code = $43,000 in 30 days. No camera. No team. The 4-file system runs alone

5 AI personas. $127,000/month. No models. No team. Claude runs the agency

21 Things most claude users have Never Set Up and It's Costing them Hours Every Week

Hermes + Polymarket - how i built a self-learning BTC up/down trading agent 100$ → 5000$ ( guide )

The .env Setup That Keeps Claude Code From Leaking Your Secrets (Full Config Included)

What to Learn, Build, and Skip in AI Agents (2026)

20 Claude Prompts that turn a $20 Subscription into a personal Assistant, Editor, Coach, and Analyst

69 Best Open-Source AI Repositories in April 2026

The Exact Blueprint To Make $650,000/Year (Quant Roadmap)

I sold 100 books on Amazon without writing a single word. $34,900 a month ( full guide )

How to Build a Multi-Agent Coding Stack in 2026 (Full Course)

Everyone can make $5,000/month in GTA. Here is how:

Hermes + Polymarket - how i built a self-learning BTC up/down trading agent 100$ → 5000$ ( guide )

The CLAUDE.md File That 10x'd My Output (Full File Included)

how to run 10 faceless accounts and build a $100k/m ai marketing business

Fortnite Paid Me $4,200 Last Month. I Haven't Opened The Game Once

40 Powerful AI Prompts for Claude That Produce Expert-Level Results

ChatGPT 5.5 found my first clients on Google Maps in 10 minutes. Now it's $15,000 a month

The Anatomy of a Perfect Skill: Reverse-Engineered from 100 Best Examples

How I'm Making $300/Day As a Teenager (Still at Uni)

The Exact Blueprint To Make $650,000/Year (Quant Roadmap)

If You're Not Using These 100 Repos, Delete CLAUDE CODE - a large collection of repositories

How to Give Claude Perfect Memory (complete guide)

Meta Open Sourced the Cheat Code to go Viral: I used it to make my client $1M

Scale your app to $480k/yr with AI influencers ( full guide )

How I'm Making $300/Day As a Teenager (Still at Uni)

$5K a Month If You're Lazy. $30K If You're Not. Here's How to Start Earning From AI Music on Spotify

Claude Code Masterclass: The Complete Guide From Zero to Shipping

how to run 10 faceless accounts and build a $100k/m ai marketing business

I FOUND 1,116 CLAUDE CODE SKILLS FROM 500+ REPOS SO YOU DON'T HAVE TO. [ALL LINKS]

How to build a Deep Researcher

40 Claude Cowork Commands, Workflows & Automations Most Users Don't Know — The Complete List

The solo founder stack of 2026

Every AI trend you need to know in 2026

How to literally escape your fate

Scale your app to $480k/yr with AI influencers ( full guide )

Claude is dumb by default, one file changes that

the complete guide to building skills for claude/codex

I Replaced Karpathy's LLM Wiki with Something That Actually Works

Don't be lazy to automate your work with routines

Hermes Agent: What People Are Actually Using It For (/Last30Days of Reddit, X, and YouTube)

Everyone can make $5,000/month in GTA. Here is how:

The Harness Is Everything: What Cursor, Claude Code, and Perplexity Actually Built

Anthropic hackathon winner automated his entire Workflow. Free repo replaces a $15K/Month Dev Team

How I built harness for my agent using Claude Code leaks

Building a Football Match Prediction System with Claude AI

THE ROOT CAUSE

Opus 4.7 is the first model that punishes bad prompting. here's how to correct it. (FULL GUIDE)

top 30 MCP servers that make your claude 100x more powerful (full links)

How to Build Services-as-Software Business

How to Automate TikTok Slideshow Content Creation with Claude Opus 4.7 (Step-by-Step Guide)

AI Agent Stack Everyone Must Use in 2026 (Builder’s Guide)

Top 67 Claude Skills That Turn a $20 Subscription Into a Full Dev Team - (Full Links)

5 pipelines I'd sell today using Claude Code (none of them are coding)

OpenClaw + GLM 5.1 = FREE AI AGENTS

How my $600 Mac Mini Runs a 35B AI Model

Anthropic hackathon winner automated his entire Workflow. Free repo replaces a $15K/Month Dev Team

Claude + Obsidian | How to use your second brain

The AI Businesses That Will Print Money in 2027 (That Nobody Is Building Yet)

How to become an AI Automation Builder in 6 months (RESOURCES)

Google engineer automated 80% of his work with Claude Code. here's the exact system he built

claude code + obsidian + skill graph. how to build a local research engine?