
Shannon vs. Bateson: Two Irreconcilable Definitions of Information
One bit that changed the war.
Shannon information measures how many bits travel across a channel; Bateson information measures whether those bits actually matter to someone. Confusing the two leads to deep misunderstandings about cognition, AI, and what it means to think.
Actions
The Source

EP 278 Peter Wang on AI, Copyright, and the Future of Intelligence
The Observer
Complexity science, Game B, social technology — systems thinking and civilizational design from the Santa Fe Institute
The Translation
AI-assisted summaryFamiliar terms
The conflation of Shannon Information with Bateson information represents one of the most consequential category errors in contemporary discussions of cognition and artificial intelligence. Shannon's information theory quantifies the reduction of uncertainty across a communication channel — it is a measure of transmission capacity, indifferent to meaning. Bateson's formulation — 'a difference that makes a difference' — is irreducibly relational and contextual. Information in the Batesonian sense exists only relative to an interpreting system with a history, a structure, and stakes.
Operation Fortitude during World War II provides a striking illustration. The Allied deception at Pas-de-Calais involved the generation and transmission of vast quantities of Shannon bits — fake radio traffic, inflatable tanks, double agents feeding false reports. Yet from the perspective of German strategic command, the operative information reduced to a single bit: real or fake? That bit, however, could only be evaluated against a deep hierarchical stack of military doctrine, intelligence methodology, and geopolitical reasoning accumulated over decades. The Shannon cost was trivial; the Bateson weight was world-historical.
This framework recontextualizes the recent claim that human conscious experience processes roughly ten bits per second. The experimental paradigms generating that figure force subjects into binary discrimination tasks, which by design measure Shannon throughput. But conscious cognition operates in a different register. An expert deploying a term like 'complexity' is invoking a compressed pointer into a vast semantic network — each Shannon bit carries enormous Batesonian payload. This is precisely why theoretical sophistication matters: it multiplies the effective work that limited conscious bandwidth can perform. Failing to distinguish these two registers — raw channel capacity and contextual meaning — systematically distorts how intelligence, human or artificial, is understood and evaluated.