Semantic Rate-Distortion for Bounded Multi-Agent Communication: Capacity-Derived Semantic Spaces and the Communication Cost of Alignment

Computer Vision & MultiModal AI
Published: arXiv: 2604.09521v1

Abstract

When two agents of different computational capacities interact with the same environment, they need not compress a common semantic alphabet differently; they can induce different semantic alphabets altogether. We show that the quotient POMDP $Q_{m,T}(M)$ - the unique coarsest abstraction consistent with an agent's capacity - serves as a capacity-derived semantic space for any bounded agent, and that communication between heterogeneous agents exhibits a sharp structural phase transition. Below a critical rate $R_{\text{crit}}$ determined by the quotient mismatch, intent-preserving communication is structurally impossible. In the supported one-way memoryless regime, classical side-information coding then yields exponential decay above the induced benchmark. Classical coding theorems tell you the rate once the source alphabet is fixed; our contribution is to derive that alphabet from bounded interaction itself. Concretely, we prove: (1) a fixed-$\varepsilon$ structural phase-transition theorem whose lower bound is fully general on the common-history quotient comparison; (2) a one-way Wyner-Ziv benchmark identification on quotient alphabets, with exact converse, exact operational equality for memoryless quotient sources, and an ergodic long-run bridge via explicit mixing bounds; (3) an asymptotic one-way converse in the shrinking-distortion regime $\varepsilon = O(1/T)$, proved from the message stream and decoder side information; and (4) alignment traversal bounds enabling compositional communication through intermediate capacity levels. Experiments on eight POMDP environments (including RockSample(4,4)) illustrate the phase transition, a structured-policy benchmark shows the one-way rate can drop by up to $19\times$ relative to the counting bound, and a shrinking-distortion sweep matches the regime of the asymptotic converse.

Paper Summary

Problem
When two agents with different computational capacities interact in the same environment, they face a fundamental communication challenge. Their capacity mismatch creates a barrier to effective communication, making it difficult for them to understand and act on each other's intentions.
Key Innovation
This research introduces a new approach to communication between agents with different capacities, called semantic rate-distortion theory. This theory recognizes that agents with different capacities inhabit different semantic spaces, even when acting in the same physical world. The key innovation is the use of quotient POMDPs (Partially Observable Markov Decision Processes) as a capacity-derived semantic space for each agent.
Practical Impact
The practical impact of this research is significant, as it provides a new framework for understanding and addressing the communication challenges between agents with different capacities. This is particularly relevant in applications such as human-AI collaboration, where humans and AI models may have different capacities and need to communicate effectively. The research also has implications for multi-agent systems, robotics, and other areas where agents with different capacities interact.
Analogy / Intuitive Explanation
Think of two agents with different capacities as living in different neighborhoods with different street signs and mapping systems. Even if they're trying to communicate, they may not be able to understand each other's directions because their mapping systems are different. The research shows that there's a critical rate (Rcrit) below which communication becomes impossible, and that agents need to find a way to bridge the gap between their semantic spaces to communicate effectively.
Paper Information
Categories:
cs.IT cs.AI
Published Date:

arXiv ID:

2604.09521v1

Quick Actions