A free probabilistic framework for analyzing the transformer-based language models
Article Type
Research Article
Publication Title
Statistics and Probability Letters
Abstract
We present a formal operator-theoretic framework for analyzing Transformer-based language models using free probability theory. By modeling token embeddings and attention mechanisms as self-adjoint operators in a tracial W∗-probability space, we reinterpret attention as non-commutative convolution and describe representation propagation via free additive convolution. This leads to a spectral dynamic system interpretation of deep Transformers. We derive entropy-based generalization bounds under freeness assumptions and provide insight into positional encoding, spectral evolution, and representational complexity. This work offers a principled, though theoretical, perspective on structural dynamics in large language models
DOI
10.1016/j.spl.2025.110516
Publication Date
11-1-2025
Recommended Citation
Das, Swagatam, "A free probabilistic framework for analyzing the transformer-based language models" (2025). Journal Articles. 5202.
https://digitalcommons.isical.ac.in/journal-articles/5202