book

Information Theory Meets Power Laws

Name: Information Theory Meets Power Laws
Author: Lukasz Debowski
ISBN: 9781119625278

by Lukasz Debowski

December 2020

Intermediate to advanced

384 pages

11h 54m

English

Wiley

Read now

Unlock full access

Cover
Information Theory Meets Power Laws
Copyright
dedication-page
Preface
Acknowledgments
Basic Notations
1 Guiding Ideas
1.1 The Motivating Question1.2 Further Questions About Texts1.3 Zipf's and Herdan's Laws1.4 Markov and Finite‐State Processes1.5 More General Stochastic Processes1.6 Two Interpretations of Probability1.7 Insights from Information Theory1.8 Estimation of Entropy Rate1.9 Entropy of Natural Language1.10 Algorithmic Information Theory1.11 Descriptions of a Random World1.12 Facts and Words Related1.13 Repetitions and Entropies1.14 Decay of Correlations1.15 Recapitulation
2 Probabilistic Preliminaries
2.1 Probability Measures2.2 Product Measurable Spaces2.3 Discrete Random Variables2.4 From IID to Finite‐State Processes
3 Probabilistic Toolbox
3.1 Borel ‐Fields and a Fair Coin3.2 Integral and Expectation3.3 Inequalities and Corollaries3.4 Semidistributions3.5 Conditional Probability3.6 Modes of Convergence3.7 Complete Spaces

4 Ergodic Properties
4.1 Plain Relative Frequency4.2 Birkhoff Ergodic Theorem4.3 Ergodic and Mixing Criteria4.4 Ergodic Decomposition
5 Entropy and Information
5.1 Shannon Measures for Partitions5.2 Block Entropy and Its Limits5.3 Shannon Measures for Fields5.4 Block Entropy Limits Revisited 5.5 Convergence of Entropy5.6 Entropy as Self‐Information
6 Equipartition and Universality
6.1 SMB Theorem6.2 Universal Semidistributions6.3 PPM Probability6.4 SMB Theorem Revisited 6.5 PPM‐based Statistics
7 Coding and Computation
7.1 Elements of Coding7.2 Kolmogorov Complexity7.3 Algorithmic Coding Theorems7.4 Limits of Mathematics7.5 Algorithmic Randomness
8 Power Laws for Information
8.1 Hilberg Exponents 8.2 Second Order SMB Theorem8.3 Probabilistic and Algorithmic Facts8.4 Theorems About Facts and Words
9 Power Laws for Repetitions
9.1 Rényi–Arimoto Entropies 9.2 Generalized Entropy Rates9.3 Recurrence Times9.4 Subword Complexity 9.5 Two Maximal Lengths9.6 Logarithmic Power Laws
10 AMS Processes
10.1 AMS and Pseudo AMS Measures10.2 Quasiperiodic Coding10.3 Synchronizable Coding 10.4 Entropy Rate in the AMS Case
11 Toy Examples
11.1 Finite and Ultrafinite Energy 11.2 Santa Fe Processes and Alike11.3 Encoding into a Finite Alphabet11.4 Random Hierarchical Association11.5 Toward Better Models
Future Research
Bibliography
Index
End User License Agreement

Content preview from Information Theory Meets Power Laws

4Ergodic Properties

According to a preformal intuition, the probability of any event can be defined as the limiting relative frequency of this event in an infinite sequence of repeated experiments. In probability theory, this sort of a statement can be formulated as a theorem, the law of large numbers. Namely, for a sequence of independent identically distributed (IID) real random variables, their averages tend to their common expectation almost surely when the number of averaged random variables tends to infinity – see Problem 3.8, where we have discussed the Hoeffding inequality. This result is the main motivation for the frequentist interpretation of probability. As we have noted in the introductory Section 1.6, another well‐known interpretation of probability is called Bayesian, and in this interpretation, probabilities are odds of an intelligent agent making predictions.

Linguists may rightly object that there are no repeatable experiments and no probabilistically independent variables in language, so the frequentist interpretation of probability need not be valid for natural language. Partly accepting this point of view, information theorists investigating the phenomenon of human communication sought for some generalizations of the law of large numbers for dependent stochastic processes. They found a plausible generalization in ergodic theory, a branch of mathematics that sprung up from pondering over origins of randomness and probability in classical mechanics, a branch ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781119625278Purchase Link

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Information Theory Meets Power Laws

by Lukasz Debowski

4Ergodic Properties

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.