book

Information Theory Meets Power Laws

Name: Information Theory Meets Power Laws
Author: Lukasz Debowski
ISBN: 9781119625278

by Lukasz Debowski

December 2020

Intermediate to advanced

384 pages

11h 54m

English

Wiley

Read now

Unlock full access

Cover
Information Theory Meets Power Laws
Copyright
dedication-page
Preface
Acknowledgments
Basic Notations
1 Guiding Ideas
1.1 The Motivating Question1.2 Further Questions About Texts1.3 Zipf's and Herdan's Laws1.4 Markov and Finite‐State Processes1.5 More General Stochastic Processes1.6 Two Interpretations of Probability1.7 Insights from Information Theory1.8 Estimation of Entropy Rate1.9 Entropy of Natural Language1.10 Algorithmic Information Theory1.11 Descriptions of a Random World1.12 Facts and Words Related1.13 Repetitions and Entropies1.14 Decay of Correlations1.15 Recapitulation
2 Probabilistic Preliminaries
2.1 Probability Measures2.2 Product Measurable Spaces2.3 Discrete Random Variables2.4 From IID to Finite‐State Processes
3 Probabilistic Toolbox
3.1 Borel ‐Fields and a Fair Coin3.2 Integral and Expectation3.3 Inequalities and Corollaries3.4 Semidistributions3.5 Conditional Probability3.6 Modes of Convergence3.7 Complete Spaces

4 Ergodic Properties
4.1 Plain Relative Frequency4.2 Birkhoff Ergodic Theorem4.3 Ergodic and Mixing Criteria4.4 Ergodic Decomposition
5 Entropy and Information
5.1 Shannon Measures for Partitions5.2 Block Entropy and Its Limits5.3 Shannon Measures for Fields5.4 Block Entropy Limits Revisited 5.5 Convergence of Entropy5.6 Entropy as Self‐Information
6 Equipartition and Universality
6.1 SMB Theorem6.2 Universal Semidistributions6.3 PPM Probability6.4 SMB Theorem Revisited 6.5 PPM‐based Statistics
7 Coding and Computation
7.1 Elements of Coding7.2 Kolmogorov Complexity7.3 Algorithmic Coding Theorems7.4 Limits of Mathematics7.5 Algorithmic Randomness
8 Power Laws for Information
8.1 Hilberg Exponents 8.2 Second Order SMB Theorem8.3 Probabilistic and Algorithmic Facts8.4 Theorems About Facts and Words
9 Power Laws for Repetitions
9.1 Rényi–Arimoto Entropies 9.2 Generalized Entropy Rates9.3 Recurrence Times9.4 Subword Complexity 9.5 Two Maximal Lengths9.6 Logarithmic Power Laws
10 AMS Processes
10.1 AMS and Pseudo AMS Measures10.2 Quasiperiodic Coding10.3 Synchronizable Coding 10.4 Entropy Rate in the AMS Case
11 Toy Examples
11.1 Finite and Ultrafinite Energy 11.2 Santa Fe Processes and Alike11.3 Encoding into a Finite Alphabet11.4 Random Hierarchical Association11.5 Toward Better Models
Future Research
Bibliography
Index
End User License Agreement

Content preview from Information Theory Meets Power Laws

1Guiding Ideas

This book concerns mathematical foundations of statistical language modeling, i.e. the question what kind of a probability distribution should be assigned to particular utterances of human languages. In this chapter, we will describe the core ideas of this book in a way which is less formalized mathematically, but more motivated linguistically. Based on the intuitions sketched in this chapter, in the following chapters, we will build rigorous mathematical constructions. The general goal is to develop a theory of discrete stochastic processes so that it would be able to account for certain statistical phenomena exhibited by human texts. The considered statistical phenomena take form of several power laws. We hope that if we were to succeed in a better modeling of these power laws, then in the long run, we may also obtain probabilistic models of language which are better in terms of performance measures used by engineers in computational linguistics. In other words, we hope that our quest for stochastic processes may turn out to be fruitful not only for purely theoretical interest but also for practical applications in engineering. We hope that the considered problems are also interesting enough on the theoretical side, and they can draw interest of professional mathematicians.

1.1 The Motivating Question

The fundamental question that motivates this book is

What kind of a statistical model may explain generation of texts in natural language, such as books, our ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781119625278Purchase Link

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Information Theory Meets Power Laws

by Lukasz Debowski

1Guiding Ideas

1.1 The Motivating Question

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.