book

Building Programming Language Interpreters

Name: Building Programming Language Interpreters
Author: Daniel Ruoso
ISBN: 9781837638079

by Daniel Ruoso

January 2026

Intermediate to advanced

372 pages

8h 14m

English

Packt Publishing

Read now

Unlock full access

Contributors
Join us on Discord!
Preface
Who this book is forWhat this book coversTo get the most out of this bookDownload the example code filesGet in touchFree Benefits with Your BookHow to Unlock
Modeling the Programming Language Runtime Environment
Defining the Scope
Free Benefits with Your BookTechnical requirementsWhy do we keep creating new languages?Domain-specific languagesCompilers and interpretersThe exercise we will complete in this bookHow will the interpreter be integrated?SummaryGet This Book’s PDF Version and Exclusive Extras
The Blurred Lines Between Native Code, Virtual Machines, and Interpreters
Modern ISAs are virtual machines of sortsNative programming languages also have a virtual machine modelInterpreters as virtual machinesAbstraction of complexity versus execution overheadHow JIT compilers change the performance calculationDeciding which model our language will useSummaryGet This Book’s PDF Version and Exclusive Extras
Instructions, Concurrency, Inputs, and Outputs
Instructions as building blocksOperator stack versus registersInterpreter stack versus language stackInterruptions to the control flowContinuations across native and interpreted codeConcurrency modelDeciding on the execution modelSummaryGet This Book’s PDF Version and Exclusive Extras
Native Types, User Types, and Extension Points
Different types of type systemsStatic typing versus dynamic typingStrong typing versus weak typingLanguage types versus user-defined typesNominal typing versus structural typingDuck typingApproaches to modeling the type systemThe native types for your languageHow the users will declare their own typesInteraction with native extensionsSummaryGet This Book’s PDF Version and Exclusive Extras
Putting It All Together: Making Trade-Off Decisions
Designing the execution modelInteracting with inputs and outputsIdentifying continuations that are ready to executePerforming interpreted operationsMaking native callbacksConcurrency in an embedded languageMemory managementMemory access patternsManaging mutabilityData ownershipLife cycle managementFinalizing my designSummaryGet This Book’s PDF Version and Exclusive Extras
Modeling the Programming Language Syntax
Review of Programming Language Paradigms
Imperative programmingFunctional programmingDeclarative programmingLogic programmingChoosing a paradigm for our languageSummaryGet This Book’s PDF Version and Exclusive Extras

Values, Containers, and the Language Meta-Model
Variables and valuesValues and containersMutabilityCopies, references, and shared valuesThe language meta-modelSummaryGet This Book’s PDF Version and Exclusive Extras
Lexical Scopes
The lexical padFunction lexical scopesBlock lexical scopesClosure lexical scopesGlobal lexical scopesSummaryGet This Book’s PDF Version and Exclusive Extras
Putting It All Together and Creating a Coherent Vision
The shape of a declarative languageSpecifying the empty programHello World!Hello who?Immutability, containers, and valuesValue typesContainer typesVariables and referencesThe final language designSummaryGet This Book’s PDF Version and Exclusive Extras
Implementing the Interpreter Runtime
Initialization and Entry Point
The scaffoldingThe global interpreter stateThe interpreted program and the interpreterThe operation tree and the mutable interpreter stateExecuting the operations in the treeMaking it friendly to be embeddedRefactoring for multiple types of operationsDesigning the interface for I/OSummaryGet This Book’s PDF Version and Exclusive Extras
Execution Frames, the Stack, and Continuations
Code as a valueInvoking a callable valueMaking a functionCalling a functionThe end-to-end exampleSummaryGet This Book’s PDF Version and Exclusive Extras
Running and Testing Language Operators
Identifying and implementing operatorsI/O operatorsList operatorsTesting the operatorsFinalizing the integrationSummaryGet This Book’s PDF Version and Exclusive Extras
Interpreting Source Code
Lexing: Turning Text into a Stream of Tokens
Identifying token types and their data structuresSpecifying the rules of the lexerEvaluating different lexer librariesGetting a stream of tokensTesting the tokenizerSummaryGet This Book’s PDF Version and Exclusive Extras
Parsing: Turning a Stream of Tokens into a Parse Tree
Types of parse nodes and their data structuresSpecifying the grammar for the parserEvaluating different parser librariesBuilding a grammar engine in C++Getting the parse treeSummaryGet This Book’s PDF Version and Exclusive Extras
Analyzing: Turning a Parse Tree into an Abstract Syntax Tree
Difference between a parse tree and an abstract syntax treeModeling the node types and their data structuresTransforming one tree into anotherSummaryGet This Book’s PDF Version and Exclusive Extras
Generating: Turning an Abstract Syntax Tree into Instructions
Matching AST nodes to interpreter operationsIntroducing the TransitionLookAhead operationIntroducing StateMachineOperationGenerating the code for StateMachineOperationGenerating the entire programIntegrating into the interpreterExecuting code in our programming language for the first timeSummaryGet This Book’s PDF Version and Exclusive Extras
Proving That It Works
Revisiting our goalsDomain-specific language (DSL) for specifying network protocolsSynchronous request–response protocolsTransfer of control at multiple pointsTransferring values to the native languageUsing captured values within the protocolChecking our acceptance criteriaInitialize the interpreter onceMake the interpreter drive the interaction with the networkTransfer control at specific pointsWorking through an exampleUnderstanding SMTPExpressing SMTP in the DSLImplementing an SMTP serverImplementing callbacksWas it worth it?SummaryGet This Book’s PDF Version and Exclusive Extras
Unlock Your Exclusive Benefits
Unlock this Book’s Free Benefits in 3 Easy StepsStep 1Step 2Step 3Need help?
Other Books You May Enjoy
Subscribe to Deep Engineering
Index

Content preview from Building Programming Language Interpreters

14 Parsing: Turning a Stream of Tokens into a Parse Tree

The sequence of tokens produced by the tokenizer is easier to work with than the raw text, but if you were to try and convert those tokens directly into operations, you’d find yourself struggling a lot. The reason for that is that most programming languages have a hierarchy of how the code is represented, and in order to understand the code, we need it to be represented at a higher level of abstraction.

In this chapter, I will focus on the process of converting a sequence of tokens into a parse tree, which is the name we give to the data structure representing how the code needs to be read. To do that, we will do the following:

Identify what data structures we need to represent the code ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Build Your Own Programming Language - Second Edition

Publisher Resources

ISBN: 9781837638079

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Building Programming Language Interpreters

by Daniel Ruoso

14

Parsing: Turning a Stream of Tokens into a Parse Tree

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.