Chapter 4. Parsing SQL
SQL (which stands for Structured Query Language and is usually pronounced sequel) is the most common language used to handle relational databases.[15] We’ll develop a SQL parser that produces a compact tokenized version of SQL statements.
This parser is based on the version of SQL used in the popular MySQL open source database. MySQL actually uses a bison parser to parse its SQL input, although for a variety of reasons this parser isn’t based on mySQL’s parser but rather is based on the description of the language in the manual.
MySQL’s parser is much longer and more complex, since this
pedagogical example leaves out many of the less heavily used parts.
MySQL’s parser is written in an odd way that uses bison to generate a C
parser that’s compiled by the C++ compiler, with a handwritten C++ lexer.
There’s also the detail that its license doesn’t allow excerpting in a
book like this one. But if you’re interested, it’s the file sql/sql_yacc.yy, which is part of the source
code at http://dev.mysql.com/downloads/mysql/5.1.html.
The ultimate definitions for SQL are the standards documents published by ANSI and ISO including ISO/IEC 9075-2:2003, which defines SQL, and a variety of related documents that define the way to embed SQL in other programming languages and in XML.
A Quick Overview of SQL
SQL is a special-purpose language for relational databases. Rather than manipulating data in memory, it manipulates data in database tables, referring to memory only incidentally. ...