6

Semi-Structured Data Manipulation

Data is generated by a wide range of systems, in an endless variety of shapes. Sometimes, the data you’ll need to work with won’t conform to the tabular structure of rows and columns that we tend to associate with relational databases. Semi-structured data refers to data that does not follow a strict tabular schema or data model. Such data may feature some or all the following properties:

  • Composite data structures involving multiple values within a single entry, such as sequences of values and groupings of named values within a single entry
  • Flexible schemas, both within composite values and in the shape of data across records
  • Nested data, where composite values contain further composite values, enabling the ...

Get Getting Started with DuckDB now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.