Skip to Content
数据工程之道:设计和构建健壮的数据系统
book

数据工程之道:设计和构建健壮的数据系统

by Joe Reis, Matt Housley
February 2024
Intermediate to advanced
370 pages
7h
Chinese
China Machine Press
Content preview from 数据工程之道:设计和构建健壮的数据系统
243
8
查询、建模和转换
到目前为止,数据工程生命周期的各个阶段主要是将数据从一个地方转移到另一个地
方,或将其保存起来。在本章中,你将学习如何使数据变得有用。通过理解查询、建模
和转换(如图
8-1
所示),你会掌握将原始数据转化为下游利益相关者可用数据的工具。
DataOps
数据工程生命周期
生成
安全 数据管理 数据架构 编排 软件工程
反向 ETL
机器学习
分析
获取
转换 服务
存储
底层设计
8-1:数据转换使我们能够从数据中创造价值
我们首先讨论查询和它们背后的重要模式。其次,我们会看一下主要的数据建模方式,
你可以用它们把业务逻辑引入你的数据。再次,我们讨论转换,它将实现你的数据模型
的逻辑,并让查询结果对下游消费者更有用处。最后,我们将介绍你和谁一起工作,以
及与本章有关的底层设计。
SQL
NoSQL
数据库中,有多种多样的技术可以用来查询、建模和转换数据。本节
的重点是对数据仓库或数据湖等
OLAP
系统的查询。尽管存在许多用于查询的语言,在
本章的大部分内容中,我们将主要关注使用方便的同时也被很多人熟知的
SQL
,这是最
 
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

设计数据密集型应用程序

设计数据密集型应用程序

Martin Kleppmann
Understanding DeFi

Understanding DeFi

Alexandra Damsker
INSPIRED

INSPIRED

Marty Cagan

Publisher Resources

ISBN: 9787111745273