Skip to Content
Tableau Prep即学即用
book

Tableau Prep即学即用

by Carl Allchin
August 2022
Beginner to intermediate
463 pages
9h 22m
Chinese
China Electric Power Press Ltd.
Content preview from Tableau Prep即学即用
421
使用历史表
48.2.2
信息的相关性
除非你是一家鞋类零售商,否则存储员工或客户的鞋码对你的分析没有任何帮助。
一方面,保存客户购买的产品以及他们何时加入和离开服务或组织的信息,对了解
消费者行为模式很有用。另一方面,过多的信息会使分析工作更加困难和耗时。因此,
只保留有关联的数据是关键。
48.2.3
更新频率
一方面,对数据集建立历史快照是有用的,但不要太频繁地进行。在许多情况下,
如果你的客户每天都在与你互动,那么每月的视图足以显示行为模式。更新的频率
越高,你会注意到的动向越少。另一方面,如果没有足够的频率来捕捉客户的特征,
则没有数据点来显示客户所持业务的趋势。
48.2.4
粒度级别
每组客户设置一个数据点?每个客户设置一个数据点?你需要决定哪些数据与你希
望进行的分析相关。无论决定什么,可能都需要进一步汇总数据进行分析。只有当
你从更多的粒度到更少的粒度时,这才有可能,因为汇总数据意味着从数据集中删
除细节。当分析一段时间内的业务模式时,想想你可能要做的比较。这个月和去年
同月的对比?这个季度与去年同季度的对比?这个决定将影响你的历史表需要保留
的数据量。
所有这些选择可能会随着时间的推移而改变,但通过建立历史表,你将为自己提供
进行分析的机会,否则你的分析可能无法进行。
48.3
性能
数据的相关性、频率和粒度都会在你构建分析以及应用分析时影响性能。数据软件
处理大型数据集的速度越来越快。但是,对于历史表,保持数据集足够小和简洁,
以便将其连接到可能已经是一个大型数据集的数据集中,这是一个挑战 ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

深度学习:核心原理与案例分析

深度学习:核心原理与案例分析

Posts & Telecom Press, Ahmed Menshawy
Python金融实战

Python金融实战

Posts & Telecom Press, Yuxing Yan
Python机器学习案例精解

Python机器学习案例精解

Posts & Telecom Press, Yuxi (Hayden) Liu
HBase管理指南

HBase管理指南

Posts & Telecom Press, Yifeng Jiang

Publisher Resources

ISBN: 9787519864439