Skip to Content
Tableau Prep即学即用
book

Tableau Prep即学即用

by Carl Allchin
August 2022
Beginner to intermediate
463 pages
9h 22m
Chinese
China Electric Power Press Ltd.
Content preview from Tableau Prep即学即用
241
基于分组的数据清理
26
-
14
:按 Spelling(拼写)分组的结果
在这里你可以看到
3d
nburgh
已经和其他的爱丁堡值进行了分组,因为只有两个
字符与正确的拼写不同。
编写一个涵盖所有这些例子的计算方法需要花费很多时间,所以这个分组和替换功
能可以为你节省很多时间和精力。尽管这个数据字段现在已经可以进行分析了,但
如果你打算将这个字段加入一个源数据集中,其中的值仍然处于原始的“混乱”状态,
你必须小心。数据集连接操作在第
16
章和第
32
章中有所涉及,必须应用于同类值。
因此,这个数据集将不再完全连接到其先前的形式。了解数据集的需求,将决定在
工作流程中的哪个位置执行每个数据准备任务。幸运的是,有了
Prep Builder
,在你
迭代解决方案的过程中,可以快速、轻松地调整这种顺序。
26.4
小结
在使用
Tableau Prep
之前,将同类字符串进行分组涉及冗长的
IF
计算或大量的手动
选择。不幸的是,这些方法很少会随着不同数值的引入而更新。
Prep
使用分组和替
换算法将字符串分组到一起,不仅使最初的数据准备工作变得更容易,而且还能为
后续的处理流程提供保障。
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

深度学习:核心原理与案例分析

深度学习:核心原理与案例分析

Posts & Telecom Press, Ahmed Menshawy
Python金融实战

Python金融实战

Posts & Telecom Press, Yuxing Yan
Python机器学习案例精解

Python机器学习案例精解

Posts & Telecom Press, Yuxi (Hayden) Liu
HBase管理指南

HBase管理指南

Posts & Telecom Press, Yifeng Jiang

Publisher Resources

ISBN: 9787519864439