Skip to Content
Tableau Prep即学即用
book

Tableau Prep即学即用

by Carl Allchin
August 2022
Beginner to intermediate
463 pages
9h 22m
Chinese
China Electric Power Press Ltd.
Content preview from Tableau Prep即学即用
239
基于分组的数据清理
26
-
11
Preppin' Data
2019
:第
2
周挑战赛中城市字段的值列表
26
-
12
:采用 Pronunciation(发音)分组和替换选项
240
26
然而,并不是所有的数据都被正确地分组了。与概况窗格一样,分组和替换功能使
用直方图来显示主要的数值。
Edinburgh
(爱丁堡)和
London
(伦敦)的正确拼写,
也是这组中最常见的拼写,已经形成了两个主要的分组。通过选择这两个分组中的
一个——在本例中是爱丁堡,你可以看到
Prep Builder
已经分组的值。如果这些值
都不正确,你有两个选择:
通过移动分组标尺上的点来改变分组和替换功能的灵敏度。将点向负号方向移
动,以降低进行分组的算法的灵敏度。这意味着更多的数据可能被添加到一个
组中。反之,将点向正号方向移动,可提高算法的灵敏度,从而使包含的数据减少。
手动取消选择值。通过取消选中分组中的一个选择,你可以从分组中删除该值
以及所有相关记录。
因为基于发音的分组方式是根据字母在英语中的发音方式来工作的,而“
3d!nburgh
开头的
3
nodonL
重新排列的字母发音都不够相似,故而不算匹配。因此,首
先在
Group and Replace
(分组和替换)控件中点击
Done
(完成)来保存到目前为
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

深度学习:核心原理与案例分析

深度学习:核心原理与案例分析

Posts & Telecom Press, Ahmed Menshawy
Python金融实战

Python金融实战

Posts & Telecom Press, Yuxing Yan
Python机器学习案例精解

Python机器学习案例精解

Posts & Telecom Press, Yuxi (Hayden) Liu
HBase管理指南

HBase管理指南

Posts & Telecom Press, Yifeng Jiang

Publisher Resources

ISBN: 9787519864439