Skip to Content
Tableau Prep即学即用
book

Tableau Prep即学即用

by Carl Allchin
August 2022
Beginner to intermediate
463 pages
9h 22m
Chinese
China Electric Power Press Ltd.
Content preview from Tableau Prep即学即用
325
36
处理自由文本
自由不一定是好事。在数据领域,自由文本就是一个很好的例子。然而,在自由文
本字段中输入的数据有很多好处。
36.1
什么是自由文本
自由文本是人们在系统和表格中输入的答案所产生的基于字符串的数据。调查问卷、
社交媒体帖子和对业务系统的评论都可能有自由文本数据点。其所产生的数据通常
存储在一列中,每个单元格有一个答案。根据定义,自由文本意味着其内容可能是
任何东西——绝对是任何东西,而这就是你所获取的数据。从脏话到俚语,你将在
数据中发现的词语可能是一个挑战,但自由文本是收集客户
/
员工真实声音的最佳
方式。
自由文本字段很可能包含冗长、散漫的句子,从而无法进行简单的分析。大多数条
目完全是独一无二的,所以不太可能通过简单地计算每种情况出现的频率来进行任
何有意义的分析。分析的价值在于提交的句子和段落中的单个词语
/ID
号。
36.2
为什么自由文本有用
通过抓取简单的调查结果,如强烈支持或反对某事的人的百分比,并不会带来很多
关于你的业务可以改进的洞察力。而听取客户自己的“声音”会比世界上最优秀的
分析师能够告诉你的信息更多。
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

深度学习:核心原理与案例分析

深度学习:核心原理与案例分析

Posts & Telecom Press, Ahmed Menshawy
Python金融实战

Python金融实战

Posts & Telecom Press, Yuxing Yan
Python机器学习案例精解

Python机器学习案例精解

Posts & Telecom Press, Yuxi (Hayden) Liu
HBase管理指南

HBase管理指南

Posts & Telecom Press, Yifeng Jiang

Publisher Resources

ISBN: 9787519864439