Skip to Content
R大数据分析实用指南
book

R大数据分析实用指南

by Posts & Telecom Press, Simon Walkowiak
May 2024
Intermediate to advanced
387 pages
6h 29m
Chinese
Packt Publishing

Overview

了解R的核心功能及第三方软件包,掌握大数据处理的重要秘诀

Key Features

  • 本书挑战了关于R语言不支持大数据流程和分析的偏见
  • 从数据导入和管理到高级分析和预测建模的大数据产品周期的所有阶段中亲身体验各种工具与R的整合

Book Description

R是一个强大的、开源的、函数式编程语言,可以用于广泛的编程任务。一般来讲,R语言的应用主要在数据统计与分析、机器学习、高性能计算等方面。R语言已经在多个领域赢得了认可,同时也基于其开源、免费的特点不断地发展壮大。

本书通过9章内容,循序渐进地揭示了大数据的概念,介绍了如何使用R进行数据处理,如何创建Hadoop虚拟机,如何建立和部署SQL数据库,同时还介绍了MongoDB、HBase、Spark、Hive相关的内容,并在本书的最后介绍了R的潜在应用场景。

本书适合中级数据分析师、数据工程师、统计学家、研究人员和数据科学家阅读,需要读者具备数据分析、数据管理和大数据算法的基本知识。

What you will learn

  • 如何使用R进行数据处理
  • 如何创建Hadoop虚拟机
  • 如何建立和部署SQL数据库
  • MongoDB、HBase、Spark、Hive的相关内容
  • R的潜在应用场景

Who this book is for

本书适合中级数据分析师、数据工程师、统计学家、研究人员和数据科学家,希望并计划将当前或未来的大数据分析流程与R编程语言相结合。 本书假定读者已有一些数据分析、数据管理和大数据算法的经验,有可能只是欠缺一些与R相关的开源大数据工具的使用技能。

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

金融中的机器学习

金融中的机器学习

Posts & Telecom Press, Jannes Klaas
Python高级编程(第2版)

Python高级编程(第2版)

Posts & Telecom Press, Michał Jaworski, Tarek Ziadé
精通Spark数据科学

精通Spark数据科学

Posts & Telecom Press, Andrew Morgan, Antoine Amend, David George, Matthew Hallett
程序员学数据结构

程序员学数据结构

Posts & Telecom Press, William Smith

Publisher Resources

ISBN: 9781836205791