Chapter 5 BIG DATA PLATFORMS AND OPERATING TOOLS

LEARNING OBJECTIVES

After completing this chapter, you should be able to do the following:

    Recognize which Big Data software tools are available for use.

    Identify the open-source software known as Hadoop.

    Recall the role of map reduce and R software.

INTRODUCTION

This chapter identifies a variety of Big Data platforms as well as the operating tools that can be used on those platforms. Chief among the tools is the operating system known as Hadoop. Hadoop is an open-source framework that many organizations have chosen to support their Big Data efforts. This chapter will concentrate on information technology terms that are necessary for accountants to have foundational understanding in Big Data applications.

BIG DATA CAPABILITIES

The first step in all Big Data is understanding what the organization hopes to achieve. There should be two discussions that occur.

First, the organization should conduct a strategic planning retreat. The main question that should be asked is: What is the long-term vision for the company as it relates to Big Data?

Next, the organization should conduct an information planning retreat. This discussion should focus on how the organization can achieve the strategy for the first step with existing resources (hardware, software, staff, and future budget).

Both of these conversations are necessary and should take place in two different planning meetings. One way to approach the needs of the ...

Get Analytics and Big Data for Accountants now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.