O'Reilly logo

HBase High Performance Cookbook by Ruchir Choudhry

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

Extracting data from Oracle

HBase doesn't allow direct interaction or a pipeline for data import from Oracle and MySQL to HBase. The basic concept remains the same: to first extract the data into flat / text files (ImportTsv format), transform the data into HFiles, and then load them into HBase by telling the region server where to find them.

Getting ready

Let's start with getting public data from the following URL:

http://databank.worldbank.org/data/download/WDI_csv.zip

This will have the following files:

  • WDI_Data.csv
  • WDI_Country.csv (this is the file we will use)
  • WDI_Series.csv
  • WDI_CS_Notes.csv
  • WDI_ST_Notes.csv
  • WDI_Footnotes.csv
  • WDI_Description.csv

We will be using this as data and nothing else; this is freely available on the aforementioned World Bank ...

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, interactive tutorials, and more.

Start Free Trial

No credit card required