Skip to Content
Seven Databases in Seven Weeks, 2nd Edition
book

Seven Databases in Seven Weeks, 2nd Edition

by Luc Perkins, Eric Redmond, Jim Wilson
April 2018
Beginner to intermediate
360 pages
8h 54m
English
Pragmatic Bookshelf
Content preview from Seven Databases in Seven Weeks, 2nd Edition

Day 2: Working with Big Data

With Day 1’s table creation and manipulation under our belts, it’s time to start adding some serious data to our wiki table. Today, you’ll script against the HBase APIs, ultimately streaming Wikipedia content right into our wiki! Along the way, you’ll pick up some performance tricks for making faster import jobs. Finally, you’ll poke around in HBase’s internals to see how it partitions data into regions, achieving a series of both performance and disaster recovery goals.

Importing Data, Invoking Scripts

One common problem people face when trying a new database system is how to migrate data into it. Handcrafting Put operations with static strings, as you did in Day 1, is all well and good, but you can do better. ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Microsoft Power BI - The Complete Masterclass - 2025 EDITION

Microsoft Power BI - The Complete Masterclass - 2025 EDITION

Nikolai Schuler

Publisher Resources

ISBN: 9781680505962Errata Page