O'Reilly logo

Stay ahead with the world's most comprehensive technology and business learning platform.

With Safari, you learn the way you learn best. Get unlimited access to videos, live online training, learning paths, books, tutorials, and more.

Start Free Trial

No credit card required

HP Vertica Essentials

Book Description

Learn to deploy, administer, and manage HP Vertica, one of the most robust MPP solutions around

In Detail

With the rise of Massively Parallel Processing (MPP) and NewSQL databases, many users are confused about which MPP technology to opt for. Today, HP Vertica is gaining a lot of traction as a major MPP technology. Vertica's distributed architecture allows fast query processing, and it is a highly fault-tolerant architecture, thus making it one of the most sought-after MPP databases today.

HP Vertica Essentials will help you to learn day-to-day administration activities in a step-by-step format. You will start by learning how to install Vertica, followed by its management and monitoring. You will learn about the different backup and restore techniques, including the concept of projections in Vertica. Finally, you will explore the various techniques to improve performance and bulk loading in Vertica. By the end of this book, you will be able to install, manage, and monitor Vertica efficiently.

What You Will Learn

  • Successfully install Vertica
  • Manage clusters in Vertica
  • Create backups and restore databases
  • Master different techniques to improve performance
  • Understand the concept of projections in Vertica
  • Make effective projections in Vertica
  • Monitor performance in Vertica
  • Get to grips with the concept of bulk loading in Vertica

Downloading the example code for this book. You can download the example code files for all Packt books you have purchased from your account at http://www.PacktPub.com. If you purchased this book elsewhere, you can visit http://www.PacktPub.com/support and register to have the files e-mailed directly to you.

Table of Contents

  1. HP Vertica Essentials
    1. Table of Contents
    2. HP Vertica Essentials
    3. Credits
    4. About the Author
    5. About the Reviewers
    6. www.PacktPub.com
      1. Support files, eBooks, discount offers, and more
        1. Why subscribe?
        2. Free access for Packt account holders
        3. Instant updates on new Packt books
    7. Preface
      1. What this book covers
      2. What you need for this book
      3. Who this book is for
      4. Conventions
      5. Reader feedback
      6. Customer support
        1. Errata
        2. Piracy
        3. Questions
    8. 1. Installing Vertica
      1. Understanding the preinstallation steps
        1. Swap space
        2. Dynamic CPU frequency scaling
        3. Understanding disk space requirements
      2. Steps to install Vertica
      3. Summary
    9. 2. Cluster Management
      1. Comprehending the elastic cluster scaling factor
        1. Enabling and disabling an elastic cluster
        2. Viewing and setting the scaling factor settings
        3. Enabling and disabling local segmentation
        4. Understanding the best practices in cluster management
        5. Monitoring elastic cluster rebalancing
      2. Adding nodes in Vertica
        1. Method
          1. Using the Management Console to add nodes
        2. Adding nodes using administration tools
      3. Removing nodes in Vertica
        1. Lowering the K-safety level
        2. Removing nodes using administration tools
        3. Removing nodes using the Management Console
        4. Removing hosts from a cluster
      4. Replacing nodes
        1. Replacing a node using the same name and IP address
        2. Replacing a failed node using a different name and IP address
        3. Redistributing configuration files to nodes
        4. Using administration tools to replace nodes with different names and IP addresses
      5. Changing the IP addresses of a Vertica cluster
      6. Summary
    10. 3. Monitoring Vertica
      1. Monitoring through the system tables
        1. Understanding a system table example
      2. Looking at events
        1. Looking at events through logfiles
        2. Looking at events through the ACTIVE_EVENTS system table
      3. Monitoring Vertica through the Management Console
      4. Retaining monitoring information
        1. Enabling and disabling Data Collector
        2. Viewing the current data retention policy
        3. Configuring data retention policies
      5. Monitoring data collection components
      6. Summary
    11. 4. Backup and Restore
      1. Requirements for backup hosts
      2. Generating the vbr.py configuration file
        1. Miscellaneous settings
        2. Database access settings
        3. Data transmission during the backup process
        4. Mapping
      3. Creating full and incremental backups
        1. Understanding the requirements
        2. Running vbr.py
        3. Incremental snapshots
        4. Creating schema and table snapshots
      4. Restoring full database snapshots
        1. Restoring from a specific snapshot
        2. Restoring from the most recent snapshot
        3. Restoring schema and table snapshots
        4. Copying a database from one cluster to another
          1. Copying the database
      5. Using database snapshot functions
        1. Creating database snapshots
        2. Removing snapshots
      6. Summary
    12. 5. Performance Improvement
      1. Understanding projections
        1. Looking into high availability and recovery
          1. Comprehending unsegmented projections
          2. Comprehending segmented projections
        2. Creating projections using Database Designer
          1. The comprehensive design
          2. The query-specific design
        3. Creating projections manually
          1. Column list and encoding
          2. The base query
          3. The sort order
          4. Segmentation
          5. Keeping K-safety (K-Safe) in mind
            1. Creating buddy projections
            2. A note on table partitioning
      2. Understanding the storage model in Vertica
        1. Tuple Mover operations
          1. Moveout
          2. Mergeout
          3. Tuning Tuple Mover
        2. Adding storage locations
          1. Adding a new location
          2. Measuring location performance
          3. Setting location performance
          4. Understanding storage location tweaking functions
            1. Altering
            2. Dropping
            3. Retiring storage locations
            4. Restoring retired storage locations
      3. Summary
    13. 6. Bulk Loading
      1. Using the COPY command
        1. Aborting the COPY command
      2. Load methods
      3. Data transformation
      4. Summary
    14. Index