Apache Superset Quick Start Guide

Book description

Integrate open source data analytics and build business intelligence on SQL databases with Apache Superset. The quick, intuitive nature for data visualization in a web application makes it easy for creating interactive dashboards.

Key Features

  • Work with Apache Superset's rich set of data visualizations
  • Create interactive dashboards and data storytelling
  • Easily explore data

Book Description

Apache Superset is a modern, open source, enterprise-ready business intelligence (BI) web application. With the help of this book, you will see how Superset integrates with popular databases like Postgres, Google BigQuery, Snowflake, and MySQL. You will learn to create real time data visualizations and dashboards on modern web browsers for your organization using Superset.

First, we look at the fundamentals of Superset, and then get it up and running. You'll go through the requisite installation, configuration, and deployment. Then, we will discuss different columnar data types, analytics, and the visualizations available. You'll also see the security tools available to the administrator to keep your data safe.

You will learn how to visualize relationships as graphs instead of coordinates on plain orthogonal axes. This will help you when you upload your own entity relationship dataset and analyze the dataset in new, different ways. You will also see how to analyze geographical regions by working with location data.

Finally, we cover a set of tutorials on dashboard designs frequently used by analysts, business intelligence professionals, and developers.

What you will learn

  • Get to grips with the fundamentals of data exploration using Superset
  • Set up a working instance of Superset on cloud services like Google Compute Engine
  • Integrate Superset with SQL databases
  • Build dashboards with Superset
  • Calculate statistics in Superset for numerical, categorical, or text data
  • Understand visualization techniques, filtering, and grouping by aggregation
  • Manage user roles and permissions in Superset
  • Work with SQL Lab

Who this book is for

This book is for data analysts, BI professionals, and developers who want to learn Apache Superset. If you want to create interactive dashboards from SQL databases, this book is what you need. Working knowledge of Python will be an advantage but not necessary to understand this book.

Table of contents

  1. Title Page
  2. Copyright and Credits
    1. Apache Superset Quick Start Guide
  3. About Packt
    1. Why subscribe?
    2. Packt.com
  4. Foreword
  5. Contributors
    1. About the author
    2. About the reviewer
    3. Packt is searching for authors like you
  6. Preface
    1. Who this book is for
    2. What this book covers   
    3. To get the most out of this book
      1. Download the example code files
      2. Download the color images
      3. Conventions used
    4. Get in touch
      1. Reviews
  7. Getting Started with Data Exploration
    1. Datasets
    2. Installing Superset
    3. Sharing Superset
    4. Configuring Superset
    5. Adding a database
    6. Adding a table
    7. Creating a visualization
    8. Uploading a CSV
    9. Configuring the table schema
    10. Customizing the visualization
    11. Making a dashboard
    12. Summary
  8. Configuring Superset and Using SQL Lab
    1. Setting the web server
    2. Creating the metadata database
    3. Migrating data from SQLite to PostgreSQL
    4. Web server
      1. Gunicorn
    5. Setting up an NGINX reverse proxy
    6. Setting up HTTPS or SSL certification
    7. Flask-AppBuilder permissions
    8. Securing session data
    9. Caching queries
    10. Mapbox access token
    11. Long-running queries
    12. Main configuration file
    13. SQL Lab
    14. Summary
  9. User Authentication and Permissions
    1. Security features
    2. Setting up OAuth Google sign-in
    3. List Users page
    4. List Base Permissions page
    5. Views/Menus page
    6. List Permissions on Views/Menus pages
    7. Alpha and gamma – building blocks for custom roles
      1. Alpha
      2. Gamma
      3. Public
    8. User Statistics page
    9. Action log
    10. Summary
  10. Visualizing Data in a Column
    1. Dataset
    2. Distribution – histogram
    3. Comparison – relationship between feature values
    4. Comparison – box plots for groups of feature values
    5. Comparison – side-by-side visualization of two feature values
    6. Summary statistics – headline
    7. Summary
  11. Comparing Feature Values
    1. Dataset
    2. Comparing multiple time series
    3. Comparing two time series
    4. Identifying differences in trends for two feature values
    5. Summary
  12. Drawing Connections between Entity Columns
    1. Datasets
    2. Directed force networks
    3. Chord diagrams
    4. Sunburst chart
    5. Sankey's diagram
    6. Partitioning
    7. Summary
  13. Mapping Data That Has Location Information
    1. Data
    2. Scatter point
    3. Scatter grid
    4. Arcs
    5. Path
    6. Summary
  14. Building Dashboards
    1. Charts
      1. Getting started with Superset
      2. Visualizing data in a column
      3. Comparing feature values
      4. Drawing connections between entity columns
      5. Mapping data that has location information
    2. Dashboards
      1. Making a dashboard
      2. Selecting charts
      3. Separating charts into tabs
      4. Headlining sections using titles
      5. Inserting markdown
      6. Organizing charts in the dashboard layout
      7. Separating sections using dividers
    3. Summary
  15. Other Books You May Enjoy
    1. Leave a review - let other readers know what you think

Product information

  • Title: Apache Superset Quick Start Guide
  • Author(s): Shashank Shekhar
  • Release date: December 2018
  • Publisher(s): Packt Publishing
  • ISBN: 9781788992244