book

Communicating Data with Tableau

by Ben Jones

June 2014

Beginner to intermediate

334 pages

7h 35m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Intended AudienceAssumptions This Book MakesContents of This BookConventions Used in This BookUsing Code ExamplesSafari® Books OnlineHow to Contact UsAcknowledgments
A Step in the ProcessA Model of CommunicationThree Types of Communication ProblemsSix Principles of Communicating DataPrinciple #1: Know Your GoalPrinciple #2: Use the Right DataPrinciple #3: Select Suitable VisualizationsWhat type of data do you have?What are the most effective types of visualizations for your data type?Principle #4: Design for AestheticsPrinciple #5: Choose an Effective Medium and ChannelPrinciple #6: Check the ResultsSummary
Using TableauMy Tableau StoryTableau ProductsConnecting to DataThe Tableau User InterfaceSheetsDashboardsThe toolbarData typesChanging data typesCalculated fieldsCreating visualizationsShow MeSummary
Communicating “How Much”An Example of How MuchComparing ComparisonsFine-Tuning the DefaultSortingThe Dot ChartCommunicating “How Many”A Tale of Two FormatsCounting DimensionsHistograms: How Many of How Much?Summary
RatiosTwo Ways of Adding RankRatesBlending Data SourcesVisualizing RatesSummary
Part-to-WholeIntroducing Filters and Quick FiltersIntroducing Table CalculationsProportions as Waterfall Charts Using GanttCurrent-to-HistoricalThe Bullet GraphReference LinesActual-to-TargetSummary
The Normal DistributionAn Example of “Normal” DataBox PlotsAn Example of “Non-Normal” DataSensitivity to OutliersVisualizing Typical Values of Non-Normal DistributionsSummary
Respecting VariationVisualizing VariationVariation Over Time: Control ChartsAnatomy of a Control ChartHow to Create a Control Chart in TableauThe quick methodThe rigorous methodUnderstanding UncertaintySummary
ScatterplotsWho Is Who?LabelsTooltipsAnnotationsMaking it ExploratoryAdding Background ImagesStacked BarsRegression and Trend LinesThe Quadrant ChartSummary
The Origin of Time ChartsThe Line ChartThe Dual-Axis Line ChartThe Connected ScatterplotThe Date Field Type and SeasonalityThe TimelineThe SlopegraphStep 1: Get the DataStep 2: Connect TableauStep 3: Create a Parameter and Matching Calculated FieldStep 4: Create the Basic SlopegraphStep 5: Add Line Coloring and ThicknessStep 6: Design the DashboardSummary

One Special MapCircle MapsAdding a Second EncodingWhen Marks MultiplyFilled MapsDual-Encoded MapsA Dual-Axis MapA Dual-Encoded Circle MapSummary
Maps with ShapesMaps Showing PathsPlotting Map Shapes Using AxesSummary
Dashboards in TableauA Word of Caution“Begin with the End in Mind”Types of DashboardsContext Is KingSummary
Building an Exploratory DashboardStep 1: DesignStep 2: SheetsMoving Things AroundStep 3: AnnotationsStep 4: Objects“Hover for More Info” iconsStep 5: ActionsQuick Filters on DashboardsDynamic Labels on DashboardsUsing Sheets as Filters on DashboardsHighlighting SheetsUsing Sheets to HighlightStep 6: FormattingSteps 7 and 8: Delivery and ResultsBuilding an Explanatory DashboardA Key Point to Explain: Nordic Countries in the LeadAnother Key Point to Explain: The Emergence of ChinaSummary
Animating DashboardsShowing Multiple TabsAdding Navigation with FiltersAdding Custom Header ImagesAdding Google Maps to DashboardsCreate the URLsAdding Dynamic Google Maps Satellite Images to Our DashboardAdding YouTube Videos to DashboardsSummary
TrainingExamplesBlogsOther Resources

Content preview from Communicating Data with Tableau

Chapter 6. Mean and Median

“...like a statistician who drowned in a lake of average depth six inches.”

—Anonymous

Try to think of a time when you listened to a presentation about data that didn’t include either an average or a median value. They’re almost as common as percentages. Whether we’re tracking home prices, the stock market, student test scores, or the price of gasoline, we come face to face with the notion of central tendency on a regular basis.

Why are they so commonly used? As humans, we have a hard time processing a simple list of more than a half dozen values, let alone reams and reams of raw data. The attractiveness of these measures of central tendency is that they condense a lot of data into digestible morsels that carry with them the notion of “typical.”

As useful as these statistics can be to communicate data, they need to be handled with care. In this chapter, we’ll see how they can be put to good use, but we’ll also see how they can mislead.

The three main measures of central tendency are mean, median, and mode. Let’s start with their definitions:

The mean (or average) is determined by summing all of the values in a data set and dividing by the number of values. The mean is considered a “representative value,” meaning if you replaced each value in the data set with the mean, the overall sum wouldn’t change.
The median is the middle value in a data set in which the values have been placed in order of magnitude. Thus, half the values in the data set are less than the ...