book

Creating a Data-Driven Organization

by Carl Anderson

August 2015

Beginner

300 pages

7h 29m

English

O'Reilly Media, Inc.

Book available

Read now

Unlock full access

SummaryWho Should Read This Book?Chapter OrganizationConventions Used in This BookSafari® Books OnlineHow to Contact UsAcknowledgments
Data CollectionData AccessReportingAlertingFrom Reporting and Alerting to AnalysisHallmarks of Data-DrivennessAnalytics MaturityOverview
Facets of Data QualityDirty DataData GenerationData EntryMissing DataDuplicatesTruncated DataUnitsDefault ValuesData ProvenanceData Quality Is a Shared Responsibility
Collect All the ThingsPrioritizing Data SourcesConnecting the DotsData CollectionPurchasing DataHow Much Is a Dataset Worth?Data Retention
Types of AnalystsData AnalystData Engineers and Analytics EngineersBusiness AnalystsData ScientistsStatisticiansQuantsAccountants and Financial AnalystsData Visualization SpecialistsAnalytics Is a Team SportSkills and QualitiesJust One More ToolExploratory Data Analysis and Statistical ModelingDatabase QueriesFile Inspection and ManipulationAnalytics-org Structure
What Is Analysis?Types of AnalysisDescriptive AnalysisExploratory AnalysisInferential AnalysisPredictive AnalysisCausal Analysis
Metric DesignSimpleStandardizedAccuratePreciseRelative Versus AbsoluteRobustDirectKey Performance IndicatorsKPI ExamplesHow Many KPIs?KPI Definitions and Targets
StorytellingFirst StepsWhat Are You Trying to Achieve?Who Is Your Audience?What’s Your Medium?Sell, Sell, Sell!Data VisualizationChoosing a ChartDesigning Elements of the ChartDeliveryInfographicsDashboardsSummary
Why A/B Test?How To: Best Practices in A/B TestingBefore the ExperimentRunning the ExperimentOther ApproachesMultivariate TestingBayesian BanditsCultural Implications
How Are Decisions Made?Data-Driven, -Informed, or -Influenced?What Makes Decision Making Hard?DataCultureThe Cognitive BarriersWhere Does Intuition Work?SolutionsMotivationAbilityTriggersConclusion

Open, Trusting CultureBroad Data LiteracyGoals-First CultureInquisitive, Questioning CultureIterative, Learning CultureAnti-HiPPO CultureData Leadership
Chief Data OfficerCDO RoleSecrets of SuccessFuture of the CDO RoleChief Analytics OfficerConclusion
Respect PrivacyInadvertent LeakagePractice Empathy Provide ChoiceData QualitySecurityEnforcementConclusions
Analytics OrganizationsData Analysis & Data ScienceDecision MakingData VisualizationA/B Testing
Nearest Neighbor Type ProblemsRelative Frequency ProblemsEstimating Univariate Distribution ProblemsMultivariate Problems
ValueActivation

Content preview from Creating a Data-Driven Organization

Chapter 3. Data Collection

Errors using inadequate data are much less than those using no data at all.

Charles Babbage

It’s difficult to imagine the power that you’re going to have when so many different sorts of data are available.

Tim Berners-Lee

In the previous chapter, I covered data quality and collecting data right. In this chapter, we switch focus to choosing the right data sources to consume and provision to the analysts. That is, collecting the right data. I’ll cover such topics as prioritizing which data sources to consume, how to collect the data, and how to assess the value that the data provides to the organization.

Collect All the Things

Imagine that you are rolling out a new checkout process on your website. You will want to know exactly how it is performing against your metrics—you will want to track conversion, basket size, and so on—but it will also be instructive and insightful to understand how it is being used. For instance, on some sites, “add to cart” is a painless single click, so a pattern of customer behavior might be to add a bunch of items to the cart as a holding area and then prune that down to their final choices before clicking the checkout submit button. On other sites, however, “add to cart” might involve multiple clicks, and removing items might be harder or ambiguous—in short, there is more friction—so that customers essentially need to make their final decision before adding items to the cart. You can see why instrumenting the checkout ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Start your free trial

Publisher Resources

ISBN: 9781491916902Errata Page

Creating a Data-Driven Organization

by Carl Anderson

Chapter 3. Data Collection

Collect All the Things

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

You might also like

The Manager's Path

The Manager's Path

Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems

The Staff Engineer's Path

Publisher Resources

Chapter 3. Data Collection

Collect All the Things

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,and much more.

You might also like

The Manager's Path

The Manager's Path

Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems

The Staff Engineer's Path

Publisher Resources

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.