Book description
Offers a clear view of the utility and place for survey data within the broader Big Data ecosystem
This book presents a collection of snapshots from two sides of the Big Data perspective. It assembles an array of tangible tools, methods, and approaches that illustrate how Big Data sources and methods are being used in the survey and social sciences to improve official statistics and estimates for human populations. It also provides examples of how survey data are being used to evaluate and improve the quality of insights derived from Big Data.
Big Data Meets Survey Science: A Collection of Innovative Methods shows how survey data and Big Data are used together for the benefit of one or more sources of data, with numerous chapters providing consistent illustrations and examples of survey data enriching the evaluation of Big Data sources. Examples of how machine learning, data mining, and other data science techniques are inserted into virtually every stage of the survey lifecycle are presented. Topics covered include: Total Error Frameworks for Found Data; Performance and Sensitivities of Home Detection on Mobile Phone Data; Assessing Community Wellbeing Using Google Street View and Satellite Imagery; Using Surveys to Build and Assess RBS Religious Flag; and more.
- Presents groundbreaking survey methods being utilized today in the field of Big Data
- Explores how machine learning methods can be applied to the design, collection, and analysis of social science data
- Filled with examples and illustrations that show how survey data benefits Big Data evaluation
- Covers methods and applications used in combining Big Data with survey statistics
- Examines regulations as well as ethical and privacy issues
Big Data Meets Survey Science: A Collection of Innovative Methods is an excellent book for both the survey and social science communities as they learn to capitalize on this new revolution. It will also appeal to the broader data and computer science communities looking for new areas of application for emerging methods and data sources.
Table of contents
- Cover
- List of Contributors
- Introduction
-
Section 1: The New Survey Landscape
-
1 Why Machines Matter for Survey and Social Science Researchers: Exploring Applications of Machine Learning Methods for Design, Data Collection, and Analysis
- 1.1 Introduction
- 1.2 Overview of Machine Learning Methods and Their Evaluation
- 1.3 Creating Sample Designs and Constructing Sampling Frames Using Machine Learning Methods
- 1.4 Questionnaire Design and Evaluation Using Machine Learning Methods
- 1.5 Survey Recruitment and Data Collection Using Machine Learning Methods
- 1.6 Survey Data Coding and Processing Using Machine Learning Methods
- 1.7 Sample Weighting and Survey Adjustments Using Machine Learning Methods
- 1.8 Survey Data Analysis and Estimation Using Machine Learning Methods
- 1.9 Discussion and Conclusions
- References
- Further Reading
-
2 The Future Is Now: How Surveys Can Harness Social Media to Address Twenty‐first Century Challenges
- 2.1 Introduction
- 2.2 New Ways of Thinking About Survey Research
- 2.3 The Challenge with … Sampling People
- 2.4 The Challenge with … Identifying People
- 2.5 The Challenge with … Reaching People
- 2.6 The Challenge with … Persuading People to Participate
- 2.7 The Challenge with … Interviewing People
- 2.8 Conclusion
- References
- 3 Linking Survey Data with Commercial or Administrative Data for Data Quality Assessment
-
1 Why Machines Matter for Survey and Social Science Researchers: Exploring Applications of Machine Learning Methods for Design, Data Collection, and Analysis
-
Section 2: Total Error and Data Quality
- 4 Total Error Frameworks for Found Data
- 5 Measuring the Strength of Attitudes in Social Media Data
-
6 Attention to Campaign Events: Do Twitter and Self‐Report Metrics Tell the Same Story?
- 6.1 What Can Social Media Tell Us About Social Phenomena?
- 6.2 The Empirical Evidence to Date
- 6.3 Tweets as Public Attention
- 6.4 Data Sources
- 6.5 Event Detection
- 6.6 Did Events Peak at the Same Time Across Data Streams?
- 6.7 Were Event Words Equally Prominent Across Data Streams?
- 6.8 Were Event Terms Similarly Associated with Particular Candidates?
- 6.9 Were Event Trends Similar Across Data Streams?
- 6.10 Unpacking Differences Between Samples
- 6.11 Conclusion
- References
- 7 Improving Quality of Administrative Data: A Case Study with FBI's National Incident‐Based Reporting System Data
- 8 Performance and Sensitivities of Home Detection on Mobile Phone Data
-
Section 3: Big Data in Official Statistics
- 9 Big Data Initiatives in Official Statistics
-
10 Big Data in Official Statistics: A Perspective from Statistics Netherlands
- 10.1 Introduction
- 10.2 Big Data and Official Statistics
- 10.3 Examples of Big Data in Official Statistics
- 10.4 Principles for Assessing the Quality of Big Data Statistics
- 10.5 Integration of Big Data with Other Statistical Sources
- 10.6 Disclosure Control with Big Data
- 10.7 The Way Ahead: A Chance for Paradigm Fusion
- 10.8 Conclusion
- References
- Further Reading
-
11 Mining the New Oil for Official Statistics1
- 11.1 Introduction
- 11.2 Statistical Inference for Binary Variables from Nonprobability Samples
- 11.3 Integrating Data Source B Subject to Undercoverage Bias
- 11.4 Integrating Data Sources Subject to Measurement Errors
- 11.5 Integrating Probability Sample A Subject to Unit Nonresponse
- 11.6 Empirical Studies
- 11.7 Examples of Official Statistics Applications
- 11.8 Limitations
- 11.9 Conclusion
- References
- Further Reading
- 12 Investigating Alternative Data Sources to Reduce Respondent Burden in United States Census Bureau Retail Economic Data Products
-
Section 4: Combining Big Data with Survey Statistics: Methods and Applications
- 13 Effects of Incentives in Smartphone Data Collection
- 14 Using Machine Learning Models to Predict Attrition in a Survey Panel
-
15 Assessing Community Wellbeing Using Google Street‐View and Satellite Imagery
- 15.1 Introduction
- 15.2 Methods
- 15.3 Application Results
- 15.4 Conclusions
- 15.A Amazon Mechanical Turk Questionnaire
- 15.B Pictures and Maps
- 15.C Descriptive Statistics
- 15.D Stepwise AIC OLS Regression Models
- 15.E Generalized Linear Models via Penalized Maximum Likelihood with k-Fold Cross-Validation
- 15.F Heat Maps - Actual vs. Model-Based Outcomes
- References
-
16 Nonparametric Bootstrap and Small Area Estimation to Mitigate Bias in Crowdsourced Data: Simulation Study and Application to Perceived Safety
- 16.1 Introduction
- 16.2 The Rise of Crowdsourcing and Implications
- 16.3 Crowdsourcing Data to Analyze Social Phenomena: Limitations
- 16.4 Previous Approaches for Reweighting Crowdsourced Data
- 16.5 A New Approach: Small Area Estimation Under a Nonparametric Bootstrap Estimator
- 16.6 Simulation Study
- 16.7 Case Study: Safety Perceptions in London
- 16.8 Discussion and Conclusions
- References
- 17 Using Big Data to Improve Sample Efficiency
-
Section 5: Combining Big Data with Survey Statistics: Tools
- 18 Feedback Loop: Using Surveys to Build and Assess Registration‐Based Sample Religious Flags for Survey Research
- 19 Artificial Intelligence and Machine Learning Derived Efficiencies for Large‐Scale Survey Estimation Efforts
- 20 Worldwide Population Estimates for Small Geographic Areas: Can We Do a Better Job?
-
Section 6: The Fourth Paradigm, Regulations, Ethics, Privacy
- 21 Reproducibility in the Era of Big Data: Lessons for Developing Robust Data Management and Data Analysis Procedures
- 22 Combining Active and Passive Mobile Data Collection: A Survey of Concerns
- 23 Attitudes Toward Data Linkage: Privacy, Ethics, and the Potential for Harm
-
24 Moving Social Science into the Fourth Paradigm: The Data Life Cycle
- 24.1 Consequences and Reality of the Availability of Big Data and Massive Compute Power for Survey Research and Social Science
- 24.2 Technical Challenges for Data‐Intensive Social Science Research
- 24.3 The Solution: Social Science Researchers Become “Data‐Aware”
- 24.4 Data Awareness
- 24.5 Bridge the Gap Between Silos
- 24.6 Conclusion
- References
- Index
- End User License Agreement
Product information
- Title: Big Data Meets Survey Science
- Author(s):
- Release date: September 2020
- Publisher(s): Wiley
- ISBN: 9781118976326
You might also like
book
Incomplete Categorical Data Design
A self-contained, systematic introduction, this book shows you how to draw valid statistical inferences from survey …
video
Leveraging Surveys
A well-designed, well-presented survey is one of the most reliable tools for discovering what customers think …
book
An Introduction to Survey Research, Volume II, 2nd Edition
Survey research is a powerful tool to help understand how and why individuals behave the way …
audiobook
Difficult Conversations
You have to talk with a colleague about a fraught situation, but you're worried that they'll …