Chapter 12: Synthetic data

Abstract

Synthetic data is artificially generated information that imitates the characteristics of real-world data without duplicating real-world observations. Synthetic data has emerged as a promising solution to data scarcity, privacy concerns, and data bias within artificial intelligence (AI) training. This chapter examines the benefits and limitations of synthetic data in AI training. The advantages include generating large volumes of data on demand, mitigating overfitting, ensuring data privacy, and improving model generalization. However, the quality and integrity of synthetic data can affect the performance of AI models, and producing high-quality synthetic data requires substantial computational resources. Despite ...

Get Mechanism Design, Behavioral Science and Artificial Intelligence in International Relations now with the O’Reilly learning platform.

O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.