Book description
Jump-start your career as a data scientist—learn to develop datasets for exploration, analysis, and machine learning
SQL for Data Scientists:A Beginner's Guide for Building Datasets for Analysis is a resource that’s dedicated to the Structured Query Language (SQL) and dataset design skills that data scientists use most. Aspiring data scientists will learn how to how to construct datasets for exploration, analysis, and machine learning. You can also discover how to approach query design and develop SQL code to extract data insights while avoiding common pitfalls.
You may be one of many people who are entering the field of Data Science from a range of professions and educational backgrounds, such as business analytics, social science, physics, economics, and computer science. Like many of them, you may have conducted analyses using spreadsheets as data sources, but never retrieved and engineered datasets from a relational database using SQL, which is a programming language designed for managing databases and extracting data.
This guide for data scientists differs from other instructional guides on the subject. It doesn’t cover SQL broadly. Instead, you’ll learn the subset of SQL skills that data analysts and data scientists use frequently. You’ll also gain practical advice and direction on "how to think about constructing your dataset."
- Gain an understanding of relational database structure, query design, and SQL syntax
- Develop queries to construct datasets for use in applications like interactive reports and machine learning algorithms
- Review strategies and approaches so you can design analytical datasets
- Practice your techniques with the provided database and SQL code
In this book, author Renee Teate shares knowledge gained during a 15-year career working with data, in roles ranging from database developer to data analyst to data scientist. She guides you through SQL code and dataset design concepts from an industry practitioner’s perspective, moving your data scientist career forward!
Table of contents
- Cover
- Title Page
- Introduction
- CHAPTER 1: Data Sources
-
CHAPTER 2: The SELECT Statement
- The SELECT Statement
- The Fundamental Syntax Structure of a SELECT Query
- Selecting Columns and Limiting the Number of Rows Returned
- The ORDER BY Clause: Sorting Results
- Introduction to Simple Inline Calculations
- More Inline Calculation Examples: Rounding
- More Inline Calculation Examples: Concatenating Strings
- Evaluating Query Output
- SELECT Statement Summary
- Exercises Using the Included Database
- CHAPTER 3: The WHERE Clause
- CHAPTER 4: CASE Statements
- CHAPTER 5: SQL JOINs
- CHAPTER 6: Aggregating Results for Analysis
- CHAPTER 7: Window Functions and Subqueries
- CHAPTER 8: Date and Time Functions
- CHAPTER 9: Exploratory Data Analysis with SQL
- CHAPTER 10: Building SQL Datasets for Analytical Reporting
- CHAPTER 11: More Advanced Query Structures
- CHAPTER 12: Creating Machine Learning Datasets Using SQL
- CHAPTER 13: Analytical Dataset Development Examples
- CHAPTER 14: Storing and Modifying Data
-
APPENDIX: Answers to Exercises
- Chapter 1: Data Sources
- Chapter 2: The SELECT Statement
- Chapter 3: The WHERE Clause
- Chapter 4: CASE Statements
- Chapter 5: SQL JOINs
- Chapter 6: Aggregating Results for Analysis
- Chapter 7: Window Functions and Subqueries
- Chapter 8: Date and Time Functions
- Chapter 9: Exploratory Data Analysis with SQL
- Chapter 10: Building SQL Datasets for Analytical Reporting
- Chapter 11: More Advanced Query Structures
- Chapter 12: Creating Machine Learning Datasets Using SQL
- Chapter 14: Storing and Modifying Data
- Index
- Copyright
- Dedication
- About the Author
- About the Technical Editor
- Acknowledgments
- End User License Agreement
Product information
- Title: SQL for Data Scientists
- Author(s):
- Release date: September 2021
- Publisher(s): Wiley
- ISBN: 9781119669364
You might also like
book
SQL for Data Analysis
With the explosion of data, computing power, and cloud data warehouses, SQL has become an even …
book
SQL for Data Analytics - Third Edition
Take your first steps to becoming a fully qualified data analyst by learning how to explore …
book
Data Science Bookcamp
Learn data science with Python by building five real-world projects! Experiment with card game predictions, tracking …
book
SQL for Data Analytics
Take your first steps to become a fully qualified data analyst by learning how to explore …