3. SQL for Data Preparation
By the end of this chapter, you will be able to:
- Assemble multiple tables and queries together into a dataset
- Transform and clean data using SQL functions
- Remove duplicate data using DISTINCT and DISTINCT ON
In this chapter, we will learn to clean and prepare our data for analysis using SQL techniques.
In the previous chapter, we discussed the basics of SQL and how to work with individual tables in SQL. We also used CRUD (create, read, update and delete) operations on a table. These tables are the foundation for all the work undertaken in analytics. One of the first tasks implemented in analytics is to create clean datasets. According to Forbes, it is estimated that, almost ...