CHAPTER 3Database Fundamentals
In the previous chapter, we touched on storage as a part of the data engineering life cycle. Now, let’s zoom in on one of the most important tools in that stage: databases. As a data engineer, you’re going to spend a lot of time working with databases, designing them and understanding how they work.
In this chapter, we’ll introduce you to the foundational concepts you need to start working with databases confidently. You’ll learn what they are, the different types that exist, and how to interact with them in a real-world context.
HERE’S WHAT WE’LL COVER:
- What databases are and why they matter
- The difference between relational and NoSQL databases
- Various types of NoSQL databases and their use cases
- Primary and foreign key concepts
- Interacting with databases using SQL
- Applying ACID principles
- Choosing the best type of database for your project
Databases are organized collections of data that allow for efficient storage, retrieval, and management of information. In any digital application or system, databases are the foundation for storing data, making them essential in nearly every industry. One of the key strengths of a database is that it allows data to be structured in a way that supports queries. A query is a request for information from a database. This ability to query data is what makes databases powerful for reporting, analysis, and decision-making.
The primary purpose of databases is to efficiently manage large volumes of data while ensuring ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access