Video description
Python, SQL, and Tableau are three of the most widely used tools in the world of data science. Python is the leading programming language
SQL is the most widely used means for communication with database systems
Tableau is the preferred solution for data visualization;
The course starts off by introducing software integration as a concept. We discuss some important terms such as servers, clients, requests, and responses. Moreover, you will learn about data connectivity, APIs, and endpoints. Then we continue by introducing the real-life example exercise the course is centred around: the Absenteeism at Work dataset. The preprocessing part that follows will give you a taste of what BI and data science look like in real-life, on-the-job situations. Then we continue by applying some Machine Learning to our data. You will learn how to explore the problem at hand from a machine-learning perspective, how to create targets, what kind of statistical preprocessing is necessary for this part of the exercise, how to train a Machine Learning model, and how to test it—a truly comprehensive ML exercise. Connecting Python and SQL is not immediate; we show how that's done in an entire section of the course.
By the end of that section, you will be able to transfer data from Jupyter to Workbench. And finally, as promised, Tableau will allow us to visualize the data we have been working with. We will prepare several insightful charts and will interpret the results together.
What You Will Learn
- Create a module of the ML model for later use
- Connect Python and SQL to transfer data from Jupyter to Workbench
- Visualize data in Tableau
- Analyze and interpret exercise outputs in Jupyter and Tableau
Audience
This course is for anyone looking for a career in Business Intelligence and Data Science. Data scientists who are eagerly looking to break into the field and learn the necessary essentials of Data Science and software engineers who are interested in building intelligent applications driven by Python and Machine Learning will also benefit from this course.
About The Author
365 Careers: 365 Careers’ courses have been taken by more than 203,000 students in 204 countries. People working at world-class firms such as Apple, PayPal, and Citibank have completed 365 Careers trainings. By choosing 365 Careers, you make sure you will learn from proven experts who have a passion for teaching, and can take you from beginner to pro in the shortest possible amount of time.
If you want to become a financial analyst, a finance manager, an FP&A analyst, an investment banker, a business executive, an entrepreneur, a business intelligence analyst, a data analyst, or a data scientist, 365 Careers’ courses are the perfect place to start.
Table of contents
- Chapter 1 : Introduction
- Chapter 2 : What is software integration?
- Chapter 3 : Setting up the working environment
- Chapter 4 : What's next in the course?
-
Chapter 5 : Preprocessing
- Data Sets in Python
- Data at a Glance
- A Note on Our Usage of Terms with Multiple Meanings
- Picking the Appropriate Approach for the Task at Hand
- Removing Irrelevant Data
- Examining the Reasons for Absence
- Splitting a Column into Multiple Dummies
- Dummy Variables and Their Statistical Importance
- Grouping - Transforming Dummy Variables into Categorical Variables
- Concatenating Columns in Python
- Changing Column Order in Pandas DataFrame
- Implementing Checkpoints in Coding
- Exploring the Initial "Date" Column
- Using the "Date" Column to Extract the Appropriate Month Value
- Introducing "Day of the Week"
- Further Analysis of the DataFrame: Next 5 Columns
- Further Analysis of the DaraFrame: "Education", "Children", "Pets"
- A Final Note on Preprocessing
-
Chapter 6 : Machine Learnings
- Exploring the Problem from a Machine Learning Point of View
- Creating the Targets for the Logistic Regression
- Selecting the Inputs
- A Bit of Statistical Preprocessing
- Train-test Split of the Data
- Training the Model and Assessing its Accuracy
- Extracting the Intercept and Coefficients from a Logistic Regression
- Interpreting the Logistic Regression Coefficients
- Omitting the dummy variables from the Standardization
- Interpreting the Important Predictors
- Simplifying the Model (Backward Elimination)
- Testing the Machine Learning Model
- How to Save the Machine Learning Model and Prepare it for Future Deployment
- Creating a Module for Later Use of the Model
- Chapter 7 : Installing MySQL and Getting Acquainted with the Interface
-
Chapter 8 : Connecting Python and SQL
- Implementing the 'absenteeism_module' - Part I
- Implementing the 'absenteeism_module' - Part II
- Creating a Database in MySQL
- Importing and Installing 'pymysql'
- Creating a Connection and Cursor
- Creating the 'predicted_outputs' table in MySQL
- Running an SQL SELECT Statement from Python
- Transferring Data from Jupyter to Workbench - Part I
- Transferring Data from Jupyter to Workbench - Part II
- Transferring Data from Jupyter to Workbench - Part III
- Chapter 9 : Analyzing the Obtained data in Tableau
Product information
- Title: Python, SQL, and Tableau: Integrating Python, SQL, and Tableau
- Author(s):
- Release date: May 2019
- Publisher(s): Packt Publishing
- ISBN: 9781838987916
You might also like
video
Data Science Fundamentals Part 1: Learning Basic Concepts, Data Wrangling, and Databases with Python
20 Hours of Video Instruction Data Science Fundamentals LiveLessons teaches you the foundational concepts, theory, and …
video
Pandas Data Analysis with Python Fundamentals
3+ Hours of Video Instruction provides analysts and aspiring data scientists with a practical introduction to …
video
The Complete Python and PostgreSQL Developer Course
Ever wanted to learn one of the most popular programming languages on the planet? Why not …
video
Python A-Z: Learn Python by Building 15 Projects and ChatGPT
This comprehensive Python course covers all fundamental concepts and advanced Python concepts, and you learn a …