on-demand course

AI Catalyst Conference: NLP with ChatGPT (and other Large Language Models): Transformer Architectures from Development to Deployment

with Jon Krohn

March 2023

Intermediate

2h 52m

English

Addison-Wesley Professional

Closed Captioning available in English

Overview

3 Hours of Video

Discover the astounding state-of-the-art in Natural Language Processing (NLP) that is enabled by Large Language Models (LLMs) like ChatGPT and T5
Understand Attention and Transformers, as well as how these essential modern NLP concepts relate to Deep Learning and LLMs
Survey the staggeringly broad range of LLMs’ natural-language capabilities
Learn how to use LLMs in practice, including how to train and deploy them into production NLP applications

Large Language Models (LLMs) such as GPT series architectures have dramatically accelerated the natural language processing (NLP) capabilities of machines in recent years. These capabilities, facilitated by LLMs’ hundreds of billions of model parameters, approach or exceed human-level performance on a staggeringly broad set of natural-language tasks — often without any task-specific training being required. In this event, leading subject-matter experts introduce LLMs and their associated concepts (e.g., Transformers, Attention), survey LLMs’ breadth of capabilities, and provide the best practices on how to leverage LLMs efficiently and confidently in order to supercharge your own natural-language applications.

AI Catalyst
The AI Catalyst Conference from Pearson brings together leading voices in AI to make complex topics understandable and actionable. Host Jon Krohn guides the conversation and explains how to bring state-of-the-art methods into practice. Gain new information or a different perspective to make an impact in your job and in the world.

By the end of the course, you’ll understand:

Large Language Models (LLMs)
Attention
Transformers
The breadth of state-of-the-art NLP applications

And you’ll be able to:

Select an appropriate LLM architecture for a given NLP application
Prompt pre-trained LLMs like ChatGPT and GPT-3 to effectively produce your desired output
Train and deploy LLMs into production NLP applications
Potentially accelerate your data science roadmap by months or years by leveraging a pre-trained LLM instead of needing to train individual task-specific models from scratch yourself

This course is for you because…

You’d like to appreciate the staggering breadth of NLP and Deep Learning capabilities
You are a data scientist, software developer, ML engineer, or other technical professional who would like to be able incorporate new NLP approaches into real-world applications

Prerequisites
All you need is an interest in how AI can impact you and your organization.

Recommended Follow-up

Read: Quick Start Guide to LLMs by Sinan Ozdemir, https://learning.oreilly.com/library/view/quick-start-guide/9780138199425/
Attend: Deploying GPT and Large Language Models by Sinan Ozdemir: https://learning.oreilly.com/search/?q=Sinan%20Ozdemir&type=live-event-series&rows=10&publishers=Pearson
Attend: Hands-on Natural Language Generation and GPT by Sinan Ozdemir: https://learning.oreilly.com/search/?q=Sinan%20Ozdemir&type=live-event-series&rows=10&publishers=Pearson
Read: Chapter 15 of Learning Deep Learning by Dr. Magnus Ekman: https://learning.oreilly.com/library/view/learning-deep-learning/9780137470198/
Watch: NLP using Transformer Architectures by Aurélien Géron: https://learning.oreilly.com/videos/natural-language-processing/0636920373605/0636920373605-video329383/
For a more general introduction to deep learning, check out the Deep Learning: The Complete Guide playlist by Dr. Jon Krohn: https://learning.oreilly.com/playlists/a40ea8fe-994d-4370-8b29-0d6c0f519a89/

Course Schedule

Jon Krohn: Welcome

Sinan Ozdemir: Introduction to Large Language Models (30 minutes)
We can’t talk about state-of-the-art Natural Language Processing (NLP) without talking about Transformers and large language models (LLMs) like ChatGPT, BERT, GPT, and T5. Sinan explores a brief history of modern NLP up to the rise of attention-based models and Transformers including the proliferation of LLMs that continues to this day along with all of the good and sometimes the not-so-good outcomes. He overviews the major architectures that influence the tasks and models that dominate NLP while peeking under the hood to understand how LLMs learn to read, write, and do so much more.

Sinan Ozdemir is an active lecturer focusing on large language models and a former lecturer of data science at the Johns Hopkins University. He is the author of multiple textbooks on data science and machine learning including The Principles of Data Science. Sinan is the Founder and CTO of LoopGenius where he uses State of the art AI to help people create and run their businesses. He holds a master’s degree in Pure Mathematics from Johns Hopkins University and is based in San Francisco.

Jon and Sinan Discussion + Q&A

Melanie Subbiah: The Broad Range of LLM Capabilities
Large language models have unlocked a huge number of exciting applications in the real world that were not possible before — capabilities that are creative, useful, and profitable. Through interactive demos of GPT-3, Melanie explores a broad range of these use cases, giving participants more intuition for how large language models have been effective.

Melanie Subbiah is a third-year PhD student in NLP at Columbia University where she researches narrative summarization and aspects of online text safety. Before starting graduate school, she was one of the lead authors on the GPT-3 paper, building out the evaluation suite for that work and helping early customers use the OpenAI API for their projects. Prior to that, she researched autonomous systems at Apple. Melanie obtained her Bachelor’s in computer science from Williams College.

Jon and Melanie Discussion + Q&A

Shaan Khosla: Training and Deploying LLMs
Shaan covers practical LLM tips over the full NLP lifecycle. These include topics such as efficient training practices, validation methods, and productionization considerations to ensure your design is optimized for implementation within your real-world natural-language application.

Shaan Khosla is a data scientist at Nebula where he researches, designs, and develops NLP models. He’s previously worked at Bank of America on an internal machine learning consulting team, where he used LLMs to build proof of concept systems for various lines of business. Shaan holds a BSBA in Computer Science and Finance from the University of Miami and is currently completing a master’s degree in Data Science at NYU. He has published multiple peer-reviewed papers applying LLMs, topic modeling, and recommendation systems to the fields of biochemistry and healthcare.

Jon and Shaan Discussion + Q&A

Jon Krohn: Closing Remarks

About the Host
Host: Jon Krohn is Co-Founder and Chief Data Scientist at the machine learning company Nebula. He authored the book Deep Learning Illustrated, an instant #1 bestseller that was translated into seven languages. He is also the host of SuperDataScience, the data science industry’s most listened-to podcast. Jon is renowned for his compelling lectures, which he offers at leading universities and conferences, as well as via his award-winning YouTube channel. He holds a PhD from Oxford and has been publishing on machine learning in prominent academic journals since 2010.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Watch now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Introduction to Transformer Models for NLP: Using BERT, GPT, and More to Solve Modern Natural Language Processing Tasks

Publisher Resources

ISBN: 9780138224912

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills