Skip to Content
What's New in AI: Multimodal AI with Purvanshi Mehta
video

What's New in AI: Multimodal AI with Purvanshi Mehta

by George Anadiotis, Purvanshi Mehta
June 2024
Intermediate
47m
English
O'Reilly Media, Inc.
Closed Captioning available in German, English, Spanish, French, Japanese, Korean, Portuguese (Portugal, Brazil), Chinese (Simplified), Chinese (Traditional)

Overview

Join host George Anadiotis and guest Purvanshi Mehta, cofounder of Lica World, for a discussion about multimodal AI and its applications. Trained on various types of data from text to images to audio and video, multimodal AI models are expanding the possibilities for the kinds of AI applications we can build.

New large AI models such as GPT-4, Gemini, and Claude 3 are all general-purpose multimodal foundational models. More specialized multimodal AI models, such as OpenAI’s yet-to-be-released Sora, which generates video from text, or Suno AI, which generates songs from text, are fueling the imagination with ways we might leverage AI to automate and augment tasks in robotics, entertainment, healthcare, manufacturing, and other industries.

George and Purvanshi discuss where this technology stands and share their thoughts on where the field is headed.

What you’ll learn and how you can apply it

  • Learn about state-of-the-art multimodal AI and technologies that you can leverage today
  • Understand the specific techniques and skills needed to build multimodal AI systems
  • Explore what’s in store for multimodal AI and how to keep up with the latest developments

This live course is for you because…

  • You want to stay up-to-date on the latest developments and breakthroughs in the field of AI.
  • You’re an AI practitioner who wants to expand your skills beyond one particular field of application.

Recommended follow-up:

Please note that slides or supplemental materials are not available for download from this recording. Resources are only provided at the time of the live event.

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Watch now

Unlock full access

More than 5,000 organizations count on O’Reilly

AirBnbBlueOriginElectronic ArtsHomeDepotNasdaqRakutenTata Consultancy Services

QuotationMarkO’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.
Julian F.
Head of Cybersecurity
QuotationMarkI wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.
Addison B.
Field Engineer
QuotationMarkI’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.
Amir M.
Data Platform Tech Lead
QuotationMarkI'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.
Mark W.
Embedded Software Engineer

You might also like

What’s New in AI: Generative AI with Dan Jeffries

What’s New in AI: Generative AI with Dan Jeffries

George Anadiotis, Dan Jeffries
AI Superstream: AI Agents

AI Superstream: AI Agents

Antje Barth, Lucas Soares, Patrick Debois, Tony Kipkemboi, Chris Hallenbeck, Erin Mikail Staples, Chris Fregly, Gabriela de Queiroz

Publisher Resources

ISBN: 0642572033507