Skip to Content
Building LLM Powered Applications
book

Building LLM Powered Applications

by Valentina Alto
May 2024
Intermediate to advanced
342 pages
8h 45m
English
Packt Publishing
Content preview from Building LLM Powered Applications

10

Building Multimodal Applications with LLMs

In this chapter, we are going beyond LLMs, to introduce the concept of multimodality while building agents. We will see the logic behind the combination of foundation models in different AI domains – language, images, and audio – into one single agent that can adapt to a variety of tasks. By the end of this chapter, you will be able to build your own multimodal agent, providing it with the tools and LLMs needed to perform various AI tasks.

Throughout this chapter, we will cover the following topics:

  • Introduction to multimodality and large multimodal models (LMMs)
  • Examples of emerging LMMs
  • How to build a multimodal agent with single-modal LLMs using LangChain

Technical requirements

To complete the ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Designing Data-Intensive Applications

Designing Data-Intensive Applications

Martin Kleppmann
AI Engineering

AI Engineering

Chip Huyen
AI Engineering

AI Engineering

Chip Huyen
AI Engineering

AI Engineering

Chip Huyen

Publisher Resources

ISBN: 9781835462317Supplemental Content