Chapter 3. LLM-Based Applications
As of early 2025, only a few companies offer large multimodal models that can understand and generate text, images, and other media, like sound and video. For brevity, we will call these AI models. The most well-known examples are the GPT models created by OpenAI, but a few other popular examples are the Gemini models created by Google, the Claude Sonnet and Haiku models created by Anthropic, and the Llama models created by Meta.
In many cases, these companies partner with other companies to offer these models as a cloud service. For example, OpenAI has a partnership with Microsoft, which provides the infrastructure to host OpenAI’s models in cloud services that can be accessed via APIs. Other companies, like Meta, provide a model snapshot, a large binary file containing the weights of a pretrained model, which users can install in their own infrastructure. This infrastructure can be “bare metal,” meaning physical machines the companies own, or cloud infrastructure they purchase from other providers.
Model-building companies also offer user-facing applications. In many cases, the name of the model and the name of the user-facing application are the same or very similar, making it easy to confuse the two. For example, the Google Gemini application uses the Google Gemini model, and the Claude application uses the Anthropic Claude Sonnet and Haiku models. OpenAI’s names are slightly different: its user-facing application, ChatGPT, allows users to ...
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Read now
Unlock full access