1
What Are Transformers?
Transformers are industrialized, homogenized Large Language Models (LLMs) designed for parallel computing. The most well-known transformer is ChatGPT. A transformer model can carry out a wide range of tasks with no fine-tuning. Transformers can perform self-supervised learning on billions of records of raw unlabeled data with billions of parameters. From these billion-parameter models have emerged multimodal architectures that can process text, images, audio, and videos.
ChatGPT popularized the usage of transformer architectures that have become general-purpose technologies; just like printing, electricity, and computers.
Applications are burgeoning everywhere! Google Cloud AI, Amazon Web Services (AWS), Microsoft Azure, ...
Get Transformers for Natural Language Processing and Computer Vision - Third Edition now with the O’Reilly learning platform.
O’Reilly members experience books, live events, courses curated by job role, and more from O’Reilly and nearly 200 top publishers.