book

Sustainable AI

Name: Sustainable AI
Author: Raghavendra Selvan
ISBN: 9781098155513

by Raghavendra Selvan

October 2025

Intermediate to advanced

292 pages

8h 9m

English

O'Reilly Media, Inc.

Read now

Unlock full access

Includes

Quizzes

Preface
Who Should Read This Book?What This Book Is and Is NotUsing This BookConventions Used in This BookUsing Code ExamplesO’Reilly Online LearningHow to Contact UsAcknowledgments
1. Sustainability and Artificial Intelligence
Scope of SustainabilityArtificial Intelligence: The New Electricity?Sustainability × AIAI for SustainabilitySustainability of AIA Green Path to Sustainable AITL;DR
2. Under the Hood of Generative AI
Representation LearningOverview of Representation SpacesLearning Representation SpacesLearning Representations to GenAIAutoencodersLarge Language ModelsMultimodal Generative ModelsTour of Neural ArchitecturesData ModalitiesNeural Network ZooFormalizing Machine LearningNonlinear Models and Deep LearningHow to Train Your ModelBuilding GenAIGenAI IngredientsResources and Engineering at ScaleAdditional ResourcesCommon NotationsDatasetsFrom ML Basics to Sustainable AI
3. Quantifying the Efficiency of Deep Learning
AI WasteResource Consumption of Deep LearningResource Efficiency and Climate AwarenessActual Carbon Footprint of AIResource Efficiency and Sustainable AIQuantifying Resource Consumption of AIModel ComplexityComputation TimeEnergy ConsumptionCarbon Footprint of AI ModelsGHG Emissions and Carbon FootprintRelating Carbon Footprint to Energy ConsumptionEstimating the Carbon Footprint of AI ModelsEfficiency Quantified: What Comes Next?
4. Data Parsimony
The Cost of DataCarbon Footprint of Data StorageScale of Datasets in AICarbon Footprint of Processing DataDataset CurationActive Learning for Dataset CreationLearning with Pruned DatasetsInstance SelectionTokenization and Data Efficiency in Modern AI ModelsCoreset SelectionLearning with Compressed DataData Point CompressionDataset CondensationData and Dataset Compressed: What Comes Next?
5. Automating Model Selection
MotivationThe Model Selection Hierarchy: MC3-SpaceModel Selection as OptimizationHyperparameter OptimizationGrid SearchRandom SearchBayesian OptimizationNeural Architecture SearchNAS Search SpaceNAS As OptimizationNAS Using Random SearchNAS Using Evolutionary AlgorithmsEfficiency and NASModel Selection in the Era of Foundational ModelsMixture of ExpertsModel Selection Automated: What Comes Next?
6. Training Efficiency
Training Costs of AI ModelsTransfer LearningPretrained ModelsFine-Tuning of Pretrained ModelsIn-Context Learning in LLMsTraining Compressed Neural NetworksNeural Network PruningFactorized Neural NetworksLow-Rank Adaptation of Foundational ModelsQuantizationLow-Precision TrainingQuantizing Optimizer StatesEfficient Training Achieved: What Comes Next?
7. Lean Inference
Lifetime Cost of an AI ModelAchieving Lean InferenceResource-Efficient ArchitecturesKnowledge DistillationPruning of Trained ModelsPost-Training QuantizationDeploying ModelsCross-Platform ModelsInference Beyond PythonAI Model Inference in Low-Level LanguagesServing Foundational Models in C++Inference Is Lean: What Comes Next?
8. Hardware Considerations
Environmental Cost of AI HardwareEmbodied EmissionsE-WasteHardware Scaling Laws of AIThe Alchemy of Creating AIImproving the Resource Efficiency of AI HardwareCluster-Level OptimizationAccelerator-Level OptimizationCustom Hardware OptimizationHardware Optimized: What Comes Next?
9. A Recipe for Sustainable AI
Technical Debt of Machine LearningEnvironmental Debt of AITransparency DebtData DebtOther Elements of Environmental DebtOperationalizing Sustainable AIMLOpsGreen MLOpsGreen MLOps in PracticeModel CardsEnergy RatingsOrchestration FrameworksSustainable AI Operationalized: What Comes Next?

10. Toward Sustainable AI
Rebound Effects and AIEfficiency Is Not EnoughBroader Environmental EffectsBeyond EfficiencyEconomic Sustainability of AISocial Sustainability of AIThe Way ForwardSystems ThinkingPutting Systems Thinking into PracticeImpact of Sustainable AI
Epilogue
Index
About the Author

Content preview from Sustainable AI

Chapter 7. Lean Inference

If a tree falls in the forest and no one is around to hear it, does it make a sound? We have all heard this idiom in different settings. For AI practitioners, the related question should be if a model is developed and no one uses it, what happens to all the resources used in its development?

All AI models are developed with the hope they will be used extensively. It is not a given that all models will have takers, though. This brings us to a philosophical question on how to allocate resources at the outset when developing any resource-intensive technology.¹ We will discuss this dilemma further in Chapter 9.

In this chapter, we will focus on the resource efficiency and sustainability of AI models at deployment. Technically, using an AI model after training for prediction purposes is known as inference. I first present an overview of the inference costs of modern AI models and then look at some effective methods to improve these costs. Many of the methods discussed in Chapter 6 for improving training efficiency, such as quantization and neural network pruning, can also be used to achieve lean inference. In addition, we will consider specialized methods that can accelerate inference of AI models by translating high-level implementations to more efficient, lower-level programming languages such as C++.

Lifetime Cost of an AI Model

Consider the inference cost of a large GenAI model, such as Llama-3-405B. Let’s assume the energy consumption per prompt is ...

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.

Read now

Unlock full access

More than 5,000 organizations count on O’Reilly

O’Reilly covers everything we've got, with content to help us build a world-class technology community, upgrade the capabilities and competencies of our teams, and improve overall team performance as well as their engagement.

Julian F.

Head of Cybersecurity

I wanted to learn C and C++, but it didn't click for me until I picked up an O'Reilly book. When I went on the O’Reilly platform, I was astonished to find all the books there, plus live events and sandboxes so you could play around with the technology.

Addison B.

Field Engineer

I’ve been on the O’Reilly platform for more than eight years. I use a couple of learning platforms, but I'm on O'Reilly more than anybody else. When you're there, you start learning. I'm never disappointed.

Amir M.

Data Platform Tech Lead

I'm always learning. So when I got on to O'Reilly, I was like a kid in a candy store. There are playlists. There are answers. There's on-demand training. It's worth its weight in gold, in terms of what it allows me to do.

Mark W.

Embedded Software Engineer

Publisher Resources

ISBN: 9781098155506Errata Page

Cloud Computing

Data Engineering

Data Science

AI & ML

Programming Languages

Software Architecture

IT/Ops

Security

Design

Business

Soft Skills

Sustainable AI

by Raghavendra Selvan

Chapter 7. Lean Inference

Lifetime Cost of an AI Model

Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.