Skip to Main Content
Automated Planning
book

Automated Planning

by Malik Ghallab, Dana Nau, Paolo Traverso
May 2004
Intermediate to advanced content levelIntermediate to advanced
635 pages
19h 46m
English
Morgan Kaufmann
Content preview from Automated Planning
16.2 Planning in Fully Observable Domains 385
We can define the utility in a state s with an action a as V (s, a) = R(s) C(s, a),
and the utility of a policy in a state as V (s|π ) = R(s) C(s, π (s)). This generalizes
to histories. Let h =s
0
, s
1
, ... be a history. The utility of history h induced by a
policy π is defined as
V (h|π ) =
i0
(R(s
i
) C(s
i
, π (s
i
)))
One problem with this definition is that it usually will not converge to a finite
value. In fact, it is important that the total accumulated utility is finite; otherwise,
there is no way to compare histories. A common way to ensure a bounded measure
of utilities for infinite histories is to
Become an O’Reilly member and get unlimited access to this title plus top books and audiobooks from O’Reilly and nearly 200 top publishers, thousands of courses curated by job role, 150+ live events each month,
and much more.
Start your free trial

You might also like

Communicate with Teams More Effectively

Communicate with Teams More Effectively

Charles Humble
How to Overcome a Power Deficit

How to Overcome a Power Deficit

Cyril Bouquet, Jean-Louis Barsoux

Publisher Resources

ISBN: 9781558608566