|
Stuart Russell's seminar |
|
|
|
|
Stuart Russell's seminar 22 November 2010
Stuart Russell's Invited Seminar at Laboratoire de Recherche en Informatique, will take place on November 22nd, Monday, 11am, room 79.
Everyone is welcome. |
|
Title: Life, play and win in 20 trillion moves Author: Stuart Russell, Computer Science Division, University of California, Berkeley Abstract
Given the impossibility of perfectly rational behaviour, the concept of bounded optimality provides a more satisfactory formal definition of intelligence; yet still we lack ideas for designing systems that can achieve reasonable decision quality over long time scales. The talk begins with a classical idea - hierarchical planning with high-level actions of extended duration - and resolves the longstanding open problem of "downward refinement," providing the first algorithms capable of proving that a high-level plan is correct and optimal without considering its concrete implementations.
The classical setting of hierarchical planning is then generalized to that of hierarchical reinforcement learning. I describe a concurrent partial-programming language, ALisp, that may be used to specify constraints on behavior, leaving unspecified those choices that the agent must learn to make on its own. ALisp comes with reinforcement learning algorithms that, in the limit, find the optimal completion of any given partial program. Initial scaling experiments are promising.
Finally, I will briefly explore the implications of this work for research on bounded rationality, metareasoning, and artificial intelligence.
[Joint work with Ron Parr, David Andre, Bhaskara Marthi, Andy Zimdars, David Latham, Carlos Guestrin, Jason Wolfe] |
|
|
|
|
News |
|
|
Yannis Manoussakis passed away6 June 2021We have just learned of the death of Yannis Manoussakis, Professor at the University of Paris-Saclay, on Saturday June 5.
He was the leader of the GALaC team and had been for many years director of the LRI, we lose a friend and a dear colleague.
Our Semaine du cerveau : Cerveau connecté16 March 2021Wizard project1 April 2021Innovation Area: Public Safety, IoT, Mobility
|
|
|
|
|