Français Anglais
Accueil Annuaire Plan du site
Accueil > Evenements > Séminaires
Séminaire d'équipe(s) Learning and Optimization
Continuous MCTS for hydroelectric scheduling
Adrien Couëtoux

16 July 2013, 14:30 - 16 July 2013, 15:30
Salle/Bat : 1/DIG-Moulon
Contact :

Activités de recherche :

Résumé :
Hydroelectric scheduling (HS) problems, so far, have mostly been solved by mathematical programming methods (linear programming, stochastic dual dynamic programming, etc), because it's the only known method that works well to solve sequential decision making with very large action space (~10000 variables and more).
However, these methods need to simplify the model in several ways, including: assuming the cost function is convex, and simplifying the underlying random processes (like assuming they are Markovian).
Both these assumptions are wrong in reality for hydroelectric systems.

We investigate an alternative way to solve these problems, Monte Carlo Tree Search. It became famous for improving significantly the level of computers for the game of Go. We extended its reach to continuous domains.
However, MCTS suffers from some limitations to work on HS problems: it does not do well on very large action spaces, and it struggles with long time horizon.
This is why we worked on two fronts: first, we developped a framework to mix MCTS with existing suboptimal policies (like Linear Programming), in a way that can get the best out of both worlds.
Second, we are working on modifications of the traditional bandit inspired formula that dictates how to spend computing power in the tree of MCTS. Alternative methods to direct the computing effort show promising results on simple problems.

Pour en savoir plus :
Séminaires
Programming computing media (reporté)
Combinatorics
Friday 18 September 2020 - 14:30
Salle : 445 - PCRI-N
Frédéric Gruau .............................................

forum-dev Continuous Integration
Friday 05 June 2020 - 10:00
Salle : 0 - 650
Erik Bray .............................................

Large-scale Spectral Clustering for GPU-based Plat
High-performance computing
Tuesday 24 March 2020 - 10:30
Salle : 465 - PCRI-N
Guanlin He .............................................

Recherche Opérationnelle à Google
Stochastic Combinatorial Optimization
Thursday 12 March 2020 - 14:30
Salle : 445 - PCRI-N
Laurent Perron .............................................

Forum dev-LRI
Wednesday 05 February 2020 - 14:00
Salle : 455 - PCRI-N
Erik Bray .............................................