Petr Novotný
Botanická 554/68a
602 00 Brno
| e‑mail: |
|---|
Most real-world decisions contain tradeoffs between achievable rewards and accepted losses. Current mainstream game-theoretic methods typically reduce this tradeoff to a single maximum reward objective by encoding the risks into the (negative) rewards. However, this leads to the loss of control over the tradeoff. We develop techniques for solving games where both the reward and the risk are explicitly and quantitatively considered in players' decisions.
Reinforcement learning (RL) is a sub-area of machine learning concerned with learning agents acting in an interactive environment. Recent breakthroughs in gameplaying have been achieved using methods of reinforcement learning. We concentrate on developing RL methods for risk-aware agents that act to maximize their rewards while keeping their risk at an acceptable level. Our research comprises theoretical reasoning about various classes of agents/environments and experimental evaluation of developed algorithms.
RAlph - We develop a system for risk-aware reinforcement learning based on enhanced variants of the AlphaZero algorithm. The RAlph allows for an explicit specification of acceptable risk levels controlling thus aspects of its behavior that are typically difficult to express using rewards only. So far, the RAlph has been developed only for fully observable Markov decision processes. Possible research topics:
| e‑mail: |
|---|
Want to contribute to any part of our mission? Looking for a bachelor or master thesis topic? Interested in obtaining a Ph.D. degree? Join us!