Csaba Szepesvari
TalkRL: The Reinforcement Learning Podcast - Podcast készítő Robin Ranjit Singh Chauhan

Kategóriák:
Csaba Szepesvari of DeepMind shares his views on Bandits, Adversaries, PUCT in AlphaGo / AlphaZero / MuZero, AGI and RL, what is timeless, and more!