Acquisition of Chess Knowledge in Alphazero

AI Safety Fundamentals: Alignment - Un pódcast de BlueDot Impact

Categorías:

Abstract:What is learned by sophisticated neural network agents such as AlphaZero? This question is of both scientific and practical interest. If the representations of strong neural networks bear no resemblance to human concepts, our ability to understand faithful explanations of their decisions will be restricted, ultimately limiting what we can achieve with neural network interpretability. In this work we provide evidence that human knowledge is acquired by the AlphaZero neural network as ...

Visit the podcast's native language site