📑

AI Paper Research

AI 논문 조사 및 정리

Foundations
강화학습Reinforcement Learning
Mastering Diverse Domains through World ...
Decision Transformer: Reinforcement Lear...
Mastering Atari, Go, Chess and Shogi by ...Grandmaster level in StarCraft II using ...
Soft Actor-Critic: Off-Policy Maximum En...
Proximal Policy Optimization AlgorithmsMastering Chess and Shogi by Self-Play w...
Mastering the game of Go with deep neura...Asynchronous Methods for Deep Reinforcem...
Playing Atari with Deep Reinforcement Le...
홈/강화학습/2017

강화학습 — 2017

2편의 논문

arXiv15,000+

Proximal Policy Optimization Algorithms

근접 정책 최적화 알고리즘

John Schulman, Filip Wolski, Prafulla Dhariwal et al. (2017)

arXiv5,000+

Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm

범용 강화학습 알고리즘의 자기대국으로 체스와 쇼기 마스터하기

David Silver, Thomas Hubert, Julian Schrittwieser et al. (2017)

← 강화학습 전체