MDP(Markov Decision Process)

Notice

Recent Posts

Tags more

Archives

관리 메뉴

decimal

ML&DL/RL

silent 2022. 3. 24. 14:42

- 구성요소

S, A, P, R, γ

- Model free, Model based

MDP 구성요소 중 P, R을 모르는 경우 -> Model free
MDP 구성요소 중 P, R을 아는 경우 -> Model based

Exploition, Exploration (0)	2022.03.24
on-policy, off-policy 구분 (0)	2022.03.24

'ML&DL/RL' Related Articles