Contextual Bandits and Reinforcement Learning with Function Approximation > 강연영상

본문 바로가기
사이트 내 전체검색


강연영상

Contextual Bandits and Reinforcement Learning with Function Approximat…

2473
일자 이다빈
강연자
소속

In this talk, we discuss contextual bandits and reinforcement learning problems based on function approximation frameworks. For the first part, we consider neural logistic bandits, where the main task is to learn an unknown reward function within a logistic link function using a neural network. For the second part, we explain algorithms for learning Markov decision processes whose transition is governed by a multinomial logit model. 

Hot

인기 On the p-adic Group Cohomology of Finite Group Schemes

강연자 : 권혁준 | 소속 : 서울대학교
상단으로

Research Institute of Mathematics
서울특별시 관악구 대학동 서울대학교 자연과학대학 129동 305호
Tel. 02-880-6562 / Fax. 02-877-6541 su305@snu.ac.kr

COPYRIGHT ⓒ 자연과학대학 수학연구소 ALL RIGHT RESERVED.