Contextual Bandits and Reinforcement Learning with Function Approximation > 강연영상

본문 바로가기

$서울대학교 수리과학부$

사이트 내 전체검색

로그인

강연영상

Contextual Bandits and Reinforcement Learning with Function Approximat…

수학강연회 2473 2025.09.12 12:29

https://www.youtube.com/embed/hH0a4Q5VFFA + 230

일자	이다빈
강연자
소속

In this talk, we discuss contextual bandits and reinforcement learning problems based on function approximation frameworks. For the first part, we consider neural logistic bandits, where the main task is to learn an unknown reward function within a logistic link function using a neural network. For the second part, we explain algorithms for learning Markov decision processes whose transition is governed by a multinomial logit model.

$프린트$

목록

Hot

인기 Bochner-Riesz means and spectral projection for Hermite expansions

수학강연회 | 강연자 : 유재현 | 소속 : 이화여자대학교

Hermite functions play an important role in many areas, including quantum mechanics, partial diff…… More

Hot

인기 Moduli space of vector bundles on curves

수학강연회 | 강연자 : 임우남 | 소속 : 연세대학교

The moduli spaces of vector bundles on curves lie at the crossroads of geometry, topology, and representation theory…… More

Hot

인기 No anomalous dissipation in two dimensional fluids

수학강연회 | 강연자 : 박재민 | 소속 : 연세대학교

In this talk, we will discuss Leray-Hopf solutions to the incompressible Navier-Stokes equations wit…… More

Hot

인기 Mathematical theory of neural network and its application to scientific machine learning

수학강연회 | 강연자 : 고승찬 | 소속 : 인하대학교

In recent years, modern machine learning techniques using deep neural networks have achieved trem…… More

Now

현재 Contextual Bandits and Reinforcement Learning with Function Approximation

수학강연회 | 강연자 : 이다빈 | 소속 : 서울대학교

In this talk, we discuss contextual bandits and reinforcement learning problems based on function approximation fram…… More

Hot

인기 On the p-adic Group Cohomology of Finite Group Schemes

BK21 FOUR Rookies Pitch | 강연자 : 권혁준 | 소속 : 서울대학교

We introduce a cohomology th…… More

Hot

인기 The conductor density of abelian extensions with local behaviors

BK21 FOUR Rookies Pitch | 강연자 : 오경원 | 소속 : 전남대학교

Arithmetic statistics is the study of the statist…… More

Hot

인기 젊은과학자상 수상 기념강연: 최소주기궤도를 찾아서

수학강연회 | 강연자 : 강정수 | 소속 : 서울대학교

해밀턴 역학은 고전역학을 수학적으로 기술하는 방법 중 하나로, 곡면 또는 다양체 위의 벡터장으로 표현할 수 있다. 푸앵카레는 해밀턴 역학의 중요한 예인 3체 문제를 연구하면서 주기 궤도의 중요성을 강조하였고, 이러…… More

Hot

인기 What I talk about when I talk about solitons

수학강연회 | 강연자 : 권순식 | 소속 : 카이스트

The study of solitons—localized, non-dispersive wave structures—began with a single observation by Scott Russell in 183…… More

Hot

인기 학부생을 위한 ɛ 강연: 우리는 무엇을 어떻게 '잘' 셀 수 있을까?

수학강연회 | 강연자 : 이승재 | 소속 : 인천대학교

수학에서 '무엇인가를 센다'는 것은 겉으로 보기엔 쉬운 문제처럼 보이지만, 그 안에는 깊고 흥미로운 세계가 숨어 있습니다. 이번 강연에서는 우리가 일상에서 흔히 접하는 간단한 counting 문제들로 시작하여, 점…… More

1
2
3
4
5
6
7
8
9
10

상단으로

개인정보취급방침 수리과학부 회사소개 오시는길 사이트맵

Research Institute of Mathematics
서울특별시 관악구 대학동 서울대학교 자연과학대학 129동 305호
Tel. 02-880-6562 / Fax. 02-877-6541 su305@snu.ac.kr

COPYRIGHT ⓒ 자연과학대학 수학연구소 ALL RIGHT RESERVED.