Sequential Decision Making Team Seminar (Talk by Canzhe Zhao, Shanghai Jiao Tong University).

Name: Sequential Decision Making Team Seminar (Talk by Canzhe Zhao, Shanghai Jiao Tong University).
Start: 2025-10-24T04:00+09:00
End: 2025-10-24T05:00+09:00

2025/10/24(金)

04:00〜05:00

Googleカレンダーに追加

参加者

2人/300人

主催：RIKEN AIP Public

イベントに申し込む

Sequential Decision Making Team Seminar (Talk by Canzhe Zhao, Shanghai Jiao Tong University).
This is an online seminar. Registration is required.

【Sequential Decision Making Team】
【Date】2025/October 24 (Fri) 13:00-14:00(JST)
【Speaker】Canzhe Zhao, Shanghai Jiao Tong University, Department of Computer Science and Engineering

Title: Scalable Online Learning in Adversarial Environments: from Single-Agent to Multi-Agent

Abstract:Practical applications of sequential decision-making in complex and dynamic environments face critical challenges, including the curse of dimensionality and adversarial loss functions. In this talk, I will present a unified research program on scalable online learning in adversarial environments, addressing these core challenges from both single-agent reinforcement learning (RL) and multi-agent gametheoretic perspectives. The first part of the talk focuses on adversarial bandits and RL with function approximation. I will introduce our advances on learning in adversarial linear mixture MDPs and low-rank MDPs. In addition, I will present our best-of-both-worlds algorithms for linear bandits, which achieve (nearly) optimal regret in both stochastic and adversarial environments, even under heavy-tailed noise distributions. The second part of the talk extends to partially observable Markov games (POMGs). I will present the first algorithm achieving last-iterate convergence in POMGs under bandit feedback, alongside pioneering algorithms for
learning POMGs with linear function approximation. These algorithms enable scalable and efficient learning in high-dimensional game environments.
Collectively, these advancements demonstrate how principled algorithmic designs can overcome fundamental limitations in online learning, leading to scalable and robust decision-making in complex and dynamic environments. The contributions presented in this talk have been published in premier machine learning venues, including ICML, ICLR, NeurIPS, UAI, and AAAI.

キーワード
タグ
ツール	Discord Google Hangout Google Meet Remo Skype Slack Microsoft Teams Whereby YouTube Live Zoom
開催日
こだわり条件	人気のウェビナー終了のウェビナーを含む

Sequential Decision Making Team Seminar (Talk by Canzhe Zhao, Shanghai Jiao Tong University).

主催：RIKEN AIP Public

似たイベント

【再放送】年間数百件に及ぶ委託先のセキュリティ評価、膨大なチェックシート管理の負担をなくす新しい方法とは

主催：マジセミ×セキュリティ（デジタルとの新たな出会いと体験）

【再放送】HashiCorp買収が示すIBMの戦略とクラウド時代に狙われる認証情報のセキュリティ対策〜漏洩リスクに備える、Vault Radarを活用したシークレットの可視化と管理〜

主催：マジセミ×セキュリティ（デジタルとの新たな出会いと体験）

最新攻撃に備える「事前対策」とSOC不要な「侵入後対策」を実現するには？～AI活用で人材不足と高コストの課題を同時に解決する最新アプローチを解説～

主催：マジセミ×ゼロトラスト・認証・ID管理（デジタルとの新たな出会いと体験）

【独自仕様の機器など】今までIoT化が難しかった医療機器のデータを活用する方法～”セキュアな通信”と”高度な保守サービス化”を実現～

主催：マジセミ×先端技術・最新動向（デジタルとの新たな出会いと体験）

ビジネスコミュニティBizCRE に参加しよう！

主催：CREEKS

似たイベント

【再放送】年間数百件に及ぶ委託先のセキュリティ評価、膨大なチェックシート管理の負担をなくす新しい方法とは

主催：マジセミ×セキュリティ（デジタルとの新たな出会いと体験）

【再放送】HashiCorp買収が示すIBMの戦略とクラウド時代に狙われる認証情報のセキュリティ対策 〜漏洩リスクに備える、Vault Radarを活用したシークレットの可視化と管理〜

主催：マジセミ×セキュリティ（デジタルとの新たな出会いと体験）

最新攻撃に備える「事前対策」とSOC不要な「侵入後対策」を実現するには？ ～AI活用で人材不足と高コストの課題を同時に解決する最新アプローチを解説～

主催：マジセミ×ゼロトラスト・認証・ID管理（デジタルとの新たな出会いと体験）

【独自仕様の機器など】今までIoT化が難しかった医療機器のデータを活用する方法 ～”セキュアな通信”と”高度な保守サービス化”を実現～

主催：マジセミ×先端技術・最新動向（デジタルとの新たな出会いと体験）

ビジネスコミュニティBizCRE に参加しよう！

主催：CREEKS

【再放送】HashiCorp買収が示すIBMの戦略とクラウド時代に狙われる認証情報のセキュリティ対策〜漏洩リスクに備える、Vault Radarを活用したシークレットの可視化と管理〜

最新攻撃に備える「事前対策」とSOC不要な「侵入後対策」を実現するには？～AI活用で人材不足と高コストの課題を同時に解決する最新アプローチを解説～

【独自仕様の機器など】今までIoT化が難しかった医療機器のデータを活用する方法～”セキュアな通信”と”高度な保守サービス化”を実現～