Bartłomiej Cupiał
Hello there! I'm Bartek and I am a PhD student at the University of Warsaw and IDEAS NCBR. Currently visiting the UCL DARK Lab under the supervision of Professor Tim Rocktäschel.
I'm a researcher exploring the synergy between large language models and reinforcement learning. My work focuses on developing more efficient and adaptive AI agents that can learn when to plan and reason. I'm particularly interested in addressing the fundamental challenges of open-ended learning and maintaining behavioral diversity in RL systems.
When I'm not buried in research papers, you might find me battling in out in NetHack (though I'll admit, my AI is catching up to my skills). I'm always up for a good chat, so feel free to reach out!
News
-
BALROG was accepted to ICLR 2025! We will present BALROG at the poster session this Saturday 3:00-5:30 PM, Hall 3, #252.
April, 2025
-
I'm excited to announce that I'm currently visiting the UCL DARK Lab in London under the supervision of Professor Tim Rocktäschel.
January, 2025
-
I am happy to announce that I will be presenting our paper about fine-tuning in RL at this year's MLinPL conference!. Saturday / 9 November 12:30 - 12:55 Hall A (CfC Session 7) CEST!.
November, 2024
-
We talked about work on forgetting in RL fine-tuning at UCL Dark seminar. It was a great experience! You can see the recording here.
July, 2024
-
NetHack full moon bug, a story of, by far, the weirdest bug I've encountered in my CS career.
May, 2024
-
We will be at ICML 2024! Come by our spotlight poster
#1410 on Tue 23 Jul 11:30 a.m. CEST - 1 p.m. CEST!.
May, 2024
Research
BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games
Davide Paglieri*, Bartłomiej Cupiał*, Samuel Coward, Ulyana Piterbarg, Maciej Wolczyk, Akbir Khan, Eduardo Pignatelli, Łukasz Kuciński, Lerrel Pinto, Rob Fergus, Jakob Nicolaus Foerster, Jack Parker-Holder, Tim Rocktäschel
ICLR 2025
Fine-tuning Reinforcement Learning Models is Secretly a Forgetting Mitigation Problem
Bartłomiej Cupiał*, Maciej Wołczyk*, Mateusz Ostaszewski, Michał Bortkiewicz, Michał Zając, Razvan Pascanu, Łukasz Kuciński, Piotr Miłoś
ICML 2024 Spotlight
GAN-based Plugin Model for Video Generation with Applications in Colonoscopy
Łukasz Struski1, Tomasz Urbańczyk, Krzysztof Bucki, Bartłomiej Cupiał, Aneta Kaczyńska, Przemysław Spurek, Jacek Tabor
PLOS ONE