Accepted Papers#
Please see all of the contributed work at https://openreview.net/group?id=ICML.cc/2025/Workshop/PUT.
ID |
Title |
Authors |
Decision |
---|---|---|---|
1 |
JEDI: The Force of Jensen-Shannon Divergence in Disentangling Diffusion Models |
Eric Tillmann Bill, Enis Simsar, Thomas Hofmann |
Accept (Poster) |
2 |
Agentic Adversarial QA for Improving Domain-Specific LLMs |
Vincent Grari, ciprian tomoiaga, Sylvain Lamprier, Tatsunori Hashimoto, Marcin Detyniecki |
Accept (Poster) |
3 |
Scalable Temporal Domain Generalization via Prompting |
Sepidehsadat Hosseini, Mengyao Zhai, Hossein Hajimirsadeghi, Frederick Tung |
Accept (Poster) |
4 |
Right Question is Already Half the Answer: Fully Unsupervisedd LLM Reasoning Incentization |
Qingyang Zhang, Haitao Wu, Changqing Zhang, Peilin Zhao, Yatao Bian |
Accept (Poster) |
5 |
Accurate Parameter-Efficient Test-Time Adaptation for Time Series Forecasting |
Heitor Rapela Medeiros, Hossein Sharifi-Noghabi, Gabriel L. Oliveira, Saghar Irandoust |
Accept (Poster) |
6 |
AdaptMI: Adaptive Skill-based In-context Math Instructions for Small Language Models |
Yinghui He, Abhishek Panigrahi, Yong Lin, Sanjeev Arora |
Accept (Poster) |
7 |
UniTTA: Unified Benchmark and Versatile Framework Towards Realistic Test-Time Adaptation |
Chaoqun Du, Jiayi Guo, Yulin Wang, Gao Huang |
Accept (Poster) |
8 |
SwiTTA: Switching Domain Experts and Aggregating Contextual Features Towards Realistic Test-Time Adaptation |
Chaoqun Du, Jiayi Guo, Yulin Wang, Gao Huang |
Accept (Poster) |
10 |
Lightweight Online Adaption for Time Series Foundation Model Forecasts |
Thomas L Lee, William Toner, Martin Asenov, Artjom Joosen, Rajkarn Singh |
Accept (Poster) |
11 |
DPCore: Dynamic Prompt Coreset for Continual Test-Time Adaptation |
Yunbei Zhang, Akshay Mehra, Shuaicheng Niu, Jihun Hamm |
Accept (Poster) |
13 |
Language Model Personalization via Reward Factorization |
Idan Shenfeld, Felix Faltings, Pulkit Agrawal, Aldo Pacchiano |
Accept (Oral) |
14 |
MADCAT: Combating Malware Detection Under Concept Drift with Test-Time Adaptation |
Eunjin Roh, Yigitcan Kaya, Christopher Kruegel, Giovanni Vigna, Sanghyun Hong |
Accept (Poster) |
15 |
Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization |
Joschka Braun, Carsten Eickhoff, Seyed Ali Bahrainian |
Accept (Poster) |
17 |
Language System: A Lightweight Ranking Framework for Language Models |
Chenheng Zhang, Tianqi Du, Jizhe Zhang, Mingqing Xiao, Yifei Wang, Yisen Wang, Zhouchen Lin |
Accept (Poster) |
18 |
LIFT: Improving Long Context Understanding of Large Language Models through Long Input Fine-Tuning |
Yansheng Mao, Yufei Xu, Jiaqi Li, Fanxu Meng, Haotong Yang, Zilong Zheng, Xiyuan Wang, Muhan Zhang |
Accept (Poster) |
19 |
On Training-Test (Mis)alignment in Unsupervised Combinatorial Optimization: Observation, Empirical Exploration, and Analysis |
Fanchen Bu, Kijung Shin |
Accept (Poster) |
20 |
Prefix-Tuning+: Modernizing Prefix-Tuning by Decoupling the Prefix from Attention |
Haonan Wang, Brian K Chen, Li Siquan, Liang Xinhe, Tianyang Hu, Hwee Kuan Lee, Kenji Kawaguchi |
Accept (Poster) |
21 |
Replacing thinking with tool usage enables reasoning in small language models |
Corrado Rainone, Tim Bakker, Roland Memisevic |
Accept (Poster) |
22 |
Test-time Offline Reinforcement Learning on Goal-related Experience |
Marco Bagatella, Mert Albaba, Jonas Hübotter, Georg Martius, Andreas Krause |
Accept (Poster) |
23 |
Test-Time Adaptation for Generalizable Task Progress Estimation |
Christos Ziakas, Alessandra Russo |
Accept (Poster) |
25 |
Causal Fine-Tuning of Pre-trained Language Models for Robust Test Time Adaptation |
Jialin Yu, Yuxiang Zhou, Yulan He, Nevin L. Zhang, Junchi Yu, Philip Torr, Ricardo Silva |
Accept (Poster) |
26 |
Self-Generated In-Context Examples Improve LLM Agents for Sequential Decision-Making Tasks |
Vishnu Sarukkai, Zhiqiang Xie, Kayvon Fatahalian |
Accept (Poster) |
28 |
CCC: Enhancing Video Generation via Structured MLLM Feedback |
Jing Gu, Ashwin Nagarajan, Tejas Polu, Kaizhi Zheng, Ruijian Zha, Jie Yang, Xin Eric Wang |
Accept (Poster) |
29 |
Keep the Alignment, Skip the Overhead: Lightweight Instruction Alignment for Continually Trained LLMs |
Ishan Jindal, Badrinath chandana, Pranjal Bharti, Lakkidi Vinay, SACHIN DEV SHARMA |
Accept (Poster) |
32 |
OnDistributionalRobustnessofIn-ContextLearningforTextClassification |
Carolina Hatanpää, Noah A. Smith, Sachin Kumar |
Accept (Poster) |
33 |
Leto: Modeling Multivariate Time Series with Memorizing at Test Time |
Ali Behrouz, Daniel Yiming Cao, Ali Parviz, Michele Santacatterina, Ramin Zabih |
Accept (Oral) |
34 |
Inference-Time Alignment via Hypothesis Reweighting |
Yoonho Lee, Jonathan Williams, Henrik Marklund, Archit Sharma, Eric Mitchell, Anikait Singh, Chelsea Finn |
Accept (Poster) |
35 |
e3: Learning to Explore Enables Extrapolation of Test-Time Compute for LLMs |
Amrith Setlur, Matthew Y. R. Yang, Charlie Victor Snell, Jeremiah Greer, Ian Wu, Virginia Smith, Max Simchowitz, Aviral Kumar |
Accept (Oral) |
36 |
Beyond Markovian: Reflective Exploration via Bayes-Adaptive RL for LLM Reasoning |
Shenao Zhang, Yaqing Wang, Yinxiao Liu, Tianqi Liu, Peter Grabowski, Eugene Ie, Zhaoran Wang, Yunxuan Li |
Accept (Poster) |
37 |
An Evidence-Based Post-Hoc Adjustment Framework for Anomaly Detection Under Data Contamination |
Sukanya Patra, Souhaib Ben Taieb |
Accept (Poster) |
38 |
Distilling Prompts at Test-Time for Multimodal Few-Shot Learning |
Akash Gupta, Amos Storkey, Mirella Lapata |
Accept (Poster) |
39 |
Mitigating Forgetting in Low Rank Adaptation |
Joanna Sliwa, Frank Schneider, Philipp Hennig, José Miguel Hernández-Lobato |
Accept (Poster) |
40 |
Temporal Sampling for Forgotten Reasoning in LLMs |
Yuetai Li, Zhangchen Xu, Fengqing Jiang, Bhaskar Ramasubramanian, Luyao Niu, Bill Yuchen Lin, Xiang Yue, Radha Poovendran |
Accept (Poster) |
43 |
Adaptive Monocular Depth Estimation with Masked Image Consistency |
Damian Sójka, Marc Masana, Bartłomiej Twardowski, Sebastian Cygert |
Accept (Poster) |
44 |
Learning to Self-Correct through Chain-of-Thought Verification |
Bradley Guo, Jingwen Gu, Jin Peng Zhou, Wen Sun |
Accept (Poster) |
45 |
SteeringTTA: Guiding Diffusion Trajectories for Robust Test-Time-Adaptation |
Jihyun Yu, Yoojin Oh, Wonho Bae, Mingyu Kim, Junhyug Noh |
Accept (Poster) |
46 |
Context Tuning for In-Context Optimization |
Jack Lu, Ryan Teehan, Zhenbang Yang, Mengye Ren |
Accept (Poster) |
48 |
When and How Unlabeled Data Provably Improve In-Context Learning |
Yingcong Li, Xiangyu Chang, Muti Kara, Xiaofeng Liu, Amit Roy-Chowdhury, Samet Oymak |
Accept (Poster) |
49 |
Diffusion Tree Sampling: Scalable inference‑time alignment of diffusion models |
Vineet Jain, Kusha Sareen, Mohammad Pedramfar, Siamak Ravanbakhsh |
Accept (Poster) |
50 |
The Curious Language Model: Strategic Test-Time Information Acquisition |
Michael Cooper, Rohan Wadhawan, John Michael Giorgi, Chenhao Tan, Davis Liang |
Accept (Poster) |
51 |
Prune ’n Predict: Optimizing LLM Decision-making with Conformal Prediction |
Harit Vishwakarma, Alan Mishler, Thomas Cook, Niccolo Dalmasso, Natraj Raman, Sumitra Ganesh |
Accept (Poster) |
53 |
Test-Time Alignment of Discrete Diffusion Models with Sequential Monte Carlo |
Chinmay Pani, Zijing Ou, Yingzhen Li |
Accept (Poster) |
54 |
Reasoning as an Adaptive Defense for Safety |
Taeyoun Kim, Fahim Tajwar, Aditi Raghunathan, Aviral Kumar |
Accept (Poster) |
57 |
GRIP: In-Parameter Graph Reasoning through Fine-Tuning Large Language Models |
Jiarui Feng, Yixin Chen, Muhan Zhang |
Accept (Poster) |
58 |
Test Time Adaptation Using Adaptive Quantile Recalibration |
Paria Mehrbod, Pedro Vianna, geraldin nanfack, Guy Wolf, Eugene Belilovsky |
Accept (Poster) |
59 |
Adaptive Diffusion Denoised Smoothing : Certified Robustness via Randomized Smoothing with Differentially Private Guided Denoising Diffusion |
Frederick Shpilevskiy, Saiyue Lyu, Krishnamurthy Dj Dvijotham, Mathias Lécuyer, Pierre-Andre Noel |
Accept (Oral) |
60 |
LoRA-TTT: Low-Rank Test-Time Training for Vision-Language Models |
Yuto Kojima, Jiarui Xu, Xueyan Zou, Xiaolong Wang |
Accept (Poster) |
61 |
Shift-Aware Test Time Adaptation and Benchmarking for Time-Series Forecasting |
Shivam Grover, Ali Etemad |
Accept (Oral) |
63 |
Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval |
Taiye Chen, Zeming Wei, Ang Li, Yisen Wang |
Accept (Poster) |
64 |
Scaling Textual Gradients via Sampling-Based Momentum |
Zixin Ding, Junyuan Hong, Jiachen T. Wang, Zinan Lin, Zhangyang Wang, Yuxin Chen |
Accept (Poster) |
65 |
Rejection Sampling Based Fine Tuning Secretly Performs PPO |
Gautham Govind Anil, Dheeraj Mysore Nagaraj, Karthikeyan Shanmugam, Sanjay Shakkottai |
Accept (Poster) |
66 |
Value Conditioned Policy Fine Tuning for Test Time Domain Adaptation |
Harit Pandya, Ignas Budvytis, Rudra P. K. Poudel, Stephan Liwicki |
Accept (Poster) |
68 |
Zero-Shot Adaptation of Behavioral Foundation Models to Unseen Dynamics |
Maksim Bobrin, Ilya Zisman, Alexander Nikulin, Dmitry V. Dylov, Vladislav Kurenkov |
Accept (Poster) |
71 |
N-Gram Induction Heads for In-Context RL: Improving Stability and Reducing Data Needs |
Ilya Zisman, Alexander Nikulin, Viacheslav Sinii, Denis Tarasov, Lyubaykin Nikita, Andrei Polubarov, Igor Kiselev, Vladislav Kurenkov |
Accept (Poster) |
72 |
Monitoring Risks in Test-Time Adaptation |
Mona Schirmer, Metod Jazbec, Christian A. Naesseth, Eric Nalisnick |
Accept (Oral) |