Agent Research Papers
Agent Research Papers
Updated on 2025.10.30 Current Search Keywords:
Agent,Multi-Agent,Tool Learning,Agent RL,Autonomous Agent,LLM Agent
If you have any other keywords, please feel free to let us know :)
Table of Contents
- <a href=#agent>Agent</a>
- <a href=#large-language-models>Large Language Models</a>
- <a href=#reinforcement-learning>Reinforcement Learning</a>
Agent
- 2025-10-13, video-SALMONN S: Streaming Audio-Visual LLMs Beyond Length Limits via Memory, Guangzhi Sun et.al., Paper: http://arxiv.org/abs/2510.11129
- 2025-08-28, rStar2-Agent: Agentic Reasoning Technical Report, Ning Shang et.al., Paper: http://arxiv.org/abs/2508.20722
- 2025-10-22, gem5 Co-Pilot: AI Assistant Agent for Architectural Design Space Exploration, Zuoming Fu et.al., Paper: http://arxiv.org/abs/2510.19577
- 2025-10-12, Zero-Shot Large Language Model Agents for Fully Automated Radiotherapy Treatment Planning, Dongrong Yang et.al., Paper: http://arxiv.org/abs/2510.11754
- 2025-09-30, Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents, Shuai Shao et.al., Paper: http://arxiv.org/abs/2509.26354
- 2025-10-20, World-in-World: World Models in a Closed-Loop World, Jiahan Zhang et.al., Paper: http://arxiv.org/abs/2510.18135
- 2025-10-16, Why Instant-Runoff Voting Is So Resilient to Coalitional Manipulation: Phase Transitions in the Perturbed Culture, François Durand et.al., Paper: http://arxiv.org/abs/2510.14450
- 2025-10-17, Where to Search: Measure the Prior-Structured Search Space of LLM Agents, Zhuo-Yang Song et.al., Paper: http://arxiv.org/abs/2510.14846
- 2025-09-29, Where LLM Agents Fail and How They can Learn From Failures, Kunlun Zhu et.al., Paper: http://arxiv.org/abs/2509.25370
- 2025-10-21, When Your AI Agent Succumbs to Peer-Pressure: Studying Opinion-Change Dynamics of LLMs, Aliakbar Mehdizadeh et.al., Paper: http://arxiv.org/abs/2510.19107
- 2025-10-08, When Machines Meet Each Other: Network Effects and the Strategic Role of History in Multi-Agent AI, Yu Liu et.al., Paper: http://arxiv.org/abs/2510.06903
- 2025-10-10, When LLM Agents Meet Graph Optimization: An Automated Data Quality Improvement Approach, Zhihan Zhang et.al., Paper: http://arxiv.org/abs/2510.08952
- 2025-09-29, When Greedy Wins: Emergent Exploitation Bias in Meta-Bandit LLM Training, Sanxing Chen et.al., Paper: http://arxiv.org/abs/2509.24923
- 2025-09-02, When Agents go Astray: Course-Correcting SWE Agents with PRMs, Shubham Gandhi et.al., Paper: http://arxiv.org/abs/2509.02360
- 2025-10-13, When Agents Trade: Live Multi-Market Trading Benchmark for LLM Agents, Lingfei Qian et.al., Paper: http://arxiv.org/abs/2510.11695
- 2025-10-15, When “Correct” Is Not Safe: Can We Trust Functionally Correct Patches Generated by Code Agents?, Yibo Peng et.al., Paper: http://arxiv.org/abs/2510.17862
- 2025-09-26, What Makes LLM Agent Simulations Useful for Policy? Insights From an Iterative Design Engagement in Emergency Preparedness, Yuxuan Li et.al., Paper: http://arxiv.org/abs/2509.21868
- 2025-09-25, What Do LLM Agents Do When Left Alone? Evidence of Spontaneous Meta-Cognitive Patterns, Stefan Szeider et.al., Paper: http://arxiv.org/abs/2509.21224
- 2025-10-21, WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection, Guanzhong He et.al., Paper: http://arxiv.org/abs/2510.18798
- 2025-09-16, WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning, Kuan Li et.al., Paper: http://arxiv.org/abs/2509.13305
- 2025-10-13, WebRouter: Query-specific Router via Variational Information Bottleneck for Cost-sensitive Web Agent, Tao Li et.al., Paper: http://arxiv.org/abs/2510.11221
- 2025-10-22, WebGraphEval: Multi-Turn Trajectory Evaluation for Web Agents using Graph Representation, Yaoyao Qian et.al., Paper: http://arxiv.org/abs/2510.19205
- 2025-10-21, WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality, Chunyang Li et.al., Paper: http://arxiv.org/abs/2510.18560
- 2025-10-08, WebDART: Dynamic Decomposition and Re-planning for Complex Web Tasks, Jingbo Yang et.al., Paper: http://arxiv.org/abs/2510.06587
- 2025-10-17, WEBSERV: A Browser-Server Environment for Efficient Training of Reinforcement Learning-based Web Agents at Scale, Yuxuan Lu et.al., Paper: http://arxiv.org/abs/2510.16252
- 2025-09-28, WAREX: Web Agent Reliability Evaluation on Existing Benchmarks, Su Kara et.al., Paper: http://arxiv.org/abs/2510.03285
- 2025-09-30, VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications, Wei He et.al., Paper: http://arxiv.org/abs/2509.26490
- 2025-09-15, VisDocSketcher: Towards Scalable Visual Documentation with Agentic Systems, Luís F. Gomes et.al., Paper: http://arxiv.org/abs/2509.11942
- 2025-10-20, Verification-Aware Planning for Multi-Agent Systems, Tianyang Xu et.al., Paper: http://arxiv.org/abs/2510.17109
- 2025-10-03, VeriGuard: Enhancing LLM Agent Safety via Verified Code Generation, Lesly Miculicich et.al., Paper: http://arxiv.org/abs/2510.05156
- 2025-10-14, VQArt-Bench: A semantically rich VQA Benchmark for Art and Cultural Heritage, A. Alfarano et.al., Paper: http://arxiv.org/abs/2510.12750
- 2025-10-16, VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation, Han Zhao et.al., Paper: http://arxiv.org/abs/2510.14902
- 2025-10-17, VERA-MH Concept Paper, Luca Belli et.al., Paper: http://arxiv.org/abs/2510.15297
- 2025-10-19, VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents, Kangrui Wang et.al., Paper: http://arxiv.org/abs/2510.16907
- 2025-09-12, V-Math: An Agentic Approach to the Vietnamese National High School Graduation Mathematics Exams, Duong Q. Nguyen et.al., Paper: http://arxiv.org/abs/2509.12251
- 2025-08-26, Utilizing Training Data to Improve LLM Reasoning for Tabular Understanding, Chufan Gao et.al., Paper: http://arxiv.org/abs/2508.18676
- 2025-10-14, Using Kolmogorov-Smirnov Distance for Measuring Distribution Shift in Machine Learning, Ozan K. Tonguz et.al., Paper: http://arxiv.org/abs/2510.15996
- 2025-10-16, UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos, Mingxuan Liu et.al., Paper: http://arxiv.org/abs/2510.15018
- 2025-10-18, Unleashing Diverse Thinking Modes in LLMs through Multi-Agent Collaboration, Zhixuan He et.al., Paper: http://arxiv.org/abs/2510.16645
- 2025-09-17, Understanding the Process of Human-AI Value Alignment, Jack McKinlay et.al., Paper: http://arxiv.org/abs/2509.13854
- 2025-10-13, Uncertainty-Aware, Risk-Adaptive Access Control for Agentic Systems using an LLM-Judged TBAC Model, Charles Fleming et.al., Paper: http://arxiv.org/abs/2510.11414
- 2025-09-26, UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios, Haotian Luo et.al., Paper: http://arxiv.org/abs/2509.21766
- 2025-09-22, UIPro: Unleashing Superior Interaction Capability For GUI Agents, Hongxin Li et.al., Paper: http://arxiv.org/abs/2509.17328
- 2025-09-05, UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning, Haoming Wang et.al., Paper: http://arxiv.org/abs/2509.02544
- 2025-10-17, TriAgent: Automated Biomarker Discovery with Deep Research Grounding for Triage in Acute Care by LLM-Based Multi-Agent Collaboration, Kerem Delikoyun et.al., Paper: http://arxiv.org/abs/2510.16080
- 2025-10-11, Tree Search for LLM Agent Reinforcement Learning, Yuxiang Ji et.al., Paper: http://arxiv.org/abs/2509.21240
- 2025-10-07, Training-Free Time Series Classification via In-Context Reasoning with LLM Agents, Songyuan Sui et.al., Paper: http://arxiv.org/abs/2510.05950
- 2025-10-09, Training-Free Group Relative Policy Optimization, Yuzheng Cai et.al., Paper: http://arxiv.org/abs/2510.08191
- 2025-09-24, Training Task Reasoning LLM Agents for Multi-turn Task Planning via Single-turn Reinforcement Learning, Hanjiang Hu et.al., Paper: http://arxiv.org/abs/2509.20616
- 2025-10-16, Training LLM Agents to Empower Humans, Evan Ellis et.al., Paper: http://arxiv.org/abs/2510.13709
- 2025-10-22, Trace: Securing Smart Contract Repository Against Access Control Vulnerability, Chong Chen et.al., Paper: http://arxiv.org/abs/2510.19254
- 2025-09-11, TrEnv: Transparently Share Serverless Execution Environments Across Different Functions and Nodes, Jialiang Huang et.al., Paper: http://arxiv.org/abs/2509.09525
- 2025-09-20, Towards Transparent and Incentive-Compatible Collaboration in Decentralized LLM Multi-Agent Systems: A Blockchain-Driven Approach, Minfeng Qi et.al., Paper: http://arxiv.org/abs/2509.16736
- 2025-09-19, Towards Robust Visual Continual Learning with Multi-Prototype Supervision, Xiwei Liu et.al., Paper: http://arxiv.org/abs/2509.16011
- 2025-10-24, Towards Reliable Code-as-Policies: A Neuro-Symbolic Framework for Embodied Task Planning, Sanghyun Ahn et.al., Paper: http://arxiv.org/abs/2510.21302
- 2025-10-19, Towards Interpretable and Trustworthy Time Series Reasoning: A BlueSky Vision, Kanghui Ning et.al., Paper: http://arxiv.org/abs/2510.16980
- 2025-10-17, Towards Automatic Evaluation and Selection of PHI De-identification Models via Multi-Agent Collaboration, Guanchen Wu et.al., Paper: http://arxiv.org/abs/2510.16194
- 2025-10-16, Towards Automated Governance: A DSL for Human-Agent Collaboration in Software Projects, Adem Ait et.al., Paper: http://arxiv.org/abs/2510.14465
- 2025-09-02, Towards Agents That Know When They Don’t Know: Uncertainty as a Control Signal for Structured Reasoning, Josefa Lia Stoisser et.al., Paper: http://arxiv.org/abs/2509.02401
- 2025-09-30, Towards Agentic OS: An LLM Agent Framework for Linux Schedulers, Yusheng Zheng et.al., Paper: http://arxiv.org/abs/2509.01245
- 2025-10-23, Towards AI Agents for Course Instruction in Higher Education: Early Experiences from the Field, Yogesh Simmhan et.al., Paper: http://arxiv.org/abs/2510.20255
- 2025-09-16, Toward PDDL Planning Copilot, Yarin Benyamin et.al., Paper: http://arxiv.org/abs/2509.12987
- 2025-08-25, Toward Generalized Autonomous Agents: A Neuro-Symbolic AI Framework for Integrating Social and Technical Support in Education, Ryan Hare et.al., Paper: http://arxiv.org/abs/2508.18406
- 2025-08-26, Toward Edge General Intelligence with Agentic AI and Agentification: Concepts, Technologies, and Future Directions, Ruichen Zhang et.al., Paper: http://arxiv.org/abs/2508.18725
- 2025-10-08, Toward Causal-Visual Programming: Enhancing Agentic Reasoning in Low-Code Environments, Jiexi Xu et.al., Paper: http://arxiv.org/abs/2509.25282
- 2025-09-17, TopoSizing: An LLM-aided Framework of Topology-based Understanding and Sizing for AMS Circuits, Ziming Wei et.al., Paper: http://arxiv.org/abs/2509.14169
- 2025-10-22, ToolScope: Enhancing LLM Agent Tool Use through Tool Merging and Context-Aware Filtering, Marianne Menglin Liu et.al., Paper: http://arxiv.org/abs/2510.20036
- 2025-09-18, ToolSample: Dual Dynamic Sampling Methods with Curriculum Learning for RL-based Tool Learning, Zihao Feng et.al., Paper: http://arxiv.org/abs/2509.14718
- 2025-10-16, ToolPRM: Fine-Grained Inference Scaling of Structured Outputs for Function Calling, Jianghao Lin et.al., Paper: http://arxiv.org/abs/2510.14703
- 2025-10-19, ToolCritic: Detecting and Correcting Tool-Use Errors in Dialogue Systems, Hassan Hamad et.al., Paper: http://arxiv.org/abs/2510.17052
- 2025-10-21, Tokencake: A KV-Cache-centric Serving Framework for LLM-based Multi-Agent Applications, Zhuohang Bian et.al., Paper: http://arxiv.org/abs/2510.18586
- 2025-10-14, ToPolyAgent: AI Agents for Coarse-Grained Topological Polymer Simulations, Lijie Ding et.al., Paper: http://arxiv.org/abs/2510.12091
- 2025-10-16, To Infinity and Beyond: Tool-Use Unlocks Length Generalization in State Space Models, Eran Malach et.al., Paper: http://arxiv.org/abs/2510.14826
- 2025-09-17, Ticket-Bench: A Kickoff for Multilingual and Regionalized Agent Evaluation, Thales Sales Almeida et.al., Paper: http://arxiv.org/abs/2509.14477
- 2025-09-22, Through the Lens of Human-Human Collaboration: A Configurable Research Platform for Exploring Human-Agent Collaboration, Bingsheng Yao et.al., Paper: http://arxiv.org/abs/2509.18008
- 2025-10-15, Three-Dimensional Simulation of the University of Hawai`i FEL Oscillator: Superradiant Emission and Cavity Desynchronization, Amir Weinberg et.al., Paper: http://arxiv.org/abs/2510.14061
- 2025-10-14, Three Lenses on the AI Revolution: Risk, Transformation, Continuity, Masoud Makrehchi et.al., Paper: http://arxiv.org/abs/2510.12859
- 2025-10-17, The Spark Effect: On Engineering Creative Diversity in Multi-Agent AI Systems, Alexander Doudkin et.al., Paper: http://arxiv.org/abs/2510.15568
- 2025-10-01, The Social Laboratory: A Psychometric Framework for Multi-Agent LLM Evaluation, Zarreen Reza et.al., Paper: http://arxiv.org/abs/2510.01295
- 2025-09-01, The Need for Verification in AI-Driven Scientific Discovery, Cristina Cornelio et.al., Paper: http://arxiv.org/abs/2509.01398
- 2025-09-02, The Landscape of Agentic Reinforcement Learning for LLMs: A Survey, Guibin Zhang et.al., Paper: http://arxiv.org/abs/2509.02547
- 2025-09-23, The Heterogeneous Multi-Agent Challenge, Charles Dansereau et.al., Paper: http://arxiv.org/abs/2509.19512
- 2025-10-16, The Gatekeeper Knows Enough, Fikresilase Wondmeneh Abebayew et.al., Paper: http://arxiv.org/abs/2510.14881
- 2025-09-26, The Emergence of Altruism in Large-Language-Model Agents Society, Haoyang Li et.al., Paper: http://arxiv.org/abs/2509.22537
- 2025-08-25, The AI Data Scientist, Farkhad Akimov et.al., Paper: http://arxiv.org/abs/2508.18113
- 2025-10-16, Terrarium: Revisiting the Blackboard for Multi-Agent Safety, Privacy, and Security Studies, Mason Nakamura et.al., Paper: http://arxiv.org/abs/2510.14312
- 2025-10-09, Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception – Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track, Erjia Xiao et.al., Paper: http://arxiv.org/abs/2510.07871
- 2025-10-15, Tandem Training for Language Models, Robert West et.al., Paper: http://arxiv.org/abs/2510.13551
- 2025-09-09, Talking with Oompa Loompas: A novel framework for evaluating linguistic acquisition of LLM agents, Sankalp Tattwadarshi Swain et.al., Paper: http://arxiv.org/abs/2509.07389
- 2025-09-08, TalkToAgent: A Human-centric Explanation of Reinforcement Learning Agents with Large Language Models, Haechang Kim et.al., Paper: http://arxiv.org/abs/2509.04809
- 2025-10-12, Talk Less, Call Right: Enhancing Role-Play LLM Agents with Automatic Prompt Optimization and Role Prompting, Saksorn Ruangtanusak et.al., Paper: http://arxiv.org/abs/2509.00482
- 2025-09-12, Tackling One Health Risks: How Large Language Models are leveraged for Risk Negotiation and Consensus-building, Alexandra Fetsch et.al., Paper: http://arxiv.org/abs/2509.09906
- 2025-10-01, TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments, Zhangchen Xu et.al., Paper: http://arxiv.org/abs/2510.01179
- 2025-09-30, TENET: Leveraging Tests Beyond Validation for Code Generation, Yiran Hu et.al., Paper: http://arxiv.org/abs/2509.24148
- 2025-10-27, TALM: Dynamic Tree-Structured Multi-Agent Framework with Long-Term Memory for Scalable Code Generation, Ming-Tung Shen et.al., Paper: http://arxiv.org/abs/2510.23010
- 2025-10-02, TACOS: Task Agnostic COordinator of a multi-drone System, Alessandro Nazzari et.al., Paper: http://arxiv.org/abs/2510.01869
- 2025-10-15, Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms, Shrey Pandit et.al., Paper: http://arxiv.org/abs/2510.13913
- 2025-10-18, Synergizing chemical and AI communities for advancing laboratories of the future, Saejin Oh et.al., Paper: http://arxiv.org/abs/2510.16293
- 2025-10-11, SwarmSys: Decentralized Swarm-Inspired Agents for Scalable and Adaptive Reasoning, Ruohao Li et.al., Paper: http://arxiv.org/abs/2510.10047
- 2025-10-13, SusBench: An Online Benchmark for Evaluating Dark Pattern Susceptibility of Computer-Use Agents, Longjie Guo et.al., Paper: http://arxiv.org/abs/2510.11035
- 2025-09-15, Survival at Any Cost? LLMs and the Choice Between Self-Preservation and Human Harm, Alireza Mohamadi et.al., Paper: http://arxiv.org/abs/2509.12190
- 2025-08-27, Survey of Specialized Large Language Model, Chenghan Yang et.al., Paper: http://arxiv.org/abs/2508.19667
- 2025-08-31, Supporting Our AI Overlords: Redesigning Data Systems to be Agent-First, Shu Liu et.al., Paper: http://arxiv.org/abs/2509.00997
- 2025-09-23, Structured Cognition for Behavioral Intelligence in Large Language Model Agents: Preliminary Study, Myung Ho Kim et.al., Paper: http://arxiv.org/abs/2510.05107
- 2025-10-14, Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs, Yujie Zhao et.al., Paper: http://arxiv.org/abs/2510.11062
- 2025-10-10, StreamingVLM: Real-Time Understanding for Infinite Video Streams, Ruyi Xu et.al., Paper: http://arxiv.org/abs/2510.09608
- 2025-10-07, Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents, Mingkang Zhu et.al., Paper: http://arxiv.org/abs/2510.06214
- 2025-09-12, Strategic Tradeoffs Between Humans and AI in Multi-Agent Bargaining, Crystal Qian et.al., Paper: http://arxiv.org/abs/2509.09071
- 2025-10-02, StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?, Yanxu Chen et.al., Paper: http://arxiv.org/abs/2510.02209
- 2025-10-15, Steer-MoE: Efficient Audio-Language Alignment with a Mixture-of-Experts Steering Module, Ruitao Feng et.al., Paper: http://arxiv.org/abs/2510.13558
- 2025-10-08, Spiral of Silence in Large Language Model Agents, Mingze Zhong et.al., Paper: http://arxiv.org/abs/2510.02360
- 2025-09-26, Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents, Heyang Gao et.al., Paper: http://arxiv.org/abs/2510.03253
- 2025-10-24, Soft Instruction De-escalation Defense, Nils Philipp Walter et.al., Paper: http://arxiv.org/abs/2510.21057
- 2025-10-21, Socialized Learning and Emergent Behaviors in Multi-Agent Systems based on Multimodal Large Language Models, Sureyya Akin et.al., Paper: http://arxiv.org/abs/2510.18515
- 2025-10-01, Social Welfare Function Leaderboard: When LLM Agents Allocate Social Welfare, Zhengliang Shi et.al., Paper: http://arxiv.org/abs/2510.01164
- 2025-10-06, Social Agent: Mastering Dyadic Nonverbal Behavior Generation via Conversational LLM Agents, Zeyi Zhang et.al., Paper: http://arxiv.org/abs/2510.04637
- 2025-10-02, SoK: Measuring What Matters for Closed-Loop Security Agents, Mudita Khurana et.al., Paper: http://arxiv.org/abs/2510.01654
- 2025-09-27, Situational Awareness for Safe and Robust Multi-Agent Interactions Under Uncertainty, Benjamin Alcorn et.al., Paper: http://arxiv.org/abs/2509.23425
- 2025-10-11, Simulating Viva Voce Examinations to Evaluate Clinical Reasoning in Large Language Models, Christopher Chiu et.al., Paper: http://arxiv.org/abs/2510.10278
- 2025-10-09, Simulating Teams with LLM Agents: Interactive 2D Environments for Studying Human-AI Dynamics, Mohammed Almutairi et.al., Paper: http://arxiv.org/abs/2510.08242
- 2025-09-23, Simulating Online Social Media Conversations on Controversial Topics Using AI Agents Calibrated on Real-World Data, Elisa Composta et.al., Paper: http://arxiv.org/abs/2509.18985
- 2025-09-29, SimuHome: A Temporal- and Environment-Aware Benchmark for Smart Home LLM Agents, Gyuhyeon Seo et.al., Paper: http://arxiv.org/abs/2509.24282
- 2025-09-21, SignalLLM: A General-Purpose LLM Agent Framework for Automated Signal Processing, Junlong Ke et.al., Paper: http://arxiv.org/abs/2509.17197
- 2025-10-22, SheetBrain: A Neuro-Symbolic Agent for Accurate Reasoning over Complex and Large Spreadsheets, Ziwei Wang et.al., Paper: http://arxiv.org/abs/2510.19247
- 2025-10-20, ShapeCraft: LLM Agents for Structured, Textured and Interactive 3D Modeling, Shuyuan Zhang et.al., Paper: http://arxiv.org/abs/2510.17603
- 2025-10-15, Sequential Quantum Measurements and the Instrumental Group Algebra, Christopher S. Jackson et.al., Paper: http://arxiv.org/abs/2510.13980
- 2025-10-21, SentinelNet: Safeguarding Multi-Agent Collaboration Through Credit-Based Dynamic Threat Detection, Yang Feng et.al., Paper: http://arxiv.org/abs/2510.16219
- 2025-10-20, Semantic Intelligence: A Bio-Inspired Cognitive Framework for Embodied Agents, Wenbing Tang et.al., Paper: http://arxiv.org/abs/2510.17129
- 2025-10-17, Self-evolving expertise in complex non-verifiable subject domains: dialogue as implicit meta-RL, Richard M. Bailey et.al., Paper: http://arxiv.org/abs/2510.15772
- 2025-09-12, Self-Supervised Goal-Reaching Results in Multi-Agent Cooperation and Exploration, Chirayu Nimonkar et.al., Paper: http://arxiv.org/abs/2509.10656
- 2025-10-09, Self-Improving LLM Agents at Test-Time, Emre Can Acikgoz et.al., Paper: http://arxiv.org/abs/2510.07841
- 2025-10-01, Seeing through Uncertainty: Robust Task-Oriented Optimization in Visual Navigation, Yiyuan Pan et.al., Paper: http://arxiv.org/abs/2510.00441
- 2025-10-22, Seed3D 1.0: From Images to High-Fidelity Simulation-Ready 3D Assets, Jiashi Feng et.al., Paper: http://arxiv.org/abs/2510.19944
- 2025-10-24, Securing AI Agent Execution, Christoph Bühler et.al., Paper: http://arxiv.org/abs/2510.21236
- 2025-09-18, SecureFixAgent: A Hybrid LLM Agent for Automated Python Static Vulnerability Repair, Jugal Gajjar et.al., Paper: http://arxiv.org/abs/2509.16275
- 2025-08-27, Secure Multi-LLM Agentic AI and Agentification for Edge General Intelligence by Zero-Trust: A Survey, Yinqiu Liu et.al., Paper: http://arxiv.org/abs/2508.19870
- 2025-10-21, Search Self-play: Pushing the Frontier of Agent Capability without Supervision, Hongliang Lu et.al., Paper: http://arxiv.org/abs/2510.18821
- 2025-09-12, SciML Agents: Write the Solver, Not the Solution, Saarth Gaonkar et.al., Paper: http://arxiv.org/abs/2509.09936
- 2025-10-13, Scaling Long-Horizon LLM Agent via Context-Folding, Weiwei Sun et.al., Paper: http://arxiv.org/abs/2510.11967
- 2025-10-08, Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management, Miao Lu et.al., Paper: http://arxiv.org/abs/2510.06727
- 2025-10-17, Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding, Sensen Gao et.al., Paper: http://arxiv.org/abs/2510.15253
- 2025-10-10, Safety Game: Balancing Safe and Informative Conversations with Blackbox Agentic AI using LP Solvers, Tuan Nguyen et.al., Paper: http://arxiv.org/abs/2510.09330
- 2025-09-30, SafeMind: Benchmarking and Mitigating Safety Risks in Embodied LLM Agents, Ruolin Chen et.al., Paper: http://arxiv.org/abs/2509.25885
- 2025-10-20, SafeCoop: Unravelling Full Stack Safety in Agentic Collaborative Driving, Xiangbo Gao et.al., Paper: http://arxiv.org/abs/2510.18123
- 2025-09-18, SWE-QA: Can Language Models Answer Repository-level Code Questions?, Weihan Peng et.al., Paper: http://arxiv.org/abs/2509.14635
- 2025-10-16, SVAG-Bench: A Large-Scale Benchmark for Multi-Instance Spatio-temporal Video Action Grounding, Tanveer Hannan et.al., Paper: http://arxiv.org/abs/2510.13016
- 2025-10-15, STEMS: Spatial-Temporal Enhanced Safe Multi-Agent Coordination for Building Energy Management, Huiliang Zhang et.al., Paper: http://arxiv.org/abs/2510.14112
- 2025-10-19, STARK: Strategic Team of Agents for Refining Kernels, Juncheng Dong et.al., Paper: http://arxiv.org/abs/2510.16996
- 2025-09-30, STAC: When Innocent Tools Form Dangerous Chains to Jailbreak LLM Agents, Jing-Jing Li et.al., Paper: http://arxiv.org/abs/2509.25624
- 2025-10-14, SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding, Zhiliu Yang et.al., Paper: http://arxiv.org/abs/2510.12749
- 2025-10-21, SOCIA-Nabla: Textual Gradient Meets Multi-Agent Orchestration for Automated Simulator Generation, Yuncheng Hua et.al., Paper: http://arxiv.org/abs/2510.18551
- 2025-10-08, SID: Multi-LLM Debate Driven by Self Signals, Xuhang Chen et.al., Paper: http://arxiv.org/abs/2510.06843
- 2025-10-17, SIADAFIX: issue description response for adaptive program repair, Xin Cao et.al., Paper: http://arxiv.org/abs/2510.16059
- 2025-10-27, SI-Bench: Benchmarking Social Intelligence of Large Language Models in Human-to-Human Conversations, Shuai Huang et.al., Paper: http://arxiv.org/abs/2510.23182
- 2025-10-17, SHARE: Scene-Human Aligned Reconstruction, Joshua Li et.al., Paper: http://arxiv.org/abs/2510.15342
- 2025-10-14, SENTINEL: A Multi-Level Formal Framework for Safety Evaluation of LLM-based Embodied Agents, Simon Sinong Zhan et.al., Paper: http://arxiv.org/abs/2510.12985
- 2025-09-24, SAMULE: Self-Learning Agents Enhanced by Multi-level Reflection, Yubin Ge et.al., Paper: http://arxiv.org/abs/2509.20562
- 2025-10-16, RoboGPT-R1: Enhancing Robot Planning with Reinforcement Learning, Jinrui Liu et.al., Paper: http://arxiv.org/abs/2510.14828
- 2025-09-30, RoRecomp: Enhancing Reasoning Efficiency via Rollout Response Recomposition in Reinforcement Learning, Gang Li et.al., Paper: http://arxiv.org/abs/2509.25958
- 2025-10-18, Ripple Effect Protocol: Coordinating Agent Populations, Ayush Chopra et.al., Paper: http://arxiv.org/abs/2510.16572
- 2025-10-22, Review of Tools for Zero-Code LLM Based Application Development, Priyaranjan Pattnayak et.al., Paper: http://arxiv.org/abs/2510.19747
- 2025-10-12, Retro*: Optimizing LLMs for Reasoning-Intensive Document Retrieval, Junwei Lan et.al., Paper: http://arxiv.org/abs/2509.24869
- 2025-10-28, Retrieval and Argumentation Enhanced Multi-Agent LLMs for Judgmental Forecasting, Deniz Gorur et.al., Paper: http://arxiv.org/abs/2510.24303
- 2025-10-13, Rethinking Reward Miscalibration of GRPO in Agentic RL, Jingyu Liu et.al., Paper: http://arxiv.org/abs/2509.23870
- 2025-10-28, Repurposing Synthetic Data for Fine-grained Search Agent Supervision, Yida Zhao et.al., Paper: http://arxiv.org/abs/2510.24694
- 2025-10-28, ReplicationBench: Can AI Agents Replicate Astrophysics Research Papers?, Christine Ye et.al., Paper: http://arxiv.org/abs/2510.24591
- 2025-08-26, Reliable Weak-to-Strong Monitoring of LLM Agents, Neil Kale et.al., Paper: http://arxiv.org/abs/2508.19461
- 2025-10-28, Reinforcement Learning for Long-Horizon Multi-Turn Search Agents, Vivek Kalyan et.al., Paper: http://arxiv.org/abs/2510.24126
- 2025-09-08, Reinforcement Learning Foundations for Deep Research Systems: A Survey, Wenjun Li et.al., Paper: http://arxiv.org/abs/2509.06733
- 2025-10-10, Reimagining Agent-based Modeling with Large Language Model Agents via Shachi, So Kuroki et.al., Paper: http://arxiv.org/abs/2509.21862
- 2025-09-15, Redefining Website Fingerprinting Attacks With Multiagent LLMs, Chuxu Song et.al., Paper: http://arxiv.org/abs/2509.12462
- 2025-10-19, ReclAIm: A multi-agent framework for degradation-aware performance tuning of medical imaging AI, Eleftherios Tzanis et.al., Paper: http://arxiv.org/abs/2510.17004
- 2025-09-04, Real-time adaptive quantum error correction by model-free multi-agent learning, Manuel Guatto et.al., Paper: http://arxiv.org/abs/2509.03974
- 2025-08-26, Real-Time Model Checking for Closed-Loop Robot Reactive Planning, Christopher Chandler et.al., Paper: http://arxiv.org/abs/2508.19186
- 2025-10-16, ReUseIt: Synthesizing Reusable AI Agent Workflows for Web Automation, Yimeng Liu et.al., Paper: http://arxiv.org/abs/2510.14308
- 2025-08-29, ReLATE: Learning Efficient Sparse Encoding for High-Performance Tensor Decomposition, Ahmed E. Helal et.al., Paper: http://arxiv.org/abs/2509.00280
- 2025-09-29, RadOnc-GPT: An Autonomous LLM Agent for Real-Time Patient Outcomes Labeling at Scale, Jason Holmes et.al., Paper: http://arxiv.org/abs/2509.25540
- 2025-10-06, RL Is a Hammer and LLMs Are Nails: A Simple Reinforcement Learning Recipe for Strong Prompt Injection, Yuxin Wen et.al., Paper: http://arxiv.org/abs/2510.04885
- 2025-09-08, REMI: A Novel Causal Schema Memory Architecture for Personalized Lifestyle Recommendation Agents, Vishal Raman et.al., Paper: http://arxiv.org/abs/2509.06269
- 2025-10-01, RELATE-Sim: Leveraging Turning Point Theory and LLM Agents to Predict and Understand Long-Term Relationship Dynamics through Interactive Narrative Simulations, Matthew Yue et.al., Paper: http://arxiv.org/abs/2510.00414
- 2025-10-15, RECODE: Reasoning Through Code Generation for Visual Question Answering, Junhong Shen et.al., Paper: http://arxiv.org/abs/2510.13756
- 2025-10-07, RECODE-H: A Benchmark for Research Code Development with Interactive Human Feedback, Chunyu Miao et.al., Paper: http://arxiv.org/abs/2510.06186
- 2025-10-18, REALM: An MLLM-Agent Framework for Open World 3D Reasoning Segmentation and Editing on Gaussian Splatting, Changyue Shi et.al., Paper: http://arxiv.org/abs/2510.16410
- 2025-09-08, RAFFLES: Reasoning-based Attribution of Faults for LLM Systems, Chenyang Zhu et.al., Paper: http://arxiv.org/abs/2509.06822
- 2025-10-01, QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL, Cong Yu et.al., Paper: http://arxiv.org/abs/2510.00967
- 2025-10-19, Pursuing Minimal Sufficiency in Spatial Reasoning, Yejie Guo et.al., Paper: http://arxiv.org/abs/2510.16688
- 2025-09-04, Psychologically Enhanced AI Agents, Maciej Besta et.al., Paper: http://arxiv.org/abs/2509.04343
- 2025-08-28, Provable Benefits of In-Tool Learning for Large Language Models, Sam Houliston et.al., Paper: http://arxiv.org/abs/2508.20755
- 2025-09-14, Prompts to Proxies: Emulating Human Preferences via a Compact LLM Ensemble, Bingchen Wang et.al., Paper: http://arxiv.org/abs/2509.11311
- 2025-09-16, PromptSleuth: Detecting Prompt Injection via Semantic Intent Invariance, Mengxiao Wang et.al., Paper: http://arxiv.org/abs/2508.20890
- 2025-10-18, Prompt Optimization via Retrieved Reasoning Assets and Multi-Agent Analysis, Wonduk Seo et.al., Paper: http://arxiv.org/abs/2510.16635
- 2025-10-08, Prompt Optimization Across Multiple Agents for Representing Diverse Human Populations, Manh Hung Nguyen et.al., Paper: http://arxiv.org/abs/2510.07064
- 2025-10-21, Probabilistic Modeling of Intentions in Socially Intelligent LLM Agents, Feifan Xia et.al., Paper: http://arxiv.org/abs/2510.18476
- 2025-09-22, Privacy in Action: Towards Realistic Privacy Mitigation and Evaluation for LLM-Powered Agents, Shouju Wang et.al., Paper: http://arxiv.org/abs/2509.17488
- 2025-10-18, Prior Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods, Avrim Blum et.al., Paper: http://arxiv.org/abs/2510.16609
- 2025-10-10, Preference-Aware Memory Update for Long-Term LLM Agents, Haoran Sun et.al., Paper: http://arxiv.org/abs/2510.09720
- 2025-10-02, Pre-Hoc Predictions in AutoML: Leveraging LLMs to Enhance Model Selection and Benchmarking for Tabular datasets, Yannis Belkhiter et.al., Paper: http://arxiv.org/abs/2510.01842
- 2025-10-22, Pragmatic Heterogeneous Collaborative Perception via Generative Communication Mechanism, Junfei Zhou et.al., Paper: http://arxiv.org/abs/2510.19618
- 2025-10-02, Position: Privacy Is Not Just Memorization!, Niloofar Mireshghallah et.al., Paper: http://arxiv.org/abs/2510.01645
- 2025-10-17, PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction, Simon Yu et.al., Paper: http://arxiv.org/abs/2510.15863
- 2025-10-28, Policy Cards: Machine-Readable Runtime Governance for Autonomous AI Agents, Juraj Mavračić et.al., Paper: http://arxiv.org/abs/2510.24383
- 2025-10-21, PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold, Yi Wan et.al., Paper: http://arxiv.org/abs/2510.15862
- 2025-10-21, Plural Voices, Single Agent: Towards Inclusive AI in Multi-User Domestic Spaces, Joydeep Chandra et.al., Paper: http://arxiv.org/abs/2510.19008
- 2025-10-06, Plug-and-Play Dramaturge: A Divide-and-Conquer Approach for Iterative Narrative Script Refinement via Collaborative LLM Agents, Wenda Xie et.al., Paper: http://arxiv.org/abs/2510.05188
- 2025-10-01, Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs, Siyu Zhu et.al., Paper: http://arxiv.org/abs/2509.25779
- 2025-09-24, Perspectra: Choosing Your Experts Enhances Critical Thinking in Multi-Agent Research Ideation, Yiren Liu et.al., Paper: http://arxiv.org/abs/2509.20553
- 2025-09-28, PartnerMAS: An LLM Hierarchical Multi-Agent Framework for Business Partner Selection on High-Dimensional Features, Lingyao Li et.al., Paper: http://arxiv.org/abs/2509.24046
- 2025-09-29, PanoWorld-X: Generating Explorable Panoramic Worlds via Sphere-Aware Video Diffusion, Yuyang Yin et.al., Paper: http://arxiv.org/abs/2509.24997
- 2025-10-20, PLAGUE: Plug-and-play framework for Lifelong Adaptive Generation of Multi-turn Exploits, Neeladri Bhuiya et.al., Paper: http://arxiv.org/abs/2510.17947
- 2025-10-28, PFEA: An LLM-based High-Level Natural Language Planning and Feedback Embodied Agent for Human-Centered AI, Wenbin Ding et.al., Paper: http://arxiv.org/abs/2510.24109
- 2025-10-08, PARSE: LLM Driven Schema Optimization for Reliable Entity Extraction, Anubhav Shrimal et.al., Paper: http://arxiv.org/abs/2510.08623
- 2025-10-13, PADME: Procedure Aware DynaMic Execution, Deepeka Garg et.al., Paper: http://arxiv.org/abs/2510.11281
- 2025-10-27, P1GPT: a multi-agent LLM workflow module for multi-modal financial information analysis, Chen-Che Lu et.al., Paper: http://arxiv.org/abs/2510.23032
- 2025-09-19, Overhearing LLM Agents: A Survey, Taxonomy, and Roadmap, Andrew Zhu et.al., Paper: http://arxiv.org/abs/2509.16325
- 2025-10-17, Outraged AI: Large language models prioritise emotion over cost in fairness enforcement, Hao Liu et.al., Paper: http://arxiv.org/abs/2510.17880
- 2025-10-02, Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge, Charlie Masters et.al., Paper: http://arxiv.org/abs/2510.02557
- 2025-10-28, OrchDAG: Complex Tool Orchestration in Multi-Turn Interactions with Plan DAGs, Yifu Lu et.al., Paper: http://arxiv.org/abs/2510.24663
- 2025-09-28, Optimism as Risk-Seeking in Multi-Agent Reinforcement Learning, Runyu Zhang et.al., Paper: http://arxiv.org/abs/2509.24047
- 2025-10-21, Optimal allocations with distortion risk measures and mixed risk attitudes, Mario Ghossoub et.al., Paper: http://arxiv.org/abs/2510.18236
- 2025-10-09, Opponent Shaping in LLM Agents, Marta Emili Garcia Segura et.al., Paper: http://arxiv.org/abs/2510.08255
- 2025-10-24, OpenHype: Hyperbolic Embeddings for Hierarchical Open-Vocabulary Radiance Fields, Lisa Weijler et.al., Paper: http://arxiv.org/abs/2510.21441
- 2025-10-16, OpenDerisk: An Industrial Framework for AI-Driven SRE, with Design, Implementation, and Case Studies, Peng Di et.al., Paper: http://arxiv.org/abs/2510.13561
- 2025-10-23, Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence, Jiahao Meng et.al., Paper: http://arxiv.org/abs/2510.20579
- 2025-10-01, On the Soundness and Consistency of LLM Agents for Executing Test Cases Written in Natural Language, Sébastien Salva et.al., Paper: http://arxiv.org/abs/2509.19136
- 2025-10-14, On the Number of Small Points for Rational Maps, Jit Wu Yap et.al., Paper: http://arxiv.org/abs/2510.12039
- 2025-10-27, On Generalization in Agentic Tool Calling: CoreThink Agentic Reasoner and MAVEN Dataset, Vishvesh Bhat et.al., Paper: http://arxiv.org/abs/2510.22898
- 2025-09-05, OSC: Cognitive Orchestration through Dynamic Knowledge Alignment in Multi-Agent LLM Collaboration, Jusheng Zhang et.al., Paper: http://arxiv.org/abs/2509.04876
- 2025-09-01, ORCA: ORchestrating Causal Agent, Joanie Hayoun Chung et.al., Paper: http://arxiv.org/abs/2508.21304
- 2025-10-20, OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning, Zhenyu Bi et.al., Paper: http://arxiv.org/abs/2510.18032
- 2025-09-20, OPEN-THEATRE: An Open-Source Toolkit for LLM-based Interactive Drama, Tianyang Xu et.al., Paper: http://arxiv.org/abs/2509.16713
- 2025-10-22, Nonmonotone subgradient methods based on a local descent lemma, Francisco J. Aragón-Artacho et.al., Paper: http://arxiv.org/abs/2510.19341
- 2025-08-21, Noise, Adaptation, and Strategy: Assessing LLM Fidelity in Decision-Making, Yuanjun Feng et.al., Paper: http://arxiv.org/abs/2508.15926
- 2025-10-08, NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents, Tianshi Zheng et.al., Paper: http://arxiv.org/abs/2510.07172
- 2025-10-09, Neuro-Symbolic Agents with Modal Logic for Autonomous Diagnostics, Antonin Sulc et.al., Paper: http://arxiv.org/abs/2509.11943
- 2025-10-17, Narrowing Action Choices with AI Improves Human Sequential Decisions, Eleni Straitouri et.al., Paper: http://arxiv.org/abs/2510.16097
- 2025-10-21, NEBULA: Do We Evaluate Vision-Language-Action Agents Correctly?, Jierui Peng et.al., Paper: http://arxiv.org/abs/2510.16263
- 2025-10-14, MultiFoodhat: A potential new paradigm for intelligent food quality inspection, Yue Hu et.al., Paper: http://arxiv.org/abs/2510.13889
- 2025-10-17, Multi-dimensional Data Analysis and Applications Basing on LLM Agents and Knowledge Graph Interactions, Xi Wang et.al., Paper: http://arxiv.org/abs/2510.15258
- 2025-10-27, Multi-Stakeholder Alignment in LLM-Powered Collaborative AI Systems: A Multi-Agent Framework for Intelligent Tutoring, Alexandre P Uchoa et.al., Paper: http://arxiv.org/abs/2510.23245
- 2025-10-06, Multi-Agent Tool-Integrated Policy Optimization, Zhanfeng Mo et.al., Paper: http://arxiv.org/abs/2510.04678
- 2025-09-01, Multi-Agent Reinforcement Learning for Task Offloading in Wireless Edge Networks, Andrea Fox et.al., Paper: http://arxiv.org/abs/2509.01257
- 2025-10-14, Multi-Agent Debate for LLM Judges with Adaptive Stability Detection, Tianyu Hu et.al., Paper: http://arxiv.org/abs/2510.12697
- 2025-09-09, Multi Robot Coordination in Highly Dynamic Environments: Tackling Asymmetric Obstacles and Limited Communication, Vincenzo Suriani et.al., Paper: http://arxiv.org/abs/2509.08859
- 2025-10-19, More with Less: An Empirical Study of Turn-Control Strategies for Efficient Coding Agents, Pengfei Gao et.al., Paper: http://arxiv.org/abs/2510.16786
- 2025-10-27, Model Proficiency in Centralized Multi-Agent Systems: A Performance Study, Anna Guerra et.al., Paper: http://arxiv.org/abs/2510.23447
- 2025-10-24, Mixture-of-Minds: Multi-Agent Reinforcement Learning for Table Understanding, Yuhang Zhou et.al., Paper: http://arxiv.org/abs/2510.20176
- 2025-09-28, Mix-Ecom: Towards Mixed-Type E-Commerce Dialogues with Complex Domain Rules, Chenyu Zhou et.al., Paper: http://arxiv.org/abs/2509.23836
- 2025-10-22, Misalignment Bounty: Crowdsourcing AI Agent Misbehavior, Rustem Turtayev et.al., Paper: http://arxiv.org/abs/2510.19738
- 2025-09-16, Mining the Long Tail: A Comparative Study of Data-Centric Criticality Metrics for Robust Offline Reinforcement Learning in Autonomous Motion Planning, Antonio Guillen-Perez et.al., Paper: http://arxiv.org/abs/2508.18397
- 2025-08-28, MindGuard: Tracking, Detecting, and Attributing MCP Tool Poisoning Attack via Decision Dependence Graph, Zhiqiang Wang et.al., Paper: http://arxiv.org/abs/2508.20412
- 2025-10-16, MetaCaptioner: Towards Generalist Visual Captioning with Open-source Suites, Zhenxin Lei et.al., Paper: http://arxiv.org/abs/2510.12126
- 2025-09-08, Meta-Policy Reflexion: Reusable Reflective Memory and Rule Admissibility for Resource-Efficient LLM Agent, Chunlong Wu et.al., Paper: http://arxiv.org/abs/2509.03990
- 2025-10-23, Merge and Conquer: Evolutionarily Optimizing AI for 2048, Maggie Bai et.al., Paper: http://arxiv.org/abs/2510.20205
- 2025-10-21, Memory-Augmented State Machine Prompting: A Novel LLM Agent Framework for Real-Time Strategy Games, Runnan Qi et.al., Paper: http://arxiv.org/abs/2510.18395
- 2025-09-27, Memory Management and Contextual Consistency for Long-Running Low-Code Agents, Jiexi Xu et.al., Paper: http://arxiv.org/abs/2509.25250
- 2025-10-22, Memo: Training Memory-Efficient Embodied Agents with Reinforcement Learning, Gunshi Gupta et.al., Paper: http://arxiv.org/abs/2510.19732
- 2025-08-25, Memento: Fine-tuning LLM Agents without Fine-tuning LLMs, Huichi Zhou et.al., Paper: http://arxiv.org/abs/2508.16153
- 2025-09-23, MemOrb: A Plug-and-Play Verbal-Reinforcement Memory Layer for E-Commerce Customer Service, Yizhe Huang et.al., Paper: http://arxiv.org/abs/2509.18713
- 2025-09-30, Mem-α: Learning Memory Construction via Reinforcement Learning, Yu Wang et.al., Paper: http://arxiv.org/abs/2509.25911
- 2025-09-15, MedicalOS: An LLM Agent based Operating System for Digital Healthcare, Jared Zhu et.al., Paper: http://arxiv.org/abs/2509.11507
- 2025-10-12, MedCoAct: Confidence-Aware Multi-Agent Collaboration for Complete Clinical Decision, Hongjie Zheng et.al., Paper: http://arxiv.org/abs/2510.10461
- 2025-10-21, Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents, Guangfu Guo et.al., Paper: http://arxiv.org/abs/2510.18424
- 2025-10-14, ManiAgent: An Agentic Framework for General Robotic Manipulation, Yi Yang et.al., Paper: http://arxiv.org/abs/2510.11660
- 2025-10-01, ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs, Adi Simhi et.al., Paper: http://arxiv.org/abs/2510.00857
- 2025-10-24, Magellan: Guided MCTS for Latent Space Exploration and Novelty Generation, Lufan Chang et.al., Paper: http://arxiv.org/abs/2510.21341
- 2025-09-04, Maestro: Joint Graph & Config Optimization for Reliable AI Agents, Wenxiao Wang et.al., Paper: http://arxiv.org/abs/2509.04642
- 2025-09-22, MSCoRe: A Benchmark for Multi-Stage Collaborative Reasoning in LLM Agents, Yuzhen Lei et.al., Paper: http://arxiv.org/abs/2509.17628
- 2025-10-22, MSC-Bench: A Rigorous Benchmark for Multi-Server Tool Orchestration, Jia-Kai Dong et.al., Paper: http://arxiv.org/abs/2510.19423
- 2025-10-20, MIRAGE: Agentic Framework for Multimodal Misinformation Detection with Web-Grounded Reasoning, Mir Nafis Sharear Shopnil et.al., Paper: http://arxiv.org/abs/2510.17590
- 2025-10-21, MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models, ChangSu Choi et.al., Paper: http://arxiv.org/abs/2510.18383
- 2025-10-28, MCP-Flow: Facilitating LLM Agents to Master Real-World, Diverse and Scaling MCP Tools, Wenhao Wang et.al., Paper: http://arxiv.org/abs/2510.24284
- 2025-08-28, MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers, Zhenting Wang et.al., Paper: http://arxiv.org/abs/2508.20453
- 2025-10-14, MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents, Dongsen Zhang et.al., Paper: http://arxiv.org/abs/2510.15994
- 2025-08-26, MATRIX: Multi-Agent simulaTion fRamework for safe Interactions and conteXtual clinical conversational evaluation, Ernest Lim et.al., Paper: http://arxiv.org/abs/2508.19163
- 2025-09-30, MASLegalBench: Benchmarking Multi-Agent Systems in Deductive Legal Reasoning, Huihao Jing et.al., Paper: http://arxiv.org/abs/2509.24922
- 2025-09-29, MAS $^2$ : Self-Generative, Self-Configuring, Self-Rectifying Multi-Agent Systems, Kun Wang et.al., Paper: http://arxiv.org/abs/2509.24323
- 2025-10-17, MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games, Huining Yuan et.al., Paper: http://arxiv.org/abs/2510.15414
- 2025-09-04, MAGneT: Coordinated Multi-Agent Generation of Synthetic Multi-Turn Mental Health Counseling Sessions, Aishik Mandal et.al., Paper: http://arxiv.org/abs/2509.04183
- 2025-10-16, MAGPIE: A benchmark for Multi-AGent contextual PrIvacy Evaluation, Gurusha Juneja et.al., Paper: http://arxiv.org/abs/2510.15186
- 2025-10-16, MAFA: A Multi-Agent Framework for Enterprise-Scale Annotation with Configurable Task Adaptation, Mahmood Hegazy et.al., Paper: http://arxiv.org/abs/2510.14184
- 2025-10-15, MADREC: A Multi-Aspect Driven LLM Agent for Explainable and Adaptive Recommendation, Jiin Park et.al., Paper: http://arxiv.org/abs/2510.13371
- 2025-10-27, Lost in Tokenization: Context as the Key to Unlocking Biomolecular Understanding in Scientific LLMs, Kai Zhuang et.al., Paper: http://arxiv.org/abs/2510.23127
- 2025-10-28, Look and Tell: A Dataset for Multimodal Grounding Across Egocentric and Exocentric Views, Anna Deichler et.al., Paper: http://arxiv.org/abs/2510.22672
- 2025-09-27, Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents, Yaorui Shi et.al., Paper: http://arxiv.org/abs/2509.23040
- 2025-10-08, LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling, Zecheng Tang et.al., Paper: http://arxiv.org/abs/2510.06915
- 2025-09-30, Lita: Light Agent Uncovers the Agentic Coding Capabilities of LLMs, Hankun Dai et.al., Paper: http://arxiv.org/abs/2509.25873
- 2025-10-16, LiRA: Linguistic Robust Anchoring for Cross-lingual Large Language Models, Haolin Li et.al., Paper: http://arxiv.org/abs/2510.14466
- 2025-10-11, Leveraging Large Language Models for Cybersecurity Risk Assessment – A Case from Forestry Cyber-Physical Systems, Fikret Mert Gultekin et.al., Paper: http://arxiv.org/abs/2510.06343
- 2025-09-04, Leveraging LLM-Based Agents for Intelligent Supply Chain Planning, Yongzhi Qi et.al., Paper: http://arxiv.org/abs/2509.03811
- 2025-09-26, Leveraging LLM Agents for Automated Video Game Testing, Chengjia Wang et.al., Paper: http://arxiv.org/abs/2509.22170
- 2025-09-07, Let’s Roleplay: Examining LLM Alignment in Collaborative Dialogues, Abhijnan Nath et.al., Paper: http://arxiv.org/abs/2509.05882
- 2025-10-22, Learning to Make Friends: Coaching LLM Agents toward Emergent Social Ties, Philipp J. Schneider et.al., Paper: http://arxiv.org/abs/2510.19299
- 2025-10-09, Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks, Cheng Yang et.al., Paper: http://arxiv.org/abs/2510.08002
- 2025-10-22, Learning from Supervision with Semantic and Episodic Memory: A Reflective Approach to Agent Adaptation, Jackson Hassell et.al., Paper: http://arxiv.org/abs/2510.19897
- 2025-09-30, Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents, Davide Paglieri et.al., Paper: http://arxiv.org/abs/2509.03581
- 2025-10-19, Learning Ecology with VERA Using Conceptual Models and Simulations, Spencer Rugaber et.al., Paper: http://arxiv.org/abs/2510.16944
- 2025-10-10, Leading the Follower: Learning Persuasive Agents in Social Deduction Games, Zhang Zheng et.al., Paper: http://arxiv.org/abs/2510.09087
- 2025-10-28, Law in Silico: Simulating Legal Society with LLM-Based Agents, Yiding Wang et.al., Paper: http://arxiv.org/abs/2510.24442
- 2025-10-19, Lark: Biologically Inspired Neuroevolution for Multi-Stakeholder LLM Agents, Dheeraj Chintapalli et.al., Paper: http://arxiv.org/abs/2510.16978
- 2025-10-22, Large Language Model enabled Mathematical Modeling, Guoyun Zhang et.al., Paper: http://arxiv.org/abs/2510.19895
- 2025-10-27, Language Server CLI Empowers Language Agents with Process Rewards, Yifan Zhang et.al., Paper: http://arxiv.org/abs/2510.22907
- 2025-10-16, LabOS: The AI-XR Co-Scientist That Sees and Works With Humans, Le Cong et.al., Paper: http://arxiv.org/abs/2510.14861
- 2025-09-24, LLMs for Bayesian Optimization in Scientific Domains: Are We There Yet?, Rushil Gupta et.al., Paper: http://arxiv.org/abs/2509.21403
- 2025-10-07, LLMs as Policy-Agnostic Teammates: A Case Study in Human Proxy Design for Heterogeneous Agent Teams, Aju Ani Justus et.al., Paper: http://arxiv.org/abs/2510.06151
- 2025-09-21, LLMs as Layout Designers: A Spatial Reasoning Perspective, Sha Li et.al., Paper: http://arxiv.org/abs/2509.16891
- 2025-09-23, LLMZ+: Contextual Prompt Whitelist Principles for Agentic LLMs, Tom Pawelek et.al., Paper: http://arxiv.org/abs/2509.18557
- 2025-09-28, LLM/Agent-as-Data-Analyst: A Survey, Zirui Tang et.al., Paper: http://arxiv.org/abs/2509.23988
- 2025-10-07, LLM-FS-Agent: A Deliberative Role-based Large Language Model Architecture for Transparent Feature Selection, Mohamed Bal-Ghaoui et.al., Paper: http://arxiv.org/abs/2510.05935
- 2025-09-30, LLM Agents for Knowledge Discovery in Atomic Layer Processing, Andreas Werbrouck et.al., Paper: http://arxiv.org/abs/2509.26201
- 2025-09-23, LLM Agents for Interactive Workflow Provenance: Reference Architecture and Evaluation Methodology, Renan Souza et.al., Paper: http://arxiv.org/abs/2509.13978
- 2025-10-16, LLM Agents for Automated Web Vulnerability Reproduction: Are We There Yet?, Bin Liu et.al., Paper: http://arxiv.org/abs/2510.14700
- 2025-10-03, LLM Agents for Automated Dependency Upgrades, Vali Tawosi et.al., Paper: http://arxiv.org/abs/2510.03480
- 2025-09-19, LLM Agents at the Roundtable: A Multi-Perspective and Dialectical Reasoning Framework for Essay Scoring, Jinhee Jang et.al., Paper: http://arxiv.org/abs/2509.14834
- 2025-10-16, LLM Agents Beyond Utility: An Open-Ended Perspective, Asen Nachkov et.al., Paper: http://arxiv.org/abs/2510.14548
- 2025-09-25, LLM Agent Meets Agentic AI: Can LLM Agents Simulate Customers to Evaluate Agentic-AI-based Shopping Assistants?, Lu Sun et.al., Paper: http://arxiv.org/abs/2509.21501
- 2025-09-25, LIMI: Less is More for Agency, Yang Xiao et.al., Paper: http://arxiv.org/abs/2509.17567
- 2025-09-23, LCMF: Lightweight Cross-Modality Mambaformer for Embodied Robotics VQA, Zeyi Kang et.al., Paper: http://arxiv.org/abs/2509.18576
- 2025-10-21, LAFA: Agentic LLM-Driven Federated Analytics over Decentralized Data Sources, Haichao Ji et.al., Paper: http://arxiv.org/abs/2510.18477
- 2025-10-08, LAD-RAG: Layout-aware Dynamic RAG for Visually-Rich Document Understanding, Zhivar Sourati et.al., Paper: http://arxiv.org/abs/2510.07233
- 2025-10-08, L2M-AID: Autonomous Cyber-Physical Defense by Fusing Semantic Reasoning of Large Language Models with Multi-Agent Reinforcement Learning (Preprint), Tianxiang Xu et.al., Paper: http://arxiv.org/abs/2510.07363
- 2025-10-11, KG-MAS: Knowledge Graph-Enhanced Multi-Agent Infrastructure for coupling physical and digital robotic environments, Walid Abdela et.al., Paper: http://arxiv.org/abs/2510.10325
- 2025-10-21, KAT-Coder Technical Report, Zizheng Zhan et.al., Paper: http://arxiv.org/abs/2510.18779
- 2025-10-05, Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation, Hadi Nekoei et.al., Paper: http://arxiv.org/abs/2510.04373
- 2025-09-26, JudgeAgent: Knowledge-wise and Dynamic LLM Evaluation with Agent-as-Interviewer, Zhichao Shi et.al., Paper: http://arxiv.org/abs/2509.02097
- 2025-10-01, JoyAgent-JDGenie: Technical Report on the GAIA, Jiarun Liu et.al., Paper: http://arxiv.org/abs/2510.00510
- 2025-10-21, JAUNT: Joint Alignment of User Intent and Network State for QoE-centric LLM Tool Routing, Enhan Li et.al., Paper: http://arxiv.org/abs/2510.18550
- 2025-10-20, Investigating the Impact of Dark Patterns on LLM-Based Web Agents, Devin Ersoy et.al., Paper: http://arxiv.org/abs/2510.18113
- 2025-10-28, Investigating Software Aging in LLM-Generated Software Systems, César Santos et.al., Paper: http://arxiv.org/abs/2510.24188
- 2025-09-05, Internet 3.0: Architecture for a Web-of-Agents with it’s Algorithm for Ranking Agents, Rajesh Tembarai Krishnamachari et.al., Paper: http://arxiv.org/abs/2509.04979
- 2025-10-16, Internalizing World Models via Self-Play Finetuning for Agentic RL, Shiqi Chen et.al., Paper: http://arxiv.org/abs/2510.15047
- 2025-10-05, Internal World Models as Imagination Networks in Cognitive Agents, Saurabh Ranjan et.al., Paper: http://arxiv.org/abs/2510.04391
- 2025-08-27, Interactive Graph Visualization and TeamingRecommendation in an Interdisciplinary Project’sTalent Knowledge Graph, Jiawei Xu et.al., Paper: http://arxiv.org/abs/2508.19489
- 2025-09-01, Instructional Agents: LLM Agents on Automated Course Material Generation for Teaching Faculties, Huaiyuan Yao et.al., Paper: http://arxiv.org/abs/2508.19611
- 2025-10-21, InspectCoder: Dynamic Analysis-Enabled Self Repair through interactive LLM-Debugger Collaboration, Yunkun Wang et.al., Paper: http://arxiv.org/abs/2510.18327
- 2025-09-26, Infusing Theory of Mind into Socially Intelligent LLM Agents, EunJeong Hwang et.al., Paper: http://arxiv.org/abs/2509.22887
- 2025-10-16, Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents, Guoqing Wang et.al., Paper: http://arxiv.org/abs/2510.14967
- 2025-10-04, InfoMosaic-Bench: Evaluating Multi-Source Information Seeking in Tool-Augmented Agents, Yaxin Du et.al., Paper: http://arxiv.org/abs/2510.02271
- 2025-09-30, InfiAgent: Self-Evolving Pyramid Agent Framework for Infinite Scenarios, Chenglin Yu et.al., Paper: http://arxiv.org/abs/2509.22502
- 2025-08-30, Inducing State Anxiety in LLM Agents Reproduces Human-Like Biases in Consumer Decision-Making, Ziv Ben-Zion et.al., Paper: http://arxiv.org/abs/2510.06222
- 2025-10-27, Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning, Ran Xu et.al., Paper: http://arxiv.org/abs/2510.23038
- 2025-10-15, In-Browser LLM-Guided Fuzzing for Real-Time Prompt Injection Testing in Agentic AI Browsers, Avihay Cohen et.al., Paper: http://arxiv.org/abs/2510.13543
- 2025-09-28, Improving the Efficiency of LLM Agent Systems through Trajectory Reduction, Yuan-An Xiao et.al., Paper: http://arxiv.org/abs/2509.23586
- 2025-10-03, Improving GUI Grounding with Explicit Position-to-Coordinate Mapping, Suyuchen Wang et.al., Paper: http://arxiv.org/abs/2510.03230
- 2025-10-23, ImpossibleBench: Measuring LLMs’ Propensity of Exploiting Test Cases, Ziqian Zhong et.al., Paper: http://arxiv.org/abs/2510.20270
- 2025-09-26, Impact of Collective Behaviors of Autonomous Vehicles on Urban Traffic Dynamics: A Multi-Agent Reinforcement Learning Approach, Ahmet Onur Akman et.al., Paper: http://arxiv.org/abs/2509.22216
- 2025-08-22, IR-Agent: Expert-Inspired LLM Agents for Structure Elucidation from Infrared Spectra, Heewoong Noh et.al., Paper: http://arxiv.org/abs/2508.16112
- 2025-10-14, IL3D: A Large-Scale Indoor Layout Dataset for LLM-Driven 3D Scene Generation, Wenxu Zhou et.al., Paper: http://arxiv.org/abs/2510.12095
- 2025-09-10, HypoGeneAgent: A Hypothesis Language Agent for Gene-Set Cluster Resolution Selection Using Perturb-seq Datasets, Ying Yuan et.al., Paper: http://arxiv.org/abs/2509.09740
- 2025-10-24, Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine, Wenyi Wang et.al., Paper: http://arxiv.org/abs/2510.21614
- 2025-10-23, Human-Centered LLM-Agent System for Detecting Anomalous Digital Asset Transactions, Gyuyeon Na et.al., Paper: http://arxiv.org/abs/2510.20102
- 2025-10-22, Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1, Qianli Ma et.al., Paper: http://arxiv.org/abs/2510.19600
- 2025-09-22, Human vs. Agent in Task-Oriented Conversations, Zhefan Wang et.al., Paper: http://arxiv.org/abs/2509.17619
- 2025-09-19, How do Language Models Generate Slang: A Systematic Comparison between Human and Machine-Generated Slang Usages, Siyang Wu et.al., Paper: http://arxiv.org/abs/2509.15518
- 2025-10-10, How can we assess human-agent interactions? Case studies in software agent design, Valerie Chen et.al., Paper: http://arxiv.org/abs/2510.09801
- 2025-09-17, How Does Cognitive Bias Affect Large Language Models? A Case Study on the Anchoring Effect in Price Negotiation Simulations, Yoshiki Takenami et.al., Paper: http://arxiv.org/abs/2508.21137
- 2025-10-26, How Do AI Agents Do Human Work? Comparing AI and Human Workflows Across Diverse Occupations, Zora Zhiruo Wang et.al., Paper: http://arxiv.org/abs/2510.22780
- 2025-09-01, How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on $τ$ -bench, Venkatesh Mishra et.al., Paper: http://arxiv.org/abs/2508.20931
- 2025-10-13, Holistic Agent Leaderboard: The Missing Infrastructure for AI Agent Evaluation, Sayash Kapoor et.al., Paper: http://arxiv.org/abs/2510.11977
- 2025-10-15, Higher Satisfaction, Lower Cost: A Technical Report on How LLMs Revolutionize Meituan’s Intelligent Interaction Systems, Xuxin Cheng et.al., Paper: http://arxiv.org/abs/2510.13291
- 2025-08-29, HiVA: Self-organized Hierarchical Variable Agent via Goal-driven Semantic-Topological Evolution, Jinzhou Tang et.al., Paper: http://arxiv.org/abs/2509.00189
- 2025-10-16, Helmsman: Autonomous Synthesis of Federated Learning Systems via Multi-Agent Collaboration, Haoyuan Li et.al., Paper: http://arxiv.org/abs/2510.14512
- 2025-09-11, Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents, Jiawei Wang et.al., Paper: http://arxiv.org/abs/2509.09265
- 2025-10-24, HIKMA: Human-Inspired Knowledge by Machine Agents through a Multi-Agent Framework for Semi-Autonomous Scientific Conferences, Zain Ul Abideen Tariq et.al., Paper: http://arxiv.org/abs/2510.21370
- 2025-09-16, H $^2$ R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents, Shicheng Ye et.al., Paper: http://arxiv.org/abs/2509.12810
- 2025-10-02, GuruAgents: Emulating Wise Investors with Prompt-Guided LLM Agents, Yejin Kim et.al., Paper: http://arxiv.org/abs/2510.01664
- 2025-09-09, Guided Reasoning in LLM-Driven Penetration Testing Using Structured Attack Trees, Katsuaki Nakano et.al., Paper: http://arxiv.org/abs/2509.07939
- 2025-10-12, GraphTracer: Graph-Guided Failure Tracing in LLM Agents for Robust Multi-Turn Deep Search, Heng Zhang et.al., Paper: http://arxiv.org/abs/2510.10581
- 2025-09-20, Governed By Agents: A Survey On The Role Of Agentic AI In Future Computing Environments, Nauman Ali Murad et.al., Paper: http://arxiv.org/abs/2509.16676
- 2025-10-23, GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?, Chiyu Chen et.al., Paper: http://arxiv.org/abs/2510.20333
- 2025-09-09, Getting In Contract with Large Language Models – An Agency Theory Perspective On Large Language Model Alignment, Sascha Kaltenpoth et.al., Paper: http://arxiv.org/abs/2509.07642
- 2025-10-21, Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming, Zheng Zhang et.al., Paper: http://arxiv.org/abs/2510.18314
- 2025-08-26, Generative Artificial Intelligence and Agents in Research and Teaching, Jussi S. Jauhiainen et.al., Paper: http://arxiv.org/abs/2508.16701
- 2025-10-16, Generalized Dynamics Generation towards Scannable Physical World Model, Yichen Li et.al., Paper: http://arxiv.org/abs/2510.15041
- 2025-09-22, Generalizable End-to-End Tool-Use RL with Synthetic CodeGym, Weihua Du et.al., Paper: http://arxiv.org/abs/2509.17325
- 2025-10-14, GenCellAgent: Generalizable, Training-Free Cellular Image Segmentation via Large Language Model Agents, Xi Yu et.al., Paper: http://arxiv.org/abs/2510.13896
- 2025-10-02, Gala: Global LLM Agents for Text-to-Model Translation, Junyang Cai et.al., Paper: http://arxiv.org/abs/2509.08970
- 2025-10-16, GUIrilla: A Scalable Framework for Automated Desktop UI Exploration, Sofiya Garkot et.al., Paper: http://arxiv.org/abs/2510.16051
- 2025-09-28, GUI-Shepherd: Reliable Process Reward and Verification for Long-Sequence GUI Tasks, Cong Chen et.al., Paper: http://arxiv.org/abs/2509.23738
- 2025-10-02, GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments, Hanlin Zhu et.al., Paper: http://arxiv.org/abs/2509.21998
- 2025-10-14, GOAT: A Training Framework for Goal-Oriented Agent with Tools, Hyunji Min et.al., Paper: http://arxiv.org/abs/2510.12218
- 2025-10-10, Fundamentals of Building Autonomous LLM Agents, Victor de Lamo Castrillo et.al., Paper: http://arxiv.org/abs/2510.09244
- 2025-10-28, FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling, Zengzhuang Xu et.al., Paper: http://arxiv.org/abs/2510.24645
- 2025-09-30, From Trace to Line: LLM Agent for Real-World OSS Vulnerability Localization, Haoran Xi et.al., Paper: http://arxiv.org/abs/2510.02389
- 2025-10-05, From Shadow to Light: Toward Safe and Efficient Policy Learning Across MPC, DeePC, RL, and LLM Agents, Amin Vahidi-Moghaddam et.al., Paper: http://arxiv.org/abs/2510.04076
- 2025-10-15, From Refusal to Recovery: A Control-Theoretic Approach to Generative AI Guardrails, Ravi Pandya et.al., Paper: http://arxiv.org/abs/2510.13727
- 2025-10-23, From Questions to Queries: An AI-powered Multi-Agent Framework for Spatial Text-to-SQL, Ali Khosravi Kazazi et.al., Paper: http://arxiv.org/abs/2510.21045
- 2025-10-28, From Observability Data to Diagnosis: An Evolving Multi-agent System for Incident Management in Cloud Systems, Yu Luo et.al., Paper: http://arxiv.org/abs/2510.24145
- 2025-10-14, From Literal to Liberal: A Meta-Prompting Framework for Eliciting Human-Aligned Exception Handling in Large Language Models, Imran Khan et.al., Paper: http://arxiv.org/abs/2510.12864
- 2025-09-17, From Legacy Fortran to Portable Kokkos: An Autonomous Agentic AI Workflow, Sparsh Gupta et.al., Paper: http://arxiv.org/abs/2509.12443
- 2025-08-24, From Language to Action: A Review of Large Language Models as Autonomous Agents and Tool Users, Sadia Sultana Chowa et.al., Paper: http://arxiv.org/abs/2508.17281
- 2025-10-23, From Generation to Attribution: Music AI Agent Architectures for the Post-Streaming Era, Wonil Kim et.al., Paper: http://arxiv.org/abs/2510.20276
- 2025-09-07, From Digital Distrust to Codified Honesty: Experimental Evidence on Generative AI in Credence Goods Markets, Alexander Erlei et.al., Paper: http://arxiv.org/abs/2509.06069
- 2025-10-07, From Agentification to Self-Evolving Agentic AI for Wireless Networks: Concepts, Approaches, and Future Research Directions, Changyuan Zhao et.al., Paper: http://arxiv.org/abs/2510.05596
- 2025-09-14, Free-MAD: Consensus-Free Multi-Agent Debate, Yu Cui et.al., Paper: http://arxiv.org/abs/2509.11035
- 2025-10-15, Formalizing the Safety, Security, and Functional Properties of Agentic AI Systems, Edoardo Allegrini et.al., Paper: http://arxiv.org/abs/2510.14133
- 2025-10-21, Food4All: A Multi-Agent Framework for Real-time Free Food Discovery with Integrated Nutritional Metadata, Zhengqing Yuan et.al., Paper: http://arxiv.org/abs/2510.18289
- 2025-09-11, Flip Co-op: Cooperative Takeovers in Shared Autonomy, Sandeep Banik et.al., Paper: http://arxiv.org/abs/2509.09281
- 2025-10-24, Five-loop beta function for gauge theories: computations, results and consequences, F. Herzog et.al., Paper: http://arxiv.org/abs/2510.21624
- 2025-10-01, Fine-tuning with RAG for Improving LLM Learning of New Skills, Humaid Ibrahim et.al., Paper: http://arxiv.org/abs/2510.01375
- 2025-10-13, FinVet: A Collaborative Framework of RAG and External Fact-Checking Agents for Financial Misinformation Detection, Daniel Berhane Araya et.al., Paper: http://arxiv.org/abs/2510.11654
- 2025-10-19, FinSight: Towards Real-World Financial Deep Research, Jiajie Jin et.al., Paper: http://arxiv.org/abs/2510.16844
- 2025-10-21, FinAI Data Assistant: LLM-based Financial Database Query Processing with the OpenAI Function Calling API, Juhyeong Kim et.al., Paper: http://arxiv.org/abs/2510.14162
- 2025-10-21, Fetch.ai: An Architecture for Modern Multi-Agent Systems, Michael J. Wooldridge et.al., Paper: http://arxiv.org/abs/2510.18699
- 2025-09-30, Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents, Zhen Yang et.al., Paper: http://arxiv.org/abs/2509.26539
- 2025-09-28, FedAgentBench: Towards Automating Real-world Federated Medical Image Analysis with Server-Client LLM Agents, Pramit Saha et.al., Paper: http://arxiv.org/abs/2509.23803
- 2025-09-04, FaMA: LLM-Empowered Agentic Assistant for Consumer-to-Consumer Marketplace, Yineng Yan et.al., Paper: http://arxiv.org/abs/2509.03890
- 2025-08-24, FLAIRR-TS – Forecasting LLM-Agents with Iterative Refinement and Retrieval for Time Series, Gunjan Jalori et.al., Paper: http://arxiv.org/abs/2508.19279
- 2025-09-12, FHIR-AgentBench: Benchmarking LLM Agents for Realistic Interoperable EHR Question Answering, Gyubok Lee et.al., Paper: http://arxiv.org/abs/2509.19319
- 2025-08-26, FALCON: Autonomous Cyber Threat Intelligence Mining with LLMs for IDS Rule Generation, Shaswata Mitra et.al., Paper: http://arxiv.org/abs/2508.18684
- 2025-10-15, FACTS: Table Summarization via Offline Template Generation with Agentic Workflows, Ye Yuan et.al., Paper: http://arxiv.org/abs/2510.13920
- 2025-10-20, FABRIC: Framework for Agent-Based Realistic Intelligence Creation, Abhigya Verma et.al., Paper: http://arxiv.org/abs/2510.17995
- 2025-10-04, Extracting Conceptual Knowledge to Locate Software Issues, Ying Wang et.al., Paper: http://arxiv.org/abs/2509.21427
- 2025-10-08, Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions, Yixiang Zhang et.al., Paper: http://arxiv.org/abs/2510.07176
- 2025-08-30, Exploring Decision-Making Capabilities of LLM Agents: An Experimental Study on Jump-Jump Game, Juwu Li et.al., Paper: http://arxiv.org/abs/2509.00483
- 2025-10-16, Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents, Rui Wang et.al., Paper: http://arxiv.org/abs/2510.14438
- 2025-10-17, Experience-Driven Exploration for Efficient API-Free AI Agents, Chenwei Tang et.al., Paper: http://arxiv.org/abs/2510.15259
- 2025-10-17, Exemplar-Guided Planing: Enhanced LLM Agent for KGQA, Jingao Xu et.al., Paper: http://arxiv.org/abs/2510.15283
- 2025-10-20, Executable Knowledge Graphs for Replicating AI Research, Yujie Luo et.al., Paper: http://arxiv.org/abs/2510.17795
- 2025-10-17, EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle, Rong Wu et.al., Paper: http://arxiv.org/abs/2510.16079
- 2025-10-13, Evolution in Simulation: AI-Agent School with Dual Memory for High-Fidelity Educational Dynamics, Sheng Jin et.al., Paper: http://arxiv.org/abs/2510.11290
- 2025-10-15, EvoTest: Evolutionary Test-Time Learning for Self-Improving Agentic Systems, Yufei He et.al., Paper: http://arxiv.org/abs/2510.13220
- 2025-10-13, EvoEmo: Towards Evolved Emotional Policies for Adversarial LLM Agents in Multi-Turn Price Negotiation, Yunbo Long et.al., Paper: http://arxiv.org/abs/2509.04310
- 2025-10-27, Evaluation of Vision-LLMs in Surveillance Video, Pascal Benschop et.al., Paper: http://arxiv.org/abs/2510.23190
- 2025-10-14, Evaluating the Quality of Randomness and Entropy in Tasks Supported by Large Language Models, Rabimba Karanjai et.al., Paper: http://arxiv.org/abs/2510.12080
- 2025-08-27, Evaluating Language Model Reasoning about Confidential Information, Dylan Sam et.al., Paper: http://arxiv.org/abs/2508.19980
- 2025-09-19, Evaluating Behavioral Alignment in Conflict Dialogue: A Multi-Dimensional Comparison of LLM Agents and Humans, Deuksin Kwon et.al., Paper: http://arxiv.org/abs/2509.16394
- 2025-09-30, ErrorPrism: Reconstructing Error Propagation Paths in Cloud Service Systems, Junsong Pu et.al., Paper: http://arxiv.org/abs/2509.26463
- 2025-09-24, EpidemIQs: Prompt-to-Paper LLM Agents for Epidemic Modeling and Analysis, Mohammad Hossein Samaei et.al., Paper: http://arxiv.org/abs/2510.00024
- 2025-09-09, EnvX: Agentize Everything with Agentic AI, Linyao Chen et.al., Paper: http://arxiv.org/abs/2509.08088
- 2025-10-20, Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics, Akshara Prabhakar et.al., Paper: http://arxiv.org/abs/2510.17797
- 2025-09-16, Enhancing LLM-Based Social Bot via an Adversarial Learning Framework, Fanqi Kong et.al., Paper: http://arxiv.org/abs/2508.17711
- 2025-08-21, End-to-End Agentic RAG System Training for Traceable Diagnostic Reasoning, Qiaoyu Zheng et.al., Paper: http://arxiv.org/abs/2508.15746
- 2025-08-27, Encouraging Good Processes Without the Need for Good Answers: Reinforcement Learning for LLM Agent Planning, Zhiwei Li et.al., Paper: http://arxiv.org/abs/2508.19598
- 2025-09-11, Enabling Regulatory Multi-Agent Collaboration: Architecture, Challenges, and Solutions, Qinnan Hu et.al., Paper: http://arxiv.org/abs/2509.09215
- 2025-10-20, Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents, Yihong Tang et.al., Paper: http://arxiv.org/abs/2510.17491
- 2025-10-14, Empowering LLM Agents with Geospatial Awareness: Toward Grounded Reasoning for Wildfire Response, Yiheng Chen et.al., Paper: http://arxiv.org/abs/2510.12061
- 2025-09-15, Emotions are Recognized Patterns of Cognitive Activities, Yue Jin et.al., Paper: http://arxiv.org/abs/2509.16232
- 2025-09-17, Emergent Social Dynamics of LLM Agents in the El Farol Bar Problem, Ryosuke Takata et.al., Paper: http://arxiv.org/abs/2509.04537
- 2025-10-23, EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence, Ding Zou et.al., Paper: http://arxiv.org/abs/2510.20578
- 2025-10-14, EmboMatrix: A Scalable Training-Ground for Embodied Decision-Making, Zixing Lei et.al., Paper: http://arxiv.org/abs/2510.12072
- 2025-10-21, EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval, Zebin Yang et.al., Paper: http://arxiv.org/abs/2510.18546
- 2025-09-28, Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation, Pengxiang Li et.al., Paper: http://arxiv.org/abs/2509.23866
- 2025-10-20, Echoes of Human Malice in Agents: Benchmarking LLMs for Multi-Turn Online Harassment Attacks, Trilok Padhi et.al., Paper: http://arxiv.org/abs/2510.14207
- 2025-10-21, Earth AI: Unlocking Geospatial Insights with Foundation Models and Cross-Modal Reasoning, Aaron Bell et.al., Paper: http://arxiv.org/abs/2510.18318
- 2025-10-24, EU-Agent-Bench: Measuring Illegal Behavior of LLM Agents Under EU Law, Ilija Lichkovski et.al., Paper: http://arxiv.org/abs/2510.21524
- 2025-10-14, ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning, Hanyang Chen et.al., Paper: http://arxiv.org/abs/2510.12693
- 2025-09-26, EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning, Wujiang Xu et.al., Paper: http://arxiv.org/abs/2509.22576
- 2025-10-19, EEschematic: Multimodal-LLM Based AI Agent for Schematic Generation of Analog Circuit, Chang Liu et.al., Paper: http://arxiv.org/abs/2510.17002
- 2025-10-07, EARL: Efficient Agentic Reinforcement Learning Systems for Large Language Models, Zheyue Tan et.al., Paper: http://arxiv.org/abs/2510.05943
- 2025-10-09, Dynamic Generation of Multi-LLM Agents Communication Topologies with Graph Diffusion Models, Eric Hanchen Jiang et.al., Paper: http://arxiv.org/abs/2510.07799
- 2025-09-30, Dual-Scale World Models for LLM Agents Towards Hard-Exploration Problems, Minsoo Kim et.al., Paper: http://arxiv.org/abs/2509.24116
- 2025-10-11, Don’t Just Fine-tune the Agent, Tune the Environment, Siyuan Lu et.al., Paper: http://arxiv.org/abs/2510.10197
- 2025-10-20, Does Reasoning Help LLM Agents Play Dungeons and Dragons? A Prompt Engineering Experiment, Patricia Delafuente et.al., Paper: http://arxiv.org/abs/2510.18112
- 2025-10-13, DocReward: A Document Reward Model for Structuring and Stylizing, Junpeng Liu et.al., Paper: http://arxiv.org/abs/2510.11391
- 2025-10-24, Doc-Researcher: A Unified System for Multimodal Document Parsing and Deep Research, Kuicai Dong et.al., Paper: http://arxiv.org/abs/2510.21603
- 2025-10-20, Do LLMs Recognize Your Latent Preferences? A Benchmark for Latent Information Discovery in Personalized Interaction, Ioannis Tsaknakis et.al., Paper: http://arxiv.org/abs/2510.17132
- 2025-09-26, Do LLM Agents Know How to Ground, Recover, and Assess? A Benchmark for Epistemic Competence in Information-Seeking Agents, Jiaqi Shao et.al., Paper: http://arxiv.org/abs/2509.22391
- 2025-10-20, Diverse Planning with Simulators via Linear Temporal Logic, Mustafa F. Abdelwahed et.al., Paper: http://arxiv.org/abs/2510.17418
- 2025-09-29, Dive into the Agent Matrix: A Realistic Evaluation of Self-Replication Risk in LLM Agents, Boxuan Zhang et.al., Paper: http://arxiv.org/abs/2509.25302
- 2025-10-26, Distributed Multi-Agent Bandits Over Erdős-Rényi Random Networks, Jingyuan Liu et.al., Paper: http://arxiv.org/abs/2510.22811
- 2025-10-24, DispatchMAS: Fusing taxonomy and artificial intelligence agents for emergency medical services, Xiang Li et.al., Paper: http://arxiv.org/abs/2510.21228
- 2025-09-18, Diagnostics of cognitive failures in multi-agent expert systems using dynamic evaluation protocols and subsequent mutation of the processing context, Andrejs Sorstkins et.al., Paper: http://arxiv.org/abs/2509.15366
- 2025-10-22, DiSRouter: Distributed Self-Routing for LLM Selections, Hang Zheng et.al., Paper: http://arxiv.org/abs/2510.19208
- 2025-10-14, Designing Tools with Control Confidence, Ajith Anil Meera et.al., Paper: http://arxiv.org/abs/2510.12630
- 2025-10-23, Designing Intent Communication for Agent-Human Collaboration, Yi Li et.al., Paper: http://arxiv.org/abs/2510.20409
- 2025-10-13, Demystifying Reinforcement Learning in Agentic Reasoning, Zhaochen Yu et.al., Paper: http://arxiv.org/abs/2510.11701
- 2025-10-14, Deliberate Lab: A Platform for Real-Time Human-AI Social Experiments, Crystal Qian et.al., Paper: http://arxiv.org/abs/2510.13011
- 2025-10-22, Defending Against Prompt Injection with DataFilter, Yizhu Wang et.al., Paper: http://arxiv.org/abs/2510.19207
- 2025-09-02, DeepTRACE: Auditing Deep Research AI Systems for Tracking Reliability Across Citations and Evidence, Pranav Narayanan Venkit et.al., Paper: http://arxiv.org/abs/2509.04499
- 2025-10-24, DeepAgent: A General Reasoning Agent with Scalable Toolsets, Xiaoxi Li et.al., Paper: http://arxiv.org/abs/2510.21618
- 2025-09-02, Deep Research is the New Analytics System: Towards Building the Runtime for AI-Driven Analytics, Matthew Russo et.al., Paper: http://arxiv.org/abs/2509.02751
- 2025-10-20, Decentralized Real-Time Planning for Multi-UAV Cooperative Manipulation via Imitation Learning, Shantnav Agarwal et.al., Paper: http://arxiv.org/abs/2510.17143
- 2025-10-16, Data-driven Calibration Sample Selection and Forecast Combination in Electricity Price Forecasting: An Application of the ARHNN Method, Tomasz Serafin et.al., Paper: http://arxiv.org/abs/2510.15011
- 2025-09-12, Dark Patterns Meet GUI Agents: LLM Agent Susceptibility to Manipulative Interfaces and the Role of Human Oversight, Jingyu Tang et.al., Paper: http://arxiv.org/abs/2509.10723
- 2025-09-06, DRF: LLM-AGENT Dynamic Reputation Filtering Framework, Yuwei Lou et.al., Paper: http://arxiv.org/abs/2509.05764
- 2025-10-13, DMAS-Forge: A Framework for Transparent Deployment of AI Applications as Distributed Systems, Alessandro Cornacchia et.al., Paper: http://arxiv.org/abs/2510.11872
- 2025-10-24, DAO-AI: Evaluating Collective Decision-Making through Agentic AI in Decentralized Governance, Chunghyun Han et.al., Paper: http://arxiv.org/abs/2510.21117
- 2025-10-20, Cybersecurity AI: Evaluating Agentic Cybersecurity in Attack/Defense CTFs, Francesco Balassone et.al., Paper: http://arxiv.org/abs/2510.17521
- 2025-10-28, Cybersecurity AI Benchmark (CAIBench): A Meta-Benchmark for Evaluating Cybersecurity AI Agents, María Sanz-Gómez et.al., Paper: http://arxiv.org/abs/2510.24317
- 2025-08-28, CyberSleuth: Autonomous Blue-Team LLM Agent for Web Attack Forensics, Stefano Fumero et.al., Paper: http://arxiv.org/abs/2508.20643
- 2025-10-08, Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping, Ziyi Wang et.al., Paper: http://arxiv.org/abs/2510.07230
- 2025-09-11, Curriculum-Based Multi-Tier Semantic Exploration via Deep Reinforcement Learning, Abdel Hakim Drid et.al., Paper: http://arxiv.org/abs/2509.09356
- 2025-10-21, Crucible: Quantifying the Potential of Control Algorithms through LLM Agents, Lianchen Jia et.al., Paper: http://arxiv.org/abs/2510.18491
- 2025-10-15, Cortex: Workflow-Aware Resource Pooling and Scheduling for Agentic Serving, Nikos Pagonas et.al., Paper: http://arxiv.org/abs/2510.14126
- 2025-10-24, Context Engineering for AI Agents in Open-Source Software, Seyedmoein Mohsenimofidi et.al., Paper: http://arxiv.org/abs/2510.21413
- 2025-10-05, Constructing coherent spatial memory in LLM agents through graph rectification, Puzhen Zhang et.al., Paper: http://arxiv.org/abs/2510.04195
- 2025-10-07, Constraint-Aware Route Recommendation from Natural Language via Hierarchical LLM Agents, Tao Zhe et.al., Paper: http://arxiv.org/abs/2510.06078
- 2025-10-20, Consistent Zero-Shot Imitation with Contrastive Goal Inference, Kathryn Wantlin et.al., Paper: http://arxiv.org/abs/2510.17059
- 2025-10-20, CompactPrompt: A Unified Pipeline for Prompt Data Compression in LLM Workflows, Joong Ho Choi et.al., Paper: http://arxiv.org/abs/2510.18043
- 2025-08-27, CompLex: Music Theory Lexicon Constructed by Autonomous Agents for Automatic Music Generation, Zhejing Hu et.al., Paper: http://arxiv.org/abs/2508.19603
- 2025-10-22, Communication to Completion: Modeling Collaborative Workflows with Intelligent Multi-Agent Communication, Yiming Lu et.al., Paper: http://arxiv.org/abs/2510.19995
- 2025-10-07, Communication Enables Cooperation in LLM Agents: A Comparison with Curriculum-Based Approaches, Hachem Madmoun et.al., Paper: http://arxiv.org/abs/2510.05748
- 2025-10-09, CommandSans: Securing AI Agents with Surgical Precision Prompt Sanitization, Debeshee Das et.al., Paper: http://arxiv.org/abs/2510.08829
- 2025-10-22, ColorAgent: Building A Robust, Personalized, and Interactive OS Agent, Ning Li et.al., Paper: http://arxiv.org/abs/2510.19386
- 2025-10-13, Collaborative Shadows: Distributed Backdoor Attacks in LLM-Based Multi-Agent Systems, Pengyu Zhu et.al., Paper: http://arxiv.org/abs/2510.11246
- 2025-10-26, Collaborative LLM Agents for C4 Software Architecture Design Automation, Kamil Szczepanik et.al., Paper: http://arxiv.org/abs/2510.22787
- 2025-10-20, Coinvisor: An RL-Enhanced Chatbot Agent for Interactive Cryptocurrency Investment Analysis, Chong Chen et.al., Paper: http://arxiv.org/abs/2510.17235
- 2025-10-15, CodeEvolve: An open source evolutionary coding agent for algorithm discovery and optimization, Henrique Assumpção et.al., Paper: http://arxiv.org/abs/2510.14150
- 2025-10-27, CodeAD: Synthesize Code of Rules for Log-based Anomaly Detection with LLMs, Junjie Huang et.al., Paper: http://arxiv.org/abs/2510.22986
- 2025-10-03, CoDA: Agentic Systems for Collaborative Data Visualization, Zichen Chen et.al., Paper: http://arxiv.org/abs/2510.03194
- 2025-09-26, CoBel-World: Harnessing LLM Reasoning to Build a Collaborative Belief World for Optimizing Embodied Multi-Agent Collaboration, Zhimin Wang et.al., Paper: http://arxiv.org/abs/2509.21981
- 2025-09-17, Co-Investigator AI: The Rise of Agentic AI for Smarter, Trustworthy AML Compliance Narratives, Prathamesh Vasudeo Naik et.al., Paper: http://arxiv.org/abs/2509.08380
- 2025-10-23, Co-Designing Quantum Codes with Transversal Diagonal Gates via Multi-Agent Systems, Xi He et.al., Paper: http://arxiv.org/abs/2510.20728
- 2025-10-05, Closing the Loop: Coordinating Inventory and Recommendation via Deep Reinforcement Learning on Multiple Timescales, Jinyang Jiang et.al., Paper: http://arxiv.org/abs/2510.04272
- 2025-10-15, CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation, Yee Man Choi et.al., Paper: http://arxiv.org/abs/2510.17853
- 2025-10-13, Chronologically Consistent Generative AI, Songrun He et.al., Paper: http://arxiv.org/abs/2510.11677
- 2025-10-18, Check Yourself Before You Wreck Yourself: Selectively Quitting Improves LLM Agent Safety, Vamshi Krishna Bonagiri et.al., Paper: http://arxiv.org/abs/2510.16492
- 2025-09-26, ChatInject: Abusing Chat Templates for Prompt Injection in LLM Agents, Hwan Chang et.al., Paper: http://arxiv.org/abs/2509.22830
- 2025-08-26, CausalMACE: Causality Empowered Multi-Agents in Minecraft Cooperative Tasks, Qi Chai et.al., Paper: http://arxiv.org/abs/2508.18797
- 2025-09-29, Causal Autoencoder-like Generation of Feedback Fuzzy Cognitive Maps with an LLM Agent, Akash Kumar Panda et.al., Paper: http://arxiv.org/abs/2509.25593
- 2025-09-09, CancerGUIDE: Cancer Guideline Understanding via Internal Disagreement Estimation, Alyssa Unell et.al., Paper: http://arxiv.org/abs/2509.07325
- 2025-10-20, Can Transformer Memory Be Corrupted? Investigating Cache-Side Vulnerabilities in Large Language Models, Elias Hossain et.al., Paper: http://arxiv.org/abs/2510.17098
- 2025-10-13, Can Tool-Integrated Reinforcement Learning Generalize Across Diverse Domains?, Zhengyu Chen et.al., Paper: http://arxiv.org/abs/2510.11184
- 2025-10-28, Can LLMs Write Faithfully? An Agent-Based Evaluation of LLM-generated Islamic Content, Abdullah Mushtaq et.al., Paper: http://arxiv.org/abs/2510.24438
- 2025-10-09, CaRT: Teaching LLM Agents to Know When They Know Enough, Grace Liu et.al., Paper: http://arxiv.org/abs/2510.08517
- 2025-09-30, CORTEX: Collaborative LLM Agents for High-Stakes Alert Triage, Bowen Wei et.al., Paper: http://arxiv.org/abs/2510.00311
- 2025-09-25, CORE: Full-Path Evaluation of LLM Agents Beyond Final State, Panagiotis Michelakis et.al., Paper: http://arxiv.org/abs/2509.20998
- 2025-10-09, COMPASS: Enhancing Agent Long-Horizon Reasoning with Evolving Context, Guangya Wan et.al., Paper: http://arxiv.org/abs/2510.08790
- 2025-10-08, COMPASS: A Multi-Turn Benchmark for Tool-Mediated Planning & Preference Optimization, Tian Qin et.al., Paper: http://arxiv.org/abs/2510.07043
- 2025-08-27, CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning, Zeyi Sun et.al., Paper: http://arxiv.org/abs/2508.20096
- 2025-08-29, COCORELI: Cooperative, Compositional Reconstitution \& Execution of Language Instructions, Swarnadeep Bhar et.al., Paper: http://arxiv.org/abs/2509.04470
- 2025-10-23, C-NAV: Towards Self-Evolving Continual Object Navigation in Open World, Ming-Ming Yu et.al., Paper: http://arxiv.org/abs/2510.20685
- 2025-10-10, Building a Foundational Guardrail for General Agentic Systems via Synthetic Data, Yue Huang et.al., Paper: http://arxiv.org/abs/2510.09781
- 2025-09-27, BuildBench: Benchmarking LLM Agents on Compiling Real-World Open-Source Software, Zehua Zhang et.al., Paper: http://arxiv.org/abs/2509.25248
- 2025-10-18, BuildArena: A Physics-Aligned Interactive Benchmark of LLMs for Engineering Construction, Tian Xia et.al., Paper: http://arxiv.org/abs/2510.16559
- 2025-10-17, Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation, Ed Li et.al., Paper: http://arxiv.org/abs/2510.15624
- 2025-10-07, BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks, Sagnik Anupam et.al., Paper: http://arxiv.org/abs/2510.02418
- 2025-10-28, BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents, Litu Ou et.al., Paper: http://arxiv.org/abs/2510.23458
- 2025-10-01, Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking Attacks, Shoumik Saha et.al., Paper: http://arxiv.org/abs/2510.01359
- 2025-10-20, Breaking and Fixing Defenses Against Control-Flow Hijacking in Multi-Agent Systems, Rishi Jha et.al., Paper: http://arxiv.org/abs/2510.17276
- 2025-10-16, Bounds and asymptotic expansions for the radii of convexity and uniform convexity of normalized Bessel functions, Árpád Baricz et.al., Paper: http://arxiv.org/abs/2510.14323
- 2025-09-24, Blueprint-Bench: Comparing spatial intelligence of LLMs, agents and image models, Lukas Petersson et.al., Paper: http://arxiv.org/abs/2509.25229
- 2025-08-26, Bias-Adjusted LLM Agents for Human-Like Decision-Making via Behavioral Economics, Ayato Kitadai et.al., Paper: http://arxiv.org/abs/2508.18600
- 2025-10-01, Beyond the Strongest LLM: Multi-Turn Multi-Agent Orchestration vs. Single LLMs on Benchmarks, Aaron Xuxiang Tian et.al., Paper: http://arxiv.org/abs/2509.23537
- 2025-10-03, Beyond the Final Answer: Evaluating the Reasoning Trajectories of Tool-Augmented Agents, Wonjoong Kim et.al., Paper: http://arxiv.org/abs/2510.02837
- 2025-10-01, Beyond Single LLMs: Enhanced Code Generation via Multi-Stage Performance-Guided LLM Orchestration, Huashan Chen et.al., Paper: http://arxiv.org/abs/2510.01379
- 2025-10-16, Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning, Xingang Guo et.al., Paper: http://arxiv.org/abs/2510.12712
- 2025-10-22, Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents, Gil Pasternak et.al., Paper: http://arxiv.org/abs/2510.19771
- 2025-10-19, Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI, Jitao Sang et.al., Paper: http://arxiv.org/abs/2510.16720
- 2025-10-06, Beyond Outcome Reward: Decoupling Search and Answering Improves LLM Agents, Yiding Wang et.al., Paper: http://arxiv.org/abs/2510.04695
- 2025-10-06, Beyond Manuals and Tasks: Instance-Level Context Learning for LLM Agents, Kuntai Cai et.al., Paper: http://arxiv.org/abs/2510.02369
- 2025-10-14, Benefits and Limitations of Communication in Multi-Agent Reasoning, Michael Rizvi-Martel et.al., Paper: http://arxiv.org/abs/2510.13903
- 2025-10-23, Balancing Specialization and Centralization: A Multi-Agent Reinforcement Learning Benchmark for Sequential Industrial Control, Tom Maus et.al., Paper: http://arxiv.org/abs/2510.20408
- 2025-10-28, BLM $_1$ : A Boundless Large Model for Cross-Space, Cross-Task, and Cross-Embodiment Learning, Wentao Tan et.al., Paper: http://arxiv.org/abs/2510.24161
- 2025-09-08, AxelSMOTE: An Agent-Based Oversampling Algorithm for Imbalanced Classification, Sukumar Kishanthan et.al., Paper: http://arxiv.org/abs/2509.06875
- 2025-10-16, Ax-Prover: A Deep Reasoning Agentic Framework for Theorem Proving in Mathematics and Quantum Physics, Marco Del Tredici et.al., Paper: http://arxiv.org/abs/2510.12787
- 2025-10-06, Autonomy Matters: A Study on Personalization-Privacy Dilemma in LLM Agents, Zhiping Zhang et.al., Paper: http://arxiv.org/abs/2510.04465
- 2025-09-09, Autonomous Code Evolution Meets NP-Completeness, Cunxi Yu et.al., Paper: http://arxiv.org/abs/2509.07367
- 2025-10-10, Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics, Lianhao Zhou et.al., Paper: http://arxiv.org/abs/2510.09901
- 2025-10-01, Automating Data-Driven Modeling and Analysis for Engineering Applications using Large Language Model Agents, Yang Liu et.al., Paper: http://arxiv.org/abs/2510.01398
- 2025-10-09, Automating Android Build Repair: Bridging the Reasoning-Execution Gap in LLM Agents with Domain-Specific Tools, Ha Min Son et.al., Paper: http://arxiv.org/abs/2510.08640
- 2025-10-01, Automatically Generating Web Applications from Requirements Via Multi-Agent Test-Driven Development, Yuxuan Wan et.al., Paper: http://arxiv.org/abs/2509.25297
- 2025-10-28, Automatically Benchmarking LLM Code Agents through Agent-Driven Annotation and Evaluation, Lingyue Fu et.al., Paper: http://arxiv.org/abs/2510.24358
- 2025-10-15, Automated Network Protocol Testing with LLM Agents, Yunze Wei et.al., Paper: http://arxiv.org/abs/2510.13248
- 2025-09-15, Automated Creation and Enrichment Framework for Improved Invocation of Enterprise APIs as Tools, Prerna Agarwal et.al., Paper: http://arxiv.org/abs/2509.11626
- 2025-10-23, Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents, Zhenning Yang et.al., Paper: http://arxiv.org/abs/2510.20211
- 2025-10-27, AutoStreamPipe: LLM Assisted Automatic Generation of Data Stream Processing Pipelines, Abolfazl Younesi et.al., Paper: http://arxiv.org/abs/2510.23408
- 2025-10-09, AutoQual: An LLM Agent for Automated Discovery of Interpretable Features for Review Quality Assessment, Xiaochong Lan et.al., Paper: http://arxiv.org/abs/2510.08081
- 2025-10-07, AutoPentester: An LLM Agent-based Framework for Automated Pentesting, Yasod Ginige et.al., Paper: http://arxiv.org/abs/2510.05605
- 2025-09-10, AutoODD: Agentic Audits via Bayesian Red Teaming in Black-Box Models, Rebecca Martin et.al., Paper: http://arxiv.org/abs/2509.08638
- 2025-10-03, AudioToolAgent: An Agentic Framework for Audio-Language Models, Gijs Wijngaard et.al., Paper: http://arxiv.org/abs/2510.02995
- 2025-10-13, Attacks by Content: Automated Fact-checking is an AI Security Issue, Michael Schlichtkrull et.al., Paper: http://arxiv.org/abs/2510.11238
- 2025-09-09, Astra: A Multi-Agent System for GPU Kernel Performance Optimization, Anjiang Wei et.al., Paper: http://arxiv.org/abs/2509.07506
- 2025-09-22, Asteria: Semantic-Aware Cross-Region Caching for Agentic LLM Tool Access, Chaoyi Ruan et.al., Paper: http://arxiv.org/abs/2509.17360
- 2025-10-24, AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite, Jonathan Bragg et.al., Paper: http://arxiv.org/abs/2510.21652
- 2025-10-22, Are Large Language Models Sensitive to the Motives Behind Communication?, Addison J. Wu et.al., Paper: http://arxiv.org/abs/2510.19687
- 2025-09-04, Are LLM Agents the New RPA? A Comparative Study with RPA Across Enterprise Workflows, Petr Průcha et.al., Paper: http://arxiv.org/abs/2509.04198
- 2025-09-03, Are LLM Agents Behaviorally Coherent? Latent Profiles for Social Simulation, James Mooney et.al., Paper: http://arxiv.org/abs/2509.03736
- 2025-10-27, Are Agents Just Automata? On the Formal Equivalence Between Agentic AI and the Chomsky Hierarchy, Roham Koohestani et.al., Paper: http://arxiv.org/abs/2510.23487
- 2025-09-10, Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations, Ron F. Del Rosario et.al., Paper: http://arxiv.org/abs/2509.08646
- 2025-10-21, Applying voxel-based analysis to oropharyngeal cancer proton therapy patients: a correlation study on radiation-induced acute dysphagia, Qianxia Wang et.al., Paper: http://arxiv.org/abs/2510.18210
- 2025-10-13, Analyzing and Internalizing Complex Policy Documents for LLM Agents, Jiateng Liu et.al., Paper: http://arxiv.org/abs/2510.11588
- 2025-10-15, An LLM-Powered AI Agent Framework for Holistic IoT Traffic Interpretation, Daniel Adu Worae et.al., Paper: http://arxiv.org/abs/2510.13925
- 2025-09-16, An LLM Agentic Approach for Legal-Critical Software: A Case Study for Tax Prep Software, Sina Gogani-Khiabani et.al., Paper: http://arxiv.org/abs/2509.13471
- 2025-10-19, An Agentic Framework with LLMs for Solving Complex Vehicle Routing Problems, Ni Zhang et.al., Paper: http://arxiv.org/abs/2510.16701
- 2025-10-16, AlphaQuanter: An End-to-End Tool-Orchestrated Agentic Reinforcement Learning Framework for Stock Trading, Zheye Deng et.al., Paper: http://arxiv.org/abs/2510.14264
- 2025-10-06, Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails, Siwei Han et.al., Paper: http://arxiv.org/abs/2510.04860
- 2025-09-14, Agentic UAVs: LLM-Driven Autonomy with Integrated Tool-Calling and Cognitive Reasoning, Anis Koubaa et.al., Paper: http://arxiv.org/abs/2509.13352
- 2025-09-28, Agentic Reinforcement Learning with Implicit Step Rewards, Xiaoqian Liu et.al., Paper: http://arxiv.org/abs/2509.19199
- 2025-10-20, Agentic Reinforcement Learning for Search is Unsafe, Yushi Yang et.al., Paper: http://arxiv.org/abs/2510.17431
- 2025-09-24, Agentic Metacognition: Designing a “Self-Aware” Low-Code Agent for Failure Prediction and Human Handoff, Jiexi Xu et.al., Paper: http://arxiv.org/abs/2509.19783
- 2025-09-16, Agentic Lybic: Multi-Agent Execution System with Tiered Reasoning and Orchestration, Liangxuan Guo et.al., Paper: http://arxiv.org/abs/2509.11067
- 2025-09-16, Agentic JWT: A Secure Delegation Protocol for Autonomous AI Agents, Abhishek Goswami et.al., Paper: http://arxiv.org/abs/2509.13597
- 2025-10-19, Agentic Inequality, Matthew Sharp et.al., Paper: http://arxiv.org/abs/2510.16853
- 2025-10-16, Agentic Entropy-Balanced Policy Optimization, Guanting Dong et.al., Paper: http://arxiv.org/abs/2510.14545
- 2025-10-19, Agentic Design of Compositional Machines, Wenqian Zhang et.al., Paper: http://arxiv.org/abs/2510.14980
- 2025-10-17, Agentic AI for Ultra-Modern Networks: Multi-Agent Framework for RAN Autonomy and Assurance, Sukhdeep Singh et.al., Paper: http://arxiv.org/abs/2510.16144
- 2025-09-16, Agentic AI for Financial Crime Compliance, Henrik Axelsen et.al., Paper: http://arxiv.org/abs/2509.13137
- 2025-08-22, AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications, Dawei Gao et.al., Paper: http://arxiv.org/abs/2508.16279
- 2025-10-05, AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework, Hanchen Zhang et.al., Paper: http://arxiv.org/abs/2510.04206
- 2025-09-10, AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning, Zhiheng Xi et.al., Paper: http://arxiv.org/abs/2509.08755
- 2025-09-28, AgentGuard: Runtime Verification of AI Agents, Roham Koohestani et.al., Paper: http://arxiv.org/abs/2509.23864
- 2025-10-28, AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis, Xuanzhong Chen et.al., Paper: http://arxiv.org/abs/2510.24695
- 2025-10-28, AgentFold: Long-Horizon Web Agents with Proactive Context Management, Rui Ye et.al., Paper: http://arxiv.org/abs/2510.24699
- 2025-10-07, AgentDR Dynamic Recommendation with Implicit Item-Item Relations via LLM-based Agents, Mingdai Yang et.al., Paper: http://arxiv.org/abs/2510.05598
- 2025-08-27, AgentCoMa: A Compositional Benchmark Mixing Commonsense and Mathematical Reasoning in Real-World Scenarios, Lisa Alazraki et.al., Paper: http://arxiv.org/abs/2508.19988
- 2025-10-20, AgentChangeBench: A Multi-Dimensional Evaluation Framework for Goal-Shift Robustness in Conversational AI, Manik Rana et.al., Paper: http://arxiv.org/abs/2510.18170
- 2025-10-02, AgentCaster: Reasoning-Guided Tornado Forecasting, Michael Chen et.al., Paper: http://arxiv.org/abs/2510.03349
- 2025-10-23, AgentArcEval: An Architecture Evaluation Method for Foundation Model based Agents, Qinghua Lu et.al., Paper: http://arxiv.org/abs/2510.21031
- 2025-08-24, Agent-Testing Agent: A Meta-Agent for Automated Testing and Evaluation of Conversational AI Agents, Sameer Komoravolu et.al., Paper: http://arxiv.org/abs/2508.17393
- 2025-10-14, Agent-Based Simulation of a Financial Market with Large Language Models, Ryuji Hashimoto et.al., Paper: http://arxiv.org/abs/2510.12189
- 2025-10-28, Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents, Yueqi Song et.al., Paper: http://arxiv.org/abs/2510.24702
- 2025-09-04, AgenTracer: Who Is Inducing Failure in the LLM Agentic Systems?, Guibin Zhang et.al., Paper: http://arxiv.org/abs/2509.03312
- 2025-10-28, Affordance Representation and Recognition for Autonomous Agents, Habtom Kahsay Gidey et.al., Paper: http://arxiv.org/abs/2510.24459
- 2025-10-22, AegisMCP: Online Graph Intrusion Detection for Tool-Augmented LLMs on Edge Devices, Zhonghao Zhan et.al., Paper: http://arxiv.org/abs/2510.19462
- 2025-08-27, Aegis: Taxonomy and Optimizations for Overcoming Agent-Environment Failures in LLM Agents, Kevin Song et.al., Paper: http://arxiv.org/abs/2508.19504
- 2025-10-06, Adversarial Reinforcement Learning for Large Language Model Agent Safety, Zizhao Wang et.al., Paper: http://arxiv.org/abs/2510.05442
- 2025-10-04, Adversarial Agent Collaboration for C to Rust Translation, Tianyu Li et.al., Paper: http://arxiv.org/abs/2510.03879
- 2025-10-15, Addressing the alignment problem in transportation policy making: an LLM approach, Xiaoyu Yan et.al., Paper: http://arxiv.org/abs/2510.13139
- 2025-10-21, Adaptive Coopetition: Leveraging Coarse Verifier Signals for Resilient Multi-Agent LLM Reasoning, Rui Jerry Huang et.al., Paper: http://arxiv.org/abs/2510.18179
- 2025-10-10, Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols, Mikhail Terekhov et.al., Paper: http://arxiv.org/abs/2510.09462
- 2025-10-27, Adapting Interleaved Encoders with PPO for Language-Guided Reinforcement Learning in BabyAI, Aryan Mathur et.al., Paper: http://arxiv.org/abs/2510.23148
- 2025-10-17, AURA: An Agent Autonomy Risk Assessment Framework, Lorenzo Satta Chiris et.al., Paper: http://arxiv.org/abs/2510.15739
- 2025-10-26, ATLAS: Actor-Critic Task-Completion with Look-ahead Action Simulation, Jiali Cheng et.al., Paper: http://arxiv.org/abs/2510.22732
- 2025-10-18, ATA: A Neuro-Symbolic Approach to Implement Autonomous and Trustworthy Agents, David Peer et.al., Paper: http://arxiv.org/abs/2510.16381
- 2025-10-11, ASTREA: Introducing Agentic Intelligence for Orbital Thermal Autonomy, Alejandro D. Mousist et.al., Paper: http://arxiv.org/abs/2509.13380
- 2025-09-22, ARK-V1: An LLM-Agent for Knowledge Graph Question Answering Requiring Commonsense Reasoning, Jan-Felix Klein et.al., Paper: http://arxiv.org/abs/2509.18063
- 2025-09-26, AMANDA: Agentic Medical Knowledge Augmentation for Data-Efficient Medical Visual Question Answering, Ziqing Wang et.al., Paper: http://arxiv.org/abs/2510.02328
- 2025-10-20, ALPINE: A Lightweight and Adaptive Privacy-Decision Agent Framework for Dynamic Edge Crowdsensing, Guanjie Cheng et.al., Paper: http://arxiv.org/abs/2510.17162
- 2025-10-03, ALMAS: an Autonomous LLM-based Multi-Agent Software Engineering Framework, Vali Tawosi et.al., Paper: http://arxiv.org/abs/2510.03463
- 2025-10-11, ALLOY: Generating Reusable Agent Workflows from User Demonstration, Jiawen Li et.al., Paper: http://arxiv.org/abs/2510.10049
- 2025-09-16, AI Agents with Human-Like Collaborative Tools: Adaptive Strategies for Enhanced Problem-Solving, Harper Reed et.al., Paper: http://arxiv.org/abs/2509.13547
- 2025-10-14, AI Agents as Universal Task Solvers, Alessandro Achille et.al., Paper: http://arxiv.org/abs/2510.12066
- 2025-10-01, ACON: Optimizing Context Compression for Long-horizon LLM Agents, Minki Kang et.al., Paper: http://arxiv.org/abs/2510.00615
- 2025-09-29, A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory, Qianshan Wei et.al., Paper: http://arxiv.org/abs/2510.02373
- 2025-10-21, A $^2$ FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning, Qianben Chen et.al., Paper: http://arxiv.org/abs/2510.12838
- 2025-10-22, A Tutorial on Cognitive Biases in Agentic AI-Driven 6G Autonomous Networks, Hatim Chergui et.al., Paper: http://arxiv.org/abs/2510.19973
- 2025-10-07, A Survey on Agentic Security: Applications, Threats and Defenses, Asif Shahriar et.al., Paper: http://arxiv.org/abs/2510.06445
- 2025-10-13, A Survey on Agentic Multimodal Large Language Models, Huanjin Yao et.al., Paper: http://arxiv.org/abs/2510.10991
- 2025-10-14, A Survey of Vibe Coding with Large Language Models, Yuyao Ge et.al., Paper: http://arxiv.org/abs/2510.12399
- 2025-08-28, A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers, Ming Hu et.al., Paper: http://arxiv.org/abs/2508.21148
- 2025-10-01, A Practitioner’s Guide to Multi-turn Agentic Reinforcement Learning, Ruiyi Wang et.al., Paper: http://arxiv.org/abs/2510.01132
- 2025-10-01, A Multi-Agent LLM Defense Pipeline Against Prompt Injection Attacks, S M Asif Hossain et.al., Paper: http://arxiv.org/abs/2509.14285
- 2025-10-20, A Mimamsa Inspired Framework For Instruction Sequencing In AI Agents, Bama Srinivasan et.al., Paper: http://arxiv.org/abs/2510.17691
- 2025-10-06, A Lightweight Large Language Model-Based Multi-Agent System for 2D Frame Structural Analysis, Ziheng Geng et.al., Paper: http://arxiv.org/abs/2510.05414
- 2025-10-13, A Large-Language-Model Assisted Automated Scale Bar Detection and Extraction Framework for Scanning Electron Microscopic Images, Yuxuan Chen et.al., Paper: http://arxiv.org/abs/2510.11260
- 2025-09-18, A Knowledge-driven Adaptive Collaboration of LLMs for Enhancing Medical Decision-making, Xiao Wu et.al., Paper: http://arxiv.org/abs/2509.14998
- 2025-10-24, A Knowledge-Graph Translation Layer for Mission-Aware Multi-Agent Path Planning in Spatiotemporal Dynamics, Edward Holmberg et.al., Paper: http://arxiv.org/abs/2510.21695
- 2025-08-26, A Concurrent Modular Agent: Framework for Autonomous LLM Agents, Norihiro Maruyama et.al., Paper: http://arxiv.org/abs/2508.19042
- 2025-10-20, A Brain Cell Type Resource Created by Large Language Models and a Multi-Agent AI System for Collaborative Community Annotation, Rongbin Li et.al., Paper: http://arxiv.org/abs/2510.17064
- 2025-09-15, $ε$ -Optimal Multi-Agent Patrol using Recurrent Strategy, Deepak Mallya et.al., Paper: http://arxiv.org/abs/2509.11640
- 2025-10-13, $How^{2}$ : How to learn from procedural How-to questions, Gautier Dagan et.al., Paper: http://arxiv.org/abs/2510.11144
- 2025-09-27, “Shall We Dig Deeper?”: Designing and Evaluating Strategies for LLM Agents to Advance Knowledge Co-Construction in Asynchronous Online Discussions, Yuanhao Zhang et.al., Paper: http://arxiv.org/abs/2509.23327
(<a href=#updated-on-20251030>back to top</a>)
Large Language Models
- 2025-10-22, olmOCR 2: Unit Test Rewards for Document OCR, Jake Poznanski et.al., Paper: http://arxiv.org/abs/2510.19817
- 2025-10-23, Zhyper: Factorized Hypernetworks for Conditioned LLM Fine-Tuning, M. H. I. Abdalla et.al., Paper: http://arxiv.org/abs/2510.19733
- 2025-10-21, Zero-Shot Vehicle Model Recognition via Text-Based Retrieval-Augmented Generation, Wei-Chia Chang et.al., Paper: http://arxiv.org/abs/2510.18502
- 2025-10-28, Zero-Shot Cross-Lingual Transfer using Prefix-Based Adaptation, Snegha A et.al., Paper: http://arxiv.org/abs/2510.24619
- 2025-10-24, Wisdom and Delusion of LLM Ensembles for Code Generation and Repair, Fernando Vallecillos Ruiz et.al., Paper: http://arxiv.org/abs/2510.21513
- 2025-10-21, Why Policy Gradient Algorithms Work for Undiscounted Total-Reward MDPs, Jongmin Lee et.al., Paper: http://arxiv.org/abs/2510.18340
- 2025-10-28, WebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling Info-Rich Seeking, Zhengwei Tao et.al., Paper: http://arxiv.org/abs/2510.24697
- 2025-10-23, Video Prediction of Dynamic Physical Simulations With Pixel-Space Spatiotemporal Transformers, Dean L Slack et.al., Paper: http://arxiv.org/abs/2510.20807
- 2025-10-21, Verifiable Accuracy and Abstention Rewards in Curriculum RL to Alleviate Lost-in-Conversation, Ming Li et.al., Paper: http://arxiv.org/abs/2510.18731
- 2025-10-27, VOLD: Reasoning Transfer from LLMs to Vision-Language Models via On-Policy Distillation, Walid Bousselham et.al., Paper: http://arxiv.org/abs/2510.23497
- 2025-10-21, VAR: Visual Attention Reasoning via Structured Search and Backtracking, Wei Cai et.al., Paper: http://arxiv.org/abs/2510.18619
- 2025-10-23, User Perceptions of Privacy and Helpfulness in LLM Responses to Privacy-Sensitive Scenarios, Xiaoyuan Wu et.al., Paper: http://arxiv.org/abs/2510.20721
- 2025-10-21, Unifying and Enhancing Graph Transformers via a Hierarchical Mask Framework, Yujie Xing et.al., Paper: http://arxiv.org/abs/2510.18825
- 2025-10-21, UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation, Yibin Wang et.al., Paper: http://arxiv.org/abs/2510.18701
- 2025-10-21, UWBench: A Comprehensive Vision-Language Benchmark for Underwater Understanding, Da Zhang et.al., Paper: http://arxiv.org/abs/2510.18262
- 2025-10-21, Training Diverse Graph Experts for Ensembles: A Systematic Empirical Study, Gangda Deng et.al., Paper: http://arxiv.org/abs/2510.18370
- 2025-10-21, Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement Learning, Chenghao Zhu et.al., Paper: http://arxiv.org/abs/2510.18849
- 2025-10-21, Topoformer: brain-like topographic organization in Transformer language models through spatial querying and reweighting, Taha Binhuraib et.al., Paper: http://arxiv.org/abs/2510.18745
- 2025-10-22, Top-P Masking for Cross Language Information Retrieval, Joseph Casale et.al., Paper: http://arxiv.org/abs/2510.19758
- 2025-10-22, ToolDreamer: Instilling LLM Reasoning Into Tool Retrievers, Saptarshi Sengupta et.al., Paper: http://arxiv.org/abs/2510.19791
- 2025-10-28, Tongyi DeepResearch Technical Report, Tongyi DeepResearch Team et.al., Paper: http://arxiv.org/abs/2510.24701
- 2025-10-21, Tokencake: A KV-Cache-centric Serving Framework for LLM-based Multi-Agent Applications, Zhuohang Bian et.al., Paper: http://arxiv.org/abs/2510.18586
- 2025-10-21, Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views, Zhangquan Chen et.al., Paper: http://arxiv.org/abs/2510.18632
- 2025-10-27, Think Twice: Branch-and-Rethink Reasoning Reward Model, Yizhu Jiao et.al., Paper: http://arxiv.org/abs/2510.23596
- 2025-10-24, The Universal Landscape of Human Reasoning, Qiguang Chen et.al., Paper: http://arxiv.org/abs/2510.21623
- 2025-10-21, The Trust Paradox in LLM-Based Multi-Agent Systems: When Collaboration Becomes a Security Vulnerability, Zijie Xu et.al., Paper: http://arxiv.org/abs/2510.18563
- 2025-10-22, The Tail Tells All: Estimating Model-Level Membership Inference Vulnerability Without Reference Models, Euodia Dodd et.al., Paper: http://arxiv.org/abs/2510.19773
- 2025-10-21, The Impact of Image Resolution on Biomedical Multimodal Large Language Models, Liangyu Chen et.al., Paper: http://arxiv.org/abs/2510.18304
- 2025-10-22, The Feasibility of Training Sovereign Language Models in the Global South: A Study of Brazil and Mexico, Sandra Malagon et.al., Paper: http://arxiv.org/abs/2510.19801
- 2025-10-21, The Attribution Story of WhisperGate: An Academic Perspective, Oleksandr Adamov et.al., Paper: http://arxiv.org/abs/2510.18484
- 2025-10-22, The Art of Asking: Multilingual Prompt Optimization for Synthetic Data, David Mora et.al., Paper: http://arxiv.org/abs/2510.19806
- 2025-10-21, Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs, Yanhong Li et.al., Paper: http://arxiv.org/abs/2510.18279
- 2025-10-23, Structure-Conditional Minimum Bayes Risk Decoding, Bryan Eikema et.al., Paper: http://arxiv.org/abs/2510.20700
- 2025-10-21, Streamlining Acceptance Test Generation for Mobile Applications Through Large Language Models: An Industrial Case Study, Pedro Luís Fonseca et.al., Paper: http://arxiv.org/abs/2510.18861
- 2025-10-21, StreamingTOM: Streaming Token Compression for Efficient Video Understanding, Xueyi Chen et.al., Paper: http://arxiv.org/abs/2510.18269
- 2025-10-21, StarBench: A Turn-Based RPG Benchmark for Agentic Multimodal Decision-Making and Information Seeking, Haoran Zhang et.al., Paper: http://arxiv.org/abs/2510.18483
- 2025-10-21, Socialized Learning and Emergent Behaviors in Multi-Agent Systems based on Multimodal Large Language Models, Sureyya Akin et.al., Paper: http://arxiv.org/abs/2510.18515
- 2025-10-22, SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration, Xichen Zhang et.al., Paper: http://arxiv.org/abs/2510.19767
- 2025-10-23, Small Drafts, Big Verdict: Information-Intensive Visual Reasoning via Speculation, Yuhan Liu et.al., Paper: http://arxiv.org/abs/2510.20812
- 2025-10-21, Simple and Efficient Heterogeneous Temporal Graph Neural Network, Yili Wang et.al., Paper: http://arxiv.org/abs/2510.18467
- 2025-10-23, Simple Context Compression: Mean-Pooling and Multi-Ratio Training, Yair Feldman et.al., Paper: http://arxiv.org/abs/2510.20797
- 2025-10-21, ShaRE your Data! Characterizing Datasets for LLM-based Requirements Engineering, Quim Motger et.al., Paper: http://arxiv.org/abs/2510.18787
- 2025-10-21, SemiAdapt and SemiLoRA: Efficient Domain Adaptation for Transformer-based Low-Resource Language Translation with a Case Study on Irish, Josh McGiff et.al., Paper: http://arxiv.org/abs/2510.18725
- 2025-10-22, Semantic World Models, Jacob Berg et.al., Paper: http://arxiv.org/abs/2510.19818
- 2025-10-21, SegTune: Structured and Fine-Grained Control for Song Generation, Pengfei Cai et.al., Paper: http://arxiv.org/abs/2510.18416
- 2025-10-21, Seg the HAB: Language-Guided Geospatial Algae Bloom Reasoning and Segmentation, Patterson Hsieh et.al., Paper: http://arxiv.org/abs/2510.18751
- 2025-10-21, See the Text: From Tokenization to Visual Reading, Ling Xing et.al., Paper: http://arxiv.org/abs/2510.18840
- 2025-10-22, Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning, Xichen Zhang et.al., Paper: http://arxiv.org/abs/2510.19807
- 2025-10-28, STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence, Zihan Liu et.al., Paper: http://arxiv.org/abs/2510.24693
- 2025-10-21, SSD: Spatial-Semantic Head Decoupling for Efficient Autoregressive Image Generation, Siyong Jian et.al., Paper: http://arxiv.org/abs/2510.18716
- 2025-10-21, SLICE: SLO-Driven Scheduling for LLM Inference on Edge Computing Devices, Pan Zhou et.al., Paper: http://arxiv.org/abs/2510.18544
- 2025-10-24, SBASH: a Framework for Designing and Evaluating RAG vs. Prompt-Tuned LLM Honeypots, Adetayo Adebimpe et.al., Paper: http://arxiv.org/abs/2510.21459
- 2025-10-28, Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance, Yujie Wei et.al., Paper: http://arxiv.org/abs/2510.24711
- 2025-10-27, RobotArena $\infty$ : Scalable Robot Benchmarking via Real-to-Sim Translation, Yash Jangir et.al., Paper: http://arxiv.org/abs/2510.23571
- 2025-10-24, Risk Management for Mitigating Benchmark Failure Modes: BenchRisk, Sean McGregor et.al., Paper: http://arxiv.org/abs/2510.21460
- 2025-10-22, Review of Tools for Zero-Code LLM Based Application Development, Priyaranjan Pattnayak et.al., Paper: http://arxiv.org/abs/2510.19747
- 2025-10-21, Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting, Howard Chen et.al., Paper: http://arxiv.org/abs/2510.18874
- 2025-10-28, ReplicationBench: Can AI Agents Replicate Astrophysics Research Papers?, Christine Ye et.al., Paper: http://arxiv.org/abs/2510.24591
- 2025-10-28, Relative Scaling Laws for LLMs, William Held et.al., Paper: http://arxiv.org/abs/2510.24626
- 2025-10-21, Reasoning Language Model Inference Serving Unveiled: An Empirical Study, Qi Li et.al., Paper: http://arxiv.org/abs/2510.18672
- 2025-10-28, ReForm: Reflective Autoformalization with Prospective Bounded Sequence Optimization, Guoxin Chen et.al., Paper: http://arxiv.org/abs/2510.24592
- 2025-10-27, ReCode: Unify Plan and Action for Universal Granularity Control, Zhaoyang Yu et.al., Paper: http://arxiv.org/abs/2510.23564
- 2025-10-22, RLIE: Rule Generation with Logistic Regression, Iterative Refinement, and Evaluation for Large Language Models, Yang Yang et.al., Paper: http://arxiv.org/abs/2510.19698
- 2025-10-24, RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models, Xueyuan Lin et.al., Paper: http://arxiv.org/abs/2510.21604
- 2025-10-24, REMONI: An Autonomous System Integrating Wearables and Multimodal Large Language Models for Enhanced Remote Health Monitoring, Thanh Cong Ho et.al., Paper: http://arxiv.org/abs/2510.21445
- 2025-10-23, RAGRank: Using PageRank to Counter Poisoning in CTI LLM Pipelines, Austin Jia et.al., Paper: http://arxiv.org/abs/2510.20768
- 2025-10-21, Prompting the Priorities: A First Look at Evaluating LLMs for Vulnerability Triage and Prioritization, Osama Al Haddad et.al., Paper: http://arxiv.org/abs/2510.18508
- 2025-10-21, Probabilistic Modeling of Intentions in Socially Intelligent LLM Agents, Feifan Xia et.al., Paper: http://arxiv.org/abs/2510.18476
- 2025-10-21, Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language Models, Lehan Wang et.al., Paper: http://arxiv.org/abs/2510.18303
- 2025-10-21, Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options, Joongkyu Lee et.al., Paper: http://arxiv.org/abs/2510.18713
- 2025-10-21, Position: LLM Watermarking Should Align Stakeholders’ Incentives for Practical Adoption, Yepeng Liu et.al., Paper: http://arxiv.org/abs/2510.18333
- 2025-10-27, Point Convergence of Nesterov’s Accelerated Gradient Method: An AI-Assisted Proof, Uijeong Jang et.al., Paper: http://arxiv.org/abs/2510.23513
- 2025-10-21, PlanU: Large Language Model Decision Making through Planning under Uncertainty, Ziwei Deng et.al., Paper: http://arxiv.org/abs/2510.18442
- 2025-10-23, Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs, Yanlin Song et.al., Paper: http://arxiv.org/abs/2510.20691
- 2025-10-27, PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity, Yuqian Yuan et.al., Paper: http://arxiv.org/abs/2510.23603
- 2025-10-21, ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive Text-to-Speech Generation, Haowei Lou et.al., Paper: http://arxiv.org/abs/2510.18308
- 2025-10-24, ParaRNN: Unlocking Parallel Training of Nonlinear RNNs for Large Language Models, Federico Danieli et.al., Paper: http://arxiv.org/abs/2510.21450
- 2025-10-28, PRISM-Bench: A Benchmark of Puzzle-Based Visual Tasks with CoT Error Detection, Yusu Qian et.al., Paper: http://arxiv.org/abs/2510.23594
- 2025-10-28, Optimizing Retrieval for RAG via Reinforced Contrastive Learning, Jiawei Zhou et.al., Paper: http://arxiv.org/abs/2510.24652
- 2025-10-29, OpenReward: Learning to Reward Long-form Agentic Tasks via Reinforcement Learning, Ziyou Hu et.al., Paper: http://arxiv.org/abs/2510.24636
- 2025-10-28, Open Korean Historical Corpus: A Millennia-Scale Diachronic Collection of Public Domain Texts, Seyoung Song et.al., Paper: http://arxiv.org/abs/2510.24541
- 2025-10-21, One Size Fits All? A Modular Adaptive Sanitization Kit (MASK) for Customizable Privacy-Preserving Phone Scam Detection, Kangzhong Wang et.al., Paper: http://arxiv.org/abs/2510.18493
- 2025-10-23, On the Detectability of LLM-Generated Text: What Exactly Is LLM-Generated Text?, Mingmeng Geng et.al., Paper: http://arxiv.org/abs/2510.20810
- 2025-10-22, On Controlled Change: Generative AI’s Impact on Professional Authority in Journalism, Tomás Dodds et.al., Paper: http://arxiv.org/abs/2510.19792
- 2025-10-21, Noise-Conditioned Mixture-of-Experts Framework for Robust Speaker Verification, Bin Gu et.al., Paper: http://arxiv.org/abs/2510.18533
- 2025-10-23, Neural Diversity Regularizes Hallucinations in Small Models, Kushal Chakrabarti et.al., Paper: http://arxiv.org/abs/2510.20690
- 2025-10-28, Multi-Agent Evolve: LLM Self-Improve through Co-evolution, Yixing Chen et.al., Paper: http://arxiv.org/abs/2510.23595
- 2025-10-24, MoniTor: Exploiting Large Language Models with Instruction for Online Video Anomaly Detection, Shengtian Yang et.al., Paper: http://arxiv.org/abs/2510.21449
- 2025-10-24, Modest-Align: Data-Efficient Alignment for Vision-Language Models, Jiaxiang Liu et.al., Paper: http://arxiv.org/abs/2510.21606
- 2025-10-23, Mixing Importance with Diversity: Joint Optimization for KV Cache Compression in Large Vision-Language Models, Xuyang Liu et.al., Paper: http://arxiv.org/abs/2510.20707
- 2025-10-27, Minimizing Human Intervention in Online Classification, William Réveillard et.al., Paper: http://arxiv.org/abs/2510.23557
- 2025-10-21, Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents, Guangfu Guo et.al., Paper: http://arxiv.org/abs/2510.18424
- 2025-10-21, MTraining: Distributed Dynamic Sparse Attention for Efficient Ultra-Long Context Training, Wenxuan Li et.al., Paper: http://arxiv.org/abs/2510.18830
- 2025-10-24, MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization, Chenglong Wang et.al., Paper: http://arxiv.org/abs/2510.21473
- 2025-10-21, MLMA: Towards Multilingual with Mamba Based Architectures, Mohamed Nabih Ali et.al., Paper: http://arxiv.org/abs/2510.18684
- 2025-10-21, MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models, ChangSu Choi et.al., Paper: http://arxiv.org/abs/2510.18383
- 2025-10-27, Lightweight Robust Direct Preference Optimization, Cheol Woo Kim et.al., Paper: http://arxiv.org/abs/2510.23590
- 2025-10-21, LightMem: Lightweight and Efficient Memory-Augmented Generation, Jizhan Fang et.al., Paper: http://arxiv.org/abs/2510.18866
- 2025-10-23, Learning to Triage Taint Flows Reported by Dynamic Program Analysis in Node.js Packages, Ronghao Ni et.al., Paper: http://arxiv.org/abs/2510.20739
- 2025-10-27, Learning to Reason Efficiently with Discounted Reinforcement Learning, Alex Ayoub et.al., Paper: http://arxiv.org/abs/2510.23486
- 2025-10-27, Learning the PTM Code through a Coarse-to-Fine, Mechanism-Aware Framework, Jingjie Zhang et.al., Paper: http://arxiv.org/abs/2510.23492
- 2025-10-21, Large language models for folktale type automation based on motifs: Cinderella case study, Tjaša Arčon et.al., Paper: http://arxiv.org/abs/2510.18561
- 2025-10-21, Large Language Models in Thematic Analysis: Prompt Engineering, Evaluation, and Guidelines for Qualitative Software Engineering Research, Cristina Martinez Montes et.al., Paper: http://arxiv.org/abs/2510.18456
- 2025-10-21, LLMs as Sparse Retrievers:A Framework for First-Stage Product Search, Hongru Song et.al., Paper: http://arxiv.org/abs/2510.18527
- 2025-10-21, LAFA: Agentic LLM-Driven Federated Analytics over Decentralized Data Sources, Haichao Ji et.al., Paper: http://arxiv.org/abs/2510.18477
- 2025-10-21, KrishokBondhu: A Retrieval-Augmented Voice-Based Agricultural Advisory Call Center for Bengali Farmers, Mohd Ruhul Ameen et.al., Paper: http://arxiv.org/abs/2510.18355
- 2025-10-21, KoSimpleQA: A Korean Factuality Benchmark with an Analysis of Reasoning LLMs, Donghyeon Ko et.al., Paper: http://arxiv.org/abs/2510.18368
- 2025-10-23, KL-Regularized Reinforcement Learning is Designed to Mode Collapse, Anthony GX-Chen et.al., Paper: http://arxiv.org/abs/2510.20817
- 2025-10-21, KAT-Coder Technical Report, Zizheng Zhan et.al., Paper: http://arxiv.org/abs/2510.18779
- 2025-10-21, JAUNT: Joint Alignment of User Intent and Network State for QoE-centric LLM Tool Routing, Enhan Li et.al., Paper: http://arxiv.org/abs/2510.18550
- 2025-10-22, Integrating Transparent Models, LLMs, and Practitioner-in-the-Loop: A Case of Nonprofit Program Evaluation, Ji Ma et.al., Paper: http://arxiv.org/abs/2510.19799
- 2025-10-21, Integrating Large Language Models and Evaluating Student Outcomes in an Introductory Computer Science Course, Annapurna Vadaparty et.al., Paper: http://arxiv.org/abs/2510.18806
- 2025-10-21, InspectCoder: Dynamic Analysis-Enabled Self Repair through interactive LLM-Debugger Collaboration, Yunkun Wang et.al., Paper: http://arxiv.org/abs/2510.18327
- 2025-10-21, ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization, Yuanhe Guo et.al., Paper: http://arxiv.org/abs/2510.18433
- 2025-10-21, Illusions of reflection: open-ended task reveals systematic failures in Large Language Models’ reflective reasoning, Sion Weatherhead et.al., Paper: http://arxiv.org/abs/2510.18254
- 2025-10-21, Identity-Aware Large Language Models require Cultural Reasoning, Alistair Plum et.al., Paper: http://arxiv.org/abs/2510.18510
- 2025-10-27, ISA-Bench: Benchmarking Instruction Sensitivity for Large Audio Language Models, Bohan Li et.al., Paper: http://arxiv.org/abs/2510.23558
- 2025-10-27, IPQA: A Benchmark for Core Intent Identification in Personalized Question Answering, Jieyong Kim et.al., Paper: http://arxiv.org/abs/2510.23536
- 2025-10-21, IMB: An Italian Medical Benchmark for Question Answering, Antonio Romano et.al., Paper: http://arxiv.org/abs/2510.18468
- 2025-10-21, IF-VidCap: Can Video Caption Models Follow Instructions?, Shihao Li et.al., Paper: http://arxiv.org/abs/2510.18726
- 2025-10-24, Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine, Wenyi Wang et.al., Paper: http://arxiv.org/abs/2510.21614
- 2025-10-22, Hubble: a Model Suite to Advance the Study of LLM Memorization, Johnny Tian-Zheng Wei et.al., Paper: http://arxiv.org/abs/2510.19811
- 2025-10-21, How Efficient Are Diffusion Language Models? A Critical Examination of Efficiency Evaluation Practices, Han Peng et.al., Paper: http://arxiv.org/abs/2510.18480
- 2025-10-21, How Do LLMs Use Their Depth?, Akshat Gupta et.al., Paper: http://arxiv.org/abs/2510.18871
- 2025-10-24, Head Pursuit: Probing Attention Specialization in Multimodal Transformers, Lorenzo Basile et.al., Paper: http://arxiv.org/abs/2510.21518
- 2025-10-21, HarmNet: A Framework for Adaptive Multi-Turn Jailbreak Attacks on Large Language Models, Sidhant Narula et.al., Paper: http://arxiv.org/abs/2510.18728
- 2025-10-21, Hardness of Learning Regular Languages in the Next Symbol Prediction Setting, Satwik Bhattamishra et.al., Paper: http://arxiv.org/abs/2510.18634
- 2025-10-21, Grounding or Guessing? Visual Signals for Detecting Hallucinations in Sign Language Translation, Yasser Hamidullah et.al., Paper: http://arxiv.org/abs/2510.18439
- 2025-10-28, Greedy Sampling Is Provably Efficient for RLHF, Di Wu et.al., Paper: http://arxiv.org/abs/2510.24700
- 2025-10-22, Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs, Haochen Wang et.al., Paper: http://arxiv.org/abs/2510.18876
- 2025-10-21, Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming, Zheng Zhang et.al., Paper: http://arxiv.org/abs/2510.18314
- 2025-10-23, Generative Reasoning Recommendation via LLMs, Minjie Hong et.al., Paper: http://arxiv.org/abs/2510.20815
- 2025-10-28, Generative AI for Healthcare: Fundamentals, Challenges, and Perspectives, Gang Chen et.al., Paper: http://arxiv.org/abs/2510.24551
- 2025-10-21, GPTFace: Generative Pre-training of Facial-Linguistic Transformer by Span Masking and Weakly Correlated Text-image Data, Yudong Li et.al., Paper: http://arxiv.org/abs/2510.18345
- 2025-10-28, FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling, Zengzhuang Xu et.al., Paper: http://arxiv.org/abs/2510.24645
- 2025-10-21, From Retrieval to Generation: Unifying External and Parametric Knowledge for Medical Question Answering, Lei Li et.al., Paper: http://arxiv.org/abs/2510.18297
- 2025-10-21, From Quarter to All: Accelerating Speculative LLM Decoding via Floating-Point Exponent Remapping and Parameter Sharing, Yushu Zhao et.al., Paper: http://arxiv.org/abs/2510.18525
- 2025-10-24, From Polyester Girlfriends to Blind Mice: Creating the First Pragmatics Understanding Benchmarks for Slovene, Mojca Brglez et.al., Paper: http://arxiv.org/abs/2510.21575
- 2025-10-22, Forbidden Sidon subsets of perfect difference sets, featuring a human-assisted proof, Boris Alexeev et.al., Paper: http://arxiv.org/abs/2510.19804
- 2025-10-21, Fine-Tuned Thoughts: Leveraging Chain-of-Thought Reasoning for Industrial Asset Health Monitoring, Shuxin Lin et.al., Paper: http://arxiv.org/abs/2510.18817
- 2025-10-21, FedDEAP: Adaptive Dual-Prompt Tuning for Multi-Domain Federated Learning, Yubin Zheng et.al., Paper: http://arxiv.org/abs/2510.18837
- 2025-10-21, FeClustRE: Hierarchical Clustering and Semantic Tagging of App Features from User Reviews, Max Tiessler et.al., Paper: http://arxiv.org/abs/2510.18799
- 2025-10-23, Fast Inference via Hierarchical Speculative Decoding, Clara Mohri et.al., Paper: http://arxiv.org/abs/2510.19705
- 2025-10-27, FARMER: Flow AutoRegressive Transformer over Pixels, Guangting Zheng et.al., Paper: http://arxiv.org/abs/2510.23588
- 2025-10-21, Exploring a Unified Vision-Centric Contrastive Alternatives on Multi-Modal Web Documents, Yiqi Lin et.al., Paper: http://arxiv.org/abs/2510.18703
- 2025-10-21, Exploring Membership Inference Vulnerabilities in Clinical Large Language Models, Alexander Nemecek et.al., Paper: http://arxiv.org/abs/2510.18674
- 2025-10-23, Exploring Large Language Models for Access Control Policy Synthesis and Summarization, Adarsh Vatsa et.al., Paper: http://arxiv.org/abs/2510.20692
- 2025-10-28, Evolving Diagnostic Agents in a Virtual Clinical Environment, Pengcheng Qiu et.al., Paper: http://arxiv.org/abs/2510.24654
- 2025-10-21, Evaluating Large Language Models in detecting Secrets in Android Apps, Marco Alecci et.al., Paper: http://arxiv.org/abs/2510.18601
- 2025-10-21, Evaluating LLM-Based Mobile App Recommendations: An Empirical Study, Quim Motger et.al., Paper: http://arxiv.org/abs/2510.18364
- 2025-10-21, Enhancing Hotel Recommendations with AI: LLM-Based Review Summarization and Query-Driven Insights, Nikolaos Belibasakis et.al., Paper: http://arxiv.org/abs/2510.18277
- 2025-10-21, Engagement Undermines Safety: How Stereotypes and Toxicity Shape Humor in Language Models, Atharvan Dogra et.al., Paper: http://arxiv.org/abs/2510.18454
- 2025-10-23, Empathic Prompting: Non-Verbal Context Integration for Multimodal LLM Conversations, Lorenzo Stacchio et.al., Paper: http://arxiv.org/abs/2510.20743
- 2025-10-27, Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale Verifier, Hyeongseop Rha et.al., Paper: http://arxiv.org/abs/2510.23506
- 2025-10-27, EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT, Baoqi Pei et.al., Paper: http://arxiv.org/abs/2510.23569
- 2025-10-21, EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval, Zebin Yang et.al., Paper: http://arxiv.org/abs/2510.18546
- 2025-10-21, EffiReasonTrans: RL-Optimized Reasoning for Code Translation, Yanlin Wang et.al., Paper: http://arxiv.org/abs/2510.18863
- 2025-10-24, EU-Agent-Bench: Measuring Illegal Behavior of LLM Agents Under EU Law, Ilija Lichkovski et.al., Paper: http://arxiv.org/abs/2510.21524
- 2025-10-21, ECG-LLM– training and evaluation of domain-specific large language models for electrocardiography, Lara Ahrens et.al., Paper: http://arxiv.org/abs/2510.18339
- 2025-10-28, Dissecting Role Cognition in Medical LLMs via Neuronal Ablation, Xun Liang et.al., Paper: http://arxiv.org/abs/2510.24677
- 2025-10-28, Diffusion LLM with Native Variable Generation Lengths: Let [EOS] Lead the Way, Yicun Yang et.al., Paper: http://arxiv.org/abs/2510.24605
- 2025-10-23, Diagnosing Visual Reasoning: Challenges, Insights, and a Path Forward, Jing Bi et.al., Paper: http://arxiv.org/abs/2510.20696
- 2025-10-21, DelvePO: Direction-Guided Self-Evolving Framework for Flexible Prompt Optimization, Tao Tao et.al., Paper: http://arxiv.org/abs/2510.18257
- 2025-10-21, DeepTx: Real-Time Transaction Risk Analysis via Multi-Modal Features and LLM Reasoning, Yixuan Liu et.al., Paper: http://arxiv.org/abs/2510.18438
- 2025-10-27, Deductive Chain-of-Thought Augmented Socially-aware Robot Navigation World Model, Weizheng Wang et.al., Paper: http://arxiv.org/abs/2510.23509
- 2025-10-21, DSI-Bench: A Benchmark for Dynamic Spatial Intelligence, Ziang Zhang et.al., Paper: http://arxiv.org/abs/2510.18873
- 2025-10-21, DART: A Structured Dataset of Regulatory Drug Documents in Italian for Clinical NLP, Mariano Barone et.al., Paper: http://arxiv.org/abs/2510.18475
- 2025-10-21, CovMatch: Cross-Covariance Guided Multimodal Dataset Distillation with Trainable Text Encoder, Yongmin Lee et.al., Paper: http://arxiv.org/abs/2510.18583
- 2025-10-21, Counterfactual Reasoning for Steerable Pluralistic Value Alignment of Large Language Models, Hanze Guo et.al., Paper: http://arxiv.org/abs/2510.18526
- 2025-10-28, ComboBench: Can LLMs Manipulate Physical Devices to Play Virtual Reality Games?, Shuqing Li et.al., Paper: http://arxiv.org/abs/2510.24706
- 2025-10-21, Combining Distantly Supervised Models with In Context Learning for Monolingual and Cross-Lingual Relation Extraction, Vipul Rathore et.al., Paper: http://arxiv.org/abs/2510.18344
- 2025-10-24, ColorEcosystem: Powering Personalized, Standardized, and Trustworthy Agentic Service in massive-agent Ecosystem, Fangwen Wu et.al., Paper: http://arxiv.org/abs/2510.21566
- 2025-10-21, CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment, Xue Jiang et.al., Paper: http://arxiv.org/abs/2510.18471
- 2025-10-22, Class-Aware Prototype Learning with Negative Contrast for Test-Time Adaptation of Vision-Language Models, Xiaozhen Qiao et.al., Paper: http://arxiv.org/abs/2510.19802
- 2025-10-21, CircuitSeer: Mining High-Quality Data by Probing Mathematical Reasoning Circuits in LLMs, Shaobo Wang et.al., Paper: http://arxiv.org/abs/2510.18470
- 2025-10-21, Chain-of-Conceptual-Thought: Eliciting the Agent to Deeply Think within the Response, Qingqing Gu et.al., Paper: http://arxiv.org/abs/2510.18434
- 2025-10-21, CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using Agent, Haojia Lin et.al., Paper: http://arxiv.org/abs/2510.18596
- 2025-10-21, CLASP: Cost-Optimized LLM-based Agentic System for Phishing Detection, Fouad Trad et.al., Paper: http://arxiv.org/abs/2510.18585
- 2025-10-21, CEFR-Annotated WordNet: LLM-Based Proficiency-Guided Semantic Database for Language Learning, Masato Kikuchi et.al., Paper: http://arxiv.org/abs/2510.18466
- 2025-10-21, Building Trust in Clinical LLMs: Bias Analysis and Dataset Transparency, Svetlana Maslenkova et.al., Paper: http://arxiv.org/abs/2510.18556
- 2025-10-24, Brain-tuning Improves Generalizability and Efficiency of Brain Alignment in Speech Models, Omer Moussa et.al., Paper: http://arxiv.org/abs/2510.21520
- 2025-10-21, BrailleLLM: Braille Instruction Tuning with Large Language Models for Braille Domain Tasks, Tianyuan Huang et.al., Paper: http://arxiv.org/abs/2510.18288
- 2025-10-22, Blackbox Model Provenance via Palimpsestic Membership Inference, Rohith Kuditipudi et.al., Paper: http://arxiv.org/abs/2510.19796
- 2025-10-21, Beyond Single Models: Mitigating Multimodal Hallucinations via Adaptive Token Ensemble Decoding, Jinlin Li et.al., Paper: http://arxiv.org/abs/2510.18321
- 2025-10-23, Bayesian Jammer Localization with a Hybrid CNN and Path-Loss Mixture of Experts, Mariona Jaramillo-Civill et.al., Paper: http://arxiv.org/abs/2510.20666
- 2025-10-21, Automated urban waterlogging assessment and early warning through a mixture of foundation models, Chenxu Zhang et.al., Paper: http://arxiv.org/abs/2510.18425
- 2025-10-23, Automated Extraction of Fluoropyrimidine Treatment and Treatment-Related Toxicities from Clinical Notes Using Natural Language Processing, Xizhi Wu et.al., Paper: http://arxiv.org/abs/2510.20727
- 2025-10-24, Are the LLMs Capable of Maintaining at Least the Language Genus?, Sandra Mitrović et.al., Paper: http://arxiv.org/abs/2510.21561
- 2025-10-21, An Encoder-Decoder Foundation Chemical Language Model for Generative Polymer Design, Harikrishna Sahu et.al., Paper: http://arxiv.org/abs/2510.18860
- 2025-10-27, Alita-G: Self-Evolving Generative Agent for Agent Generation, Jiahao Qiu et.al., Paper: http://arxiv.org/abs/2510.23601
- 2025-10-28, AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis, Xuanzhong Chen et.al., Paper: http://arxiv.org/abs/2510.24695
- 2025-10-28, Advancing site-specific disease and pest management in precision agriculture: From reasoning-driven foundation models to adaptive, feedback-based learning, Nitin Rai et.al., Paper: http://arxiv.org/abs/2510.24650
- 2025-10-21, Adamas: Hadamard Sparse Attention for Efficient Long-Context Inference, Siyuan Yan et.al., Paper: http://arxiv.org/abs/2510.18413
- 2025-10-22, AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders, Yuezhou Hu et.al., Paper: http://arxiv.org/abs/2510.19779
- 2025-10-24, Actionable Cybersecurity Notifications for Smart Homes: A User Study on the Role of Length and Complexity, Victor Jüttner et.al., Paper: http://arxiv.org/abs/2510.21508
- 2025-10-23, ARGenSeg: Image Segmentation with Autoregressive Image Generation Model, Xiaolong Wang et.al., Paper: http://arxiv.org/abs/2510.20803
- 2025-10-23, A Use-Case Specific Dataset for Measuring Dimensions of Responsible Performance in LLM-generated Text, Alicia Sagae et.al., Paper: http://arxiv.org/abs/2510.20782
- 2025-10-27, A Survey of Data Agents: Emerging Paradigm or Overstated Hype?, Yizhang Zhu et.al., Paper: http://arxiv.org/abs/2510.23587
- 2025-10-24, A Multimodal Benchmark for Framing of Oil & Gas Advertising and Potential Greenwashing Detection, Gaku Morio et.al., Paper: http://arxiv.org/abs/2510.21679
- 2025-10-24, A Data-Centric Approach to Multilingual E-Commerce Product Search: Case Study on Query-Category and Query-Item Relevance, Yabo Yin et.al., Paper: http://arxiv.org/abs/2510.21671
(<a href=#updated-on-20251030>back to top</a>)
Reinforcement Learning
- 2025-10-22, olmOCR 2: Unit Test Rewards for Document OCR, Jake Poznanski et.al., Paper: http://arxiv.org/abs/2510.19817
- 2025-10-21, Why Policy Gradient Algorithms Work for Undiscounted Total-Reward MDPs, Jongmin Lee et.al., Paper: http://arxiv.org/abs/2510.18340
- 2025-10-20, When 5G NTN Meets GNSS: Tracking GNSS Signals under Overlaid 5G Waveforms, Idir Edjekouane et.al., Paper: http://arxiv.org/abs/2510.17324
- 2025-10-21, WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection, Guanzhong He et.al., Paper: http://arxiv.org/abs/2510.18798
- 2025-10-27, VideoTG-R1: Boosting Video Temporal Grounding via Curriculum Reinforcement Learning on Reflected Boundary Annotations, Lu Dong et.al., Paper: http://arxiv.org/abs/2510.23397
- 2025-10-27, Video-Thinker: Sparking “Thinking with Videos” via Reinforcement Learning, Shijian Wang et.al., Paper: http://arxiv.org/abs/2510.23473
- 2025-10-21, Verifiable Accuracy and Abstention Rewards in Curriculum RL to Alleviate Lost-in-Conversation, Ming Li et.al., Paper: http://arxiv.org/abs/2510.18731
- 2025-10-27, Variational Thermal State Preparation on Digital Quantum Processors Assisted by Matrix Product States, Rui-Hao Li et.al., Paper: http://arxiv.org/abs/2510.23546
- 2025-10-28, VOLD: Reasoning Transfer from LLMs to Vision-Language Models via On-Policy Distillation, Walid Bousselham et.al., Paper: http://arxiv.org/abs/2510.23497
- 2025-10-22, Using Non-Expert Data to Robustify Imitation Learning via Offline Reinforcement Learning, Kevin Huang et.al., Paper: http://arxiv.org/abs/2510.19495
- 2025-10-22, Universal Quantitative Abstraction: Categorical Duality and Logical Completeness for Probabilistic Systems, Nivar Anwer et.al., Paper: http://arxiv.org/abs/2510.19444
- 2025-10-24, Unified token representations for sequential decision models, Zhuojing Tian et.al., Paper: http://arxiv.org/abs/2510.21448
- 2025-10-20, UniRL-Zero: Reinforcement Learning on Unified Models with Joint Language Model and Diffusion Model Experts, Fu-Yun Wang et.al., Paper: http://arxiv.org/abs/2510.17937
- 2025-10-21, Uncovering critical temperature dependence in Heusler magnets via explicit machine learning, Jean-Baptiste Morée et.al., Paper: http://arxiv.org/abs/2510.18469
- 2025-10-20, UltraCUA: A Foundation Model for Computer Use Agents with Hybrid Action, Yuhao Yang et.al., Paper: http://arxiv.org/abs/2510.17790
- 2025-10-21, Two-loop QCD corrections for real and off-shell diphoton and triphoton production via quark loops, Dario Kermanschah et.al., Paper: http://arxiv.org/abs/2510.18801
- 2025-10-20, Train for Truth, Keep the Skills: Binary Retrieval-Augmented Reward Mitigates Hallucinations, Tong Chen et.al., Paper: http://arxiv.org/abs/2510.17733
- 2025-10-20, Trading with the Devil: Risk and Return in Foundation Model Strategies, Jinrui Zhang et.al., Paper: http://arxiv.org/abs/2510.17165
- 2025-10-27, Towards Stochastic (N-1)-Secure Redispatch, Oleksii Molodchyk et.al., Paper: http://arxiv.org/abs/2510.23551
- 2025-10-28, Towards Quadrupedal Jumping and Walking for Dynamic Locomotion using Reinforcement Learning, Jørgen Anker Olsen et.al., Paper: http://arxiv.org/abs/2510.24584
- 2025-10-20, Towards Optimal Control and Algorithmic Structure of Decompression Schedules, Benjamin Marsh et.al., Paper: http://arxiv.org/abs/2510.17551
- 2025-10-21, Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement Learning, Chenghao Zhu et.al., Paper: http://arxiv.org/abs/2510.18849
- 2025-10-20, Toward Autonomous Neural VMC: An Energy-Variance Convergence Criterion for Quantum Systems, Huan-Chen Shi et.al., Paper: http://arxiv.org/abs/2510.17490
- 2025-10-24, Three-nucleon lepton-number-violating potentials in chiral EFT and their matrix elements in light nuclei, Graham Chambers-Wall et.al., Paper: http://arxiv.org/abs/2510.21564
- 2025-10-27, Think Twice: Branch-and-Rethink Reasoning Reward Model, Yizhu Jiao et.al., Paper: http://arxiv.org/abs/2510.23596
- 2025-10-24, The population of Galactic young massive star clusters in the TeV range, Rowan Batzofin et.al., Paper: http://arxiv.org/abs/2510.21480
- 2025-10-21, The implications of inflation for the last ACT, Zhi-Chong Qiu et.al., Paper: http://arxiv.org/abs/2510.18320
- 2025-10-23, The Shape of Reasoning: Topological Analysis of Reasoning Traces in Large Language Models, Xue Wen Tan et.al., Paper: http://arxiv.org/abs/2510.20665
- 2025-10-21, The Picard-Lagrange Framework for Higher-Order Langevin Monte Carlo, Jaideep Mahajan et.al., Paper: http://arxiv.org/abs/2510.18242
- 2025-10-20, The Marked Edge Walk: A Novel MCMC Algorithm for Sampling of Graph Partitions, Atticus McWhorter et.al., Paper: http://arxiv.org/abs/2510.17714
- 2025-10-22, The Confusing Instance Principle for Online Linear Quadratic Control, Waris Radji et.al., Paper: http://arxiv.org/abs/2510.19531
- 2025-10-27, The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation, Farid Bagirov et.al., Paper: http://arxiv.org/abs/2510.23393
- 2025-10-20, TabR1: Taming GRPO for tabular reasoning LLMs, Pengxiang Cai et.al., Paper: http://arxiv.org/abs/2510.17385
- 2025-10-24, System-Theoretic Analysis of Dynamic Generalized Nash Equilibrium Problems – Turnpikes and Dissipativity, Sophie Hall et.al., Paper: http://arxiv.org/abs/2510.21556
- 2025-10-24, Surrogate-based quantification of policy uncertainty in generative flow networks, Ramón Nartallo-Kaluarachchi et.al., Paper: http://arxiv.org/abs/2510.21523
- 2025-10-20, SoftMimic: Learning Compliant Whole-body Control from Examples, Gabriel B. Margolis et.al., Paper: http://arxiv.org/abs/2510.17792
- 2025-10-21, Socialized Learning and Emergent Behaviors in Multi-Agent Systems based on Multimodal Large Language Models, Sureyya Akin et.al., Paper: http://arxiv.org/abs/2510.18515
- 2025-10-22, SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration, Xichen Zhang et.al., Paper: http://arxiv.org/abs/2510.19767
- 2025-10-21, Sherlock Your Queries: Learning to Ask the Right Questions for Dialogue-Based Retrieval, Dong Yun et.al., Paper: http://arxiv.org/abs/2510.18659
- 2025-10-27, Sequential Multi-Agent Dynamic Algorithm Configuration, Chen Lu et.al., Paper: http://arxiv.org/abs/2510.23535
- 2025-10-22, Semi-Implicit Approaches for Large-Scale Bayesian Spatial Interpolation, Sébastien Garneau et.al., Paper: http://arxiv.org/abs/2510.19722
- 2025-10-21, Search Self-play: Pushing the Frontier of Agent Capability without Supervision, Hongliang Lu et.al., Paper: http://arxiv.org/abs/2510.18821
- 2025-10-22, Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning, Xichen Zhang et.al., Paper: http://arxiv.org/abs/2510.19807
- 2025-10-28, Sample-efficient and Scalable Exploration in Continuous-Time RL, Klemens Iten et.al., Paper: http://arxiv.org/abs/2510.24482
- 2025-10-21, Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation, Daniel Bethell et.al., Paper: http://arxiv.org/abs/2510.18478
- 2025-10-28, SPICE: Self-Play In Corpus Environments Improves Reasoning, Bo Liu et.al., Paper: http://arxiv.org/abs/2510.24684
- 2025-10-28, SPARTA: Evaluating Reasoning Segmentation Robustness through Black-Box Adversarial Paraphrasing in Text Autoencoder Latent Space, Viktoriia Zinkovich et.al., Paper: http://arxiv.org/abs/2510.24446
- 2025-10-20, SPACeR: Self-Play Anchoring with Centralized Reference Models, Wei-Jer Chang et.al., Paper: http://arxiv.org/abs/2510.18060
- 2025-10-28, SGFusion: Stochastic Geographic Gradient Fusion in Federated Learning, Khoa Nguyen et.al., Paper: http://arxiv.org/abs/2510.23455
- 2025-10-22, SEA: Semantic Map Prediction for Active Exploration of Uncertain Areas, Hongyu Ding et.al., Paper: http://arxiv.org/abs/2510.19766
- 2025-10-20, Rewarding the Journey, Not Just the Destination: A Composite Path and Answer Self-Scoring Reward Mechanism for Test-Time Reinforcement Learning, Chenwei Tang et.al., Paper: http://arxiv.org/abs/2510.17923
- 2025-10-20, Rethinking On-policy Optimization for Query Augmentation, Zhichao Xu et.al., Paper: http://arxiv.org/abs/2510.17139
- 2025-10-21, Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting, Howard Chen et.al., Paper: http://arxiv.org/abs/2510.18874
- 2025-10-21, Reinforcement Learning with Imperfect Transition Predictions: A Bellman-Jensen Approach, Chenbei Lu et.al., Paper: http://arxiv.org/abs/2510.18687
- 2025-10-23, Reinforcement Learning and Consumption-Savings Behavior, Brandon Kaplowitz et.al., Paper: http://arxiv.org/abs/2510.20748
- 2025-10-24, Reduced Floating-Point Precision Implicit Monte Carlo, Simon Butson et.al., Paper: http://arxiv.org/abs/2510.21683
- 2025-10-22, Reasoning Like Experts: Leveraging Multimodal Large Language Models for Drawing-based Psychoanalysis, Xueqi Ma et.al., Paper: http://arxiv.org/abs/2510.19451
- 2025-10-24, Real-Time Gait Adaptation for Quadrupeds using Model Predictive Control and Reinforcement Learning, Prakrut Kotecha et.al., Paper: http://arxiv.org/abs/2510.20706
- 2025-10-21, Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback, Yi-Lun Wu et.al., Paper: http://arxiv.org/abs/2510.18353
- 2025-10-20, RL-Driven Security-Aware Resource Allocation Framework for UAV-Assisted O-RAN, Zaineh Abughazzah et.al., Paper: http://arxiv.org/abs/2510.18084
- 2025-10-24, RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models, Xueyuan Lin et.al., Paper: http://arxiv.org/abs/2510.21604
- 2025-10-20, RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation, Yuquan Xue et.al., Paper: http://arxiv.org/abs/2510.17640
- 2025-10-20, R2L: Reliable Reinforcement Learning: Guaranteed Return & Reliable Policies in Reinforcement Learning, Nadir Farhi et.al., Paper: http://arxiv.org/abs/2510.18074
- 2025-10-20, R2BC: Multi-Agent Imitation Learning from Single-Agent Demonstrations, Connor Mattson et.al., Paper: http://arxiv.org/abs/2510.18085
- 2025-10-20, QueST: Incentivizing LLMs to Generate Difficult Problems, Hanxu Hu et.al., Paper: http://arxiv.org/abs/2510.17715
- 2025-10-22, Quantum Monte Carlo study of low-dimensional Fermi fluids of dipolar atoms, Clio Johnson et.al., Paper: http://arxiv.org/abs/2510.19533
- 2025-10-22, Quantum Machine Learning methods for Fourier-based distribution estimation with application in option pricing, Fernando Alonso et.al., Paper: http://arxiv.org/abs/2510.19494
- 2025-10-20, Provably Optimal Reinforcement Learning under Safety Filtering, Donggeon David Oh et.al., Paper: http://arxiv.org/abs/2510.18082
- 2025-10-29, Prospects for a 95 GeV Higgs Boson at Future Higgs Factories with Transformer Networks, Yabo Dong et.al., Paper: http://arxiv.org/abs/2510.24662
- 2025-10-21, Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language Models, Lehan Wang et.al., Paper: http://arxiv.org/abs/2510.18303
- 2025-10-21, Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options, Joongkyu Lee et.al., Paper: http://arxiv.org/abs/2510.18713
- 2025-10-24, Predicted observational effects of rapid rotation for Be stars, Rina G. Rast et.al., Paper: http://arxiv.org/abs/2510.21640
- 2025-10-22, Practical algorithm for simulating thermal pure quantum states, Wei-Bo He et.al., Paper: http://arxiv.org/abs/2510.19504
- 2025-10-20, Plasma Shape Control via Zero-shot Generative Reinforcement Learning, Niannian Wu et.al., Paper: http://arxiv.org/abs/2510.17531
- 2025-10-21, PlanU: Large Language Model Decision Making through Planning under Uncertainty, Ziwei Deng et.al., Paper: http://arxiv.org/abs/2510.18442
- 2025-10-23, Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs, Yanlin Song et.al., Paper: http://arxiv.org/abs/2510.20691
- 2025-10-22, Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing, Yusu Qian et.al., Paper: http://arxiv.org/abs/2510.19808
- 2025-10-28, Pair Approximation Meets Reality: Diffusion of Innovation in Organizational Networks within the biased-independence q-Voter Model, Angelika Abramiuk-Szurlej et.al., Paper: http://arxiv.org/abs/2510.24447
- 2025-10-21, PGTT: Phase-Guided Terrain Traversal for Perceptive Legged Locomotion, Alexandros Ntagkas et.al., Paper: http://arxiv.org/abs/2510.18348
- 2025-10-21, PCMS: Parallel Coupler For Multimodel Simulations, Jacob S. Merson et.al., Paper: http://arxiv.org/abs/2510.18838
- 2025-10-20, Oxidation State Dynamics and Emerging Patterns in Magnetite, Emre Gürsoy et.al., Paper: http://arxiv.org/abs/2510.18061
- 2025-10-22, Optimizing the Unknown: Black Box Bayesian Optimization with Energy-Based Model and Reinforcement Learning, Ruiyao Miao et.al., Paper: http://arxiv.org/abs/2510.19530
- 2025-10-20, Optimizing Energy Management of Smart Grid using Reinforcement Learning aided by Surrogate models built using Physics-informed Neural Networks, Julen Cestero et.al., Paper: http://arxiv.org/abs/2510.17380
- 2025-10-29, OpenReward: Learning to Reward Long-form Agentic Tasks via Reinforcement Learning, Ziyou Hu et.al., Paper: http://arxiv.org/abs/2510.24636
- 2025-10-23, Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence, Jiahao Meng et.al., Paper: http://arxiv.org/abs/2510.20579
- 2025-10-21, Online SFT for LLM Reasoning: Surprising Effectiveness of Self-Tuning without Rewards, Mengqi Li et.al., Paper: http://arxiv.org/abs/2510.18814
- 2025-10-20, OncoReason: Structuring Clinical Reasoning in LLMs for Robust and Interpretable Survival Prediction, Raghu Vamshi Hemadri et.al., Paper: http://arxiv.org/abs/2510.17532
- 2025-10-23, On Multiple Robustness of Proximal Dynamic Treatment Regimes, Yuanshan Gao et.al., Paper: http://arxiv.org/abs/2510.20451
- 2025-10-21, On AI Verification in Open RAN, Rahul Soundrarajan et.al., Paper: http://arxiv.org/abs/2510.18417
- 2025-10-27, Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences, Zhuoran Jin et.al., Paper: http://arxiv.org/abs/2510.23451
- 2025-10-20, OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning, Zhenyu Bi et.al., Paper: http://arxiv.org/abs/2510.18032
- 2025-10-23, No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes, Jasmine Bayrooti et.al., Paper: http://arxiv.org/abs/2510.20725
- 2025-10-21, Nash Policy Gradient: A Policy Gradient Method with Iteratively Refined Regularization for Finding Nash Equilibria, Eason Yu et.al., Paper: http://arxiv.org/abs/2510.18183
- 2025-10-21, NTKMTL: Mitigating Task Imbalance in Multi-Task Learning from Neural Tangent Kernel Perspective, Xiaohan Qin et.al., Paper: http://arxiv.org/abs/2510.18258
- 2025-10-20, Multimodal Safety Is Asymmetric: Cross-Modal Exploits Unlock Black-Box MLLMs Jailbreaks, Xinkai Wang et.al., Paper: http://arxiv.org/abs/2510.17277
- 2025-10-24, Multilevel Picard scheme for solving high-dimensional drift control problems with state constraints, Yuan Zhong et.al., Paper: http://arxiv.org/abs/2510.21607
- 2025-10-28, Multi-Agent Evolve: LLM Self-Improve through Co-evolution, Yixing Chen et.al., Paper: http://arxiv.org/abs/2510.23595
- 2025-10-22, Monte Carlo study of the $O(2)$-invariant $φ^4$ theory with a cubic perturbation in three dimensions, Martin Hasenbusch et.al., Paper: http://arxiv.org/abs/2510.19473
- 2025-10-23, Monte Carlo Sampling for Wave Functions Requiring (Anti)Symmetrization, Koyena Bose et.al., Paper: http://arxiv.org/abs/2510.20577
- 2025-10-21, MoMaGen: Generating Demonstrations under Soft and Hard Constraints for Multi-Step Bimanual Mobile Manipulation, Chengshu Li et.al., Paper: http://arxiv.org/abs/2510.18316
- 2025-10-28, MiniOneRec: An Open-Source Framework for Scaling Generative Recommendation, Xiaoyu Kong et.al., Paper: http://arxiv.org/abs/2510.24431
- 2025-10-27, MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding, Xin Jin et.al., Paper: http://arxiv.org/abs/2510.23479
- 2025-10-22, Memo: Training Memory-Efficient Embodied Agents with Reinforcement Learning, Gunshi Gupta et.al., Paper: http://arxiv.org/abs/2510.19732
- 2025-10-22, MedReason-R1: Learning to Reason for CT Diagnosis with Reinforcement Learning and Local Zoom, Yifan Li et.al., Paper: http://arxiv.org/abs/2510.19626
- 2025-10-21, Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents, Guangfu Guo et.al., Paper: http://arxiv.org/abs/2510.18424
- 2025-10-24, Mechanistic Interpretability for Neural TSP Solvers, Reuben Narad et.al., Paper: http://arxiv.org/abs/2510.21693
- 2025-10-23, Measuring cosmic dipole with the GRB luminosity-time relation, Jessica Santiago et.al., Paper: http://arxiv.org/abs/2510.20705
- 2025-10-20, Measuring Reasoning in LLMs: a New Dialectical Angle, Soheil Abbasloo et.al., Paper: http://arxiv.org/abs/2510.18134
- 2025-10-24, MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization, Chenglong Wang et.al., Paper: http://arxiv.org/abs/2510.21473
- 2025-10-21, MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models, ChangSu Choi et.al., Paper: http://arxiv.org/abs/2510.18383
- 2025-10-21, MADR: MPC-guided Adversarial DeepReach, Ryan Teoh et.al., Paper: http://arxiv.org/abs/2510.18845
- 2025-10-21, Lyapunov-Aware Quantum-Inspired Reinforcement Learning for Continuous-Time Vehicle Control: A Feasibility Study, Nutkritta Kraipatthanapong et.al., Paper: http://arxiv.org/abs/2510.18852
- 2025-10-28, Low-lying baryon resonances from lattice QCD, Colin Morningstar et.al., Paper: http://arxiv.org/abs/2510.24596
- 2025-10-20, Local Coherence or Global Validity? Investigating RLVR Traces in Math Domains, Soumya Rani Samineni et.al., Paper: http://arxiv.org/abs/2510.18176
- 2025-10-20, Leveraging Group Relative Policy Optimization to Advance Large Language Models in Traditional Chinese Medicine, Jiacheng Xie et.al., Paper: http://arxiv.org/abs/2510.17402
- 2025-10-27, Learning to Reason Efficiently with Discounted Reinforcement Learning, Alex Ayoub et.al., Paper: http://arxiv.org/abs/2510.23486
- 2025-10-21, Learning to Navigate Under Imperfect Perception: Conformalised Segmentation for Safe Reinforcement Learning, Daniel Bethell et.al., Paper: http://arxiv.org/abs/2510.18485
- 2025-10-28, Learning to Drive Safely with Hybrid Options, Bram De Cooman et.al., Paper: http://arxiv.org/abs/2510.24674
- 2025-10-20, Learning to Design Soft Hands using Reward Models, Xueqian Bai et.al., Paper: http://arxiv.org/abs/2510.17086
- 2025-10-22, Learning Upper Lower Value Envelopes to Shape Online RL: A Principled Approach, Sebastian Reboul et.al., Paper: http://arxiv.org/abs/2510.19528
- 2025-10-20, LLMs Encode How Difficult Problems Are, William Lugoloobi et.al., Paper: http://arxiv.org/abs/2510.18147
- 2025-10-23, KL-Regularized Reinforcement Learning is Designed to Mode Collapse, Anthony GX-Chen et.al., Paper: http://arxiv.org/abs/2510.20817
- 2025-10-20, Inference of Deterministic Finite Automata via Q-Learning, Elaheh Hosseinkhani et.al., Paper: http://arxiv.org/abs/2510.17386
- 2025-10-21, Improved thermonuclear rate of $^{42}$Ti($p$,$γ$)$^{43}$ V and its astrophysical implication in rp-process, S. Q. Hou et.al., Paper: http://arxiv.org/abs/2510.18531
- 2025-10-20, Humanoid Goalkeeper: Learning from Position Conditioned Task-Motion Constraints, Junli Ren et.al., Paper: http://arxiv.org/abs/2510.18002
- 2025-10-28, How Flat is a Plateau? Evolution of Late-Time TDE Disks, Yael Alush et.al., Paper: http://arxiv.org/abs/2510.24696
- 2025-10-21, Higher Embedding Dimension Creates a Stronger World Model for a Simple Sorting Task, Brady Bhalla et.al., Paper: http://arxiv.org/abs/2510.18315
- 2025-10-27, Ground-state phase diagram of S = 1/2 Heisenberg model on 2D square-hexagon-octagon lattice, Yumeng Luo et.al., Paper: http://arxiv.org/abs/2510.23376
- 2025-10-28, Greedy Sampling Is Provably Efficient for RLHF, Di Wu et.al., Paper: http://arxiv.org/abs/2510.24700
- 2025-10-24, Goal-based portfolio selection with fixed transaction costs, Erhan Bayraktar et.al., Paper: http://arxiv.org/abs/2510.21650
- 2025-10-23, GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning, Jinchang Luo et.al., Paper: http://arxiv.org/abs/2510.20548
- 2025-10-23, GSWorld: Closed-Loop Photo-Realistic Simulation Suite for Robotic Manipulation, Guangqi Jiang et.al., Paper: http://arxiv.org/abs/2510.20813
- 2025-10-20, GACO-CAD: Geometry-Augmented and Conciseness-Optimized CAD Model Generation from Single Image, Yinghui Wang et.al., Paper: http://arxiv.org/abs/2510.17157
- 2025-10-20, Functional Distribution Networks (FDN), Omer Haq et.al., Paper: http://arxiv.org/abs/2510.17794
- 2025-10-20, From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models, Zefan Cai et.al., Paper: http://arxiv.org/abs/2510.17247
- 2025-10-21, From Competition to Synergy: Unlocking Reinforcement Learning for Subject-Driven Image Generation, Ziwei Huang et.al., Paper: http://arxiv.org/abs/2510.18263
- 2025-10-20, Foundational Automatic Evaluators: Scaling Multi-Task Generative Evaluator Training for Reasoning-Centric Domains, Austin Xu et.al., Paper: http://arxiv.org/abs/2510.17793
- 2025-10-21, Food4All: A Multi-Agent Framework for Real-time Free Food Discovery with Integrated Nutritional Metadata, Zhengqing Yuan et.al., Paper: http://arxiv.org/abs/2510.18289
- 2025-10-20, Finite-Time Bounds for Average-Reward Fitted Q-Iteration, Jongmin Lee et.al., Paper: http://arxiv.org/abs/2510.17391
- 2025-10-21, Fingerprints of cluster-based Haldane and bound-magnon states in a spin-1 Heisenberg diamond chain, Azam Zoshki et.al., Paper: http://arxiv.org/abs/2510.18447
- 2025-10-20, Fine-tuning Flow Matching Generative Models with Intermediate Feedback, Jiajun Fan et.al., Paper: http://arxiv.org/abs/2510.18072
- 2025-10-28, Fill in the Blanks: Accelerating Q-Learning with a Handful of Demonstrations in Sparse Reward Settings, Seyed Mahdi Basiri Azad et.al., Paper: http://arxiv.org/abs/2510.24432
- 2025-10-28, Fast Bayesian Multilevel Quasi-Monte Carlo, Aleksei G. Sorokin et.al., Paper: http://arxiv.org/abs/2510.24604
- 2025-10-28, Fare: Failure Resilience in Learned Visual Navigation Control, Zishuo Wang et.al., Paper: http://arxiv.org/abs/2510.24680
- 2025-10-28, Evolving Diagnostic Agents in a Virtual Clinical Environment, Pengcheng Qiu et.al., Paper: http://arxiv.org/abs/2510.24654
- 2025-10-20, EvoSyn: Generalizable Evolutionary Data Synthesis for Verifiable Learning, He Du et.al., Paper: http://arxiv.org/abs/2510.17928
- 2025-10-21, Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model, Ling Team et.al., Paper: http://arxiv.org/abs/2510.18855
- 2025-10-20, Estimating Orbital Parameters of Direct Imaging Exoplanet Using Neural Network, Bo Liang et.al., Paper: http://arxiv.org/abs/2510.17459
- 2025-10-24, Enhancing Tactile-based Reinforcement Learning for Robotic Control, Elle Miller et.al., Paper: http://arxiv.org/abs/2510.21609
- 2025-10-23, EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence, Ding Zou et.al., Paper: http://arxiv.org/abs/2510.20578
- 2025-10-24, Electroweak corrections to $gg\rightarrow γγ$, Gabriele Fiore et.al., Paper: http://arxiv.org/abs/2510.21643
- 2025-10-21, Efficient Model-Based Reinforcement Learning for Robot Control via Online Learning, Fang Nan et.al., Paper: http://arxiv.org/abs/2510.18518
- 2025-10-20, Efficient Algorithms for Mitigating Uncertainty and Risk in Reinforcement Learning, Xihong Su et.al., Paper: http://arxiv.org/abs/2510.17690
- 2025-10-21, EffiReasonTrans: RL-Optimized Reasoning for Code Translation, Yanlin Wang et.al., Paper: http://arxiv.org/abs/2510.18863
- 2025-10-28, Dual-Mind World Models: A General Framework for Learning in Dynamic Wireless Networks, Lingyi Wang et.al., Paper: http://arxiv.org/abs/2510.24546
- 2025-10-23, Downsizing Diffusion Models for Cardinality Estimation, Xinhe Mu et.al., Paper: http://arxiv.org/abs/2510.20681
- 2025-10-23, Detection of ultra-high-energy cosmic rays in the southern hemisphere with FAST: data acquisition and preliminary results, Jakub Kmec et.al., Paper: http://arxiv.org/abs/2510.20522
- 2025-10-22, Demonstrating Real Advantage of Machine-Learning-Enhanced Monte Carlo for Combinatorial Optimization, Luca Maria Del Bono et.al., Paper: http://arxiv.org/abs/2510.19544
- 2025-10-24, DeepAgent: A General Reasoning Agent with Scalable Toolsets, Xiaoxi Li et.al., Paper: http://arxiv.org/abs/2510.21618
- 2025-10-21, Deep Q-Learning Assisted Bandwidth Reservation for Multi-Operator Time-Sensitive Vehicular Networking, Abdullah Al-Khatib et.al., Paper: http://arxiv.org/abs/2510.18553
- 2025-10-20, Deep Neural Network extraction of Unpolarized Transverse Momentum Distributions, I. P. Fernando et.al., Paper: http://arxiv.org/abs/2510.17243
- 2025-10-20, Decentralized Real-Time Planning for Multi-UAV Cooperative Manipulation via Imitation Learning, Shantnav Agarwal et.al., Paper: http://arxiv.org/abs/2510.17143
- 2025-10-21, DeLoad: Demand-Driven Short-Video Preloading with Scalable Watch-Time Estimation, Tong Liu et.al., Paper: http://arxiv.org/abs/2510.18459
- 2025-10-24, DEEDEE: Fast and Scalable Out-of-Distribution Dynamics Detection, Tala Aljaafari et.al., Paper: http://arxiv.org/abs/2510.21638
- 2025-10-23, DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning, Runpeng Xie et.al., Paper: http://arxiv.org/abs/2510.19562
- 2025-10-20, D2C-HRHR: Discrete Actions with Double Distributional Critics for High-Risk-High-Return Tasks, Jundong Zhang et.al., Paper: http://arxiv.org/abs/2510.17212
- 2025-10-20, CrossGuard: Safeguarding MLLMs against Joint-Modal Implicit Malicious Attacks, Xu Zhang et.al., Paper: http://arxiv.org/abs/2510.17687
- 2025-10-24, Cost Minimization for Space-Air-Ground Integrated Multi-Access Edge Computing Systems, Weihong Qin et.al., Paper: http://arxiv.org/abs/2510.21541
- 2025-10-27, Cosmic magnification on multi-catalogue Herschel submillimetre galaxies, R. Fernandez-Fernandez et.al., Paper: http://arxiv.org/abs/2510.23582
- 2025-10-20, Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control, Chengxiu Hua et.al., Paper: http://arxiv.org/abs/2510.17122
- 2025-10-23, Consumption-Investment Problem in Rank-Based Models, David Itkin et.al., Paper: http://arxiv.org/abs/2510.20763
- 2025-10-24, Constraints on ultra-heavy dark matter from the CDEX-10 experiment at the China Jinping Underground Laboratory, Y. F. Wang et.al., Paper: http://arxiv.org/abs/2510.21458
- 2025-10-20, Consistent Zero-Shot Imitation with Contrastive Goal Inference, Kathryn Wantlin et.al., Paper: http://arxiv.org/abs/2510.17059
- 2025-10-23, Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence, Kun Ouyang et.al., Paper: http://arxiv.org/abs/2510.20470
- 2025-10-21, Computational Foundations for Strategic Coopetition: Formalizing Interdependence and Complementarity, Vik Pant et.al., Paper: http://arxiv.org/abs/2510.18802
- 2025-10-20, Colour coherence in small collision systems, Isobel Kolbé et.al., Paper: http://arxiv.org/abs/2510.17570
- 2025-10-20, Collider Searches for Near-Continuum Dark Matter, Steven Ferrante et.al., Paper: http://arxiv.org/abs/2510.17989
- 2025-10-20, Coinvisor: An RL-Enhanced Chatbot Agent for Interactive Cryptocurrency Investment Analysis, Chong Chen et.al., Paper: http://arxiv.org/abs/2510.17235
- 2025-10-21, CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment, Xue Jiang et.al., Paper: http://arxiv.org/abs/2510.18471
- 2025-10-28, Cluster Dose Prediction in Carbon Ion Therapy: Using Transfer Learning from a Pretrained Dose Prediction U-Net, Miriam Schwarze et.al., Paper: http://arxiv.org/abs/2510.24703
- 2025-10-21, Chemistry, Climate, and Transmission Spectra of TRAPPIST-1 e Explored with a Multimodel Sparse Sampled Ensemble, Eric T. Wolf et.al., Paper: http://arxiv.org/abs/2510.18704
- 2025-10-20, Characterizing expansivity through $C^*$ -algebras, S. Bautista et.al., Paper: http://arxiv.org/abs/2510.17255
- 2025-10-20, Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs, Paula Cordero-Encinar et.al., Paper: http://arxiv.org/abs/2510.17472
- 2025-10-24, Causality Meets Locality: Provably Generalizable and Scalable Policy Learning for Networked Systems, Hao Liang et.al., Paper: http://arxiv.org/abs/2510.21427
- 2025-10-27, Causal Deep Q Network, Elouanes Khelifi et.al., Paper: http://arxiv.org/abs/2510.23424
- 2025-10-21, CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using Agent, Haojia Lin et.al., Paper: http://arxiv.org/abs/2510.18596
- 2025-10-20, CLAWS:Creativity detection for LLM-generated solutions using Attention Window of Sections, Keuntae Kim et.al., Paper: http://arxiv.org/abs/2510.17921
- 2025-10-21, Beware of the running $n_s$ when producing heavy primordial black holes, Sasha Allegrini et.al., Paper: http://arxiv.org/abs/2510.18791
- 2025-10-20, B-Meson Anomalies: Effective Field Theory Meets Machine Learning, Alejandro Mir et.al., Paper: http://arxiv.org/abs/2510.17742
- 2025-10-20, Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling, Lipeng Xie et.al., Paper: http://arxiv.org/abs/2510.17314
- 2025-10-27, Approximately optimal distributed controls for high-dimensional stochastic systems with pairwise interaction through controls, Elise Devey et.al., Paper: http://arxiv.org/abs/2510.23537
- 2025-10-21, Analysis note: measurement of thrust and track energy-energy correlator in $e^+e^-$ collisions at 91.2 GeV with DELPHI open data, Jingyu Zhang et.al., Paper: http://arxiv.org/abs/2510.18762
- 2025-10-21, An integrated neural wavefunction solver for spinful Fermi systems, Alexander Avdoshkin et.al., Paper: http://arxiv.org/abs/2510.18621
- 2025-10-27, An Information-Theoretic Analysis of Out-of-Distribution Generalization in Meta-Learning with Applications to Meta-RL, Xingtu Liu et.al., Paper: http://arxiv.org/abs/2510.23448
- 2025-10-20, An Exact Quantile-Energy Equality for Terminal Halfspaces in Linear-Gaussian Control with a Discrete-Time Companion, KL/Schrodinger Links, and High-Precision Validation, Sandro Andric et.al., Paper: http://arxiv.org/abs/2510.17945
- 2025-10-20, An Empirical Study of Lagrangian Methods in Safe Reinforcement Learning, Lindsay Spoor et.al., Paper: http://arxiv.org/abs/2510.17564
- 2025-10-20, Agentic Reinforcement Learning for Search is Unsafe, Yushi Yang et.al., Paper: http://arxiv.org/abs/2510.17431
- 2025-10-28, Advancing site-specific disease and pest management in precision agriculture: From reasoning-driven foundation models to adaptive, feedback-based learning, Nitin Rai et.al., Paper: http://arxiv.org/abs/2510.24650
- 2025-10-28, Adaptive Surrogate Gradients for Sequential Reinforcement Learning in Spiking Neural Networks, Korneel Van den Berghe et.al., Paper: http://arxiv.org/abs/2510.24461
- 2025-10-27, Adaptive Multilevel Splitting: First Application to Rare-Event Derivative Pricing, Riccardo Gozzo et.al., Paper: http://arxiv.org/abs/2510.23461
- 2025-10-20, Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models, Jiajun Fan et.al., Paper: http://arxiv.org/abs/2510.18053
- 2025-10-23, AdaDoS: Adaptive DoS Attack via Deep Adversarial Reinforcement Learning in SDN, Wei Shao et.al., Paper: http://arxiv.org/abs/2510.20566
- 2025-10-21, Actor-Free Continuous Control via Structurally Maximizable Q-Functions, Yigit Korkmaz et.al., Paper: http://arxiv.org/abs/2510.18828
- 2025-10-20, Accelerating Bayesian Inference via Multi-Fidelity Transport Map Coupling, Sanjan C. Muchandimath et.al., Paper: http://arxiv.org/abs/2510.17946
- 2025-10-20, ALPINE: A Lightweight and Adaptive Privacy-Decision Agent Framework for Dynamic Edge Crowdsensing, Guanjie Cheng et.al., Paper: http://arxiv.org/abs/2510.17162
- 2025-10-24, A Unified Model for Multi-Task Drone Routing in Post-Disaster Road Assessment, Huatian Gong et.al., Paper: http://arxiv.org/abs/2510.21525
- 2025-10-23, A Unified Framework for Zero-Shot Reinforcement Learning, Jacopo Di Ventura et.al., Paper: http://arxiv.org/abs/2510.20542
- 2025-10-27, A Sequential Planning Framework for the Operational Reality of Interacting Air Traffic Flow Regulations and Traffic Flow Programs, Thinh Hoang et.al., Paper: http://arxiv.org/abs/2510.23402
- 2025-10-20, A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning, Anjie Liu et.al., Paper: http://arxiv.org/abs/2510.17697
- 2025-10-23, A Microphysical Probe of Neutron Star Interiors: Constraining the Equation of State with Glitch Dynamics, Zhonghao Tu et.al., Paper: http://arxiv.org/abs/2510.20791
(<a href=#updated-on-20251030>back to top</a>)
Notes:
- We have modified the
sorting ruleof the above table to prioritize papers based on the time of their latest update rather than their initial publication date. If an article has been recently modified, it will appear earlier in the list.
Function added: