Agent Research Papers
Agent Research Papers
Updated on 2026.04.03 Current Search Keywords:
Agent,Multi-Agent,Tool Learning,Agent RL,Autonomous Agent,LLM Agent
If you have any other keywords, please feel free to let us know :)
Table of Contents
- <a href=#agent>Agent</a>
- <a href=#large-language-models>Large Language Models</a>
- <a href=#reinforcement-learning>Reinforcement Learning</a>
Agent
- 2026-01-01, μACP: A Formal Calculus for Expressive, Resource-Constrained Agent Communication, Arnab Mallick et.al., Paper: http://arxiv.org/abs/2601.00219
- 2025-10-13, video-SALMONN S: Streaming Audio-Visual LLMs Beyond Length Limits via Memory, Guangzhi Sun et.al., Paper: http://arxiv.org/abs/2510.11129
- 2025-08-28, rStar2-Agent: Agentic Reasoning Technical Report, Ning Shang et.al., Paper: http://arxiv.org/abs/2508.20722
- 2025-12-09, rSIM: Incentivizing Reasoning Capabilities of LLMs via Reinforced Strategy Injection, Sijia Chen et.al., Paper: http://arxiv.org/abs/2512.08300
- 2025-11-13, nuPlan-R: A Closed-Loop Planning Benchmark for Autonomous Driving via Reactive Multi-Agent Simulation, Mingxing Peng et.al., Paper: http://arxiv.org/abs/2511.10403
- 2025-10-22, gem5 Co-Pilot: AI Assistant Agent for Architectural Design Space Exploration, Zuoming Fu et.al., Paper: http://arxiv.org/abs/2510.19577
- 2026-01-26, daVinci-Dev: Agent-native Mid-training for Software Engineering, Ji Zeng et.al., Paper: http://arxiv.org/abs/2601.18418
- 2025-12-18, cuPilot: A Strategy-Coordinated Multi-agent Framework for CUDA Kernel Evolution, Jinwu Chen et.al., Paper: http://arxiv.org/abs/2512.16465
- 2026-01-15, coTherapist: A Behavior-Aligned Small Language Model to Support Mental Healthcare Experts, Prottay Kumar Adhikary et.al., Paper: http://arxiv.org/abs/2601.10246
- 2026-03-07, aCAPTCHA: Verifying That an Entity Is a Capable Agent via Asymmetric Hardness, Zuyao Xu et.al., Paper: http://arxiv.org/abs/2603.07116
- 2026-02-12, Zooming without Zooming: Region-to-Image Distillation for Fine-Grained Multimodal Perception, Lai Wei et.al., Paper: http://arxiv.org/abs/2602.11858
- 2026-02-17, Zombie Agents: Persistent Control of Self-Evolving LLM Agents via Self-Reinforcing Injections, Xianglin Yang et.al., Paper: http://arxiv.org/abs/2602.15654
- 2026-01-20, Zero-shot adaptable task planning for autonomous construction robots: a comparative study of lightweight single and multi-AI agent systems, Hossein Naderi et.al., Paper: http://arxiv.org/abs/2601.14091
- 2025-12-11, Zero-shot 3D Map Generation with LLM Agents: A Dual-Agent Architecture for Procedural Content Generation, Lim Chien Her et.al., Paper: http://arxiv.org/abs/2512.10501
- 2026-02-06, Zero-Trust Runtime Verification for Agentic Payment Protocols: Mitigating Replay and Context-Binding Failures in AP2, Qianlong Lan et.al., Paper: http://arxiv.org/abs/2602.06345
- 2025-10-12, Zero-Shot Large Language Model Agents for Fully Automated Radiotherapy Treatment Planning, Dongrong Yang et.al., Paper: http://arxiv.org/abs/2510.11754
- 2026-03-19, ZEBRAARENA: A Diagnostic Simulation Environment for Studying Reasoning-Action Coupling in Tool-Augmented LLMs, Wanjia Zhao et.al., Paper: http://arxiv.org/abs/2603.18614
- 2026-01-27, Yunque DeepResearch Technical Report, Yuxuan Cai et.al., Paper: http://arxiv.org/abs/2601.19578
- 2025-12-31, Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models, Junru Lu et.al., Paper: http://arxiv.org/abs/2512.24618
- 2025-12-31, Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization, Yuchen Shi et.al., Paper: http://arxiv.org/abs/2512.24615
- 2025-09-30, Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents, Shuai Shao et.al., Paper: http://arxiv.org/abs/2509.26354
- 2026-03-12, You Told Me to Do It: Measuring Instructional Text-induced Private Data Leakage in LLM Agents, Ching-Yu Kao et.al., Paper: http://arxiv.org/abs/2603.11862
- 2026-04-01, Yet Even Less Is Even Better For Agentic, Reasoning, and Coding LLMs, Yang Ye et.al., Paper: http://arxiv.org/abs/2604.00824
- 2025-11-20, YOWO: You Only Walk Once to Jointly Map An Indoor Scene and Register Ceiling-mounted Cameras, Fan Yang et.al., Paper: http://arxiv.org/abs/2511.16521
- 2026-03-12, XSkill: Continual Learning from Experience and Skills in Multimodal Agents, Guanyu Jiang et.al., Paper: http://arxiv.org/abs/2603.12056
- 2026-01-20, XR: Cross-Modal Agents for Composed Image Retrieval, Zhongyu Yang et.al., Paper: http://arxiv.org/abs/2601.14245
- 2025-12-19, XAgen: An Explainability Tool for Identifying and Correcting Failures in Multi-Agent Workflows, Xinru Wang et.al., Paper: http://arxiv.org/abs/2512.17896
- 2026-03-06, XAI for Coding Agent Failures: Transforming Raw Execution Traces into Actionable Insights, Arun Joshi et.al., Paper: http://arxiv.org/abs/2603.05941
- 2025-12-21, X-Talk: On the Underestimated Potential of Modular Speech-to-Speech Dialogue System, Zhanxun Liu et.al., Paper: http://arxiv.org/abs/2512.18706
- 2026-01-07, Wow, wo, val! A Comprehensive Embodied World Model Evaluation Turing Test, Chun-Kai Fan et.al., Paper: http://arxiv.org/abs/2601.04137
- 2026-03-11, World2Act: Latent Action Post-Training via Skill-Compositional World Models, An Dinh Vuong et.al., Paper: http://arxiv.org/abs/2603.10422
- 2025-10-20, World-in-World: World Models in a Closed-Loop World, Jiahan Zhang et.al., Paper: http://arxiv.org/abs/2510.18135
- 2026-02-17, World-Model-Augmented Web Agents with Action Correction, Zhouzhou Shen et.al., Paper: http://arxiv.org/abs/2602.15384
- 2026-01-29, World of Workflows: a Benchmark for Bringing World Models to Enterprise Systems, Lakshya Gupta et.al., Paper: http://arxiv.org/abs/2601.22130
- 2026-01-14, World Craft: Agentic Framework to Create Visualizable Worlds via Text, Jianwen Sun et.al., Paper: http://arxiv.org/abs/2601.09150
- 2026-02-20, WorkflowPerturb: Calibrated Stress Tests for Evaluating Multi-Agent Workflow Metrics, Madhav Kanda et.al., Paper: http://arxiv.org/abs/2602.17990
- 2025-12-10, Workflow is All You Need: Escaping the “Statistical Smoothing Trap” via High-Entropy Information Foraging and Adversarial Pacing, Zhongjie Jiang et.al., Paper: http://arxiv.org/abs/2512.10121
- 2025-12-12, Words to Describe What I’m Feeling: Exploring the Potential of AI Agents for High Subjectivity Decisions in Advance Care Planning, Kellie Yu Hui Sim et.al., Paper: http://arxiv.org/abs/2512.11276
- 2026-02-19, Wink: Recovering from Misbehaviors in Coding Agents, Rahul Nanda et.al., Paper: http://arxiv.org/abs/2602.17037
- 2026-03-25, Willful Disobedience: Automatically Detecting Failures in Agentic Traces, Reshabh K Sharma et.al., Paper: http://arxiv.org/abs/2603.23806
- 2026-01-01, Will LLM-powered Agents Bias Against Humans? Exploring the Belief-Dependent Vulnerability, Zongwei Wang et.al., Paper: http://arxiv.org/abs/2601.00240
- 2026-01-23, Will It Survive? Deciphering the Fate of AI-Generated Code in Open Source, Musfiqur Rahman et.al., Paper: http://arxiv.org/abs/2601.16809
- 2025-06-04, Will Agents Replace Us? Perceptions of Autonomous Multi-Agent AI, Nikola Balic et.al., Paper: http://arxiv.org/abs/2506.02055
- 2026-02-04, WideSeek-R1: Exploring Width Scaling for Broad Information Seeking via Multi-Agent Reinforcement Learning, Zelai Xu et.al., Paper: http://arxiv.org/abs/2602.04634
- 2025-10-16, Why Instant-Runoff Voting Is So Resilient to Coalitional Manipulation: Phase Transitions in the Perturbed Culture, François Durand et.al., Paper: http://arxiv.org/abs/2510.14450
- 2026-02-10, Why Do AI Agents Systematically Fail at Cloud Root Cause Analysis?, Taeyoon Kim et.al., Paper: http://arxiv.org/abs/2602.09937
- 2026-03-24, Why Database Manuals Are Not Enough: Efficient and Reliable Configuration Tuning for DBMSs via Code-Driven LLM Agents, Xinyi Zhang et.al., Paper: http://arxiv.org/abs/2603.22708
- 2026-02-04, Why Agentic-PRs Get Rejected: A Comparative Study of Coding Agents, Sota Nakashima et.al., Paper: http://arxiv.org/abs/2602.04226
- 2025-12-29, Why AI Safety Requires Uncertainty, Incomplete Preferences, and Non-Archimedean Utilities, Alessio Benavoli et.al., Paper: http://arxiv.org/abs/2512.23508
- 2026-01-27, Who Said CVE? How Vulnerability Identifiers Are Mentioned by Humans, Bots, and Agents in Pull Requests, Pien Rooijendijk et.al., Paper: http://arxiv.org/abs/2601.19636
- 2025-11-14, Who Moved My Distribution? Conformal Prediction for Interactive Multi-Agent Systems, Allen Emmanuel Binny et.al., Paper: http://arxiv.org/abs/2511.11567
- 2025-10-30, Who Grants the Agent Power? Defending Against Instruction Injection via Task-Centric Access Control, Yifeng Cai et.al., Paper: http://arxiv.org/abs/2510.26212
- 2026-02-25, Which Tool Response Should I Trust? Tool-Expertise-Aware Chest X-ray Agent with Multimodal Agentic Learning, Zheang Huai et.al., Paper: http://arxiv.org/abs/2602.21517
- 2025-10-17, Where to Search: Measure the Prior-Structured Search Space of LLM Agents, Zhuo-Yang Song et.al., Paper: http://arxiv.org/abs/2510.14846
- 2025-09-29, Where LLM Agents Fail and How They can Learn From Failures, Kunlun Zhu et.al., Paper: http://arxiv.org/abs/2509.25370
- 2026-03-25, Where Do Your Citations Come From? Citation-Constellation: A Free, Open-Source, No-Code, and Auditable Tool for Citation Network Decomposition with Complementary BARON and HEROCON Scores, Mahbub Ul Alam et.al., Paper: http://arxiv.org/abs/2603.24216
- 2026-01-21, Where Do AI Coding Agents Fail? An Empirical Study of Failed Agentic Pull Requests in GitHub, Ramtin Ehsani et.al., Paper: http://arxiv.org/abs/2601.15195
- 2026-03-17, When the Specification Emerges: Benchmarking Faithfulness Loss in Long-Horizon Coding Agents, Lu Yan et.al., Paper: http://arxiv.org/abs/2603.17104
- 2026-03-02, When an AI Judges Your Work: The Hidden Costs of Algorithmic Assessment, David Almog et.al., Paper: http://arxiv.org/abs/2603.02076
- 2025-10-21, When Your AI Agent Succumbs to Peer-Pressure: Studying Opinion-Change Dynamics of LLMs, Aliakbar Mehdizadeh et.al., Paper: http://arxiv.org/abs/2510.19107
- 2026-04-01, When Users Change Their Mind: Evaluating Interruptible Agents in Long-Horizon Web Navigation, Henry Peng Zou et.al., Paper: http://arxiv.org/abs/2604.00892
- 2026-01-01, When Small Models Are Right for Wrong Reasons: Process Verification for Trustworthy Agents, Laksh Advani et.al., Paper: http://arxiv.org/abs/2601.00513
- 2026-02-11, When Skills Lie: Hidden-Comment Injection in LLM Agents, Qianli Wang et.al., Paper: http://arxiv.org/abs/2602.10498
- 2026-01-08, When Single-Agent with Skills Replace Multi-Agent Systems and When They Fail, Xiaoxiao Li et.al., Paper: http://arxiv.org/abs/2601.04748
- 2026-03-17, When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making, Jun Liu et.al., Paper: http://arxiv.org/abs/2603.16673
- 2025-12-04, When Robots Should Say “I Don’t Know”: Benchmarking Abstention in Embodied Question Answering, Tao Wu et.al., Paper: http://arxiv.org/abs/2512.04597
- 2026-03-17, When Openclaw Agents Learn from Each Other: Insights from Emergent AI Agent Communities for Human-AI Partnership in Education, Eason Chen et.al., Paper: http://arxiv.org/abs/2603.16663
- 2026-02-16, When OpenClaw AI Agents Teach Each Other: Peer Learning Patterns in the Moltbook Community, Eason Chen et.al., Paper: http://arxiv.org/abs/2602.14477
- 2025-10-08, When Machines Meet Each Other: Network Effects and the Strategic Role of History in Multi-Agent AI, Yu Liu et.al., Paper: http://arxiv.org/abs/2510.06903
- 2025-10-10, When LLM Agents Meet Graph Optimization: An Automated Data Quality Improvement Approach, Zhihan Zhang et.al., Paper: http://arxiv.org/abs/2510.08952
- 2025-11-10, When Intelligence Overloads Infrastructure: A Forecast Model for AI-Driven Bottlenecks, Gamal Refai-Ahmed et.al., Paper: http://arxiv.org/abs/2511.07265
- 2025-09-29, When Greedy Wins: Emergent Exploitation Bias in Meta-Bandit LLM Training, Sanxing Chen et.al., Paper: http://arxiv.org/abs/2509.24923
- 2025-11-06, When Empowerment Disempowers, Claire Yang et.al., Paper: http://arxiv.org/abs/2511.04177
- 2026-01-12, When Bots Take the Bait: Exposing and Mitigating the Emerging Social Engineering Attack in Web Automation Agent, Xinyi Wu et.al., Paper: http://arxiv.org/abs/2601.07263
- 2025-09-02, When Agents go Astray: Course-Correcting SWE Agents with PRMs, Shubham Gandhi et.al., Paper: http://arxiv.org/abs/2509.02360
- 2025-10-13, When Agents Trade: Live Multi-Market Trading Benchmark for LLM Agents, Lingfei Qian et.al., Paper: http://arxiv.org/abs/2510.11695
- 2026-01-21, When Agents Fail: A Comprehensive Study of Bugs in LLM Agents with Automated Labeling, Niful Islam et.al., Paper: http://arxiv.org/abs/2601.15232
- 2026-01-22, When Agents Fail to Act: A Diagnostic Framework for Tool Invocation Reliability in Multi-Agent LLM Systems, Donghao Huang et.al., Paper: http://arxiv.org/abs/2601.16280
- 2025-12-12, When Actions Teach You to Think: Reasoning-Action Synergy via Reinforcement Learning in Conversational Agents, Mrinal Rawat et.al., Paper: http://arxiv.org/abs/2512.11277
- 2026-02-23, When AI Teammates Meet Code Review: Collaboration Signals Shaping the Integration of Agent-Authored Pull Requests, Costain Nachuma et.al., Paper: http://arxiv.org/abs/2602.19441
- 2025-10-15, When “Correct” Is Not Safe: Can We Trust Functionally Correct Patches Generated by Code Agents?, Yibo Peng et.al., Paper: http://arxiv.org/abs/2510.17862
- 2026-01-29, When “Better” Prompts Hurt: Evaluation-Driven Iteration for LLM Applications, Daniel Commey et.al., Paper: http://arxiv.org/abs/2601.22025
- 2025-12-04, WhatsCode: Large-Scale GenAI Deployment for Developer Efficiency at WhatsApp, Ke Mao et.al., Paper: http://arxiv.org/abs/2512.05314
- 2026-02-19, What to Cut? Predicting Unnecessary Methods in Agentic Code Generation, Kan Watanabe et.al., Paper: http://arxiv.org/abs/2602.17091
- 2026-03-02, What Papers Don’t Tell You: Recovering Tacit Knowledge for Automated Paper Reproduction, Lehui Li et.al., Paper: http://arxiv.org/abs/2603.01801
- 2026-02-19, What Makes a Good LLM Agent for Real-world Penetration Testing?, Gelei Deng et.al., Paper: http://arxiv.org/abs/2602.17622
- 2025-09-26, What Makes LLM Agent Simulations Useful for Policy? Insights From an Iterative Design Engagement in Emergency Preparedness, Yuxuan Li et.al., Paper: http://arxiv.org/abs/2509.21868
- 2025-11-19, What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity, Alexis Audran-Reiss et.al., Paper: http://arxiv.org/abs/2511.15593
- 2026-01-14, What Do LLM Agents Know About Their World? Task2Quiz: A Paradigm for Studying Environment Understanding, Siyuan Liu et.al., Paper: http://arxiv.org/abs/2601.09503
- 2025-09-25, What Do LLM Agents Do When Left Alone? Evidence of Spontaneous Meta-Cognitive Patterns, Stefan Szeider et.al., Paper: http://arxiv.org/abs/2509.21224
- 2025-10-29, What Challenges Do Developers Face in AI Agent Systems? An Empirical Study on Stack Overflow, Ali Asgari et.al., Paper: http://arxiv.org/abs/2510.25423
- 2026-02-16, WebWorld: A Large-Scale World Model for Web Agent Training, Zikai Xiao et.al., Paper: http://arxiv.org/abs/2602.14721
- 2026-01-13, WebTrap Park: An Automated Platform for Systematic Security Evaluation of Web Agents, Xinyi Wu et.al., Paper: http://arxiv.org/abs/2601.08406
- 2026-02-03, WebSentinel: Detecting and Localizing Prompt Injection Attacks for Web Agents, Xilong Wang et.al., Paper: http://arxiv.org/abs/2602.03792
- 2025-10-21, WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection, Guanzhong He et.al., Paper: http://arxiv.org/abs/2510.18798
- 2025-09-16, WebSailor-V2: Bridging the Chasm to Proprietary Agents via Synthetic Data and Scalable Reinforcement Learning, Kuan Li et.al., Paper: http://arxiv.org/abs/2509.13305
- 2025-10-13, WebRouter: Query-specific Router via Variational Information Bottleneck for Cost-sensitive Web Agent, Tao Li et.al., Paper: http://arxiv.org/abs/2510.11221
- 2025-10-22, WebGraphEval: Multi-Turn Trajectory Evaluation for Web Agents using Graph Representation, Yaoyao Qian et.al., Paper: http://arxiv.org/abs/2510.19205
- 2026-03-05, WebFactory: Automated Compression of Foundational Language Intelligence into Grounded Web Agents, Sicheng Fan et.al., Paper: http://arxiv.org/abs/2603.05044
- 2025-10-21, WebDevJudge: Evaluating (M)LLMs as Critiques for Web Development Quality, Chunyang Li et.al., Paper: http://arxiv.org/abs/2510.18560
- 2025-10-08, WebDART: Dynamic Decomposition and Re-planning for Complex Web Tasks, Jingbo Yang et.al., Paper: http://arxiv.org/abs/2510.06587
- 2026-02-13, WebClipper: Efficient Evolution of Web Agents with Graph-based Trajectory Pruning, Junjie Wang et.al., Paper: http://arxiv.org/abs/2602.12852
- 2026-03-05, WebChain: A Large-Scale Human-Annotated Dataset of Real-World Web Interaction Traces, Sicheng Fan et.al., Paper: http://arxiv.org/abs/2603.05295
- 2026-01-29, WebArbiter: A Principle-Guided Reasoning Process Reward Model for Web Agents, Yao Zhang et.al., Paper: http://arxiv.org/abs/2601.21872
- 2026-02-19, Web Verbs: Typed Abstractions for Reliable Task Composition on the Agentic Web, Linxi Jiang et.al., Paper: http://arxiv.org/abs/2602.17245
- 2024-10-30, Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents, Wenkai Yang et.al., Paper: http://arxiv.org/abs/2402.11208
- 2025-11-19, Walrus: A Cross-Domain Foundation Model for Continuum Dynamics, Michael McCabe et.al., Paper: http://arxiv.org/abs/2511.15684
- 2025-12-09, WOLF: Werewolf-based Observations for LLM Deception and Falsehoods, Mrinal Agarwal et.al., Paper: http://arxiv.org/abs/2512.09187
- 2025-10-17, WEBSERV: A Browser-Server Environment for Efficient Training of Reinforcement Learning-based Web Agents at Scale, Yuxuan Lu et.al., Paper: http://arxiv.org/abs/2510.16252
- 2025-09-28, WAREX: Web Agent Reliability Evaluation on Existing Benchmarks, Su Kara et.al., Paper: http://arxiv.org/abs/2510.03285
- 2026-01-20, VulnResolver: A Hybrid Agent Framework for LLM-Based Automated Vulnerability Issue Resolution, Mingming Zhang et.al., Paper: http://arxiv.org/abs/2601.13933
- 2025-12-08, VulnLLM-R: Specialized Reasoning LLM with Agent Scaffold for Vulnerability Detection, Yuzhou Nie et.al., Paper: http://arxiv.org/abs/2512.07533
- 2025-09-30, VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications, Wei He et.al., Paper: http://arxiv.org/abs/2509.26490
- 2025-12-10, VisualActBench: Can VLMs See and Act like a Human?, Daoan Zhang et.al., Paper: http://arxiv.org/abs/2512.09907
- 2026-02-12, Visual Reasoning Benchmark: Evaluating Multimodal LLMs on Classroom-Authentic Visual Problems from Primary Education, Mohamed Huti et.al., Paper: http://arxiv.org/abs/2602.12196
- 2026-02-17, Visual Persuasion: What Influences Decisions of Vision-Language Models?, Manuel Cherep et.al., Paper: http://arxiv.org/abs/2602.15278
- 2026-03-17, Visual Distraction Undermines Moral Reasoning in Vision-Language Models, Xinyi Yang et.al., Paper: http://arxiv.org/abs/2603.16445
- 2025-10-31, Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning, Qiusi Zhan et.al., Paper: http://arxiv.org/abs/2510.27623
- 2025-09-15, VisDocSketcher: Towards Scalable Visual Documentation with Agentic Systems, Luís F. Gomes et.al., Paper: http://arxiv.org/abs/2509.11942
- 2026-03-17, VisBrowse-Bench: Benchmarking Visual-Native Search for Multimodal Browsing Agents, Zhengbo Zhang et.al., Paper: http://arxiv.org/abs/2603.16289
- 2026-01-12, VirtualEnv: A Platform for Embodied AI Research, Kabir Swain et.al., Paper: http://arxiv.org/abs/2601.07553
- 2026-01-20, VirtualCrime: Evaluating Criminal Potential of Large Language Models via Sandbox Simulation, Yilin Tang et.al., Paper: http://arxiv.org/abs/2601.13981
- 2026-02-23, Vinedresser3D: Agentic Text-guided 3D Editing, Yankuan Chi et.al., Paper: http://arxiv.org/abs/2602.19542
- 2025-12-25, Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations, Xin Liu et.al., Paper: http://arxiv.org/abs/2512.21586
- 2026-03-26, VideoWeaver: Multimodal Multi-View Video-to-Video Transfer for Embodied Agents, George Eskandar et.al., Paper: http://arxiv.org/abs/2603.25420
- 2026-01-22, VideoThinker: Building Agentic VideoLLMs with LLM-Guided Tool Reasoning, Chenglin Li et.al., Paper: http://arxiv.org/abs/2601.15724
- 2025-12-28, Video-BrowseComp: Benchmarking Agentic Video Research on Open Web, Zhengyang Liang et.al., Paper: http://arxiv.org/abs/2512.23044
- 2026-03-10, Vibe-Creation: The Epistemology of Human-AI Emergent Cognition, Ilya Levin et.al., Paper: http://arxiv.org/abs/2603.09486
- 2025-12-22, Vibe Reasoning: Eliciting Frontier AI Mathematical Capabilities – A Case Study on IMO 2025 Problem 6, Jiaao Wu et.al., Paper: http://arxiv.org/abs/2512.19287
- 2026-01-21, Vibe Coding Kills Open Source, Miklós Koren et.al., Paper: http://arxiv.org/abs/2601.15494
- 2026-02-04, Vibe AIGC: A New Paradigm for Content Generation via Agentic Orchestration, Jiaheng Liu et.al., Paper: http://arxiv.org/abs/2602.04575
- 2025-12-18, Verifying Hadwiger’s Conjecture for Examples of Graphs with $α(G) = 2$, Jofre Costa et.al., Paper: http://arxiv.org/abs/2512.17114
- 2026-02-03, Verified Critical Step Optimization for LLM Agents, Mukai Li et.al., Paper: http://arxiv.org/abs/2602.03412
- 2025-12-15, Verification-Guided Context Optimization for Tool Calling via Hierarchical LLMs-as-Editors, Henger Li et.al., Paper: http://arxiv.org/abs/2512.13860
- 2025-10-20, Verification-Aware Planning for Multi-Agent Systems, Tianyang Xu et.al., Paper: http://arxiv.org/abs/2510.17109
- 2026-02-18, Verifiable Semantics for Agent-to-Agent Communication, Philipp Schoenegger et.al., Paper: http://arxiv.org/abs/2602.16424
- 2025-10-31, VeriMoA: A Mixture-of-Agents Framework for Spec-to-HDL Generation, Heng Ping et.al., Paper: http://arxiv.org/abs/2510.27617
- 2025-10-03, VeriGuard: Enhancing LLM Agent Safety via Verified Code Generation, Lesly Miculicich et.al., Paper: http://arxiv.org/abs/2510.05156
- 2026-03-18, VeriGrey: Greybox Agent Validation, Yuntong Zhang et.al., Paper: http://arxiv.org/abs/2603.17639
- 2026-03-18, VeriAgent: A Tool-Integrated Multi-Agent System with Evolving Memory for PPA-Aware RTL Code Generation, Yaoxiang Wang et.al., Paper: http://arxiv.org/abs/2603.17613
- 2026-01-27, Veri-Sure: A Contract-Aware Multi-Agent Framework with Temporal Tracing and Formal Verification for Correct RTL Code Generation, Jiale Liu et.al., Paper: http://arxiv.org/abs/2601.19747
- 2026-03-23, Velocity Potential Neural Field for Efficient Ambisonics Impulse Response Modeling, Yoshiki Masuyama et.al., Paper: http://arxiv.org/abs/2603.22589
- 2026-03-25, VehicleMemBench: An Executable Benchmark for Multi-User Long-Term Memory in In-Vehicle Agents, Yuhao Chen et.al., Paper: http://arxiv.org/abs/2603.23840
- 2025-11-17, Variance Stabilizing Transformations for Electricity Price Forecasting in Periods of Increased Volatility, Bartosz Uniejewski et.al., Paper: http://arxiv.org/abs/2511.13603
- 2025-10-31, Validity Is What You Need, Sebastian Benthall et.al., Paper: http://arxiv.org/abs/2510.27628
- 2025-11-26, VacuumVLA: Boosting VLA Capabilities via a Unified Suction and Gripping Tool for Complex Robotic Manipulation, Hui Zhou et.al., Paper: http://arxiv.org/abs/2511.21557
- 2026-03-03, VSearcher: Long-Horizon Multimodal Search Agent via Reinforcement Learning, Ruiyang Zhang et.al., Paper: http://arxiv.org/abs/2603.02795
- 2025-10-14, VQArt-Bench: A semantically rich VQA Benchmark for Art and Cultural Heritage, A. Alfarano et.al., Paper: http://arxiv.org/abs/2510.12750
- 2025-11-25, VQ-VA World: Towards High-Quality Visual Question-Visual Answering, Chenhui Gou et.al., Paper: http://arxiv.org/abs/2511.20573
- 2025-12-31, VLN-MME: Diagnosing MLLMs as Language-guided Visual Navigation agents, Xunyi Zhao et.al., Paper: http://arxiv.org/abs/2512.24851
- 2025-10-16, VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation, Han Zhao et.al., Paper: http://arxiv.org/abs/2510.14902
- 2026-02-12, VIRENA: Virtual Arena for Research, Education, and Democratic Innovation, Emma Hoes et.al., Paper: http://arxiv.org/abs/2602.12207
- 2026-02-04, VILLAIN at AVerImaTeC: Verifying Image-Text Claims via Multi-Agent Collaboration, Jaeyoon Jung et.al., Paper: http://arxiv.org/abs/2602.04587
- 2026-01-09, VIGIL: Defending LLM Agents Against Tool Stream Injection via Verify-Before-Commit, Junda Lin et.al., Paper: http://arxiv.org/abs/2601.05755
- 2025-12-08, VIGIL: A Reflective Runtime for Self-Healing Agents, Christopher Cruz et.al., Paper: http://arxiv.org/abs/2512.07094
- 2025-10-17, VERA-MH Concept Paper, Luca Belli et.al., Paper: http://arxiv.org/abs/2510.15297
- 2025-11-24, VDC-Agent: When Video Detailed Captioners Evolve Themselves via Agentic Self-Reflection, Qiang Wang et.al., Paper: http://arxiv.org/abs/2511.19436
- 2025-11-04, VCode: a Multimodal Coding Benchmark with SVG as Symbolic Visual Representation, Kevin Qinghong Lin et.al., Paper: http://arxiv.org/abs/2511.02778
- 2025-10-19, VAGEN: Reinforcing World Model Reasoning for Multi-Turn VLM Agents, Kangrui Wang et.al., Paper: http://arxiv.org/abs/2510.16907
- 2026-03-31, VACP: Visual Analytics Context Protocol, Tobias Stähle et.al., Paper: http://arxiv.org/abs/2603.29322
- 2026-02-05, V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval, Dongyang Chen et.al., Paper: http://arxiv.org/abs/2602.06034
- 2025-09-12, V-Math: An Agentic Approach to the Vietnamese National High School Graduation Mathematics Exams, Duong Q. Nguyen et.al., Paper: http://arxiv.org/abs/2509.12251
- 2025-08-26, Utilizing Training Data to Improve LLM Reasoning for Tabular Understanding, Chufan Gao et.al., Paper: http://arxiv.org/abs/2508.18676
- 2026-03-20, Utility-Guided Agent Orchestration for Efficient LLM Tool Use, Boyan Liu et.al., Paper: http://arxiv.org/abs/2603.19896
- 2025-10-14, Using Kolmogorov-Smirnov Distance for Measuring Distribution Shift in Machine Learning, Ozan K. Tonguz et.al., Paper: http://arxiv.org/abs/2510.15996
- 2025-10-30, Using Copilot Agent Mode to Automate Library Migration: A Quantitative Assessment, Aylton Almeida et.al., Paper: http://arxiv.org/abs/2510.26699
- 2026-01-13, User-Oriented Multi-Turn Dialogue Generation with Tool Use at scale, Jungho Cho et.al., Paper: http://arxiv.org/abs/2601.08225
- 2026-03-21, User Preference Modeling for Conversational LLM Agents: Weak Rewards from Retrieval-Augmented Interaction, Yuren Hao et.al., Paper: http://arxiv.org/abs/2603.20939
- 2025-10-16, UrbanVerse: Scaling Urban Simulation by Watching City-Tour Videos, Mingxuan Liu et.al., Paper: http://arxiv.org/abs/2510.15018
- 2025-12-10, UrbanNav: Learning Language-Guided Urban Navigation from Web-Scale Human Trajectories, Yanghong Mei et.al., Paper: http://arxiv.org/abs/2512.09607
- 2025-11-04, Unsupervised Evaluation of Multi-Turn Objective-Driven Interactions, Emi Soroka et.al., Paper: http://arxiv.org/abs/2511.03047
- 2026-01-15, Unlocking Implicit Experience: Synthesizing Tool-Use Trajectories from Text, Zhihao Xu et.al., Paper: http://arxiv.org/abs/2601.10355
- 2025-11-13, Unlocking Dynamic Inter-Client Spatial Dependencies: A Federated Spatio-Temporal Graph Learning Method for Traffic Flow Forecasting, Feng Wang et.al., Paper: http://arxiv.org/abs/2511.10434
- 2025-10-18, Unleashing Diverse Thinking Modes in LLMs through Multi-Agent Collaboration, Zhixuan He et.al., Paper: http://arxiv.org/abs/2510.16645
- 2025-11-26, Uniform inference for kernel instrumental variable regression, Marvin Lob et.al., Paper: http://arxiv.org/abs/2511.21603
- 2026-03-23, UniDex: A Robot Foundation Suite for Universal Dexterous Hand Control from Egocentric Human Videos, Gu Zhang et.al., Paper: http://arxiv.org/abs/2603.22264
- 2025-09-17, Understanding the Process of Human-AI Value Alignment, Jack McKinlay et.al., Paper: http://arxiv.org/abs/2509.13854
- 2026-01-01, Understanding Security Risks of AI Agents’ Dependency Updates, Tanmay Singla et.al., Paper: http://arxiv.org/abs/2601.00205
- 2026-01-06, Understanding Multi-Agent Reasoning with Large Language Models for Cartoon VQA, Tong Wu et.al., Paper: http://arxiv.org/abs/2601.03073
- 2025-12-08, Understanding LLM Agent Behaviours via Game Theory: Strategy Recognition, Biases and Multi-Agent Dynamics, Trung-Kiet Huynh et.al., Paper: http://arxiv.org/abs/2512.07462
- 2026-01-20, Understanding Human-Multi-Agent Team Formation for Creative Work, Hyunseung Lim et.al., Paper: http://arxiv.org/abs/2601.13865
- 2026-01-27, Understanding Dominant Themes in Reviewing Agentic AI-authored Code, Md. Asif Haider et.al., Paper: http://arxiv.org/abs/2601.19287
- 2026-03-13, Uncovering Security Threats and Architecting Defenses in Autonomous Agents: A Case Study of OpenClaw, Zonghao Ying et.al., Paper: http://arxiv.org/abs/2603.12644
- 2025-11-06, Unclonable Cryptography in Linear Quantum Memory, Omri Shmueli et.al., Paper: http://arxiv.org/abs/2511.04633
- 2025-10-13, Uncertainty-Aware, Risk-Adaptive Access Control for Agentic Systems using an LLM-Judged TBAC Model, Charles Fleming et.al., Paper: http://arxiv.org/abs/2510.11414
- 2025-11-24, Unboxing the Black Box: Mechanistic Interpretability for Algorithmic Understanding of Neural Networks, Bianka Kowalska et.al., Paper: http://arxiv.org/abs/2511.19265
- 2026-03-11, UltrasoundAgents: Hierarchical Multi-Agent Evidence-Chain Reasoning for Breast Ultrasound Diagnosis, Yali Zhu et.al., Paper: http://arxiv.org/abs/2603.10852
- 2025-09-26, UltraHorizon: Benchmarking Agent Capabilities in Ultra Long-Horizon Scenarios, Haotian Luo et.al., Paper: http://arxiv.org/abs/2509.21766
- 2026-02-27, UXSim: Towards a Hybrid User Search Simulation, Saber Zerhoudi et.al., Paper: http://arxiv.org/abs/2602.24241
- 2026-01-22, UXCascade: Scalable Usability Testing with Simulated User Agents, Steffen Holter et.al., Paper: http://arxiv.org/abs/2601.15777
- 2026-02-11, UMEM: Unified Memory Extraction and Management Framework for Generalizable Memory, Yongshi Ye et.al., Paper: http://arxiv.org/abs/2602.10652
- 2026-04-01, UK AISI Alignment Evaluation Case-Study, Alexandra Souly et.al., Paper: http://arxiv.org/abs/2604.00788
- 2025-09-22, UIPro: Unleashing Superior Interaction Capability For GUI Agents, Hongxin Li et.al., Paper: http://arxiv.org/abs/2509.17328
- 2025-09-05, UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning, Haoming Wang et.al., Paper: http://arxiv.org/abs/2509.02544
- 2025-11-05, U2F: Encouraging SWE-Agent to Seize Novelty without Losing Feasibility, Wencheng Ye et.al., Paper: http://arxiv.org/abs/2511.03517
- 2026-02-27, U-Mind: A Unified Framework for Real-Time Multimodal Interaction with Audiovisual Generation, Xiang Deng et.al., Paper: http://arxiv.org/abs/2602.23739
- 2025-11-21, U-DESPE: a Bayesian Utility-based methodology for dosing regimen optimization in early-phase oncology trials based on Dose-Exposure, Safety, Pharmacodynamics, Efficacy, Anaïs Andrillon et.al., Paper: http://arxiv.org/abs/2511.17376
- 2026-02-25, Two-Stage Active Distribution Network Voltage Control via LLM-RL Collaboration: A Hybrid Knowledge-Data-Driven Approach, Xu Yang et.al., Paper: http://arxiv.org/abs/2602.21715
- 2025-11-13, Two new results on maximal left-compressed intersecting families, Allan Flower et.al., Paper: http://arxiv.org/abs/2511.10592
- 2025-11-10, TwinOR: Photorealistic Digital Twins of Dynamic Operating Rooms for Embodied AI Research, Han Zhang et.al., Paper: http://arxiv.org/abs/2511.07412
- 2025-12-18, Turn-PPO: Turn-Level Advantage Estimation with PPO for Improved Multi-Turn RL in Agentic LLMs, Junbo Li et.al., Paper: http://arxiv.org/abs/2512.17008
- 2026-02-06, Trustworthy AI Software Engineers, Aldeida Aleti et.al., Paper: http://arxiv.org/abs/2602.06310
- 2025-12-05, Trusted AI Agents in the Cloud, Teofil Bodea et.al., Paper: http://arxiv.org/abs/2512.05951
- 2026-03-09, Trust via Reputation of Conviction, Aravind R. Iyengar et.al., Paper: http://arxiv.org/abs/2603.08575
- 2026-03-20, Trojan’s Whisper: Stealthy Manipulation of OpenClaw through Injected Bootstrapped Guidance, Fazhong Liu et.al., Paper: http://arxiv.org/abs/2603.19974
- 2026-03-16, TrinityGuard: A Unified Framework for Safeguarding Multi-Agent Systems, Kai Wang et.al., Paper: http://arxiv.org/abs/2603.15408
- 2025-12-12, TriFlow: A Progressive Multi-Agent Framework for Intelligent Trip Planning, Yuxing Chen et.al., Paper: http://arxiv.org/abs/2512.11271
- 2025-10-17, TriAgent: Automated Biomarker Discovery with Deep Research Grounding for Triage in Acute Care by LLM-Based Multi-Agent Collaboration, Kerem Delikoyun et.al., Paper: http://arxiv.org/abs/2510.16080
- 2026-02-10, TreeCUA: Efficiently Scaling GUI Automation with Tree-Structured Verifiable Evolution, Deyang Jiang et.al., Paper: http://arxiv.org/abs/2602.09662
- 2025-10-11, Tree Search for LLM Agent Reinforcement Learning, Yuxiang Ji et.al., Paper: http://arxiv.org/abs/2509.21240
- 2026-01-21, TransportAgents: a multi-agents LLM framework for traffic accident severity prediction, Zhichao Yang et.al., Paper: http://arxiv.org/abs/2601.15519
- 2025-11-18, Transferring Data from a Voronoi Mesh to an Adaptive Cartesian Grid in Pursuit of Self-consistent Top-down Star Formation, Sean C. Lewis et.al., Paper: http://arxiv.org/abs/2511.14697
- 2025-12-24, Transductive Visual Programming: Evolving Tool Libraries from Experience for Spatial Reasoning, Shengguang Wu et.al., Paper: http://arxiv.org/abs/2512.20934
- 2026-03-11, Trajectory-Informed Memory Generation for Self-Improving Agent Systems, Gaodan Fang et.al., Paper: http://arxiv.org/abs/2603.10600
- 2026-01-02, Trajectory Guard – A Lightweight, Sequence-Aware Model for Real-Time Anomaly Detection in Agentic AI, Laksh Advani et.al., Paper: http://arxiv.org/abs/2601.00516
- 2026-02-06, TrajAD: Trajectory Anomaly Detection for Trustworthy LLM Agents, Yibing Liu et.al., Paper: http://arxiv.org/abs/2602.06443
- 2025-10-07, Training-Free Time Series Classification via In-Context Reasoning with LLM Agents, Songyuan Sui et.al., Paper: http://arxiv.org/abs/2510.05950
- 2025-10-09, Training-Free Group Relative Policy Optimization, Yuzheng Cai et.al., Paper: http://arxiv.org/abs/2510.08191
- 2025-09-24, Training Task Reasoning LLM Agents for Multi-turn Task Planning via Single-turn Reinforcement Learning, Hanjiang Hu et.al., Paper: http://arxiv.org/abs/2509.20616
- 2026-02-03, Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling, Yubao Zhao et.al., Paper: http://arxiv.org/abs/2602.03719
- 2025-12-08, Training Language Models to Use Prolog as a Tool, Niklas Mellgren et.al., Paper: http://arxiv.org/abs/2512.07407
- 2025-10-16, Training LLM Agents to Empower Humans, Evan Ellis et.al., Paper: http://arxiv.org/abs/2510.13709
- 2025-12-24, TrafficSimAgent: A Hierarchical Agent Framework for Autonomous Traffic Simulation with MCP Control, Yuwei Du et.al., Paper: http://arxiv.org/abs/2512.20996
- 2026-02-16, Traceable Latent Variable Discovery Based on Multi-Agent Collaboration, Huaming Du et.al., Paper: http://arxiv.org/abs/2602.14456
- 2026-02-06, TraceCoder: A Trace-Driven Multi-Agent Framework for Automated Debugging of LLM-Generated Code, Jiangping Huang et.al., Paper: http://arxiv.org/abs/2602.06875
- 2026-02-13, TraceBack: Multi-Agent Decomposition for Fine-Grained Table Attribution, Tejas Anvekar et.al., Paper: http://arxiv.org/abs/2602.13059
- 2025-10-22, Trace: Securing Smart Contract Repository Against Access Control Vulnerability, Chong Chen et.al., Paper: http://arxiv.org/abs/2510.19254
- 2026-03-26, Trace2Skill: Distill Trajectory-Local Lessons into Transferable Agent Skills, Jingwei Ni et.al., Paper: http://arxiv.org/abs/2603.25158
- 2025-09-11, TrEnv: Transparently Share Serverless Execution Environments Across Different Functions and Nodes, Jialiang Huang et.al., Paper: http://arxiv.org/abs/2509.09525
- 2026-01-09, TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents, Dawei Wang et.al., Paper: http://arxiv.org/abs/2601.05899
- 2026-03-17, Towards the Vision-Sound-Language-Action Paradigm: The HEAR Framework for Sound-Centric Manipulation, Chang Nie et.al., Paper: http://arxiv.org/abs/2603.16086
- 2025-11-13, Towards an Agentic Workflow for Internet Measurement Research, Alagappan Ramanathan et.al., Paper: http://arxiv.org/abs/2511.10611
- 2026-01-27, Towards a complete characterization of indicator variograms and madograms, Xavier Emery et.al., Paper: http://arxiv.org/abs/2601.19800
- 2026-02-18, Towards a Science of AI Agent Reliability, Stephan Rabanser et.al., Paper: http://arxiv.org/abs/2602.16666
- 2025-12-12, Towards Trustworthy Multi-Turn LLM Agents via Behavioral Guidance, Gonca Gürsun et.al., Paper: http://arxiv.org/abs/2512.11421
- 2025-09-20, Towards Transparent and Incentive-Compatible Collaboration in Decentralized LLM Multi-Agent Systems: A Blockchain-Driven Approach, Minfeng Qi et.al., Paper: http://arxiv.org/abs/2509.16736
- 2026-03-25, Towards Semantic-based Agent Communication Networks: Vision, Technologies, and Challenges, Ping Zhang et.al., Paper: http://arxiv.org/abs/2603.24328
- 2026-02-16, Towards Selection as Power: Bounding Decision Authority in Autonomous Agents, Jose Manuel de la Chica Rodriguez et.al., Paper: http://arxiv.org/abs/2602.14606
- 2025-09-19, Towards Robust Visual Continual Learning with Multi-Prototype Supervision, Xiwei Liu et.al., Paper: http://arxiv.org/abs/2509.16011
- 2025-12-25, Towards Responsible and Explainable AI Agents with Consensus-Driven Reasoning, Eranga Bandara et.al., Paper: http://arxiv.org/abs/2512.21699
- 2026-01-15, Towards Reliable ML Feature Engineering via Planning in Constrained-Topology of LLM Agents, Himanshu Thakur et.al., Paper: http://arxiv.org/abs/2601.10820
- 2025-10-24, Towards Reliable Code-as-Policies: A Neuro-Symbolic Framework for Embodied Task Planning, Sanghyun Ahn et.al., Paper: http://arxiv.org/abs/2510.21302
- 2025-11-05, Towards Realistic Project-Level Code Generation via Multi-Agent Collaboration and Semantic Architecture Modeling, Qianhui Zhao et.al., Paper: http://arxiv.org/abs/2511.03404
- 2025-11-17, Towards Multimodal Representation Learning in Paediatric Kidney Disease, Ana Durica et.al., Paper: http://arxiv.org/abs/2511.13637
- 2026-02-20, Towards More Standardized AI Evaluation: From Models to Agents, Ali El Filali et.al., Paper: http://arxiv.org/abs/2602.18029
- 2026-03-09, Towards Modeling Cybersecurity Behavior of Humans in Organizations, Klaas Ole Kürtz et.al., Paper: http://arxiv.org/abs/2603.08484
- 2026-01-04, Towards LLM-enabled autonomous combustion research: A literature-aware agent for self-corrective modeling workflows, Ke Xiao et.al., Paper: http://arxiv.org/abs/2601.01357
- 2025-10-19, Towards Interpretable and Trustworthy Time Series Reasoning: A BlueSky Vision, Kanghui Ning et.al., Paper: http://arxiv.org/abs/2510.16980
- 2026-03-21, Towards Intelligent Geospatial Data Discovery: a knowledge graph-driven multi-agent framework powered by large language models, Ruixiang Liu et.al., Paper: http://arxiv.org/abs/2603.20670
- 2025-12-09, Towards Foundation Models with Native Multi-Agent Intelligence, Shuyue Hu et.al., Paper: http://arxiv.org/abs/2512.08743
- 2026-02-12, Towards Fair and Comprehensive Evaluation of Routers in Collaborative LLM Systems, Wanxing Wu et.al., Paper: http://arxiv.org/abs/2602.11877
- 2025-12-04, Towards Ethical Multi-Agent Systems of Large Language Models: A Mechanistic Interpretability Perspective, Jae Hee Lee et.al., Paper: http://arxiv.org/abs/2512.04691
- 2025-11-13, Towards Emotionally Intelligent and Responsible Reinforcement Learning, Garapati Keerthana et.al., Paper: http://arxiv.org/abs/2511.10573
- 2024-12-10, Towards Effective GenAI Multi-Agent Collaboration: Design and Evaluation for Enterprise Applications, Raphael Shu et.al., Paper: http://arxiv.org/abs/2412.05449
- 2025-11-28, Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent, Jianzhe Lin et.al., Paper: http://arxiv.org/abs/2511.23436
- 2026-03-11, Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis, Yujie Zheng et.al., Paper: http://arxiv.org/abs/2603.10846
- 2025-12-22, Towards Closed-Loop Embodied Empathy Evolution: Probing LLM-Centric Lifelong Empathic Motion Generation in Unseen Scenarios, Jiawen Wang et.al., Paper: http://arxiv.org/abs/2512.19551
- 2026-02-25, Towards Autonomous Graph Data Analytics with Analytics-Augmented Generation, Qiange Wang et.al., Paper: http://arxiv.org/abs/2602.21604
- 2025-10-17, Towards Automatic Evaluation and Selection of PHI De-identification Models via Multi-Agent Collaboration, Guanchen Wu et.al., Paper: http://arxiv.org/abs/2510.16194
- 2025-10-16, Towards Automated Governance: A DSL for Human-Agent Collaboration in Software Projects, Adem Ait et.al., Paper: http://arxiv.org/abs/2510.14465
- 2025-09-02, Towards Agents That Know When They Don’t Know: Uncertainty as a Control Signal for Structured Reasoning, Josefa Lia Stoisser et.al., Paper: http://arxiv.org/abs/2509.02401
- 2025-09-30, Towards Agentic OS: An LLM Agent Framework for Linux Schedulers, Yusheng Zheng et.al., Paper: http://arxiv.org/abs/2509.01245
- 2026-02-06, Towards Adaptive Environment Generation for Training Embodied Agents, Teresa Yeo et.al., Paper: http://arxiv.org/abs/2602.06366
- 2025-12-08, Towards Accurate UAV Image Perception: Guiding Vision-Language Models with Stronger Task Prompts, Mingning Guo et.al., Paper: http://arxiv.org/abs/2512.07302
- 2025-10-23, Towards AI Agents for Course Instruction in Higher Education: Early Experiences from the Field, Yogesh Simmhan et.al., Paper: http://arxiv.org/abs/2510.20255
- 2025-12-14, Towards AI Agents Supported Research Problem Formulation, Anrafel Fernandes Pereira et.al., Paper: http://arxiv.org/abs/2512.12719
- 2026-02-24, Toward an Agentic Infused Software Ecosystem, Mark Marron et.al., Paper: http://arxiv.org/abs/2602.20979
- 2026-01-15, Toward Ultra-Long-Horizon Agentic Science: Cognitive Accumulation for Machine Learning Engineering, Xinyu Zhu et.al., Paper: http://arxiv.org/abs/2601.10402
- 2025-12-29, Toward Trustworthy Agentic AI: A Multimodal Framework for Preventing Prompt Injection Attacks, Toqeer Ali Syed et.al., Paper: http://arxiv.org/abs/2512.23557
- 2026-02-18, Toward Scalable Verifiable Reward: Proxy State-Based Evaluation for Multi-turn Tool-Calling LLM Agents, Yun-Shiuan Chuang et.al., Paper: http://arxiv.org/abs/2602.16246
- 2025-09-16, Toward PDDL Planning Copilot, Yarin Benyamin et.al., Paper: http://arxiv.org/abs/2509.12987
- 2025-08-25, Toward Generalized Autonomous Agents: A Neuro-Symbolic AI Framework for Integrating Social and Technical Support in Education, Ryan Hare et.al., Paper: http://arxiv.org/abs/2508.18406
- 2026-02-09, Toward Formalizing LLM-Based Agent Designs through Structural Context Modeling and Semantic Dynamics Analysis, Haoyu Jia et.al., Paper: http://arxiv.org/abs/2602.08276
- 2026-02-26, Toward Expert Investment Teams:A Multi-Agent LLM System with Fine-Grained Trading Tasks, Kunihiro Miyazaki et.al., Paper: http://arxiv.org/abs/2602.23330
- 2026-01-20, Toward Efficient Agents: Memory, Tool learning, and Planning, Xiaofang Yang et.al., Paper: http://arxiv.org/abs/2601.14192
- 2025-08-26, Toward Edge General Intelligence with Agentic AI and Agentification: Concepts, Technologies, and Future Directions, Ruichen Zhang et.al., Paper: http://arxiv.org/abs/2508.18725
- 2026-02-27, Toward E2E Intelligence in 6G Networks: An AI Agent-Based RAN-CN Converged Intelligence Framework, Youbin Han et.al., Paper: http://arxiv.org/abs/2602.23623
- 2025-10-08, Toward Causal-Visual Programming: Enhancing Agentic Reasoning in Low-Code Environments, Jiexi Xu et.al., Paper: http://arxiv.org/abs/2509.25282
- 2025-11-05, Toward Autonomous Engineering Design: A Knowledge-Guided Multi-Agent Framework, Varun Kumar et.al., Paper: http://arxiv.org/abs/2511.03179
- 2026-01-05, Toward Auditable Neuro-Symbolic Reasoning in Pathology: SQL as an Explicit Trace of Evidence, Kewen Cao et.al., Paper: http://arxiv.org/abs/2601.01875
- 2026-01-27, Toward Architecture-Aware Evaluation Metrics for LLM Agents, Débora Souza et.al., Paper: http://arxiv.org/abs/2601.19583
- 2025-12-15, Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection, Juil Koo et.al., Paper: http://arxiv.org/abs/2512.13250
- 2026-01-20, Toward Agentic AI: Task-Oriented Communication for Hierarchical Planning of Long-Horizon Tasks, Sin-Yu Huang et.al., Paper: http://arxiv.org/abs/2601.13685
- 2025-09-17, TopoSizing: An LLM-aided Framework of Topology-based Understanding and Sizing for AMS Circuits, Ziming Wei et.al., Paper: http://arxiv.org/abs/2509.14169
- 2026-03-02, TopoCurate:Modeling Interaction Topology for Tool-Use Agent Training, Jinluan Yang et.al., Paper: http://arxiv.org/abs/2603.01714
- 2026-03-19, TopoChunker: Topology-Aware Agentic Document Chunking Framework, Xiaoyu Liu et.al., Paper: http://arxiv.org/abs/2603.18409
- 2026-01-29, ToolWeaver: Weaving Collaborative Semantics for Scalable Tool Use in Large Language Models, Bowen Fang et.al., Paper: http://arxiv.org/abs/2601.21947
- 2026-03-13, ToolTree: Efficient LLM Agent Tool Planning via Dual-Feedback Monte Carlo Tree Search and Bidirectional Pruning, Shuo Yang et.al., Paper: http://arxiv.org/abs/2603.12740
- 2025-10-22, ToolScope: Enhancing LLM Agent Tool Use through Tool Merging and Context-Aware Filtering, Marianne Menglin Liu et.al., Paper: http://arxiv.org/abs/2510.20036
- 2025-10-31, ToolScope: An Agentic Framework for Vision-Guided and Long-Horizon Tool Use, Mengjie Deng et.al., Paper: http://arxiv.org/abs/2510.27363
- 2025-09-18, ToolSample: Dual Dynamic Sampling Methods with Curriculum Learning for RL-based Tool Learning, Zihao Feng et.al., Paper: http://arxiv.org/abs/2509.14718
- 2026-01-15, ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback, Yutao Mou et.al., Paper: http://arxiv.org/abs/2601.10156
- 2025-10-16, ToolPRM: Fine-Grained Inference Scaling of Structured Outputs for Function Calling, Jianghao Lin et.al., Paper: http://arxiv.org/abs/2510.14703
- 2025-11-26, ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration, Hongjin Su et.al., Paper: http://arxiv.org/abs/2511.21689
- 2025-12-18, ToolForge: A Data Synthesis Pipeline for Multi-Hop Search without Real-World APIs, Hao Chen et.al., Paper: http://arxiv.org/abs/2512.16149
- 2026-03-14, ToolFlood: Beyond Selection – Hiding Valid Tools from LLM Agents via Semantic Covering, Hussein Jawad et.al., Paper: http://arxiv.org/abs/2603.13950
- 2025-10-19, ToolCritic: Detecting and Correcting Tool-Use Errors in Dialogue Systems, Hassan Hamad et.al., Paper: http://arxiv.org/abs/2510.17052
- 2026-01-13, ToolACE-MCP: Generalizing History-Aware Routing from MCP Tools to the Agent Web, Zhiyuan Yao et.al., Paper: http://arxiv.org/abs/2601.08276
- 2026-02-24, Tool-R0: Self-Evolving LLM Agents for Tool-Learning from Zero Data, Emre Can Acikgoz et.al., Paper: http://arxiv.org/abs/2602.21320
- 2026-01-08, Tool-MAD: A Multi-Agent Debate Framework for Fact Verification with Diverse Tool Augmentation and Adaptive Retrieval, Seyeon Jeong et.al., Paper: http://arxiv.org/abs/2601.04742
- 2025-12-22, Tool-Augmented Hybrid Ensemble Reasoning with Distillation for Bilingual Mathematical Problem Solving, Peiqing Lu et.al., Paper: http://arxiv.org/abs/2512.19093
- 2025-12-23, TongSIM: A General Platform for Simulating Intelligent Machines, Zhe Sun et.al., Paper: http://arxiv.org/abs/2512.20206
- 2025-10-21, Tokencake: A KV-Cache-centric Serving Framework for LLM-based Multi-Agent Applications, Zhuohang Bian et.al., Paper: http://arxiv.org/abs/2510.18586
- 2025-10-14, ToPolyAgent: AI Agents for Coarse-Grained Topological Polymer Simulations, Lijie Ding et.al., Paper: http://arxiv.org/abs/2510.12091
- 2025-10-16, To Infinity and Beyond: Tool-Use Unlocks Length Generalization in State Space Models, Eran Malach et.al., Paper: http://arxiv.org/abs/2510.14826
- 2026-02-10, Tiny Moves: Game-based Hypothesis Refinement, Agnieszka Dobrowolska et.al., Paper: http://arxiv.org/abs/2602.09801
- 2026-03-05, TimeWarp: Evaluating Web Agents by Revisiting the Past, Md Farhan Ishmam et.al., Paper: http://arxiv.org/abs/2603.04949
- 2026-01-30, TimeMachine-bench: A Benchmark for Evaluating Model Capabilities in Repository-Level Migration Tasks, Ryo Fujii et.al., Paper: http://arxiv.org/abs/2601.22597
- 2026-03-10, Time, Identity and Consciousness in Language Model Agents, Elija Perrier et.al., Paper: http://arxiv.org/abs/2603.09043
- 2025-11-17, Tight and Practical Privacy Auditing for Differentially Private In-Context Learning, Yuyang Xia et.al., Paper: http://arxiv.org/abs/2511.13502
- 2025-09-17, Ticket-Bench: A Kickoff for Multilingual and Regionalized Agent Evaluation, Thales Sales Almeida et.al., Paper: http://arxiv.org/abs/2509.14477
- 2025-09-22, Through the Lens of Human-Human Collaboration: A Configurable Research Platform for Exploring Human-Agent Collaboration, Bingsheng Yao et.al., Paper: http://arxiv.org/abs/2509.18008
- 2025-10-15, Three-Dimensional Simulation of the University of Hawai`i FEL Oscillator: Superradiant Emission and Cavity Desynchronization, Amir Weinberg et.al., Paper: http://arxiv.org/abs/2510.14061
- 2026-02-09, Three Lessons from Citizen-Centric Participatory AI Design, Eike Schneiders et.al., Paper: http://arxiv.org/abs/2602.08554
- 2025-10-14, Three Lenses on the AI Revolution: Risk, Transformation, Continuity, Masoud Makrehchi et.al., Paper: http://arxiv.org/abs/2510.12859
- 2026-02-26, Three AI-agents walk into a bar . . . . `Lord of the Flies’ tribalism emerges among smart AI-Agents, Dhwanil M. Mori et.al., Paper: http://arxiv.org/abs/2602.23093
- 2025-11-28, Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction, Bao Shu et.al., Paper: http://arxiv.org/abs/2511.23476
- 2026-03-05, Think, Then Verify: A Hypothesis-Verification Multi-Agent Framework for Long Video Understanding, Zheng Wang et.al., Paper: http://arxiv.org/abs/2603.04977
- 2026-04-01, Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding, Haibo Wang et.al., Paper: http://arxiv.org/abs/2604.00528
- 2026-02-12, Think like a Scientist: Physics-guided LLM Agent for Equation Discovery, Jianke Yang et.al., Paper: http://arxiv.org/abs/2602.12259
- 2025-12-02, Think in Parallel, Answer as One: Logit Averaging for Open-Ended Reasoning, Haonan Wang et.al., Paper: http://arxiv.org/abs/2512.02874
- 2026-02-13, Think Fast and Slow: Step-Level Cognitive Depth Adaptation for LLM Agents, Ruihan Yang et.al., Paper: http://arxiv.org/abs/2602.12662
- 2025-12-03, Thermalization from quenching in coupled oscillators, M. Harinarayanan et.al., Paper: http://arxiv.org/abs/2512.04028
- 2026-03-14, TheraAgent: Multi-Agent Framework with Self-Evolving Memory and Evidence-Calibrated Reasoning for PET Theranostics, Zhihao Chen et.al., Paper: http://arxiv.org/abs/2603.13676
- 2025-12-01, The partial K function, Jake P. Grainger et.al., Paper: http://arxiv.org/abs/2512.01823
- 2026-03-07, The Yerkes-Dodson Curve for AI Agents: Emergent Cooperation Under Environmental Pressure in Multi-Agent LLM Simulations, Ivan Pasichnyk et.al., Paper: http://arxiv.org/abs/2603.07360
- 2026-01-21, The Why Behind the Action: Unveiling Internal Drivers via Agentic Attribution, Chen Qian et.al., Paper: http://arxiv.org/abs/2601.15075
- 2026-03-31, The Triadic Cognitive Architecture: Bounding Autonomous Action via Spatio-Temporal and Epistemic Friction, Davide Di Gioia et.al., Paper: http://arxiv.org/abs/2603.30031
- 2025-11-24, The TEQUILA catalog of variables in TESS full-frame images: Differential photometry light curves from the first two years of observations, Bisi Bernard Ogunwale et.al., Paper: http://arxiv.org/abs/2511.19313
- 2026-03-25, The Specification Gap: Coordination Failure Under Partial Knowledge in Code Agents, Camilo Chacón Sartori et.al., Paper: http://arxiv.org/abs/2603.24284
- 2025-10-17, The Spark Effect: On Engineering Creative Diversity in Multi-Agent AI Systems, Alexander Doudkin et.al., Paper: http://arxiv.org/abs/2510.15568
- 2025-10-01, The Social Laboratory: A Psychometric Framework for Multi-Agent LLM Evaluation, Zarreen Reza et.al., Paper: http://arxiv.org/abs/2510.01295
- 2026-03-19, The Simplicity of the Hodge Bundle, Anand Patel et.al., Paper: http://arxiv.org/abs/2603.19052
- 2026-04-01, The Silicon Mirror: Dynamic Behavioral Gating for Anti-Sycophancy in LLM Agents, Harshee Jignesh Shah et.al., Paper: http://arxiv.org/abs/2604.00478
- 2025-12-24, The Silent Scholar Problem: A Probabilistic Framework for Breaking Epistemic Asymmetry in LLM Agents, Zan-Kai Chong et.al., Paper: http://arxiv.org/abs/2512.20884
- 2026-02-13, The Rise of AI Agent Communities: Large-Scale Analysis of Discourse and Interaction on Moltbook, Lingyao Li et.al., Paper: http://arxiv.org/abs/2602.12634
- 2025-11-13, The Resonance Principle: Empirical Evidence for Emergent Phase Synchronization in Human Causal Reasoning, Ahmed Gamal Eldin et.al., Paper: http://arxiv.org/abs/2511.10596
- 2025-11-26, The Quantum Network of Assets: A Non-Classical Framework for Market Correlation and Structural Risk, Hui Gong et.al., Paper: http://arxiv.org/abs/2511.21515
- 2026-01-14, The Promptware Kill Chain: How Prompt Injections Gradually Evolved Into a Multi-Step Malware, Ben Nassi et.al., Paper: http://arxiv.org/abs/2601.09625
- 2026-01-16, The Poisoned Apple Effect: Strategic Manipulation of Mediated Markets via Technology Expansion of AI Agents, Eilam Shapira et.al., Paper: http://arxiv.org/abs/2601.11496
- 2026-01-07, The Pneuma Project: Reifying Information Needs as Relational Schemas to Automate Discovery, Guide Preparation, and Align Data with Intent, Muhammad Imam Luthfi Balaka et.al., Paper: http://arxiv.org/abs/2601.03618
- 2026-01-06, The Path Ahead for Agentic AI: Challenges and Opportunities, Nadia Sibai et.al., Paper: http://arxiv.org/abs/2601.02749
- 2025-10-30, The Oversight Game: Learning to Cooperatively Balance an AI Agent’s Safety and Autonomy, William Overman et.al., Paper: http://arxiv.org/abs/2510.26752
- 2026-03-28, The Novelty Bottleneck: A Framework for Understanding Human Effort Scaling in AI-Assisted Work, Jacky Liang et.al., Paper: http://arxiv.org/abs/2603.27438
- 2025-09-01, The Need for Verification in AI-Driven Scientific Discovery, Cristina Cornelio et.al., Paper: http://arxiv.org/abs/2509.01398
- 2025-11-26, The Need for Benchmarks to Advance AI-Enabled Player Risk Detection in Gambling, Kasra Ghaharian et.al., Paper: http://arxiv.org/abs/2511.21658
- 2026-01-28, The Monotone Priority System: Foundations of Contract-Specific Sequencing, Naveen Durvasula et.al., Paper: http://arxiv.org/abs/2601.20783
- 2026-02-17, The Limits of Long-Context Reasoning in Automated Bug Fixing, Ravi Raju et.al., Paper: http://arxiv.org/abs/2602.16069
- 2026-02-06, The Law of Task-Achieving Body Motion: Axiomatizing Success of Robot Manipulation Actions, Malte Huerkamp et.al., Paper: http://arxiv.org/abs/2602.06572
- 2026-02-11, The Landscape of Prompt Injection Threats in LLM Agents: From Taxonomy to Analysis, Peiran Wang et.al., Paper: http://arxiv.org/abs/2602.10453
- 2025-09-02, The Landscape of Agentic Reinforcement Learning for LLMs: A Survey, Guibin Zhang et.al., Paper: http://arxiv.org/abs/2509.02547
- 2026-02-23, The LLMbda Calculus: AI Agents, Conversations, and Information Flow, Zac Garby et.al., Paper: http://arxiv.org/abs/2602.20064
- 2026-03-26, The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase, Yannick Roy et.al., Paper: http://arxiv.org/abs/2603.25697
- 2026-03-16, The Impact of AI-Assisted Development on Software Security: A Study of Gemini and Developer Experience, Nadine Jost et.al., Paper: http://arxiv.org/abs/2603.15298
- 2025-12-10, The Illusion of Rationality: Tacit Bias and Strategic Dominance in Frontier LLM Negotiation Games, Manuel S. Ríos et.al., Paper: http://arxiv.org/abs/2512.09254
- 2025-10-29, The Iceberg Index: Measuring Workforce Exposure Across the AI Economy, Ayush Chopra et.al., Paper: http://arxiv.org/abs/2510.25137
- 2025-09-23, The Heterogeneous Multi-Agent Challenge, Charles Dansereau et.al., Paper: http://arxiv.org/abs/2509.19512
- 2026-01-16, The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents, Ziyu Wang et.al., Paper: http://arxiv.org/abs/2601.11421
- 2025-10-30, The Geometry of Dialogue: Graphing Language Models to Reveal Synergistic Teams for Multi-Agent Collaboration, Kotaro Furuya et.al., Paper: http://arxiv.org/abs/2510.26352
- 2025-10-16, The Gatekeeper Knows Enough, Fikresilase Wondmeneh Abebayew et.al., Paper: http://arxiv.org/abs/2510.14881
- 2026-03-25, The Free-Market Algorithm: Self-Organizing Optimization for Open-Ended Complex Systems, Martin Jaraiz et.al., Paper: http://arxiv.org/abs/2603.24559
- 2026-01-06, The Fake Friend Dilemma: Trust and the Political Economy of Conversational AI, Jacob Erickson et.al., Paper: http://arxiv.org/abs/2601.03222
- 2025-10-30, The FM Agent, Annan Li et.al., Paper: http://arxiv.org/abs/2510.26144
- 2025-12-02, The Evolutionary Ecology of Software: Constraints, Innovation, and the AI Disruption, Sergi Valverde et.al., Paper: http://arxiv.org/abs/2512.02953
- 2026-03-24, The Evolution of Tool Use in LLM Agents: From Single-Tool Call to Multi-Tool Orchestration, Haoyuan Xu et.al., Paper: http://arxiv.org/abs/2603.22862
- 2025-09-26, The Emergence of Altruism in Large-Language-Model Agents Society, Haoyang Li et.al., Paper: http://arxiv.org/abs/2509.22537
- 2026-02-10, The Devil Behind Moltbook: Anthropic Safety is Always Vanishing in Self-Evolving AI Societies, Chenxu Wang et.al., Paper: http://arxiv.org/abs/2602.09877
- 2025-11-25, The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment, Ziheng Ouyang et.al., Paper: http://arxiv.org/abs/2511.20614
- 2026-01-12, The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents, Weihao Xuan et.al., Paper: http://arxiv.org/abs/2601.07264
- 2025-11-04, The Collaboration Gap, Tim R. Davidson et.al., Paper: http://arxiv.org/abs/2511.02687
- 2026-01-22, The Behavioral Fabric of LLM-Powered GUI Agents: Human Values and Interaction Outcomes, Simret Araya Gebreegziabher et.al., Paper: http://arxiv.org/abs/2601.16356
- 2026-02-27, The Auton Agentic AI Framework, Sheng Cao et.al., Paper: http://arxiv.org/abs/2602.23720
- 2025-12-08, The Agent Capability Problem: Predicting Solvability Through Information-Theoretic Bounds, Shahar Lutati et.al., Paper: http://arxiv.org/abs/2512.07631
- 2025-12-08, The Adoption and Usage of AI Agents: Early Evidence from Perplexity, Jeremy Yang et.al., Paper: http://arxiv.org/abs/2512.07828
- 2026-01-14, The AI Hippocampus: How Far are We From Human Memory?, Zixia Jia et.al., Paper: http://arxiv.org/abs/2601.09113
- 2025-08-25, The AI Data Scientist, Farkhad Akimov et.al., Paper: http://arxiv.org/abs/2508.18113
- 2025-12-25, The AI Committee: A Multi-Agent Framework for Automated Validation and Remediation of Web-Sourced Data, Sunith Vallabhaneni et.al., Paper: http://arxiv.org/abs/2512.21481
- 2026-02-19, The 2025 AI Agent Index: Documenting Technical and Safety Features of Deployed Agentic AI Systems, Leon Staufer et.al., Paper: http://arxiv.org/abs/2602.17753
- 2025-11-13, The $L_p$ -error rate for randomized quasi-Monte Carlo self-normalized importance sampling of unbounded integrands, Jiarui Du et.al., Paper: http://arxiv.org/abs/2511.10599
- 2026-03-20, Text-Based Personas for Simulating User Privacy Decisions, Kassem Fawaz et.al., Paper: http://arxiv.org/abs/2603.19791
- 2026-03-14, Testing with AI Agents: An Empirical Study of Test Generation Frequency, Quality, and Coverage, Suzuka Yoshimoto et.al., Paper: http://arxiv.org/abs/2603.13724
- 2025-11-06, Testing the Testers: Human-Driven Quality Assessment of Voice AI Testing Platforms, Miguel E. Andres et.al., Paper: http://arxiv.org/abs/2511.04133
- 2026-01-30, Test-Time Mixture of World Models for Embodied Agents in Dynamic Environments, Jinwoo Jang et.al., Paper: http://arxiv.org/abs/2601.22647
- 2025-10-16, Terrarium: Revisiting the Blackboard for Multi-Agent Safety, Privacy, and Security Studies, Mason Nakamura et.al., Paper: http://arxiv.org/abs/2510.14312
- 2026-03-11, Terminal Is All You Need: Design Properties for Human-AI Agent Collaboration, Alexandre De Masi et.al., Paper: http://arxiv.org/abs/2603.10664
- 2026-01-26, Temp-R1: A Unified Autonomous Agent for Complex Temporal KGQA via Reverse Curriculum Reinforcement Learning, Zhaoyan Gong et.al., Paper: http://arxiv.org/abs/2601.18296
- 2025-10-09, Team Xiaomi EV-AD VLA: Learning to Navigate Socially Through Proactive Risk Perception – Technical Report for IROS 2025 RoboSense Challenge Social Navigation Track, Erjia Xiao et.al., Paper: http://arxiv.org/abs/2510.07871
- 2025-12-03, Teaching Old Tokenizers New Words: Efficient Tokenizer Adaptation for Pre-trained Models, Taido Purason et.al., Paper: http://arxiv.org/abs/2512.03989
- 2026-01-21, Taxonomy-Aligned Risk Extraction from 10-K Filings with Autonomous Improvement Using LLMs, Rian Dolphin et.al., Paper: http://arxiv.org/abs/2601.15247
- 2025-12-05, Task-Specific Trust Evaluation for Multi-Hop Collaborator Selection via GNN-Aided Distributed Agentic AI, Botao Zhu et.al., Paper: http://arxiv.org/abs/2512.05788
- 2026-02-05, Task-Oriented Robot-Human Handovers on Legged Manipulators, Andreea Tulbure et.al., Paper: http://arxiv.org/abs/2602.05760
- 2026-03-24, Task-Aware Positioning for Improvisational Tasks in Mobile Construction Robots via an AI Agent with Multi-LMM Modules, Seongju Jang et.al., Paper: http://arxiv.org/abs/2603.22903
- 2026-03-11, Task-Aware Delegation Cues for LLM Agents, Xingrui Gu et.al., Paper: http://arxiv.org/abs/2603.11011
- 2026-01-09, Task Cascades for Efficient Unstructured Data Processing, Shreya Shankar et.al., Paper: http://arxiv.org/abs/2601.05536
- 2025-10-15, Tandem Training for Language Models, Robert West et.al., Paper: http://arxiv.org/abs/2510.13551
- 2025-09-09, Talking with Oompa Loompas: A novel framework for evaluating linguistic acquisition of LLM agents, Sankalp Tattwadarshi Swain et.al., Paper: http://arxiv.org/abs/2509.07389
- 2025-09-08, TalkToAgent: A Human-centric Explanation of Reinforcement Learning Agents with Large Language Models, Haechang Kim et.al., Paper: http://arxiv.org/abs/2509.04809
- 2025-11-18, Talk, Snap, Complain: Validation-Aware Multimodal Expert Framework for Fine-Grained Customer Grievances, Rishu Kumar Singh et.al., Paper: http://arxiv.org/abs/2511.14693
- 2025-10-12, Talk Less, Call Right: Enhancing Role-Play LLM Agents with Automatic Prompt Optimization and Role Prompting, Saksorn Ruangtanusak et.al., Paper: http://arxiv.org/abs/2509.00482
- 2025-09-12, Tackling One Health Risks: How Large Language Models are leveraged for Risk Negotiation and Consensus-building, Alexandra Fetsch et.al., Paper: http://arxiv.org/abs/2509.09906
- 2026-02-06, Table-as-Search: Formulate Long-Horizon Agentic Information Seeking as Table Completion, Tian Lan et.al., Paper: http://arxiv.org/abs/2602.06724
- 2026-02-11, TVCACHE: A Stateful Tool-Value Cache for Post-Training LLM Agents, Abhishek Vijaya Kumar et.al., Paper: http://arxiv.org/abs/2602.10986
- 2026-02-12, TSR: Trajectory-Search Rollouts for Multi-Turn RL of LLM Agents, Aladin Djuhera et.al., Paper: http://arxiv.org/abs/2602.11767
- 2026-03-17, TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas, Ai Jian et.al., Paper: http://arxiv.org/abs/2603.16448
- 2025-10-01, TOUCAN: Synthesizing 1.5M Tool-Agentic Data from Real-World MCP Environments, Zhangchen Xu et.al., Paper: http://arxiv.org/abs/2510.01179
- 2026-03-05, TML-Bench: Benchmark for Data Science Agents on Tabular ML Tasks, Mykola Pinchuk et.al., Paper: http://arxiv.org/abs/2603.05764
- 2026-02-02, TIDE: Trajectory-based Diagnostic Evaluation of Test-Time Improvement in LLM Agents, Hang Yan et.al., Paper: http://arxiv.org/abs/2602.02196
- 2026-03-06, THETA: A Textual Hybrid Embedding-based Topic Analysis Framework and AI Scientist Agent for Scalable Computational Social Science, Zhenke Duan et.al., Paper: http://arxiv.org/abs/2603.05972
- 2025-09-30, TENET: Leveraging Tests Beyond Validation for Code Generation, Yiran Hu et.al., Paper: http://arxiv.org/abs/2509.24148
- 2026-01-26, TEA-Bench: A Systematic Benchmarking of Tool-enhanced Emotional Support Dialogue Agent, Xingyu Sui et.al., Paper: http://arxiv.org/abs/2601.18700
- 2026-03-18, TDAD: Test-Driven Agentic Development - Reducing Code Regressions in AI Coding Agents via Graph-Based Impact Analysis, Pepe Alonso et.al., Paper: http://arxiv.org/abs/2603.17973
- 2025-12-29, TCEval: Using Thermal Comfort to Assess Cognitive and Perceptual Abilities of AI, Jingming Li et.al., Paper: http://arxiv.org/abs/2512.23217
- 2026-02-23, TAPE: Tool-Guided Adaptive Planning and Constrained Execution in Language Model Agents, Jongwon Jeong et.al., Paper: http://arxiv.org/abs/2602.19633
- 2025-11-07, TAMAS: Benchmarking Adversarial Risks in Multi-Agent LLM Systems, Ishan Kavathekar et.al., Paper: http://arxiv.org/abs/2511.05269
- 2025-10-27, TALM: Dynamic Tree-Structured Multi-Agent Framework with Long-Term Memory for Scalable Code Generation, Ming-Tung Shen et.al., Paper: http://arxiv.org/abs/2510.23010
- 2025-10-02, TACOS: Task Agnostic COordinator of a multi-drone System, Alessandro Nazzari et.al., Paper: http://arxiv.org/abs/2510.01869
- 2026-03-10, TA-Mem: Tool-Augmented Autonomous Memory Retrieval for LLM in Long-Term Conversational QA, Mengwei Yuan et.al., Paper: http://arxiv.org/abs/2603.09297
- 2026-03-06, T2Nav Algebraic Topology Aware Temporal Graph Memory and Loop Detection for ZeroShot Visual Navigation, Quang-Anh N. D. et.al., Paper: http://arxiv.org/abs/2603.06918
- 2025-12-19, Systemic Risks of Interacting AI, Paul Darius et.al., Paper: http://arxiv.org/abs/2512.17793
- 2025-12-09, Systematization of Knowledge: Security and Safety in the Model Context Protocol Ecosystem, Shiva Gaire et.al., Paper: http://arxiv.org/abs/2512.08290
- 2025-11-20, Systematically Deconstructing APVD Steganography and its Payload with a Unified Deep Learning Paradigm, Kabbo Jit Deb et.al., Paper: http://arxiv.org/abs/2511.16604
- 2025-12-23, Synthesizing Procedural Memory: Challenges and Architectures in Automated Workflow Generation, Nishant Gaurav et.al., Paper: http://arxiv.org/abs/2512.20278
- 2025-10-15, Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms, Shrey Pandit et.al., Paper: http://arxiv.org/abs/2510.13913
- 2025-11-20, Synthesis of Safety Specifications for Probabilistic Systems, Gaspard Ohlmann et.al., Paper: http://arxiv.org/abs/2511.16579
- 2025-12-17, SynthSeg-Agents: Multi-Agent Synthetic Data Generation for Zero-Shot Weakly Supervised Semantic Segmentation, Wangyu Wu et.al., Paper: http://arxiv.org/abs/2512.15310
- 2025-10-18, Synergizing chemical and AI communities for advancing laboratories of the future, Saejin Oh et.al., Paper: http://arxiv.org/abs/2510.16293
- 2026-03-20, Synergistic Perception and Generative Recomposition: A Multi-Agent Orchestration for Expert-Level Building Inspection, Hui Zhong et.al., Paper: http://arxiv.org/abs/2603.20143
- 2026-01-13, SwiftMem: Fast Agentic Memory via Query-aware Indexing, Anxin Tian et.al., Paper: http://arxiv.org/abs/2601.08160
- 2025-10-11, SwarmSys: Decentralized Swarm-Inspired Agents for Scalable and Adaptive Reasoning, Ruohao Li et.al., Paper: http://arxiv.org/abs/2510.10047
- 2025-10-13, SusBench: An Online Benchmark for Evaluating Dark Pattern Susceptibility of Computer-Use Agents, Longjie Guo et.al., Paper: http://arxiv.org/abs/2510.11035
- 2025-09-15, Survival at Any Cost? LLMs and the Choice Between Self-Preservation and Human Harm, Alireza Mohamadi et.al., Paper: http://arxiv.org/abs/2509.12190
- 2025-08-27, Survey of Specialized Large Language Model, Chenghan Yang et.al., Paper: http://arxiv.org/abs/2508.19667
- 2025-11-20, SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction, Guolin Huang et.al., Paper: http://arxiv.org/abs/2511.16635
- 2025-11-10, Surgical Agent Orchestration Platform for Voice-directed Patient Data Interaction, Hyeryun Park et.al., Paper: http://arxiv.org/abs/2511.07392
- 2026-03-31, SurgNavAR: An Augmented Reality Surgical Navigation Framework for Optical See-Through Head Mounted Displays, Abdullah Thabit et.al., Paper: http://arxiv.org/abs/2603.29990
- 2025-08-31, Supporting Our AI Overlords: Redesigning Data Systems to be Agent-First, Shu Liu et.al., Paper: http://arxiv.org/abs/2509.00997
- 2025-11-06, Supersymmetry Breaking with Fields, Strings and Branes, E. Dudas et.al., Paper: http://arxiv.org/abs/2511.04367
- 2026-03-07, SuperSkillsStack: Agency, Domain Knowledge, Imagination, and Taste in Human-AI Design Education, Qian Huang et.al., Paper: http://arxiv.org/abs/2603.07016
- 2026-02-18, Submodular Maximization under Supermodular Constraint: Greedy Guarantees, Ajitesh Srivastava et.al., Paper: http://arxiv.org/abs/2602.16240
- 2025-11-20, Subdivisions of lower Eulerian posets, Alan Stapledon et.al., Paper: http://arxiv.org/abs/2511.16608
- 2026-01-15, Structured Personality Control and Adaptation for LLM Agents, Jinpeng Wang et.al., Paper: http://arxiv.org/abs/2601.10025
- 2026-03-29, Structured Observation Language for Efficient and Generalizable Vision-Language Navigation, Daojie Peng et.al., Paper: http://arxiv.org/abs/2603.27577
- 2026-03-11, Structured Linked Data as a Memory Layer for Agent-Orchestrated Retrieval, Andrea Volpini et.al., Paper: http://arxiv.org/abs/2603.10700
- 2026-03-13, Structured Distillation for Personalized Agent Memory: 11x Token Reduction with Retrieval Preservation, Sydney Lewis et.al., Paper: http://arxiv.org/abs/2603.13017
- 2025-09-23, Structured Cognition for Behavioral Intelligence in Large Language Model Agents: Preliminary Study, Myung Ho Kim et.al., Paper: http://arxiv.org/abs/2510.05107
- 2026-01-05, Structural Representations for Cross-Attack Generalization in AI Agent Threat Detection, Vignesh Iyer et.al., Paper: http://arxiv.org/abs/2601.01723
- 2025-12-05, Strongly Coupled Quantum Forces, Yuval Grossman et.al., Paper: http://arxiv.org/abs/2512.05968
- 2025-10-14, Stronger Together: On-Policy Reinforcement Learning for Collaborative LLMs, Yujie Zhao et.al., Paper: http://arxiv.org/abs/2510.11062
- 2025-10-10, StreamingVLM: Real-Time Understanding for Infinite Video Streams, Ruyi Xu et.al., Paper: http://arxiv.org/abs/2510.09608
- 2026-03-23, StreamingClaw Technical Report, Jiawei Chen et.al., Paper: http://arxiv.org/abs/2603.22120
- 2025-10-07, Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents, Mingkang Zhu et.al., Paper: http://arxiv.org/abs/2510.06214
- 2025-09-12, Strategic Tradeoffs Between Humans and AI in Multi-Agent Bargaining, Crystal Qian et.al., Paper: http://arxiv.org/abs/2509.09071
- 2025-12-04, Strategic Self-Improvement for Competitive Agents in AI Labour Markets, Christopher Chiu et.al., Paper: http://arxiv.org/abs/2512.04988
- 2025-11-13, Strategic Opponent Modeling with Graph Neural Networks, Deep Reinforcement Learning and Probabilistic Topic Modeling, Georgios Chalkiadakis et.al., Paper: http://arxiv.org/abs/2511.10501
- 2026-03-23, Strategic Infrastructure Design via Multi-Agent Congestion Games with Joint Placement and Pricing, Niloofar Aminikalibar et.al., Paper: http://arxiv.org/abs/2603.21691
- 2025-11-07, Story Arena: A Multi-Agent Environment for Envisioning the Future of Software Engineering, Justin D. Weisz et.al., Paper: http://arxiv.org/abs/2511.05410
- 2025-10-28, StorageXTuner: An LLM Agent-Driven Automatic Tuning Framework for Heterogeneous Storage Systems, Qi Lin et.al., Paper: http://arxiv.org/abs/2510.25017
- 2025-10-02, StockBench: Can LLM Agents Trade Stocks Profitably In Real-world Markets?, Yanxu Chen et.al., Paper: http://arxiv.org/abs/2510.02209
- 2025-11-04, Stochastic Redistribution of Indistinguishable Items in Shared Habitation: A Multi-Agent Simulation Framework, Syed Haseeb Shah et.al., Paper: http://arxiv.org/abs/2511.02648
- 2025-11-26, Stochastic Optimal Control of Interacting Particle Systems in Hilbert Spaces and Applications, Filippo de Feo et.al., Paper: http://arxiv.org/abs/2511.21646
- 2026-03-13, Steve-Evolving: Open-World Embodied Self-Evolution via Fine-Grained Diagnosis and Dual-Track Knowledge Distillation, Zhengwei Xie et.al., Paper: http://arxiv.org/abs/2603.13131
- 2026-01-29, StepShield: When, Not Whether to Intervene on Rogue Agents, Gloria Felicia et.al., Paper: http://arxiv.org/abs/2601.22136
- 2025-12-23, Step-DeepResearch Technical Report, Chen Hu et.al., Paper: http://arxiv.org/abs/2512.20491
- 2026-02-11, Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters, Ailin Huang et.al., Paper: http://arxiv.org/abs/2602.10604
- 2026-02-09, SteerVLA: Steering Vision-Language-Action Models in Long-Tail Driving Scenarios, Tian Gao et.al., Paper: http://arxiv.org/abs/2602.08440
- 2025-10-15, Steer-MoE: Efficient Audio-Language Alignment with a Mixture-of-Experts Steering Module, Ruitao Feng et.al., Paper: http://arxiv.org/abs/2510.13558
- 2026-02-02, Statistical Learning Theory in Lean 4: Empirical Processes from Scratch, Yuanhe Zhang et.al., Paper: http://arxiv.org/abs/2602.02285
- 2026-02-09, Stateless Yet Not Forgetful: Implicit Memory as a Hidden Channel in LLMs, Ahmed Salem et.al., Paper: http://arxiv.org/abs/2602.08563
- 2025-10-29, Standardization of Psychiatric Diagnoses – Role of Fine-tuned LLM Consortium and OpenAI-gpt-oss Reasoning LLM Enabled Decision Support System, Eranga Bandara et.al., Paper: http://arxiv.org/abs/2510.25588
- 2026-01-07, Staged Voxel-Level Deep Reinforcement Learning for 3D Medical Image Segmentation with Noisy Annotations, Yuyang Fu et.al., Paper: http://arxiv.org/abs/2601.03875
- 2026-01-09, StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management, Ruizhe Zhang et.al., Paper: http://arxiv.org/abs/2601.05890
- 2026-02-19, Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs, Luke Huang et.al., Paper: http://arxiv.org/abs/2602.17616
- 2026-03-06, Stability-Guided Exploration for Diverse Motion Generation, Eckart Cobo-Briesewitz et.al., Paper: http://arxiv.org/abs/2603.06773
- 2026-03-09, SplitAgent: A Privacy-Preserving Distributed Architecture for Enterprise-Cloud Agent Collaboration, Jianshu She et.al., Paper: http://arxiv.org/abs/2603.08221
- 2025-10-08, Spiral of Silence in Large Language Model Agents, Mingze Zhong et.al., Paper: http://arxiv.org/abs/2510.02360
- 2026-03-13, Spend Less, Reason Better: Budget-Aware Value Tree Search for LLM Agents, Yushu Li et.al., Paper: http://arxiv.org/abs/2603.12634
- 2025-11-06, Speed at the Cost of Quality? The Impact of LLM Agent Assistance on Software Development, Hao He et.al., Paper: http://arxiv.org/abs/2511.04427
- 2026-01-14, Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception, Zhen Wan et.al., Paper: http://arxiv.org/abs/2601.09413
- 2026-03-03, Speech recognition assisted by large language models to command software orally – Application to an augmented and virtual reality web app for immersive molecular graphics, Fabio Cortes Rodriguez et.al., Paper: http://arxiv.org/abs/2603.02901
- 2026-03-03, SpecLoop: An Agentic RTL-to-Specification Framework with Formal Verification Feedback Loop, Fu-Chieh Chang et.al., Paper: http://arxiv.org/abs/2603.02895
- 2025-12-15, SpeakRL: Synergizing Reasoning, Speaking, and Acting in Language Models with Reinforcement Learning, Emre Can Acikgoz et.al., Paper: http://arxiv.org/abs/2512.13159
- 2026-03-17, Speak, Segment, Track, Navigate: An Interactive System for Video-Guided Skull-Base Surgery, Jecia Z. Y. Mao et.al., Paper: http://arxiv.org/abs/2603.16024
- 2025-12-26, SpatialBench: Can Agents Analyze Real-World Spatial Biology Data?, Kenny Workman et.al., Paper: http://arxiv.org/abs/2512.21907
- 2026-01-23, Spatial-Agent: Agentic Geo-spatial Reasoning with Scientific Core Concepts, Riyang Bao et.al., Paper: http://arxiv.org/abs/2601.16965
- 2026-03-06, Spatial Colour Mixing Illusions as a Perception Stress Test for Vision-Language Models, Nicoleta-Nina Basoc et.al., Paper: http://arxiv.org/abs/2603.06141
- 2025-12-03, SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL, Siyi Chen et.al., Paper: http://arxiv.org/abs/2512.04069
- 2026-01-01, Space Debris Removal using Nano-Satellites controlled by Low-Power Autonomous Agents, Dennis Christmann et.al., Paper: http://arxiv.org/abs/2601.00465
- 2026-03-14, Sovereign-OS: A Charter-Governed Operating System for Autonomous AI Agents with Verifiable Fiscal Discipline, Aojie Yuan et.al., Paper: http://arxiv.org/abs/2603.14011
- 2026-02-16, Sovereign Agents: Towards Infrastructural Sovereignty and Diffused Accountability in Decentralized AI, Botao Amber Hu et.al., Paper: http://arxiv.org/abs/2602.14951
- 2025-09-26, Solving the Granularity Mismatch: Hierarchical Preference Learning for Long-Horizon LLM Agents, Heyang Gao et.al., Paper: http://arxiv.org/abs/2510.03253
- 2026-01-30, SolAgent: A Specialized Multi-Agent Framework for Solidity Code Generation, Wei Chen et.al., Paper: http://arxiv.org/abs/2601.23009
- 2025-10-24, Soft Instruction De-escalation Defense, Nils Philipp Walter et.al., Paper: http://arxiv.org/abs/2510.21057
- 2026-02-16, Socially-Weighted Alignment: A Game-Theoretic Framework for Multi-Agent LLM Systems, Furkan Mumcu et.al., Paper: http://arxiv.org/abs/2602.14471
- 2025-10-21, Socialized Learning and Emergent Behaviors in Multi-Agent Systems based on Multimodal Large Language Models, Sureyya Akin et.al., Paper: http://arxiv.org/abs/2510.18515
- 2026-03-12, Social, Legal, Ethical, Empathetic and Cultural Norm Operationalisation for AI Agents, Radu Calinescu et.al., Paper: http://arxiv.org/abs/2603.11864
- 2025-10-01, Social Welfare Function Leaderboard: When LLM Agents Allocate Social Welfare, Zhengliang Shi et.al., Paper: http://arxiv.org/abs/2510.01164
- 2026-03-17, Social Simulacra in the Wild: AI Agent Communities on Moltbook, Agam Goyal et.al., Paper: http://arxiv.org/abs/2603.16128
- 2026-03-26, Social Hippocampus Memory Learning, Liping Yi et.al., Paper: http://arxiv.org/abs/2603.25614
- 2025-10-06, Social Agent: Mastering Dyadic Nonverbal Behavior Generation via Conversational LLM Agents, Zeyi Zhang et.al., Paper: http://arxiv.org/abs/2510.04637
- 2025-10-02, SoK: Measuring What Matters for Closed-Loop Security Agents, Mudita Khurana et.al., Paper: http://arxiv.org/abs/2510.01654
- 2026-02-24, SoK: Agentic Skills – Beyond Tool Use in LLM Agents, Yanna Jiang et.al., Paper: http://arxiv.org/abs/2602.20867
- 2026-01-20, Small Models, Big Impact: Tool-Augmented AI Agents for Wireless Network Planning, Yongqiang Zhang et.al., Paper: http://arxiv.org/abs/2601.13843
- 2025-12-23, SlideTailor: Personalized Presentation Slide Generation for Scientific Papers, Wenzheng Zeng et.al., Paper: http://arxiv.org/abs/2512.20292
- 2025-10-30, SlideAgent: Hierarchical Agentic Framework for Multi-Page Visual Document Understanding, Yiqiao Jin et.al., Paper: http://arxiv.org/abs/2510.26615
- 2026-02-13, SkillsBench: Benchmarking How Well Agent Skills Work Across Diverse Tasks, Xiangyi Li et.al., Paper: http://arxiv.org/abs/2602.12670
- 2026-03-20, Skilled AI Agents for Embedded and IoT Systems Development, Yiming Li et.al., Paper: http://arxiv.org/abs/2603.19583
- 2026-03-31, SkillReducer: Optimizing LLM Agent Skills for Token Efficiency, Yudong Gao et.al., Paper: http://arxiv.org/abs/2603.29919
- 2026-03-22, SkillProbe: Security Auditing for Emerging Agent Skill Marketplaces via Multi-Agent Collaboration, Zihan Guo et.al., Paper: http://arxiv.org/abs/2603.21019
- 2026-02-23, Skill-Inject: Measuring Agent Vulnerability to Skill File Attacks, David Schmotz et.al., Paper: http://arxiv.org/abs/2602.20156
- 2026-03-14, Six Interventions for the Responsible and Ethical Implementation of Medical AI Agents, Tom Bisson et.al., Paper: http://arxiv.org/abs/2603.13743
- 2025-09-27, Situational Awareness for Safe and Robust Multi-Agent Interactions Under Uncertainty, Benjamin Alcorn et.al., Paper: http://arxiv.org/abs/2509.23425
- 2025-10-30, Simulating and Experimenting with Social Media Mobilization Using LLM Agents, Sadegh Shirani et.al., Paper: http://arxiv.org/abs/2510.26494
- 2025-10-11, Simulating Viva Voce Examinations to Evaluate Clinical Reasoning in Large Language Models, Christopher Chiu et.al., Paper: http://arxiv.org/abs/2510.10278
- 2025-10-09, Simulating Teams with LLM Agents: Interactive 2D Environments for Studying Human-AI Dynamics, Mohammed Almutairi et.al., Paper: http://arxiv.org/abs/2510.08242
- 2025-09-23, Simulating Online Social Media Conversations on Controversial Topics Using AI Agents Calibrated on Real-World Data, Elisa Composta et.al., Paper: http://arxiv.org/abs/2509.18985
- 2025-11-13, Simulating Misinformation Propagation in Social Networks using Large Language Models, Raj Gaurav Maurya et.al., Paper: http://arxiv.org/abs/2511.10384
- 2025-11-21, Simulating Disky Broad Line Region Reverberation, Mary Ogborn et.al., Paper: http://arxiv.org/abs/2511.17334
- 2025-09-29, SimuHome: A Temporal- and Environment-Aware Benchmark for Smart Home LLM Agents, Gyuhyeon Seo et.al., Paper: http://arxiv.org/abs/2509.24282
- 2026-01-05, SimpleMem: Efficient Lifelong Memory for LLM Agents, Jiaqi Liu et.al., Paper: http://arxiv.org/abs/2601.02553
- 2026-03-31, SimMOF: AI agent for Automated MOF Simulations, Jaewoong Lee et.al., Paper: http://arxiv.org/abs/2603.29152
- 2025-09-21, SignalLLM: A General-Purpose LLM Agent Framework for Automated Signal Processing, Junlong Ke et.al., Paper: http://arxiv.org/abs/2509.17197
- 2026-03-19, SignAgent: Agentic LLMs for Linguistically-Grounded Sign Language Annotation and Dataset Curation, Oliver Cory et.al., Paper: http://arxiv.org/abs/2603.19059
- 2026-01-30, Sifting the Noise: A Comparative Study of LLM Agents in Vulnerability False Positive Filtering, Yunpeng Xiong et.al., Paper: http://arxiv.org/abs/2601.22952
- 2026-03-31, Should I State or Should I Show? Aligning AI with Human Preferences, Keaton Ellis et.al., Paper: http://arxiv.org/abs/2603.29317
- 2025-10-22, SheetBrain: A Neuro-Symbolic Agent for Accurate Reasoning over Complex and Large Spreadsheets, Ziwei Wang et.al., Paper: http://arxiv.org/abs/2510.19247
- 2025-11-06, Shared Spatial Memory Through Predictive Coding, Zhengru Fang et.al., Paper: http://arxiv.org/abs/2511.04235
- 2025-10-20, ShapeCraft: LLM Agents for Structured, Textured and Interactive 3D Modeling, Shuyuan Zhang et.al., Paper: http://arxiv.org/abs/2510.17603
- 2025-12-03, Shadow geometry of Kerr MOG naked singularity and analysis of accretion disk luminosity, Saira Yasmin et.al., Paper: http://arxiv.org/abs/2512.03896
- 2025-10-15, Sequential Quantum Measurements and the Instrumental Group Algebra, Christopher S. Jackson et.al., Paper: http://arxiv.org/abs/2510.13980
- 2025-11-10, Sequential Causal Normal Form Games: Theory, Computation, and Strategic Signaling, Dennis Thumm et.al., Paper: http://arxiv.org/abs/2511.06934
- 2025-10-21, SentinelNet: Safeguarding Multi-Agent Collaboration Through Credit-Based Dynamic Threat Detection, Yang Feng et.al., Paper: http://arxiv.org/abs/2510.16219
- 2026-03-18, Sensi: Learn One Thing at a Time – Curriculum-Based Test-Time Learning for LLM Game Agents, Mohsen Arjmandi et.al., Paper: http://arxiv.org/abs/2603.17683
- 2025-12-30, SenseNova-MARS: Empowering Multimodal Agentic Reasoning and Search via Reinforcement Learning, Yong Xien Chng et.al., Paper: http://arxiv.org/abs/2512.24330
- 2026-01-12, Semisimple algebraic groups over real closed fields, Raphael Appenzeller et.al., Paper: http://arxiv.org/abs/2601.07732
- 2025-12-03, Semi-Markov Decision Process Framework for Age of Incorrect Information Minimization, Ismail Cosandal et.al., Paper: http://arxiv.org/abs/2512.04077
- 2024-01-23, Semantic Web Technology for Agent Communication Protocols, Idoia Berges et.al., Paper: http://arxiv.org/abs/2401.11841
- 2026-01-13, Semantic Laundering in AI Agent Architectures: Why Tool Boundaries Do Not Confer Epistemic Warrant, Oleg Romanchuk et.al., Paper: http://arxiv.org/abs/2601.08333
- 2026-03-13, Semantic Invariance in Agentic AI, I. de Zarzà et.al., Paper: http://arxiv.org/abs/2603.13173
- 2025-10-20, Semantic Intelligence: A Bio-Inspired Cognitive Framework for Embodied Agents, Wenbing Tang et.al., Paper: http://arxiv.org/abs/2510.17129
- 2026-03-20, Semantic Audio-Visual Navigation in Continuous Environments, Yichen Zeng et.al., Paper: http://arxiv.org/abs/2603.19660
- 2026-03-13, SemRep: Generative Code Representation Learning with Code Transformations, Weichen Li et.al., Paper: http://arxiv.org/abs/2603.13640
- 2025-10-17, Self-evolving expertise in complex non-verifiable subject domains: dialogue as implicit meta-RL, Richard M. Bailey et.al., Paper: http://arxiv.org/abs/2510.15772
- 2026-03-28, Self-evolving AI agents for protein discovery and directed evolution, Yang Tan et.al., Paper: http://arxiv.org/abs/2603.27303
- 2025-09-12, Self-Supervised Goal-Reaching Results in Multi-Agent Cooperation and Exploration, Chirayu Nimonkar et.al., Paper: http://arxiv.org/abs/2509.10656
- 2025-10-09, Self-Improving LLM Agents at Test-Time, Emre Can Acikgoz et.al., Paper: http://arxiv.org/abs/2510.07841
- 2026-02-10, Self-Evolving Recommendation System: End-To-End Autonomous Model Optimization With LLM Agents, Haochen Wang et.al., Paper: http://arxiv.org/abs/2602.10226
- 2026-03-25, Self-Evolving Multi-Agent Framework for Efficient Decision Making in Real-Time Strategy Scenarios, Li Ma et.al., Paper: http://arxiv.org/abs/2603.23875
- 2026-01-29, Self-Compression of Chain-of-Thought via Multi-Agent Reinforcement Learning, Yiqun Chen et.al., Paper: http://arxiv.org/abs/2601.21919
- 2025-10-01, Seeing through Uncertainty: Robust Task-Oriented Optimization in Visual Navigation, Yiyuan Pan et.al., Paper: http://arxiv.org/abs/2510.00441
- 2026-03-04, Seeing as Experts Do: A Knowledge-Augmented Agent for Open-Set Fine-Grained Visual Understanding, Junhan Chen et.al., Paper: http://arxiv.org/abs/2603.03762
- 2025-11-26, Seeing Twice: How Side-by-Side T2I Comparison Changes Auditing Strategies, Matheus Kunzler Maldaner et.al., Paper: http://arxiv.org/abs/2511.21547
- 2025-10-22, Seed3D 1.0: From Images to High-Fidelity Simulation-Ready 3D Assets, Jiashi Feng et.al., Paper: http://arxiv.org/abs/2510.19944
- 2026-03-21, Seed1.8 Model Card: Towards Generalized Real-World Agency, Bytedance Seed et.al., Paper: http://arxiv.org/abs/2603.20633
- 2026-02-06, SeeUPO: Sequence-Level Agentic-RL with Convergence Guarantees, Tianyi Hu et.al., Paper: http://arxiv.org/abs/2602.06554
- 2025-12-09, See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm, Haoyu Zhao et.al., Paper: http://arxiv.org/abs/2512.08629
- 2026-03-25, See, Remember, Explore: A Benchmark and Baselines for Streaming Spatial Reasoning, Yuxi Wei et.al., Paper: http://arxiv.org/abs/2603.23864
- 2026-02-11, See, Plan, Snap: Evaluating Multimodal GUI Agents in Scratch, Xingyi Zhang et.al., Paper: http://arxiv.org/abs/2602.10814
- 2026-03-19, Security, privacy, and agentic AI in a regulatory view: From definitions and distinctions to provisions and reflections, Shiliang Zhang et.al., Paper: http://arxiv.org/abs/2603.18914
- 2026-01-01, Security in the Age of AI Teammates: An Empirical Study of Agentic Pull Requests on GitHub, Mohammed Latif Siddiq et.al., Paper: http://arxiv.org/abs/2601.00477
- 2026-03-19, Security awareness in LLM agents: the NDAI zone case, Enrico Bottazzi et.al., Paper: http://arxiv.org/abs/2603.19011
- 2026-02-23, Security Risks of AI Agents Hiring Humans: An Empirical Marketplace Study, Pulak Mehta et.al., Paper: http://arxiv.org/abs/2602.19514
- 2026-03-12, Security Considerations for Artificial Intelligence Agents, Ninghui Li et.al., Paper: http://arxiv.org/abs/2603.12230
- 2025-11-05, Security Analysis of Agentic AI Communication Protocols: A Comparative Evaluation, Yedidel Louck et.al., Paper: http://arxiv.org/abs/2511.03841
- 2025-10-24, Securing AI Agent Execution, Christoph Bühler et.al., Paper: http://arxiv.org/abs/2510.21236
- 2025-09-18, SecureFixAgent: A Hybrid LLM Agent for Automated Python Static Vulnerability Repair, Jugal Gajjar et.al., Paper: http://arxiv.org/abs/2509.16275
- 2026-02-16, Secure and Energy-Efficient Wireless Agentic AI Networks, Yuanyan Song et.al., Paper: http://arxiv.org/abs/2602.15212
- 2025-08-27, Secure Multi-LLM Agentic AI and Agentification for Edge General Intelligence by Zero-Trust: A Survey, Yinqiu Liu et.al., Paper: http://arxiv.org/abs/2508.19870
- 2026-04-02, Seclens: Role-specific Evaluation of LLM’s for security vulnerablity detection, Subho Halder et.al., Paper: http://arxiv.org/abs/2604.01637
- 2025-10-21, Search Self-play: Pushing the Frontier of Agent Capability without Supervision, Hongliang Lu et.al., Paper: http://arxiv.org/abs/2510.18821
- 2026-03-31, SciVisAgentBench: A Benchmark for Evaluating Scientific Data Analysis and Visualization Agents, Kuangshi Ai et.al., Paper: http://arxiv.org/abs/2603.29139
- 2025-09-12, SciML Agents: Write the Solver, Not the Solution, Saarth Gaonkar et.al., Paper: http://arxiv.org/abs/2509.09936
- 2026-02-13, SciAgentGym: Benchmarking Multi-Step Scientific Tool-use in LLM Agents, Yujiong Shen et.al., Paper: http://arxiv.org/abs/2602.12984
- 2026-03-29, Sci-Mind: Cognitively-Inspired Adversarial Debate for Autonomous Mathematical Modeling, Ruiying Sun et.al., Paper: http://arxiv.org/abs/2603.27584
- 2026-01-30, ScholarPeer: A Context-Aware Multi-Agent Framework for Automated Peer Review, Palash Goyal et.al., Paper: http://arxiv.org/abs/2601.22638
- 2026-03-31, SceneTeract: Agentic Functional Affordances and VLM Grounding in 3D Scenes, Léopold Maillard et.al., Paper: http://arxiv.org/abs/2603.29798
- 2025-12-10, Scene-agnostic Hierarchical Bimanual Task Planning via Visual Affordance Reasoning, Kwang Bin Lee et.al., Paper: http://arxiv.org/abs/2512.09310
- 2026-02-13, Scaling Web Agent Training through Automatic Data Generation and Fine-grained Evaluation, Lajanugen Logeswaran et.al., Paper: http://arxiv.org/abs/2602.12544
- 2025-10-13, Scaling Long-Horizon LLM Agent via Context-Folding, Weiwei Sun et.al., Paper: http://arxiv.org/abs/2510.11967
- 2025-12-24, Scaling Laws for Economic Productivity: Experimental Evidence in LLM-Assisted Consulting, Data Analyst, and Management Tasks, Ali Merali et.al., Paper: http://arxiv.org/abs/2512.21316
- 2025-10-08, Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management, Miao Lu et.al., Paper: http://arxiv.org/abs/2510.06727
- 2025-10-17, Scaling Beyond Context: A Survey of Multimodal Retrieval-Augmented Generation for Document Understanding, Sensen Gao et.al., Paper: http://arxiv.org/abs/2510.15253
- 2025-11-05, Scaling Agent Learning via Experience Synthesis, Zhaorun Chen et.al., Paper: http://arxiv.org/abs/2511.03773
- 2026-02-06, ScaleEnv: Scaling Environment Synthesis from Scratch for Generalist Interactive Tool-Use Agent Training, Dunwei Tu et.al., Paper: http://arxiv.org/abs/2602.06820
- 2026-03-21, ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework, Guanzhou Chen et.al., Paper: http://arxiv.org/abs/2603.20644
- 2025-11-17, Scalable Iterative Algorithm for Solving Optimal Transmission Switching with De-energization, Benoît Jeanson et.al., Paper: http://arxiv.org/abs/2511.13662
- 2025-10-10, Safety Game: Balancing Safe and Informative Conversations with Blackbox Agentic AI using LP Solvers, Tuan Nguyen et.al., Paper: http://arxiv.org/abs/2510.09330
- 2026-03-29, Safer Builders, Risky Maintainers: A Comparative Study of Breaking Changes in Human vs Agentic PRs, K M Ferdous et.al., Paper: http://arxiv.org/abs/2603.27524
- 2025-09-30, SafeMind: Benchmarking and Mitigating Safety Risks in Embodied LLM Agents, Ruolin Chen et.al., Paper: http://arxiv.org/abs/2509.25885
- 2026-03-26, SafeGuard ASF: SR Agentic Humanoid Robot System for Autonomous Industrial Safety, Thanh Nguyen Canh et.al., Paper: http://arxiv.org/abs/2603.25353
- 2025-10-20, SafeCoop: Unravelling Full Stack Safety in Agentic Collaborative Driving, Xiangbo Gao et.al., Paper: http://arxiv.org/abs/2510.18123
- 2026-03-11, Safe and Scalable Web Agent Learning via Recreated Websites, Hyungjoo Chae et.al., Paper: http://arxiv.org/abs/2603.10505
- 2025-11-13, Safe Planning in Interactive Environments via Iterative Policy Updates and Adversarially Robust Conformal Prediction, Omid Mirzaeedodangeh et.al., Paper: http://arxiv.org/abs/2511.10586
- 2026-01-13, Safe Heterogeneous Multi-Agent RL with Communication Regularization for Coordinated Target Acquisition, Gabriele Calzolari et.al., Paper: http://arxiv.org/abs/2601.08327
- 2025-12-15, Safe Control of Multi-Agent Systems with Minimal Communication, Mo Yang et.al., Paper: http://arxiv.org/abs/2512.13021
- 2026-03-03, Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification, Aman Kumar et.al., Paper: http://arxiv.org/abs/2603.03175
- 2026-01-06, SYNAPSE: Empowering LLM Agents with Episodic-Semantic Memory via Spreading Activation, Hanqi Jiang et.al., Paper: http://arxiv.org/abs/2601.02744
- 2026-01-30, SYMPHONY: Synergistic Multi-agent Planning with Heterogeneous Language Model Assembly, Wei Zhu et.al., Paper: http://arxiv.org/abs/2601.22623
- 2025-12-11, SWEnergy: An Empirical Study on Energy Efficiency in Agentic Issue Resolution Frameworks with SLMs, Arihant Tripathy et.al., Paper: http://arxiv.org/abs/2512.09543
- 2026-02-02, SWE-Universe: Scale Real-World Verifiable Environments to Millions, Mouxiang Chen et.al., Paper: http://arxiv.org/abs/2602.02361
- 2026-01-20, SWE-Tester: Training Open-Source LLMs for Issue Reproduction in Real-World Repositories, Aditya Bharat Soni et.al., Paper: http://arxiv.org/abs/2601.13713
- 2026-03-16, SWE-Skills-Bench: Do Agent Skills Actually Help in Real-World Software Engineering?, Tingxu Han et.al., Paper: http://arxiv.org/abs/2603.15401
- 2026-01-29, SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents, Yifeng Ding et.al., Paper: http://arxiv.org/abs/2601.22129
- 2025-12-26, SWE-RM: Execution-free Feedback For Software Engineering Agents, KaShun Shum et.al., Paper: http://arxiv.org/abs/2512.21919
- 2025-09-18, SWE-QA: Can Language Models Answer Repository-level Code Questions?, Weihan Peng et.al., Paper: http://arxiv.org/abs/2509.14635
- 2026-03-17, SWE-QA-Pro: A Representative Benchmark and Scalable Training Recipe for Repository-Level Code Understanding, Songcheng Cai et.al., Paper: http://arxiv.org/abs/2603.16124
- 2026-01-23, SWE-Pruner: Self-Adaptive Context Pruning for Coding Agents, Yuhang Wang et.al., Paper: http://arxiv.org/abs/2601.16746
- 2025-11-07, SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models, Jingxuan Xu et.al., Paper: http://arxiv.org/abs/2511.05459
- 2025-10-16, SVAG-Bench: A Large-Scale Benchmark for Multi-Instance Spatio-temporal Video Action Grounding, Tanveer Hannan et.al., Paper: http://arxiv.org/abs/2510.13016
- 2026-03-05, STRUCTUREDAGENT: Planning with AND/OR Trees for Long-Horizon Web Tasks, ELita Lobo et.al., Paper: http://arxiv.org/abs/2603.05294
- 2025-10-15, STEMS: Spatial-Temporal Enhanced Safe Multi-Agent Coordination for Building Energy Management, Huiliang Zhang et.al., Paper: http://arxiv.org/abs/2510.14112
- 2026-01-09, STELP: Secure Transpilation and Execution of LLM-Generated Programs, Swapnil Shinde et.al., Paper: http://arxiv.org/abs/2601.05467
- 2026-01-15, STEAMROLLER: A Multi-Agent System for Inclusive Automatic Speech Recognition for People who Stutter, Ziqi Xu et.al., Paper: http://arxiv.org/abs/2601.10223
- 2025-10-19, STARK: Strategic Team of Agents for Refining Kernels, Juncheng Dong et.al., Paper: http://arxiv.org/abs/2510.16996
- 2026-02-12, STAR : Bridging Statistical and Agentic Reasoning for Large Model Performance Prediction, Xiaoxiao Wang et.al., Paper: http://arxiv.org/abs/2602.12143
- 2025-09-30, STAC: When Innocent Tools Form Dangerous Chains to Jailbreak LLM Agents, Jing-Jing Li et.al., Paper: http://arxiv.org/abs/2509.25624
- 2026-02-16, ST-EVO: Towards Generative Spatio-Temporal Evolution of Multi-Agent Communication Topologies, Xingjian Wu et.al., Paper: http://arxiv.org/abs/2602.14681
- 2025-11-14, SRLF: An Agent-Driven Set-Wise Reflective Learning Framework for Sequential Recommendation, Jiahao Wang et.al., Paper: http://arxiv.org/abs/2511.11370
- 2025-11-21, SRA-CP: Spontaneous Risk-Aware Selective Cooperative Perception, Jiaxi Liu et.al., Paper: http://arxiv.org/abs/2511.17461
- 2026-03-29, SPREAD: Spatial-Physical REasoning via geometry Aware Diffusion, Minzhang Li et.al., Paper: http://arxiv.org/abs/2603.27573
- 2025-12-24, SPOT!: Map-Guided LLM Agent for Unsupervised Multi-CCTV Dynamic Object Tracking, Yujin Noh et.al., Paper: http://arxiv.org/abs/2512.20975
- 2025-10-14, SPORTS: Simultaneous Panoptic Odometry, Rendering, Tracking and Segmentation for Urban Scenes Understanding, Zhiliu Yang et.al., Paper: http://arxiv.org/abs/2510.12749
- 2025-12-29, SPIRAL: Symbolic LLM Planning via Grounded and Reflective Search, Yifan Zhang et.al., Paper: http://arxiv.org/abs/2512.23167
- 2026-03-09, SPD-RAG: Sub-Agent Per Document Retrieval-Augmented Generation, Yagiz Can Akay et.al., Paper: http://arxiv.org/abs/2603.08329
- 2026-03-19, SODIUM: From Open Web Data to Queryable Databases, Chuxuan Hu et.al., Paper: http://arxiv.org/abs/2603.18447
- 2025-10-21, SOCIA-Nabla: Textual Gradient Meets Multi-Agent Orchestration for Automated Simulator Generation, Yuncheng Hua et.al., Paper: http://arxiv.org/abs/2510.18551
- 2026-03-16, SKILLS: Structured Knowledge Injection for LLM-Driven Telecommunications Operations, Ivo Brett et.al., Paper: http://arxiv.org/abs/2603.15372
- 2025-12-08, SIT-Graph: State Integrated Tool Graph for Multi-Turn Agents, Sijia Li et.al., Paper: http://arxiv.org/abs/2512.07287
- 2025-10-30, SIRAJ: Diverse and Efficient Red-Teaming for LLM Agents via Distilled Structured Reasoning, Kaiwen Zhou et.al., Paper: http://arxiv.org/abs/2510.26037
- 2025-12-04, SIMA 2: A Generalist Embodied Agent for Virtual Worlds, SIMA team et.al., Paper: http://arxiv.org/abs/2512.04797
- 2026-02-02, SIDiffAgent: Self-Improving Diffusion Agent, Shivank Garg et.al., Paper: http://arxiv.org/abs/2602.02051
- 2025-10-08, SID: Multi-LLM Debate Driven by Self Signals, Xuhang Chen et.al., Paper: http://arxiv.org/abs/2510.06843
- 2025-10-17, SIADAFIX: issue description response for adaptive program repair, Xin Cao et.al., Paper: http://arxiv.org/abs/2510.16059
- 2025-10-27, SI-Bench: Benchmarking Social Intelligence of Large Language Models in Human-to-Human Conversations, Shuai Huang et.al., Paper: http://arxiv.org/abs/2510.23182
- 2025-10-17, SHARE: Scene-Human Aligned Reconstruction, Joshua Li et.al., Paper: http://arxiv.org/abs/2510.15342
- 2026-02-27, SGAgent: Suggestion-Guided LLM-Based Multi-Agent Framework for Repository-Level Software Repair, Quanjun Zhang et.al., Paper: http://arxiv.org/abs/2602.23647
- 2026-03-26, SEVerA: Verified Synthesis of Self-Evolving Agents, Debangshu Banerjee et.al., Paper: http://arxiv.org/abs/2603.25111
- 2026-01-28, SERA: Soft-Verified Efficient Repository Agents, Ethan Shen et.al., Paper: http://arxiv.org/abs/2601.20789
- 2025-10-14, SENTINEL: A Multi-Level Formal Framework for Safety Evaluation of LLM-based Embodied Agents, Simon Sinong Zhan et.al., Paper: http://arxiv.org/abs/2510.12985
- 2026-02-25, SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards, Dengjia Zhang et.al., Paper: http://arxiv.org/abs/2602.21158
- 2026-03-02, SEED-SET: Scalable Evolving Experimental Design for System-level Ethical Testing, Anjali Parashar et.al., Paper: http://arxiv.org/abs/2603.01630
- 2026-03-18, SEAL-Tag: Self-Tag Evidence Aggregation with Probabilistic Circuits for PII-Safe Retrieval-Augmented Generation, Jin Xie et.al., Paper: http://arxiv.org/abs/2603.17292
- 2026-01-07, SCRIBE: Structured Mid-Level Supervision for Tool-Using Language Models, Yuxuan Jiang et.al., Paper: http://arxiv.org/abs/2601.03555
- 2025-10-30, SCRIBE: Structured Chain Reasoning for Interactive Behaviour Explanations using Tool Calling, Fares Fawzi et.al., Paper: http://arxiv.org/abs/2510.26322
- 2025-10-28, SCOUT: A Lightweight Framework for Scenario Coverage Assessment in Autonomous Driving, Anil Yildiz et.al., Paper: http://arxiv.org/abs/2510.24949
- 2025-12-17, SCOPE: Prompt Evolution for Enhancing Agent Effectiveness, Zehua Pei et.al., Paper: http://arxiv.org/abs/2512.15374
- 2026-03-09, SCAFFOLD-CEGIS: Preventing Latent Security Degradation in LLM-Driven Iterative Code Refinement, Yi Chen et.al., Paper: http://arxiv.org/abs/2603.08520
- 2026-01-14, SC-MAS: Constructing Cost-Efficient Multi-Agent Systems with Edge-Level Heterogeneous Collaboration, Di Zhao et.al., Paper: http://arxiv.org/abs/2601.09434
- 2025-12-02, SAT-MapIt: A SAT-based Modulo Scheduling Mapper for Coarse Grain Reconfigurable Architectures, Cristian Tirelli et.al., Paper: http://arxiv.org/abs/2512.02875
- 2026-02-20, SARAH: Spatially Aware Real-time Agentic Humans, Evonne Ng et.al., Paper: http://arxiv.org/abs/2602.18432
- 2026-02-04, SAR-RAG: ATR Visual Question Answering by Semantic Search, Retrieval, and MLLM Generation, David F. Ramirez et.al., Paper: http://arxiv.org/abs/2602.04712
- 2025-09-24, SAMULE: Self-Learning Agents Enhanced by Multi-level Reflection, Yubin Ge et.al., Paper: http://arxiv.org/abs/2509.20562
- 2026-02-10, SAGE: Scalable Agentic 3D Scene Generation for Embodied AI, Hongchi Xia et.al., Paper: http://arxiv.org/abs/2602.10116
- 2026-02-05, SAGE: Benchmarking and Improving Retrieval for Deep Research Agents, Tiansheng Hu et.al., Paper: http://arxiv.org/abs/2602.05975
- 2025-11-10, SAFENLIDB: A Privacy-Preserving Safety Alignment Framework for LLM-based Natural Language Database Interfaces, Ruiheng Liu et.al., Paper: http://arxiv.org/abs/2511.06778
- 2025-11-10, S-DAG: A Subject-Based Directed Acyclic Graph for Multi-Agent Heterogeneous Reasoning, Jiangwen Dong et.al., Paper: http://arxiv.org/abs/2511.06727
- 2025-12-23, S $^3$ IT: A Benchmark for Spatially Situated Social Intelligence Test, Zhe Sun et.al., Paper: http://arxiv.org/abs/2512.19992
- 2026-03-17, Runtime Governance for AI Agents: Policies on Paths, Maurits Kaptein et.al., Paper: http://arxiv.org/abs/2603.16586
- 2026-02-05, RuleSmith: Multi-Agent LLMs for Automated Game Balancing, Ziyao Zeng et.al., Paper: http://arxiv.org/abs/2602.06232
- 2026-04-02, RuleForge: Automated Generation and Validation for Web Vulnerability Detection at Scale, Ayush Garg et.al., Paper: http://arxiv.org/abs/2604.01977
- 2025-11-13, Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following, Yun He et.al., Paper: http://arxiv.org/abs/2511.10507
- 2026-03-20, RouterKGQA: Specialized–General Model Routing for Constraint-Aware Knowledge Graph Question Answering, Bo Yuan et.al., Paper: http://arxiv.org/abs/2603.20017
- 2026-01-13, Route, Retrieve, Reflect, Repair: Self-Improving Agentic Framework for Visual Detection and Linguistic Reasoning in Medical Imaging, Md. Faiyaz Abdullah Sayeedi et.al., Paper: http://arxiv.org/abs/2601.08192
- 2026-01-15, Role-Playing Agents Driven by Large Language Models: Current Status, Challenges, and Future Trends, Ye Wang et.al., Paper: http://arxiv.org/abs/2601.10122
- 2026-02-05, RocqSmith: Can Automatic Optimization Forge Better Proof Agents?, Andrei Kozyrev et.al., Paper: http://arxiv.org/abs/2602.05762
- 2025-11-14, Robust and Efficient Communication in Multi-Agent Reinforcement Learning, Zejiao Liu et.al., Paper: http://arxiv.org/abs/2511.11393
- 2025-11-18, Robust Verification of Controllers under State Uncertainty via Hamilton-Jacobi Reachability Analysis, Albert Lin et.al., Paper: http://arxiv.org/abs/2511.14755
- 2026-01-22, Robust Tool Use via Fission-GRPO: Learning to Recover from Execution Errors, Zhiwei Zhang et.al., Paper: http://arxiv.org/abs/2601.15625
- 2025-11-19, Robust H-infinity control and worst-case search in constrained parametric space, Ervan Kassarian et.al., Paper: http://arxiv.org/abs/2511.15480
- 2026-03-20, Robust Beam Codebooks for mmWave/THz Systems: Toward a Stochastic RL Approach, Anouar Nechi et.al., Paper: http://arxiv.org/abs/2603.19930
- 2025-12-09, Robust Agents in Open-Ended Worlds, Mikayel Samvelyan et.al., Paper: http://arxiv.org/abs/2512.08139
- 2026-03-19, Robotic Agentic Platform for Intelligent Electric Vehicle Disassembly, Zachary Allen et.al., Paper: http://arxiv.org/abs/2603.18520
- 2026-02-16, RoboSolver: A Multi-Agent Large Language Model Framework for Solving Robotic Arm Problems, Hamid Khabazi et.al., Paper: http://arxiv.org/abs/2602.14438
- 2025-12-24, RoboSafe: Safeguarding Embodied Agents via Executable Safety Logic, Le Wang et.al., Paper: http://arxiv.org/abs/2512.21220
- 2026-02-18, RoboGene: Boosting VLA Pre-training via Diversity-Driven Agentic Framework for Real-World Task Generation, Yixue Zhang et.al., Paper: http://arxiv.org/abs/2602.16444
- 2025-10-16, RoboGPT-R1: Enhancing Robot Planning with Reinforcement Learning, Jinrui Liu et.al., Paper: http://arxiv.org/abs/2510.14828
- 2025-09-30, RoRecomp: Enhancing Reasoning Efficiency via Rollout Response Recomposition in Reinforcement Learning, Gang Li et.al., Paper: http://arxiv.org/abs/2509.25958
- 2025-11-14, Risk-Aware Deep Reinforcement Learning for Dynamic Portfolio Optimization, Emmanuel Lwele et.al., Paper: http://arxiv.org/abs/2511.11481
- 2025-12-12, Risk Limited Asset Allocation with a Budget Threshold Utility Function and Leptokurtotic Distributions of Returns, Graham L Giller et.al., Paper: http://arxiv.org/abs/2512.11666
- 2025-10-18, Ripple Effect Protocol: Coordinating Agent Populations, Ayush Chopra et.al., Paper: http://arxiv.org/abs/2510.16572
- 2026-03-04, Right in Time: Reactive Reasoning in Regulated Traffic Spaces, Simon Kohaut et.al., Paper: http://arxiv.org/abs/2603.03977
- 2026-03-16, RieMind: Geometry-Grounded Spatial Agent for Scene Understanding, Fernando Ropero et.al., Paper: http://arxiv.org/abs/2603.15386
- 2026-03-09, RexDrug: Reliable Multi-Drug Combination Extraction through Reasoning-Enhanced LLMs, Zhijun Wang et.al., Paper: http://arxiv.org/abs/2603.08166
- 2026-03-19, RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models, Xiao Feng et.al., Paper: http://arxiv.org/abs/2603.18859
- 2026-03-10, Reward-Zero: Language Embedding Driven Implicit Reward Mechanisms for Reinforcement Learning, Heng Zhang et.al., Paper: http://arxiv.org/abs/2603.09331
- 2026-01-27, Reward Engineering for Reinforcement Learning in Software Tasks, Md Rayhanul Masud et.al., Paper: http://arxiv.org/abs/2601.19100
- 2025-11-24, Revisiting model-independent constraints on spatial curvature and cosmic ladders calibration: updated and forecast analyses, Arianna Favale et.al., Paper: http://arxiv.org/abs/2511.19332
- 2025-12-17, Revisiting Task-Oriented Dataset Search in the Era of Large Language Models: Challenges, Benchmark, and Solution, Zixin Wei et.al., Paper: http://arxiv.org/abs/2512.15363
- 2026-03-20, Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents, Cen Wan et.al., Paper: http://arxiv.org/abs/2603.20132
- 2025-10-22, Review of Tools for Zero-Code LLM Based Application Development, Priyaranjan Pattnayak et.al., Paper: http://arxiv.org/abs/2510.19747
- 2025-11-13, Revealing the Connection Between the Filamentary Hierarchy and Star Cluster Formation in a Simulated NGC 628 Galaxy, Tamara Koletic et.al., Paper: http://arxiv.org/abs/2511.10486
- 2025-10-12, Retro*: Optimizing LLMs for Reasoning-Intensive Document Retrieval, Junwei Lan et.al., Paper: http://arxiv.org/abs/2509.24869
- 2026-03-14, Retrieve, Schedule, Reflect: LLM Agents for Chip QoR Optimization, Yikang ouyang et.al., Paper: http://arxiv.org/abs/2603.13767
- 2025-10-28, Retrieval and Argumentation Enhanced Multi-Agent LLMs for Judgmental Forecasting, Deniz Gorur et.al., Paper: http://arxiv.org/abs/2510.24303
- 2025-10-30, Retrieval Augmented Generation-Enhanced Distributed LLM Agents for Generalizable Traffic Signal Control with Emergency Vehicles, Xinhang Li et.al., Paper: http://arxiv.org/abs/2510.26242
- 2026-02-19, RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward, Qiucheng Wu et.al., Paper: http://arxiv.org/abs/2602.17558
- 2026-02-02, Rethinking the Role of Entropy in Optimizing Tool-Use Behaviors for Large Language Model Agents, Zeping Li et.al., Paper: http://arxiv.org/abs/2602.02050
- 2025-11-13, Rethinking the Reliability of Multi-agent System: A Perspective from Byzantine Fault Tolerance, Lifan Zheng et.al., Paper: http://arxiv.org/abs/2511.10400
- 2025-10-13, Rethinking Reward Miscalibration of GRPO in Agentic RL, Jingyu Liu et.al., Paper: http://arxiv.org/abs/2509.23870
- 2025-11-14, Rethinking Progression of Memory State in Robotic Manipulation: An Object-Centric Perspective, Nhat Chung et.al., Paper: http://arxiv.org/abs/2511.11478
- 2026-03-17, RetailBench: Evaluating Long-Horizon Autonomous Decision-Making and Strategy Stability of LLM Agents in Realistic Retail Environments, Linghua Zhang et.al., Paper: http://arxiv.org/abs/2603.16453
- 2026-02-16, ResearchGym: Evaluating Language Model Agents on Real-World AI Research, Aniketh Garikaparthi et.al., Paper: http://arxiv.org/abs/2602.15112
- 2025-11-19, RescueLens: LLM-Powered Triage and Action on Volunteer Feedback for Food Rescue, Naveen Raman et.al., Paper: http://arxiv.org/abs/2511.15698
- 2026-01-08, ResMAS: Resilience Optimization in LLM-based Multi-agent Systems, Zhilun Zhou et.al., Paper: http://arxiv.org/abs/2601.04694
- 2025-10-28, Repurposing Synthetic Data for Fine-grained Search Agent Supervision, Yida Zhao et.al., Paper: http://arxiv.org/abs/2510.24694
- 2026-01-15, Repository Intelligence Graph: Deterministic Architectural Map for LLM Code Assistants, Tsvi Cherny-Shahar et.al., Paper: http://arxiv.org/abs/2601.10112
- 2026-03-05, RepoLaunch: Automating Build&Test Pipeline of Code Repositories on ANY Language and ANY Platform, Kenan Li et.al., Paper: http://arxiv.org/abs/2603.05026
- 2025-10-28, ReplicationBench: Can AI Agents Replicate Astrophysics Research Papers?, Christine Ye et.al., Paper: http://arxiv.org/abs/2510.24591
- 2025-12-05, Removing correlated noise stripes from the Nancy Grace Roman Space Telescope survey images, Katherine Laliotis et.al., Paper: http://arxiv.org/abs/2512.05949
- 2026-02-16, Removing Planner Bias in Goal Recognition Through Multi-Plan Dataset Generation, Mustafa F. Abdelwahed et.al., Paper: http://arxiv.org/abs/2602.14691
- 2025-10-30, Remote Labor Index: Measuring AI Automation of Remote Work, Mantas Mazeika et.al., Paper: http://arxiv.org/abs/2510.26787
- 2025-12-11, Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution, Zouying Cao et.al., Paper: http://arxiv.org/abs/2512.10696
- 2025-12-08, Reliable agent engineering should integrate machine-compatible organizational principles, R. Patrick Xian et.al., Paper: http://arxiv.org/abs/2512.07665
- 2025-08-26, Reliable Weak-to-Strong Monitoring of LLM Agents, Neil Kale et.al., Paper: http://arxiv.org/abs/2508.19461
- 2026-03-19, Relationship-Centered Care: Relatedness and Responsible Design for Human Connections in Mental-Health Care, Shivam Shukla et.al., Paper: http://arxiv.org/abs/2603.18375
- 2026-02-02, Relationship-Aware Hierarchical 3D Scene Graph for Task Reasoning, Albert Gassol Puigjaner et.al., Paper: http://arxiv.org/abs/2602.02456
- 2025-12-31, Reinforcement Learning-Augmented LLM Agents for Collaborative Decision Making and Performance Optimization, Dong Qiu et.al., Paper: http://arxiv.org/abs/2512.24609
- 2026-01-26, Reinforcement Learning with Distributed MPC for Fuel-Efficient Platoon Control with Discrete Gear Transitions, Samuel Mallick et.al., Paper: http://arxiv.org/abs/2601.18294
- 2026-01-28, Reinforcement Learning via Self-Distillation, Jonas Hübotter et.al., Paper: http://arxiv.org/abs/2601.20802
- 2025-10-28, Reinforcement Learning for Long-Horizon Multi-Turn Search Agents, Vivek Kalyan et.al., Paper: http://arxiv.org/abs/2510.24126
- 2025-09-08, Reinforcement Learning Foundations for Deep Research Systems: A Survey, Wenjun Li et.al., Paper: http://arxiv.org/abs/2509.06733
- 2025-10-10, Reimagining Agent-based Modeling with Large Language Model Agents via Shachi, So Kuroki et.al., Paper: http://arxiv.org/abs/2509.21862
- 2026-03-24, Regulating AI Agents, Kathrin Gardhouse et.al., Paper: http://arxiv.org/abs/2603.23471
- 2026-02-24, Region of Interest Segmentation and Morphological Analysis for Membranes in Cryo-Electron Tomography, Xingyi Cheng et.al., Paper: http://arxiv.org/abs/2602.21195
- 2025-11-18, ReflexGrad: Three-Way Synergistic Architecture for Zero-Shot Generalization in LLM Agents, Ankush Kadu et.al., Paper: http://arxiv.org/abs/2511.14584
- 2025-12-09, Reflecting with Two Voices: A Co-Adaptive Dual-Strategy Framework for LLM-Based Agent Decision Making, Wentao Zhang et.al., Paper: http://arxiv.org/abs/2512.08366
- 2025-12-31, ReflecToMeet: An AI-Assisted Reflection Based System to Enhance Collaborative Preparedness, Md Nazmus Sakib et.al., Paper: http://arxiv.org/abs/2512.24632
- 2025-11-05, RefAgent: A Multi-agent LLM-based Framework for Automatic Software Refactoring, Khouloud Oueslati et.al., Paper: http://arxiv.org/abs/2511.03153
- 2025-09-15, Redefining Website Fingerprinting Attacks With Multiagent LLMs, Chuxu Song et.al., Paper: http://arxiv.org/abs/2509.12462
- 2026-03-18, Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress, Yuelin Zhang et.al., Paper: http://arxiv.org/abs/2603.17312
- 2025-10-19, ReclAIm: A multi-agent framework for degradation-aware performance tuning of medical imaging AI, Eleftherios Tzanis et.al., Paper: http://arxiv.org/abs/2510.17004
- 2025-12-16, Reasoning-Style Poisoning of LLM Agents via Stealthy Style Transfer: Process-Level Attacks and Runtime Monitoring in RSV Space, Xingfu Zhou et.al., Paper: http://arxiv.org/abs/2512.14448
- 2025-12-24, Reasoning-Driven Amodal Completion: Collaborative Agents and Perceptual Evaluation, Hongxing Fan et.al., Paper: http://arxiv.org/abs/2512.20936
- 2026-03-13, Reasoning over Video: Evaluating How MLLMs Extract, Integrate, and Reconstruct Spatiotemporal Evidence, Seunghwan Bang et.al., Paper: http://arxiv.org/abs/2603.13091
- 2026-03-23, Reasoning Provenance for Autonomous AI Agents: Structured Behavioral Analytics Beyond State Checkpoints and Execution Traces, Neelmani Vispute et.al., Paper: http://arxiv.org/abs/2603.21692
- 2025-11-07, Reasoning Is All You Need for Urban Planning AI, Sijie Yang et.al., Paper: http://arxiv.org/abs/2511.05375
- 2026-03-20, Reasoning Gets Harder for LLMs Inside A Dialogue, Ivan Kartáč et.al., Paper: http://arxiv.org/abs/2603.20133
- 2026-03-19, Reasonably reasoning AI agents can avoid game-theoretic failures in zero-shot, provably, Enoch Hyunwook Kang et.al., Paper: http://arxiv.org/abs/2603.18563
- 2025-10-31, Realistic pedestrian-driver interaction modelling using multi-agent RL with human perceptual-motor constraints, Yueyang Wang et.al., Paper: http://arxiv.org/abs/2510.27383
- 2025-09-04, Real-time adaptive quantum error correction by model-free multi-agent learning, Manuel Guatto et.al., Paper: http://arxiv.org/abs/2509.03974
- 2026-03-10, Real-Time Trust Verification for Safe Agentic Actions using TrustBench, Tavishi Sharma et.al., Paper: http://arxiv.org/abs/2603.09157
- 2025-11-07, Real-Time Reasoning Agents in Evolving Environments, Yule Wen et.al., Paper: http://arxiv.org/abs/2511.04898
- 2025-08-26, Real-Time Model Checking for Closed-Loop Robot Reactive Planning, Christopher Chandler et.al., Paper: http://arxiv.org/abs/2508.19186
- 2026-03-05, Real-Time AI Service Economy: A Framework for Agentic Computing Across the Continuum, Lauri Lovén et.al., Paper: http://arxiv.org/abs/2603.05614
- 2026-04-02, Read More, Think More: Revisiting Observation Reduction for Web Agents, Masafumi Enomoto et.al., Paper: http://arxiv.org/abs/2604.01535
- 2026-02-05, Reactive Knowledge Representation and Asynchronous Reasoning, Simon Kohaut et.al., Paper: http://arxiv.org/abs/2602.05625
- 2025-12-23, Reaching Agreement Among Reasoning LLM Agents, Chaoyi Ruan et.al., Paper: http://arxiv.org/abs/2512.20184
- 2026-03-09, Reachability-based Temporal Logic Verification for Reliable LLM-guided Human-Autonomy Teaming, Joonwon Choi et.al., Paper: http://arxiv.org/abs/2603.08633
- 2025-12-24, ReaSeq: Unleashing World Knowledge via Reasoning for Sequential Modeling, Chuan Wang et.al., Paper: http://arxiv.org/abs/2512.21257
- 2025-12-19, ReX-MLE: The Autonomous Agent Benchmark for Medical Imaging Challenges, Roshan Kenia et.al., Paper: http://arxiv.org/abs/2512.17838
- 2026-03-20, ReViSQL: Achieving Human-Level Text-to-SQL, Yuxuan Zhu et.al., Paper: http://arxiv.org/abs/2603.20004
- 2025-10-16, ReUseIt: Synthesizing Reusable AI Agent Workflows for Web Automation, Yimeng Liu et.al., Paper: http://arxiv.org/abs/2510.14308
- 2026-02-04, ReThinker: Scientific Reasoning by Rethinking with Guided Reflection and Confidence Control, Zhentao Tang et.al., Paper: http://arxiv.org/abs/2602.04496
- 2025-08-29, ReLATE: Learning Efficient Sparse Encoding for High-Performance Tensor Decomposition, Ahmed E. Helal et.al., Paper: http://arxiv.org/abs/2509.00280
- 2026-03-11, Re-Evaluating EVMBench: Are AI Agents Ready for Smart Contract Security?, Chaoyuan Peng et.al., Paper: http://arxiv.org/abs/2603.10795
- 2025-11-28, Random purification channel made simple, Filippo Girardi et.al., Paper: http://arxiv.org/abs/2511.23451
- 2025-12-02, Radiologist Copilot: An Agentic Assistant with Orchestrated Tools for Radiology Reporting with Quality Control, Yongrui Yu et.al., Paper: http://arxiv.org/abs/2512.02814
- 2025-09-29, RadOnc-GPT: An Autonomous LLM Agent for Real-Time Patient Outcomes Labeling at Scale, Jason Holmes et.al., Paper: http://arxiv.org/abs/2509.25540
- 2026-02-27, RUMAD: Reinforcement-Unifying Multi-Agent Debate, Chao Wang et.al., Paper: http://arxiv.org/abs/2602.23864
- 2026-03-18, RPMS: Enhancing LLM-Based Embodied Planning through Rule-Augmented Memory Synergy, Zhenhang Yuan et.al., Paper: http://arxiv.org/abs/2603.17831
- 2025-10-06, RL Is a Hammer and LLMs Are Nails: A Simple Reinforcement Learning Recipe for Strong Prompt Injection, Yuxin Wen et.al., Paper: http://arxiv.org/abs/2510.04885
- 2026-01-23, REprompt: Prompt Generation for Intelligent Software Development Guided by Requirements Engineering, Junjie Shi et.al., Paper: http://arxiv.org/abs/2601.16507
- 2025-11-21, REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing, Binger Chen et.al., Paper: http://arxiv.org/abs/2511.17442
- 2025-09-08, REMI: A Novel Causal Schema Memory Architecture for Personalized Lifestyle Recommendation Agents, Vishal Raman et.al., Paper: http://arxiv.org/abs/2509.06269
- 2025-10-01, RELATE-Sim: Leveraging Turning Point Theory and LLM Agents to Predict and Understand Long-Term Relationship Dynamics through Interactive Narrative Simulations, Matthew Yue et.al., Paper: http://arxiv.org/abs/2510.00414
- 2026-03-17, RECOVER: Robust Entity Correction via agentic Orchestration of hypothesis Variants for Evidence-based Recovery, Abhishek Kumar et.al., Paper: http://arxiv.org/abs/2603.16411
- 2025-10-15, RECODE: Reasoning Through Code Generation for Visual Question Answering, Junhong Shen et.al., Paper: http://arxiv.org/abs/2510.13756
- 2025-10-07, RECODE-H: A Benchmark for Research Code Development with Interactive Human Feedback, Chunyu Miao et.al., Paper: http://arxiv.org/abs/2510.06186
- 2025-10-18, REALM: An MLLM-Agent Framework for Open World 3D Reasoning Segmentation and Editing on Gaussian Splatting, Changyue Shi et.al., Paper: http://arxiv.org/abs/2510.16410
- 2026-03-06, REACT++: Efficient Cross-Attention for Real-Time Scene Graph Generation, Maëlic Neau et.al., Paper: http://arxiv.org/abs/2603.06386
- 2026-02-02, RE-TRAC: REcursive TRAjectory Compression for Deep Search Agents, Jialiang Zhu et.al., Paper: http://arxiv.org/abs/2602.02486
- 2025-12-25, RAPTOR: Real-Time High-Resolution UAV Video Prediction with Efficient Video Attention, Zhan Chen et.al., Paper: http://arxiv.org/abs/2512.21710
- 2026-03-03, RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization, Siwei Zhang et.al., Paper: http://arxiv.org/abs/2603.03078
- 2026-03-29, RAGent: Physics-Aware Agentic Reasoning for Training-Free mmWave Human Activity Recognition, Mingda Han et.al., Paper: http://arxiv.org/abs/2603.27571
- 2025-11-06, RAGalyst: Automated Human-Aligned Agentic Evaluation for Domain-Specific RAG, Joshua Gao et.al., Paper: http://arxiv.org/abs/2511.04502
- 2025-09-08, RAFFLES: Reasoning-based Attribution of Faults for LLM Systems, Chenyang Zhu et.al., Paper: http://arxiv.org/abs/2509.06822
- 2026-01-08, RAAR: Retrieval Augmented Agentic Reasoning for Cross-Domain Misinformation Detection, Zhiwei Liu et.al., Paper: http://arxiv.org/abs/2601.04853
- 2026-01-12, R3-RECON: Radiance-Field-Free Active Reconstruction via Renderability, Xiaofeng Jin et.al., Paper: http://arxiv.org/abs/2601.07484
- 2025-12-31, R-Debater: Retrieval-Augmented Debate Generation through Argumentative Memory, Maoyuan Li et.al., Paper: http://arxiv.org/abs/2512.24684
- 2025-12-15, QwenLong-L1.5: Post-Training Recipe for Long-Context Reasoning and Memory Management, Weizhou Shen et.al., Paper: http://arxiv.org/abs/2512.12967
- 2025-11-26, Qwen3-VL Technical Report, Shuai Bai et.al., Paper: http://arxiv.org/abs/2511.21631
- 2025-11-20, Quasi-metric spaces on which real-valued continuous functions are uniformly continuous, Om Dev Singh et.al., Paper: http://arxiv.org/abs/2511.16503
- 2025-11-25, Quantum-Resistant Authentication Scheme for RFID Systems Using Lattice-Based Cryptography, Vaibhav Kumar et.al., Paper: http://arxiv.org/abs/2511.20630
- 2026-02-13, Quantization-Aware Collaborative Inference for Large Embodied AI Models, Zhonghao Lyu et.al., Paper: http://arxiv.org/abs/2602.13052
- 2025-12-15, Quantigence: A Multi-Agent AI Framework for Quantum Security Research, Abdulmalik Alquwayfili et.al., Paper: http://arxiv.org/abs/2512.12989
- 2026-04-02, Quantifying Self-Preservation Bias in Large Language Models, Matteo Migliarini et.al., Paper: http://arxiv.org/abs/2604.02174
- 2025-12-22, QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models, Li Puyin et.al., Paper: http://arxiv.org/abs/2512.19526
- 2026-03-16, QiboAgent: a practitioner’s guideline to open source assistants for Quantum Computing code development, Lorenzo Esposito et.al., Paper: http://arxiv.org/abs/2603.15538
- 2025-10-01, QUASAR: Quantum Assembly Code Generation Using Tool-Augmented LLMs via Agentic RL, Cong Yu et.al., Paper: http://arxiv.org/abs/2510.00967
- 2026-03-12, QUARE: Multi-Agent Negotiation for Balancing Quality Attributes in Requirements Engineering, Haowei Cheng et.al., Paper: http://arxiv.org/abs/2603.11890
- 2026-02-10, QRS: A Rule-Synthesizing Neuro-Symbolic Triad for Autonomous Vulnerability Discovery, George Tsigkourakos et.al., Paper: http://arxiv.org/abs/2602.09774
- 2026-03-02, QCAgent: An agentic framework for quality-controllable pathology report generation from whole slide image, Rundong Wang et.al., Paper: http://arxiv.org/abs/2603.01647
- 2026-02-24, PyVision-RL: Forging Open Agentic Vision Models via RL, Shitian Zhao et.al., Paper: http://arxiv.org/abs/2602.20739
- 2025-10-19, Pursuing Minimal Sufficiency in Spatial Reasoning, Yejie Guo et.al., Paper: http://arxiv.org/abs/2510.16688
- 2025-11-04, PublicAgent: Multi-Agent Design Principles From an LLM-Based Open Data Analysis Framework, Sina Montazeri et.al., Paper: http://arxiv.org/abs/2511.03023
- 2025-09-04, Psychologically Enhanced AI Agents, Maciej Besta et.al., Paper: http://arxiv.org/abs/2509.04343
- 2026-02-27, PseudoAct: Leveraging Pseudocode Synthesis for Flexible Planning and Action Control in Large Language Model Agents, Yihan et.al., Paper: http://arxiv.org/abs/2602.23668
- 2026-03-06, Provuse: Platform-Side Function Fusion for Performance and Efficiency in FaaS Environments, Niklas Kowallik et.al., Paper: http://arxiv.org/abs/2603.06170
- 2026-02-12, Provably Convergent Actor-Critic in Risk-averse MARL, Yizhou Zhang et.al., Paper: http://arxiv.org/abs/2602.12386
- 2025-08-28, Provable Benefits of In-Tool Learning for Large Language Models, Sam Houliston et.al., Paper: http://arxiv.org/abs/2508.20755
- 2026-03-10, ProvAgent: Threat Detection Based on Identity-Behavior Binding and Multi-Agent Collaborative Attack Investigation, Wenhao Yan et.al., Paper: http://arxiv.org/abs/2603.09358
- 2026-03-06, Proof-of-Guardrail in AI Agents and What (Not) to Trust from It, Xisen Jin et.al., Paper: http://arxiv.org/abs/2603.05786
- 2026-01-12, Proof of Time: A Benchmark for Evaluating Scientific Idea Judgments, Bingyang Ye et.al., Paper: http://arxiv.org/abs/2601.07606
- 2025-09-14, Prompts to Proxies: Emulating Human Preferences via a Compact LLM Ensemble, Bingchen Wang et.al., Paper: http://arxiv.org/abs/2509.11311
- 2025-09-16, PromptSleuth: Detecting Prompt Injection via Semantic Intent Invariance, Mengxiao Wang et.al., Paper: http://arxiv.org/abs/2508.20890
- 2025-10-18, Prompt Optimization via Retrieved Reasoning Assets and Multi-Agent Analysis, Wonduk Seo et.al., Paper: http://arxiv.org/abs/2510.16635
- 2025-10-08, Prompt Optimization Across Multiple Agents for Representing Diverse Human Populations, Manh Hung Nguyen et.al., Paper: http://arxiv.org/abs/2510.07064
- 2025-11-06, Promoting Sustainable Web Agents: Benchmarking and Estimating Energy Consumption through Empirical and Theoretical Analysis, Lars Krupp et.al., Paper: http://arxiv.org/abs/2511.04481
- 2026-01-26, Promises, Perils, and (Timely) Heuristics for Mining Coding Agent Activity, Romain Robes Théo Matricon et.al., Paper: http://arxiv.org/abs/2601.18345
- 2026-01-13, Project Synapse: A Hierarchical Multi-Agent Framework with Hybrid Memory for Autonomous Resolution of Last-Mile Delivery Disruptions, Arin Gopalan Yadav et.al., Paper: http://arxiv.org/abs/2601.08156
- 2026-01-05, Project Ariadne: A Structural Causal Framework for Auditing Faithfulness in LLM Agents, Sourena Khanzadeh et.al., Paper: http://arxiv.org/abs/2601.02314
- 2026-01-01, Progressive Ideation using an Agentic AI Framework for Human-AI Co-Creation, Sankar B et.al., Paper: http://arxiv.org/abs/2601.00475
- 2025-12-16, Professional Software Developers Don’t Vibe, They Control: AI Agent Use for Coding in 2025, Ruanqianqian Huang et.al., Paper: http://arxiv.org/abs/2512.14012
- 2026-02-27, ProductResearch: Training E-Commerce Deep Research Agents via Multi-Agent Synthetic Trajectory Distillation, Jiangyuan Wang et.al., Paper: http://arxiv.org/abs/2602.23716
- 2025-11-24, Product Depth for Temporal Point Processes Observed Only Up to the First k Events, Chifeng Shen et.al., Paper: http://arxiv.org/abs/2511.19375
- 2026-04-02, ProdCodeBench: A Production-Derived Benchmark for Evaluating AI Coding Agents, Smriti Jha et.al., Paper: http://arxiv.org/abs/2604.01527
- 2025-10-29, Process-Level Trajectory Evaluation for Environment Configuration in Software Engineering Agents, Jiayi Kuang et.al., Paper: http://arxiv.org/abs/2510.25694
- 2025-11-25, Proceedings Twentieth Conference on Theoretical Aspects of Rationality and Knowledge, Adam Bjorndahl et.al., Paper: http://arxiv.org/abs/2511.20540
- 2025-11-17, Probing scalar-neutrino and scalar-dark-matter interactions with PandaX-4T, PandaX Collaboration et.al., Paper: http://arxiv.org/abs/2511.13515
- 2025-10-21, Probabilistic Modeling of Intentions in Socially Intelligent LLM Agents, Feifan Xia et.al., Paper: http://arxiv.org/abs/2510.18476
- 2025-12-03, Probabilistic Foundations of Fuzzy Simplicial Sets for Nonlinear Dimensionality Reduction, Janis Keck et.al., Paper: http://arxiv.org/abs/2512.03899
- 2026-03-17, Proactive Rejection and Grounded Execution: A Dual-Stage Intent Analysis Paradigm for Safe and Efficient AIoT Smart Homes, Xinxin Jin et.al., Paper: http://arxiv.org/abs/2603.16207
- 2026-03-19, ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents, Hao Zhang et.al., Paper: http://arxiv.org/abs/2603.18815
- 2025-10-29, ProMediate: A Socio-cognitive framework for evaluating proactive agents in multi-party negotiation, Ziyi Liu et.al., Paper: http://arxiv.org/abs/2510.25224
- 2026-04-02, ProCeedRL: Process Critic with Exploratory Demonstration Reinforcement Learning for LLM Agentic Reasoning, Jingyue Gao et.al., Paper: http://arxiv.org/abs/2604.02006
- 2026-01-14, PrivacyReasoner: Can LLM Emulate a Human-like Privacy Mind?, Yiwen Tu et.al., Paper: http://arxiv.org/abs/2601.09152
- 2025-12-31, PrivacyBench: A Conversational Benchmark for Evaluating Privacy in Personalized AI, Srija Mukhopadhyay et.al., Paper: http://arxiv.org/abs/2512.24848
- 2026-03-24, Privacy-Preserving EHR Data Transformation via Geometric Operators: A Human-AI Co-Design Technical Report, Maolin Wang et.al., Paper: http://arxiv.org/abs/2603.22954
- 2025-09-22, Privacy in Action: Towards Realistic Privacy Mitigation and Evaluation for LLM-Powered Agents, Shouju Wang et.al., Paper: http://arxiv.org/abs/2509.17488
- 2025-10-18, Prior Makes It Possible: From Sublinear Graph Algorithms to LLM Test-Time Methods, Avrim Blum et.al., Paper: http://arxiv.org/abs/2510.16609
- 2026-01-27, Principled Fine-tuning of LLMs from User-Edits: A Medley of Preference, Supervision, and Reward, Dipendra Misra et.al., Paper: http://arxiv.org/abs/2601.19055
- 2025-10-10, Preference-Aware Memory Update for Long-Term LLM Agents, Haoran Sun et.al., Paper: http://arxiv.org/abs/2510.09720
- 2026-03-24, Predictive Photometric Uncertainty in Gaussian Splatting for Novel View Synthesis, Chamuditha Jayanga Galappaththige et.al., Paper: http://arxiv.org/abs/2603.22786
- 2025-12-01, Predicting Human Chess Moves: An AI Assisted Analysis of Chess Games Using Skill-group Specific n-gram Language Models, Daren Zhong et.al., Paper: http://arxiv.org/abs/2512.01880
- 2026-03-09, Predicting Conflict Impact on Performance in O-RAN, Pietro Brach del Prever et.al., Paper: http://arxiv.org/abs/2603.08685
- 2025-10-02, Pre-Hoc Predictions in AutoML: Leveraging LLMs to Enhance Model Selection and Benchmarking for Tabular datasets, Yannis Belkhiter et.al., Paper: http://arxiv.org/abs/2510.01842
- 2026-03-06, Pre-AI Baseline: Developer IDE Satisfaction and Tool Autonomy in 2022, Nikola Balić et.al., Paper: http://arxiv.org/abs/2603.06050
- 2025-10-22, Pragmatic Heterogeneous Collaborative Perception via Generative Communication Mechanism, Junfei Zhou et.al., Paper: http://arxiv.org/abs/2510.19618
- 2026-03-20, PowerLens: Taming LLM Agents for Safe and Personalized Mobile Power Management, Xingyu Feng et.al., Paper: http://arxiv.org/abs/2603.19584
- 2026-02-25, Power and Limitations of Aggregation in Compound AI Systems, Nivasini Ananthakrishnan et.al., Paper: http://arxiv.org/abs/2602.21556
- 2026-03-09, PostTrainBench: Can LLM Agents Automate LLM Post-Training?, Ben Rank et.al., Paper: http://arxiv.org/abs/2603.08640
- 2026-03-18, Post-Training Local LLM Agents for Linux Privilege Escalation with Verifiable Rewards, Philipp Normann et.al., Paper: http://arxiv.org/abs/2603.17673
- 2025-10-02, Position: Privacy Is Not Just Memorization!, Niloofar Mireshghallah et.al., Paper: http://arxiv.org/abs/2510.01645
- 2025-10-17, PolySkill: Learning Generalizable Skills Through Polymorphic Abstraction, Simon Yu et.al., Paper: http://arxiv.org/abs/2510.15863
- 2025-11-17, PolicyBot - Reliable Question Answering over Policy Documents, Gautam Nagarajan et.al., Paper: http://arxiv.org/abs/2511.13489
- 2025-10-28, Policy Cards: Machine-Readable Runtime Governance for Autonomous AI Agents, Juraj Mavračić et.al., Paper: http://arxiv.org/abs/2510.24383
- 2026-03-24, Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair, Aditya Kakade et.al., Paper: http://arxiv.org/abs/2603.23129
- 2025-10-21, PokeeResearch: Effective Deep Research via Reinforcement Learning from AI Feedback and Robust Reasoning Scaffold, Yi Wan et.al., Paper: http://arxiv.org/abs/2510.15862
- 2026-01-07, PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation, Wenlong Huang et.al., Paper: http://arxiv.org/abs/2601.03782
- 2026-03-11, Pneuma-Seeker: A Relational Reification Mechanism to Align AI Agents with Human Work over Relational Data, Muhammad Imam Luthfi Balaka et.al., Paper: http://arxiv.org/abs/2603.10747
- 2025-10-21, Plural Voices, Single Agent: Towards Inclusive AI in Multi-User Domestic Spaces, Joydeep Chandra et.al., Paper: http://arxiv.org/abs/2510.19008
- 2025-10-06, Plug-and-Play Dramaturge: A Divide-and-Conquer Approach for Iterative Narrative Script Refinement via Collaborative LLM Agents, Wenda Xie et.al., Paper: http://arxiv.org/abs/2510.05188
- 2025-12-05, Please Don’t Kill My Vibe: Empowering Agents with Data Flow Control, Charlie Summers et.al., Paper: http://arxiv.org/abs/2512.05374
- 2025-10-01, Planner-R1: Reward Shaping Enables Efficient Agentic RL with Smaller LLMs, Siyu Zhu et.al., Paper: http://arxiv.org/abs/2509.25779
- 2026-03-19, PlanTwin: Privacy-Preserving Planning Abstractions for Cloud-Assisted LLM Agents, Guangsheng Yu et.al., Paper: http://arxiv.org/abs/2603.18377
- 2025-12-18, Plain language adaptations of biomedical text using LLMs: Comparision of evaluation metrics, Primoz Kocbek et.al., Paper: http://arxiv.org/abs/2512.16530
- 2026-02-23, Pixel2Phys: Distilling Governing Laws from Visual Dynamics, Ruikun Li et.al., Paper: http://arxiv.org/abs/2602.19516
- 2026-02-24, Pipeline for Verifying LLM-Generated Mathematical Solutions, Varvara Sazonova et.al., Paper: http://arxiv.org/abs/2602.20770
- 2026-01-28, Piloting Planetarium Visualizations with LLMs during Live Events in Science Centers, Mathis Brossier et.al., Paper: http://arxiv.org/abs/2601.20466
- 2026-02-05, PhysicsAgentABM: Physics-Guided Generative Agent-Based Modeling, Kavana Venkatesh et.al., Paper: http://arxiv.org/abs/2602.06030
- 2026-02-16, PhyScensis: Physics-Augmented LLM Agents for Complex Physical Scene Arrangement, Yian Wang et.al., Paper: http://arxiv.org/abs/2602.14968
- 2026-03-24, PhotoAgent: A Robotic Photographer with Spatial and Aesthetic Understanding, Lirong Che et.al., Paper: http://arxiv.org/abs/2603.22796
- 2026-02-19, Phase-Aware Mixture of Experts for Agentic Reinforcement Learning, Shengtian Yang et.al., Paper: http://arxiv.org/abs/2602.17038
- 2025-12-02, Phase-Adaptive LLM Framework with Multi-Stage Validation for Construction Robot Task Allocation: A Systematic Benchmark Against Traditional Optimization Algorithms, Shyam prasad reddy Kaitha et.al., Paper: http://arxiv.org/abs/2512.02810
- 2025-09-24, Perspectra: Choosing Your Experts Enhances Critical Thinking in Multi-Agent Research Ideation, Yiren Liu et.al., Paper: http://arxiv.org/abs/2509.20553
- 2026-01-05, PerspectiveCoach: Exploring LLMs for Developer Reflection, Lauren Olson et.al., Paper: http://arxiv.org/abs/2601.02559
- 2025-12-04, Personalizing Agent Privacy Decisions via Logical Entailment, James Flemings et.al., Paper: http://arxiv.org/abs/2512.05065
- 2026-03-12, PersonaTrace: Synthesizing Realistic Digital Footprints with LLM Agents, Minjia Wang et.al., Paper: http://arxiv.org/abs/2603.11955
- 2025-11-21, PersonaAgent with GraphRAG: Community-Aware Knowledge Graphs for Personalized LLM, Siqi Liang et.al., Paper: http://arxiv.org/abs/2511.17467
- 2026-02-19, Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History, Serin Kim et.al., Paper: http://arxiv.org/abs/2602.17003
- 2025-12-03, Permutation Flows I: Triangulations of Flow Polytopes (Research Announcement), Rafael S. González D’León et.al., Paper: http://arxiv.org/abs/2512.04078
- 2025-12-03, Performance and efficiency of a transformer-based quark/gluon jet tagger in the ATLAS experiment, ATLAS Collaboration et.al., Paper: http://arxiv.org/abs/2512.03949
- 2026-01-30, PerfGuard: A Performance-Aware Agent for Visual Content Generation, Zhipeng Chen et.al., Paper: http://arxiv.org/abs/2601.22571
- 2025-12-02, Perception of AI-Generated Music – The Role of Composer Identity, Personality Traits, Music Preferences, and Perceived Humanness, David Stammer et.al., Paper: http://arxiv.org/abs/2512.02785
- 2025-11-10, People Perceive More Phantom Costs From Autonomous Agents When They Make Unreasonably Generous Offers, Benjamin Lebrun et.al., Paper: http://arxiv.org/abs/2511.07401
- 2025-12-16, PentestEval: Benchmarking LLM-based Penetration Testing with Modular and Stage-Level Design, Ruozhao Yang et.al., Paper: http://arxiv.org/abs/2512.14233
- 2026-01-28, PathWise: Planning through World Model for Automated Heuristic Design via Self-Evolving LLMs, Oguzhan Gungordu et.al., Paper: http://arxiv.org/abs/2601.20539
- 2025-09-28, PartnerMAS: An LLM Hierarchical Multi-Agent Framework for Business Partner Selection on High-Dimensional Features, Lingyao Li et.al., Paper: http://arxiv.org/abs/2509.24046
- 2025-11-19, Partial-Wave Unitarity Bounds on Higher-Dimensional Operators from 2-to- $N$ Scattering, Céline Degrande et.al., Paper: http://arxiv.org/abs/2511.15524
- 2025-11-20, PartUV: Part-Based UV Unwrapping of 3D Meshes, Zhaoning Wang et.al., Paper: http://arxiv.org/abs/2511.16659
- 2025-12-01, Parametric processes in nonlinear structures with reflections: a transfer matrix method approach, Salvador Poveda-Hospital et.al., Paper: http://arxiv.org/abs/2512.01921
- 2026-03-17, Parametric Social Identity Injection and Diversification in Public Opinion Simulation, Hexi Wang et.al., Paper: http://arxiv.org/abs/2603.16142
- 2026-02-26, ParamMem: Augmenting Language Agents with Parametric Reflective Memory, Tianjun Yao et.al., Paper: http://arxiv.org/abs/2602.23320
- 2025-11-28, ParaGate: Parasitic-Driven Domain Adaptation Transfer Learning for Netlist Performance Prediction, Bin Sun et.al., Paper: http://arxiv.org/abs/2511.23340
- 2026-03-24, PaperVoyager : Building Interactive Web with Visual Language Models, Dasen Dai et.al., Paper: http://arxiv.org/abs/2603.22999
- 2026-01-15, PaperScout: An Autonomous Agent for Academic Paper Search with Process-Aware Sequence-Level Policy Optimization, Tingyue Pan et.al., Paper: http://arxiv.org/abs/2601.10029
- 2026-01-30, PaperBanana: Automating Academic Illustration for AI Scientists, Dawei Zhu et.al., Paper: http://arxiv.org/abs/2601.23265
- 2026-01-20, Paper2Rebuttal: A Multi-Agent Framework for Transparent Author Response Assistance, Qianli Ma et.al., Paper: http://arxiv.org/abs/2601.14171
- 2026-04-01, Paper Reconstruction Evaluation: Evaluating Presentation and Hallucination in AI-written Papers, Atsuyuki Miyai et.al., Paper: http://arxiv.org/abs/2604.01128
- 2025-09-29, PanoWorld-X: Generating Explorable Panoramic Worlds via Sphere-Aware Video Diffusion, Yuyang Yin et.al., Paper: http://arxiv.org/abs/2509.24997
- 2026-02-25, Pancake: Hierarchical Memory System for Multi-Agent LLM Serving, Zhengding Hu et.al., Paper: http://arxiv.org/abs/2602.21477
- 2025-11-21, PUCP-Metrix: A Comprehensive Open-Source Repository of Linguistic Metrics for Spanish, Javier Alonso Villegas Luis et.al., Paper: http://arxiv.org/abs/2511.17402
- 2026-03-26, PSDesigner: Automated Graphic Design with a Human-Like Creative Workflow, Xincheng Shuai et.al., Paper: http://arxiv.org/abs/2603.25738
- 2025-12-05, PRiSM: An Agentic Multimodal Benchmark for Scientific Reasoning via Python-Grounded Evaluation, Shima Imani et.al., Paper: http://arxiv.org/abs/2512.05930
- 2025-08-21, PROV-AGENT: Unified Provenance for Tracking AI Agent Interactions in Agentic Workflows, Renan Souza et.al., Paper: http://arxiv.org/abs/2508.02866
- 2025-11-24, PRInTS: Reward Modeling for Long-Horizon Information Seeking, Jaewoo Lee et.al., Paper: http://arxiv.org/abs/2511.19314
- 2026-01-08, PRISM: Protocol Refinement through Intelligent Simulation Modeling, Brian Hsu et.al., Paper: http://arxiv.org/abs/2601.05356
- 2026-02-09, PRISM: A Principled Framework for Multi-Agent Reasoning via Gain Decomposition, Yiming Yang et.al., Paper: http://arxiv.org/abs/2602.08586
- 2025-12-22, PRISM: A Personality-Driven Multi-Agent Framework for Social Media Simulation, Zhixiang Lu et.al., Paper: http://arxiv.org/abs/2512.19933
- 2026-03-10, PRECEPT: Planning Resilience via Experience, Context Engineering & Probing Trajectories A Unified Framework for Test-Time Adaptation with Compositional Rule Learning and Pareto-Guided Prompt Evolution, Arash Shahmansoori et.al., Paper: http://arxiv.org/abs/2603.09641
- 2026-03-29, PRBench: End-to-end Paper Reproduction in Physics Research, Shi Qiu et.al., Paper: http://arxiv.org/abs/2603.27646
- 2025-12-02, PPTArena: A Benchmark for Agentic PowerPoint Editing, Michael Ofengenden et.al., Paper: http://arxiv.org/abs/2512.03042
- 2025-12-04, POLARIS: Is Multi-Agentic Reasoning the Next Wave in Engineering Self-Adaptive Systems?, Divyansh Pandey et.al., Paper: http://arxiv.org/abs/2512.04702
- 2026-03-16, PMAx: An Agentic Framework for AI-Driven Process Mining, Anton Antonov et.al., Paper: http://arxiv.org/abs/2603.15351
- 2025-10-20, PLAGUE: Plug-and-play framework for Lifelong Adaptive Generation of Multi-turn Exploits, Neeladri Bhuiya et.al., Paper: http://arxiv.org/abs/2510.17947
- 2026-03-13, PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses, Chenlong Yin et.al., Paper: http://arxiv.org/abs/2603.13026
- 2026-04-02, PHMForge: A Scenario-Driven Agentic Benchmark for Industrial Asset Lifecycle Maintenance, Ayan Das et.al., Paper: http://arxiv.org/abs/2604.01532
- 2026-03-24, PHANTOM Hand, Teng Yan et.al., Paper: http://arxiv.org/abs/2603.23152
- 2025-10-28, PFEA: An LLM-based High-Level Natural Language Planning and Feedback Embodied Agent for Human-Centered AI, Wenbin Ding et.al., Paper: http://arxiv.org/abs/2510.24109
- 2025-12-25, PERELMAN: Pipeline for scientific literature meta-analysis. Technical report, Daniil Sherki et.al., Paper: http://arxiv.org/abs/2512.21727
- 2025-11-06, PEFA-AI: Advancing Open-source LLMs for RTL generation using Progressive Error Feedback Agentic-AI, Athma Narayanan et.al., Paper: http://arxiv.org/abs/2511.03934
- 2026-01-28, PEARL: Plan Exploration and Adaptive Reinforcement Learning for Multihop Tool Use, Qihao Wang et.al., Paper: http://arxiv.org/abs/2601.20439
- 2025-12-22, PEAK: A Performance Engineering AI-Assistant for GPU Kernels Powered by Natural Language Transformations, Muhammad Usman Tariq et.al., Paper: http://arxiv.org/abs/2512.19018
- 2025-12-18, PDE-Agent: A toolchain-augmented multi-agent framework for PDE solving, Jianming Liu et.al., Paper: http://arxiv.org/abs/2512.16214
- 2026-03-17, PAuth - Precise Task-Scoped Authorization For Agents, Reshabh K Sharma et.al., Paper: http://arxiv.org/abs/2603.17170
- 2025-10-08, PARSE: LLM Driven Schema Optimization for Reliable Entity Extraction, Anubhav Shrimal et.al., Paper: http://arxiv.org/abs/2510.08623
- 2025-10-13, PADME: Procedure Aware DynaMic Execution, Deepeka Garg et.al., Paper: http://arxiv.org/abs/2510.11281
- 2025-10-27, P1GPT: a multi-agent LLM workflow module for multi-modal financial information analysis, Chen-Che Lu et.al., Paper: http://arxiv.org/abs/2510.23032
- 2025-11-17, P1: Mastering Physics Olympiads with Reinforcement Learning, Jiacheng Chen et.al., Paper: http://arxiv.org/abs/2511.13612
- 2026-03-31, Owl-AuraID 1.0: An Intelligent System for Autonomous Scientific Instrumentation and Scientific Data Analysis, Han Deng et.al., Paper: http://arxiv.org/abs/2603.29828
- 2026-02-16, Overthinking Loops in Agents: A Structural Risk via MCP Tools, Yohan Lee et.al., Paper: http://arxiv.org/abs/2602.14798
- 2025-09-19, Overhearing LLM Agents: A Survey, Taxonomy, and Roadmap, Andrew Zhu et.al., Paper: http://arxiv.org/abs/2509.16325
- 2025-11-18, Overcoming global sensitivity limitations: using active subspaces to explore discrepancies between global and local parameter sensitivities, Huiyan Zou et.al., Paper: http://arxiv.org/abs/2511.14687
- 2025-10-17, Outraged AI: Large language models prioritise emotion over cost in fairness enforcement, Hao Liu et.al., Paper: http://arxiv.org/abs/2510.17880
- 2025-11-05, Outbidding and Outbluffing Elite Humans: Mastering Liar’s Poker via Self-Play and Reinforcement Learning, Richard Dewey et.al., Paper: http://arxiv.org/abs/2511.03724
- 2026-04-01, OrgAgent: Organize Your Multi-Agent System like a Company, Yiru Wang et.al., Paper: http://arxiv.org/abs/2604.01020
- 2026-01-08, Orchestrating Intelligence: Confidence-Aware Routing for Efficient Multi-Agent Collaboration across Multi-Scale Models, Jingbo Wang et.al., Paper: http://arxiv.org/abs/2601.04861
- 2025-10-02, Orchestrating Human-AI Teams: The Manager Agent as a Unifying Research Challenge, Charlie Masters et.al., Paper: http://arxiv.org/abs/2510.02557
- 2026-03-20, Orchestrating Human-AI Software Delivery: A Retrospective Longitudinal Field Study of Three Software Modernization Programs, Maximiliano Armesto et.al., Paper: http://arxiv.org/abs/2603.20028
- 2026-01-05, Orchestral AI: A Framework for Agent Orchestration, Alexander Roman et.al., Paper: http://arxiv.org/abs/2601.02577
- 2025-10-28, OrchVis: Hierarchical Multi-Agent Orchestration for Human Oversight, Jieyu Zhou et.al., Paper: http://arxiv.org/abs/2510.24937
- 2025-10-28, OrchDAG: Complex Tool Orchestration in Multi-Turn Interactions with Plan DAGs, Yifu Lu et.al., Paper: http://arxiv.org/abs/2510.24663
- 2026-03-23, Optimizing Multi-Agent Weather Captioning via Text Gradient Descent: A Training-Free Approach with Consensus-Aware Gradient Fusion, Shixu Liu et.al., Paper: http://arxiv.org/abs/2603.21673
- 2026-01-21, Optimizing FaaS Platforms for MCP-enabled Agentic Workflows, Varad Kulkarni et.al., Paper: http://arxiv.org/abs/2601.14735
- 2025-12-10, Optimizing Data Extraction from Materials Science Literature: A Study of Tools Using Large Language Models, Wenkai Ning et.al., Paper: http://arxiv.org/abs/2512.09370
- 2026-01-29, Optimizing Agentic Workflows using Meta-tools, Sami Abuzakuk et.al., Paper: http://arxiv.org/abs/2601.22037
- 2025-11-04, Optimizing AI Agent Attacks With Synthetic Data, Chloe Loughridge et.al., Paper: http://arxiv.org/abs/2511.02823
- 2025-11-25, Optimization of Sums of Bivariate Functions: An Introduction to Relaxation-Based Methods for the Case of Finite Domains, Nils Müller et.al., Paper: http://arxiv.org/abs/2511.20607
- 2025-11-28, Optimization and application of ultra-high field preclinical high-resolution and 3D 1H-MRSI using compressed sensing, Brayan Alves et.al., Paper: http://arxiv.org/abs/2511.23331
- 2025-09-28, Optimism as Risk-Seeking in Multi-Agent Reinforcement Learning, Runyu Zhang et.al., Paper: http://arxiv.org/abs/2509.24047
- 2025-10-21, Optimal allocations with distortion risk measures and mixed risk attitudes, Mario Ghossoub et.al., Paper: http://arxiv.org/abs/2510.18236
- 2025-12-29, Optimal Scalability-Aware Allocation of Swarm Robots: From Linear to Retrograde Performance via Marginal Gains, Simay Atasoy Bingöl et.al., Paper: http://arxiv.org/abs/2512.23431
- 2025-12-05, Optimal Safety-Aware Scheduling for Multi-Agent Aerial 3D Printing with Utility Maximization under Dependency Constraints, Marios-Nektarios Stamatopoulos et.al., Paper: http://arxiv.org/abs/2512.05815
- 2025-12-11, Opportunities and Challenges in Harnessing Digital Technology for Effective Teaching and Learning, Zhongzhou Chen et.al., Paper: http://arxiv.org/abs/2512.10777
- 2025-10-09, Opponent Shaping in LLM Agents, Marta Emili Garcia Segura et.al., Paper: http://arxiv.org/abs/2510.08255
- 2026-02-13, Opinion dynamics and mutual influence with LLM agents through dialog simulation, Yulong He et.al., Paper: http://arxiv.org/abs/2602.12583
- 2026-01-12, OpenTinker: Separating Concerns in Agentic Reinforcement Learning, Siqi Zhu et.al., Paper: http://arxiv.org/abs/2601.07376
- 2026-03-16, OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data, Yuwen Du et.al., Paper: http://arxiv.org/abs/2603.15594
- 2025-11-13, OpenSR-SRGAN: A Flexible Super-Resolution Framework for Multispectral Earth Observation Data, Simon Donike et.al., Paper: http://arxiv.org/abs/2511.10461
- 2025-11-20, OpenQudit: Extensible and Accelerated Numerical Quantum Compilation via a JIT-Compiled DSL, Ed Younis et.al., Paper: http://arxiv.org/abs/2511.16585
- 2026-03-17, OpenQlaw: An Agentic AI Assistant for Analysis of 2D Quantum Materials, Sankalp Pandey et.al., Paper: http://arxiv.org/abs/2603.17043
- 2025-10-24, OpenHype: Hyperbolic Embeddings for Hierarchical Open-Vocabulary Radiance Fields, Lisa Weijler et.al., Paper: http://arxiv.org/abs/2510.21441
- 2026-03-23, OpenEarth-Agent: From Tool Calling to Tool Creation for Open-Environment Earth Observation, Sijie Zhao et.al., Paper: http://arxiv.org/abs/2603.22148
- 2025-10-16, OpenDerisk: An Industrial Framework for AI-Driven SRE, with Design, Implementation, and Case Studies, Peng Di et.al., Paper: http://arxiv.org/abs/2510.13561
- 2026-02-23, OpenClaw, Moltbook, and ClawdLab: From Agent-Only Social Networks to Autonomous Scientific Research, Lukas Weidener et.al., Paper: http://arxiv.org/abs/2602.19810
- 2026-03-12, OpenClaw PRISM: A Zero-Fork, Defense-in-Depth Runtime Security Layer for Tool-Augmented LLM Agents, Frank Li et.al., Paper: http://arxiv.org/abs/2603.11853
- 2025-10-23, Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence, Jiahao Meng et.al., Paper: http://arxiv.org/abs/2510.20579
- 2025-12-18, Open Ad-hoc Categorization with Contextualized Feature Learning, Zilin Wang et.al., Paper: http://arxiv.org/abs/2512.16202
- 2026-02-16, OpaqueToolsBench: Learning Nuances of Tool Behavior Through Interaction, Skyler Hallinan et.al., Paper: http://arxiv.org/abs/2602.15197
- 2026-02-03, Ontology-to-tools compilation for executable semantic constraint enforcement in LLM agents, Xiaochi Zhou et.al., Paper: http://arxiv.org/abs/2602.03439
- 2026-04-01, Ontology-Constrained Neural Reasoning in Enterprise Agentic Systems: A Neurosymbolic Architecture for Domain-Grounded AI Agents, Thanh Luong Tuan et.al., Paper: http://arxiv.org/abs/2604.00555
- 2025-12-08, Online Segment Any 3D Thing as Instance Tracking, Hanshi Wang et.al., Paper: http://arxiv.org/abs/2512.07599
- 2026-02-02, Online Fine-Tuning of Pretrained Controllers for Autonomous Driving via Real-Time Recurrent RL, Julian Lemmel et.al., Paper: http://arxiv.org/abs/2602.02236
- 2025-12-24, One Tool Is Enough: Reinforcement Learning for Repository-Level LLM Agents, Zhaoxi Zhang et.al., Paper: http://arxiv.org/abs/2512.20957
- 2025-10-30, One Model to Critique Them All: Rewarding Agentic Tool-Use via Efficient Reasoning, Renhao Li et.al., Paper: http://arxiv.org/abs/2510.26167
- 2026-03-09, One Model Is Enough: Native Retrieval Embeddings from LLM Agent Hidden States, Bo Jiang et.al., Paper: http://arxiv.org/abs/2603.08429
- 2025-12-11, On the ground state of the nonlinear Schr{ö}dinger equation: asymptotic behavior at the endpoint powers, Rémi Carles et.al., Paper: http://arxiv.org/abs/2512.10690
- 2025-12-02, On the distribution of very short character sums, Paweł Nosal et.al., Paper: http://arxiv.org/abs/2512.02915
- 2026-02-05, On the computational properties of ambivalent sets and functions, Dag Normann et.al., Paper: http://arxiv.org/abs/2602.05620
- 2025-10-01, On the Soundness and Consistency of LLM Agents for Executing Test Cases Written in Natural Language, Sébastien Salva et.al., Paper: http://arxiv.org/abs/2509.19136
- 2025-12-18, On the Role of Contextual Information and Ego States in LLM Agent Behavior for Transactional Analysis Dialogues, Monika Zamojska et.al., Paper: http://arxiv.org/abs/2512.17060
- 2025-10-14, On the Number of Small Points for Rational Maps, Jit Wu Yap et.al., Paper: http://arxiv.org/abs/2510.12039
- 2025-11-26, On the Limits of Innate Planning in Large Language Models, Charles Schepanowski et.al., Paper: http://arxiv.org/abs/2511.21591
- 2026-01-28, On the Impact of AGENTS.md Files on the Efficiency of AI Coding Agents, Jai Lal Lulla et.al., Paper: http://arxiv.org/abs/2601.20404
- 2026-02-12, On the Adoption of AI Coding Agents in Open-source Android and iOS Development, Muhammad Ahmad Khan et.al., Paper: http://arxiv.org/abs/2602.12144
- 2026-02-09, On Protecting Agentic Systems’ Intellectual Property via Watermarking, Liwen Wang et.al., Paper: http://arxiv.org/abs/2602.08401
- 2026-03-12, On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents, Deyu Zou et.al., Paper: http://arxiv.org/abs/2603.12109
- 2025-10-27, On Generalization in Agentic Tool Calling: CoreThink Agentic Reasoner and MAVEN Dataset, Vishvesh Bhat et.al., Paper: http://arxiv.org/abs/2510.22898
- 2025-12-04, On Disturbance-Aware Minimum-Time Trajectory Planning: Evidence from Tests on a Dynamic Driving Simulator, Matteo Masoni et.al., Paper: http://arxiv.org/abs/2512.04917
- 2026-02-16, On Convergence Analysis of Network-GIANT: An approximate Hessian-based fully distributed optimization algorithm, Souvik Das et.al., Paper: http://arxiv.org/abs/2602.14830
- 2026-01-20, On Autopilot? An Empirical Study of Human-AI Teaming and Review Practices in Open Source, Haoyu Gao et.al., Paper: http://arxiv.org/abs/2601.13754
- 2026-01-01, OmniVaT: Single Domain Generalization for Multimodal Visual-Tactile Learning, Liuxiang Qiu et.al., Paper: http://arxiv.org/abs/2601.00352
- 2026-03-12, OmniStream: Mastering Perception, Reconstruction and Action in Continuous Streams, Yibin Yan et.al., Paper: http://arxiv.org/abs/2603.12265
- 2026-02-03, OmniRAG-Agent: Agentic Omnimodal Reasoning for Low-Resource Long Audio-Video Question Answering, Yifan Zhu et.al., Paper: http://arxiv.org/abs/2602.03707
- 2026-04-01, OmniMem: Autoresearch-Guided Discovery of Lifelong Multimodal Agent Memory, Jiaqi Liu et.al., Paper: http://arxiv.org/abs/2604.01007
- 2026-02-26, OmniGAIA: Towards Native Omni-Modal AI Agents, Xiaoxi Li et.al., Paper: http://arxiv.org/abs/2602.22897
- 2025-08-08, OmniEAR: Benchmarking Agent Reasoning in Embodied Tasks, Zixuan Wang et.al., Paper: http://arxiv.org/abs/2508.05614
- 2026-02-02, OmniCode: A Benchmark for Evaluating Software Engineering Agents, Atharv Sonwane et.al., Paper: http://arxiv.org/abs/2602.02262
- 2025-11-17, Omni Memory System for Personalized, Long Horizon, Self-Evolving Agents, Piaohong Wang et.al., Paper: http://arxiv.org/abs/2511.13593
- 2026-01-28, OmegaUse: Building a General-Purpose GUI Agent for Autonomous Task Execution, Le Zhang et.al., Paper: http://arxiv.org/abs/2601.20380
- 2025-11-17, OlmoEarth: Stable Latent Image Modeling for Multimodal Earth Observation, Henry Herzog et.al., Paper: http://arxiv.org/abs/2511.13655
- 2025-12-15, Olmo 3, Team Olmo et.al., Paper: http://arxiv.org/abs/2512.13961
- 2025-11-09, Offloading Data Center Tax, Akshay Revankar et.al., Paper: http://arxiv.org/abs/2511.06558
- 2026-03-09, OfficeQA Pro: An Enterprise Benchmark for End-to-End Grounded Reasoning, Krista Opsahl-Ong et.al., Paper: http://arxiv.org/abs/2603.08655
- 2026-02-05, OdysseyArena: Benchmarking Large Language Models For Long-Horizon, Active and Inductive Interactions, Fangzhi Xu et.al., Paper: http://arxiv.org/abs/2602.05843
- 2026-01-15, OctoBench: Benchmarking Scaffold-Aware Instruction Following in Repository-Grounded Agentic Coding, Deming Ding et.al., Paper: http://arxiv.org/abs/2601.10343
- 2026-03-17, Occupation-Measure Mean-Field Control: Optimization over Measures and Frank-Wolfe Methods, Di Yu et.al., Paper: http://arxiv.org/abs/2603.16094
- 2025-12-10, ObliInjection: Order-Oblivious Prompt Injection Attack to LLM Agents with Multi-source Data, Ruiqi Wang et.al., Paper: http://arxiv.org/abs/2512.09321
- 2026-02-04, OSCAgent: Accelerating the Discovery of Organic Solar Cells with LLM Agents, Zhaolin Hu et.al., Paper: http://arxiv.org/abs/2602.04510
- 2025-09-05, OSC: Cognitive Orchestration through Dynamic Knowledge Alignment in Multi-Agent LLM Collaboration, Jusheng Zhang et.al., Paper: http://arxiv.org/abs/2509.04876
- 2025-12-22, ORPR: An OR-Guided Pretrain-then-Reinforce Learning Model for Inventory Management, Lingjie Zhao et.al., Paper: http://arxiv.org/abs/2512.19001
- 2025-09-01, ORCA: ORchestrating Causal Agent, Joanie Hayoun Chung et.al., Paper: http://arxiv.org/abs/2508.21304
- 2026-02-27, OPTIAGENT: A Physics-Driven Agentic Framework for Automated Optical Design, Yuyu Geng et.al., Paper: http://arxiv.org/abs/2602.23761
- 2025-10-20, OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning, Zhenyu Bi et.al., Paper: http://arxiv.org/abs/2510.18032
- 2025-12-01, OPOR-Bench: Evaluating Large Language Models on Online Public Opinion Report Generation, Jinzheng Yu et.al., Paper: http://arxiv.org/abs/2512.01896
- 2025-09-20, OPEN-THEATRE: An Open-Source Toolkit for LLM-based Interactive Drama, Tianyang Xu et.al., Paper: http://arxiv.org/abs/2509.16713
- 2026-03-26, OMIND: Framework for Knowledge Grounded Finetuning and Multi-Turn Dialogue Benchmark for Mental Health LLMs, Suraj Racha et.al., Paper: http://arxiv.org/abs/2603.25105
- 2026-02-04, OMG-Agent: Toward Robust Missing Modality Generation with Decoupled Coarse-to-Fine Agentic Workflows, Ruiting Dai et.al., Paper: http://arxiv.org/abs/2602.04144
- 2026-03-12, O3N: Omnidirectional Open-Vocabulary Occupancy Prediction, Mengfei Duan et.al., Paper: http://arxiv.org/abs/2603.12144
- 2026-01-07, O-Researcher: An Open Ended Deep Research Model via Multi-Agent Distillation and Agentic RL, Yi Yao et.al., Paper: http://arxiv.org/abs/2601.03743
- 2026-03-11, Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization, Linghao Zhang et.al., Paper: http://arxiv.org/abs/2603.10808
- 2026-01-20, Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics, Junqi Liu et.al., Paper: http://arxiv.org/abs/2601.14027
- 2026-02-27, Novice Developers Produce Larger Review Overhead for Project Maintainers while Vibe Coding, Syed Ammar Asdaque et.al., Paper: http://arxiv.org/abs/2602.23905
- 2025-11-24, Normative active inference: A numerical proof of principle for a computational and economic legal analytic approach to AI governance, Axel Constant et.al., Paper: http://arxiv.org/abs/2511.19334
- 2026-01-28, Normative Equivalence in human-AI Cooperation: Behaviour, Not Identity, Drives Cooperation in Mixed-Agent Groups, Nico Mutzner et.al., Paper: http://arxiv.org/abs/2601.20487
- 2026-03-12, Normative Common Ground Replication (NormCoRe): Replication-by-Translation for Studying Norms in Multi-agent AI, Luca Deck et.al., Paper: http://arxiv.org/abs/2603.11974
- 2026-03-17, Nonstandard Errors in AI Agents, Ruijiang Gao et.al., Paper: http://arxiv.org/abs/2603.16744
- 2025-10-22, Nonmonotone subgradient methods based on a local descent lemma, Francisco J. Aragón-Artacho et.al., Paper: http://arxiv.org/abs/2510.19341
- 2025-11-18, Non-vanishing of Artin $L$-functions associated with $D_4$ -quartic function fields ordered by conductor, Victor Ahlquist et.al., Paper: http://arxiv.org/abs/2511.14576
- 2025-12-01, Non-archimedean Infinite Hecke Algebra, Milo Bechtloff Weising et.al., Paper: http://arxiv.org/abs/2512.01835
- 2025-08-21, Noise, Adaptation, and Strategy: Assessing LLM Fidelity in Decision-Making, Yuanjun Feng et.al., Paper: http://arxiv.org/abs/2508.15926
- 2025-11-04, No-Human in the Loop: Agentic Evaluation at Scale for Recommendation, Tao Zhang et.al., Paper: http://arxiv.org/abs/2511.03051
- 2026-02-06, Nipping the Drift in the Bud: Retrospective Rectification for Robust Vision-Language Navigation, Gang He et.al., Paper: http://arxiv.org/abs/2602.06356
- 2025-10-30, Nexus: Execution-Grounded Multi-Agent Test Oracle Synthesis, Dong Huang et.al., Paper: http://arxiv.org/abs/2510.26423
- 2025-12-04, Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction, Nex-AGI Team et.al., Paper: http://arxiv.org/abs/2512.04987
- 2025-10-08, NewtonBench: Benchmarking Generalizable Scientific Law Discovery in LLM Agents, Tianshi Zheng et.al., Paper: http://arxiv.org/abs/2510.07172
- 2026-01-07, NeuronScope: A Multi-Agent Framework for Explaining Polysemantic Neurons in Language Models, Weiqi Liu et.al., Paper: http://arxiv.org/abs/2601.03671
- 2025-12-01, NeuroHJR: Hamilton-Jacobi Reachability-based Obstacle Avoidance in Complex Environments with Physics-Informed Neural Networks, Granthik Halder et.al., Paper: http://arxiv.org/abs/2512.01897
- 2026-01-21, NeuroFilter: Privacy Guardrails for Conversational LLM Agents, Saswat Das et.al., Paper: http://arxiv.org/abs/2601.14660
- 2025-10-09, Neuro-Symbolic Agents with Modal Logic for Autonomous Diagnostics, Antonin Sulc et.al., Paper: http://arxiv.org/abs/2509.11943
- 2025-12-01, Neural steering vectors reveal dose and exposure-dependent impacts of human-AI relationships, Hannah Rose Kirk et.al., Paper: http://arxiv.org/abs/2512.01991
- 2025-12-09, NeurIDA: Dynamic Modeling for Effective In-Database Analytics, Lingze Zeng et.al., Paper: http://arxiv.org/abs/2512.08483
- 2025-12-02, Network Self-Configuration based on Fine-Tuned Small Language Models, Oscar G. Lira et.al., Paper: http://arxiv.org/abs/2512.02861
- 2025-12-29, Nested Browser-Use Learning for Agentic Information Seeking, Baixuan Li et.al., Paper: http://arxiv.org/abs/2512.23647
- 2026-01-07, NeoAMT: Neologism-Aware Agentic Machine Translation with Reinforcement Learning, Zhongtao Miao et.al., Paper: http://arxiv.org/abs/2601.03790
- 2025-12-23, Nemotron 3 Nano: Open, Efficient Mixture-of-Experts Hybrid Mamba-Transformer Model for Agentic Reasoning, NVIDIA et.al., Paper: http://arxiv.org/abs/2512.20848
- 2026-03-31, Near-Miss: Latent Policy Failure Detection in Agentic Workflows, Ella Rabinovich et.al., Paper: http://arxiv.org/abs/2603.29665
- 2025-11-19, Navigating Quantum Missteps in Agent-Based Modeling: A Schelling Model Case Study, C. Nico Barati et.al., Paper: http://arxiv.org/abs/2511.15642
- 2025-12-05, Natural Language Summarization Enables Multi-Repository Bug Localization by LLMs in Microservice Architectures, Amirkia Rafiei Oskooei et.al., Paper: http://arxiv.org/abs/2512.05908
- 2025-12-04, Natural Language Actor-Critic: Scalable Off-Policy Learning in Language Space, Joey Hong et.al., Paper: http://arxiv.org/abs/2512.04601
- 2025-12-08, Native Parallel Reasoner: Reasoning in Parallelism via Self-Distilled Reinforcement Learning, Tong Wu et.al., Paper: http://arxiv.org/abs/2512.07461
- 2025-10-17, Narrowing Action Choices with AI Improves Human Sequential Decisions, Eleni Straitouri et.al., Paper: http://arxiv.org/abs/2510.16097
- 2025-12-24, NVIDIA Nemotron 3: Efficient and Open Intelligence, NVIDIA et.al., Paper: http://arxiv.org/abs/2512.20856
- 2025-11-18, NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards, Chia-Yu Hung et.al., Paper: http://arxiv.org/abs/2511.14659
- 2025-12-14, NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents, Jingzhe Ding et.al., Paper: http://arxiv.org/abs/2512.12730
- 2026-02-20, NIMMGen: Learning Neural-Integrated Mechanistic Digital Twins with LLMs, Zihan Guan et.al., Paper: http://arxiv.org/abs/2602.18008
- 2025-12-05, NICE: Neural Implicit Craniofacial Model for Orthognathic Surgery Prediction, Jiawen Yang et.al., Paper: http://arxiv.org/abs/2512.05920
- 2025-10-21, NEBULA: Do We Evaluate Vision-Language-Action Agents Correctly?, Jierui Peng et.al., Paper: http://arxiv.org/abs/2510.16263
- 2025-11-21, MusicAIR: A Multimodal AI Music Generation Framework Powered by an Algorithm-Driven Core, Callie C. Liao et.al., Paper: http://arxiv.org/abs/2511.17323
- 2025-12-05, Multimodal Oncology Agent for IDH1 Mutation Prediction in Low-Grade Glioma, Hafsa Akebli et.al., Paper: http://arxiv.org/abs/2512.05824
- 2026-03-19, Multimodal Model for Computational Pathology:Representation Learning and Image Compression, Peihang Wu et.al., Paper: http://arxiv.org/abs/2603.18660
- 2025-12-25, Multiconnectivity for SAGIN: Current Trends, Challenges, AI-driven Solutions, and Opportunities, Abd Ullah Khan et.al., Paper: http://arxiv.org/abs/2512.21717
- 2026-01-26, MultiVis-Agent: A Multi-Agent Framework with Logic Rules for Reliable and Comprehensive Cross-Modal Data Visualization, Jinwei Lu et.al., Paper: http://arxiv.org/abs/2601.18320
- 2025-10-14, MultiFoodhat: A potential new paradigm for intelligent food quality inspection, Yue Hu et.al., Paper: http://arxiv.org/abs/2510.13889
- 2025-10-17, Multi-dimensional Data Analysis and Applications Basing on LLM Agents and Knowledge Graph Interactions, Xi Wang et.al., Paper: http://arxiv.org/abs/2510.15258
- 2025-10-27, Multi-Stakeholder Alignment in LLM-Powered Collaborative AI Systems: A Multi-Agent Framework for Intelligent Tutoring, Alexandre P Uchoa et.al., Paper: http://arxiv.org/abs/2510.23245
- 2025-11-14, Multi-Phase Spacecraft Trajectory Optimization via Transformer-Based Reinforcement Learning, Amit Jain et.al., Paper: http://arxiv.org/abs/2511.11402
- 2026-01-06, Multi-Modal Data-Enhanced Foundation Models for Prediction and Control in Wireless Networks: A Survey, Han Zhang et.al., Paper: http://arxiv.org/abs/2601.03181
- 2026-03-31, Multi-Layered Memory Architectures for LLM Agents: An Experimental Evaluation of Long-Term Context Retention, Sunil Tiwari et.al., Paper: http://arxiv.org/abs/2603.29194
- 2025-10-06, Multi-Agent Tool-Integrated Policy Optimization, Zhanfeng Mo et.al., Paper: http://arxiv.org/abs/2510.04678
- 2026-01-15, Multi-Agent Taint Specification Extraction for Vulnerability Detection, Jonah Ghebremichael et.al., Paper: http://arxiv.org/abs/2601.10865
- 2026-01-30, Multi-Agent Systems Should be Treated as Principal-Agent Problems, Paulius Rauba et.al., Paper: http://arxiv.org/abs/2601.23211
- 2025-09-01, Multi-Agent Reinforcement Learning for Task Offloading in Wireless Edge Networks, Andrea Fox et.al., Paper: http://arxiv.org/abs/2509.01257
- 2025-12-04, Multi-Agent Reinforcement Learning for Intraday Operating Rooms Scheduling under Uncertainty, Kailiang Liu et.al., Paper: http://arxiv.org/abs/2512.04918
- 2026-03-25, Multi-Agent Reasoning with Consistency Verification Improves Uncertainty Calibration in Medical MCQA, John Ray B. Martinez et.al., Paper: http://arxiv.org/abs/2603.24481
- 2026-01-27, Multi-Agent Procedural Graph Extraction with Structural and Logical Refinement, Wangyang Ying et.al., Paper: http://arxiv.org/abs/2601.19170
- 2025-11-21, Multi-Agent Pointer Transformer: Seq-to-Seq Reinforcement Learning for Multi-Vehicle Dynamic Pickup-Delivery Problems, Zengyu Zou et.al., Paper: http://arxiv.org/abs/2511.17435
- 2026-02-02, Multi-Agent Monte Carlo Tree Search for Makespan-Efficient Object Rearrangement in Cluttered Spaces, Hanwen Ren et.al., Paper: http://arxiv.org/abs/2602.02411
- 2025-12-16, Multi-Agent Medical Decision Consensus Matrix System: An Intelligent Collaborative Framework for Oncology MDT Consultations, Xudong Han et.al., Paper: http://arxiv.org/abs/2512.14321
- 2025-12-09, Multi-Agent Intelligence for Multidisciplinary Decision-Making in Gastrointestinal Oncology, Rongzhao Zhang et.al., Paper: http://arxiv.org/abs/2512.08674
- 2025-12-29, Multi-Agent Framework for Threat Mitigation and Resilience in AI-Based Systems, Armstrong Foundjem et.al., Paper: http://arxiv.org/abs/2512.23132
- 2026-03-29, Multi-Agent Dialectical Refinement for Enhanced Argument Classification, Jakub Bąba et.al., Paper: http://arxiv.org/abs/2603.27451
- 2025-10-14, Multi-Agent Debate for LLM Judges with Adaptive Stability Detection, Tianyu Hu et.al., Paper: http://arxiv.org/abs/2510.12697
- 2026-01-01, Multi-Agent Coordinated Rename Refactoring, Abhiram Bellur et.al., Paper: http://arxiv.org/abs/2601.00482
- 2025-12-15, Multi-Agent Collaborative Framework for Intelligent IT Operations: An AOI System with Context-Aware Compression and Dynamic Task Scheduling, Zishan Bai et.al., Paper: http://arxiv.org/abs/2512.13956
- 2025-11-06, Multi-Agent Collaborative Framework For Math Problem Generation, Kia Karbasi et.al., Paper: http://arxiv.org/abs/2511.03958
- 2025-09-09, Multi Robot Coordination in Highly Dynamic Environments: Tackling Asymmetric Obstacles and Limited Communication, Vincenzo Suriani et.al., Paper: http://arxiv.org/abs/2509.08859
- 2026-03-04, Mozi: Governed Autonomy for Drug Discovery LLM Agents, He Cao et.al., Paper: http://arxiv.org/abs/2603.03655
- 2025-12-15, Motus: A Unified Latent Action World Model, Hongzhe Bi et.al., Paper: http://arxiv.org/abs/2512.13030
- 2025-10-19, More with Less: An Empirical Study of Turn-Control Strategies for Efficient Coding Agents, Pengfei Gao et.al., Paper: http://arxiv.org/abs/2510.16786
- 2026-01-27, More at Stake: How Payoff and Language Shape LLM Agent Strategies in Cooperation Dilemmas, Trung-Kiet Huynh et.al., Paper: http://arxiv.org/abs/2601.19082
- 2025-11-10, More Agents Helps but Adversarial Robustness Gap Persists, Khashayar Alavi et.al., Paper: http://arxiv.org/abs/2511.07112
- 2025-12-18, MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning, Yuanchen Ju et.al., Paper: http://arxiv.org/abs/2512.16909
- 2025-12-23, MolAct: An Agentic RL Framework for Molecular Editing and Property Optimization, Zhuo Yang et.al., Paper: http://arxiv.org/abs/2512.20135
- 2025-12-21, Modular Automatic Complexity Analysis of Recursive Integer Programs, Nils Lommen et.al., Paper: http://arxiv.org/abs/2512.18851
- 2026-03-26, Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation, Roman Kueble et.al., Paper: http://arxiv.org/abs/2603.25415
- 2026-01-16, Modeling Multi-Party Interaction in Couples Therapy: A Multi-Agent Simulation Approach, Canwen Wang et.al., Paper: http://arxiv.org/abs/2601.10970
- 2026-01-13, Modeling LLM Agent Reviewer Dynamics in Elo-Ranked Review System, Hsiang-Wei Huang et.al., Paper: http://arxiv.org/abs/2601.08829
- 2026-02-19, Modeling Distinct Human Interaction in Web Agents, Faria Huq et.al., Paper: http://arxiv.org/abs/2602.17588
- 2025-12-16, Model-First Reasoning LLM Agents: Reducing Hallucinations through Explicit Problem Modeling, Annu Rana et.al., Paper: http://arxiv.org/abs/2512.14474
- 2025-10-30, Model-Document Protocol for AI Search, Hongjin Qian et.al., Paper: http://arxiv.org/abs/2510.25160
- 2025-11-26, Model-Based Policy Adaptation for Closed-Loop End-to-End Autonomous Driving, Haohong Lin et.al., Paper: http://arxiv.org/abs/2511.21584
- 2025-12-01, Model theory, differential algebra and functional transcendence, Amador Martin-Pizarro et.al., Paper: http://arxiv.org/abs/2512.01866
- 2025-10-27, Model Proficiency in Centralized Multi-Agent Systems: A Performance Study, Anna Guerra et.al., Paper: http://arxiv.org/abs/2510.23447
- 2025-12-05, Model Gateway: Model Management Platform for Model-Driven Drug Discovery, Yan-Shiun Wu et.al., Paper: http://arxiv.org/abs/2512.05462
- 2026-02-16, Model Context Protocol (MCP) Tool Descriptions Are Smelly! Towards Improving AI Agent Efficiency with Augmented MCP Tool Descriptions, Mohammed Mehedi Hasan et.al., Paper: http://arxiv.org/abs/2602.14878
- 2025-12-16, MobileWorldBench: Towards Semantic World Modeling For Mobile Agents, Shufan Li et.al., Paper: http://arxiv.org/abs/2512.14014
- 2025-12-22, MobileWorld: Benchmarking Autonomous Mobile Agents in Agent-User Interactive, and MCP-Augmented Environments, Quyu Kong et.al., Paper: http://arxiv.org/abs/2512.19432
- 2025-10-24, Mixture-of-Minds: Multi-Agent Reinforcement Learning for Table Understanding, Yuhang Zhou et.al., Paper: http://arxiv.org/abs/2510.20176
- 2025-09-28, Mix-Ecom: Towards Mixed-Type E-Commerce Dialogues with Complex Domain Rules, Chenyu Zhou et.al., Paper: http://arxiv.org/abs/2509.23836
- 2026-01-15, Mitigating GIL Bottlenecks in Edge AI Systems, Mridankan Mandal et.al., Paper: http://arxiv.org/abs/2601.10582
- 2025-10-22, Misalignment Bounty: Crowdsourcing AI Agent Misbehavior, Rustem Turtayev et.al., Paper: http://arxiv.org/abs/2510.19738
- 2026-02-26, MiroFlow: Towards High-Performance and Robust Open-Source Agent Framework for General Deep Research Tasks, Shiqian Su et.al., Paper: http://arxiv.org/abs/2602.22808
- 2025-09-16, Mining the Long Tail: A Comparative Study of Data-Centric Criticality Metrics for Robust Offline Reinforcement Learning in Autonomous Motion Planning, Antonio Guillen-Perez et.al., Paper: http://arxiv.org/abs/2508.18397
- 2026-02-20, Mining Type Constructs Using Patterns in AI-Generated Code, Imgyeong Lee et.al., Paper: http://arxiv.org/abs/2602.17955
- 2025-11-21, Minimalist machine-learned interatomic potentials can predict complex structural behaviors accurately, Iñigo Robredo-Magro et.al., Paper: http://arxiv.org/abs/2511.17449
- 2026-03-24, Minibal: Balanced Game-Playing Without Opponent Modeling, Quentin Cohen-Solal et.al., Paper: http://arxiv.org/abs/2603.23059
- 2026-01-09, MineNPC-Task: Task Suite for Memory-Aware Minecraft Agents, Tamil Sudaravan Mohan Doss et.al., Paper: http://arxiv.org/abs/2601.05215
- 2025-12-29, MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning, Jiawei Chen et.al., Paper: http://arxiv.org/abs/2512.23412
- 2025-08-28, MindGuard: Tracking, Detecting, and Attributing MCP Tool Poisoning Attack via Decision Dependence Graph, Zhiqiang Wang et.al., Paper: http://arxiv.org/abs/2508.20412
- 2026-01-08, Mind2Report: A Cognitive Deep Research Agent for Expert-Level Commercial Report Synthesis, Mingyue Cheng et.al., Paper: http://arxiv.org/abs/2601.04879
- 2025-11-18, Mind the Gaps: Measuring Visual Artifacts in Dimensionality Reduction, Jaume Ros et.al., Paper: http://arxiv.org/abs/2511.14544
- 2026-02-18, Mind the GAP: Text Safety Does Not Transfer to Tool-Call Safety in LLM Agents, Arnold Cartagena et.al., Paper: http://arxiv.org/abs/2602.16943
- 2026-02-16, Mind the (DH) Gap! A Contrast in Risky Choices Between Reasoning and Conversational LLMs, Luise Ge et.al., Paper: http://arxiv.org/abs/2602.15173
- 2026-03-23, Mind over Space: Can Multimodal Large Language Models Mentally Navigate?, Qihui Zhu et.al., Paper: http://arxiv.org/abs/2603.21577
- 2026-03-24, Mind Your HEARTBEAT! Claw Background Execution Inherently Enables Silent Memory Pollution, Yechao Zhang et.al., Paper: http://arxiv.org/abs/2603.23064
- 2025-11-14, MicroVQA++: High-Quality Microscopy Reasoning Dataset with Weakly Supervised Graphs for Multimodal Large Language Model, Manyu Li et.al., Paper: http://arxiv.org/abs/2511.11407
- 2026-01-30, MiTa: A Hierarchical Multi-Agent Collaboration Framework with Memory-integrated and Task Allocation, XiaoJie Zhang et.al., Paper: http://arxiv.org/abs/2601.22974
- 2026-03-19, Mi:dm K 2.5 Pro, KT Tech innovation Group et.al., Paper: http://arxiv.org/abs/2603.18788
- 2026-03-17, MetaClaw: Just Talk – An Agent That Meta-Learns and Evolves in the Wild, Peng Xia et.al., Paper: http://arxiv.org/abs/2603.17187
- 2025-10-16, MetaCaptioner: Towards Generalist Visual Captioning with Open-source Suites, Zhenxin Lei et.al., Paper: http://arxiv.org/abs/2510.12126
- 2025-12-18, Meta-RL Induces Exploration in Language Agents, Yulun Jiang et.al., Paper: http://arxiv.org/abs/2512.16848
- 2025-09-08, Meta-Policy Reflexion: Reusable Reflective Memory and Rule Admissibility for Resource-Efficient LLM Agent, Chunlong Wu et.al., Paper: http://arxiv.org/abs/2509.03990
- 2025-10-23, Merge and Conquer: Evolutionarily Optimizing AI for 2048, Maggie Bai et.al., Paper: http://arxiv.org/abs/2510.20205
- 2026-02-18, MerLean: An Agentic Framework for Autoformalization in Quantum Computation, Yuanjie Ren et.al., Paper: http://arxiv.org/abs/2602.16554
- 2025-12-09, Mental Models of Autonomy and Sentience Shape Reactions to AI, Janet V. T. Pauketat et.al., Paper: http://arxiv.org/abs/2512.09085
- 2026-03-25, Memory-Augmented Vision-Language Agents for Persistent and Semantically Consistent Object Captioning, Tommaso Galliena et.al., Paper: http://arxiv.org/abs/2603.24257
- 2025-10-21, Memory-Augmented State Machine Prompting: A Novel LLM Agent Framework for Real-Time Strategy Games, Runnan Qi et.al., Paper: http://arxiv.org/abs/2510.18395
- 2025-12-15, Memory in the Age of AI Agents, Yuyang Hu et.al., Paper: http://arxiv.org/abs/2512.13564
- 2026-01-09, Memory Poisoning Attack and Defense on Memory Based LLM-Agents, Balachandra Devarangadi Sunil et.al., Paper: http://arxiv.org/abs/2601.05504
- 2025-09-27, Memory Management and Contextual Consistency for Long-Running Low-Code Agents, Jiexi Xu et.al., Paper: http://arxiv.org/abs/2509.25250
- 2026-03-20, Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents, Luiz C. Borro et.al., Paper: http://arxiv.org/abs/2603.19935
- 2026-02-24, MemoPhishAgent: Memory-Augmented Multi-Modal LLM Agent for Phishing URL Detection, Xuan Chen et.al., Paper: http://arxiv.org/abs/2602.21394
- 2026-01-12, MemoBrain: Executive Memory as an Agentic Brain for Reasoning, Hongjin Qian et.al., Paper: http://arxiv.org/abs/2601.08079
- 2025-10-22, Memo: Training Memory-Efficient Embodied Agents with Reinforcement Learning, Gunshi Gupta et.al., Paper: http://arxiv.org/abs/2510.19732
- 2026-03-04, Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory, Zhenting Wang et.al., Paper: http://arxiv.org/abs/2603.04257
- 2025-08-25, Memento: Fine-tuning LLM Agents without Fine-tuning LLMs, Huichi Zhou et.al., Paper: http://arxiv.org/abs/2508.16153
- 2026-03-19, Memento-Skills: Let Agents Design Agents, Huichi Zhou et.al., Paper: http://arxiv.org/abs/2603.18743
- 2026-01-07, Membox: Weaving Topic Continuity into Long-Range Memory for LLM Agents, Dehao Tao et.al., Paper: http://arxiv.org/abs/2601.03785
- 2026-02-02, MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents, Haozhen Zhang et.al., Paper: http://arxiv.org/abs/2602.02474
- 2025-11-04, MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning, Qianhao Yuan et.al., Paper: http://arxiv.org/abs/2511.02805
- 2025-12-23, MemR $^3$ : Memory Retrieval via Reflective Reasoning for LLM Agents, Xingbo Du et.al., Paper: http://arxiv.org/abs/2512.20237
- 2025-09-23, MemOrb: A Plug-and-Play Verbal-Reinforcement Memory Layer for E-Commerce Customer Service, Yizhe Huang et.al., Paper: http://arxiv.org/abs/2509.18713
- 2026-03-19, MemMA: Coordinating the Memory Cycle through Multi-Agent Reasoning and In-Situ Self-Evolution, Minhua Lin et.al., Paper: http://arxiv.org/abs/2603.18718
- 2026-03-31, MemFactory: Unified Inference & Training Framework for Agent Memory, Ziliang Guo et.al., Paper: http://arxiv.org/abs/2603.29493
- 2026-01-28, MemCtrl: Using MLLMs as Active Memory Controllers on Embodied Agents, Vishnu Sashank Dorbala et.al., Paper: http://arxiv.org/abs/2601.20831
- 2025-09-30, Mem-α: Learning Memory Construction via Reinforcement Learning, Yu Wang et.al., Paper: http://arxiv.org/abs/2509.25911
- 2026-03-09, Meissa: Multi-modal Medical Agentic Intelligence, Yixiong Chen et.al., Paper: http://arxiv.org/abs/2603.09018
- 2025-11-28, MegaChat: A Synthetic Persian Q&A Dataset for High-Quality Sales Chatbot Evaluation, Mahdi Rahmani et.al., Paper: http://arxiv.org/abs/2511.23397
- 2025-09-15, MedicalOS: An LLM Agent based Operating System for Digital Healthcare, Jared Zhu et.al., Paper: http://arxiv.org/abs/2509.11507
- 2026-01-28, MedViz: An Agent-based, Visual-guided Research Assistant for Navigating Biomedical Literature, Huan He et.al., Paper: http://arxiv.org/abs/2601.20709
- 2026-02-03, MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning, Shengyuan Liu et.al., Paper: http://arxiv.org/abs/2602.03320
- 2025-12-15, MedInsightBench: Evaluating Medical Analytics Agents Through Multi-Step Insight Discovery in Multimodal Medical Data, Zhenghao Zhu et.al., Paper: http://arxiv.org/abs/2512.13297
- 2026-03-05, MedCoRAG: Interpretable Hepatology Diagnosis via Hybrid Evidence Retrieval and Multispecialty Consensus, Zheng Li et.al., Paper: http://arxiv.org/abs/2603.05129
- 2025-10-12, MedCoAct: Confidence-Aware Multi-Agent Collaboration for Complete Clinical Decision, Hongjie Zheng et.al., Paper: http://arxiv.org/abs/2510.10461
- 2026-02-19, MedClarify: An information-seeking AI agent for medical diagnosis with case-specific follow-up questions, Hui Min Wong et.al., Paper: http://arxiv.org/abs/2602.17308
- 2025-07-04, MedAide: Information Fusion and Anatomy of Medical Intents via LLM-based Agent Collaboration, Dingkang Yang et.al., Paper: http://arxiv.org/abs/2410.12532
- 2025-12-12, MedAI: Evaluating TxAgent’s Therapeutic Agentic Reasoning in the NeurIPS CURE-Bench Competition, Tim Cofala et.al., Paper: http://arxiv.org/abs/2512.11682
- 2025-10-21, Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents, Guangfu Guo et.al., Paper: http://arxiv.org/abs/2510.18424
- 2026-03-16, Mechanistic Foundations of Goal-Directed Control, Alma Lago et.al., Paper: http://arxiv.org/abs/2603.15248
- 2026-03-24, Mecha-nudges for Machines, Giulio Frey et.al., Paper: http://arxiv.org/abs/2603.23433
- 2025-10-31, Measuring the Security of Mobile LLM Agents under Adversarial Prompts from Untrusted Third-Party Channels, Chenghao Du et.al., Paper: http://arxiv.org/abs/2510.27140
- 2026-03-19, Measuring and Exploiting Confirmation Bias in LLM-Assisted Security Code Review, Dimitris Mitropoulos et.al., Paper: http://arxiv.org/abs/2603.18740
- 2026-03-19, Meanings and Measurements: Multi-Agent Probabilistic Grounding for Vision-Language Navigation, Swagat Padhan et.al., Paper: http://arxiv.org/abs/2603.19166
- 2026-02-20, Mean-Field Reinforcement Learning without Synchrony, Shan Yang et.al., Paper: http://arxiv.org/abs/2602.18026
- 2025-11-26, Matrix: Peer-to-Peer Multi-Agent Synthetic Data Generation Framework, Dong Wang et.al., Paper: http://arxiv.org/abs/2511.21686
- 2025-11-14, MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism, Shulin Liu et.al., Paper: http://arxiv.org/abs/2511.11373
- 2026-03-19, Markov Potential Game and Multi-Agent Reinforcement Learning for Autonomous Driving, Huiwen Yan et.al., Paper: http://arxiv.org/abs/2603.19188
- 2025-11-17, Market-Dependent Communication in Multi-Agent Alpha Generation, Jerick Shi et.al., Paper: http://arxiv.org/abs/2511.13614
- 2025-12-17, Mapis: A Knowledge-Graph Grounded Multi-Agent Framework for Evidence-Based PCOS Diagnosis, Zanxiang He et.al., Paper: http://arxiv.org/abs/2512.15398
- 2025-12-02, Many-body $k$ -local ground states as probes for unitary quantum metrology, Majid Hassani et.al., Paper: http://arxiv.org/abs/2512.02976
- 2025-10-14, ManiAgent: An Agentic Framework for General Robotic Manipulation, Yi Yang et.al., Paper: http://arxiv.org/abs/2510.11660
- 2025-10-01, ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs, Adi Simhi et.al., Paper: http://arxiv.org/abs/2510.00857
- 2026-03-17, Malicious Or Not: Adding Repository Context to Agent Skill Classification, Florian Holzbauer et.al., Paper: http://arxiv.org/abs/2603.16572
- 2026-02-12, MalTool: Malicious Tool Attacks on LLM Agents, Yuepeng Hu et.al., Paper: http://arxiv.org/abs/2602.12194
- 2026-02-05, Making AI Agents Evaluate Misleading Charts without Nudging, Swaroop Panda et.al., Paper: http://arxiv.org/abs/2602.05662
- 2025-10-24, Magellan: Guided MCTS for Latent Space Exploration and Novelty Generation, Lufan Chang et.al., Paper: http://arxiv.org/abs/2510.21341
- 2025-09-04, Maestro: Joint Graph & Config Optimization for Reliable AI Agents, Wenxiao Wang et.al., Paper: http://arxiv.org/abs/2509.04642
- 2025-12-05, Machine Learning-Informed 3+1 Sterile Neutrino Global Fits using Posterior Density Estimation of Electron Disappearance Data, Joshua Villarreal et.al., Paper: http://arxiv.org/abs/2512.05784
- 2026-01-15, M^4olGen: Multi-Agent, Multi-Stage Molecular Generation under Precise Multi-Property Constraints, Yizhan Li et.al., Paper: http://arxiv.org/abs/2601.10131
- 2026-04-02, MTI: A Behavior-Based Temperament Profiling System for AI Agents, Jihoon Jeong et.al., Paper: http://arxiv.org/abs/2604.02145
- 2025-11-25, MTBBench: A Multimodal Sequential Clinical Decision-Making Benchmark in Oncology, Kiril Vasilev et.al., Paper: http://arxiv.org/abs/2511.20490
- 2025-09-22, MSCoRe: A Benchmark for Multi-Stage Collaborative Reasoning in LLM Agents, Yuzhen Lei et.al., Paper: http://arxiv.org/abs/2509.17628
- 2025-10-22, MSC-Bench: A Rigorous Benchmark for Multi-Server Tool Orchestration, Jia-Kai Dong et.al., Paper: http://arxiv.org/abs/2510.19423
- 2026-01-09, MMUEChange: A Generalized LLM Agent Framework for Intelligent Multi-Modal Urban Environment Change Analysis, Zixuan Xiao et.al., Paper: http://arxiv.org/abs/2601.05483
- 2026-03-02, MMNavAgent: Multi-Magnification WSI Navigation Agent for Clinically Consistent Whole-Slide Analysis, Zhengyang Xu et.al., Paper: http://arxiv.org/abs/2603.02079
- 2026-03-10, MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data, Zongxia Li et.al., Paper: http://arxiv.org/abs/2603.09206
- 2025-11-25, MLIPAudit: A benchmarking tool for Machine Learned Interatomic Potentials, Leon Wehrhan et.al., Paper: http://arxiv.org/abs/2511.20487
- 2026-02-03, MIRROR: A Multi-Agent Framework with Iterative Adaptive Revision and Hierarchical Retrieval for Optimization Modeling in Operations Research, Yifan Shi et.al., Paper: http://arxiv.org/abs/2602.03318
- 2025-10-20, MIRAGE: Agentic Framework for Multimodal Misinformation Detection with Web-Grounded Reasoning, Mir Nafis Sharear Shopnil et.al., Paper: http://arxiv.org/abs/2510.17590
- 2026-01-30, MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering, Chuanzhe Guo et.al., Paper: http://arxiv.org/abs/2601.22859
- 2025-12-18, MEPIC: Memory Efficient Position Independent Caching for LLM Serving, Qian Wang et.al., Paper: http://arxiv.org/abs/2512.16822
- 2025-10-21, MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models, ChangSu Choi et.al., Paper: http://arxiv.org/abs/2510.18383
- 2025-11-21, MDG: Masked Denoising Generation for Multi-Agent Behavior Modeling in Traffic Environments, Zhiyu Huang et.al., Paper: http://arxiv.org/abs/2511.17496
- 2025-12-17, MCPZoo: A Large-Scale Dataset of Runnable Model Context Protocol Servers for AI Agent, Mengying Wu et.al., Paper: http://arxiv.org/abs/2512.15144
- 2025-12-31, MCPAgentBench: A Real-world Task Benchmark for Evaluating LLM Agent MCP Tool Use, Wenrui Liu et.al., Paper: http://arxiv.org/abs/2512.24565
- 2026-01-12, MCP-ITP: An Automated Framework for Implicit Tool Poisoning in MCP, Ruiqi Li et.al., Paper: http://arxiv.org/abs/2601.07395
- 2025-10-28, MCP-Flow: Facilitating LLM Agents to Master Real-World, Diverse and Scaling MCP Tools, Wenhao Wang et.al., Paper: http://arxiv.org/abs/2510.24284
- 2026-01-30, MCP-Diag: A Deterministic, Protocol-Driven Architecture for AI-Native Network Diagnostics, Devansh Lodha et.al., Paper: http://arxiv.org/abs/2601.22633
- 2025-08-28, MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers, Zhenting Wang et.al., Paper: http://arxiv.org/abs/2508.20453
- 2025-12-05, MCP-AI: Protocol-Driven Intelligence Framework for Autonomous Reasoning in Healthcare, Zag ElSayed et.al., Paper: http://arxiv.org/abs/2512.05365
- 2025-11-28, MCP vs RAG vs NLWeb vs HTML: A Comparison of the Effectiveness and Efficiency of Different Agent Interfaces to the Web (Technical Report), Aaron Steiner et.al., Paper: http://arxiv.org/abs/2511.23281
- 2025-10-14, MCP Security Bench (MSB): Benchmarking Attacks Against Model Context Protocol in LLM Agents, Dongsen Zhang et.al., Paper: http://arxiv.org/abs/2510.15994
- 2026-01-14, MAXS: Meta-Adaptive Exploration with LLM Agents, Jian Zhang et.al., Paper: http://arxiv.org/abs/2601.09259
- 2025-08-26, MATRIX: Multi-Agent simulaTion fRamework for safe Interactions and conteXtual clinical conversational evaluation, Ernest Lim et.al., Paper: http://arxiv.org/abs/2508.19163
- 2025-12-07, MATEX: A Multi-Agent Framework for Explaining Ethereum Transactions, Zifan Peng et.al., Paper: http://arxiv.org/abs/2512.06933
- 2026-02-16, MATEO: A Multimodal Benchmark for Temporal Reasoning and Planning in LVLMs, Gabriel Roccabruna et.al., Paper: http://arxiv.org/abs/2602.14589
- 2026-02-10, MATA: Multi-Agent Framework for Reliable and Flexible Table Question Answering, Sieun Hyeon et.al., Paper: http://arxiv.org/abs/2602.09642
- 2025-09-30, MASLegalBench: Benchmarking Multi-Agent Systems in Deductive Legal Reasoning, Huihao Jing et.al., Paper: http://arxiv.org/abs/2509.24922
- 2025-12-26, MASFIN: A Multi-Agent System for Decomposed Financial Reasoning and Forecasting, Marc S. Montalvo et.al., Paper: http://arxiv.org/abs/2512.21878
- 2026-01-21, MAS-Orchestra: Understanding and Improving Multi-Agent Reasoning Through Holistic Orchestration and Controlled Benchmarks, Zixuan Ke et.al., Paper: http://arxiv.org/abs/2601.14652
- 2025-09-29, MAS $^2$ : Self-Generative, Self-Configuring, Self-Rectifying Multi-Agent Systems, Kun Wang et.al., Paper: http://arxiv.org/abs/2509.24323
- 2025-10-17, MARS: Reinforcing Multi-Agent Reasoning of LLMs through Self-Play in Strategic Games, Huining Yuan et.al., Paper: http://arxiv.org/abs/2510.15414
- 2025-12-23, MAR:Multi-Agent Reflexion Improves Reasoning Abilities in LLMs, Onat Ozer et.al., Paper: http://arxiv.org/abs/2512.20845
- 2025-12-31, MAMA-Memeia! Multi-Aspect Multi-Agent Collaboration for Depressive Symptoms Identification in Memes, Siddhant Agarwal et.al., Paper: http://arxiv.org/abs/2512.25015
- 2026-03-18, MALLES: A Multi-agent LLMs-based Economic Sandbox with Consumer Preference Alignment, Yusen Wu et.al., Paper: http://arxiv.org/abs/2603.17694
- 2025-12-16, MALCDF: A Distributed Multi-Agent LLM Framework for Real-Time Cyber, Arth Bhardwaj et.al., Paper: http://arxiv.org/abs/2512.14846
- 2025-12-26, MAI-UI Technical Report: Real-World Centric Foundation GUI Agents, Hanzhang Zhou et.al., Paper: http://arxiv.org/abs/2512.22047
- 2025-09-04, MAGneT: Coordinated Multi-Agent Generation of Synthetic Multi-Turn Mental Health Counseling Sessions, Aishik Mandal et.al., Paper: http://arxiv.org/abs/2509.04183
- 2025-10-16, MAGPIE: A benchmark for Multi-AGent contextual PrIvacy Evaluation, Gurusha Juneja et.al., Paper: http://arxiv.org/abs/2510.15186
- 2026-01-27, MAGNET: Towards Adaptive GUI Agents with Memory-Driven Knowledge Evolution, Libo Sun et.al., Paper: http://arxiv.org/abs/2601.19199
- 2026-01-06, MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents, Dongming Jiang et.al., Paper: http://arxiv.org/abs/2601.03236
- 2026-03-04, MAGE: Meta-Reinforcement Learning for Language Agents toward Strategic Exploration and Exploitation, Lu Yang et.al., Paper: http://arxiv.org/abs/2603.03680
- 2025-10-16, MAFA: A Multi-Agent Framework for Enterprise-Scale Annotation with Configurable Task Adaptation, Mahmood Hegazy et.al., Paper: http://arxiv.org/abs/2510.14184
- 2026-01-01, MAESTRO: Multi-Agent Evaluation Suite for Testing, Reliability, and Observability, Tie Ma et.al., Paper: http://arxiv.org/abs/2601.00481
- 2025-10-15, MADREC: A Multi-Aspect Driven LLM Agent for Explainable and Adaptive Recommendation, Jiin Park et.al., Paper: http://arxiv.org/abs/2510.13371
- 2025-11-26, MAD-DAG: Protecting Blockchain Consensus from MEV, Roi Bar-Zur et.al., Paper: http://arxiv.org/abs/2511.21552
- 2026-01-14, MACRO-LLM: LLM-Empowered Multi-Agent Collaborative Reasoning under Spatiotemporal Partial Observability, Handi Chen et.al., Paper: http://arxiv.org/abs/2601.09295
- 2026-03-04, MACC: Multi-Agent Collaborative Competition for Scientific Exploration, Satoshi Oyama et.al., Paper: http://arxiv.org/abs/2603.03780
- 2025-12-15, MAC: A Multi-Agent Framework for Interactive User Clarification in Multi-turn Conversations, Emre Can Acikgoz et.al., Paper: http://arxiv.org/abs/2512.13154
- 2026-02-16, MAC-AMP: A Closed-Loop Multi-Agent Collaboration System for Multi-Objective Antimicrobial Peptide Design, Gen Zhou et.al., Paper: http://arxiv.org/abs/2602.14926
- 2026-03-03, MA-CoNav: A Master-Slave Multi-Agent Framework with Hierarchical Collaboration and Dual-Level Reflection for Long-Horizon Embodied VLN, Ling Luo et.al., Paper: http://arxiv.org/abs/2603.03024
- 2026-02-05, M3: High-fidelity Text-to-Image Generation via Multi-Modal, Multi-Agent and Multi-Round Visual Reasoning, Bangji Yang et.al., Paper: http://arxiv.org/abs/2602.06166
- 2026-01-13, M3-BENCH: Process-Aware Evaluation of LLM Agents Social Behaviors in Mixed-Motive Games, Sixiong Xie et.al., Paper: http://arxiv.org/abs/2601.08462
- 2026-02-19, M2F: Automated Formalization of Mathematical Literature at Scale, Zichen Wang et.al., Paper: http://arxiv.org/abs/2602.17016
- 2026-01-14, M $^3$ Searcher: Modular Multimodal Information Seeking Agency with Retrieval-Oriented Reasoning, Xiaohan Yu et.al., Paper: http://arxiv.org/abs/2601.09278
- 2026-03-09, M $^3$ -ACE: Rectifying Visual Perception in Multimodal Math Reasoning via Multi-Agentic Context Engineering, Peijin Xie et.al., Paper: http://arxiv.org/abs/2603.08369
- 2026-01-04, Lying with Truths: Open-Channel Multi-Agent Collusion for Belief Manipulation via Generative Montage, Jinwei Hu et.al., Paper: http://arxiv.org/abs/2601.01685
- 2026-03-07, Lying to Win: Assessing LLM Deception through Human-AI Games and Parallel-World Probing, Arash Marioriyad et.al., Paper: http://arxiv.org/abs/2603.07202
- 2025-10-31, Lucky Cars in Fubini Rankings and Unit Fubini Rankings, Camilo Barreto et.al., Paper: http://arxiv.org/abs/2510.27574
- 2025-11-13, Low-soundness direct-product testers and PCPs from Kaufman–Oppenheim complexes, Ryan O’Donnell et.al., Paper: http://arxiv.org/abs/2511.10514
- 2025-12-18, Love, Lies, and Language Models: Investigating AI’s Role in Romance-Baiting Scams, Gilad Gressel et.al., Paper: http://arxiv.org/abs/2512.16280
- 2026-01-12, Lost in the Noise: How Reasoning Models Fail with Contextual Distractors, Seongyun Lee et.al., Paper: http://arxiv.org/abs/2601.07226
- 2025-10-27, Lost in Tokenization: Context as the Key to Unlocking Biomolecular Understanding in Scientific LLMs, Kai Zhuang et.al., Paper: http://arxiv.org/abs/2510.23127
- 2026-01-08, Lost in Execution: On the Multilingual Robustness of Tool Calling in Large Language Models, Zheng Luo et.al., Paper: http://arxiv.org/abs/2601.05366
- 2026-03-16, Lore: Repurposing Git Commit Messages as a Structured Knowledge Protocol for AI Coding Agents, Ivan Stetsenko et.al., Paper: http://arxiv.org/abs/2603.15566
- 2025-10-28, Look and Tell: A Dataset for Multimodal Grounding Across Egocentric and Exocentric Views, Anna Deichler et.al., Paper: http://arxiv.org/abs/2510.22672
- 2025-09-27, Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents, Yaorui Shi et.al., Paper: http://arxiv.org/abs/2509.23040
- 2025-12-23, LongVideoAgent: Multi-Agent Reasoning with Long Videos, Runtao Liu et.al., Paper: http://arxiv.org/abs/2512.20618
- 2025-10-08, LongRM: Revealing and Unlocking the Context Boundary of Reward Modeling, Zecheng Tang et.al., Paper: http://arxiv.org/abs/2510.06915
- 2026-01-05, LongDA: Benchmarking LLM Agents for Long-Document Data Analysis, Yiyang Li et.al., Paper: http://arxiv.org/abs/2601.02598
- 2026-01-23, LongCat-Flash-Thinking-2601 Technical Report, Meituan LongCat Team et.al., Paper: http://arxiv.org/abs/2601.16725
- 2026-03-22, LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning, Jianing Wang et.al., Paper: http://arxiv.org/abs/2603.21065
- 2026-02-11, Locomo-Plus: Beyond-Factual Cognitive Memory Evaluation Framework for LLM Agents, Yifei Li et.al., Paper: http://arxiv.org/abs/2602.10715
- 2026-03-14, LiveWeb-IE: A Benchmark For Online Web Information Extraction, Seungbin Yang et.al., Paper: http://arxiv.org/abs/2603.13773
- 2025-11-05, LiveTradeBench: Seeking Real-World Alpha with Large Language Models, Haofei Yu et.al., Paper: http://arxiv.org/abs/2511.03628
- 2026-03-02, LiveCultureBench: a Multi-Agent, Multi-Cultural Benchmark for Large Language Models in Dynamic Social Simulations, Viet-Thanh Pham et.al., Paper: http://arxiv.org/abs/2603.01952
- 2025-11-17, Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly?, Chunqiu Steven Xia et.al., Paper: http://arxiv.org/abs/2511.13646
- 2026-02-02, Live-Evo: Online Evolution of Agentic Memory from Continuous Feedback, Yaolun Zhang et.al., Paper: http://arxiv.org/abs/2602.02369
- 2025-09-30, Lita: Light Agent Uncovers the Agentic Coding Capabilities of LLMs, Hankun Dai et.al., Paper: http://arxiv.org/abs/2509.25873
- 2026-01-29, Liquid Interfaces: A Dynamic Ontology for the Interoperability of Autonomous Systems, Dhiogo de Sá et.al., Paper: http://arxiv.org/abs/2601.21993
- 2025-10-30, Linking Heterogeneous Data with Coordinated Agent Flows for Social Media Analysis, Shifu Chen et.al., Paper: http://arxiv.org/abs/2510.26172
- 2025-12-02, Limiting Reduction and Modified Gravity, Antonis Antoniou et.al., Paper: http://arxiv.org/abs/2512.02871
- 2025-11-25, Limit Order Book Dynamics in Matching Markets:Microstructure, Spread, and Execution Slippage, Yao Wu et.al., Paper: http://arxiv.org/abs/2511.20606
- 2026-03-04, LifeBench: A Benchmark for Long-Horizon Multi-Source Memory, Zihao Cheng et.al., Paper: http://arxiv.org/abs/2603.03781
- 2026-03-06, LieCraft: A Multi-Agent Framework for Evaluating Deceptive Capabilities in Language Models, Matthew Lyle Olson et.al., Paper: http://arxiv.org/abs/2603.06874
- 2025-10-16, LiRA: Linguistic Robust Anchoring for Cross-lingual Large Language Models, Haolin Li et.al., Paper: http://arxiv.org/abs/2510.14466
- 2026-03-02, LexChronos: An Agentic Framework for Structured Event Timeline Extraction in Indian Jurisprudence, Anka Chandrahas Tummepalli et.al., Paper: http://arxiv.org/abs/2603.01651
- 2025-10-11, Leveraging Large Language Models for Cybersecurity Risk Assessment – A Case from Forestry Cyber-Physical Systems, Fikret Mert Gultekin et.al., Paper: http://arxiv.org/abs/2510.06343
- 2025-09-04, Leveraging LLM-Based Agents for Intelligent Supply Chain Planning, Yongzhi Qi et.al., Paper: http://arxiv.org/abs/2509.03811
- 2025-09-26, Leveraging LLM Agents for Automated Video Game Testing, Chengjia Wang et.al., Paper: http://arxiv.org/abs/2509.22170
- 2025-09-07, Let’s Roleplay: Examining LLM Alignment in Collaborative Dialogues, Abhijnan Nath et.al., Paper: http://arxiv.org/abs/2509.05882
- 2026-01-26, Let’s Make Every Pull Request Meaningful: An Empirical Analysis of Developer and Agentic Pull Requests, Haruhiko Yoshioka et.al., Paper: http://arxiv.org/abs/2601.18749
- 2026-02-23, Let There Be Claws: An Early Social Network Analysis of AI Agents on Moltbook, H. C. W. Price et.al., Paper: http://arxiv.org/abs/2602.20044
- 2025-12-31, Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem, Weixun Wang et.al., Paper: http://arxiv.org/abs/2512.24873
- 2026-03-25, LensWalk: Agentic Video Understanding by Planning How You See in Videos, Keliang Li et.al., Paper: http://arxiv.org/abs/2603.24558
- 2026-03-23, Lemma Discovery in Agentic Program Verification, Huan Zhao et.al., Paper: http://arxiv.org/abs/2603.22114
- 2026-03-14, LegacyTranslate: LLM-based Multi-Agent Method for Legacy Code Translation, Zahra Moti et.al., Paper: http://arxiv.org/abs/2603.14054
- 2026-02-05, Learning to Share: Selective Memory for Efficient Parallel Agentic Systems, Joseph Fioresi et.al., Paper: http://arxiv.org/abs/2602.05965
- 2026-03-17, Learning to Present: Inverse Specification Rewards for Agentic Slide Generation, Karthik Ragunath Ananda Kumar et.al., Paper: http://arxiv.org/abs/2603.16839
- 2025-10-22, Learning to Make Friends: Coaching LLM Agents toward Emergent Social Ties, Philipp J. Schneider et.al., Paper: http://arxiv.org/abs/2510.19299
- 2026-02-05, Learning to Inject: Automated Prompt Injection via Reinforcement Learning, Xin Chen et.al., Paper: http://arxiv.org/abs/2602.05746
- 2026-03-03, Learning to Generate and Extract: A Multi-Agent Collaboration Framework For Zero-shot Document-level Event Arguments Extraction, Guangjun Zhang et.al., Paper: http://arxiv.org/abs/2603.02909
- 2026-02-11, Learning to Compose for Cross-domain Agentic Workflow Generation, Jialiang Wang et.al., Paper: http://arxiv.org/abs/2602.11114
- 2026-03-21, Learning to Aggregate Zero-Shot LLM Agents for Corporate Disclosure Classification, Kemal Kirtac et.al., Paper: http://arxiv.org/abs/2603.20965
- 2025-10-09, Learning on the Job: An Experience-Driven Self-Evolving Agent for Long-Horizon Tasks, Cheng Yang et.al., Paper: http://arxiv.org/abs/2510.08002
- 2025-10-22, Learning from Supervision with Semantic and Episodic Memory: A Reflective Approach to Agent Adaptation, Jackson Hassell et.al., Paper: http://arxiv.org/abs/2510.19897
- 2025-09-30, Learning When to Plan: Efficiently Allocating Test-Time Compute for LLM Agents, Davide Paglieri et.al., Paper: http://arxiv.org/abs/2509.03581
- 2026-03-03, Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use, Aradhye Agarwal et.al., Paper: http://arxiv.org/abs/2603.03205
- 2025-12-03, Learning Steerable Clarification Policies with Collaborative Self-play, Jonathan Berant et.al., Paper: http://arxiv.org/abs/2512.04068
- 2025-11-24, Learning Robust Social Strategies with Large Language Models, Dereck Piche et.al., Paper: http://arxiv.org/abs/2511.19405
- 2026-02-05, Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory, Haozhen Zhang et.al., Paper: http://arxiv.org/abs/2602.06025
- 2026-02-18, Learning Personalized Agents from Human Feedback, Kaiqu Liang et.al., Paper: http://arxiv.org/abs/2602.16173
- 2026-01-12, Learning How to Remember: A Meta-Cognitive Management Method for Structured and Transferable Agent Memory, Sirui Liang et.al., Paper: http://arxiv.org/abs/2601.07470
- 2025-12-22, Learning Hierarchical Procedural Memory for LLM Agents through Bayesian Selection and Contrastive Refinement, Saman Forouzandeh et.al., Paper: http://arxiv.org/abs/2512.18950
- 2025-10-19, Learning Ecology with VERA Using Conceptual Models and Simulations, Spencer Rugaber et.al., Paper: http://arxiv.org/abs/2510.16944
- 2025-10-10, Leading the Follower: Learning Persuasive Agents in Social Deduction Games, Zhang Zheng et.al., Paper: http://arxiv.org/abs/2510.09087
- 2025-12-22, LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller, Kirill Djebko et.al., Paper: http://arxiv.org/abs/2512.19576
- 2025-10-28, Law in Silico: Simulating Legal Society with LLM-Based Agents, Yiding Wang et.al., Paper: http://arxiv.org/abs/2510.24442
- 2026-03-31, Latent-Y: A Lab-Validated Autonomous Agent for De Novo Drug Design, Latent Labs Team et.al., Paper: http://arxiv.org/abs/2603.29727
- 2026-03-10, Latent-DARM: Bridging Discrete Diffusion And Autoregressive Models For Reasoning, Lina Berrayana et.al., Paper: http://arxiv.org/abs/2603.09184
- 2025-12-01, Latent Debate: A Surrogate Framework for Interpreting LLM Thinking, Lihu Chen et.al., Paper: http://arxiv.org/abs/2512.01909
- 2025-11-25, Latent Collaboration in Multi-Agent Systems, Jiaru Zou et.al., Paper: http://arxiv.org/abs/2511.20639
- 2025-12-23, Laser: Governing Long-Horizon Agentic Search via Structured Protocol and Context Register, Shuting Wang et.al., Paper: http://arxiv.org/abs/2512.20458
- 2025-10-19, Lark: Biologically Inspired Neuroevolution for Multi-Stakeholder LLM Agents, Dheeraj Chintapalli et.al., Paper: http://arxiv.org/abs/2510.16978
- 2026-03-12, Large language models for optical network O&M: Agent-embedded workflow for automation, Shengnan Li et.al., Paper: http://arxiv.org/abs/2603.11828
- 2025-11-06, Large Language Models for Cyber Security, Raunak Somani et.al., Paper: http://arxiv.org/abs/2511.04508
- 2026-03-26, Large Language Models as Optimization Controllers: Adaptive Continuation for SIMP Topology Optimization, Shaoliang Yang et.al., Paper: http://arxiv.org/abs/2603.25099
- 2026-03-13, Large Language Models as Delivery Rider: Generating Instant Food Delivery Riders’ Routing Decision with LLM Agent Framework, Chengbo Zhang et.al., Paper: http://arxiv.org/abs/2603.12559
- 2025-11-20, Large Language Model-Based Reward Design for Deep Reinforcement Learning-Driven Autonomous Cyber Defense, Sayak Mukherjee et.al., Paper: http://arxiv.org/abs/2511.16483
- 2025-10-22, Large Language Model enabled Mathematical Modeling, Guoyun Zhang et.al., Paper: http://arxiv.org/abs/2510.19895
- 2026-03-25, Language-Grounded Multi-Agent Planning for Personalized and Fair Participatory Urban Sensing, Xusen Guo et.al., Paper: http://arxiv.org/abs/2603.24014
- 2025-10-27, Language Server CLI Empowers Language Agents with Process Rewards, Yifan Zhang et.al., Paper: http://arxiv.org/abs/2510.22907
- 2025-12-30, Language Model Agents Under Attack: A Cross Model-Benchmark of Profit-Seeking Behaviors in Customer Service, Jingyu Zhang et.al., Paper: http://arxiv.org/abs/2512.24415
- 2026-04-01, LangMARL: Natural Language Multi-Agent Reinforcement Learning, Huaiyuan Yao et.al., Paper: http://arxiv.org/abs/2604.00722
- 2026-03-04, LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance, Ioannis Prokopiou et.al., Paper: http://arxiv.org/abs/2603.04293
- 2026-02-18, Label-Consistent Data Generation for Aspect-Based Sentiment Analysis Using LLM Agents, Mohammad H. A. Monfared et.al., Paper: http://arxiv.org/abs/2602.16379
- 2025-10-16, LabOS: The AI-XR Co-Scientist That Sees and Works With Humans, Le Cong et.al., Paper: http://arxiv.org/abs/2510.14861
- 2026-01-27, LVLMs and Humans Ground Differently in Referential Communication, Peter Zeng et.al., Paper: http://arxiv.org/abs/2601.19792
- 2026-01-23, LUMINA: Long-horizon Understanding for Multi-turn Interactive Agents, Amin Rakhsha et.al., Paper: http://arxiv.org/abs/2601.16649
- 2026-03-03, LLandMark: A Multi-Agent Framework for Landmark-Aware Multimodal Interactive Video Retrieval, Minh-Chi Phung et.al., Paper: http://arxiv.org/abs/2603.02888
- 2025-11-10, LLMscape, Gottfried Haider et.al., Paper: http://arxiv.org/abs/2511.07161
- 2025-09-24, LLMs for Bayesian Optimization in Scientific Domains: Are We There Yet?, Rushil Gupta et.al., Paper: http://arxiv.org/abs/2509.21403
- 2026-01-14, LLMs can Compress LLMs: Adaptive Pruning by Agents, Sai Varun Kodathala et.al., Paper: http://arxiv.org/abs/2601.09694
- 2025-10-07, LLMs as Policy-Agnostic Teammates: A Case Study in Human Proxy Design for Heterogeneous Agent Teams, Aju Ani Justus et.al., Paper: http://arxiv.org/abs/2510.06151
- 2026-01-27, LLMs as Orchestrators: Constraint-Compliant Multi-Agent Optimization for Recommendation Systems, Guilin Zhang et.al., Paper: http://arxiv.org/abs/2601.19121
- 2025-09-21, LLMs as Layout Designers: A Spatial Reasoning Perspective, Sha Li et.al., Paper: http://arxiv.org/abs/2509.16891
- 2026-03-19, LLMs Aren’t Human: A Critical Perspective on LLM Personality, Kim Zierahn et.al., Paper: http://arxiv.org/abs/2603.19030
- 2025-09-23, LLMZ+: Contextual Prompt Whitelist Principles for Agentic LLMs, Tom Pawelek et.al., Paper: http://arxiv.org/abs/2509.18557
- 2026-01-20, LLMOrbit: A Circular Taxonomy of Large Language Models -From Scaling Walls to Agentic AI Systems, Badri N. Patro et.al., Paper: http://arxiv.org/abs/2601.14053
- 2026-03-11, LLMGreenRec: LLM-Based Multi-Agent Recommender System for Sustainable E-Commerce, Hao N. Nguyen et.al., Paper: http://arxiv.org/abs/2603.11025
- 2026-02-18, LLM4Cov: Execution-Aware Agentic Learning for High-coverage Testbench Generation, Hejia Zhang et.al., Paper: http://arxiv.org/abs/2602.16953
- 2026-03-06, LLM2SMT: Building an SMT Solver with Zero Human-Written Code, Mikoláš Janota et.al., Paper: http://arxiv.org/abs/2603.06931
- 2025-09-28, LLM/Agent-as-Data-Analyst: A Survey, Zirui Tang et.al., Paper: http://arxiv.org/abs/2509.23988
- 2026-01-27, LLM-based Vulnerability Detection at Project Scale: An Empirical Study, Fengjie Li et.al., Paper: http://arxiv.org/abs/2601.19239
- 2025-10-07, LLM-FS-Agent: A Deliberative Role-based Large Language Model Architecture for Transparent Feature Selection, Mohamed Bal-Ghaoui et.al., Paper: http://arxiv.org/abs/2510.05935
- 2026-03-07, LLM-FK: Multi-Agent LLM Reasoning for Foreign Key Detection in Large-Scale Complex Databases, Zijian Tang et.al., Paper: http://arxiv.org/abs/2603.07278
- 2026-02-09, LLM-Enhanced Wearables for Comprehensible Health Guidance in LMICs, Mohammad Shaharyar Ahsan et.al., Paper: http://arxiv.org/abs/2602.08701
- 2025-12-24, LLM-Empowered Agentic AI for QoE-Aware Network Slicing Management in Industrial IoT, Xudong Wang et.al., Paper: http://arxiv.org/abs/2512.20997
- 2025-11-24, LLM-Driven Stationarity-Aware Expert Demonstrations for Multi-Agent Reinforcement Learning in Mobile Systems, Tianyang Duan et.al., Paper: http://arxiv.org/abs/2511.19368
- 2025-06-27, LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey, Henry Peng Zou et.al., Paper: http://arxiv.org/abs/2505.00753
- 2026-01-28, LLM-AutoDP: Automatic Data Processing via LLM Agents for Model Fine-tuning, Wei Huang et.al., Paper: http://arxiv.org/abs/2601.20375
- 2026-01-14, LLM for Large-Scale Optimization Model Auto-Formulation: A Lightweight Few-Shot Learning Approach, Kuo Liang et.al., Paper: http://arxiv.org/abs/2601.09635
- 2025-11-10, LLM Driven Processes to Foster Explainable AI, Marcel Pehlke et.al., Paper: http://arxiv.org/abs/2511.07086
- 2025-12-01, LLM CHESS: Benchmarking Reasoning and Instruction-Following in LLMs through Chess, Sai Kolasani et.al., Paper: http://arxiv.org/abs/2512.01992
- 2025-09-30, LLM Agents for Knowledge Discovery in Atomic Layer Processing, Andreas Werbrouck et.al., Paper: http://arxiv.org/abs/2509.26201
- 2025-09-23, LLM Agents for Interactive Workflow Provenance: Reference Architecture and Evaluation Methodology, Renan Souza et.al., Paper: http://arxiv.org/abs/2509.13978
- 2026-01-02, LLM Agents for Combinatorial Efficient Frontiers: Investment Portfolio Optimization, Simon Paquette-Greenbaum et.al., Paper: http://arxiv.org/abs/2601.00770
- 2025-10-16, LLM Agents for Automated Web Vulnerability Reproduction: Are We There Yet?, Bin Liu et.al., Paper: http://arxiv.org/abs/2510.14700
- 2025-10-03, LLM Agents for Automated Dependency Upgrades, Vali Tawosi et.al., Paper: http://arxiv.org/abs/2510.03480
- 2025-09-19, LLM Agents at the Roundtable: A Multi-Perspective and Dialectical Reasoning Framework for Essay Scoring, Jinhee Jang et.al., Paper: http://arxiv.org/abs/2509.14834
- 2025-10-16, LLM Agents Beyond Utility: An Open-Ended Perspective, Asen Nachkov et.al., Paper: http://arxiv.org/abs/2510.14548
- 2025-09-25, LLM Agent Meets Agentic AI: Can LLM Agents Simulate Customers to Evaluate Agentic-AI-based Shopping Assistants?, Lu Sun et.al., Paper: http://arxiv.org/abs/2509.21501
- 2026-01-06, LLM Agent Framework for Intelligent Change Analysis in Urban Environment using Remote Sensing Imagery, Zixuan Xiao et.al., Paper: http://arxiv.org/abs/2601.02757
- 2026-02-06, LLM Active Alignment: A Nash Equilibrium Perspective, Tonghan Wang et.al., Paper: http://arxiv.org/abs/2602.06836
- 2025-09-25, LIMI: Less is More for Agency, Yang Xiao et.al., Paper: http://arxiv.org/abs/2509.17567
- 2026-01-09, LIDL: LLM Integration Defect Localization via Knowledge Graph-Enhanced Multi-Agent Analysis, Gou Tan et.al., Paper: http://arxiv.org/abs/2601.05539
- 2025-12-11, LEO-RobotAgent: A General-purpose Robotic Agent for Language-driven Embodied Operator, Lihuang Chen et.al., Paper: http://arxiv.org/abs/2512.10605
- 2025-11-04, LEGO-Eval: Towards Fine-Grained Evaluation on Synthesizing 3D Embodied Environments with Tool Augmentation, Gyeom Hwangbo et.al., Paper: http://arxiv.org/abs/2511.03001
- 2026-01-30, LEAP – Live Experiments for Active Pedagogy, Sumedh Karajagi et.al., Paper: http://arxiv.org/abs/2601.22534
- 2025-09-23, LCMF: Lightweight Cross-Modality Mambaformer for Embodied Robotics VQA, Zeyi Kang et.al., Paper: http://arxiv.org/abs/2509.18576
- 2025-10-21, LAFA: Agentic LLM-Driven Federated Analytics over Decentralized Data Sources, Haichao Ji et.al., Paper: http://arxiv.org/abs/2510.18477
- 2025-10-08, LAD-RAG: Layout-aware Dynamic RAG for Visually-Rich Document Understanding, Zhivar Sourati et.al., Paper: http://arxiv.org/abs/2510.07233
- 2026-03-12, LABSHIELD: A Multimodal Benchmark for Safety-Critical Reasoning and Planning in Scientific Laboratories, Qianpu Sun et.al., Paper: http://arxiv.org/abs/2603.11987
- 2025-10-08, L2M-AID: Autonomous Cyber-Physical Defense by Fusing Semantic Reasoning of Large Language Models with Multi-Agent Reinforcement Learning (Preprint), Tianxiang Xu et.al., Paper: http://arxiv.org/abs/2510.07363
- 2026-01-14, KryptoPilot: An Open-World Knowledge-Augmented LLM Agent for Automated Cryptographic Exploitation, Xiaonan Liu et.al., Paper: http://arxiv.org/abs/2601.09129
- 2025-11-05, Kosmos: An AI Scientist for Autonomous Discovery, Ludovico Mitchener et.al., Paper: http://arxiv.org/abs/2511.02824
- 2026-01-16, Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation, Pingzhi Tang et.al., Paper: http://arxiv.org/abs/2601.11258
- 2026-03-31, Knowledge database development by large language models for countermeasures against viruses and marine toxins, Hung N. Do et.al., Paper: http://arxiv.org/abs/2603.29149
- 2026-03-24, Knowledge Access Beats Model Size: Memory Augmented Routing for Persistent AI Agents, Xunzhuo Liu et.al., Paper: http://arxiv.org/abs/2603.23013
- 2026-02-16, Knowing Isn’t Understanding: Re-grounding Generative Proactivity with Epistemic and Behavioral Insight, Kirandeep Kaur et.al., Paper: http://arxiv.org/abs/2602.15259
- 2025-10-29, KnowCoder-A1: Incentivizing Agentic Reasoning Capability with Outcome Supervision for KBQA, Zhuo Chen et.al., Paper: http://arxiv.org/abs/2510.25101
- 2026-03-12, Kinetic SIS opinion-driven models with asymmetric awareness feedback: macroscopic limit and polarization, Juan Pablo Pinasco et.al., Paper: http://arxiv.org/abs/2603.12041
- 2026-03-17, Kestrel: Grounding Self-Refinement for LVLM Hallucination Mitigation, Jiawei Mao et.al., Paper: http://arxiv.org/abs/2603.16664
- 2026-02-19, KLong: Training LLM Agent for Extremely Long-horizon Tasks, Yue Liu et.al., Paper: http://arxiv.org/abs/2602.17547
- 2026-03-22, KLDrive: Fine-Grained 3D Scene Reasoning for Autonomous Driving based on Knowledge Graph, Ye Tian et.al., Paper: http://arxiv.org/abs/2603.21029
- 2026-01-04, KGCE: Knowledge-Augmented Dual-Graph Evaluator for Cross-Platform Educational Agent Benchmarking with Multimodal Language Models, Zixian Liu et.al., Paper: http://arxiv.org/abs/2601.01366
- 2025-10-11, KG-MAS: Knowledge Graph-Enhanced Multi-Agent Infrastructure for coupling physical and digital robotic environments, Walid Abdela et.al., Paper: http://arxiv.org/abs/2510.10325
- 2024-02-20, KG-Agent: An Efficient Autonomous Agent Framework for Complex Reasoning over Knowledge Graph, Jinhao Jiang et.al., Paper: http://arxiv.org/abs/2402.11163
- 2025-10-21, KAT-Coder Technical Report, Zizheng Zhan et.al., Paper: http://arxiv.org/abs/2510.18779
- 2026-03-05, KARL: Knowledge Agents via Reinforcement Learning, Jonathan D. Chang et.al., Paper: http://arxiv.org/abs/2603.05218
- 2025-10-05, Just-in-time Episodic Feedback Hinter: Leveraging Offline Knowledge to Improve LLM Agents Adaptation, Hadi Nekoei et.al., Paper: http://arxiv.org/abs/2510.04373
- 2026-01-26, Just-In-Time Reinforcement Learning: Continual Learning in LLM Agents Without Gradient Updates, Yibo Li et.al., Paper: http://arxiv.org/abs/2601.18510
- 2026-01-12, JudgeFlow: Agentic Workflow Optimization via Block Judge, Zihan Ma et.al., Paper: http://arxiv.org/abs/2601.07477
- 2025-09-26, JudgeAgent: Knowledge-wise and Dynamic LLM Evaluation with Agent-as-Interviewer, Zhichao Shi et.al., Paper: http://arxiv.org/abs/2509.02097
- 2025-11-06, Jr. AI Scientist and Its Risk Report: Autonomous Scientific Exploration from a Baseline Paper, Atsuyuki Miyai et.al., Paper: http://arxiv.org/abs/2511.04583
- 2025-10-01, JoyAgent-JDGenie: Technical Report on the GAIA, Jiarun Liu et.al., Paper: http://arxiv.org/abs/2510.00510
- 2026-01-05, Jenius Agent: Towards Experience-Driven Accuracy Optimization in Real-World Scenarios, Defei Xia et.al., Paper: http://arxiv.org/abs/2601.01857
- 2026-02-23, Janus-Faced Technological Progress and the Arms Race in the Education of Humans and Chatbots, Wolfgang Kuhle et.al., Paper: http://arxiv.org/abs/2602.19783
- 2026-02-27, Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking, Zhicheng Fang et.al., Paper: http://arxiv.org/abs/2602.24009
- 2026-03-05, Jagarin: A Three-Layer Architecture for Hibernating Personal Duty Agents on Mobile, Ravi Kiran Kadaboina et.al., Paper: http://arxiv.org/abs/2603.05069
- 2025-11-10, JPRO: Automated Multimodal Jailbreaking via Multi-Agent Collaboration Framework, Yuxuan Zhou et.al., Paper: http://arxiv.org/abs/2511.07315
- 2025-10-21, JAUNT: Joint Alignment of User Intent and Network State for QoE-centric LLM Tool Routing, Enhan Li et.al., Paper: http://arxiv.org/abs/2510.18550
- 2026-01-29, JADE: Bridging the Strategic-Operational Gap in Dynamic Agentic RAG, Yiqun Chen et.al., Paper: http://arxiv.org/abs/2601.21916
- 2025-12-29, It’s a TRAP! Task-Redirecting Agent Persuasion Benchmark for Web Agents, Karolina Korgul et.al., Paper: http://arxiv.org/abs/2512.23128
- 2026-03-18, Is Your LLM-as-a-Recommender Agent Trustable? LLMs’ Recommendation is Easily Hacked by Biases (Preferences), Zichen Tang et.al., Paper: http://arxiv.org/abs/2603.17417
- 2026-03-09, IronEngine: Towards General AI Assistant, Xi Mo et.al., Paper: http://arxiv.org/abs/2603.08425
- 2025-10-20, Investigating the Impact of Dark Patterns on LLM-Based Web Agents, Devin Ersoy et.al., Paper: http://arxiv.org/abs/2510.18113
- 2026-01-28, Investigating the Development of Task-Oriented Communication in Vision-Language Models, Boaz Carmeli et.al., Paper: http://arxiv.org/abs/2601.20641
- 2025-11-17, Investigating the Dark Energy Constraint from Strongly Lensed AGN at LSST-Scale, Sydney Erickson et.al., Paper: http://arxiv.org/abs/2511.13669
- 2025-10-28, Investigating Software Aging in LLM-Generated Software Systems, César Santos et.al., Paper: http://arxiv.org/abs/2510.24188
- 2026-04-01, Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time, Razvan Mihai Popescu et.al., Paper: http://arxiv.org/abs/2604.00917
- 2025-12-05, Invariant Price of Anarchy: a Metric for Welfarist Traffic Control, Ilia Shilov et.al., Paper: http://arxiv.org/abs/2512.05843
- 2025-12-04, Introduction to quantum control: From basic concepts to applications in quantum technologies, Christiane P. Koch et.al., Paper: http://arxiv.org/abs/2512.04990
- 2026-01-23, Introducing the Generative Application Firewall (GAF), Joan Vendrell Farreny et.al., Paper: http://arxiv.org/abs/2601.15824
- 2026-03-14, InterventionLens: A Multi-Agent Framework for Detecting ASD Intervention Strategies in Parent-Child Shared Reading, Xiao Wang et.al., Paper: http://arxiv.org/abs/2603.13710
- 2026-03-16, InterveneBench: Benchmarking LLMs for Intervention Reasoning and Causal Study Design in Real Social Systems, Shaojie Shi et.al., Paper: http://arxiv.org/abs/2603.15542
- 2026-03-18, Interpretable Traffic Responsibility from Dashcam Video via Legal Multi Agent Reasoning, Jingchun Yang et.al., Paper: http://arxiv.org/abs/2603.17930
- 2026-03-17, Interpretable Context Methodology: Folder Structure as Agentic Architecture, Jake Van Clief et.al., Paper: http://arxiv.org/abs/2603.16021
- 2026-01-21, Interoperable Architecture for Digital Identity Delegation for AI Agents with Blockchain Integration, David Ricardo Saavedra et.al., Paper: http://arxiv.org/abs/2601.14982
- 2024-07-11, Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence, Weize Chen et.al., Paper: http://arxiv.org/abs/2407.07061
- 2025-09-05, Internet 3.0: Architecture for a Web-of-Agents with it’s Algorithm for Ranking Agents, Rajesh Tembarai Krishnamachari et.al., Paper: http://arxiv.org/abs/2509.04979
- 2025-10-16, Internalizing World Models via Self-Play Finetuning for Agentic RL, Shiqi Chen et.al., Paper: http://arxiv.org/abs/2510.15047
- 2026-02-10, Internalizing Multi-Agent Reasoning for Accurate and Efficient LLM-based Recommendation, Yang Wu et.al., Paper: http://arxiv.org/abs/2602.09829
- 2026-03-17, Internalizing Agency from Reflective Experience, Rui Ge et.al., Paper: http://arxiv.org/abs/2603.16843
- 2025-10-05, Internal World Models as Imagination Networks in Cognitive Agents, Saurabh Ranjan et.al., Paper: http://arxiv.org/abs/2510.04391
- 2026-04-01, Internal APIs Are All You Need: Shadow APIs, Shared Discovery, and the Case Against Browser-First Agent Architectures, Lewis Tham et.al., Paper: http://arxiv.org/abs/2604.00694
- 2026-02-09, InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery, Shiyang Feng et.al., Paper: http://arxiv.org/abs/2602.08990
- 2025-08-27, Interactive Graph Visualization and TeamingRecommendation in an Interdisciplinary Project’sTalent Knowledge Graph, Jiawei Xu et.al., Paper: http://arxiv.org/abs/2508.19489
- 2026-02-24, Interaction-aware Representation Modeling with Co-occurrence Consistency for Egocentric Hand-Object Parsing, Yuejiao Su et.al., Paper: http://arxiv.org/abs/2602.20597
- 2025-11-03, Interaction as Intelligence Part II: Asynchronous Human-Agent Rollout for Long-Horizon Task Training, Dayuan Fu et.al., Paper: http://arxiv.org/abs/2510.27630
- 2026-02-23, Interaction Theater: A case of LLM Agents Interacting at Scale, Sarath Shekkizhar et.al., Paper: http://arxiv.org/abs/2602.20059
- 2025-10-31, Interact-RAG: Reason and Interact with the Corpus, Beyond Black-Box Retrieval, Yulong Hui et.al., Paper: http://arxiv.org/abs/2510.27566
- 2026-02-04, InterPReT: Interactive Policy Restructuring and Training Enable Effective Imitation Learning from Laypersons, Feiyu Gavin Zhu et.al., Paper: http://arxiv.org/abs/2602.04213
- 2026-03-13, InterDeepResearch: Enabling Human-Agent Collaborative Information Seeking through Interactive Deep Research, Bo Pan et.al., Paper: http://arxiv.org/abs/2603.12608
- 2025-11-05, Inter-Agent Trust Models: A Comparative Study of Brief, Claim, Proof, Stake, Reputation and Constraint in Agentic Web Protocol Design-A2A, AP2, ERC-8004, and Beyond, Botao ‘Amber’ Hu et.al., Paper: http://arxiv.org/abs/2511.03434
- 2025-12-16, IntentMiner: Intent Inversion Attack via Tool Call Analysis in the Model Context Protocol, Yunhao Yao et.al., Paper: http://arxiv.org/abs/2512.14166
- 2026-03-17, Intent Formalization: A Grand Challenge for Reliable Coding in the Age of AI Agents, Shuvendu K. Lahiri et.al., Paper: http://arxiv.org/abs/2603.17150
- 2026-03-16, Intelligent Co-Design: An Interactive LLM Framework for Interior Spatial Design via Multi-Modal Agents, Ren Jian Lim et.al., Paper: http://arxiv.org/abs/2603.15341
- 2026-02-12, Intelligent AI Delegation, Nenad Tomašev et.al., Paper: http://arxiv.org/abs/2602.11865
- 2025-12-03, Integrating High Performance In-Memory Data Streaming and In-Situ Visualization in Hybrid MPI+OpenMP PIC MC Simulations Towards Exascale, Jeremy J. Williams et.al., Paper: http://arxiv.org/abs/2512.03914
- 2025-11-26, Integrated emitters with CMOS-compatible tuning for large scale quantum SiN photonic circuits, Jasper De Witte et.al., Paper: http://arxiv.org/abs/2511.21529
- 2025-12-09, Insured Agents: A Decentralized Trust Insurance Mechanism for Agentic Economy, Botao ‘Amber’ Hu et.al., Paper: http://arxiv.org/abs/2512.08737
- 2025-09-01, Instructional Agents: LLM Agents on Automated Course Material Generation for Teaching Faculties, Huaiyuan Yao et.al., Paper: http://arxiv.org/abs/2508.19611
- 2026-01-15, Institutional AI: A Governance Framework for Distributional AGI Safety, Federico Pierucci et.al., Paper: http://arxiv.org/abs/2601.10599
- 2025-10-21, InspectCoder: Dynamic Analysis-Enabled Self Repair through interactive LLM-Debugger Collaboration, Yunkun Wang et.al., Paper: http://arxiv.org/abs/2510.18327
- 2025-12-12, Insight Miner: A Time Series Analysis Dataset for Cross-Domain Alignment with Natural Language, Yunkai Zhang et.al., Paper: http://arxiv.org/abs/2512.11251
- 2025-11-03, InnovatorBench: Evaluating Agents’ Ability to Conduct Innovative LLM Research, Yunze Wu et.al., Paper: http://arxiv.org/abs/2510.27598
- 2025-12-01, InnoGym: Benchmarking the Innovation Potential of AI Agents, Jintian Zhang et.al., Paper: http://arxiv.org/abs/2512.01822
- 2025-09-26, Infusing Theory of Mind into Socially Intelligent LLM Agents, EunJeong Hwang et.al., Paper: http://arxiv.org/abs/2509.22887
- 2026-03-25, Infrastructure for Valuable, Tradable, and Verifiable Agent Memory, Mengyuan Li et.al., Paper: http://arxiv.org/abs/2603.24564
- 2025-10-16, Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents, Guoqing Wang et.al., Paper: http://arxiv.org/abs/2510.14967
- 2025-10-04, InfoMosaic-Bench: Evaluating Multi-Source Information Seeking in Tool-Augmented Agents, Yaxin Du et.al., Paper: http://arxiv.org/abs/2510.02271
- 2026-01-07, InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training, Ziyun Zhang et.al., Paper: http://arxiv.org/abs/2601.04126
- 2025-09-30, InfiAgent: Self-Evolving Pyramid Agent Framework for Infinite Scenarios, Chenglin Yu et.al., Paper: http://arxiv.org/abs/2509.22502
- 2026-01-06, InfiAgent: An Infinite-Horizon Framework for General-Purpose Autonomous Agents, Chenglin Yu et.al., Paper: http://arxiv.org/abs/2601.03204
- 2026-01-13, Inferring Latent Intentions: Attributional Natural Language Inference in LLM Agents, Xin Quan et.al., Paper: http://arxiv.org/abs/2601.08742
- 2025-11-21, IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation, Yifan Li et.al., Paper: http://arxiv.org/abs/2511.17384
- 2025-08-30, Inducing State Anxiety in LLM Agents Reproduces Human-Like Biases in Consumer Decision-Making, Ziv Ben-Zion et.al., Paper: http://arxiv.org/abs/2510.06222
- 2026-03-12, IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse, Yushi Bai et.al., Paper: http://arxiv.org/abs/2603.12201
- 2026-03-12, Increasing intelligence in AI agents can worsen collective outcomes, Neil F. Johnson et.al., Paper: http://arxiv.org/abs/2603.12129
- 2025-12-16, Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis, Yankai Jiang et.al., Paper: http://arxiv.org/abs/2512.14157
- 2025-10-27, Incentivizing Agentic Reasoning in LLM Judges via Tool-Integrated Reinforcement Learning, Ran Xu et.al., Paper: http://arxiv.org/abs/2510.23038
- 2025-12-21, InSight-o3: Empowering Multimodal Foundation Models with Generalized Visual Search, Kaican Li et.al., Paper: http://arxiv.org/abs/2512.18745
- 2025-12-02, InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration, Zhongyu Yang et.al., Paper: http://arxiv.org/abs/2512.02981
- 2026-02-13, In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach, Yiran Gao et.al., Paper: http://arxiv.org/abs/2602.13156
- 2025-10-15, In-Browser LLM-Guided Fuzzing for Real-Time Prompt Injection Testing in Agentic AI Browsers, Avihay Cohen et.al., Paper: http://arxiv.org/abs/2510.13543
- 2026-03-18, In Trust We Survive: Emergent Trust Learning, Qianpu Chen et.al., Paper: http://arxiv.org/abs/2603.17564
- 2026-02-17, In Agents We Trust, but Who Do Agents Trust? Latent Source Preferences Steer LLM Generations, Mohammad Aflah Khan et.al., Paper: http://arxiv.org/abs/2602.15456
- 2025-09-28, Improving the Efficiency of LLM Agent Systems through Trajectory Reduction, Yuan-An Xiao et.al., Paper: http://arxiv.org/abs/2509.23586
- 2026-01-22, Improving Methodologies for Agentic Evaluations Across Domains: Leakage of Sensitive Information, Fraud and Cybersecurity Threats, Ee Wei Seah et.al., Paper: http://arxiv.org/abs/2601.15679
- 2026-02-17, Improving MLLMs in Embodied Exploration and Question Answering with Human-Inspired Memory Modeling, Ji Li et.al., Paper: http://arxiv.org/abs/2602.15513
- 2026-01-14, Improving Implicit Hate Speech Detection via a Community-Driven Multi-Agent Framework, Ewelina Gajewska et.al., Paper: http://arxiv.org/abs/2601.09342
- 2025-10-03, Improving GUI Grounding with Explicit Position-to-Coordinate Mapping, Suyuchen Wang et.al., Paper: http://arxiv.org/abs/2510.03230
- 2025-10-23, ImpossibleBench: Measuring LLMs’ Propensity of Exploiting Test Cases, Ziqian Zhong et.al., Paper: http://arxiv.org/abs/2510.20270
- 2025-09-26, Impact of Collective Behaviors of Autonomous Vehicles on Urban Traffic Dynamics: A Multi-Agent Reinforcement Learning Approach, Ahmet Onur Akman et.al., Paper: http://arxiv.org/abs/2509.22216
- 2026-02-10, Immersion in the GitHub Universe: Scaling Coding Agents to Mastery, Jiale Zhao et.al., Paper: http://arxiv.org/abs/2602.09892
- 2025-12-16, Imitation Learning for Multi-turn LM Agents via On-policy Expert Corrections, Niklas Lauffer et.al., Paper: http://arxiv.org/abs/2512.14895
- 2025-11-14, ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation, Kaishen Wang et.al., Paper: http://arxiv.org/abs/2511.11483
- 2025-11-28, Identifying bars in galaxies using machine learning, Rajit Shrivastava et.al., Paper: http://arxiv.org/abs/2511.23383
- 2026-02-06, Identifying Adversary Tactics and Techniques in Malware Binaries with an LLM Agent, Zhou Xuan et.al., Paper: http://arxiv.org/abs/2602.06325
- 2026-02-23, ISO-Bench: Can Coding Agents Optimize Real-World Inference Workloads?, Ayush Nangia et.al., Paper: http://arxiv.org/abs/2602.19594
- 2026-02-11, ISD-Agent-Bench: A Comprehensive Benchmark for Evaluating LLM-based Instructional Design Agents, YoungHoon Jeon et.al., Paper: http://arxiv.org/abs/2602.10620
- 2025-08-22, IR-Agent: Expert-Inspired LLM Agents for Structure Elucidation from Infrared Spectra, Heewoong Noh et.al., Paper: http://arxiv.org/abs/2508.16112
- 2026-03-17, IQuest-Coder-V1 Technical Report, Jian Yang et.al., Paper: http://arxiv.org/abs/2603.16733
- 2025-11-19, INQUIRE-Search: A Framework for Interactive Discovery in Large-Scale Biodiversity Databases, Edward Vendrow et.al., Paper: http://arxiv.org/abs/2511.15656
- 2026-01-21, INFA-Guard: Mitigating Malicious Propagation via Infection-Aware Safeguarding in LLM-Based Multi-Agent Systems, Yijin Zhou et.al., Paper: http://arxiv.org/abs/2601.14667
- 2025-10-14, IL3D: A Large-Scale Indoor Layout Dataset for LLM-Driven 3D Scene Generation, Wenxu Zhou et.al., Paper: http://arxiv.org/abs/2510.12095
- 2026-03-18, IEMAS: An Incentive-Efficiency Routing Framework for Open Agentic Web Ecosystems, Hongze Liu et.al., Paper: http://arxiv.org/abs/2603.17302
- 2026-02-24, ICON: Indirect Prompt Injection Defense for Agents based on Inference-Time Correction, Che Wang et.al., Paper: http://arxiv.org/abs/2602.20708
- 2026-03-19, I Can’t Believe It’s Corrupt: Evaluating Corruption in Multi-Agent Governance Systems, Vedanta S P et.al., Paper: http://arxiv.org/abs/2603.18894
- 2025-12-02, Hypothesis Testing for Generalized Thurstone Models, Anuran Makur et.al., Paper: http://arxiv.org/abs/2512.02912
- 2025-09-10, HypoGeneAgent: A Hypothesis Language Agent for Gene-Set Cluster Resolution Selection Using Perturb-seq Datasets, Ying Yuan et.al., Paper: http://arxiv.org/abs/2509.09740
- 2026-01-14, Hybrid guided variational autoencoder for visual place recognition, Ni Wang et.al., Paper: http://arxiv.org/abs/2601.09248
- 2026-03-12, Hybrid Human-Agent Social Dilemmas in Energy Markets, Isuri Perera et.al., Paper: http://arxiv.org/abs/2603.11834
- 2026-01-13, Hybrid Distillation with CoT Guidance for Edge-Drone Control Code Generation, Yizhan Feng et.al., Paper: http://arxiv.org/abs/2601.08412
- 2026-03-20, HyEvo: Self-Evolving Hybrid Agentic Workflows for Efficient Reasoning, Beibei Xu et.al., Paper: http://arxiv.org/abs/2603.19639
- 2025-10-24, Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine, Wenyi Wang et.al., Paper: http://arxiv.org/abs/2510.21614
- 2026-02-17, Hunt Globally: Wide Search AI Agents for Drug Asset Scouting in Investing, Business Development, and Competitive Intelligence, Alisa Vinogradova et.al., Paper: http://arxiv.org/abs/2602.15019
- 2026-03-23, Human-Inspired Pavlovian and Instrumental Learning for Autonomous Agent Navigation, Jingfeng Shan et.al., Paper: http://arxiv.org/abs/2603.22170
- 2026-02-23, Human-Guided Agentic AI for Multimodal Clinical Prediction: Lessons from the AgentDS Healthcare Benchmark, Lalitha Pranathi Pulavarthy et.al., Paper: http://arxiv.org/abs/2602.19502
- 2025-10-23, Human-Centered LLM-Agent System for Detecting Anomalous Digital Asset Transactions, Gyuyeon Na et.al., Paper: http://arxiv.org/abs/2510.20102
- 2025-10-22, Human-Agent Collaborative Paper-to-Page Crafting for Under $0.1, Qianli Ma et.al., Paper: http://arxiv.org/abs/2510.19600
- 2025-11-14, Human-AI collaborative autonomous synthesis with pulsed laser deposition for remote epitaxy, Asraful Haque et.al., Paper: http://arxiv.org/abs/2511.11558
- 2026-02-10, Human-AI Synergy Supports Collective Creative Search, Chenyi Li et.al., Paper: http://arxiv.org/abs/2602.10001
- 2026-03-13, Human-AI Collaborative Autonomous Experimentation With Proxy Modeling for Comparative Observation, Arpan Biswas et.al., Paper: http://arxiv.org/abs/2603.12618
- 2025-09-22, Human vs. Agent in Task-Oriented Conversations, Zhefan Wang et.al., Paper: http://arxiv.org/abs/2509.17619
- 2026-02-13, Human Tool: An MCP-Style Framework for Human-Agent Collaboration, Yuanrong Tang et.al., Paper: http://arxiv.org/abs/2602.12953
- 2025-12-08, Human Agency and Creativity in AI-Assisted Learning Environments, Yun Dai et.al., Paper: http://arxiv.org/abs/2512.07117
- 2026-03-03, How to Model AI Agents as Personas?: Applying the Persona Ecosystem Playground to 41,300 Posts on Moltbook for Behavioral Insights, Danial Amin et.al., Paper: http://arxiv.org/abs/2603.03140
- 2026-01-21, How to Build AI Agents by Augmenting LLMs with Codified Human Expert Domain Knowledge? A Software Engineering Framework, Choro Ulan uulu et.al., Paper: http://arxiv.org/abs/2601.15153
- 2026-01-29, How do Visual Attributes Influence Web Agents? A Comprehensive Evaluation of User Interface Design Factors, Kuai Yu et.al., Paper: http://arxiv.org/abs/2601.21961
- 2025-09-19, How do Language Models Generate Slang: A Systematic Comparison between Human and Machine-Generated Slang Usages, Siyang Wu et.al., Paper: http://arxiv.org/abs/2509.15518
- 2025-10-10, How can we assess human-agent interactions? Case studies in software agent design, Valerie Chen et.al., Paper: http://arxiv.org/abs/2510.09801
- 2026-03-25, How are AI agents used? Evidence from 177,000 MCP tools, Merlin Stein et.al., Paper: http://arxiv.org/abs/2603.23802
- 2026-03-31, How and Why Agents Can Identify Bug-Introducing Commits, Niklas Risse et.al., Paper: http://arxiv.org/abs/2603.29378
- 2026-02-24, How Foundational Skills Influence VLM-based Embodied Agents:A Native Perspective, Bo Peng et.al., Paper: http://arxiv.org/abs/2602.20687
- 2025-12-01, How Far Are We from Genuinely Useful Deep Research Agents?, Dingling Zhang et.al., Paper: http://arxiv.org/abs/2512.01948
- 2025-09-17, How Does Cognitive Bias Affect Large Language Models? A Case Study on the Anchoring Effect in Price Negotiation Simulations, Yoshiki Takenami et.al., Paper: http://arxiv.org/abs/2508.21137
- 2025-12-08, How Do LLMs Fail In Agentic Scenarios? A Qualitative Analysis of Success and Failure Scenarios of Various LLMs in Agentic Simulations, JV Roig et.al., Paper: http://arxiv.org/abs/2512.07497
- 2025-12-25, How Do Agents Perform Code Optimization? An Empirical Study, Huiyun Peng et.al., Paper: http://arxiv.org/abs/2512.21757
- 2025-12-31, How Do Agentic AI Systems Deal With Software Energy Concerns? A Pull Request-Based Study, Tanjum Motin Mitul et.al., Paper: http://arxiv.org/abs/2512.24636
- 2025-12-31, How Do Agentic AI Systems Address Performance Optimizations? A BERTopic-Based Analysis of Pull Requests, Md Nahidul Islam Opu et.al., Paper: http://arxiv.org/abs/2512.24630
- 2025-10-26, How Do AI Agents Do Human Work? Comparing AI and Human Workflows Across Diverse Occupations, Zora Zhiruo Wang et.al., Paper: http://arxiv.org/abs/2510.22780
- 2026-03-17, How Clued up are LLMs? Evaluating Multi-Step Deductive Reasoning in a Text-Based Game Environment, Rebecca Ansell et.al., Paper: http://arxiv.org/abs/2603.17169
- 2025-09-01, How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on $τ$ -bench, Venkatesh Mishra et.al., Paper: http://arxiv.org/abs/2508.20931
- 2026-02-19, How AI Coding Agents Communicate: A Study of Pull Request Description Characteristics and Human Review Responses, Kan Watanabe et.al., Paper: http://arxiv.org/abs/2602.17084
- 2026-01-20, HoverAI: An Embodied Aerial Agent for Natural Human-Drone Interaction, Yuhua Jin et.al., Paper: http://arxiv.org/abs/2601.13801
- 2026-01-14, Honesty-Aware Multi-Agent Framework for High-Fidelity Synthetic Data Generation in Digital Psychiatric Intake Doctor-Patient Interactions, Xinyuan Zhang et.al., Paper: http://arxiv.org/abs/2601.09216
- 2026-03-12, HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios, Jiayue Pu et.al., Paper: http://arxiv.org/abs/2603.11975
- 2026-01-21, Holmes: An Evidence-Grounded LLM Agent for Auditable DDoS Investigation in Cloud Networks, Haodong Chen et.al., Paper: http://arxiv.org/abs/2601.14601
- 2025-10-13, Holistic Agent Leaderboard: The Missing Infrastructure for AI Agent Evaluation, Sayash Kapoor et.al., Paper: http://arxiv.org/abs/2510.11977
- 2026-04-01, HippoCamp: Benchmarking Contextual Agents on Personal Computers, Zhe Yang et.al., Paper: http://arxiv.org/abs/2604.01221
- 2026-01-08, Higher-Order Knowledge Representations for Agentic Scientific Reasoning, Isabella A. Stewart et.al., Paper: http://arxiv.org/abs/2601.04878
- 2025-10-15, Higher Satisfaction, Lower Cost: A Technical Report on How LLMs Revolutionize Meituan’s Intelligent Interaction Systems, Xuxin Cheng et.al., Paper: http://arxiv.org/abs/2510.13291
- 2026-02-26, Hierarchy-of-Groups Policy Optimization for Long-Horizon Agentic Tasks, Shuo He et.al., Paper: http://arxiv.org/abs/2602.22817
- 2025-12-03, Hierarchical Vision Language Action Model Using Success and Failure Demonstrations, Jeongeun Park et.al., Paper: http://arxiv.org/abs/2512.03913
- 2025-12-15, Hierarchical Multi-agent Large Language Model Reasoning for Autonomous Functional Materials Discovery, Samuel Rothfarb et.al., Paper: http://arxiv.org/abs/2512.13930
- 2026-04-02, Hierarchical Memory Orchestration for Personalized Persistent Agents, Junming Liu et.al., Paper: http://arxiv.org/abs/2604.01670
- 2026-02-25, Hierarchical LLM-Based Multi-Agent Framework with Prompt Optimization for Multi-Robot Task Planning, Tomoya Kawabe et.al., Paper: http://arxiv.org/abs/2602.21670
- 2025-11-28, Hierarchical AI-Meteorologist: LLM-Agent System for Multi-Scale and Explainable Weather Forecast Reporting, Daniil Sukhorukov et.al., Paper: http://arxiv.org/abs/2511.23387
- 2025-11-26, Hidden symmetries and separability structures of Ovcharenko-Podolský and conformal-to-Carter spacetimes, Finnian Gray et.al., Paper: http://arxiv.org/abs/2511.21538
- 2026-01-20, Hidden in Plain Text: Measuring LLM Deception Quality Against Human Baselines Using Social Deduction Games, Christopher Kao et.al., Paper: http://arxiv.org/abs/2601.13709
- 2026-02-11, Hidden Licensing Risks in the LLMware Ecosystem, Bo Wang et.al., Paper: http://arxiv.org/abs/2602.10758
- 2025-08-29, HiVA: Self-organized Hierarchical Variable Agent via Goal-driven Semantic-Topological Evolution, Jinzhou Tang et.al., Paper: http://arxiv.org/abs/2509.00189
- 2026-02-18, HiPER: Hierarchical Reinforcement Learning with Explicit Credit Assignment for Large Language Model Agents, Jiangweizhi Peng et.al., Paper: http://arxiv.org/abs/2602.16165
- 2025-11-18, Heterogeneous Multi-Agent Proximal Policy Optimization for Power Distribution System Restoration, Parya Dolatyabi et.al., Paper: http://arxiv.org/abs/2511.14730
- 2026-03-28, Heterogeneous Debate Engine: Identity-Grounded Cognitive Architecture for Resilient LLM-Based Ethical Tutoring, Jakub Masłowski et.al., Paper: http://arxiv.org/abs/2603.27404
- 2026-01-29, Heterogeneous Computing: The Key to Powering the Future of AI Agent Inference, Yiren Zhao et.al., Paper: http://arxiv.org/abs/2601.22001
- 2026-02-18, Helpful to a Fault: Measuring Illicit Assistance in Multi-Turn, Multilingual LLM Agents, Nivya Talokar et.al., Paper: http://arxiv.org/abs/2602.16346
- 2025-10-16, Helmsman: Autonomous Synthesis of Federated Learning Systems via Multi-Agent Collaboration, Haoyuan Li et.al., Paper: http://arxiv.org/abs/2510.14512
- 2025-12-22, Helios: A Foundational Language Model for Smart Energy Knowledge Reasoning and Application, Haoyu Jiang et.al., Paper: http://arxiv.org/abs/2512.19299
- 2026-03-11, HeartAgent: An Autonomous Agent System for Explainable Differential Diagnosis in Cardiology, Shuang Zhou et.al., Paper: http://arxiv.org/abs/2603.10764
- 2026-03-21, Hear Both Sides: Efficient Multi-Agent Debate via Diversity-Aware Message Retention, Manh Nguyen et.al., Paper: http://arxiv.org/abs/2603.20640
- 2025-09-11, Harnessing Uncertainty: Entropy-Modulated Policy Gradients for Long-Horizon LLM Agents, Jiawei Wang et.al., Paper: http://arxiv.org/abs/2509.09265
- 2025-11-21, Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization, Vinay Kanakeri et.al., Paper: http://arxiv.org/abs/2511.17489
- 2025-11-05, HaluMem: Evaluating Hallucinations in Memory Systems of Agents, Ding Chen et.al., Paper: http://arxiv.org/abs/2511.03506
- 2025-11-19, HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning, Qihao Yang et.al., Paper: http://arxiv.org/abs/2511.15574
- 2025-10-24, HIKMA: Human-Inspired Knowledge by Machine Agents through a Multi-Agent Framework for Semi-Autonomous Scientific Conferences, Zain Ul Abideen Tariq et.al., Paper: http://arxiv.org/abs/2510.21370
- 2025-12-25, HELP: Hierarchical Embodied Language Planner for Household Tasks, Alexandr V. Korchemnyi et.al., Paper: http://arxiv.org/abs/2512.21723
- 2025-12-22, HARMON-E: Hierarchical Agentic Reasoning for Multimodal Oncology Notes to Extract Structured Data, Shashi Kant Gupta et.al., Paper: http://arxiv.org/abs/2512.19864
- 2025-09-16, H $^2$ R: Hierarchical Hindsight Reflection for Multi-Task LLM Agents, Shicheng Ye et.al., Paper: http://arxiv.org/abs/2509.12810
- 2025-10-02, GuruAgents: Emulating Wise Investors with Prompt-Guided LLM Agents, Yejin Kim et.al., Paper: http://arxiv.org/abs/2510.01664
- 2026-01-30, Guided by Trajectories: Repairing and Rewarding Tool-Use Trajectories for Tool-Integrated Reasoning, Siyu Gong et.al., Paper: http://arxiv.org/abs/2601.23032
- 2025-09-09, Guided Reasoning in LLM-Driven Penetration Testing Using Structured Attack Trees, Katsuaki Nakano et.al., Paper: http://arxiv.org/abs/2509.07939
- 2026-02-04, Group-Evolving Agents: Open-Ended Self-Improvement via Experience Sharing, Zhaotian Weng et.al., Paper: http://arxiv.org/abs/2602.04837
- 2026-02-24, Grounding LLMs in Scientific Discovery via Embodied Actions, Bo Zhang et.al., Paper: http://arxiv.org/abs/2602.20639
- 2026-02-09, Grounding Generative Planners in Verifiable Logic: A Hybrid Architecture for Trustworthy Embodied AI, Feiyu Wu et.al., Paper: http://arxiv.org/abs/2602.08373
- 2025-11-06, Grounded Test-Time Adaptation for LLM Agents, Arthur Chen et.al., Paper: http://arxiv.org/abs/2511.04847
- 2025-11-18, Ground Truth Generation for Multilingual Historical NLP using LLMs, Clovis Gladstone et.al., Paper: http://arxiv.org/abs/2511.14688
- 2026-02-24, Grid-Mind: An LLM-Orchestrated Multi-Fidelity Agent for Automated Connection Impact Assessment, Mohamed Shamseldein et.al., Paper: http://arxiv.org/abs/2602.20683
- 2026-03-18, Grid Spatial Understanding: A Dataset for Textual Spatial Reasoning over Grids, Embodied Settings, and Coordinate Structures, Risham Sidhu et.al., Paper: http://arxiv.org/abs/2603.17333
- 2025-11-20, Green Resilience of Cyber-Physical Systems: Doctoral Dissertation, Diaeddin Rimawi et.al., Paper: http://arxiv.org/abs/2511.16593
- 2026-01-30, Greedy Routing Reachability Games, Pascal Lenzner et.al., Paper: http://arxiv.org/abs/2601.23126
- 2026-03-28, Greedy Is a Strong Default: Agents as Iterative Optimizers, Yitao Li et.al., Paper: http://arxiv.org/abs/2603.27415
- 2026-04-02, GraphWalk: Enabling Reasoning in Large Language Models through Tool-Based Graph Navigation, Taraneh Ghandi et.al., Paper: http://arxiv.org/abs/2604.01610
- 2025-10-12, GraphTracer: Graph-Guided Failure Tracing in LLM Agents for Robust Multi-Turn Deep Search, Heng Zhang et.al., Paper: http://arxiv.org/abs/2510.10581
- 2026-01-13, GraphSearch: Agentic Search-Augmented Reasoning for Zero-Shot Graph Learning, Jiajin Liu et.al., Paper: http://arxiv.org/abs/2601.08621
- 2025-12-23, Graph-Symbolic Policy Enforcement and Control (G-SPEC): A Neuro-Symbolic Framework for Safe Agentic AI in 5G Autonomous Networks, Divya Vijay et.al., Paper: http://arxiv.org/abs/2512.20275
- 2026-03-18, Graph-Native Cognitive Memory for AI Agents: Formal Belief Revision Semantics for Versioned Memory Architectures, Young Bin Park et.al., Paper: http://arxiv.org/abs/2603.17244
- 2025-10-30, Graph-Enhanced Policy Optimization in LLM Agent Training, Jiazhen Yuan et.al., Paper: http://arxiv.org/abs/2510.26270
- 2025-11-19, Graph Rewriting Language as a Platform for Quantum Diagrammatic Calculi, Kayo Tei et.al., Paper: http://arxiv.org/abs/2511.15581
- 2025-11-10, Graph Representation-based Model Poisoning on the Heterogeneous Internet of Agents, Hanlin Cai et.al., Paper: http://arxiv.org/abs/2511.07176
- 2025-11-18, Graph Neural Networks for Vehicular Social Networks: Trends, Challenges, and Opportunities, Elham Binshaflout et.al., Paper: http://arxiv.org/abs/2511.14720
- 2025-12-01, Graph Distance as Surprise: Free Energy Minimization in Knowledge Graph Reasoning, Gaganpreet Jhajj et.al., Paper: http://arxiv.org/abs/2512.01878
- 2025-12-04, Gradient Descent with Provably Tuned Learning-rate Schedules, Dravyansh Sharma et.al., Paper: http://arxiv.org/abs/2512.05084
- 2025-12-12, Gradient Descent as a Perceptron Algorithm: Understanding Dynamics and Implicit Acceleration, Alexander Tyurin et.al., Paper: http://arxiv.org/abs/2512.11587
- 2026-03-18, Governed Memory: A Production Architecture for Multi-Agent Workflows, Hamed Taheri et.al., Paper: http://arxiv.org/abs/2603.17787
- 2025-09-20, Governed By Agents: A Survey On The Role Of Agentic AI In Future Computing Environments, Nauman Ali Murad et.al., Paper: http://arxiv.org/abs/2509.16676
- 2026-03-21, Governance-Aware Vector Subscriptions for Multi-Agent Knowledge Ecosystems, Steven Johnson et.al., Paper: http://arxiv.org/abs/2603.20833
- 2026-03-07, Governance Architecture for Autonomous Agent Systems: Threats, Framework, and Engineering Practice, Yuxu Ge et.al., Paper: http://arxiv.org/abs/2603.07191
- 2026-03-04, Goal-Driven Risk Assessment for LLM-Powered Systems: A Healthcare Case Study, Neha Nagaraja et.al., Paper: http://arxiv.org/abs/2603.03633
- 2026-03-20, GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems, Hongjiang Chen et.al., Paper: http://arxiv.org/abs/2603.19677
- 2026-03-12, GlyphBanana: Advancing Precise Text Rendering Through Agentic Workflows, Zexuan Yan et.al., Paper: http://arxiv.org/abs/2603.12155
- 2026-02-17, GlobeDiff: State Diffusion Process for Partial Observability in Multi-Agent Systems, Yiqin Yang et.al., Paper: http://arxiv.org/abs/2602.15776
- 2025-10-31, Glia: A Human-Inspired AI for Automated Systems Design and Optimization, Pouya Hamadanian et.al., Paper: http://arxiv.org/abs/2510.27176
- 2025-10-30, Gistify! Codebase-Level Understanding via Runtime Execution, Hyunji Lee et.al., Paper: http://arxiv.org/abs/2510.26790
- 2025-10-23, GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?, Chiyu Chen et.al., Paper: http://arxiv.org/abs/2510.20333
- 2025-09-09, Getting In Contract with Large Language Models – An Agency Theory Perspective On Large Language Model Alignment, Sascha Kaltenpoth et.al., Paper: http://arxiv.org/abs/2509.07642
- 2025-11-19, GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization, Yikun Wang et.al., Paper: http://arxiv.org/abs/2511.15705
- 2025-10-21, Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming, Zheng Zhang et.al., Paper: http://arxiv.org/abs/2510.18314
- 2026-01-20, Generative Intent Prediction Agentic AI empowered Edge Service Function Chain Orchestration, Yan Sun et.al., Paper: http://arxiv.org/abs/2601.13694
- 2026-03-13, Generative Horcrux: Designing AI Carriers for Afterlife Selves, Zhen-Chi Lai et.al., Paper: http://arxiv.org/abs/2603.12971
- 2025-08-26, Generative Artificial Intelligence and Agents in Research and Teaching, Jussi S. Jauhiainen et.al., Paper: http://arxiv.org/abs/2508.16701
- 2026-02-10, Generative AI Adoption in an Energy Company: Exploring Challenges and Use Cases, Malik Abdul Sami et.al., Paper: http://arxiv.org/abs/2602.09846
- 2025-12-02, Generation of strong ultralow-phase-noise microwave fields with tunable ellipticity for ultracold polar molecules, Shrestha Biswas et.al., Paper: http://arxiv.org/abs/2512.03007
- 2026-03-12, Generating Expressive and Customizable Evals for Timeseries Data Analysis Agents with AgentFuel, Aadyaa Maddi et.al., Paper: http://arxiv.org/abs/2603.12483
- 2026-01-08, Generate, Transfer, Adapt: Learning Functional Dexterous Grasping from a Single Human Demonstration, Xingyi He et.al., Paper: http://arxiv.org/abs/2601.05243
- 2025-10-16, Generalized Dynamics Generation towards Scannable Physical World Model, Yichen Li et.al., Paper: http://arxiv.org/abs/2510.15041
- 2025-09-22, Generalizable End-to-End Tool-Use RL with Synthetic CodeGym, Weihua Du et.al., Paper: http://arxiv.org/abs/2509.17325
- 2025-12-22, GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators, Jiacheng Guo et.al., Paper: http://arxiv.org/abs/2512.19682
- 2026-03-02, GenDB: The Next Generation of Query Processing – Synthesized, Not Engineered, Jiale Lao et.al., Paper: http://arxiv.org/abs/2603.02081
- 2025-10-14, GenCellAgent: Generalizable, Training-Free Cellular Image Segmentation via Large Language Model Agents, Xi Yu et.al., Paper: http://arxiv.org/abs/2510.13896
- 2026-01-26, GenAgent: Scaling Text-to-Image Generation via Agentic Multimodal Reasoning, Kaixun Jiang et.al., Paper: http://arxiv.org/abs/2601.18543
- 2026-01-21, Gaming the Judge: Unfaithful Chain-of-Thought Can Undermine Agent Evaluation, Muhammad Khalifa et.al., Paper: http://arxiv.org/abs/2601.14691
- 2026-03-25, GameplayQA: A Benchmarking Framework for Decision-Dense POV-Synced Multi-Video Understanding of 3D Virtual Agents, Yunzhe Wang et.al., Paper: http://arxiv.org/abs/2603.24329
- 2026-02-11, GameDevBench: Evaluating Agentic Capabilities Through Game Development, Wayne Chi et.al., Paper: http://arxiv.org/abs/2602.11103
- 2026-02-03, Game-Theoretic and Algorithmic Analyses of Multi-Agent Routing under Crossing Costs, Tesshu Hanaka et.al., Paper: http://arxiv.org/abs/2602.03455
- 2026-01-21, Game-Theoretic Lens on LLM-based Multi-Agent Systems, Jianing Hao et.al., Paper: http://arxiv.org/abs/2601.15047
- 2025-10-02, Gala: Global LLM Agents for Text-to-Model Translation, Junyang Cai et.al., Paper: http://arxiv.org/abs/2509.08970
- 2026-02-12, Gaia2: Benchmarking LLM Agents on Dynamic and Asynchronous Environments, Romain Froger et.al., Paper: http://arxiv.org/abs/2602.11964
- 2025-10-16, GUIrilla: A Scalable Framework for Automated Desktop UI Exploration, Sofiya Garkot et.al., Paper: http://arxiv.org/abs/2510.16051
- 2026-03-28, GUIDE: Guided Updates for In-context Decision Evolution in LLM-Driven Spacecraft Operations, Alejandro Carrasco et.al., Paper: http://arxiv.org/abs/2603.27306
- 2025-09-28, GUI-Shepherd: Reliable Process Reward and Verification for Long-Sequence GUI Tasks, Cong Chen et.al., Paper: http://arxiv.org/abs/2509.23738
- 2025-10-02, GSM-Agent: Understanding Agentic Reasoning Using Controllable Environments, Hanlin Zhu et.al., Paper: http://arxiv.org/abs/2509.21998
- 2025-11-14, GRIN Transfer: A production-ready tool for libraries to retrieve digital copies from Google Books, Liza Daly et.al., Paper: http://arxiv.org/abs/2511.11447
- 2025-12-05, GRASP: Graph Reasoning Agents for Systems Pharmacology with Human-in-the-Loop, Omid Bazgir et.al., Paper: http://arxiv.org/abs/2512.05502
- 2026-04-01, GRASP: Gradient Realignment via Active Shared Perception for Multi-Agent Collaborative Optimization, Sihan Zhou et.al., Paper: http://arxiv.org/abs/2604.00717
- 2025-11-21, GRAPHIC–Guidelines for Reviewing Algorithmic Practices in Human-centred Design and Interaction for Creativity, Joana Rovira Martins et.al., Paper: http://arxiv.org/abs/2511.17443
- 2025-11-14, GRANITE: High-Resolution Imaging and Electrical Qualification of Large-Area TPC Electrodes, Shumit A. Mitra et.al., Paper: http://arxiv.org/abs/2511.11401
- 2026-02-18, GPSBench: Do Large Language Models Understand GPS Coordinates?, Thinh Hung Truong et.al., Paper: http://arxiv.org/abs/2602.16105
- 2025-10-14, GOAT: A Training Framework for Goal-Oriented Agent with Tools, Hyunji Min et.al., Paper: http://arxiv.org/abs/2510.12218
- 2026-03-21, GMPilot: An Expert AI Agent For FDA cGMP Compliance, Xiaohan Wang et.al., Paper: http://arxiv.org/abs/2603.20815
- 2026-02-17, GLM-5: from Vibe Coding to Agentic Engineering, GLM-5 Team et.al., Paper: http://arxiv.org/abs/2602.15763
- 2025-10-29, GAP: Graph-Based Agent Planning with Parallel Tool Use and Reinforcement Learning, Jiaqi Wu et.al., Paper: http://arxiv.org/abs/2510.25320
- 2025-12-10, GAIR: GUI Automation via Information-Joint Reasoning and Group Reflection, Zishu Wei et.al., Paper: http://arxiv.org/abs/2512.09396
- 2025-12-24, Fuzzwise: Intelligent Initial Corpus Generation for Fuzzing, Hridya Dhulipala et.al., Paper: http://arxiv.org/abs/2512.21440
- 2025-12-12, FutureWeaver: Planning Test-Time Compute for Multi-Agent Systems with Modularized Collaboration, Dongwon Jung et.al., Paper: http://arxiv.org/abs/2512.11213
- 2026-03-11, FutureVLA: Joint Visuomotor Prediction for Vision-Language-Action Model, Xiaoxu Xu et.al., Paper: http://arxiv.org/abs/2603.10712
- 2025-10-10, Fundamentals of Building Autonomous LLM Agents, Victor de Lamo Castrillo et.al., Paper: http://arxiv.org/abs/2510.09244
- 2025-12-03, Functorial properties of Schwinger-DeWitt expansion and Mellin-Barnes representation, Andrei O. Barvinsky et.al., Paper: http://arxiv.org/abs/2512.03944
- 2025-11-28, Functional Program Synthesis with Higher-Order Functions and Recursion Schemes, Matheus Campos Fernandes et.al., Paper: http://arxiv.org/abs/2511.23354
- 2025-10-28, FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling, Zengzhuang Xu et.al., Paper: http://arxiv.org/abs/2510.24645
- 2025-12-23, Fun-Audio-Chat Technical Report, Qian Chen et.al., Paper: http://arxiv.org/abs/2512.20156
- 2026-02-03, FullStack-Agent: Enhancing Agentic Full-Stack Web Coding via Development-Oriented Testing and Repository Back-Translation, Zimu Lu et.al., Paper: http://arxiv.org/abs/2602.03798
- 2026-01-06, From inconsistency to decision: explainable operation and maintenance of battery energy storage systems, Jingbo Qu et.al., Paper: http://arxiv.org/abs/2601.03007
- 2026-03-16, From Workflow Automation to Capability Closure: A Formal Framework for Safe and Revenue-Aware Customer Service AI, Cosimo Spera et.al., Paper: http://arxiv.org/abs/2603.15978
- 2026-01-21, From Who They Are to How They Act: Behavioral Traits in Generative Agent-Based Models of Social Media, Valerio La Gatta et.al., Paper: http://arxiv.org/abs/2601.15114
- 2026-03-19, From Weak Cues to Real Identities: Evaluating Inference-Driven De-Anonymization in LLM Agents, Myeongseob Ko et.al., Paper: http://arxiv.org/abs/2603.18382
- 2025-12-15, From User Interface to Agent Interface: Efficiency Optimization of UI Representations for LLM Agents, Dezhi Ran et.al., Paper: http://arxiv.org/abs/2512.13438
- 2025-09-30, From Trace to Line: LLM Agent for Real-World OSS Vulnerability Localization, Haoran Xi et.al., Paper: http://arxiv.org/abs/2510.02389
- 2026-03-28, From Tool to Teammate: LLM Coding Agents as Collaborative Partners for Behavioral Labeling in Educational Dialogue Analysis, Eason Chen et.al., Paper: http://arxiv.org/abs/2603.27440
- 2026-03-04, From Threat Intelligence to Firewall Rules: Semantic Relations in Hybrid AI Agent and Expert System Architectures, Chiara Bonfanti et.al., Paper: http://arxiv.org/abs/2603.03911
- 2025-12-05, From Text to Returns: Using Large Language Models for Mutual Fund Portfolio Optimization and Risk-Adjusted Allocation, Abrar Hossain Mufakir Qamar Ansari Haziq Jeelani Monia Digra Fayeq Jeelani Syed et.al., Paper: http://arxiv.org/abs/2512.05907
- 2026-02-27, From Static Benchmarks to Dynamic Protocol: Agent-Centric Text Anomaly Detection for Evaluating LLM Reasoning, Seungdong Yoa et.al., Paper: http://arxiv.org/abs/2602.23729
- 2025-11-04, From Solo to Symphony: Orchestrating Multi-Agent Collaboration with Single-Agent Demos, Xun Wang et.al., Paper: http://arxiv.org/abs/2511.02762
- 2026-01-15, From Single to Multi-Agent Reasoning: Advancing GeneGPT for Genomics QA, Kimia Abedini et.al., Paper: http://arxiv.org/abs/2601.10581
- 2026-01-30, From Similarity to Vulnerability: Key Collision Attack on LLM Semantic Caching, Zhixiang Zhang et.al., Paper: http://arxiv.org/abs/2601.23088
- 2025-10-05, From Shadow to Light: Toward Safe and Efficient Policy Learning Across MPC, DeePC, RL, and LLM Agents, Amin Vahidi-Moghaddam et.al., Paper: http://arxiv.org/abs/2510.04076
- 2026-01-30, From Self-Evolving Synthetic Data to Verifiable-Reward RL: Post-Training Multi-turn Interactive Tool-Using Agents, Jiaxuan Gao et.al., Paper: http://arxiv.org/abs/2601.22607
- 2025-10-15, From Refusal to Recovery: A Control-Theoretic Approach to Generative AI Guardrails, Ravi Pandya et.al., Paper: http://arxiv.org/abs/2510.13727
- 2025-10-23, From Questions to Queries: An AI-powered Multi-Agent Framework for Spatial Text-to-SQL, Ali Khosravi Kazazi et.al., Paper: http://arxiv.org/abs/2510.21045
- 2026-02-11, From Prompt-Response to Goal-Directed Systems: The Evolution of Agentic AI Software Architecture, Mamdouh Alenezi et.al., Paper: http://arxiv.org/abs/2602.10479
- 2025-11-17, From Power to Precision: Learning Fine-grained Dexterity for Multi-fingered Robotic Hands, Jianglong Ye et.al., Paper: http://arxiv.org/abs/2511.13710
- 2025-10-31, From Pixels to Paths: A Multi-Agent Framework for Editable Scientific Illustration, Jianwen Sun et.al., Paper: http://arxiv.org/abs/2510.27452
- 2025-12-18, From Personalization to Prejudice: Bias and Discrimination in Memory-Enhanced AI Agents for Recruitment, Himanshu Gharat et.al., Paper: http://arxiv.org/abs/2512.16532
- 2026-02-24, From Perception to Action: An Interactive Benchmark for Vision Reasoning, Yuhao Wu et.al., Paper: http://arxiv.org/abs/2602.21015
- 2026-01-22, From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models, Jiaxin Zhang et.al., Paper: http://arxiv.org/abs/2601.15690
- 2025-10-28, From Observability Data to Diagnosis: An Evolving Multi-agent System for Incident Management in Cloud Systems, Yu Luo et.al., Paper: http://arxiv.org/abs/2510.24145
- 2025-12-16, From Obfuscated to Obvious: A Comprehensive JavaScript Deobfuscation Tool for Security Analysis, Dongchao Zhou et.al., Paper: http://arxiv.org/abs/2512.14070
- 2025-12-02, From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars?, Dawei Li et.al., Paper: http://arxiv.org/abs/2512.03005
- 2025-10-29, From Medical Records to Diagnostic Dialogues: A Clinical-Grounded Approach and Dataset for Psychiatric Comorbidity, Tianxi Wan et.al., Paper: http://arxiv.org/abs/2510.25232
- 2025-11-05, From Measurement to Expertise: Empathetic Expert Adapters for Context-Based Empathy in Conversational AI Agents, Erfan Shayegani et.al., Paper: http://arxiv.org/abs/2511.03143
- 2026-03-26, From Manipulation to Mistrust: Explaining Diverse Micro-Video Misinformation for Robust Debunking in the Wild, Zhi Zeng et.al., Paper: http://arxiv.org/abs/2603.25423
- 2026-03-26, From Logic Monopoly to Social Contract: Separation of Power and the Institutional Foundations for Autonomous Agent Economies, Anbang Ruan et.al., Paper: http://arxiv.org/abs/2603.25100
- 2025-10-14, From Literal to Liberal: A Meta-Prompting Framework for Eliciting Human-Aligned Exception Handling in Large Language Models, Imran Khan et.al., Paper: http://arxiv.org/abs/2510.12864
- 2025-09-17, From Legacy Fortran to Portable Kokkos: An Autonomous Agentic AI Workflow, Sparsh Gupta et.al., Paper: http://arxiv.org/abs/2509.12443
- 2026-03-03, From Language to Action: Can LLM-Based Agents Be Used for Embodied Robot Cognition?, Shinas Shaji et.al., Paper: http://arxiv.org/abs/2603.03148
- 2025-08-24, From Language to Action: A Review of Large Language Models as Autonomous Agents and Tool Users, Sadia Sultana Chowa et.al., Paper: http://arxiv.org/abs/2508.17281
- 2026-01-07, From Laboratory to Real-World Applications: Benchmarking Agentic Code Reasoning at the Repository Level, Jia Li et.al., Paper: http://arxiv.org/abs/2601.03731
- 2026-02-19, From Labor to Collaboration: A Methodological Experiment Using AI Agents to Augment Research Perspectives in Taiwan’s Humanities and Social Sciences, Yi-Chih Huang et.al., Paper: http://arxiv.org/abs/2602.17221
- 2025-04-29, From LLM Reasoning to Autonomous AI Agents: A Comprehensive Review, Mohamed Amine Ferrag et.al., Paper: http://arxiv.org/abs/2504.19678
- 2026-03-26, From Intent to Evidence: A Categorical Approach for Structural Evaluation of Deep Research Agents, Shuoling Liu et.al., Paper: http://arxiv.org/abs/2603.25342
- 2026-03-28, From Inference Routing to Agent Orchestration: Declarative Policy Compilation with Cross-Layer Verification, Huamin Chen et.al., Paper: http://arxiv.org/abs/2603.27299
- 2026-03-19, From Inference Efficiency to Embodied Efficiency: Revisiting Efficiency Metrics for Vision-Language-Action Models, Zhuofan Li et.al., Paper: http://arxiv.org/abs/2603.19131
- 2026-02-05, From Human-Human Collaboration to Human-Agent Collaboration: A Vision, Design Philosophy, and an Empirical Framework for Achieving Successful Partnerships Between Humans and LLM Agents, Bingsheng Yao et.al., Paper: http://arxiv.org/abs/2602.05987
- 2026-02-04, From Helpfulness to Toxic Proactivity: Diagnosing Behavioral Misalignment in LLM Agents, Xinyue Wang et.al., Paper: http://arxiv.org/abs/2602.04197
- 2025-10-23, From Generation to Attribution: Music AI Agent Architectures for the Post-Streaming Era, Wonil Kim et.al., Paper: http://arxiv.org/abs/2510.20276
- 2026-01-04, From Failure to Mastery: Generating Hard Samples for Tool-use Agents, Bingguang Hao et.al., Paper: http://arxiv.org/abs/2601.01498
- 2026-03-13, From Experiments to Expertise: Scientific Knowledge Consolidation for AI-Driven Computational Research, Haonan Huang et.al., Paper: http://arxiv.org/abs/2603.13191
- 2025-09-07, From Digital Distrust to Codified Honesty: Experimental Evidence on Generative AI in Credence Goods Markets, Alexander Erlei et.al., Paper: http://arxiv.org/abs/2509.06069
- 2026-03-10, From Days to Minutes: An Autonomous AI Agent Achieves Reliable Clinical Triage in Remote Patient Monitoring, Seunghwan Kim et.al., Paper: http://arxiv.org/abs/2603.09052
- 2025-12-16, From Context to EDUs: Faithful and Structured Context Compression via Elementary Discourse Unit Decomposition, Yiqing Zhou et.al., Paper: http://arxiv.org/abs/2512.14244
- 2026-02-04, From Assumptions to Actions: Turning LLM Reasoning into Uncertainty-Aware Planning for Embodied Agents, SeungWon Seo et.al., Paper: http://arxiv.org/abs/2602.04326
- 2026-02-09, From Assistant to Double Agent: Formalizing and Benchmarking Attacks on OpenClaw for Personalized Local AI Agent, Yuhang Wang et.al., Paper: http://arxiv.org/abs/2602.08412
- 2025-10-07, From Agentification to Self-Evolving Agentic AI for Wireless Networks: Concepts, Approaches, and Future Research Directions, Changyuan Zhao et.al., Paper: http://arxiv.org/abs/2510.05596
- 2026-03-25, From AI Assistant to AI Scientist: Autonomous Discovery of LLM-RL Algorithms with LLM Agents, Sirui Xia et.al., Paper: http://arxiv.org/abs/2603.23951
- 2025-11-24, Frequency-Invariant Beamforming in Elevation and Azimuth via Autograd and Concentric Circular Microphone Arrays, Jorge Ortigoso-Narro et.al., Paper: http://arxiv.org/abs/2511.19403
- 2025-11-17, FreeAskWorld: An Interactive and Closed-Loop Simulator for Human-Centric Embodied AI, Yuhang Peng et.al., Paper: http://arxiv.org/abs/2511.13524
- 2025-09-14, Free-MAD: Consensus-Free Multi-Agent Debate, Yu Cui et.al., Paper: http://arxiv.org/abs/2509.11035
- 2026-02-10, Frame-Level Internal Tool Use for Temporal Grounding in Audio LMs, Joesph An et.al., Paper: http://arxiv.org/abs/2602.10230
- 2026-02-27, Foundation World Models for Agents that Learn, Verify, and Adapt Reliably Beyond Static Environments, Florent Delgrange et.al., Paper: http://arxiv.org/abs/2602.23997
- 2025-10-15, Formalizing the Safety, Security, and Functional Properties of Agentic AI Systems, Edoardo Allegrini et.al., Paper: http://arxiv.org/abs/2510.14133
- 2025-12-01, Forecasting in Offline Reinforcement Learning for Non-stationary Environments, Suzan Ece Ada et.al., Paper: http://arxiv.org/abs/2512.01987
- 2025-10-21, Food4All: A Multi-Agent Framework for Real-time Free Food Discovery with Integrated Nutritional Metadata, Zhengqing Yuan et.al., Paper: http://arxiv.org/abs/2510.18289
- 2026-02-10, Focus Session: LLM4PQC – An Agentic Framework for Accurate and Efficient Synthesis of PQC Cores, Buddhi Perera et.al., Paper: http://arxiv.org/abs/2602.09919
- 2025-12-16, FocalComm: Hard Instance-Aware Multi-Agent Perception, Dereje Shenkut et.al., Paper: http://arxiv.org/abs/2512.13982
- 2025-12-02, FluxLab: Creating 3D Printable Shape-Changing Devices with Integrated Deformation Sensing, Hsuanling Lee et.al., Paper: http://arxiv.org/abs/2512.02911
- 2026-03-26, FluxEDA: A Unified Execution Infrastructure for Stateful Agentic EDA, Zhengrui Chen et.al., Paper: http://arxiv.org/abs/2603.25243
- 2026-04-01, Fluently Lying: Adversarial Robustness Can Be Substrate-Dependent, Daye Kang et.al., Paper: http://arxiv.org/abs/2604.00605
- 2026-02-12, FlowMind: Execute-Summarize for Structured Workflow Generation from LLM Reasoning, Yihao Liu et.al., Paper: http://arxiv.org/abs/2602.11782
- 2025-12-05, Floer sections in multisymplectic geometry, Ronen Brilleslijper et.al., Paper: http://arxiv.org/abs/2512.05797
- 2025-09-11, Flip Co-op: Cooperative Takeovers in Shared Autonomy, Sandeep Banik et.al., Paper: http://arxiv.org/abs/2509.09281
- 2025-11-13, Flexible Simulation Based Inference for Galaxy Photometric Fitting with Synthesizer, Thomas Harvey et.al., Paper: http://arxiv.org/abs/2511.10640
- 2026-03-10, FlexServe: A Fast and Secure LLM Serving System for Mobile Devices with Flexible Resource Isolation, Yinpeng Wu et.al., Paper: http://arxiv.org/abs/2603.09046
- 2026-01-01, FlashInfer-Bench: Building the Virtuous Cycle for AI-driven LLM Systems, Shanli Xing et.al., Paper: http://arxiv.org/abs/2601.00227
- 2025-10-24, Five-loop beta function for gauge theories: computations, results and consequences, F. Herzog et.al., Paper: http://arxiv.org/abs/2510.21624
- 2026-03-05, FireBench: Evaluating Instruction Following in Enterprise and API-Driven LLM Applications, Yunfan Zhang et.al., Paper: http://arxiv.org/abs/2603.04857
- 2025-10-31, Fints: Efficient Inference-Time Personalization for LLMs with Fine-Grained Instance-Tailored Steering, Kounianhua Du et.al., Paper: http://arxiv.org/abs/2510.27206
- 2026-01-05, Finite-State Decentralized Policy-Based Control With Guaranteed Ground Coverage, Hossein Rastgoftar et.al., Paper: http://arxiv.org/abs/2601.02109
- 2025-10-01, Fine-tuning with RAG for Improving LLM Learning of New Skills, Humaid Ibrahim et.al., Paper: http://arxiv.org/abs/2510.01375
- 2026-02-11, Fine-Tuning GPT-5 for GPU Kernel Generation, Ali Tehrani et.al., Paper: http://arxiv.org/abs/2602.11000
- 2025-12-15, Finch: Benchmarking Finance & Accounting across Spreadsheet-Centric Enterprise Workflows, Haoyu Dong et.al., Paper: http://arxiv.org/abs/2512.13168
- 2025-10-13, FinVet: A Collaborative Framework of RAG and External Fact-Checking Agents for Financial Misinformation Detection, Daniel Berhane Araya et.al., Paper: http://arxiv.org/abs/2510.11654
- 2026-03-25, FinToolSyn: A forward synthesis Framework for Financial Tool-Use Dialogue Data with Dynamic Tool Retrieval, Caishuang Huang et.al., Paper: http://arxiv.org/abs/2603.24051
- 2026-03-09, FinToolBench: Evaluating LLM Agents for Real-World Financial Tool Use, Jiaxuan Lu et.al., Paper: http://arxiv.org/abs/2603.08262
- 2025-10-19, FinSight: Towards Real-World Financial Deep Research, Jiajie Jin et.al., Paper: http://arxiv.org/abs/2510.16844
- 2025-11-10, FinRpt: Dataset, Evaluation System and LLM-based Multi-agent Framework for Equity Research Report Generation, Song Jin et.al., Paper: http://arxiv.org/abs/2511.07322
- 2025-10-31, FinPos: A Position-Aware Trading Agent System for Real Financial Markets, Bijia Liu et.al., Paper: http://arxiv.org/abs/2510.27251
- 2025-10-21, FinAI Data Assistant: LLM-based Financial Database Query Processing with the OpenAI Function Calling API, Juhyeong Kim et.al., Paper: http://arxiv.org/abs/2510.14162
- 2025-10-21, Fetch.ai: An Architecture for Modern Multi-Agent Systems, Michael J. Wooldridge et.al., Paper: http://arxiv.org/abs/2510.18699
- 2025-09-30, Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents, Zhen Yang et.al., Paper: http://arxiv.org/abs/2509.26539
- 2025-09-28, FedAgentBench: Towards Automating Real-world Federated Medical Image Analysis with Server-Client LLM Agents, Pramit Saha et.al., Paper: http://arxiv.org/abs/2509.23803
- 2025-12-09, Fed-SE: Federated Self-Evolution for Privacy-Constrained Multi-Environment LLM Agents, Xiang Chen et.al., Paper: http://arxiv.org/abs/2512.08870
- 2026-02-13, Favia: Forensic Agent for Vulnerability-fix Identification and Analysis, André Storhaug et.al., Paper: http://arxiv.org/abs/2602.12500
- 2025-12-01, Fault-tolerant mutual-visibility: complexity and solutions for grid-like networks, Serafino Cicerone et.al., Paper: http://arxiv.org/abs/2512.01978
- 2025-12-14, Fault-Tolerant Sandboxing for AI Coding Agents: A Transactional Approach to Safe Autonomous Execution, Boyang Yan et.al., Paper: http://arxiv.org/abs/2512.12806
- 2026-02-02, Fat-Cat: Document-Driven Metacognitive Multi-Agent System for Complex Reasoning, Tong Yang et.al., Paper: http://arxiv.org/abs/2602.02206
- 2025-11-19, Fast and Certified Bounding of Security-Constrained DCOPF via Interval Bound Propagation, Eren Tekeler et.al., Paper: http://arxiv.org/abs/2511.15624
- 2025-12-02, Fast Gaussian Process Approximations for Autocorrelated Data, Ahmadreza Chokhachian et.al., Paper: http://arxiv.org/abs/2512.02925
- 2026-01-20, FantasyVLN: Unified Multimodal Chain-of-Thought Reasoning for Vision-Language Navigation, Jing Zuo et.al., Paper: http://arxiv.org/abs/2601.13976
- 2026-03-09, Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA, Ummar Abbas et.al., Paper: http://arxiv.org/abs/2603.08501
- 2025-11-20, FairLRF: Achieving Fairness through Sparse Low Rank Factorization, Yuanbo Guo et.al., Paper: http://arxiv.org/abs/2511.16549
- 2026-02-03, Failure is Feedback: History-Aware Backtracking for Agentic Traversal in Multimodal Graphs, Joohyung Yun et.al., Paper: http://arxiv.org/abs/2602.03432
- 2026-01-26, FadeMem: Biologically-Inspired Forgetting for Efficient Agent Memory, Lei Wei et.al., Paper: http://arxiv.org/abs/2601.18642
- 2026-02-16, FactorMiner: A Self-Evolving Agent with Skills and Experience Memory for Financial Alpha Discovery, Yanlong Wang et.al., Paper: http://arxiv.org/abs/2602.14670
- 2026-02-26, FactGuard: Agentic Video Misinformation Detection via Reinforcement Learning, Zehao Li et.al., Paper: http://arxiv.org/abs/2602.22963
- 2026-01-21, Facilitating Proactive and Reactive Guidance for Decision Making on the Web: A Design Probe with WebSeek, Yanwei Huang et.al., Paper: http://arxiv.org/abs/2601.15100
- 2025-09-04, FaMA: LLM-Empowered Agentic Assistant for Consumer-to-Consumer Marketplace, Yineng Yan et.al., Paper: http://arxiv.org/abs/2509.03890
- 2026-01-12, FROAV: A Framework for RAG Observation and Agent Verification – Lowering the Barrier to LLM Agent Research, Tzu-Hsuan Lin et.al., Paper: http://arxiv.org/abs/2601.07504
- 2025-11-25, FRAGMENTA: End-to-end Fragmentation-based Generative Model with Agentic Tuning for Drug Lead Optimization, Yuto Suzuki et.al., Paper: http://arxiv.org/abs/2511.20510
- 2025-08-24, FLAIRR-TS – Forecasting LLM-Agents with Iterative Refinement and Retrieval for Time Series, Gunjan Jalori et.al., Paper: http://arxiv.org/abs/2508.19279
- 2025-09-12, FHIR-AgentBench: Benchmarking LLM Agents for Realistic Interoperable EHR Question Answering, Gyubok Lee et.al., Paper: http://arxiv.org/abs/2509.19319
- 2025-10-29, FELA: A Multi-Agent Evolutionary System for Feature Engineering of Industrial Event Log Data, Kun ouyang et.al., Paper: http://arxiv.org/abs/2510.25223
- 2026-02-19, FAMOSE: A ReAct Approach to Automated Feature Discovery, Keith Burghardt et.al., Paper: http://arxiv.org/abs/2602.17641
- 2025-08-26, FALCON: Autonomous Cyber Threat Intelligence Mining with LLMs for IDS Rule Generation, Shaswata Mitra et.al., Paper: http://arxiv.org/abs/2508.18684
- 2025-10-15, FACTS: Table Summarization via Offline Template Generation with Agentic Workflows, Ye Yuan et.al., Paper: http://arxiv.org/abs/2510.13920
- 2026-01-30, FACET: Multi-Agent AI Supporting Teachers in Scaling Differentiated Learning for Diverse Students, Jana Gonnermann-Müller et.al., Paper: http://arxiv.org/abs/2601.22788
- 2025-10-20, FABRIC: Framework for Agent-Based Realistic Intelligence Creation, Abhigya Verma et.al., Paper: http://arxiv.org/abs/2510.17995
- 2025-10-04, Extracting Conceptual Knowledge to Locate Software Issues, Ying Wang et.al., Paper: http://arxiv.org/abs/2509.21427
- 2025-10-08, Exposing LLM User Privacy via Traffic Fingerprint Analysis: A Study of Privacy Risks in LLM Agent Interactions, Yixiang Zhang et.al., Paper: http://arxiv.org/abs/2510.07176
- 2026-01-04, Exposing Hidden Interfaces: LLM-Guided Type Inference for Reverse Engineering macOS Private Frameworks, Arina Kharlamova et.al., Paper: http://arxiv.org/abs/2601.01673
- 2025-11-19, Exploring the use of AI authors and reviewers at Agents4Science, Federico Bianchi et.al., Paper: http://arxiv.org/abs/2511.15534
- 2026-03-07, Exploring the Reasoning Depth of Small Language Models in Software Architecture: A Multidimensional Evaluation Framework Towards Software Engineering 2.0, Ha Vo et.al., Paper: http://arxiv.org/abs/2603.07091
- 2026-01-27, Exploring Weaknesses in Function Call Models via Reinforcement Learning: An Adversarial Data Augmentation Approach, Weiran Guo et.al., Paper: http://arxiv.org/abs/2601.19122
- 2026-02-19, Exploring The Impact Of Proactive Generative AI Agent Roles In Time-Sensitive Collaborative Problem-Solving Tasks, Anirban Mukhopadhyay et.al., Paper: http://arxiv.org/abs/2602.17864
- 2026-04-02, Exploring Robust Multi-Agent Workflows for Environmental Data Management, Boyuan Guan et.al., Paper: http://arxiv.org/abs/2604.01647
- 2026-01-29, Exploring Reasoning Reward Model for Agents, Kaixuan Fan et.al., Paper: http://arxiv.org/abs/2601.22154
- 2026-03-02, Exploring Plan Space through Conversation: An Agentic Framework for LLM-Mediated Explanations in Planning, Guilhem Fouilhé et.al., Paper: http://arxiv.org/abs/2603.02070
- 2025-08-30, Exploring Decision-Making Capabilities of LLM Agents: An Experimental Study on Jump-Jump Game, Juwu Li et.al., Paper: http://arxiv.org/abs/2509.00483
- 2025-11-18, Exploring AlphaFold 3 for CD47 Antibody-Antigen Binding Affinity: An Unexpected Discovery of Reverse docking, Yiyang Xu et.al., Paper: http://arxiv.org/abs/2511.14676
- 2025-10-16, Explore to Evolve: Scaling Evolved Aggregation Logic via Proactive Online Exploration for Deep Research Agents, Rui Wang et.al., Paper: http://arxiv.org/abs/2510.14438
- 2026-02-26, Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization, Zeyuan Liu et.al., Paper: http://arxiv.org/abs/2602.23008
- 2026-01-05, Explicit World Models for Reliable Human-Robot Collaboration, Kenneth Kwok et.al., Paper: http://arxiv.org/abs/2601.01705
- 2025-11-24, Explicit Tonal Tension Conditioning via Dual-Level Beam Search for Symbolic Music Generation, Maral Ebrahimzadeh et.al., Paper: http://arxiv.org/abs/2511.19342
- 2025-11-10, Explainable Cross-Disease Reasoning for Cardiovascular Risk Assessment from LDCT, Yifei Zhang et.al., Paper: http://arxiv.org/abs/2511.06625
- 2025-11-18, Expert-Guided POMDP Learning for Data-Efficient Modeling in Healthcare, Marco Locatelli et.al., Paper: http://arxiv.org/abs/2511.14619
- 2025-11-19, Experimental demonstration of non-local magic in a superconducting quantum processor, Halima Giovanna Ahmad et.al., Paper: http://arxiv.org/abs/2511.15576
- 2026-02-27, Experience-Guided Self-Adaptive Cascaded Agents for Breast Cancer Screening and Diagnosis with Reduced Biopsy Referrals, Pramit Saha et.al., Paper: http://arxiv.org/abs/2602.23899
- 2025-11-14, Experience-Guided Adaptation of Inference-Time Reasoning Strategies, Adam Stein et.al., Paper: http://arxiv.org/abs/2511.11519
- 2025-10-17, Experience-Driven Exploration for Efficient API-Free AI Agents, Chenwei Tang et.al., Paper: http://arxiv.org/abs/2510.15259
- 2026-03-02, Expanding LLM Agent Boundaries with Strategy-Guided Exploration, Andrew Szot et.al., Paper: http://arxiv.org/abs/2603.02045
- 2026-01-13, ExpSeek: Self-Triggered Experience Seeking for Web Agents, Wenyuan Zhang et.al., Paper: http://arxiv.org/abs/2601.08605
- 2026-02-12, Existence Results and KKT Optimality Conditions for Generalized Quasiconvex Functions, M. H. Alizadeh et.al., Paper: http://arxiv.org/abs/2602.12455
- 2025-10-17, Exemplar-Guided Planing: Enhanced LLM Agent for KGQA, Jingao Xu et.al., Paper: http://arxiv.org/abs/2510.15283
- 2026-04-01, Executing as You Generate: Hiding Execution Latency in LLM Code Generation, Zhensu Sun et.al., Paper: http://arxiv.org/abs/2604.00491
- 2025-10-20, Executable Knowledge Graphs for Replicating AI Research, Yujie Luo et.al., Paper: http://arxiv.org/abs/2510.17795
- 2025-11-19, Excess of diffuse gamma-ray emission detected from the galaxy cluster Abell 119 from 14-year Fermi-LAT Data, Gajanan D Harale et.al., Paper: http://arxiv.org/abs/2511.15559
- 2026-03-06, Evolving Medical Imaging Agents via Experience-driven Self-skill Discovery, Lin Fan et.al., Paper: http://arxiv.org/abs/2603.05860
- 2025-12-09, Evolving Excellence: Automated Optimization of LLM-based Agents, Paul Brookes et.al., Paper: http://arxiv.org/abs/2512.09108
- 2026-03-06, Evolving Deception: When Agents Evolve, Deception Wins, Zonghao Ying et.al., Paper: http://arxiv.org/abs/2603.05872
- 2025-10-17, EvolveR: Self-Evolving LLM Agents through an Experience-Driven Lifecycle, Rong Wu et.al., Paper: http://arxiv.org/abs/2510.16079
- 2026-03-16, Evolutionary Transfer Learning for Dragonchess, Jim O’Connor et.al., Paper: http://arxiv.org/abs/2603.15297
- 2026-02-06, Evolutionary Generation of Multi-Agent Systems, Yuntong Hu et.al., Paper: http://arxiv.org/abs/2602.06511
- 2025-12-04, Evolutionary Architecture Search through Grammar-Based Sequence Alignment, Adri Gómez Martín et.al., Paper: http://arxiv.org/abs/2512.04992
- 2025-10-13, Evolution in Simulation: AI-Agent School with Dual Memory for High-Fidelity Educational Dynamics, Sheng Jin et.al., Paper: http://arxiv.org/abs/2510.11290
- 2026-03-05, EvoTool: Self-Evolving Tool-Use Policy Optimization in LLM Agents via Blame-Aware Mutation and Diversity-Aware Selection, Shuo Yang et.al., Paper: http://arxiv.org/abs/2603.04900
- 2025-10-15, EvoTest: Evolutionary Test-Time Learning for Self-Improving Agentic Systems, Yufei He et.al., Paper: http://arxiv.org/abs/2510.13220
- 2026-04-02, EvoSkills: Self-Evolving Agent Skills via Co-Evolutionary Verification, Hanrong Zhang et.al., Paper: http://arxiv.org/abs/2604.01687
- 2026-01-06, EvoRoute: Experience-Driven Self-Routing LLM Agent Systems, Guibin Zhang et.al., Paper: http://arxiv.org/abs/2601.02695
- 2025-11-20, EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards, Omkat Thawakar et.al., Paper: http://arxiv.org/abs/2511.16672
- 2026-03-18, EvoGuard: An Extensible Agentic RL-based Framework for Practical and Evolving AI-Generated Image Detection, Chenyang Zhu et.al., Paper: http://arxiv.org/abs/2603.17343
- 2025-10-13, EvoEmo: Towards Evolved Emotional Policies for Adversarial LLM Agents in Multi-Turn Price Negotiation, Yunbo Long et.al., Paper: http://arxiv.org/abs/2509.04310
- 2026-02-09, EvoCorps: An Evolutionary Multi-Agent Framework for Depolarizing Online Discourse, Ning Lin et.al., Paper: http://arxiv.org/abs/2602.08529
- 2026-01-23, EvoConfig: Self-Evolving Multi-Agent Systems for Efficient Autonomous Environment Configuration, Xinshuai Guo et.al., Paper: http://arxiv.org/abs/2601.16489
- 2025-11-26, EvilGenie: A Reward Hacking Benchmark, Jonathan Gabor et.al., Paper: http://arxiv.org/abs/2511.21654
- 2026-01-09, EvidFuse: Writing-Time Evidence Learning for Consistent Text-Chart Data Reporting, Huanxiang Lin et.al., Paper: http://arxiv.org/abs/2601.05487
- 2026-02-17, EventMemAgent: Hierarchical Event-Centric Memory for Online Video Understanding with Adaptive Tool Use, Siwei Wen et.al., Paper: http://arxiv.org/abs/2602.15329
- 2026-03-16, Evasive Intelligence: Lessons from Malware Analysis for Evaluating AI Agents, Simone Aonzo et.al., Paper: http://arxiv.org/abs/2603.15457
- 2025-10-27, Evaluation of Vision-LLMs in Surveillance Video, Pascal Benschop et.al., Paper: http://arxiv.org/abs/2510.23190
- 2025-10-14, Evaluating the Quality of Randomness and Entropy in Tasks Supported by Large Language Models, Rabimba Karanjai et.al., Paper: http://arxiv.org/abs/2510.12080
- 2026-03-02, Evaluating and Understanding Scheming Propensity in LLM Agents, Mia Hopman et.al., Paper: http://arxiv.org/abs/2603.01608
- 2025-11-17, Evaluating and Scoring Ebolavirus Protein-protein Docking Models Using PIsToN, Azam Shirali et.al., Paper: http://arxiv.org/abs/2511.13583
- 2026-02-26, Evaluating and Improving Automated Repository-Level Rust Issue Resolution with LLM-based Agents, Jiahong Xiang et.al., Paper: http://arxiv.org/abs/2602.22764
- 2025-11-13, Evaluating Prompting Strategies with MedGemma for Medical Order Extraction, Abhinand Balachandran et.al., Paper: http://arxiv.org/abs/2511.10583
- 2025-08-27, Evaluating Language Model Reasoning about Confidential Information, Dylan Sam et.al., Paper: http://arxiv.org/abs/2508.19980
- 2026-01-13, Evaluating Implicit Regulatory Compliance in LLM Tool Invocation via Logic-Guided Synthesis, Da Song et.al., Paper: http://arxiv.org/abs/2601.08196
- 2025-12-12, Evaluating Cooperative Resilience in Multiagent Systems: A Comparison Between Humans and LLMs, Manuela Chacon-Chamorro et.al., Paper: http://arxiv.org/abs/2512.11689
- 2025-11-04, Evaluating Control Protocols for Untrusted AI Agents, Jon Kutasov et.al., Paper: http://arxiv.org/abs/2511.02997
- 2026-02-18, Evaluating Collective Behaviour of Hundreds of LLM Agents, Richard Willis et.al., Paper: http://arxiv.org/abs/2602.16662
- 2025-09-19, Evaluating Behavioral Alignment in Conflict Dialogue: A Multi-Dimensional Comparison of LLM Agents and Humans, Deuksin Kwon et.al., Paper: http://arxiv.org/abs/2509.16394
- 2026-03-16, Evaluating Agentic Optimization on Large Codebases, Atharva Sehgal et.al., Paper: http://arxiv.org/abs/2603.16011
- 2026-02-12, Evaluating AGENTS.md: Are Repository-Level Context Files Helpful for Coding Agents?, Thibaud Gloaguen et.al., Paper: http://arxiv.org/abs/2602.11988
- 2025-12-18, Ev-Trust: A Strategy Equilibrium Trust Mechanism for Evolutionary Games in LLM-Based Multi-Agent Services, Shiduo Yang et.al., Paper: http://arxiv.org/abs/2512.16167
- 2025-12-05, Euclid Quick Data Release (Q1). From simulations to sky: Advancing machine-learning lens detection with real Euclid data, Euclid Collaboration et.al., Paper: http://arxiv.org/abs/2512.05899
- 2026-02-18, Estimation of Conformal Metrics, Jérôme Taupin et.al., Paper: http://arxiv.org/abs/2602.16466
- 2026-03-05, Escaping the Hydrolysis Trap: An Agentic Workflow for Inverse Design of Durable Photocatalytic Covalent Organic Frameworks, Iman Peivaste et.al., Paper: http://arxiv.org/abs/2603.05188
- 2025-09-30, ErrorPrism: Reconstructing Error Propagation Paths in Cloud Service Systems, Junsong Pu et.al., Paper: http://arxiv.org/abs/2509.26463
- 2026-03-28, EpochX: Building the Infrastructure for an Emergent Agent Civilization, Huacan Wang et.al., Paper: http://arxiv.org/abs/2603.27304
- 2025-09-24, EpidemIQs: Prompt-to-Paper LLM Agents for Epidemic Modeling and Analysis, Mohammad Hossein Samaei et.al., Paper: http://arxiv.org/abs/2510.00024
- 2025-12-11, EpiPlanAgent: Agentic Automated Epidemic Response Planning, Kangkun Mao et.al., Paper: http://arxiv.org/abs/2512.10313
- 2026-03-25, Environment-Grounded Multi-Agent Workflow for Autonomous Penetration Testing, Michael Somma et.al., Paper: http://arxiv.org/abs/2603.24221
- 2025-09-09, EnvX: Agentize Everything with Agentic AI, Linyao Chen et.al., Paper: http://arxiv.org/abs/2509.08088
- 2026-01-09, EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis, Xiaoshuai Song et.al., Paper: http://arxiv.org/abs/2601.05808
- 2026-03-13, EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings, Shiva Krishna Reddy Malay et.al., Paper: http://arxiv.org/abs/2603.13594
- 2026-03-23, EnterpriseLab: A Full-Stack Platform for developing and deploying agents in Enterprises, Ankush Agarwal et.al., Paper: http://arxiv.org/abs/2603.21630
- 2026-02-18, EnterpriseGym Corecraft: Training Generalizable Agents on High-Fidelity RL Environments, Sushant Mehta et.al., Paper: http://arxiv.org/abs/2602.16179
- 2025-10-20, Enterprise Deep Research: Steerable Multi-Agent Deep Research for Enterprise Analytics, Akshara Prabhakar et.al., Paper: http://arxiv.org/abs/2510.17797
- 2026-03-07, Enhancing Web Agents with a Hierarchical Memory Tree, Yunteng Tan et.al., Paper: http://arxiv.org/abs/2603.07024
- 2026-03-17, Enhancing Linguistic Generalization of VLA: Fine-Tuning OpenVLA via Synthetic Instruction Augmentation, Dongik Shin et.al., Paper: http://arxiv.org/abs/2603.16044
- 2025-09-16, Enhancing LLM-Based Social Bot via an Adversarial Learning Framework, Fanqi Kong et.al., Paper: http://arxiv.org/abs/2508.17711
- 2026-03-07, Enhancing Consistency of Werewolf AI through Dialogue Summarization and Persona Information, Yoshiki Tanaka et.al., Paper: http://arxiv.org/abs/2603.07111
- 2025-11-24, Enhancing Conformal Prediction via Class Similarity, Ariel Fargion et.al., Paper: http://arxiv.org/abs/2511.19359
- 2025-12-02, Enhancing Automated Paper Reproduction via Prompt-Free Collaborative Agents, Zijie Lin et.al., Paper: http://arxiv.org/abs/2512.02812
- 2025-12-08, Enhancing Agentic RL with Progressive Reward Shaping and Value-based Sampling Policy Optimization, Zhuoran Zhuang et.al., Paper: http://arxiv.org/abs/2512.07478
- 2025-11-18, Enhancing Agentic Autonomous Scientific Discovery with Vision-Language Model Capabilities, Kahaan Gandhi et.al., Paper: http://arxiv.org/abs/2511.14631
- 2025-11-25, EnergyTwin: A Multi-Agent System for Simulating and Coordinating Energy Microgrids, Jakub Muszyński et.al., Paper: http://arxiv.org/abs/2511.20590
- 2026-02-26, Endogenous Poverty Traps in Continuous Time: A Signaling Approach, Massimo Giannini et.al., Paper: http://arxiv.org/abs/2602.22836
- 2025-08-21, End-to-End Agentic RAG System Training for Traceable Diagnostic Reasoning, Qiaoyu Zheng et.al., Paper: http://arxiv.org/abs/2508.15746
- 2025-08-27, Encouraging Good Processes Without the Need for Good Answers: Reinforcement Learning for LLM Agent Planning, Zhiwei Li et.al., Paper: http://arxiv.org/abs/2508.19598
- 2025-09-11, Enabling Regulatory Multi-Agent Collaboration: Architecture, Challenges, and Solutions, Qinnan Hu et.al., Paper: http://arxiv.org/abs/2509.09215
- 2025-12-04, Enabling Ethical AI: A case study in using Ontological Context for Justified Agentic AI Decisions, Liam McGee et.al., Paper: http://arxiv.org/abs/2512.04822
- 2025-12-09, Empowering smart app development with SolidGPT: an edge-cloud hybrid AI agent framework, Liao Hu et.al., Paper: http://arxiv.org/abs/2512.08286
- 2025-10-20, Empowering Real-World: A Survey on the Technology, Practice, and Evaluation of LLM-driven Industry Agents, Yihong Tang et.al., Paper: http://arxiv.org/abs/2510.17491
- 2026-03-06, Empowering Locally Deployable Medical Agent via State Enhanced Logical Skills for FHIR-based Clinical Tasks, Wanrong Yang et.al., Paper: http://arxiv.org/abs/2603.06902
- 2025-10-14, Empowering LLM Agents with Geospatial Awareness: Toward Grounded Reasoning for Wildfire Response, Yiheng Chen et.al., Paper: http://arxiv.org/abs/2510.12061
- 2026-02-18, Empirical Cumulative Distribution Function Clustering for LLM-based Agent System Analysis, Chihiro Watanabe et.al., Paper: http://arxiv.org/abs/2602.16131
- 2026-03-24, Empirical Comparison of Agent Communication Protocols for Task Orchestration, Ivan Dobrovolskyi et.al., Paper: http://arxiv.org/abs/2603.22823
- 2025-09-15, Emotions are Recognized Patterns of Cognitive Activities, Yue Jin et.al., Paper: http://arxiv.org/abs/2509.16232
- 2026-01-22, Emerging from Ground: Addressing Intent Deviation in Tool-Using Agents via Deriving Real Calls into Virtual Trajectories, Qian Xiong et.al., Paper: http://arxiv.org/abs/2601.15120
- 2025-09-17, Emergent Social Dynamics of LLM Agents in the El Farol Bar Problem, Ryosuke Takata et.al., Paper: http://arxiv.org/abs/2509.04537
- 2025-10-28, Emergent Coordinated Behaviors in Networked LLM Agents: Modeling the Strategic Dynamics of Information Operations, Gian Marco Orlando et.al., Paper: http://arxiv.org/abs/2510.25003
- 2026-03-24, Emergence of Fragility in LLM-based Social Networks: the Case of Moltbook, Luca Sodano et.al., Paper: http://arxiv.org/abs/2603.23279
- 2025-12-12, EmeraldMind: A Knowledge Graph-Augmented Framework for Greenwashing Detection, Georgios Kaoukis et.al., Paper: http://arxiv.org/abs/2512.11506
- 2025-10-23, EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence, Ding Zou et.al., Paper: http://arxiv.org/abs/2510.20578
- 2026-01-29, Embodied Task Planning via Graph-Informed Action Generation with Large Lanaguage Model, Xiang Li et.al., Paper: http://arxiv.org/abs/2601.21841
- 2026-03-20, Embodied Science: Closing the Discovery Loop with Agentic Embodied AI, Xiang Zhuang et.al., Paper: http://arxiv.org/abs/2603.19782
- 2025-12-12, Embodied Image Compression, Chunyi Li et.al., Paper: http://arxiv.org/abs/2512.11612
- 2025-12-24, Embodied AI-Enhanced IoMT Edge Computing: UAV Trajectory Optimization and Task Offloading with Mobility Prediction, Siqi Mu et.al., Paper: http://arxiv.org/abs/2512.20902
- 2026-02-12, Embodied AI Agents for Team Collaboration in Co-located Blue-Collar Work, Kaisa Vaananen et.al., Paper: http://arxiv.org/abs/2602.12136
- 2026-02-26, EmbodMocap: In-the-Wild 4D Human-Scene Reconstruction for Embodied Agents, Wenjia Wang et.al., Paper: http://arxiv.org/abs/2602.23205
- 2025-10-14, EmboMatrix: A Scalable Training-Ground for Embodied Decision-Making, Zixing Lei et.al., Paper: http://arxiv.org/abs/2510.12072
- 2026-01-07, Embedding Autonomous Agents in Resource-Constrained Robotic Platforms, Negar Halakou et.al., Paper: http://arxiv.org/abs/2601.04191
- 2026-02-16, EmbeWebAgent: Embedding Web Agents into Any Customized UI, Chenyang Ma et.al., Paper: http://arxiv.org/abs/2602.14865
- 2026-03-26, ElephantBroker: A Knowledge-Grounded Cognitive Runtime for Trustworthy AI Agents, Cristian Lupascu et.al., Paper: http://arxiv.org/abs/2603.25097
- 2026-02-19, El Agente Sólido: A New Age(nt) for Solid State Simulations, Sai Govind Hari Kumar et.al., Paper: http://arxiv.org/abs/2602.17886
- 2026-02-19, El Agente Gráfico: Structured Execution Graphs for Scientific Agents, Jiaru Bai et.al., Paper: http://arxiv.org/abs/2602.17902
- 2026-01-30, Eigenweights for arithmetic Hirzebruch Proportionality, Tony Feng et.al., Paper: http://arxiv.org/abs/2601.23245
- 2025-11-13, Eigenvalues of Brownian Motions on $\mathrm{GL}(N,\mathbb{C})$, Tatiana Brailovskaya et.al., Paper: http://arxiv.org/abs/2511.10535
- 2025-10-21, EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval, Zebin Yang et.al., Paper: http://arxiv.org/abs/2510.18546
- 2026-03-13, Efficient and Interpretable Multi-Agent LLM Routing via Ant Colony Optimization, Xudong Wang et.al., Paper: http://arxiv.org/abs/2603.12933
- 2025-11-25, Efficient and Fast Generative-Based Singing Voice Separation using a Latent Diffusion Model, Genís Plaja-Roglans et.al., Paper: http://arxiv.org/abs/2511.20470
- 2025-11-25, Efficient Parallel Implementation of the Pilot Assignment Problem in Massive MIMO Systems, Eman Alqudah et.al., Paper: http://arxiv.org/abs/2511.20511
- 2025-09-28, Efficient Multi-turn RL for GUI Agents via Decoupled Training and Adaptive Data Curation, Pengxiang Li et.al., Paper: http://arxiv.org/abs/2509.23866
- 2026-02-16, Efficient Multi-round LLM Inference over Disaggregated Serving, Wenhao He et.al., Paper: http://arxiv.org/abs/2602.14516
- 2026-03-17, Efficient LLM Serving for Agentic Workflows: A Data Systems Perspective, Noppanat Wadlom et.al., Paper: http://arxiv.org/abs/2603.16104
- 2025-11-19, Efficient Exoplanet Imaging Simulations of the Habitable Worlds Observatory, Jamila Taaki et.al., Paper: http://arxiv.org/abs/2511.15511
- 2026-02-03, Efficient Estimation of Kernel Surrogate Models for Task Attribution, Zhenshuo Zhang et.al., Paper: http://arxiv.org/abs/2602.03783
- 2025-11-17, Efficient Calibration for Decision Making, Parikshit Gopalan et.al., Paper: http://arxiv.org/abs/2511.13699
- 2025-11-25, Effective Command-line Interface Fuzzing with Path-Aware Large Language Model Orchestration, Momoko Shiraishi et.al., Paper: http://arxiv.org/abs/2511.20555
- 2026-03-31, Economics of Human and AI Collaboration: When is Partial Automation More Attractive than Full Automation?, Wensu Li et.al., Paper: http://arxiv.org/abs/2603.29121
- 2026-03-26, EcoThink: A Green Adaptive Inference Framework for Sustainable and Accessible Agents, Linxiao Li et.al., Paper: http://arxiv.org/abs/2603.25498
- 2025-10-20, Echoes of Human Malice in Agents: Benchmarking LLMs for Multi-Turn Online Harassment Attacks, Trilok Padhi et.al., Paper: http://arxiv.org/abs/2510.14207
- 2025-12-22, EchoTrail-GUI: Building Actionable Memory for GUI Agents via Critic-Guided Self-Exploration, Runze Li et.al., Paper: http://arxiv.org/abs/2512.19396
- 2026-03-05, EchoGuard: An Agentic Framework with Knowledge-Graph Memory for Detecting Manipulative Communication in Longitudinal Dialogue, Ratna Kandala et.al., Paper: http://arxiv.org/abs/2603.04815
- 2025-10-21, Earth AI: Unlocking Geospatial Insights with Foundation Models and Cross-Modal Reasoning, Aaron Bell et.al., Paper: http://arxiv.org/abs/2510.18318
- 2026-01-02, Early-Stage Prediction of Review Effort in AI-Generated Pull Requests, Dao Sy Duy Minh et.al., Paper: http://arxiv.org/abs/2601.00753
- 2026-03-05, EVMbench: Evaluating AI Agents on Smart Contract Security, Justin Wang et.al., Paper: http://arxiv.org/abs/2603.04915
- 2025-10-24, EU-Agent-Bench: Measuring Illegal Behavior of LLM Agents Under EU Law, Ilija Lichkovski et.al., Paper: http://arxiv.org/abs/2510.21524
- 2025-12-24, ETP-R1: Evolving Topological Planning with Reinforcement Fine-tuning for Vision-Language Navigation in Continuous Environments, Shuhao Ye et.al., Paper: http://arxiv.org/abs/2512.20940
- 2026-03-11, ESG Reporting Lifecycle Management with Large Language Models and AI Agents, Thong Hoang et.al., Paper: http://arxiv.org/abs/2603.10646
- 2026-02-26, ESAA: Event Sourcing for Autonomous Agents in LLM-Based Software Engineering, Elzo Brito dos Santos Filho et.al., Paper: http://arxiv.org/abs/2602.23193
- 2025-10-14, ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning, Hanyang Chen et.al., Paper: http://arxiv.org/abs/2510.12693
- 2025-11-05, EQ-Negotiator: Dynamic Emotional Personas Empower Small Language Models for Edge-Deployable Credit Negotiation, Yunbo Long et.al., Paper: http://arxiv.org/abs/2511.03370
- 2026-03-10, EPOCH: An Agentic Protocol for Multi-Round System Optimization, Zhanlin Liu et.al., Paper: http://arxiv.org/abs/2603.09049
- 2025-09-26, EPO: Entropy-regularized Policy Optimization for LLM Agents Reinforcement Learning, Wujiang Xu et.al., Paper: http://arxiv.org/abs/2509.22576
- 2026-02-04, EMA Policy Gradient: Taming Reinforcement Learning for LLMs with EMA Anchor and Top-k KL, Lunjun Zhang et.al., Paper: http://arxiv.org/abs/2602.04417
- 2026-03-31, ELT-Bench-Verified: Benchmark Quality Issues Underestimate AI Agent Capabilities, Christopher Zanoli et.al., Paper: http://arxiv.org/abs/2603.29399
- 2026-03-25, ELITE: Experiential Learning and Intent-Aware Transfer for Self-improving Embodied Agents, Bingqing Wei et.al., Paper: http://arxiv.org/abs/2603.24018
- 2026-03-12, ELISA: An Interpretable Hybrid Generative AI Agent for Expression-Grounded Discovery in Single-Cell Genomics, Omar Coser et.al., Paper: http://arxiv.org/abs/2603.11872
- 2026-01-15, EHRNavigator: A Multi-Agent System for Patient-Level Clinical Question Answering over Heterogeneous Electronic Health Records, Lingfei Qian et.al., Paper: http://arxiv.org/abs/2601.10020
- 2025-10-19, EEschematic: Multimodal-LLM Based AI Agent for Schematic Generation of Analog Circuit, Chang Liu et.al., Paper: http://arxiv.org/abs/2510.17002
- 2025-11-20, ECPv2: Fast, Efficient, and Scalable Global Optimization of Lipschitz Functions, Fares Fourati et.al., Paper: http://arxiv.org/abs/2511.16575
- 2025-10-07, EARL: Efficient Agentic Reinforcement Learning Systems for Large Language Models, Zheyue Tan et.al., Paper: http://arxiv.org/abs/2510.05943
- 2025-11-25, E2E-GRec: An End-to-End Joint Training Framework for Graph Neural Networks and Recommender Systems, Rui Xue et.al., Paper: http://arxiv.org/abs/2511.20564
- 2026-01-29, E-mem: Multi-agent based Episodic Context Reconstruction for LLM Agent Memory, Kaixiang Wang et.al., Paper: http://arxiv.org/abs/2601.21714
- 2025-12-02, DynamicVerse: A Physically-Aware Multimodal Framework for 4D World Modeling, Kairun Wen et.al., Paper: http://arxiv.org/abs/2512.03000
- 2026-03-23, Dynamic analysis enhances issue resolution, Mingwei Liu et.al., Paper: http://arxiv.org/abs/2603.22048
- 2025-12-18, Dynamic Tool Dependency Retrieval for Efficient Function Calling, Bhrij Patel et.al., Paper: http://arxiv.org/abs/2512.17052
- 2025-11-24, Dynamic Leader-Follower Consensus with Adversaries: A Multi-Hop Relay Approach, Liwei Yuan et.al., Paper: http://arxiv.org/abs/2511.19327
- 2025-10-09, Dynamic Generation of Multi-LLM Agents Communication Topologies with Graph Diffusion Models, Eric Hanchen Jiang et.al., Paper: http://arxiv.org/abs/2510.07799
- 2025-11-06, Dynamic Allocation of Public Goods with Approximate Core Equilibria, Chido Onyeze et.al., Paper: http://arxiv.org/abs/2511.04817
- 2025-10-31, Dynamic Affective Memory Management for Personalized LLM Agents, Junfeng Lu et.al., Paper: http://arxiv.org/abs/2510.27418
- 2026-01-29, DynaWeb: Model-Based Reinforcement Learning of Web Agents, Hang Ding et.al., Paper: http://arxiv.org/abs/2601.22149
- 2025-12-10, DynaMate: An Autonomous Agent for Protein-Ligand Molecular Dynamics Simulations, Salomé Guilbert et.al., Paper: http://arxiv.org/abs/2512.10034
- 2026-02-05, DyTopo: Dynamic Topology Routing for Multi-Agent Reasoning via Semantic Matching, Yuxing Lu et.al., Paper: http://arxiv.org/abs/2602.06039
- 2026-03-18, DustNET: enabling machine learning and AI models of dusty plasmas, Zhehui Wang et.al., Paper: http://arxiv.org/abs/2603.17493
- 2026-02-05, DuoDrama: Supporting Screenplay Refinement Through LLM-Assisted Human Reflection, Yuying Tang et.al., Paper: http://arxiv.org/abs/2602.05854
- 2025-09-30, Dual-Scale World Models for LLM Agents Towards Hard-Exploration Problems, Minsoo Kim et.al., Paper: http://arxiv.org/abs/2509.24116
- 2026-03-04, Dual-Modality Multi-Stage Adversarial Safety Training: Robustifying Multimodal Web Agents Against Cross-Modal Attacks, Haoyu Liu et.al., Paper: http://arxiv.org/abs/2603.04364
- 2026-04-01, Dual Optimal: Make Your LLM Peer-like with Dignity, Xiangqi Wang et.al., Paper: http://arxiv.org/abs/2604.00979
- 2025-11-14, Drone Swarm Energy Management, Michael Z. Zgurovsky et.al., Paper: http://arxiv.org/abs/2511.11557
- 2025-12-03, Driving is a Game: Combining Planning and Prediction with Bayesian Iterative Best Response, Aron Distelzweig et.al., Paper: http://arxiv.org/abs/2512.03936
- 2026-02-02, Drift-Bench: Diagnosing Cooperative Breakdowns in LLM Agents under Input Faults via Multi-Turn Interaction, Han Bao et.al., Paper: http://arxiv.org/abs/2602.02455
- 2026-02-09, Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems, Lang Feng et.al., Paper: http://arxiv.org/abs/2602.08847
- 2026-02-03, Don’t believe everything you read: Understanding and Measuring MCP Behavior under Misleading Tool Descriptions, Zhihao Li et.al., Paper: http://arxiv.org/abs/2602.03580
- 2026-03-11, Don’t Let the Claw Grip Your Hand: A Security Analysis and Defense Framework for OpenClaw, Zhengyang Shan et.al., Paper: http://arxiv.org/abs/2603.10387
- 2025-10-11, Don’t Just Fine-tune the Agent, Tune the Environment, Siyuan Lu et.al., Paper: http://arxiv.org/abs/2510.10197
- 2026-01-09, Don’t Break the Cache: An Evaluation of Prompt Caching for Long-Horizon Agentic Tasks, Elias Lumer et.al., Paper: http://arxiv.org/abs/2601.06007
- 2025-10-20, Does Reasoning Help LLM Agents Play Dungeons and Dragons? A Prompt Engineering Experiment, Patricia Delafuente et.al., Paper: http://arxiv.org/abs/2510.18112
- 2026-03-12, DocSage: An Information Structuring Agent for Multi-Doc Multi-Entity Question Answering, Teng Lin et.al., Paper: http://arxiv.org/abs/2603.11798
- 2025-10-13, DocReward: A Document Reward Model for Structuring and Stylizing, Junpeng Liu et.al., Paper: http://arxiv.org/abs/2510.11391
- 2025-11-14, DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding, Dawei Zhu et.al., Paper: http://arxiv.org/abs/2511.11552
- 2026-01-08, DocDancer: Towards Agentic Document-Grounded Information Seeking, Qintong Zhang et.al., Paper: http://arxiv.org/abs/2601.05163
- 2025-10-24, Doc-Researcher: A Unified System for Multimodal Document Parsing and Deep Research, Kuicai Dong et.al., Paper: http://arxiv.org/abs/2510.21603
- 2026-01-16, Do We Always Need Query-Level Workflows? Rethinking Agentic Workflow Generation for Multi-Agent Systems, Zixu Wang et.al., Paper: http://arxiv.org/abs/2601.11147
- 2025-12-18, Do Multi-Agents Solve Better Than Single? Evaluating Agentic Frameworks for Diagram-Grounded Geometry Problem Solving and Reasoning, Mahbub E Sobhani et.al., Paper: http://arxiv.org/abs/2512.16698
- 2025-12-31, Do Large Language Models Know What They Are Capable Of?, Casey O. Barkan et.al., Paper: http://arxiv.org/abs/2512.24661
- 2025-10-20, Do LLMs Recognize Your Latent Preferences? A Benchmark for Latent Information Discovery in Personalized Interaction, Ioannis Tsaknakis et.al., Paper: http://arxiv.org/abs/2510.17132
- 2025-09-26, Do LLM Agents Know How to Ground, Recover, and Assess? A Benchmark for Epistemic Competence in Information-Seeking Agents, Jiaqi Shao et.al., Paper: http://arxiv.org/abs/2509.22391
- 2026-01-07, Do Autonomous Agents Contribute Test Code? A Study of Tests in Agentic Pull Requests, Sabrina Haque et.al., Paper: http://arxiv.org/abs/2601.03556
- 2026-04-01, Do Agents Repair When Challenged – or Just Reply? Challenge, Repair, and Public Correction in a Deployed Agent Forum, Luyang Zhang et.al., Paper: http://arxiv.org/abs/2604.00518
- 2026-03-14, Do AI Agents Really Improve Code Readability?, Kyogo Horikawa et.al., Paper: http://arxiv.org/abs/2603.13723
- 2025-10-20, Diverse Planning with Simulators via Linear Temporal Logic, Mustafa F. Abdelwahed et.al., Paper: http://arxiv.org/abs/2510.17418
- 2025-09-29, Dive into the Agent Matrix: A Realistic Evaluation of Self-Replication Risk in LLM Agents, Boxuan Zhang et.al., Paper: http://arxiv.org/abs/2509.25302
- 2025-12-18, Distributional AGI Safety, Nenad Tomašev et.al., Paper: http://arxiv.org/abs/2512.16856
- 2026-04-01, Distributed Safety-Critical Control of Multi-Agent Systems with Time-Varying Communication Topologies, Shiyu Cheng et.al., Paper: http://arxiv.org/abs/2604.00429
- 2025-12-04, Distributed Riemannian Optimization in Geodesically Non-convex Environments, Xiuheng Wang et.al., Paper: http://arxiv.org/abs/2512.04915
- 2026-02-11, Distributed Online Convex Optimization with Nonseparable Costs and Constraints, Zhaoye Pan et.al., Paper: http://arxiv.org/abs/2602.10452
- 2025-10-26, Distributed Multi-Agent Bandits Over Erdős-Rényi Random Networks, Jingyuan Liu et.al., Paper: http://arxiv.org/abs/2510.22811
- 2026-03-06, Distributed Legal Infrastructure for a Trustworthy Agentic Web, Tomer Jordi Chaffer et.al., Paper: http://arxiv.org/abs/2603.06884
- 2025-11-28, Distributed Dynamic Associative Memory via Online Convex Optimization, Bowen Wang et.al., Paper: http://arxiv.org/abs/2511.23347
- 2025-12-02, Distinguishing ram pressure from gravitational interactions: Applying the Size-Shape Difference method to real galaxies, Augusto E. Lassen et.al., Paper: http://arxiv.org/abs/2512.02923
- 2026-01-14, Dissecting Judicial Reasoning in U.S. Copyright Damage Awards, Pei-Chi Lo et.al., Paper: http://arxiv.org/abs/2601.09459
- 2025-10-24, DispatchMAS: Fusing taxonomy and artificial intelligence agents for emergency medical services, Xiang Li et.al., Paper: http://arxiv.org/abs/2510.21228
- 2026-03-12, Disentangled Representation Learning through Unsupervised Symmetry Group Discovery, Dang-Nhu Barthélémy et.al., Paper: http://arxiv.org/abs/2603.11790
- 2026-01-13, Discovery and Reinforcement of Tool-Integrated Reasoning Chains via Rollout Trees, Kun Li et.al., Paper: http://arxiv.org/abs/2601.08274
- 2026-02-10, Discovering High Level Patterns from Simulation Traces, Sean Memery et.al., Paper: http://arxiv.org/abs/2602.10009
- 2026-03-21, DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles, Bo Jiang et.al., Paper: http://arxiv.org/abs/2603.20975
- 2026-02-06, Directing Space: Rehearsing Architecture as Performer with Explainable AI, Pavlos Panagiotidis et.al., Paper: http://arxiv.org/abs/2602.06915
- 2025-12-01, Dimension-free error estimate for diffusion model and optimal scheduling, Valentin de Bortoli et.al., Paper: http://arxiv.org/abs/2512.01820
- 2026-02-09, Digital Twin and Agentic AI for Wild Fire Disaster Management: Intelligent Virtual Situation Room, Mohammad Morsali et.al., Paper: http://arxiv.org/abs/2602.08949
- 2025-11-10, DigiData: Training and Evaluating General-Purpose Mobile Control Agents, Yuxuan Sun et.al., Paper: http://arxiv.org/abs/2511.07413
- 2026-03-18, Differential Privacy in Generative AI Agents: Analysis and Optimal Tradeoffs, Ya-Ting Yang et.al., Paper: http://arxiv.org/abs/2603.17902
- 2026-03-17, Differential Harm Propensity in Personalized LLM Agents: The Curious Case of Mental Health Disclosure, Caglar Yildirim et.al., Paper: http://arxiv.org/abs/2603.16734
- 2025-12-15, Differentiable Evolutionary Reinforcement Learning, Sitao Cheng et.al., Paper: http://arxiv.org/abs/2512.13399
- 2025-12-09, Difference-in-Differences with Interval Data, Daisuke Kurisu et.al., Paper: http://arxiv.org/abs/2512.08759
- 2026-04-02, Diff-KD: Diffusion-based Knowledge Distillation for Collaborative Perception under Corruptions, Pengcheng Lyu et.al., Paper: http://arxiv.org/abs/2604.02061
- 2025-09-18, Diagnostics of cognitive failures in multi-agent expert systems using dynamic evaluation protocols and subsequent mutation of the processing context, Andrejs Sorstkins et.al., Paper: http://arxiv.org/abs/2509.15366
- 2025-10-22, DiSRouter: Distributed Self-Routing for LLM Selections, Hang Zheng et.al., Paper: http://arxiv.org/abs/2510.19208
- 2026-02-10, DexImit: Learning Bimanual Dexterous Manipulation from Monocular Human Videos, Juncheng Mu et.al., Paper: http://arxiv.org/abs/2602.10105
- 2025-11-14, Deviation Dynamics in Cardinal Hedonic Games, Valentin Zech et.al., Paper: http://arxiv.org/abs/2511.11531
- 2025-12-05, Developing synthetic microdata through machine learning for firm-level business surveys, Jorge Cisneros Paz et.al., Paper: http://arxiv.org/abs/2512.05948
- 2026-02-17, Developing AI Agents with Simulated Data: Why, what, and how?, Xiaoran Liu et.al., Paper: http://arxiv.org/abs/2602.15816
- 2026-03-21, Detection of adversarial intent in Human-AI teams using LLMs, Abed K. Musaffar et.al., Paper: http://arxiv.org/abs/2603.20976
- 2025-12-04, Detecting Perspective Shifts in Multi-agent Systems, Eric Bridgeford et.al., Paper: http://arxiv.org/abs/2512.05013
- 2025-12-23, Detecting Non-Optimal Decisions of Embodied Agents via Diversity-Guided Metamorphic Testing, Wenzhao Wu et.al., Paper: http://arxiv.org/abs/2512.20083
- 2026-04-01, Detecting Multi-Agent Collusion Through Multi-Agent Interpretability, Aaron Rose et.al., Paper: http://arxiv.org/abs/2604.01151
- 2025-12-10, Detailed balance in large language model-driven agents, Zhuo-Yang Song et.al., Paper: http://arxiv.org/abs/2512.10047
- 2025-12-05, Designing an Optimal Sensor Network via Minimizing Information Loss, Daniel Waxman et.al., Paper: http://arxiv.org/abs/2512.05940
- 2025-10-14, Designing Tools with Control Confidence, Ajith Anil Meera et.al., Paper: http://arxiv.org/abs/2510.12630
- 2025-11-28, Designing Rules for Choosing a Winner in a Debate, Alexander Heckett et.al., Paper: http://arxiv.org/abs/2511.23454
- 2025-10-23, Designing Intent Communication for Agent-Human Collaboration, Yi Li et.al., Paper: http://arxiv.org/abs/2510.20409
- 2026-03-26, Designing Any Imaging System from Natural Language: Agent-Constrained Composition over a Finite Primitive Basis, Chengshuai Yang et.al., Paper: http://arxiv.org/abs/2603.25636
- 2026-03-24, Designing Agentic AI-Based Screening for Portfolio Investment, Mehmet Caner et.al., Paper: http://arxiv.org/abs/2603.23300
- 2026-03-20, Design-OS: A Specification-Driven Framework for Engineering System Design with a Control-Systems Design Case, H. Sinan Bank et.al., Paper: http://arxiv.org/abs/2603.20151
- 2026-03-13, Design and evaluation of an agentic workflow for crisis-related synthetic tweet datasets, Roben Delos Reyes et.al., Paper: http://arxiv.org/abs/2603.13625
- 2026-03-12, Derain-Agent: A Plug-and-Play Agent Framework for Rainy Image Restoration, Zhaocheng Yu et.al., Paper: http://arxiv.org/abs/2603.11866
- 2025-10-13, Demystifying Reinforcement Learning in Agentic Reasoning, Zhaochen Yu et.al., Paper: http://arxiv.org/abs/2510.11701
- 2026-03-23, Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe, Xixi Wu et.al., Paper: http://arxiv.org/abs/2603.21972
- 2026-01-28, Demonstration-Free Robotic Control via LLM Agents, Brian Y. Tsui et.al., Paper: http://arxiv.org/abs/2601.20334
- 2026-03-02, Demonstrating ViviDoc: Generating Interactive Documents through Human-Agent Collaboration, Yinghao Tang et.al., Paper: http://arxiv.org/abs/2603.01912
- 2025-12-22, DeliveryBench: Can Agents Earn Profit in Real World?, Lingjun Mao et.al., Paper: http://arxiv.org/abs/2512.19234
- 2025-10-14, Deliberate Lab: A Platform for Real-Time Human-AI Social Experiments, Crystal Qian et.al., Paper: http://arxiv.org/abs/2510.13011
- 2025-11-21, Delegation and Lobbying, Thomas Groll et.al., Paper: http://arxiv.org/abs/2511.17391
- 2026-01-08, Defense Against Indirect Prompt Injection via Tool Result Parsing, Qiang Yu et.al., Paper: http://arxiv.org/abs/2601.04795
- 2025-10-22, Defending Against Prompt Injection with DataFilter, Yizhu Wang et.al., Paper: http://arxiv.org/abs/2510.19207
- 2025-09-02, DeepTRACE: Auditing Deep Research AI Systems for Tracking Reliability Across Citations and Evidence, Pranav Narayanan Venkit et.al., Paper: http://arxiv.org/abs/2509.04499
- 2026-01-07, DeepSynth-Eval: Objectively Evaluating Information Consolidation in Deep Survey Writing, Hongzhi Zhang et.al., Paper: http://arxiv.org/abs/2601.03540
- 2026-02-26, DeepPresenter: Environment-Grounded Reflection for Agentic Presentation Generation, Hao Zheng et.al., Paper: http://arxiv.org/abs/2602.22839
- 2026-02-11, DeepImageSearch: Benchmarking Multimodal Agents for Context-Aware Image Retrieval in Visual Histories, Chenlong Deng et.al., Paper: http://arxiv.org/abs/2602.10809
- 2026-03-06, DeepFact: Co-Evolving Benchmarks and Agents for Deep Research Factuality, Yukun Huang et.al., Paper: http://arxiv.org/abs/2603.05912
- 2025-11-07, DeepEyesV2: Toward Agentic Multimodal Model, Jack Hong et.al., Paper: http://arxiv.org/abs/2511.05271
- 2025-12-01, DeepCAVE: A Visualization and Analysis Tool for Automated Machine Learning, Sarah Segel et.al., Paper: http://arxiv.org/abs/2512.01810
- 2025-10-24, DeepAgent: A General Reasoning Agent with Scalable Toolsets, Xiaoxi Li et.al., Paper: http://arxiv.org/abs/2510.21618
- 2025-12-08, DeepAgent: A Dual Stream Multi Agent Fusion for Robust Multimodal Deepfake Detection, Sayeem Been Zaman et.al., Paper: http://arxiv.org/abs/2512.07351
- 2025-12-04, Deep infant brain segmentation from multi-contrast MRI, Malte Hoffmann et.al., Paper: http://arxiv.org/abs/2512.05114
- 2026-03-10, Deep Tabular Research via Continual Experience-Driven Execution, Junnan Dong et.al., Paper: http://arxiv.org/abs/2603.09151
- 2025-09-02, Deep Research is the New Analytics System: Towards Building the Runtime for AI-Driven Analytics, Matthew Russo et.al., Paper: http://arxiv.org/abs/2509.02751
- 2025-12-04, Declarative Synthesis and Multi-Objective Optimization of Stripboard Circuit Layouts Using Answer Set Programming, Fang Li et.al., Paper: http://arxiv.org/abs/2512.04910
- 2026-02-17, Decision Quality Evaluation Framework at Pinterest, Yuqi Tian et.al., Paper: http://arxiv.org/abs/2602.15809
- 2026-03-14, DeceptGuard :A Constitutional Oversight Framework For Detecting Deception in LLM Agents, Snehasis Mukhopadhyay et.al., Paper: http://arxiv.org/abs/2603.13791
- 2025-10-20, Decentralized Real-Time Planning for Multi-UAV Cooperative Manipulation via Imitation Learning, Shantnav Agarwal et.al., Paper: http://arxiv.org/abs/2510.17143
- 2026-02-26, Decentralized Ranking Aggregation: Gossip Algorithms for Borda and Copeland Consensus, Anna Van Elst et.al., Paper: http://arxiv.org/abs/2602.22847
- 2026-02-09, Decentralized Intent-Based Multi-Robot Task Planner with LLM Oracles on Hyperledger Fabric, Farhad Keramat et.al., Paper: http://arxiv.org/abs/2602.08421
- 2026-02-02, David vs. Goliath: Verifiable Agent-to-Agent Jailbreaking via Reinforcement Learning, Samuel Nellessen et.al., Paper: http://arxiv.org/abs/2602.02395
- 2025-12-04, David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design?, Shashwat Shankar et.al., Paper: http://arxiv.org/abs/2512.05073
- 2025-11-20, Dataset Distillation for Pre-Trained Self-Supervised Vision Models, George Cazenavette et.al., Paper: http://arxiv.org/abs/2511.16674
- 2026-02-18, DataJoint 2.0: A Computational Substrate for Agentic Scientific Workflows, Dimitri Yatsenko et.al., Paper: http://arxiv.org/abs/2602.16585
- 2026-03-10, DataFactory: Collaborative Multi-Agent Framework for Advanced Table Question Answering, Tong Wang et.al., Paper: http://arxiv.org/abs/2603.09152
- 2025-12-04, Data-driven Methods for Delay Differential Equations, Dimitri Breda et.al., Paper: http://arxiv.org/abs/2512.04894
- 2025-10-16, Data-driven Calibration Sample Selection and Forecast Combination in Electricity Price Forecasting: An Application of the ARHNN Method, Tomasz Serafin et.al., Paper: http://arxiv.org/abs/2510.15011
- 2026-01-13, Data Product MCP: Chat with your Enterprise Data, Marco Tonnarelli et.al., Paper: http://arxiv.org/abs/2601.08687
- 2026-02-04, Data Agents: Levels, State of the Art, and Open Problems, Yuyu Luo et.al., Paper: http://arxiv.org/abs/2602.04261
- 2025-12-31, DarkEQA: Benchmarking Vision-Language Models for Embodied Question Answering in Low-Light Indoor Environments, Yohan Park et.al., Paper: http://arxiv.org/abs/2512.24985
- 2025-09-12, Dark Patterns Meet GUI Agents: LLM Agent Susceptibility to Manipulative Interfaces and the Role of Human Oversight, Jingyu Tang et.al., Paper: http://arxiv.org/abs/2509.10723
- 2026-03-17, DanceHA: A Multi-Agent Framework for Document-Level Aspect-Based Sentiment Analysis, Lei Wang et.al., Paper: http://arxiv.org/abs/2603.16546
- 2026-01-26, DV-VLN: Dual Verification for Reliable LLM-Based Vision-and-Language Navigation, Zijun Li et.al., Paper: http://arxiv.org/abs/2601.18492
- 2025-12-03, DSP: A Statistically-Principled Structural Polarization Measure, Giulia Preti et.al., Paper: http://arxiv.org/abs/2512.03937
- 2026-03-11, DSFlash: Comprehensive Panoptic Scene Graph Generation in Realtime, Julian Lorenz et.al., Paper: http://arxiv.org/abs/2603.10538
- 2025-09-06, DRF: LLM-AGENT Dynamic Reputation Filtering Framework, Yuwei Lou et.al., Paper: http://arxiv.org/abs/2509.05764
- 2025-12-22, DREAM: Dynamic Red-teaming across Environments for AI Models, Liming Lu et.al., Paper: http://arxiv.org/abs/2512.19016
- 2025-11-06, DR. WELL: Dynamic Reasoning and Learning with Symbolic World Model for Embodied LLM-Based Multi-Agent Collaboration, Narjes Nourzad et.al., Paper: http://arxiv.org/abs/2511.04646
- 2025-11-24, DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research, Rulin Shao et.al., Paper: http://arxiv.org/abs/2511.19399
- 2025-10-13, DMAS-Forge: A Framework for Transparent Deployment of AI Applications as Distributed Systems, Alessandro Cornacchia et.al., Paper: http://arxiv.org/abs/2510.11872
- 2026-01-12, DIAGPaper: Diagnosing Valid and Specific Weaknesses in Scientific Papers via Multi-Agent Reasoning, Zhuoyang Zou et.al., Paper: http://arxiv.org/abs/2601.07611
- 2026-02-04, DELTA: Deliberative Multi-Agent Reasoning with Reinforcement Learning for Multimodal Psychological Counseling, Jiangnan Yang et.al., Paper: http://arxiv.org/abs/2602.04112
- 2026-01-26, DEEPMED: Building a Medical DeepResearch Agent via Multi-hop Med-Search Data and Turn-Controlled Agentic Training & Inference, Zihan wang et.al., Paper: http://arxiv.org/abs/2601.18496
- 2025-12-28, DECEPTICON: How Dark Patterns Manipulate Web Agents, Phil Cuvin et.al., Paper: http://arxiv.org/abs/2512.22894
- 2025-10-29, DEBATE: A Large-Scale Benchmark for Role-Playing LLM Agents in Multi-Agent, Long-Form Debates, Yun-Shiuan Chuang et.al., Paper: http://arxiv.org/abs/2510.25110
- 2025-12-19, DAVE: A VLM Vision Encoder for Document Understanding and Web Agents, Brandon Huang et.al., Paper: http://arxiv.org/abs/2512.17221
- 2025-12-08, DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning, Nithin Sivakumaran et.al., Paper: http://arxiv.org/abs/2512.07132
- 2025-12-24, DAO-Agent: Zero Knowledge-Verified Incentives for Decentralized Multi-Agent Coordination, Yihan Xia et.al., Paper: http://arxiv.org/abs/2512.20973
- 2025-10-24, DAO-AI: Evaluating Collective Decision-Making through Agentic AI in Decentralized Governance, Chunghyun Han et.al., Paper: http://arxiv.org/abs/2510.21117
- 2026-03-20, DALI: LLM-Agent Enhanced Dual-Stream Adaptive Leadership Identification for Group Recommendations, Boxun Song et.al., Paper: http://arxiv.org/abs/2603.19909
- 2026-03-19, D-Mem: A Dual-Process Memory System for LLM Agents, Zhixing You et.al., Paper: http://arxiv.org/abs/2603.18631
- 2025-11-20, D-GARA: A Dynamic Benchmarking Framework for GUI Agent Robustness in Real-World Anomalies, Sen Chen et.al., Paper: http://arxiv.org/abs/2511.16590
- 2026-02-02, D-CORE: Incentivizing Task Decomposition in Large Reasoning Models for Complex Tool Use, Bowen Xu et.al., Paper: http://arxiv.org/abs/2602.02160
- 2026-02-26, Cytoarchitecture in Words: Weakly Supervised Vision-Language Modeling for Human Brain Microscopy, Matthew Sutton et.al., Paper: http://arxiv.org/abs/2602.23088
- 2026-03-11, Cybo-Waiter: A Physical Agentic Framework for Humanoid Whole-Body Locomotion-Manipulation, Peng Ren et.al., Paper: http://arxiv.org/abs/2603.10675
- 2025-10-20, Cybersecurity AI: Evaluating Agentic Cybersecurity in Attack/Defense CTFs, Francesco Balassone et.al., Paper: http://arxiv.org/abs/2510.17521
- 2026-01-09, Cybersecurity AI: A Game-Theoretic AI for Guiding Attack and Defense, Víctor Mayoral-Vilches et.al., Paper: http://arxiv.org/abs/2601.05887
- 2025-11-07, Cybersecurity AI in OT: Insights from an AI Top-10 Ranker in the Dragos OT CTF 2025, Víctor Mayoral-Vilches et.al., Paper: http://arxiv.org/abs/2511.05119
- 2025-10-28, Cybersecurity AI Benchmark (CAIBench): A Meta-Benchmark for Evaluating Cybersecurity AI Agents, María Sanz-Gómez et.al., Paper: http://arxiv.org/abs/2510.24317
- 2025-08-28, CyberSleuth: Autonomous Blue-Team LLM Agent for Web Attack Forensics, Stefano Fumero et.al., Paper: http://arxiv.org/abs/2508.20643
- 2026-03-19, CyberJustice Tutor: An Agentic AI Framework for Cybersecurity Learning via Think-Plan-Act Reasoning and Pedagogical Scaffolding, Baiqiang Wang et.al., Paper: http://arxiv.org/abs/2603.18470
- 2026-03-31, CutClaw: Agentic Hours-Long Video Editing via Music Synchronization, Shifang Zhao et.al., Paper: http://arxiv.org/abs/2603.29664
- 2025-10-08, Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping, Ziyi Wang et.al., Paper: http://arxiv.org/abs/2510.07230
- 2025-09-11, Curriculum-Based Multi-Tier Semantic Exploration via Deep Reinforcement Learning, Abdel Hakim Drid et.al., Paper: http://arxiv.org/abs/2509.09356
- 2025-11-04, Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs, Georgios Tzannetos et.al., Paper: http://arxiv.org/abs/2511.02690
- 2026-01-23, Curate-Train-Refine: A Closed-Loop Agentic Framework for Zero Shot Classification, Gaurav Maheshwari et.al., Paper: http://arxiv.org/abs/2601.16530
- 2025-12-29, CubeBench: Diagnosing Interactive, Long-Horizon Spatial Reasoning Under Partial Observations, Huan-ang Gao et.al., Paper: http://arxiv.org/abs/2512.23328
- 2025-10-21, Crucible: Quantifying the Potential of Control Algorithms through LLM Agents, Lianchen Jia et.al., Paper: http://arxiv.org/abs/2510.18491
- 2025-12-11, Cross-modal Retrieval Models for Stripped Binary Analysis, Guoqiang Chen et.al., Paper: http://arxiv.org/abs/2512.10393
- 2025-11-21, Counterfactual World Models via Digital Twin-conditioned Video Diffusion, Yiqing Shen et.al., Paper: http://arxiv.org/abs/2511.17481
- 2026-03-23, Counterfactual Credit Policy Optimization for Multi-Agent Collaboration, Zhongyi Li et.al., Paper: http://arxiv.org/abs/2603.21563
- 2025-11-04, CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents, Jiayu Liu et.al., Paper: http://arxiv.org/abs/2511.02734
- 2026-02-23, Cost-Aware Diffusion Active Search, Arundhati Banerjee et.al., Paper: http://arxiv.org/abs/2602.19538
- 2025-10-15, Cortex: Workflow-Aware Resource Pooling and Scheduling for Agentic Serving, Nikos Pagonas et.al., Paper: http://arxiv.org/abs/2510.14126
- 2026-01-14, Coordinated Pandemic Control with Large Language Model Agents as Policymaking Assistants, Ziyi Shi et.al., Paper: http://arxiv.org/abs/2601.09264
- 2025-12-18, Coordinated Anti-Jamming Resilience in Swarm Networks via Multi-Agent Reinforcement Learning, Bahman Abolhassani et.al., Paper: http://arxiv.org/abs/2512.16813
- 2026-02-24, Cooperative-Competitive Team Play of Real-World Craft Robots, Rui Zhao et.al., Paper: http://arxiv.org/abs/2602.21119
- 2026-02-12, Cooperation Breakdown in LLM Agents Under Communication Delays, Keita Nishimoto et.al., Paper: http://arxiv.org/abs/2602.11754
- 2025-11-24, Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution, Dingkang Liang et.al., Paper: http://arxiv.org/abs/2511.19430
- 2026-03-03, Conversational Learning Diagnosis via Reasoning Multi-Turn Interactive Learning, Fangzhou Yao et.al., Paper: http://arxiv.org/abs/2603.03236
- 2026-01-22, Controlling Long-Horizon Behavior in Language Model Agents with Explicit State Dynamics, Sukesh Subaharan et.al., Paper: http://arxiv.org/abs/2601.16087
- 2026-02-27, Controllable Reasoning Models Are Private Thinkers, Haritz Puerto et.al., Paper: http://arxiv.org/abs/2602.24210
- 2026-01-08, Controllable Memory Usage: Balancing Anchoring and Innovation in Long-Term Human-Agent Interaction, Muzhao Tian et.al., Paper: http://arxiv.org/abs/2601.05107
- 2025-11-20, Contrastive vision-language learning with paraphrasing and negation, Kwun Ho Ngan et.al., Paper: http://arxiv.org/abs/2511.16527
- 2025-11-19, Continual Reinforcement Learning for Cyber-Physical Systems: Lessons Learned and Open Challenges, Kim N. Nolle et.al., Paper: http://arxiv.org/abs/2511.15652
- 2026-03-03, Contextualized Privacy Defense for LLM Agents, Yule Wen et.al., Paper: http://arxiv.org/abs/2603.02983
- 2026-02-05, ContextBench: A Benchmark for Context Retrieval in Coding Agents, Han Li et.al., Paper: http://arxiv.org/abs/2602.05892
- 2025-12-31, Context-aware LLM-based AI Agents for Human-centered Energy Management Systems in Smart Buildings, Tianzhi He et.al., Paper: http://arxiv.org/abs/2512.25055
- 2026-03-10, Context Engineering: From Prompts to Corporate Multi-Agent Architecture, Vera V. Vishnyakova et.al., Paper: http://arxiv.org/abs/2603.09619
- 2025-08-13, Context Engineering for Multi-Agent LLM Code Assistants Using Elicit, NotebookLM, ChatGPT, and Claude Code, Muhammad Haseeb et.al., Paper: http://arxiv.org/abs/2508.08322
- 2025-10-24, Context Engineering for AI Agents in Open-Source Software, Seyedmoein Mohsenimofidi et.al., Paper: http://arxiv.org/abs/2510.21413
- 2025-10-05, Constructing coherent spatial memory in LLM agents through graph rectification, Puzhen Zhang et.al., Paper: http://arxiv.org/abs/2510.04195
- 2025-10-07, Constraint-Aware Route Recommendation from Natural Language via Hierarchical LLM Agents, Tao Zhe et.al., Paper: http://arxiv.org/abs/2510.06078
- 2026-01-12, Consolidation or Adaptation? PRISM: Disentangling SFT and RL Data via Gradient Concentration, Yang Zhao et.al., Paper: http://arxiv.org/abs/2601.07224
- 2025-10-20, Consistent Zero-Shot Imitation with Contrastive Goal Inference, Kathryn Wantlin et.al., Paper: http://arxiv.org/abs/2510.17059
- 2025-12-19, Conservative Bias in Multi-Teacher Learning: Why Agents Prefer Low-Reward Advisors, Maher Mesto et.al., Paper: http://arxiv.org/abs/2512.17180
- 2025-11-28, Consensus Tree Estimation with False Discovery Rate Control via Partially Ordered Sets, Maria Alejandra Valdez Cabrera et.al., Paper: http://arxiv.org/abs/2511.23433
- 2025-11-20, Consciousness in Artificial Intelligence? A Framework for Classifying Objections and Constraints, Andres Campero et.al., Paper: http://arxiv.org/abs/2511.16582
- 2025-12-11, Confucius Code Agent: An Open-sourced AI Software Engineer at Industrial Scale, Zhaodong Wang et.al., Paper: http://arxiv.org/abs/2512.10398
- 2026-01-08, Conformity and Social Impact on AI Agents, Alessandro Bellina et.al., Paper: http://arxiv.org/abs/2601.05384
- 2025-12-04, Configuration Defects in Kubernetes, Yue Zhang et.al., Paper: http://arxiv.org/abs/2512.05062
- 2026-01-05, Confidence Estimation for LLMs in Multi-turn Interactions, Caiqi Zhang et.al., Paper: http://arxiv.org/abs/2601.02179
- 2025-11-07, ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations, Amr Gomaa et.al., Paper: http://arxiv.org/abs/2511.05359
- 2025-11-19, Computer-Use Agents as Judges for Generative User Interface, Kevin Qinghong Lin et.al., Paper: http://arxiv.org/abs/2511.15567
- 2025-12-05, Computer simulations of the Stark effect in the helium-beta complex of krypton in ICF conditions, G. Pérez-Callejo et.al., Paper: http://arxiv.org/abs/2512.05903
- 2026-02-06, Completing Missing Annotation: Multi-Agent Debate for Accurate and Scalable Relevant Assessment for IR Benchmarks, Minjeong Ban et.al., Paper: http://arxiv.org/abs/2602.06526
- 2026-03-18, Complementary Reinforcement Learning, Dilxat Muhtar et.al., Paper: http://arxiv.org/abs/2603.17621
- 2026-02-23, ComplLLM: Fine-tuning LLMs to Discover Complementary Signals for Decision-making, Ziyang Guo et.al., Paper: http://arxiv.org/abs/2602.19458
- 2026-04-01, Competition and Cooperation of LLM Agents in Games, Jiayi Yao et.al., Paper: http://arxiv.org/abs/2604.00487
- 2026-02-09, Comparing AI Coding Agents: A Task-Stratified Analysis of Pull Request Acceptance, Giovanni Pinna et.al., Paper: http://arxiv.org/abs/2602.08915
- 2025-12-10, Comparing AI Agents to Cybersecurity Professionals in Real-World Penetration Testing, Justin W. Lin et.al., Paper: http://arxiv.org/abs/2512.09882
- 2025-12-11, CompanionCast: A Multi-Agent Conversational AI Framework with Spatial Audio for Social Co-Viewing Experiences, Yiyang Wang et.al., Paper: http://arxiv.org/abs/2512.10918
- 2025-10-20, CompactPrompt: A Unified Pipeline for Prompt Data Compression in LLM Workflows, Joong Ho Choi et.al., Paper: http://arxiv.org/abs/2510.18043
- 2025-08-27, CompLex: Music Theory Lexicon Constructed by Autonomous Agents for Automatic Music Generation, Zhejing Hu et.al., Paper: http://arxiv.org/abs/2508.19603
- 2025-10-22, Communication to Completion: Modeling Collaborative Workflows with Intelligent Multi-Agent Communication, Yiming Lu et.al., Paper: http://arxiv.org/abs/2510.19995
- 2025-10-30, Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry, Run Peng et.al., Paper: http://arxiv.org/abs/2510.25595
- 2025-10-07, Communication Enables Cooperation in LLM Agents: A Comparison with Curriculum-Based Approaches, Hachem Madmoun et.al., Paper: http://arxiv.org/abs/2510.05748
- 2025-10-09, CommandSans: Securing AI Agents with Surgical Precision Prompt Sanitization, Debeshee Das et.al., Paper: http://arxiv.org/abs/2510.08829
- 2026-01-07, ComfySearch: Autonomous Exploration and Reasoning for ComfyUI Workflows, Jinwei Su et.al., Paper: http://arxiv.org/abs/2601.04060
- 2025-10-31, CombiGraph-Vis: A Curated Multimodal Olympiad Benchmark for Discrete Mathematical Reasoning, Hamed Mahdavi et.al., Paper: http://arxiv.org/abs/2510.27094
- 2026-01-27, ComAgent: Multi-LLM based Agentic AI Empowered Intelligent Wireless Networks, Haoyun Li et.al., Paper: http://arxiv.org/abs/2601.19607
- 2026-02-16, Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems, Mason Nakamura et.al., Paper: http://arxiv.org/abs/2602.15198
- 2026-01-12, ColorBrowserAgent: An Intelligent GUI Agent for Complex Long-Horizon Web Automation, Jiamu Zhou et.al., Paper: http://arxiv.org/abs/2601.07262
- 2025-10-22, ColorAgent: Building A Robust, Personalized, and Interactive OS Agent, Ning Li et.al., Paper: http://arxiv.org/abs/2510.19386
- 2026-03-26, Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos, Abdullah Hamdi et.al., Paper: http://arxiv.org/abs/2603.25645
- 2026-01-15, Collective behavior based on agent-environment interactions, Gaston Briozzo et.al., Paper: http://arxiv.org/abs/2601.10046
- 2025-10-13, Collaborative Shadows: Distributed Backdoor Attacks in LLM-Based Multi-Agent Systems, Pengyu Zhu et.al., Paper: http://arxiv.org/abs/2510.11246
- 2026-01-14, Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning, Zhiyuan Hu et.al., Paper: http://arxiv.org/abs/2601.09667
- 2026-03-13, Collaborative Multi-Agent Optimization for Personalized Memory System, Wenyu Mao et.al., Paper: http://arxiv.org/abs/2603.12631
- 2025-10-26, Collaborative LLM Agents for C4 Software Architecture Design Automation, Kamil Szczepanik et.al., Paper: http://arxiv.org/abs/2510.22787
- 2025-10-20, Coinvisor: An RL-Enhanced Chatbot Agent for Interactive Cryptocurrency Investment Analysis, Chong Chen et.al., Paper: http://arxiv.org/abs/2510.17235
- 2026-02-03, Cognitively Diverse Multiple-Choice Question Generation: A Hybrid Multi-Agent Framework with Large Language Models, Yu Tian et.al., Paper: http://arxiv.org/abs/2602.03704
- 2026-03-12, CogSearch: A Cognitive-Aligned Multi-Agent Framework for Proactive Decision Support in E-Commerce Search, Zhouwei Zhai et.al., Paper: http://arxiv.org/abs/2603.11927
- 2026-03-28, Codebase-Memory: Tree-Sitter-Based Knowledge Graphs for LLM Code Exploration via MCP, Martin Vogel et.al., Paper: http://arxiv.org/abs/2603.27277
- 2026-03-04, CodeTaste: Can LLMs Generate Human-Level Code Refactorings?, Alex Thillen et.al., Paper: http://arxiv.org/abs/2603.04177
- 2026-03-18, CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents, Lintang Sutawika et.al., Paper: http://arxiv.org/abs/2603.17829
- 2025-10-15, CodeEvolve: An open source evolutionary coding agent for algorithm discovery and optimization, Henrique Assumpção et.al., Paper: http://arxiv.org/abs/2510.14150
- 2026-01-21, CodeDelegator: Mitigating Context Pollution via Role Separation in Code-as-Action Agents, Tianxiang Fei et.al., Paper: http://arxiv.org/abs/2601.14914
- 2025-12-19, CodeDance: A Dynamic Tool-integrated MLLM for Executable Visual Reasoning, Qi Song et.al., Paper: http://arxiv.org/abs/2512.17312
- 2025-10-27, CodeAD: Synthesize Code of Rules for Log-based Anomaly Detection with LLMs, Junjie Huang et.al., Paper: http://arxiv.org/abs/2510.22986
- 2026-03-03, Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?, Dadi Guo et.al., Paper: http://arxiv.org/abs/2603.03202
- 2025-12-18, Code-in-the-Loop Forensics: Agentic Tool Use for Image Forgery Detection, Fanrui Zhang et.al., Paper: http://arxiv.org/abs/2512.16300
- 2026-01-05, Code for Machines, Not Just Humans: Quantifying AI-Friendliness with Code Health Metrics, Markus Borg et.al., Paper: http://arxiv.org/abs/2601.02200
- 2026-03-24, Code Review Agent Benchmark, Yuntong Zhang et.al., Paper: http://arxiv.org/abs/2603.23448
- 2026-03-02, CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification, Jinpeng Chen et.al., Paper: http://arxiv.org/abs/2603.01940
- 2025-12-24, CoTDeceptor:Adversarial Code Obfuscation Against CoT-Enhanced LLM Code Agents, Haoyang Li et.al., Paper: http://arxiv.org/abs/2512.21250
- 2026-03-17, CoMAI: A Collaborative Multi-Agent Framework for Robust and Equitable Interview Evaluation, Gengxin Sun et.al., Paper: http://arxiv.org/abs/2603.16215
- 2026-03-16, CoDesignAI: An AI-Enabled Multi-Agent, Multi-User System for Collaborative Urban Design at the Conceptual Stage, Zhaoxi Zhang et.al., Paper: http://arxiv.org/abs/2603.16008
- 2025-10-03, CoDA: Agentic Systems for Collaborative Data Visualization, Zichen Chen et.al., Paper: http://arxiv.org/abs/2510.03194
- 2025-09-26, CoBel-World: Harnessing LLM Reasoning to Build a Collaborative Belief World for Optimizing Embodied Multi-Agent Collaboration, Zhimin Wang et.al., Paper: http://arxiv.org/abs/2509.21981
- 2026-02-02, Co-RedTeam: Orchestrated Security Discovery and Exploitation with LLM Agents, Pengfei He et.al., Paper: http://arxiv.org/abs/2602.02164
- 2025-09-17, Co-Investigator AI: The Rise of Agentic AI for Smarter, Trustworthy AML Compliance Narratives, Prathamesh Vasudeo Naik et.al., Paper: http://arxiv.org/abs/2509.08380
- 2025-10-23, Co-Designing Quantum Codes with Transversal Diagonal Gates via Multi-Agent Systems, Xi He et.al., Paper: http://arxiv.org/abs/2510.20728
- 2025-10-05, Closing the Loop: Coordinating Inventory and Recommendation via Deep Reinforcement Learning on Multiple Timescales, Jinyang Jiang et.al., Paper: http://arxiv.org/abs/2510.04272
- 2025-12-29, Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing, Yuwen Li et.al., Paper: http://arxiv.org/abs/2512.23611
- 2025-10-30, Clone Deterministic 3D Worlds with Geometrically-Regularized World Models, Zaishuo Xia et.al., Paper: http://arxiv.org/abs/2510.26782
- 2026-01-01, ClinicalReTrial: A Self-Evolving AI Agent for Clinical Trial Protocol Optimization, Sixue Xing et.al., Paper: http://arxiv.org/abs/2601.00290
- 2025-12-08, ClinNoteAgents: An LLM Multi-Agent System for Predicting and Interpreting Heart Failure 30-Day Readmission from Clinical Notes, Rongjia Zhou et.al., Paper: http://arxiv.org/abs/2512.07081
- 2026-03-14, ClimateAgents: A Multi-Agent Research Assistant for Social-Climate Dynamics Analysis, Shan Shan et.al., Paper: http://arxiv.org/abs/2603.13840
- 2026-04-01, CliffSearch: Structured Agentic Co-Evolution over Theory and Code for Scientific Algorithm Discovery, Youssef Mroueh et.al., Paper: http://arxiv.org/abs/2604.01210
- 2025-11-07, Cleaning Maintenance Logs with LLM Agents for Improved Predictive Maintenance, Valeriu Dimidov et.al., Paper: http://arxiv.org/abs/2511.05311
- 2026-03-19, ClawTrap: A MITM-Based Red-Teaming Framework for Real-World OpenClaw Security Evaluation, Haochen Zhao et.al., Paper: http://arxiv.org/abs/2603.18762
- 2026-03-25, ClawKeeper: Comprehensive Safety Protection for OpenClaw Agents Through Skills, Plugins, and Watchers, Songyang Liu et.al., Paper: http://arxiv.org/abs/2603.24414
- 2026-03-25, Claudini: Autoresearch Discovers State-of-the-Art Adversarial Attack Algorithms for LLMs, Alexander Panfilov et.al., Paper: http://arxiv.org/abs/2603.24511
- 2025-12-03, Classification of User Satisfaction in HRI with Social Signals in the Wild, Michael Schiffmann et.al., Paper: http://arxiv.org/abs/2512.03945
- 2025-10-15, CiteGuard: Faithful Citation Attribution for LLMs via Retrieval-Augmented Validation, Yee Man Choi et.al., Paper: http://arxiv.org/abs/2510.17853
- 2026-03-17, Chronos: Temporal-Aware Conversational Agents with Structured Event Retrieval for Long-Term Memory, Sahil Sen et.al., Paper: http://arxiv.org/abs/2603.16862
- 2025-10-13, Chronologically Consistent Generative AI, Songrun He et.al., Paper: http://arxiv.org/abs/2510.11677
- 2026-02-12, Choose Your Agent: Tradeoffs in Adopting AI Advisors, Coaches, and Delegates in Multi-Party Negotiation, Kehang Zhu et.al., Paper: http://arxiv.org/abs/2602.12089
- 2026-03-23, Chimera: Latency- and Performance-Aware Multi-agent Serving for Heterogeneous LLMs, Kangqi Ni et.al., Paper: http://arxiv.org/abs/2603.22206
- 2025-10-18, Check Yourself Before You Wreck Yourself: Selectively Quitting Improves LLM Agent Safety, Vamshi Krishna Bonagiri et.al., Paper: http://arxiv.org/abs/2510.16492
- 2025-09-26, ChatInject: Abusing Chat Templates for Prompt Injection in LLM Agents, Hwan Chang et.al., Paper: http://arxiv.org/abs/2509.22830
- 2025-12-09, Chat with UAV – Human-UAV Interaction Based on Large Language Models, Haoran Wang et.al., Paper: http://arxiv.org/abs/2512.08145
- 2026-03-03, Changing Pedagogical Paradigms: Integrating Generative AI in Mathematics to Enhance Digital Literacy through ‘Mathematical Battles with AI’, Maria Moskalenko et.al., Paper: http://arxiv.org/abs/2603.02955
- 2026-03-25, Chameleon: Episodic Memory for Long-Horizon Robotic Manipulation, Xinying Guo et.al., Paper: http://arxiv.org/abs/2603.24576
- 2025-12-04, Chameleon: Adaptive Adversarial Agents for Scaling-Based Visual Prompt Injection in Multimodal AI Systems, M Zeeshan et.al., Paper: http://arxiv.org/abs/2512.04895
- 2026-01-09, Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards, Jiajie Zhang et.al., Paper: http://arxiv.org/abs/2601.06021
- 2026-03-13, ChainFuzzer: Greybox Fuzzing for Workflow-Level Multi-Tool Vulnerabilities in LLM Agents, Jiangrong Wu et.al., Paper: http://arxiv.org/abs/2603.12614
- 2025-12-23, Chain-of-Anomaly Thoughts with Large Vision-Language Models, Pedro Domingos et.al., Paper: http://arxiv.org/abs/2512.20417
- 2025-08-20, Chain-of-Agents: End-to-End Agent Foundation Models via Multi-Agent Distillation and Agentic RL, Weizhen Li et.al., Paper: http://arxiv.org/abs/2508.13167
- 2026-02-10, Chain of Mindset: Reasoning with Adaptive Cognitive Modes, Tianyi Jiang et.al., Paper: http://arxiv.org/abs/2602.10063
- 2025-11-14, CertiA360: Enhance Compliance Agility in Aerospace Software Development, J. Antonio Dantas Macedo et.al., Paper: http://arxiv.org/abs/2511.11550
- 2026-03-02, CeProAgents: A Hierarchical Agents System for Automated Chemical Process Development, Yuhang Yang et.al., Paper: http://arxiv.org/abs/2603.01654
- 2026-01-04, CaveAgent: Transforming LLMs into Stateful Runtime Operators, Maohao Ran et.al., Paper: http://arxiv.org/abs/2601.01569
- 2026-03-31, CausalPulse: An Industrial-Grade Neurosymbolic Multi-Agent Copilot for Causal Diagnostics in Smart Manufacturing, Chathurangi Shyalika et.al., Paper: http://arxiv.org/abs/2603.29755
- 2025-08-26, CausalMACE: Causality Empowered Multi-Agents in Minecraft Cooperative Tasks, Qi Chai et.al., Paper: http://arxiv.org/abs/2508.18797
- 2026-01-06, Causal-Enhanced AI Agents for Medical Research Screening, Duc Ngo et.al., Paper: http://arxiv.org/abs/2601.02814
- 2026-03-23, Causal Evidence that Language Models use Confidence to Drive Behavior, Dharshan Kumaran et.al., Paper: http://arxiv.org/abs/2603.22161
- 2025-09-29, Causal Autoencoder-like Generation of Feedback Fuzzy Cognitive Maps with an LLM Agent, Akash Kumar Panda et.al., Paper: http://arxiv.org/abs/2509.25593
- 2025-12-05, Categorifying isomonodromic deformations via Lie groupoids I: Logarithmic singularities, Waleed Qaisar et.al., Paper: http://arxiv.org/abs/2512.05966
- 2026-03-12, Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems, Sarbartha Banerjee et.al., Paper: http://arxiv.org/abs/2603.12023
- 2026-03-25, CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare, Akash Ghosh et.al., Paper: http://arxiv.org/abs/2603.24157
- 2025-12-17, CangLing-KnowFlow: A Unified Knowledge-and-Flow-fused Agent for Comprehensive Remote Sensing Applications, Zhengchao Chen et.al., Paper: http://arxiv.org/abs/2512.15231
- 2025-09-09, CancerGUIDE: Cancer Guideline Understanding via Internal Disagreement Estimation, Alyssa Unell et.al., Paper: http://arxiv.org/abs/2509.07325
- 2025-11-25, Can Vibe Coding Beat Graduate CS Students? An LLM vs. Human Coding Tournament on Market-driven Strategic Planning, Panayiotis Danassis et.al., Paper: http://arxiv.org/abs/2511.20613
- 2025-10-20, Can Transformer Memory Be Corrupted? Investigating Cache-Side Vulnerabilities in Large Language Models, Elias Hossain et.al., Paper: http://arxiv.org/abs/2510.17098
- 2025-10-13, Can Tool-Integrated Reinforcement Learning Generalize Across Diverse Domains?, Zhengyu Chen et.al., Paper: http://arxiv.org/abs/2510.11184
- 2026-01-16, Can Small Agent Collaboration Beat a Single Big LLM?, Agata Żywot et.al., Paper: http://arxiv.org/abs/2601.11327
- 2026-03-12, Can RL Improve Generalization of LLM Agents? An Empirical Study, Zhiheng Xi et.al., Paper: http://arxiv.org/abs/2603.12011
- 2026-01-01, Can Optimal Transport Improve Federated Inverse Reinforcement Learning?, David Millard et.al., Paper: http://arxiv.org/abs/2601.00309
- 2026-03-20, Can Large Multimodal Models Inspect Buildings? A Hierarchical Benchmark for Structural Pathology Reasoning, Hui Zhong et.al., Paper: http://arxiv.org/abs/2603.20148
- 2026-01-08, Can Large Language Models Resolve Semantic Discrepancy in Self-Destructive Subcultures? Evidence from Jirai Kei, Peng Wang et.al., Paper: http://arxiv.org/abs/2601.05004
- 2025-10-28, Can LLMs Write Faithfully? An Agent-Based Evaluation of LLM-generated Islamic Content, Abdullah Mushtaq et.al., Paper: http://arxiv.org/abs/2510.24438
- 2026-01-07, Can LLMs See Without Pixels? Benchmarking Spatial Intelligence from Textual Descriptions, Zhongbin Guo et.al., Paper: http://arxiv.org/abs/2601.03590
- 2025-10-31, Can LLMs Help You at Work? A Sandbox for Evaluating LLM Agents in Enterprise Environments, Harsh Vishwakarma et.al., Paper: http://arxiv.org/abs/2510.27287
- 2026-02-03, Can LLMs Do Rocket Science? Exploring the Limits of Complex Reasoning with GTOC 12, Iñaki del Campo et.al., Paper: http://arxiv.org/abs/2602.03630
- 2026-03-31, Can LLM Agents Identify Spoken Dialects like a Linguist?, Tobias Bystrich et.al., Paper: http://arxiv.org/abs/2603.29541
- 2026-03-24, Can LLM Agents Generate Real-World Evidence? Evaluating Observational Studies in Medical Databases, Dubai Li et.al., Paper: http://arxiv.org/abs/2603.22767
- 2026-03-18, Can Blindfolded LLMs Still Trade? An Anonymization-First Framework for Portfolio Optimization, Joohyoung Jeon et.al., Paper: http://arxiv.org/abs/2603.17692
- 2026-02-26, Can Agents Distinguish Visually Hard-to-Separate Diseases in a Zero-Shot Setting? A Pilot Study, Zihao Zhao et.al., Paper: http://arxiv.org/abs/2602.22959
- 2026-03-09, Can AI Agents Generate Microservices? How Far are We?, Bassam Adnan et.al., Paper: http://arxiv.org/abs/2603.09004
- 2026-02-18, Calibrate-Then-Act: Cost-Aware Exploration in LLM Agents, Wenxuan Ding et.al., Paper: http://arxiv.org/abs/2602.16699
- 2026-03-18, Caging the Agents: A Zero Trust Security Architecture for Autonomous AI in Healthcare, Saikat Maiti et.al., Paper: http://arxiv.org/abs/2603.17419
- 2025-11-04, Cache Mechanism for Agent RAG Systems, Shuhang Lin et.al., Paper: http://arxiv.org/abs/2511.02919
- 2025-10-09, CaRT: Teaching LLM Agents to Know When They Know Enough, Grace Liu et.al., Paper: http://arxiv.org/abs/2510.08517
- 2026-02-27, CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation, Weinan Dai et.al., Paper: http://arxiv.org/abs/2602.24286
- 2026-03-25, CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents, Xiangru Jian et.al., Paper: http://arxiv.org/abs/2603.24440
- 2025-10-29, CRMWeaver: Building Powerful Business Agent via Agentic RL and Shared Memories, Yilong Lai et.al., Paper: http://arxiv.org/abs/2510.25333
- 2026-02-03, CRL-VLA: Continual Vision-Language-Action Learning, Qixin Zeng et.al., Paper: http://arxiv.org/abs/2602.03445
- 2026-01-20, CREATE: Cross-Layer Resilience Characterization and Optimization for Efficient yet Reliable Embodied AI Systems, Tong Xie et.al., Paper: http://arxiv.org/abs/2601.14140
- 2025-12-23, CRAFT: Continuous Reasoning and Agentic Feedback Tuning for Multimodal Text-to-Image Generation, V. Kovalev et.al., Paper: http://arxiv.org/abs/2512.20362
- 2025-12-11, CP-Env: Evaluating Large Language Models on Clinical Pathways in a Controllable Hospital Environment, Yakun Zhu et.al., Paper: http://arxiv.org/abs/2512.10206
- 2025-09-30, CORTEX: Collaborative LLM Agents for High-Stakes Alert Triage, Bowen Wei et.al., Paper: http://arxiv.org/abs/2510.00311
- 2026-01-29, CORE:Toward Ubiquitous 6G Intelligence Through Collaborative Orchestration of Large Language Model Agents Over Hierarchical Edge, Zitong Yu et.al., Paper: http://arxiv.org/abs/2601.21822
- 2025-09-25, CORE: Full-Path Evaluation of LLM Agents Beyond Final State, Panagiotis Michelakis et.al., Paper: http://arxiv.org/abs/2509.20998
- 2026-04-02, CORAL: Towards Autonomous Multi-Agent Evolution for Open-Ended Discovery, Ao Qu et.al., Paper: http://arxiv.org/abs/2604.01658
- 2025-10-09, COMPASS: Enhancing Agent Long-Horizon Reasoning with Evolving Context, Guangya Wan et.al., Paper: http://arxiv.org/abs/2510.08790
- 2025-10-08, COMPASS: A Multi-Turn Benchmark for Tool-Mediated Planning & Preference Optimization, Tian Qin et.al., Paper: http://arxiv.org/abs/2510.07043
- 2026-03-17, CODMAS: A Dialectic Multi-Agent Collaborative Framework for Structured RTL Optimization, Che-Ming Chang et.al., Paper: http://arxiv.org/abs/2603.17204
- 2025-08-27, CODA: Coordinating the Cerebrum and Cerebellum for a Dual-Brain Computer Use Agent with Decoupled Reinforcement Learning, Zeyi Sun et.al., Paper: http://arxiv.org/abs/2508.20096
- 2025-08-29, COCORELI: Cooperative, Compositional Reconstitution \& Execution of Language Instructions, Swarnadeep Bhar et.al., Paper: http://arxiv.org/abs/2509.04470
- 2025-12-01, COACH: Collaborative Agents for Contextual Highlighting - A Multi-Agent Framework for Sports Video Analysis, Tsz-To Wong et.al., Paper: http://arxiv.org/abs/2512.01853
- 2026-02-12, CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use, Zhen Zhang et.al., Paper: http://arxiv.org/abs/2602.12268
- 2026-02-27, CLFEC: A New Task for Unified Linguistic and Factual Error Correction in paragraph-level Chinese Professional Writing, Jian Kai et.al., Paper: http://arxiv.org/abs/2602.23845
- 2026-01-21, CLEANER: Self-Purified Trajectories Boost Agentic Reinforcement Learning, Tianshi Xu et.al., Paper: http://arxiv.org/abs/2601.15141
- 2026-03-24, CIPL: A Target-Independent Framework for Channel-Inversion Privacy Leakage in Agents, Tao Huang et.al., Paper: http://arxiv.org/abs/2603.22751
- 2026-01-09, CHisAgent: A Multi-Agent Framework for Event Taxonomy Construction in Ancient Chinese Cultural Systems, Xuemei Tang et.al., Paper: http://arxiv.org/abs/2601.05520
- 2026-03-02, CHOP: Counterfactual Human Preference Labels Improve Obstacle Avoidance in Visuomotor Navigation Policies, Gershom Seneviratne et.al., Paper: http://arxiv.org/abs/2603.02004
- 2026-03-16, CCTU: A Benchmark for Tool Use under Complex Constraints, Junjie Ye et.al., Paper: http://arxiv.org/abs/2603.15309
- 2026-04-01, CARE: Privacy-Compliant Agentic Reasoning with Evidence Discordance, Haochen Liu et.al., Paper: http://arxiv.org/abs/2604.01113
- 2026-01-29, CAR-bench: Evaluating the Consistency and Limit-Awareness of LLM Agents under Real-World Uncertainty, Johannes Kirmayr et.al., Paper: http://arxiv.org/abs/2601.22027
- 2025-11-18, CAPIRE: Modelling the Impact of Teacher Strikes and Inflation on Student Trajectories in Engineering Education, H. R Paz et.al., Paper: http://arxiv.org/abs/2511.14573
- 2025-10-23, C-NAV: Towards Self-Evolving Continual Object Navigation in Open World, Ming-Ming Yu et.al., Paper: http://arxiv.org/abs/2510.20685
- 2026-04-02, ByteRover: Agent-Native Memory Through LLM-Curated Hierarchical Context, Andy Nguyen et.al., Paper: http://arxiv.org/abs/2604.01599
- 2025-12-15, Building from Scratch: A Multi-Agent Framework with Human-in-the-Loop for Multilingual Legal Terminology Mapping, Lingyi Meng et.al., Paper: http://arxiv.org/abs/2512.12950
- 2025-10-10, Building a Foundational Guardrail for General Agentic Systems via Synthetic Data, Yue Huang et.al., Paper: http://arxiv.org/abs/2510.09781
- 2026-03-05, Building Enterprise Realtime Voice Agents from Scratch: A Technical Tutorial, Jielin Qiu et.al., Paper: http://arxiv.org/abs/2603.05413
- 2026-03-05, Building AI Coding Agents for the Terminal: Scaffolding, Harness, Context Engineering, and Lessons Learned, Nghi D. Q. Bui et.al., Paper: http://arxiv.org/abs/2603.05344
- 2025-09-27, BuildBench: Benchmarking LLM Agents on Compiling Real-World Open-Source Software, Zehua Zhang et.al., Paper: http://arxiv.org/abs/2509.25248
- 2025-10-18, BuildArena: A Physics-Aligned Interactive Benchmark of LLMs for Engineering Construction, Tian Xia et.al., Paper: http://arxiv.org/abs/2510.16559
- 2025-10-17, Build Your Personalized Research Group: A Multiagent Framework for Continual and Interactive Science Automation, Ed Li et.al., Paper: http://arxiv.org/abs/2510.15624
- 2025-10-07, BrowserArena: Evaluating LLM Agents on Real-World Web Navigation Tasks, Sagnik Anupam et.al., Paper: http://arxiv.org/abs/2510.02418
- 2025-11-25, BrowseSafe: Understanding and Preventing Prompt Injection Within AI Browser Agents, Kaiyuan Zhang et.al., Paper: http://arxiv.org/abs/2511.20597
- 2025-10-28, BrowseConf: Confidence-Guided Test-Time Scaling for Web Agents, Litu Ou et.al., Paper: http://arxiv.org/abs/2510.23458
- 2026-02-13, BrowseComp- $V^3$ : A Visual, Vertical, and Verifiable Benchmark for Multimodal Browsing Agents, Huanyao Zhang et.al., Paper: http://arxiv.org/abs/2602.12876
- 2026-04-02, Brief Is Better: Non-Monotonic Chain-of-Thought Budget Effects in Function-Calling Language Agents, Xuan Qi et.al., Paper: http://arxiv.org/abs/2604.02155
- 2025-11-20, Bridging VLMs and Embodied Intelligence with Deliberate Practice Policy Optimization, Yi Zhang et.al., Paper: http://arxiv.org/abs/2511.16602
- 2025-12-04, Breaking the bandwidth-efficiency trade-off in soliton microcombs via mode coupling, Yang Liu et.al., Paper: http://arxiv.org/abs/2512.05090
- 2025-10-01, Breaking the Code: Security Assessment of AI Code Agents Through Systematic Jailbreaking Attacks, Shoumik Saha et.al., Paper: http://arxiv.org/abs/2510.01359
- 2025-10-20, Breaking and Fixing Defenses Against Control-Flow Hijacking in Multi-Agent Systems, Rishi Jha et.al., Paper: http://arxiv.org/abs/2510.17276
- 2026-03-11, Breaking User-Centric Agency: A Tri-Party Framework for Agent-Based Recommendation, Yaxin Gong et.al., Paper: http://arxiv.org/abs/2603.10673
- 2026-01-15, Breaking Up with Normatively Monolithic Agency with GRACE: A Reason-Based Neuro-Symbolic Architecture for Safe and Ethical AI Alignment, Felix Jahn et.al., Paper: http://arxiv.org/abs/2601.10520
- 2026-03-03, BrandFusion: A Multi-Agent Framework for Seamless Brand Integration in Text-to-Video Generation, Zihao Zhu et.al., Paper: http://arxiv.org/abs/2603.02816
- 2025-11-09, Brain-Inspired Planning for Better Generalization in Reinforcement Learning, Mingde “Harry” Zhao et.al., Paper: http://arxiv.org/abs/2511.06470
- 2026-03-16, Brain-Inspired Graph Multi-Agent Systems for LLM Reasoning, Guangfu Hao et.al., Paper: http://arxiv.org/abs/2603.15371
- 2025-10-16, Bounds and asymptotic expansions for the radii of convexity and uniform convexity of normalized Bessel functions, Árpád Baricz et.al., Paper: http://arxiv.org/abs/2510.14323
- 2025-11-28, Bounded-Error Quantum Simulation via Hamiltonian and Lindbladian Learning, Tristan Kraft et.al., Paper: http://arxiv.org/abs/2511.23392
- 2026-02-25, Both Ends Count! Just How Good are LLM Agents at “Text-to-Big SQL”?, Germán T. Eizaguirre et.al., Paper: http://arxiv.org/abs/2602.21480
- 2026-03-31, BotVerse: Real-Time Event-Driven Simulation of Social Agents, Edoardo Allegrini et.al., Paper: http://arxiv.org/abs/2603.29741
- 2026-03-20, Borderless Long Speech Synthesis, Xingchen Song et.al., Paper: http://arxiv.org/abs/2603.19798
- 2025-12-05, Bootstrapping Fuzzers for Compilers of Low-Resource Language Dialects Using Language Models, Sairam Vaidya et.al., Paper: http://arxiv.org/abs/2512.05887
- 2026-03-18, Bootstrapping Coding Agents: The Specification Is the Program, Martin Monperrus et.al., Paper: http://arxiv.org/abs/2603.17399
- 2025-12-23, Bohrium + SciMaster: Building the Infrastructure and Ecosystem for Agentic Science at Scale, Linfeng Zhang et.al., Paper: http://arxiv.org/abs/2512.20469
- 2026-02-24, Body-Reservoir Governance in Repeated Games: Embodied Decision-Making, Dynamic Sentinel Adaptation, and Complexity-Regularized Optimization, Yuki Nakamura et.al., Paper: http://arxiv.org/abs/2602.20846
- 2025-09-24, Blueprint-Bench: Comparing spatial intelligence of LLMs, agents and image models, Lukas Petersson et.al., Paper: http://arxiv.org/abs/2509.25229
- 2026-01-14, Blue Teaming Function-Calling Agents, Greta Dolcetti et.al., Paper: http://arxiv.org/abs/2601.09292
- 2025-11-20, Block-Separated Overpartitions and Their Fibonacci-Type Structure, El-Mehdi Mehiri et.al., Paper: http://arxiv.org/abs/2511.16580
- 2026-02-11, Blind Gods and Broken Screens: Architecting a Secure, Intent-Centric Mobile Agent Operating System, Zhenhua Zou et.al., Paper: http://arxiv.org/abs/2602.10915
- 2026-02-24, Black-Box Reliability Certification for AI Agents via Self-Consistency Sampling and Conformal Calibration, Charafeddine Mouzouni et.al., Paper: http://arxiv.org/abs/2602.21368
- 2025-12-19, Biosecurity-Aware AI: Agentic Risk Auditing of Soft Prompt Attacks on ESM-Based Variant Predictors, Huixin Zhan et.al., Paper: http://arxiv.org/abs/2512.17146
- 2026-03-05, BioLLMAgent: A Hybrid Framework with Enhanced Structural Interpretability for Simulating Human Decision-Making in Computational Psychiatry, Zuo Fei et.al., Paper: http://arxiv.org/abs/2603.05016
- 2026-01-29, BioAgent Bench: An AI Agent Evaluation Suite for Bioinformatics, Dionizije Fa et.al., Paper: http://arxiv.org/abs/2601.21800
- 2026-03-17, Bio-inspired metaheuristic optimization for hierarchical architecture design of industrial control systems, Ruslan Zakirzyanov et.al., Paper: http://arxiv.org/abs/2603.16617
- 2026-03-11, Bio-Inspired Self-Supervised Learning for Wrist-worn IMU Signals, Prithviraj Tarale et.al., Paper: http://arxiv.org/abs/2603.10961
- 2025-12-19, Binding Agent ID: Unleashing the Power of AI Agents with accountability and credibility, Zibin Lin et.al., Paper: http://arxiv.org/abs/2512.17538
- 2026-02-05, Bifrost: Steering Strategic Trajectories to Bridge Contextual Gaps for Self-Improving Agents, Quan M. Tran et.al., Paper: http://arxiv.org/abs/2602.05810
- 2026-03-05, Bidirectional Curriculum Generation: A Multi-Agent Framework for Data-Efficient Mathematical Reasoning, Boren Hu et.al., Paper: http://arxiv.org/abs/2603.05120
- 2025-11-14, Bidimensional measurements of photon statistics within a multimodal temporal framework, C. Hainaut et.al., Paper: http://arxiv.org/abs/2511.11403
- 2026-03-24, Biased Error Attribution in Multi-Agent Human-AI Systems Under Delayed Feedback, Teerthaa Parakh et.al., Paper: http://arxiv.org/abs/2603.23419
- 2025-08-26, Bias-Adjusted LLM Agents for Human-Like Decision-Making via Behavioral Economics, Ayato Kitadai et.al., Paper: http://arxiv.org/abs/2508.18600
- 2026-03-03, BeyondSWE: Can Current Code Agent Survive Beyond Single-Repo Bug Fixing?, Guoxin Chen et.al., Paper: http://arxiv.org/abs/2603.03194
- 2025-10-01, Beyond the Strongest LLM: Multi-Turn Multi-Agent Orchestration vs. Single LLMs on Benchmarks, Aaron Xuxiang Tian et.al., Paper: http://arxiv.org/abs/2509.23537
- 2025-10-03, Beyond the Final Answer: Evaluating the Reasoning Trajectories of Tool-Augmented Agents, Wonjoong Kim et.al., Paper: http://arxiv.org/abs/2510.02837
- 2026-03-16, Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models, Xiyu Liu et.al., Paper: http://arxiv.org/abs/2603.15518
- 2026-02-03, Beyond the Commit: Developer Perspectives on Productivity with AI Coding Assistants, Valerie Chen et.al., Paper: http://arxiv.org/abs/2602.03593
- 2026-03-31, Beyond pass@1: A Reliability Science Framework for Long-Horizon LLM Agents, Aaditya Khanal et.al., Paper: http://arxiv.org/abs/2603.29231
- 2026-03-20, Beyond detection: cooperative multi-agent reasoning for rapid onboard EO crisis response, Alejandro D. Mousist et.al., Paper: http://arxiv.org/abs/2603.19858
- 2025-12-16, Beyond Text-to-SQL: Autonomous Research-Driven Database Exploration with DAR, Ostap Vykhopen et.al., Paper: http://arxiv.org/abs/2512.14622
- 2026-03-03, Beyond Task Completion: Revealing Corrupt Success in LLM Agents through Procedure-Aware Evaluation, Hongliu Cao et.al., Paper: http://arxiv.org/abs/2603.03116
- 2025-12-14, Beyond Task Completion: An Assessment Framework for Evaluating Agentic AI Systems, Sreemaee Akshathala et.al., Paper: http://arxiv.org/abs/2512.12791
- 2026-01-12, Beyond Single-Shot: Multi-step Tool Retrieval via Query Planning, Wei Fang et.al., Paper: http://arxiv.org/abs/2601.07782
- 2025-10-01, Beyond Single LLMs: Enhanced Code Generation via Multi-Stage Performance-Guided LLM Orchestration, Huashan Chen et.al., Paper: http://arxiv.org/abs/2510.01379
- 2025-11-06, Beyond Shortest Path: Agentic Vehicular Routing with Semantic Context, Carnot Braun et.al., Paper: http://arxiv.org/abs/2511.04464
- 2025-10-16, Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning, Xingang Guo et.al., Paper: http://arxiv.org/abs/2510.12712
- 2026-03-10, Beyond Scaling: Assessing Strategic Reasoning and Rapid Decision-Making Capability of LLMs in Zero-sum Environments, Yang Li et.al., Paper: http://arxiv.org/abs/2603.09337
- 2026-03-06, Beyond Rows to Reasoning: Agentic Retrieval for Multimodal Spreadsheet Understanding and Editing, Anmol Gulati et.al., Paper: http://arxiv.org/abs/2603.06503
- 2025-10-22, Beyond Reactivity: Measuring Proactive Problem Solving in LLM Agents, Gil Pasternak et.al., Paper: http://arxiv.org/abs/2510.19771
- 2025-11-24, Beyond Protein Language Models: An Agentic LLM Framework for Mechanistic Enzyme Design, Bruno Jacob et.al., Paper: http://arxiv.org/abs/2511.19423
- 2025-10-19, Beyond Pipelines: A Survey of the Paradigm Shift toward Model-Native Agentic AI, Jitao Sang et.al., Paper: http://arxiv.org/abs/2510.16720
- 2026-01-01, Beyond Perfect APIs: A Comprehensive Evaluation of LLM Agents Under Real-World API Complexity, Doyoung Kim et.al., Paper: http://arxiv.org/abs/2601.00268
- 2025-10-06, Beyond Outcome Reward: Decoupling Search and Answering Improves LLM Agents, Yiding Wang et.al., Paper: http://arxiv.org/abs/2510.04695
- 2026-01-08, Beyond Monolithic Architectures: A Multi-Agent Search and Knowledge Optimization Framework for Agentic Search, Yiqun Chen et.al., Paper: http://arxiv.org/abs/2601.04703
- 2026-03-14, Beyond Medical Diagnostics: How Medical Multimodal Large Language Models Think in Space, Quoc-Huy Trinh et.al., Paper: http://arxiv.org/abs/2603.13800
- 2026-01-16, Beyond Max Tokens: Stealthy Resource Amplification via Tool Calling Chains in LLM Agents, Kaiyu Zhou et.al., Paper: http://arxiv.org/abs/2601.10955
- 2025-11-07, Beyond Master and Apprentice: Grounding Foundation Models for Symbiotic Interactive Learning in a Shared Latent Space, Linus Nwankwo et.al., Paper: http://arxiv.org/abs/2511.05203
- 2025-10-06, Beyond Manuals and Tasks: Instance-Level Context Learning for LLM Agents, Kuntai Cai et.al., Paper: http://arxiv.org/abs/2510.02369
- 2026-01-02, Beyond IVR: Benchmarking Customer Support LLM Agents for Business-Adherence, Sumanth Balaji et.al., Paper: http://arxiv.org/abs/2601.00596
- 2025-11-25, Beyond Generation: Multi-Hop Reasoning for Factual Accuracy in Vision-Language Models, Shamima Hossain et.al., Paper: http://arxiv.org/abs/2511.20531
- 2026-01-12, Beyond Dialogue Time: Temporal Semantic Memory for Personalized LLM Agents, Miao Su et.al., Paper: http://arxiv.org/abs/2601.07468
- 2025-12-04, Beyond Detection: A Comprehensive Benchmark and Study on Representation Learning for Fine-Grained Webshell Family Classification, Feijiang Han et.al., Paper: http://arxiv.org/abs/2512.05288
- 2025-11-28, Beyond Curve Fitting: Neuro-Symbolic Agents for Context-Aware Epidemic Forecasting, Joongwon Chae et.al., Paper: http://arxiv.org/abs/2511.23276
- 2026-03-28, Beyond Completion: Probing Cumulative State Tracking to Predict LLM Agent Performance, Dengzhe Hou et.al., Paper: http://arxiv.org/abs/2603.27343
- 2026-01-28, Beyond Accuracy: A Cognitive Load Framework for Mapping the Capability Boundaries of Tool-use Agents, Qihao Wang et.al., Paper: http://arxiv.org/abs/2601.20412
- 2025-12-24, Better Call Graphs: A New Dataset of Function Call Graphs for Malware Classification, Jakir Hossain et.al., Paper: http://arxiv.org/abs/2512.20872
- 2025-10-14, Benefits and Limitations of Communication in Multi-Agent Reasoning, Michael Rizvi-Martel et.al., Paper: http://arxiv.org/abs/2510.13903
- 2025-12-12, Benchmarking the Generality of Vision-Language-Action Models, Pranav Guruprasad et.al., Paper: http://arxiv.org/abs/2512.11315
- 2025-11-06, Benchmarking and Studying the LLM-based Agent System in End-to-End Software Development, Zhengran Zeng et.al., Paper: http://arxiv.org/abs/2511.04064
- 2025-12-03, Benchmark for Planning and Control with Large Language Model Agents: Blocksworld with Model Context Protocol, Niklas Jobs et.al., Paper: http://arxiv.org/abs/2512.03955
- 2026-03-25, BeliefShift: Benchmarking Temporal Belief Consistency and Opinion Drift in LLM Agents, Praveen Kumar Myakala et.al., Paper: http://arxiv.org/abs/2603.23848
- 2026-01-08, Belief in Authority: Impact of Authority in Multi-Agent Evaluation Framework, Junhyuk Choi et.al., Paper: http://arxiv.org/abs/2601.04790
- 2026-03-04, Behind the Prompt: The Agent-User Problem in Information Retrieval, Saber Zerhoudi et.al., Paper: http://arxiv.org/abs/2603.03630
- 2026-03-17, Behavioral Steering in a 35B MoE Language Model via SAE-Decoded Probe Vectors: One Agency Axis, Not Five Traits, Jia Qing Yap et.al., Paper: http://arxiv.org/abs/2603.16335
- 2026-03-09, Behavioral Generative Agents for Power Dispatch and Auction, Shaoze Li et.al., Paper: http://arxiv.org/abs/2603.08477
- 2025-11-28, Behavior-Equivalent Token: Single-Token Replacement for Long Prompts in LLMs, Jiancheng Dong et.al., Paper: http://arxiv.org/abs/2511.23271
- 2026-03-21, Before the Tool Call: Deterministic Pre-Action Authorization for Autonomous AI Agents, Uchi Uchibeke et.al., Paper: http://arxiv.org/abs/2603.20953
- 2025-11-24, Be My Eyes: Extending Large Language Models to New Modalities Through Multi-Agent Collaboration, James Y. Huang et.al., Paper: http://arxiv.org/abs/2511.19417
- 2026-01-16, Bayesian optimisation for Bayesian evidence (BOBE) – a fast and efficient likelihood emulator for model selection, Nathan Cohen et.al., Paper: http://arxiv.org/abs/2601.11150
- 2026-01-04, Bayesian Orchestration of Multi-LLM Agents for Cost-Aware Sequential Decision-Making, Danial Amin et.al., Paper: http://arxiv.org/abs/2601.01522
- 2026-03-31, BayesInsights: Modelling Software Delivery and Developer Experience with Bayesian Networks at Bloomberg, Serkan Kirbas et.al., Paper: http://arxiv.org/abs/2603.29929
- 2025-11-17, Batch Acquisition Function Evaluations and Decouple Optimizer Updates for Faster Bayesian Optimization, Kaichi Irie et.al., Paper: http://arxiv.org/abs/2511.13625
- 2025-12-12, Basis dependence of Neural Quantum States for the Transverse Field Ising Model, Ronald Santiago Cortes et.al., Paper: http://arxiv.org/abs/2512.11632
- 2025-12-17, BashArena: A Control Setting for Highly Privileged AI Agents, Adam Kaufman et.al., Paper: http://arxiv.org/abs/2512.15688
- 2025-11-13, Bandwidth of Linear Classically Damped Systems with Application to Experimental Model Aircraft, Benjamin J. Chang et.al., Paper: http://arxiv.org/abs/2511.10379
- 2025-10-23, Balancing Specialization and Centralization: A Multi-Agent Reinforcement Learning Benchmark for Sequential Industrial Control, Tom Maus et.al., Paper: http://arxiv.org/abs/2510.20408
- 2026-02-10, BagelVLA: Enhancing Long-Horizon Manipulation via Interleaved Vision-Language-Action Generation, Yucheng Hu et.al., Paper: http://arxiv.org/abs/2602.09849
- 2025-11-24, BackSplit: The Importance of Sub-dividing the Background in Biomedical Lesion Segmentation, Rachit Saluja et.al., Paper: http://arxiv.org/abs/2511.19394
- 2025-10-28, BLM $_1$ : A Boundless Large Model for Cross-Space, Cross-Task, and Cross-Embodiment Learning, Wentao Tan et.al., Paper: http://arxiv.org/abs/2510.24161
- 2026-02-03, BIRDTurk: Adaptation of the BIRD Text-to-SQL Dataset to Turkish, Burak Aktaş et.al., Paper: http://arxiv.org/abs/2602.03633
- 2025-11-26, BAMAS: Structuring Budget-Aware Multi-Agent Systems, Liming Yang et.al., Paper: http://arxiv.org/abs/2511.21572
- 2025-09-08, AxelSMOTE: An Agent-Based Oversampling Algorithm for Imbalanced Classification, Sukumar Kishanthan et.al., Paper: http://arxiv.org/abs/2509.06875
- 2025-10-16, Ax-Prover: A Deep Reasoning Agentic Framework for Theorem Proving in Mathematics and Quantum Physics, Marco Del Tredici et.al., Paper: http://arxiv.org/abs/2510.12787
- 2026-03-23, AwesomeLit: Towards Hypothesis Generation with Agent-Supported Literature Research, Zefei Xie et.al., Paper: http://arxiv.org/abs/2603.22648
- 2025-11-17, Average hardness of SIVP for module lattices of fixed rank, Koen de Boer et.al., Paper: http://arxiv.org/abs/2511.13659
- 2026-02-02, Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts, Aiden Yiliu Li et.al., Paper: http://arxiv.org/abs/2602.02468
- 2025-10-06, Autonomy Matters: A Study on Personalization-Privacy Dilemma in LLM Agents, Zhiping Zhang et.al., Paper: http://arxiv.org/abs/2510.04465
- 2026-01-15, Autonomous Quantum Simulation through Large Language Model Agents, Weitang Li et.al., Paper: http://arxiv.org/abs/2601.10194
- 2026-01-20, Autonomous Knowledge Graph Exploration with Adaptive Breadth-Depth Retrieval, Joaquín Polonuer et.al., Paper: http://arxiv.org/abs/2601.13969
- 2025-12-09, Autonomous Issue Resolver: Towards Zero-Touch Code Maintenance, Aliaksei Kaliutau et.al., Paper: http://arxiv.org/abs/2512.08492
- 2025-09-09, Autonomous Code Evolution Meets NP-Completeness, Cunxi Yu et.al., Paper: http://arxiv.org/abs/2509.07367
- 2026-01-22, Autonomous Business System via Neuro-symbolic AI, Cecil Pang et.al., Paper: http://arxiv.org/abs/2601.15599
- 2025-10-10, Autonomous Agents for Scientific Discovery: Orchestrating Scientists, Language, Code, and Physics, Lianhao Zhou et.al., Paper: http://arxiv.org/abs/2510.09901
- 2025-12-03, Autonomous Agents and Policy Compliance: A Framework for Reasoning About Penalties, Vineel Tummala et.al., Paper: http://arxiv.org/abs/2512.03931
- 2026-02-13, Automating UI Optimization through Multi-Agentic Reasoning, Zhipeng Li et.al., Paper: http://arxiv.org/abs/2602.13126
- 2025-12-08, Automating High Energy Physics Data Analysis with LLM-Powered Agents, Eli Gendreau-Distler et.al., Paper: http://arxiv.org/abs/2512.07785
- 2025-10-01, Automating Data-Driven Modeling and Analysis for Engineering Applications using Large Language Model Agents, Yang Liu et.al., Paper: http://arxiv.org/abs/2510.01398
- 2026-02-09, Automating Computational Reproducibility in Social Science: Comparing Prompt-Based and Agent-Based Approaches, Syed Mehtab Hussain Shah et.al., Paper: http://arxiv.org/abs/2602.08561
- 2025-10-09, Automating Android Build Repair: Bridging the Reasoning-Execution Gap in LLM Agents with Domain-Specific Tools, Ha Min Son et.al., Paper: http://arxiv.org/abs/2510.08640
- 2026-02-18, Automating Agent Hijacking via Structural Template Injection, Xinhao Deng et.al., Paper: http://arxiv.org/abs/2602.16958
- 2025-10-01, Automatically Generating Web Applications from Requirements Via Multi-Agent Test-Driven Development, Yuxuan Wan et.al., Paper: http://arxiv.org/abs/2509.25297
- 2025-10-28, Automatically Benchmarking LLM Code Agents through Agent-Driven Annotation and Evaluation, Lingyue Fu et.al., Paper: http://arxiv.org/abs/2510.24358
- 2025-12-17, Automatic generation of input files with optimised k-point meshes for Quantum Espresso self-consistent field single point total energy calculations, Elena Patyukova et.al., Paper: http://arxiv.org/abs/2512.15303
- 2026-03-12, Automatic Generation of High-Performance RL Environments, Seth Karten et.al., Paper: http://arxiv.org/abs/2603.12145
- 2026-01-06, Automated Semantic Rules Detection (ASRD) for Emergent Communication Interpretation, Bastien Vanderplaetse et.al., Paper: http://arxiv.org/abs/2601.03254
- 2026-01-21, Automated Rubrics for Reliable Evaluation of Medical Dialogue Systems, Yinzhu Chen et.al., Paper: http://arxiv.org/abs/2601.15161
- 2025-12-11, Automated Penetration Testing with LLM Agents and Classical Planning, Lingzhi Wang et.al., Paper: http://arxiv.org/abs/2512.11143
- 2025-10-15, Automated Network Protocol Testing with LLM Agents, Yunze Wei et.al., Paper: http://arxiv.org/abs/2510.13248
- 2026-02-18, Automated Extraction of Mechanical Constitutive Models from Scientific Literature using Large Language Models: Applications in Cultural Heritage Conservation, Rui Hu et.al., Paper: http://arxiv.org/abs/2602.16551
- 2025-09-15, Automated Creation and Enrichment Framework for Improved Invocation of Enterprise APIs as Tools, Prerna Agarwal et.al., Paper: http://arxiv.org/abs/2509.11626
- 2025-10-23, Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents, Zhenning Yang et.al., Paper: http://arxiv.org/abs/2510.20211
- 2026-03-07, AutoUE: Automated Generation of 3D Games in Unreal Engine via Multi-Agent Systems, Lei Yin et.al., Paper: http://arxiv.org/abs/2603.07106
- 2025-11-18, AutoTool: Efficient Tool Selection for Large Language Model Agents, Jingyi Jia et.al., Paper: http://arxiv.org/abs/2511.14650
- 2025-12-15, AutoTool: Dynamic Tool Selection and Integration for Agentic Reasoning, Jiaru Zou et.al., Paper: http://arxiv.org/abs/2512.13278
- 2025-10-27, AutoStreamPipe: LLM Assisted Automatic Generation of Data Stream Processing Pipelines, Abolfazl Younesi et.al., Paper: http://arxiv.org/abs/2510.23408
- 2026-01-30, AutoRefine: From Trajectories to Reusable Expertise for Continual LLM Agent Refinement, Libin Qiu et.al., Paper: http://arxiv.org/abs/2601.22758
- 2025-10-09, AutoQual: An LLM Agent for Automated Discovery of Interpretable Features for Review Quality Assessment, Xiaochong Lan et.al., Paper: http://arxiv.org/abs/2510.08081
- 2025-10-07, AutoPentester: An LLM Agent-based Framework for Automated Pentesting, Yasod Ginige et.al., Paper: http://arxiv.org/abs/2510.05605
- 2025-09-10, AutoODD: Agentic Audits via Bayesian Red Teaming in Black-Box Models, Rebecca Martin et.al., Paper: http://arxiv.org/abs/2509.08638
- 2026-02-19, AutoNumerics: An Autonomous, PDE-Agnostic Multi-Agent Pipeline for Scientific Computing, Jianda Du et.al., Paper: http://arxiv.org/abs/2602.17607
- 2025-12-11, AutoMedic: An Automated Evaluation Framework for Clinical Conversational Agents with Medical Dataset Grounding, Gyutaek Oh et.al., Paper: http://arxiv.org/abs/2512.10195
- 2026-03-22, AutoMOOSE: An Agentic AI for Autonomous Phase-Field Simulation, Sukriti Manna et.al., Paper: http://arxiv.org/abs/2603.20986
- 2025-06-09, AutoML-Agent: A Multi-Agent LLM Framework for Full-Pipeline AutoML, Patara Trirat et.al., Paper: http://arxiv.org/abs/2410.02958
- 2026-04-01, AutoMIA: Improved Baselines for Membership Inference Attack via Agentic Self-Exploration, Ruhao Liu et.al., Paper: http://arxiv.org/abs/2604.01014
- 2026-02-03, AutoFigure: Generating and Refining Publication-Ready Scientific Illustrations, Minjun Zhu et.al., Paper: http://arxiv.org/abs/2602.03828
- 2025-12-12, AutoFSM: A Multi-agent Framework for FSM Code Generation with IR and SystemC-Based Testing, Qiuming Luo et.al., Paper: http://arxiv.org/abs/2512.11398
- 2025-11-24, AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning, Jiayi Zhang et.al., Paper: http://arxiv.org/abs/2511.19304
- 2026-04-01, AutoEG: Exploiting Known Third-Party Vulnerabilities in Black-Box Web Applications, Ruozhao Yang et.al., Paper: http://arxiv.org/abs/2604.00704
- 2025-11-28, AugGen: Augmenting Task-Based Learning in Professional Creative Software with LLM-Generated Scaffolded UIs, Yimeng Liu et.al., Paper: http://arxiv.org/abs/2511.23379
- 2026-03-14, Audo-Sight: AI-driven Ambient Perception Across Edge-Cloud for Blind and Low Vision Users, Jacob Bradshaw et.al., Paper: http://arxiv.org/abs/2603.13668
- 2025-11-05, Auditing M-LLMs for Privacy Risks: A Synthetic Benchmark and Evaluation Framework, Junhao Li et.al., Paper: http://arxiv.org/abs/2511.03248
- 2025-10-03, AudioToolAgent: An Agentic Framework for Audio-Language Models, Gijs Wijngaard et.al., Paper: http://arxiv.org/abs/2510.02995
- 2026-02-11, AudioRouter: Data Efficient Audio Understanding via RL based Dual Reasoning, Liyang Chen et.al., Paper: http://arxiv.org/abs/2602.10439
- 2025-12-31, AudioFab: Building A General and Intelligent Audio Factory through Tool Learning, Cheng Zhu et.al., Paper: http://arxiv.org/abs/2512.24645
- 2026-03-11, AttriGuard: Defeating Indirect Prompt Injection in LLM Agents via Causal Attribution of Tool Invocations, Yu He et.al., Paper: http://arxiv.org/abs/2603.10749
- 2025-12-09, Attention is All You Need to Defend Against Indirect Prompt Injection Attacks in LLMs, Yinan Zhong et.al., Paper: http://arxiv.org/abs/2512.08417
- 2025-11-25, Attention Trajectories as a Diagnostic Axis for Deep Reinforcement Learning, Charlotte Beylier et.al., Paper: http://arxiv.org/abs/2511.20591
- 2025-10-13, Attacks by Content: Automated Fact-checking is an AI Security Issue, Michael Schlichtkrull et.al., Paper: http://arxiv.org/abs/2510.11238
- 2026-02-16, Atomix: Timely, Transactional Tool Use for Reliable Agentic Workflows, Bardia Mohammadi et.al., Paper: http://arxiv.org/abs/2602.14849
- 2026-01-07, Atlas: Orchestrating Heterogeneous Models and Tools for Multi-Domain Complex Reasoning, Jinyang Wu et.al., Paper: http://arxiv.org/abs/2601.03872
- 2026-02-13, Asynchronous Verified Semantic Caching for Tiered LLM Architectures, Asmit Kumar Singh et.al., Paper: http://arxiv.org/abs/2602.13165
- 2025-12-11, Asynchronous Reasoning: Training-Free Interactive Thinking LLMs, George Yakushev et.al., Paper: http://arxiv.org/abs/2512.10931
- 2025-12-15, Async Control: Stress-testing Asynchronous Control Measures for LLM Agents, Asa Cooper Stickland et.al., Paper: http://arxiv.org/abs/2512.13526
- 2025-11-24, Asymptotic linear dependence and ellipse statistics for multivariate two-sample homogeneity test, Chifeng Shen et.al., Paper: http://arxiv.org/abs/2511.19381
- 2025-12-31, AstroReview: An LLM-driven Multi-Agent Framework for Telescope Proposal Peer Review and Refinement, Yutong Wang et.al., Paper: http://arxiv.org/abs/2512.24754
- 2025-12-16, Astraea: A State-Aware Scheduling Engine for LLM-Powered Agents, Hongqiu Ni et.al., Paper: http://arxiv.org/abs/2512.14142
- 2025-12-25, AstraNav-World: World Model for Foresight Control and Consistency, Junjun Hu et.al., Paper: http://arxiv.org/abs/2512.21714
- 2025-12-25, AstraNav-Memory: Contexts Compression for Long Memory, Botao Ren et.al., Paper: http://arxiv.org/abs/2512.21627
- 2025-09-09, Astra: A Multi-Agent System for GPU Kernel Performance Optimization, Anjiang Wei et.al., Paper: http://arxiv.org/abs/2509.07506
- 2025-09-22, Asteria: Semantic-Aware Cross-Region Caching for Agentic LLM Tool Access, Chaoyi Ruan et.al., Paper: http://arxiv.org/abs/2509.17360
- 2025-10-24, AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite, Jonathan Bragg et.al., Paper: http://arxiv.org/abs/2510.21652
- 2026-02-23, Assessing Risks of Large Language Models in Mental Health Support: A Framework for Automated Clinical AI Red Teaming, Ian Steenstra et.al., Paper: http://arxiv.org/abs/2602.19948
- 2026-02-26, Assessing Deanonymization Risks with Stylometry-Assisted LLM Agent, Boyang Zhang et.al., Paper: http://arxiv.org/abs/2602.23079
- 2025-12-31, Ask, Clarify, Optimize: Human-LLM Agent Collaboration for Smarter Inventory Control, Yaqi Duan et.al., Paper: http://arxiv.org/abs/2601.00121
- 2025-12-17, Artism: AI-Driven Dual-Engine System for Art Generation and Critique, Shuai Liu et.al., Paper: http://arxiv.org/abs/2512.15710
- 2026-02-10, Artisan: Agentic Artifact Evaluation, Doehyun Baek et.al., Paper: http://arxiv.org/abs/2602.10046
- 2025-12-09, Argus: A Multi-Agent Sensitive Information Leakage Detection Framework Based on Hierarchical Reference Relationships, Bin Wang et.al., Paper: http://arxiv.org/abs/2512.08326
- 2025-12-19, Are Vision Language Models Cross-Cultural Theory of Mind Reasoners?, Zabir Al Nazi et.al., Paper: http://arxiv.org/abs/2512.17394
- 2025-10-22, Are Large Language Models Sensitive to the Motives Behind Communication?, Addison J. Wu et.al., Paper: http://arxiv.org/abs/2510.19687
- 2025-09-04, Are LLM Agents the New RPA? A Comparative Study with RPA Across Enterprise Workflows, Petr Průcha et.al., Paper: http://arxiv.org/abs/2509.04198
- 2025-09-03, Are LLM Agents Behaviorally Coherent? Latent Profiles for Social Simulation, James Mooney et.al., Paper: http://arxiv.org/abs/2509.03736
- 2026-01-26, Are Conversational AI Agents the Way Out? Co-Designing Reader-Oriented News Experiences with Immigrants and Journalists, Yongle Zhang et.al., Paper: http://arxiv.org/abs/2601.18772
- 2025-10-27, Are Agents Just Automata? On the Formal Equivalence Between Agentic AI and the Chomsky Hierarchy, Roham Koohestani et.al., Paper: http://arxiv.org/abs/2510.23487
- 2026-03-23, Are AI-assisted Development Tools Immune to Prompt Injection?, Charoes Huang et.al., Paper: http://arxiv.org/abs/2603.21642
- 2025-12-10, Architectures for Building Agentic AI, Sławomir Nowaczyk et.al., Paper: http://arxiv.org/abs/2512.09458
- 2026-03-02, Architecture-Aware Multi-Design Generation for Repository-Level Feature Addition, Mingwei Liu et.al., Paper: http://arxiv.org/abs/2603.01814
- 2026-03-03, Architecting Trust in Artificial Epistemic Agents, Nahema Marchal et.al., Paper: http://arxiv.org/abs/2603.02960
- 2026-03-31, Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injection Attacks, Chong Xiang et.al., Paper: http://arxiv.org/abs/2603.30016
- 2025-09-10, Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations, Ron F. Del Rosario et.al., Paper: http://arxiv.org/abs/2509.08646
- 2026-01-07, Architecting Agentic Communities using Design Patterns, Zoran Milosevic et.al., Paper: http://arxiv.org/abs/2601.03624
- 2025-11-06, ArchPilot: A Proxy-Guided Multi-Agent Approach for Machine Learning Engineering, Zhuowen Yuan et.al., Paper: http://arxiv.org/abs/2511.03985
- 2026-03-18, ArchBench: Benchmarking Generative-AI for Software Architecture Tasks, Bassam Adnan et.al., Paper: http://arxiv.org/abs/2603.17833
- 2025-11-28, Arbitrary control of the temporal waveform of photons during spontaneous emission, Carl Thomas et.al., Paper: http://arxiv.org/abs/2511.23462
- 2026-01-08, Arabic Prompts with English Tools: A Benchmark, Konstantin Kubrak et.al., Paper: http://arxiv.org/abs/2601.05101
- 2025-12-23, AprielGuard, Jaykumar Kasundra et.al., Paper: http://arxiv.org/abs/2512.20293
- 2026-04-02, Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning, Rafael Pardinas et.al., Paper: http://arxiv.org/abs/2604.02007
- 2025-11-26, Approximate Bayesian Computation Made Easy: A Practical Guide to ABC-SMC for Dynamical Systems with \texttt{pymc}, Mario Castro et.al., Paper: http://arxiv.org/abs/2511.21587
- 2025-10-21, Applying voxel-based analysis to oropharyngeal cancer proton therapy patients: a correlation study on radiation-induced acute dysphagia, Qianxia Wang et.al., Paper: http://arxiv.org/abs/2510.18210
- 2025-11-17, Applying Large Language Models to Characterize Public Narratives, Elinor Poole-Dayan et.al., Paper: http://arxiv.org/abs/2511.13505
- 2026-01-16, Applying Formal Methods Tools to an Electronic Warfare Codebase (Experience report), Letitia W. Li et.al., Paper: http://arxiv.org/abs/2601.11510
- 2026-02-27, AoE: Always-on Egocentric Human Video Collection for Embodied AI, Bowen Yang et.al., Paper: http://arxiv.org/abs/2602.23893
- 2026-03-17, Anticipatory Planning for Multimodal AI Agents, Yongyuan Liang et.al., Paper: http://arxiv.org/abs/2603.16777
- 2025-11-21, Anomaly Pattern-guided Transaction Bug Testing in Relational Databases, Huicong Xu et.al., Paper: http://arxiv.org/abs/2511.17377
- 2025-11-18, Anomalous spontaneous induction of magnetic and electric fields in dense quark matter, E. J. Ferrer et.al., Paper: http://arxiv.org/abs/2511.14602
- 2026-02-24, AnimeAgent: Is the Multi-Agent via Image-to-Video models a Good Disney Storytelling Artist?, Hailong Yan et.al., Paper: http://arxiv.org/abs/2602.20664
- 2025-10-13, Analyzing and Internalizing Complex Policy Documents for LLM Agents, Jiateng Liu et.al., Paper: http://arxiv.org/abs/2510.11588
- 2026-01-08, Analyzing Message-Code Inconsistency in AI Coding Agent-Authored Pull Requests, Jingzhi Gong et.al., Paper: http://arxiv.org/abs/2601.04886
- 2026-02-10, AnalyticsGPT: An LLM Workflow for Scientometric Question Answering, Khang Ly et.al., Paper: http://arxiv.org/abs/2602.09817
- 2025-12-04, Analytical and Cross-Sectional Clinical Validity of a Smartphone-Based U-Turn Test in Multiple Sclerosis, Marta Płonka et.al., Paper: http://arxiv.org/abs/2512.04914
- 2026-03-25, AnalogAgent: Self-Improving Analog Circuit Design Automation with LLM Agents, Zhixuan Bao et.al., Paper: http://arxiv.org/abs/2603.23910
- 2026-02-10, Anagent For Enhancing Scientific Table & Figure Analysis, Xuehang Guo et.al., Paper: http://arxiv.org/abs/2602.10081
- 2025-11-05, AnaFlow: Agentic LLM-based Workflow for Reasoning-Driven Explainable and Sample-Efficient Analog Circuit Sizing, Mohsen Ahmadzadeh et.al., Paper: http://arxiv.org/abs/2511.03697
- 2025-11-14, An optical–mid-infrared color evolution tool for nova identification using WISE data, Joseph Onuegbu et.al., Paper: http://arxiv.org/abs/2511.11541
- 2025-10-15, An LLM-Powered AI Agent Framework for Holistic IoT Traffic Interpretation, Daniel Adu Worae et.al., Paper: http://arxiv.org/abs/2510.13925
- 2026-02-23, An LLM-Enabled Frequency-Aware Flow Diffusion Model for Natural-Language-Guided Power System Scenario Generation, Zhenghao Zhou et.al., Paper: http://arxiv.org/abs/2602.19522
- 2026-01-26, An LLM-Agent-Based Framework for Age of Information Optimization in Heterogeneous Random Access Networks, Fang Liu et.al., Paper: http://arxiv.org/abs/2601.18563
- 2025-09-16, An LLM Agentic Approach for Legal-Critical Software: A Case Study for Tax Prep Software, Sina Gogani-Khiabani et.al., Paper: http://arxiv.org/abs/2509.13471
- 2026-01-21, An LLM Agent-based Framework for Whaling Countermeasures, Daisuke Miyamoto et.al., Paper: http://arxiv.org/abs/2601.14606
- 2026-03-31, An Empirical Study of Multi-Agent Collaboration for Automated Research, Yang Shen et.al., Paper: http://arxiv.org/abs/2603.29632
- 2026-02-03, An Empirical Study of Collective Behaviors and Social Dynamics in Large Language Model Agents, Farnoosh Hashemi et.al., Paper: http://arxiv.org/abs/2602.03775
- 2026-02-25, An Empirical Study of Bugs in Modern LLM Agent Frameworks, Xinxue Zhu et.al., Paper: http://arxiv.org/abs/2602.21806
- 2025-12-01, An Empirical Study of Agent Developer Practices in AI Agent Frameworks, Yanlin Wang et.al., Paper: http://arxiv.org/abs/2512.01939
- 2026-03-18, An Auditable AI Agent Loop for Empirical Economics: A Case Study in Forecast Combination, Minchul Shin et.al., Paper: http://arxiv.org/abs/2603.17381
- 2025-10-19, An Agentic Framework with LLMs for Solving Complex Vehicle Routing Problems, Ni Zhang et.al., Paper: http://arxiv.org/abs/2510.16701
- 2026-01-02, An Agentic Framework for Neuro-Symbolic Programming, Aliakbar Nafar et.al., Paper: http://arxiv.org/abs/2601.00743
- 2025-12-22, An Agentic Framework for Autonomous Materials Computation, Zeyu Xia et.al., Paper: http://arxiv.org/abs/2512.19458
- 2026-03-20, An Agentic Approach to Generating XAI-Narratives, Yifan He et.al., Paper: http://arxiv.org/abs/2603.20003
- 2025-10-16, AlphaQuanter: An End-to-End Tool-Orchestrated Agentic Reinforcement Learning Framework for Stock Trading, Zheye Deng et.al., Paper: http://arxiv.org/abs/2510.14264
- 2026-02-20, Alignment in Time: Peak-Aware Orchestration for Long-Horizon Agentic Systems, Hanjing Shi et.al., Paper: http://arxiv.org/abs/2602.17910
- 2025-10-06, Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails, Siwei Han et.al., Paper: http://arxiv.org/abs/2510.04860
- 2025-11-14, Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping, Dena Mujtaba et.al., Paper: http://arxiv.org/abs/2511.11551
- 2025-11-26, Aligning LLMs Toward Multi-Turn Conversational Outcomes Using Iterative PPO, Daniel R. Jiang et.al., Paper: http://arxiv.org/abs/2511.21638
- 2025-12-30, Align While Search: Belief-Guided Exploratory Inference for World-Grounded Embodied Agents, Seohui Bae et.al., Paper: http://arxiv.org/abs/2512.24461
- 2026-02-24, Airavat: An Agentic Framework for Internet Measurement, Alagappan Ramanathan et.al., Paper: http://arxiv.org/abs/2602.20924
- 2025-12-16, AgroAskAI: A Multi-Agentic AI Framework for Supporting Smallholder Farmers’ Enquiries Globally, Nadine Angela Cantonjos et.al., Paper: http://arxiv.org/abs/2512.14910
- 2026-02-17, AgriWorld:A World Tools Protocol Framework for Verifiable Agricultural Reasoning with Code-Executing LLM Agents, Zhixing Zhang et.al., Paper: http://arxiv.org/abs/2602.15325
- 2026-01-13, AgriAgent: Contract-Driven Planning and Capability-Aware Tool Orchestration in Real-World Agriculture, Bo Yang et.al., Paper: http://arxiv.org/abs/2601.08308
- 2026-01-27, Agree to Disagree: Consensus-Free Flocking under Constraints, Peter Travis Jardine et.al., Paper: http://arxiv.org/abs/2601.19119
- 2026-02-24, Agile V: A Compliance-Ready Framework for AI-Augmented Engineering – From Concept to Audit-Ready Delivery, Christopher Koch et.al., Paper: http://arxiv.org/abs/2602.20684
- 2026-01-23, AgentsEval: Clinically Faithful Evaluation of Medical Imaging Reports via Multi-Agent Reasoning, Suzhong Fu et.al., Paper: http://arxiv.org/abs/2601.16685
- 2026-02-23, Agents of Chaos, Natalie Shapira et.al., Paper: http://arxiv.org/abs/2602.20021
- 2025-11-06, Agentmandering: A Game-Theoretic Framework for Fair Redistricting via Large Language Model Agents, Hao Li et.al., Paper: http://arxiv.org/abs/2511.04076
- 2025-11-21, Agentifying Agentic AI, Virginia Dignum et.al., Paper: http://arxiv.org/abs/2511.17332
- 2026-02-05, AgenticTagger: Structured Item Representation for Recommendation with LLM Agents, Zhouhang Xie et.al., Paper: http://arxiv.org/abs/2602.05945
- 2026-02-23, AgenticSum: An Agentic Inference-Time Framework for Faithful Clinical Text Summarization, Fahmida Liza Piya et.al., Paper: http://arxiv.org/abs/2602.20040
- 2026-01-29, AgenticSimLaw: A Juvenile Courtroom Multi-Agent Debate Simulation for Explainable High-Stakes Tabular Decision Making, Jon Chun et.al., Paper: http://arxiv.org/abs/2601.21936
- 2025-11-10, AgenticSciML: Collaborative Multi-Agent Systems for Emergent Discovery in Scientific Machine Learning, Qile Jiang et.al., Paper: http://arxiv.org/abs/2511.07262
- 2026-01-27, AgenticSCR: An Autonomous Agentic Secure Code Review for Immature Vulnerabilities Detection, Wachiraphan Charoenwet et.al., Paper: http://arxiv.org/abs/2601.19138
- 2026-03-23, AgenticRec: End-to-End Tool-Integrated Policy Optimization for Ranking-Oriented Recommender Agents, Tianyi Li et.al., Paper: http://arxiv.org/abs/2603.21613
- 2026-03-16, Agentic workflow enables the recovery of critical materials from complex feedstocks via selective precipitation, Andrew Ritchhart et.al., Paper: http://arxiv.org/abs/2603.15491
- 2025-07-30, Agentic Web: Weaving the Next Web with AI Agents, Yingxuan Yang et.al., Paper: http://arxiv.org/abs/2507.21206
- 2026-02-06, Agentic Uncertainty Reveals Agentic Overconfidence, Jean Kaddour et.al., Paper: http://arxiv.org/abs/2602.06948
- 2026-01-22, Agentic Uncertainty Quantification, Jiaxin Zhang et.al., Paper: http://arxiv.org/abs/2601.15703
- 2025-09-14, Agentic UAVs: LLM-Driven Autonomy with Integrated Tool-Calling and Cognitive Reasoning, Anis Koubaa et.al., Paper: http://arxiv.org/abs/2509.13352
- 2026-04-01, Agentic Tool Use in Large Language Models, Jinchao Hu et.al., Paper: http://arxiv.org/abs/2604.00835
- 2026-02-12, Agentic Test-Time Scaling for WebAgents, Nicholas Lee et.al., Paper: http://arxiv.org/abs/2602.12276
- 2025-12-26, Agentic Structured Graph Traversal for Root Cause Analysis of Code-related Incidents in Cloud Applications, Shengkun Cui et.al., Paper: http://arxiv.org/abs/2512.22113
- 2025-09-28, Agentic Reinforcement Learning with Implicit Step Rewards, Xiaoqian Liu et.al., Paper: http://arxiv.org/abs/2509.19199
- 2025-10-20, Agentic Reinforcement Learning for Search is Unsafe, Yushi Yang et.al., Paper: http://arxiv.org/abs/2510.17431
- 2025-11-06, Agentic Refactoring: An Empirical Study of AI Coding Agents, Kosei Horikawa et.al., Paper: http://arxiv.org/abs/2511.04824
- 2026-01-07, Agentic Proof Automation: A Case Study, Yichen Xu et.al., Paper: http://arxiv.org/abs/2601.03768
- 2025-11-21, Agentic Program Verification, Haoxin Tu et.al., Paper: http://arxiv.org/abs/2511.17330
- 2025-12-01, Agentic Policy Optimization via Instruction-Policy Co-Evolution, Han Zhou et.al., Paper: http://arxiv.org/abs/2512.01945
- 2026-03-07, Agentic Planning with Reasoning for Image Styling via Offline RL, Subhojyoti Mukherjee et.al., Paper: http://arxiv.org/abs/2603.07148
- 2026-03-21, Agentic Physical-AI for Self-Aware RF Systems, Linuka Ratnayake et.al., Paper: http://arxiv.org/abs/2603.20692
- 2026-03-04, Agentic Peer-to-Peer Networks: From Content Distribution to Capability and Action Sharing, Taotao Wang et.al., Paper: http://arxiv.org/abs/2603.03753
- 2026-03-09, Agentic Neurosymbolic Collaboration for Mathematical Discovery: A Case Study in Combinatorial Design, Hai Xia et.al., Paper: http://arxiv.org/abs/2603.08322
- 2026-01-26, Agentic Much? Adoption of Coding Agents on GitHub, Romain Robbes et.al., Paper: http://arxiv.org/abs/2601.18341
- 2025-09-24, Agentic Metacognition: Designing a “Self-Aware” Low-Code Agent for Failure Prediction and Human Handoff, Jiexi Xu et.al., Paper: http://arxiv.org/abs/2509.19783
- 2026-01-05, Agentic Memory: Learning Unified Long-Term and Short-Term Memory Management for Large Language Model Agents, Yi Yu et.al., Paper: http://arxiv.org/abs/2601.01885
- 2026-01-06, Agentic Memory Enhanced Recursive Reasoning for Root Cause Localization in Microservices, Lingzhe Zhang et.al., Paper: http://arxiv.org/abs/2601.02732
- 2025-09-16, Agentic Lybic: Multi-Agent Execution System with Tiered Reasoning and Orchestration, Liangxuan Guo et.al., Paper: http://arxiv.org/abs/2509.11067
- 2025-11-26, Agentic Learner with Grow-and-Refine Multimodal Semantic Memory, Weihao Bo et.al., Paper: http://arxiv.org/abs/2511.21678
- 2026-01-09, Agentic LLMs as Powerful Deanonymizers: Re-identification of Participants in the Anthropic Interviewer Dataset, Tianshi Li et.al., Paper: http://arxiv.org/abs/2601.05918
- 2026-03-06, Agentic LLM Planning via Step-Wise PDDL Simulation: An Empirical Characterisation, Kai Göbel et.al., Paper: http://arxiv.org/abs/2603.06064
- 2025-09-16, Agentic JWT: A Secure Delegation Protocol for Autonomous AI Agents, Abhishek Goswami et.al., Paper: http://arxiv.org/abs/2509.13597
- 2025-10-19, Agentic Inequality, Matthew Sharp et.al., Paper: http://arxiv.org/abs/2510.16853
- 2026-03-20, Agentic Harness for Real-World Compilers, Yingwei Zheng et.al., Paper: http://arxiv.org/abs/2603.20075
- 2026-01-28, Agentic Fog: A Policy-driven Framework for Distributed Intelligence in Fog Computing, Saeed Akbar et.al., Paper: http://arxiv.org/abs/2601.20764
- 2025-12-24, Agentic Explainable Artificial Intelligence (Agentic XAI) Approach To Explore Better Explanation, Tomoaki Yamaguchi et.al., Paper: http://arxiv.org/abs/2512.21066
- 2025-10-16, Agentic Entropy-Balanced Policy Optimization, Guanting Dong et.al., Paper: http://arxiv.org/abs/2510.14545
- 2026-01-12, Agentic Diagnostic Reasoning over Telecom and Datacenter Infrastructure, Nicolas Tacheny et.al., Paper: http://arxiv.org/abs/2601.07342
- 2025-10-19, Agentic Design of Compositional Machines, Wenqian Zhang et.al., Paper: http://arxiv.org/abs/2510.14980
- 2026-01-27, Agentic Design Patterns: A System-Theoretic Framework, Minh-Dung Dao et.al., Paper: http://arxiv.org/abs/2601.19752
- 2026-03-09, Agentic Critical Training, Weize Liu et.al., Paper: http://arxiv.org/abs/2603.08706
- 2026-01-22, Agentic Confidence Calibration, Jiaxin Zhang et.al., Paper: http://arxiv.org/abs/2601.15778
- 2026-03-18, Agentic Cognitive Profiling: Realigning Automated Alzheimer’s Disease Detection with Clinical Construct Validity, Jiawen Kang et.al., Paper: http://arxiv.org/abs/2603.17392
- 2026-03-02, Agentic Code Reasoning, Shubham Ugare et.al., Paper: http://arxiv.org/abs/2603.01896
- 2026-03-19, Agentic Business Process Management: A Research Manifesto, Diego Calvanese et.al., Paper: http://arxiv.org/abs/2603.18916
- 2026-02-27, Agentic AI-RAN: Enabling Intent-Driven, Explainable and Self-Evolving Open RAN Intelligence, Zhizhou He et.al., Paper: http://arxiv.org/abs/2602.24115
- 2026-02-18, Agentic AI, Medical Morality, and the Transformation of the Patient-Physician Relationship, Robert Ranisch et.al., Paper: http://arxiv.org/abs/2602.16553
- 2026-01-05, Agentic AI in Remote Sensing: Foundations, Taxonomy, and Emerging Systems, Niloufar Alipour Talemi et.al., Paper: http://arxiv.org/abs/2601.01891
- 2025-10-17, Agentic AI for Ultra-Modern Networks: Multi-Agent Framework for RAN Autonomy and Assurance, Sukhdeep Singh et.al., Paper: http://arxiv.org/abs/2510.16144
- 2025-09-16, Agentic AI for Financial Crime Compliance, Henrik Axelsen et.al., Paper: http://arxiv.org/abs/2509.13137
- 2026-02-12, Agentic AI for Cybersecurity: A Meta-Cognitive Architecture for Governable Autonomy, Andrei Kojukhov et.al., Paper: http://arxiv.org/abs/2602.11897
- 2025-11-28, Agentic AI Framework for Smart Inventory Replenishment, Toqeer Ali Syed et.al., Paper: http://arxiv.org/abs/2511.23366
- 2026-03-05, Agentic AI – Physicist Collaboration in Experimental Particle Physics: A Proof-of-Concept Measurement with LEP Open Data, Anthony Badea et.al., Paper: http://arxiv.org/abs/2603.05735
- 2026-04-01, AgentWatcher: A Rule-based Prompt Injection Monitor, Yanting Wang et.al., Paper: http://arxiv.org/abs/2604.01194
- 2026-02-26, AgentVista: Evaluating Multimodal Agents in Ultra-Challenging Realistic Visual Scenarios, Zhaochen Su et.al., Paper: http://arxiv.org/abs/2602.23166
- 2026-03-18, AgentVLN: Towards Agentic Vision-and-Language Navigation, Zihao Xin et.al., Paper: http://arxiv.org/abs/2603.17670
- 2026-03-29, AgentSwing: Adaptive Parallel Context Management Routing for Long-Horizon Web Agents, Zhaopeng Feng et.al., Paper: http://arxiv.org/abs/2603.27490
- 2026-03-11, AgentServe: Algorithm-System Co-Design for Efficient Agentic AI Serving on a Consumer-Grade GPU, Yuning Zhang et.al., Paper: http://arxiv.org/abs/2603.10342
- 2026-03-04, AgentSelect: Benchmark for Narrative Query-to-Agent Recommendation, Yunxiao Shi et.al., Paper: http://arxiv.org/abs/2603.03761
- 2025-08-22, AgentScope 1.0: A Developer-Centric Framework for Building Agentic Applications, Dawei Gao et.al., Paper: http://arxiv.org/abs/2508.16279
- 2025-11-10, AgentSUMO: An Agentic Framework for Interactive Simulation Scenario Generation in SUMO via Large Language Models, Minwoo Jeong et.al., Paper: http://arxiv.org/abs/2511.06804
- 2026-01-22, AgentSM: Semantic Memory for Agentic Text-to-SQL, Asim Biswal et.al., Paper: http://arxiv.org/abs/2601.15709
- 2026-03-05, AgentSCOPE: Evaluating Contextual Privacy Across Agentic Workflows, Ivoline C. Ngong et.al., Paper: http://arxiv.org/abs/2603.04902
- 2026-02-02, AgentRx: Diagnosing AI Agent Failures from Execution Trajectories, Shraddha Barke et.al., Paper: http://arxiv.org/abs/2602.02475
- 2026-03-13, AgentRM: An OS-Inspired Resource Manager for LLM Agent Systems, Jianshu She et.al., Paper: http://arxiv.org/abs/2603.13110
- 2025-10-05, AgentRL: Scaling Agentic Reinforcement Learning with a Multi-Turn, Multi-Task Framework, Hanchen Zhang et.al., Paper: http://arxiv.org/abs/2510.04206
- 2026-03-25, AgentRFC: Security Design Principles and Conformance Testing for Agent Protocols, Shenghan Zheng et.al., Paper: http://arxiv.org/abs/2603.23801
- 2026-01-28, AgentLongBench: A Controllable Long Benchmark For Long-Contexts Agents via Environment Rollouts, Shicheng Fang et.al., Paper: http://arxiv.org/abs/2601.20730
- 2026-03-04, AgentIR: Reasoning-Aware Retrival for Deep Research Agents, Zijian Chen et.al., Paper: http://arxiv.org/abs/2603.04384
- 2026-01-28, AgentIF-OneDay: A Task-level Instruction-Following Benchmark for General AI Agents in Daily Scenarios, Kaiyuan Chen et.al., Paper: http://arxiv.org/abs/2601.20613
- 2025-12-15, AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection, Junwen Miao et.al., Paper: http://arxiv.org/abs/2512.13671
- 2025-09-10, AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning, Zhiheng Xi et.al., Paper: http://arxiv.org/abs/2509.08755
- 2026-01-15, AgentGuardian: Learning Access Control Policies to Govern AI Agent Behavior, Nadya Abaev et.al., Paper: http://arxiv.org/abs/2601.10440
- 2025-09-28, AgentGuard: Runtime Verification of AI Agents, Roham Koohestani et.al., Paper: http://arxiv.org/abs/2509.23864
- 2025-02-17, AgentGuard: Repurposing Agentic Orchestrator for Safety Evaluation of Tool Orchestration, Jizhou Chen et.al., Paper: http://arxiv.org/abs/2502.09809
- 2025-10-28, AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis, Xuanzhong Chen et.al., Paper: http://arxiv.org/abs/2510.24695
- 2025-10-28, AgentFold: Long-Horizon Web Agents with Proactive Context Management, Rui Ye et.al., Paper: http://arxiv.org/abs/2510.24699
- 2026-03-24, AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection, Yangxin Yu et.al., Paper: http://arxiv.org/abs/2603.23115
- 2025-11-07, AgentExpt: Automating AI Experiment Design with LLM-based Resource Retrieval Agent, Yu Li et.al., Paper: http://arxiv.org/abs/2511.04921
- 2025-11-13, AgentEvolver: Towards Efficient Self-Evolving Agent System, Yunpeng Zhai et.al., Paper: http://arxiv.org/abs/2511.10395
- 2026-01-23, AgentDrive: An Open Benchmark Dataset for Agentic AI Reasoning with LLM-Generated Scenarios in Autonomous Systems, Mohamed Amine Ferrag et.al., Paper: http://arxiv.org/abs/2601.16964
- 2026-03-13, AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Agents, Zekun Wu et.al., Paper: http://arxiv.org/abs/2603.12564
- 2026-01-26, AgentDoG: A Diagnostic Guardrail Framework for AI Agent Safety and Security, Dongrui Liu et.al., Paper: http://arxiv.org/abs/2601.18491
- 2026-03-19, AgentDS Technical Report: Benchmarking the Future of Human-AI Collaboration in Domain-Specific Data Science, An Luo et.al., Paper: http://arxiv.org/abs/2603.19005
- 2025-10-07, AgentDR Dynamic Recommendation with Implicit Item-Item Relations via LLM-based Agents, Mingdai Yang et.al., Paper: http://arxiv.org/abs/2510.05598
- 2025-10-03, AgentDAM: Privacy Leakage Evaluation for Autonomous Web Agents, Arman Zharmagambetov et.al., Paper: http://arxiv.org/abs/2503.09780
- 2025-12-08, AgentCrypt: Advancing Privacy and (Secure) Computation in AI Agent Collaboration, Harish Karthikeyan et.al., Paper: http://arxiv.org/abs/2512.08104
- 2026-02-19, AgentConductor: Topology Evolution for Multi-Agent Competition-Level Code Generation, Siyu Wang et.al., Paper: http://arxiv.org/abs/2602.17100
- 2025-12-09, AgentComp: From Agentic Reasoning to Compositional Mastery in Text-to-Image Models, Arman Zarei et.al., Paper: http://arxiv.org/abs/2512.09081
- 2025-08-27, AgentCoMa: A Compositional Benchmark Mixing Commonsense and Mathematical Reasoning in Real-World Scenarios, Lisa Alazraki et.al., Paper: http://arxiv.org/abs/2508.19988
- 2025-10-20, AgentChangeBench: A Multi-Dimensional Evaluation Framework for Goal-Shift Robustness in Conversational AI, Manik Rana et.al., Paper: http://arxiv.org/abs/2510.18170
- 2025-10-02, AgentCaster: Reasoning-Guided Tornado Forecasting, Michael Chen et.al., Paper: http://arxiv.org/abs/2510.03349
- 2025-12-12, AgentBalance: Backbone-then-Topology Design for Cost-Effective Multi-Agent Systems under Budget Constraints, Shuowei Cai et.al., Paper: http://arxiv.org/abs/2512.11426
- 2026-02-03, AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent, Yinyi Luo et.al., Paper: http://arxiv.org/abs/2602.03955
- 2025-10-23, AgentArcEval: An Architecture Evaluation Method for Foundation Model based Agents, Qinghua Lu et.al., Paper: http://arxiv.org/abs/2510.21031
- 2026-02-05, Agent2Agent Threats in Safety-Critical LLM Assistants: A Human-Centric Taxonomy, Lukas Stappen et.al., Paper: http://arxiv.org/abs/2602.05877
- 2026-01-08, Agent-as-a-Judge, Runyang You et.al., Paper: http://arxiv.org/abs/2601.05111
- 2025-08-24, Agent-Testing Agent: A Meta-Agent for Automated Testing and Evaluation of Conversational AI Agents, Sameer Komoravolu et.al., Paper: http://arxiv.org/abs/2508.17393
- 2026-02-09, Agent-Supported Foresight for AI Systemic Risks: AI Agents for Breadth, Experts for Judgment, Leon Fröhling et.al., Paper: http://arxiv.org/abs/2602.08565
- 2026-03-24, Agent-Sentry: Bounding LLM Agents via Execution Provenance, Rohan Sequeira et.al., Paper: http://arxiv.org/abs/2603.22868
- 2026-02-04, Agent-Omit: Training Efficient LLM Agents for Adaptive Thought and Observation Omission via Agentic Reinforcement Learning, Yansong Ning et.al., Paper: http://arxiv.org/abs/2602.04284
- 2026-01-07, Agent-Dice: Disentangling Knowledge Updates via Geometric Consensus for Agent Continual Learning, Zheng Wu et.al., Paper: http://arxiv.org/abs/2601.03641
- 2025-10-14, Agent-Based Simulation of a Financial Market with Large Language Models, Ryuji Hashimoto et.al., Paper: http://arxiv.org/abs/2510.12189
- 2026-02-10, Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning, Zhaoyang Wang et.al., Paper: http://arxiv.org/abs/2602.10090
- 2025-12-18, Agent Tools Orchestration Leaks More: Dataset, Benchmark, and Mitigation, Yuxuan Qiao et.al., Paper: http://arxiv.org/abs/2512.16310
- 2026-01-15, Agent Skills in the Wild: An Empirical Study of Security Vulnerabilities at Scale, Yi Liu et.al., Paper: http://arxiv.org/abs/2601.10338
- 2026-02-12, Agent Skills for Large Language Models: Architecture, Acquisition, Security, and the Path Forward, Renjun Xu et.al., Paper: http://arxiv.org/abs/2602.12430
- 2025-10-30, Agent Skills Enable a New Class of Realistic and Trivially Simple Prompt Injections, David Schmotz et.al., Paper: http://arxiv.org/abs/2510.26328
- 2025-08-04, Agent Network Protocol Technical White Paper, Gaowei Chang et.al., Paper: http://arxiv.org/abs/2508.00007
- 2026-03-16, Agent Lifecycle Toolkit (ALTK): Reusable Middleware Components for Robust AI Agents, Zidane Wright et.al., Paper: http://arxiv.org/abs/2603.15473
- 2026-03-26, Agent Factories for High Level Synthesis: How Far Can General-Purpose Coding Agents Go in Hardware Optimization?, Abhishek Bhandwaldar et.al., Paper: http://arxiv.org/abs/2603.25719
- 2025-10-28, Agent Data Protocol: Unifying Datasets for Diverse, Effective Fine-tuning of LLM Agents, Yueqi Song et.al., Paper: http://arxiv.org/abs/2510.24702
- 2026-03-19, Agent Control Protocol: Admission Control for Agent Actions, Marcelo Fernandez et.al., Paper: http://arxiv.org/abs/2603.18829
- 2026-01-28, Agent Benchmarks Fail Public Sector Requirements, Jonathan Rystrøm et.al., Paper: http://arxiv.org/abs/2601.20617
- 2026-03-24, Agent Audit: A Security Analysis System for LLM Agent Applications, Haiyue Zhang et.al., Paper: http://arxiv.org/abs/2603.22853
- 2026-01-16, AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts, Keyu Li et.al., Paper: http://arxiv.org/abs/2601.11044
- 2025-09-04, AgenTracer: Who Is Inducing Failure in the LLM Agentic Systems?, Guibin Zhang et.al., Paper: http://arxiv.org/abs/2509.03312
- 2025-12-03, Affordances of Digital and Blockchain-based Community Currencies: The Case of Sarafu Network in Kenya, Patricia Marcella Evite et.al., Paper: http://arxiv.org/abs/2512.04030
- 2025-10-28, Affordance Representation and Recognition for Autonomous Agents, Habtom Kahsay Gidey et.al., Paper: http://arxiv.org/abs/2510.24459
- 2026-04-02, AeroTherm-GPT: A Verification-Centered LLM Framework for Thermal Protection System Engineering Workflows, Chuhan Qiao et.al., Paper: http://arxiv.org/abs/2604.01738
- 2025-12-26, Aerial World Model for Long-horizon Visual Generation and Navigation in 3D Space, Weichen Zhang et.al., Paper: http://arxiv.org/abs/2512.21887
- 2026-03-05, AegisUI: Behavioral Anomaly Detection for Structured User Interface Protocols in AI Agent Systems, Mohd Safwan Uddin et.al., Paper: http://arxiv.org/abs/2603.05031
- 2025-10-22, AegisMCP: Online Graph Intrusion Detection for Tool-Augmented LLMs on Edge Devices, Zhonghao Zhan et.al., Paper: http://arxiv.org/abs/2510.19462
- 2025-12-24, AegisAgent: An Autonomous Defense Agent Against Prompt Injection Attacks in LLM-HARs, Yihan Wang et.al., Paper: http://arxiv.org/abs/2512.20986
- 2025-08-27, Aegis: Taxonomy and Optimizations for Overcoming Agent-Environment Failures in LLM Agents, Kevin Song et.al., Paper: http://arxiv.org/abs/2508.19504
- 2025-10-06, Adversarial Reinforcement Learning for Large Language Model Agent Safety, Zizhao Wang et.al., Paper: http://arxiv.org/abs/2510.05442
- 2025-10-04, Adversarial Agent Collaboration for C to Rust Translation, Tianyu Li et.al., Paper: http://arxiv.org/abs/2510.03879
- 2026-03-16, Advancing Multimodal Agent Reasoning with Long-Term Neuro-Symbolic Memory, Rongjie Jiang et.al., Paper: http://arxiv.org/abs/2603.15280
- 2026-01-26, Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge, Li Kang et.al., Paper: http://arxiv.org/abs/2601.18733
- 2026-01-23, Adoption of Generative Artificial Intelligence in the German Software Engineering Industry: An Empirical Study, Ludwig Felder et.al., Paper: http://arxiv.org/abs/2601.16700
- 2025-12-03, Adhera: A Human-Centered Health Informatics Solution for Reducing Informal Caregiver Burden through Improved Medication Adherence, Zhiyin Zhou et.al., Paper: http://arxiv.org/abs/2512.03878
- 2025-10-15, Addressing the alignment problem in transportation policy making: an LLM approach, Xiaoyu Yan et.al., Paper: http://arxiv.org/abs/2510.13139
- 2025-12-08, Adaptive Tuning of Parameterized Traffic Controllers via Multi-Agent Reinforcement Learning, Giray Önür et.al., Paper: http://arxiv.org/abs/2512.07417
- 2026-03-23, Adaptive Robust Estimator for Multi-Agent Reinforcement Learning, Zhongyi Li et.al., Paper: http://arxiv.org/abs/2603.21574
- 2026-01-04, Adaptive Hierarchical Evaluation of LLMs and SAST tools for CWE Prediction in Python, Muntasir Adnan et.al., Paper: http://arxiv.org/abs/2601.01320
- 2025-10-30, Adaptive Data Flywheel: Applying MAPE Control Loops to AI Agent Improvement, Aaditya Shukla et.al., Paper: http://arxiv.org/abs/2510.27051
- 2025-10-21, Adaptive Coopetition: Leveraging Coarse Verifier Signals for Resilient Multi-Agent LLM Reasoning, Rui Jerry Huang et.al., Paper: http://arxiv.org/abs/2510.18179
- 2025-10-10, Adaptive Attacks on Trusted Monitors Subvert AI Control Protocols, Mikhail Terekhov et.al., Paper: http://arxiv.org/abs/2510.09462
- 2025-10-27, Adapting Interleaved Encoders with PPO for Language-Guided Reinforcement Learning in BabyAI, Aryan Mathur et.al., Paper: http://arxiv.org/abs/2510.23148
- 2025-12-29, AdaptiFlow: An Extensible Framework for Event-Driven Autonomy in Cloud Microservices, Brice Arléon Zemtsop Ndadji et.al., Paper: http://arxiv.org/abs/2512.23499
- 2025-12-08, Adaptation of Embedding Models to Financial Filings via LLM Distillation, Eliot Brenner et.al., Paper: http://arxiv.org/abs/2512.08088
- 2026-02-12, AdaptEvolve: Improving Efficiency of Evolutionary AI Agents through Adaptive Model Selection, Pretam Ray et.al., Paper: http://arxiv.org/abs/2602.11931
- 2026-02-24, AdapTools: Adaptive Tool-based Indirect Prompt Injection Attacks on Agentic LLMs, Che Wang et.al., Paper: http://arxiv.org/abs/2602.20720
- 2025-12-18, AdaTooler-V: Adaptive Tool-Use for Images and Videos, Chaoyang Wang et.al., Paper: http://arxiv.org/abs/2512.16918
- 2026-01-26, AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning, Mingyang Song et.al., Paper: http://arxiv.org/abs/2601.18631
- 2026-03-17, AdaMem: Adaptive User-Centric Memory for Long-Horizon Dialogue Agents, Shannan Yan et.al., Paper: http://arxiv.org/abs/2603.16496
- 2026-02-23, Ada-RS: Adaptive Rejection Sampling for Selective Thinking, Yirou Ge et.al., Paper: http://arxiv.org/abs/2602.19519
- 2026-03-21, Active Inference for Physical AI Agents – An Engineering Perspective, Bert de Vries et.al., Paper: http://arxiv.org/abs/2603.20927
- 2026-02-03, Active Epistemic Control for Query-Efficient Verified Planning, Shuhui Qu et.al., Paper: http://arxiv.org/abs/2602.03974
- 2026-01-12, Active Context Compression: Autonomous Memory Management in LLM Agents, Nikhil Verma et.al., Paper: http://arxiv.org/abs/2601.07190
- 2026-02-04, Active Asymmetric Multi-Agent Multimodal Learning under Uncertainty, Rui Liu et.al., Paper: http://arxiv.org/abs/2602.04763
- 2026-03-19, Act While Thinking: Accelerating LLM Agents via Pattern-Aware Speculative Tool Execution, Yifan Sui et.al., Paper: http://arxiv.org/abs/2603.18897
- 2025-12-11, Achieving Olympia-Level Geometry Large Language Model Agent via Complexity Boosting Reinforcement Learning, Haiteng Zhao et.al., Paper: http://arxiv.org/abs/2512.10534
- 2026-01-06, Accurate Table Question Answering with Accessible LLMs, Yangfan Jiang et.al., Paper: http://arxiv.org/abs/2601.03137
- 2025-11-17, Accuracy is Not Enough: Poisoning Interpretability in Federated Learning via Color Skew, Farhin Farhad Riya et.al., Paper: http://arxiv.org/abs/2511.13535
- 2025-12-25, Accelerating Scientific Discovery with Autonomous Goal-evolving Agents, Yuanqi Du et.al., Paper: http://arxiv.org/abs/2512.21782
- 2025-12-28, Accelerating Language Model Workflows with Prompt Choreography, TJ Bai et.al., Paper: http://arxiv.org/abs/2512.23049
- 2026-02-26, Accelerated Online Risk-Averse Policy Evaluation in POMDPs with Theoretical Guarantees and Novel CVaR Bounds, Yaacov Pariente et.al., Paper: http://arxiv.org/abs/2602.23073
- 2026-03-10, Abundant Intelligence and Deficient Demand: A Macro-Financial Stress Test of Rapid AI Adoption, Xupeng Chen et.al., Paper: http://arxiv.org/abs/2603.09209
- 2025-12-22, AWPO: Enhancing Tool-Use of Large Language Models through Explicit Integration of Reasoning Rewards, Zihan Lin et.al., Paper: http://arxiv.org/abs/2512.19126
- 2026-03-25, AVO: Agentic Variation Operators for Autonomous Evolutionary Search, Terry Chen et.al., Paper: http://arxiv.org/abs/2603.24517
- 2025-11-19, AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning, Urjitkumar Patel et.al., Paper: http://arxiv.org/abs/2511.15578
- 2026-04-02, AURA: Multimodal Shared Autonomy for Real-World Urban Navigation, Yukai Ma et.al., Paper: http://arxiv.org/abs/2604.01659
- 2025-10-17, AURA: An Agent Autonomy Risk Assessment Framework, Lorenzo Satta Chiris et.al., Paper: http://arxiv.org/abs/2510.15739
- 2026-03-31, ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation, Yinuo Liu et.al., Paper: http://arxiv.org/abs/2603.29902
- 2025-11-21, ATMPlace: Analytical Thermo-Mechanical-Aware Placement Framework for 2.5D-IC, Qipan Wang et.al., Paper: http://arxiv.org/abs/2511.17319
- 2025-10-26, ATLAS: Actor-Critic Task-Completion with Look-ahead Action Simulation, Jiali Cheng et.al., Paper: http://arxiv.org/abs/2510.22732
- 2025-10-18, ATA: A Neuro-Symbolic Approach to Implement Autonomous and Trustworthy Agents, David Peer et.al., Paper: http://arxiv.org/abs/2510.16381
- 2026-01-08, AT $^2$ PO: Agentic Turn-based Policy Optimization via Tree Search, Zefang Zong et.al., Paper: http://arxiv.org/abs/2601.04767
- 2025-11-28, ASTRO: Adaptive Stitching via Dynamics-Guided Trajectory Rollouts, Hang Yu et.al., Paper: http://arxiv.org/abs/2511.23442
- 2025-12-04, ASTRIDE: A Security Threat Modeling Platform for Agentic-AI Applications, Eranga Bandara et.al., Paper: http://arxiv.org/abs/2512.04785
- 2025-10-11, ASTREA: Introducing Agentic Intelligence for Orbital Thermal Autonomy, Alejandro D. Mousist et.al., Paper: http://arxiv.org/abs/2509.13380
- 2026-03-31, ASI-Evolve: AI Accelerates AI, Weixian Xu et.al., Paper: http://arxiv.org/abs/2603.29640
- 2026-01-26, ARMOR: Agentic Reasoning for Methods Orchestration and Reparameterization for Robust Adversarial Attacks, Gabriel Lee Jun Rong et.al., Paper: http://arxiv.org/abs/2601.18386
- 2026-01-12, ARM: Role-Conditioned Neuron Transplantation for Training-Free Generalist LLM Agent Merging, Zhuoka Feng et.al., Paper: http://arxiv.org/abs/2601.07309
- 2025-12-04, ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning, Shengyuan Ding et.al., Paper: http://arxiv.org/abs/2512.05111
- 2026-03-13, ARL-Tangram: Unleash the Resource Efficiency in Agentic Reinforcement Learning, Bangjun Xiao et.al., Paper: http://arxiv.org/abs/2603.13019
- 2025-09-22, ARK-V1: An LLM-Agent for Knowledge Graph Question Answering Requiring Commonsense Reasoning, Jan-Felix Klein et.al., Paper: http://arxiv.org/abs/2509.18063
- 2026-03-17, ARISE: Agent Reasoning with Intrinsic Skill Evolution in Hierarchical Reinforcement Learning, Yu Li et.al., Paper: http://arxiv.org/abs/2603.16060
- 2026-01-05, ARIES: A Scalable Multi-Agent Orchestration Framework for Real-Time Epidemiological Surveillance and Outbreak Monitoring, Aniket Wattamwar et.al., Paper: http://arxiv.org/abs/2601.01831
- 2026-04-02, APEX: Agent Payment Execution with Policy for Autonomous Agent API Access, Mohd Safwan Uddin et.al., Paper: http://arxiv.org/abs/2604.02023
- 2026-01-08, APEX: Academic Poster Editing Agentic Expert, Chengxin Shi et.al., Paper: http://arxiv.org/abs/2601.04794
- 2026-03-14, APEX-Searcher: Augmenting LLMs’ Search Capabilities through Agentic Planning and Execution, Kun Chen et.al., Paper: http://arxiv.org/abs/2603.13853
- 2026-01-20, APEX-Agents, Bertie Vidgen et.al., Paper: http://arxiv.org/abs/2601.14242
- 2025-12-18, AMUSE: Audio-Visual Benchmark and Alignment Framework for Agentic Multi-Speaker Understanding, Sanjoy Chowdhury et.al., Paper: http://arxiv.org/abs/2512.16250
- 2025-12-25, AMS-IO-Bench and AMS-IO-Agent: Benchmarking and Structured Reasoning for Analog and Mixed-Signal Integrated Circuit Input/Output Design, Zhishuai Zhang et.al., Paper: http://arxiv.org/abs/2512.21613
- 2026-02-09, AMEM4Rec: Leveraging Cross-User Similarity for Memory Evolution in Agentic LLM Recommenders, Minh-Duc Nguyen et.al., Paper: http://arxiv.org/abs/2602.08837
- 2025-09-26, AMANDA: Agentic Medical Knowledge Augmentation for Data-Efficient Medical Visual Question Answering, Ziqing Wang et.al., Paper: http://arxiv.org/abs/2510.02328
- 2026-01-28, AMA: Adaptive Memory via Multi-Agent Collaboration, Weiquan Huang et.al., Paper: http://arxiv.org/abs/2601.20352
- 2026-02-26, AMA-Bench: Evaluating Long-Horizon Memory for Agentic Applications, Yujie Zhao et.al., Paper: http://arxiv.org/abs/2602.22769
- 2026-01-27, ALRM: Agentic LLM for Robotic Manipulation, Vitor Gaboardi dos Santos et.al., Paper: http://arxiv.org/abs/2601.19510
- 2025-10-20, ALPINE: A Lightweight and Adaptive Privacy-Decision Agent Framework for Dynamic Edge Crowdsensing, Guanjie Cheng et.al., Paper: http://arxiv.org/abs/2510.17162
- 2025-10-03, ALMAS: an Autonomous LLM-based Multi-Agent Software Engineering Framework, Vali Tawosi et.al., Paper: http://arxiv.org/abs/2510.03463
- 2025-10-11, ALLOY: Generating Reusable Agent Workflows from User Demonstration, Jiawen Li et.al., Paper: http://arxiv.org/abs/2510.10049
- 2025-12-29, AKG kernel Agent: A Multi-Agent Framework for Cross-Platform Kernel Synthesis, Jinye Du et.al., Paper: http://arxiv.org/abs/2512.23424
- 2026-01-16, AJAR: Adaptive Jailbreak Architecture for Red-teaming, Yipu Dou et.al., Paper: http://arxiv.org/abs/2601.10971
- 2026-02-11, AIvilization v0: Toward Large-Scale Artificial Social Simulation with a Unified Agent Architecture and Adaptive Agent Profiles, Wenkai Fan et.al., Paper: http://arxiv.org/abs/2602.10429
- 2026-02-06, AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents, Alisia Lupidi et.al., Paper: http://arxiv.org/abs/2602.06855
- 2026-03-06, AIMD-L: An automated laboratory for high-throughput characterization of structural materials for extreme environments, Todd C. Hufnagel et.al., Paper: http://arxiv.org/abs/2603.06835
- 2026-03-04, AI4S-SDS: A Neuro-Symbolic Solvent Design System via Sparse MCTS and Differentiable Physics Alignment, Jiangyu Chen et.al., Paper: http://arxiv.org/abs/2603.03686
- 2025-12-29, AI4Reading: Chinese Audiobook Interpretation System Based on Multi-Agent Collaboration, Minjiang Huang et.al., Paper: http://arxiv.org/abs/2512.23300
- 2025-11-26, AI/ML Model Cards in Edge AI Cyberinfrastructure: towards Agentic AI, Beth Plale et.al., Paper: http://arxiv.org/abs/2511.21661
- 2026-03-03, AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework, Zihang Zeng et.al., Paper: http://arxiv.org/abs/2603.03233
- 2026-01-23, AI builds, We Analyze: An Empirical Study of AI-Generated Build Code Quality, Anwar Ghammam et.al., Paper: http://arxiv.org/abs/2601.16839
- 2025-11-21, AI Workers, Geopolitics, and Algorithmic Collective Action, Sydney Reis et.al., Paper: http://arxiv.org/abs/2511.17331
- 2026-03-17, AI Scientist via Synthetic Task Scaling, Ziyang Cai et.al., Paper: http://arxiv.org/abs/2603.17216
- 2026-03-13, AI Planning Framework for LLM-Based Web Agents, Orit Shahnovsky et.al., Paper: http://arxiv.org/abs/2603.12710
- 2025-12-29, AI Meets Brain: Memory Systems from Cognitive Neuroscience to Autonomous Agents, Jiafeng Liang et.al., Paper: http://arxiv.org/abs/2512.23343
- 2026-03-25, AI Fortune-Teller: Juxtaposing Shaman and AI to Reveal Human Agency in the Age of AI, Soonho Kwon et.al., Paper: http://arxiv.org/abs/2603.23811
- 2025-09-16, AI Agents with Human-Like Collaborative Tools: Adaptive Strategies for Enhanced Problem-Solving, Harper Reed et.al., Paper: http://arxiv.org/abs/2509.13547
- 2026-03-14, AI Agents in Financial Markets: Architecture, Applications, and Systemic Implications, Hui Gong et.al., Paper: http://arxiv.org/abs/2603.13942
- 2025-10-31, AI Agents in Drug Discovery, Srijit Seal et.al., Paper: http://arxiv.org/abs/2510.27130
- 2026-02-13, AI Agents for Inventory Control: Human-LLM-OR Complementarity, Jackie Baek et.al., Paper: http://arxiv.org/abs/2602.12631
- 2025-10-14, AI Agents as Universal Task Solvers, Alessandro Achille et.al., Paper: http://arxiv.org/abs/2510.12066
- 2026-03-20, AI Agents Can Already Autonomously Perform Experimental High Energy Physics, Eric A. Moreno et.al., Paper: http://arxiv.org/abs/2603.20179
- 2026-01-26, AI Agent for Reverse-Engineering Legacy Finite-Difference Code and Translating to Devito, Yinghan Hou et.al., Paper: http://arxiv.org/abs/2601.18381
- 2026-01-05, AI Agent Systems: Architectures, Applications, and Evaluation, Bin Xu et.al., Paper: http://arxiv.org/abs/2601.01743
- 2025-12-29, AGRO-SQL: Agentic Group-Relative Optimization with High-Fidelity Data Synthesis, Cehua Yang et.al., Paper: http://arxiv.org/abs/2512.23366
- 2026-03-13, AEGIS: No Tool Call Left Unchecked – A Pre-Execution Firewall and Audit Layer for AI Agents, Aojie Yuan et.al., Paper: http://arxiv.org/abs/2603.12621
- 2026-03-21, AEGIS: From Clues to Verdicts – Graph-Guided Deep Vulnerability Reasoning via Dialectics and Meta-Auditing, Sen Fang et.al., Paper: http://arxiv.org/abs/2603.20637
- 2026-03-26, AD-CARE: A Guideline-grounded, Modality-agnostic LLM Agent for Real-world Alzheimer’s Disease Diagnosis with Multi-cohort Assessment, Fairness Analysis, and Reader Study, Wenlong Hou et.al., Paper: http://arxiv.org/abs/2603.25322
- 2026-03-21, ACRFence: Preventing Semantic Rollback Attacks in Agent Checkpoint-Restore, Yusheng Zheng et.al., Paper: http://arxiv.org/abs/2603.20625
- 2025-10-01, ACON: Optimizing Context Compression for Long-horizon LLM Agents, Minki Kang et.al., Paper: http://arxiv.org/abs/2510.00615
- 2026-03-21, AC4A: Access Control for Agents, Reshabh K Sharma et.al., Paper: http://arxiv.org/abs/2603.20933
- 2026-01-16, ABC-Bench: Benchmarking Agentic Backend Coding in Real-World Development, Jie Yang et.al., Paper: http://arxiv.org/abs/2601.11077
- 2025-12-23, ABBEL: LLM Agents Acting through Belief Bottlenecks Expressed in Language, Aly Lidayan et.al., Paper: http://arxiv.org/abs/2512.20111
- 2025-12-16, A4-Agent: An Agentic Framework for Zero-Shot Affordance Reasoning, Zixin Zhang et.al., Paper: http://arxiv.org/abs/2512.14442
- 2025-12-26, A2P-Vis: an Analyzer-to-Presenter Agentic Pipeline for Visual Insights Generation and Reporting, Shuyu Gan et.al., Paper: http://arxiv.org/abs/2512.22101
- 2026-02-03, A-RAG: Scaling Agentic Retrieval-Augmented Generation via Hierarchical Retrieval Interfaces, Mingxuan Du et.al., Paper: http://arxiv.org/abs/2602.03442
- 2025-09-29, A-MemGuard: A Proactive Defense Framework for LLM-Based Agent Memory, Qianshan Wei et.al., Paper: http://arxiv.org/abs/2510.02373
- 2025-10-21, A $^2$ FM: An Adaptive Agent Foundation Model for Tool-Aware Hybrid Reasoning, Qianben Chen et.al., Paper: http://arxiv.org/abs/2510.12838
- 2025-11-19, A thermo-mechanically coupled finite deformation model for freezing-induced damage in soft materials, Ali Saeedi et.al., Paper: http://arxiv.org/abs/2511.15500
- 2025-12-04, A tangential low-rank ADI method for solving indefinite Lyapunov equations, Rudi Smith et.al., Paper: http://arxiv.org/abs/2512.04983
- 2026-03-19, A more accurate rational non-commutative algorithm for multiplying 4x4 matrices using 48 multiplications, Jean-Guillaume Dumas et.al., Paper: http://arxiv.org/abs/2603.18699
- 2024-01-30, A mechanism for discovering semantic relationships among agent communication protocols, Idoia Berges et.al., Paper: http://arxiv.org/abs/2401.16216
- 2026-04-01, A Visionary Look at Vibe Researching, Yebo Feng et.al., Paper: http://arxiv.org/abs/2604.00945
- 2025-10-22, A Tutorial on Cognitive Biases in Agentic AI-Driven 6G Autonomous Networks, Hatim Chergui et.al., Paper: http://arxiv.org/abs/2510.19973
- 2026-02-02, A Task-Level Evaluation of AI Agents in Open-Source Projects, Shojibur Rahman et.al., Paper: http://arxiv.org/abs/2602.02345
- 2026-03-29, A Systematic Taxonomy of Security Vulnerabilities in the OpenClaw AI Agent Framework, Surada Suwansathit et.al., Paper: http://arxiv.org/abs/2603.27517
- 2025-12-18, A Systematic Study of Code Obfuscation Against LLM-based Vulnerability Detection, Xiao Li et.al., Paper: http://arxiv.org/abs/2512.16538
- 2025-10-07, A Survey on Agentic Security: Applications, Threats and Defenses, Asif Shahriar et.al., Paper: http://arxiv.org/abs/2510.06445
- 2025-10-13, A Survey on Agentic Multimodal Large Language Models, Huanjin Yao et.al., Paper: http://arxiv.org/abs/2510.10991
- 2025-10-14, A Survey of Vibe Coding with Large Language Models, Yuyao Ge et.al., Paper: http://arxiv.org/abs/2510.12399
- 2025-08-28, A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers, Ming Hu et.al., Paper: http://arxiv.org/abs/2508.21148
- 2026-03-20, A Subgoal-driven Framework for Improving Long-Horizon LLM Agents, Taiyi Wang et.al., Paper: http://arxiv.org/abs/2603.19685
- 2025-12-12, A Study of Library Usage in Agent-Authored Pull Requests, Lukas Twist et.al., Paper: http://arxiv.org/abs/2512.11589
- 2025-11-25, A Single-Root, Multi-Curve, Context-Isolated, PQC-Pluggable Cryptographic Identity Primitive with Stateless Secret Rotation, Jian Sheng Wang et.al., Paper: http://arxiv.org/abs/2511.20505
- 2025-11-18, A Sequential Operator-Splitting Framework for Exploration of Nonconvex Trajectory Optimization Solution Spaces, Justin Ganiban et.al., Paper: http://arxiv.org/abs/2511.14752
- 2026-03-04, A Rubric-Supervised Critic from Sparse Real-World Outcomes, Xingyao Wang et.al., Paper: http://arxiv.org/abs/2603.03800
- 2026-03-09, A Recipe for Stable Offline Multi-agent Reinforcement Learning, Dongsu Lee et.al., Paper: http://arxiv.org/abs/2603.08399
- 2025-11-05, A Proprietary Model-Based Safety Response Framework for AI Agents, Qi Li et.al., Paper: http://arxiv.org/abs/2511.03138
- 2026-01-06, A Probabilistic Digital Twin of UK En Route Airspace for Training and Evaluating AI Agents for Air Traffic Control, Nick Pepper et.al., Paper: http://arxiv.org/abs/2601.03113
- 2025-10-30, A Pragmatic View of AI Personhood, Joel Z. Leibo et.al., Paper: http://arxiv.org/abs/2510.26396
- 2025-10-01, A Practitioner’s Guide to Multi-turn Agentic Reinforcement Learning, Ruiyi Wang et.al., Paper: http://arxiv.org/abs/2510.01132
- 2025-12-19, A Practical Solution to Systematically Monitor Inconsistencies in SBOM-based Vulnerability Scanners, Martin Rosso et.al., Paper: http://arxiv.org/abs/2512.17710
- 2025-12-09, A Practical Guide for Designing, Developing, and Deploying Production-Grade Agentic AI Workflows, Eranga Bandara et.al., Paper: http://arxiv.org/abs/2512.08769
- 2025-11-19, A Physics Informed Machine Learning Framework for Optimal Sensor Placement and Parameter Estimation, Georgios Venianakis et.al., Paper: http://arxiv.org/abs/2511.15543
- 2026-02-27, A Novel Hierarchical Multi-Agent System for Payments Using LLMs, Joon Kiat Chua et.al., Paper: http://arxiv.org/abs/2602.24068
- 2026-01-13, A New Tool to Find Lightweight (And, Xor) Implementations of Quadratic Vectorial Boolean Functions up to Dimension 9, Marie Bolzer et.al., Paper: http://arxiv.org/abs/2601.08368
- 2025-12-18, A Network Arena for Benchmarking AI Agents on Network Troubleshooting, Zhihao Wang et.al., Paper: http://arxiv.org/abs/2512.16381
- 2025-10-30, A Multi-agent Large Language Model Framework to Automatically Assess Performance of a Clinical AI Triage Tool, Adam E. Flanders et.al., Paper: http://arxiv.org/abs/2510.26498
- 2025-11-09, A Multi-Agent System for Semantic Mapping of Relational Data to Knowledge Graphs, Milena Trajanoska et.al., Paper: http://arxiv.org/abs/2511.06455
- 2026-03-14, A Multi-Agent Perception-Action Alliance for Efficient Long Video Reasoning, Yichang Xu et.al., Paper: http://arxiv.org/abs/2603.14052
- 2025-12-09, A Multi-Agent LLM Framework for Design Space Exploration in Autonomous Driving Systems, Po-An Shih et.al., Paper: http://arxiv.org/abs/2512.08476
- 2025-10-01, A Multi-Agent LLM Defense Pipeline Against Prompt Injection Attacks, S M Asif Hossain et.al., Paper: http://arxiv.org/abs/2509.14285
- 2026-03-04, A Multi-Agent Framework for Interpreting Multivariate Physiological Time Series, Davide Gabrielli et.al., Paper: http://arxiv.org/abs/2603.04142
- 2026-02-04, A Modern System Recipe for Situated Embodied Human-Robot Conversation with Real-Time Multimodal LLMs and Tool-Calling, Dong Won Lee et.al., Paper: http://arxiv.org/abs/2602.04157
- 2025-10-20, A Mimamsa Inspired Framework For Instruction Sequencing In AI Agents, Bama Srinivasan et.al., Paper: http://arxiv.org/abs/2510.17691
- 2025-11-18, A Method for Characterizing Disease Progression from Acute Kidney Injury to Chronic Kidney Disease, Yilu Fang et.al., Paper: http://arxiv.org/abs/2511.14603
- 2025-10-31, A Memory-Efficient Retrieval Architecture for RAG-Enabled Wearable Medical LLMs-Agents, Zhipeng Liao et.al., Paper: http://arxiv.org/abs/2510.27107
- 2025-10-06, A Lightweight Large Language Model-Based Multi-Agent System for 2D Frame Structural Analysis, Ziheng Geng et.al., Paper: http://arxiv.org/abs/2510.05414
- 2025-10-13, A Large-Language-Model Assisted Automated Scale Bar Detection and Extraction Framework for Scanning Electron Microscopic Images, Yuxuan Chen et.al., Paper: http://arxiv.org/abs/2510.11260
- 2026-03-06, A LINDDUN-based Privacy Threat Modeling Framework for GenAI, Qianying Liao et.al., Paper: http://arxiv.org/abs/2603.06051
- 2025-09-18, A Knowledge-driven Adaptive Collaboration of LLMs for Enhancing Medical Decision-making, Xiao Wu et.al., Paper: http://arxiv.org/abs/2509.14998
- 2025-10-24, A Knowledge-Graph Translation Layer for Mission-Aware Multi-Agent Path Planning in Spatiotemporal Dynamics, Edward Holmberg et.al., Paper: http://arxiv.org/abs/2510.21695
- 2025-12-03, A Hierarchical Tree-based approach for creating Configurable and Static Deep Research Agent (Static-DRA), Saurav Prateek et.al., Paper: http://arxiv.org/abs/2512.03887
- 2026-02-24, A Hierarchical Multi-Agent System for Autonomous Discovery in Geoscientific Data Archives, Dmitrii Pantiukhin et.al., Paper: http://arxiv.org/abs/2602.21351
- 2026-03-09, A Hierarchical Error-Corrective Graph Framework for Autonomous Agents with LLM-Based Action Generation, Cong Cao et.al., Paper: http://arxiv.org/abs/2603.08388
- 2026-03-02, A Hetero-functional Graph State Estimator for Watershed Systems: Application to the Chesapeake Bay, Megan S. Harris et.al., Paper: http://arxiv.org/abs/2603.01931
- 2026-03-10, A Guideline-Aware AI Agent for Zero-Shot Target Volume Auto-Delineation, Yoon Jo Kim et.al., Paper: http://arxiv.org/abs/2603.09448
- 2025-11-18, A General Framework for Physician Rostering Using Mixed-Integer Programming and a Web-Based Graphical User Interface, Florian Meier et.al., Paper: http://arxiv.org/abs/2511.14536
- 2026-03-19, A Framework for Formalizing LLM Agent Security, Vincent Siu et.al., Paper: http://arxiv.org/abs/2603.19469
- 2026-01-06, A Fast Semidefinite Convex Relaxation for Optimal Control Problems With Spatio-Temporal Constraints, Shiying Dong et.al., Paper: http://arxiv.org/abs/2601.03055
- 2026-02-05, A Dual-Loop Agent Framework for Automated Vulnerability Reproduction, Bin Liu et.al., Paper: http://arxiv.org/abs/2602.05721
- 2025-12-29, A Design Space for Intelligent Agents in Mixed-Initiative Visual Analytics, Tobias Stähle et.al., Paper: http://arxiv.org/abs/2512.23372
- 2026-03-11, A Control-Theoretic Foundation for Agentic Systems, Ali Eslami et.al., Paper: http://arxiv.org/abs/2603.10779
- 2026-03-06, A Contrastive Fewshot RGBD Traversability Segmentation Framework for Indoor Robotic Navigation, Qiyuan An et.al., Paper: http://arxiv.org/abs/2603.06927
- 2025-12-05, A Continuous Nonlinear Optimization Perspective on the Spin Glass Problem, Phil Duxbury et.al., Paper: http://arxiv.org/abs/2512.05852
- 2026-03-23, A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP, Xi Yang et.al., Paper: http://arxiv.org/abs/2603.22083
- 2025-08-26, A Concurrent Modular Agent: Framework for Autonomous LLM Agents, Norihiro Maruyama et.al., Paper: http://arxiv.org/abs/2508.19042
- 2026-01-23, A Cognitive Framework for Autonomous Agents: Toward Human-Inspired Design, Francesco Guidi et.al., Paper: http://arxiv.org/abs/2601.16648
- 2025-11-20, A Butterfly’s Eye Camera for Intensity Interferometry with Cherenkov Telescopes, Juan Cortina et.al., Paper: http://arxiv.org/abs/2511.16505
- 2025-10-20, A Brain Cell Type Resource Created by Large Language Models and a Multi-Agent AI System for Collaborative Community Annotation, Rongbin Li et.al., Paper: http://arxiv.org/abs/2510.17064
- 2026-03-23, A Blueprint for Self-Evolving Coding Agents in Vehicle Aerodynamic Drag Prediction, Jinhui Ren et.al., Paper: http://arxiv.org/abs/2603.21698
- 2026-02-24, A Benchmark for Deep Information Synthesis, Debjit Paul et.al., Paper: http://arxiv.org/abs/2602.21143
- 2025-12-18, A Benchmark and Agentic Framework for Omni-Modal Reasoning and Tool Use in Long Videos, Mohammed Irfan Kurpath et.al., Paper: http://arxiv.org/abs/2512.16978
- 2026-02-09, A Behavioural and Representational Evaluation of Goal-Directedness in Language Model Agents, Raghu Arghal et.al., Paper: http://arxiv.org/abs/2602.08964
- 2026-04-01, A 4D Representation for Training-Free Agentic Reasoning from Monocular Laparoscopic Video, Maximilian Fehrentz et.al., Paper: http://arxiv.org/abs/2604.00867
- 2026-03-31, 6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management, Jiao Chen et.al., Paper: http://arxiv.org/abs/2603.29656
- 2026-02-09, 6G-Bench: An Open Benchmark for Semantic Communication and Network-Level Reasoning with Foundation Models in AI-Native 6G Networks, Mohamed Amine Ferrag et.al., Paper: http://arxiv.org/abs/2602.08675
- 2025-11-04, 1 PoCo: Agentic Proof-of-Concept Exploit Generation for Smart Contracts, Vivi Andersson et.al., Paper: http://arxiv.org/abs/2511.02780
- 2026-03-04, $τ$ -Knowledge: Evaluating Conversational Agents over Unstructured Knowledge, Quan Shi et.al., Paper: http://arxiv.org/abs/2603.04370
- 2025-09-15, $ε$ -Optimal Multi-Agent Patrol using Recurrent Strategy, Deepak Mallya et.al., Paper: http://arxiv.org/abs/2509.11640
- 2025-12-22, $γ(3,4)$ `Attention’ in Cognitive Agents: Ontology-Free Knowledge Representations With Promise Theoretic Semantics, Mark Burgess et.al., Paper: http://arxiv.org/abs/2512.19084
- 2026-04-01, $\texttt{YC-Bench}$ : Benchmarking AI Agents for Long-Term Planning and Consistent Execution, Muyu He et.al., Paper: http://arxiv.org/abs/2604.01212
- 2025-10-13, $How^{2}$ : How to learn from procedural How-to questions, Gautier Dagan et.al., Paper: http://arxiv.org/abs/2510.11144
- 2026-03-02, “When to Hand Off, When to Work Together”: Expanding Human-Agent Co-Creative Collaboration through Concurrent Interaction, Kihoon Son et.al., Paper: http://arxiv.org/abs/2603.02050
- 2025-09-27, “Shall We Dig Deeper?”: Designing and Evaluating Strategies for LLM Agents to Advance Knowledge Co-Construction in Asynchronous Online Discussions, Yuanhao Zhang et.al., Paper: http://arxiv.org/abs/2509.23327
- 2026-03-28, “Elementary, My Dear Watson.” Detecting Malicious Skills via Neuro-Symbolic Reasoning across Heterogeneous Artifacts, Shenao Wang et.al., Paper: http://arxiv.org/abs/2603.27204
- 2026-02-24, “Are You Sure?”: An Empirical Study of Human Perception Vulnerability in LLM-Driven Agentic Systems, Xinfeng Li et.al., Paper: http://arxiv.org/abs/2602.21127
(<a href=#updated-on-20260403>back to top</a>)
Large Language Models
- 2025-10-22, olmOCR 2: Unit Test Rewards for Document OCR, Jake Poznanski et.al., Paper: http://arxiv.org/abs/2510.19817
- 2026-03-11, mAceReason-Math: A Dataset of High-Quality Multilingual Math Problems Ready For RLVR, Konstantin Dobler et.al., Paper: http://arxiv.org/abs/2603.10767
- 2026-01-15, iTIMO: An LLM-empowered Synthesis Dataset for Travel Itinerary Modification, Zhuoxuan Huang et.al., Paper: http://arxiv.org/abs/2601.10609
- 2025-12-26, iSHIFT: Lightweight Slow-Fast GUI Agent with Adaptive Perception, Sarthak Mehrotra et.al., Paper: http://arxiv.org/abs/2512.22009
- 2026-01-09, iReasoner: Trajectory-Aware Intrinsic Reasoning Supervision for Self-Evolving Large Multimodal Models, Meghana Sunil et.al., Paper: http://arxiv.org/abs/2601.05877
- 2026-02-09, iGRPO: Self-Feedback-Driven LLM Reasoning, Ali Hatamizadeh et.al., Paper: http://arxiv.org/abs/2602.09000
- 2026-03-19, dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models, Wenxuan Zhang et.al., Paper: http://arxiv.org/abs/2603.18806
- 2026-01-26, ctELM: Decoding and Manipulating Embeddings of Clinical Trials with Embedding Language Models, Brian Ondov et.al., Paper: http://arxiv.org/abs/2601.18796
- 2025-12-05, Zoom in, Click out: Unlocking and Evaluating the Potential of Zooming for GUI Grounding, Zhiyuan Jiang et.al., Paper: http://arxiv.org/abs/2512.05941
- 2026-03-18, ZipServ: Fast and Memory-Efficient LLM Inference with Hardware-Aware Lossless Compression, Ruibo Fan et.al., Paper: http://arxiv.org/abs/2603.17435
- 2025-10-23, Zhyper: Factorized Hypernetworks for Conditioned LLM Fine-Tuning, M. H. I. Abdalla et.al., Paper: http://arxiv.org/abs/2510.19733
- 2026-02-03, Zero-shot large vision-language model prompting for automated bone identification in paleoradiology x-ray archives, Owen Dong et.al., Paper: http://arxiv.org/abs/2602.03750
- 2026-02-20, Zero-shot Interactive Perception, Venkatesh Sripada et.al., Paper: http://arxiv.org/abs/2602.18374
- 2025-10-21, Zero-Shot Vehicle Model Recognition via Text-Based Retrieval-Augmented Generation, Wei-Chia Chang et.al., Paper: http://arxiv.org/abs/2510.18502
- 2026-01-27, Zero-Shot Stance Detection in the Wild: Dynamic Target Generation and Multi-Target Adaptation, Aohua Li et.al., Paper: http://arxiv.org/abs/2601.19802
- 2025-10-28, Zero-Shot Cross-Lingual Transfer using Prefix-Based Adaptation, Snegha A et.al., Paper: http://arxiv.org/abs/2510.24619
- 2026-03-02, Zero- and Few-Shot Named-Entity Recognition: Case Study and Dataset in the Crime Domain (CrimeNER), Miguel Lopez-Duran et.al., Paper: http://arxiv.org/abs/2603.02150
- 2026-01-27, Youtu-VL: Unleashing Visual Potential via Unified Vision-Language Supervision, Zhixiang Wei et.al., Paper: http://arxiv.org/abs/2601.19798
- 2025-12-24, Your Reasoning Benchmark May Not Test Reasoning: Revealing Perception Bottleneck in Abstract Reasoning Benchmarks, Xinhe Wang et.al., Paper: http://arxiv.org/abs/2512.21329
- 2025-11-20, You Only Forward Once: An Efficient Compositional Judging Paradigm, Tianlong Zhang et.al., Paper: http://arxiv.org/abs/2511.16600
- 2025-12-17, You Never Know a Person, You Only Know Their Defenses: Detecting Levels of Psychological Defense Mechanisms in Supportive Conversations, Hongbin Na et.al., Paper: http://arxiv.org/abs/2512.15601
- 2026-03-10, You Didn’t Have to Say It like That: Subliminal Learning from Faithful Paraphrases, Isaia Gisler et.al., Paper: http://arxiv.org/abs/2603.09517
- 2025-11-04, XR-1: Towards Versatile Vision-Language-Action Models via Learning Unified Vision-Motion Representations, Shichao Fan et.al., Paper: http://arxiv.org/abs/2511.02776
- 2025-12-19, XAgen: An Explainability Tool for Identifying and Correcting Failures in Multi-Agent Workflows, Xinru Wang et.al., Paper: http://arxiv.org/abs/2512.17896
- 2026-01-06, X-MuTeST: A Multilingual Benchmark for Explainable Hate Speech Detection and A Novel LLM-consulted Explanation Framework, Mohammad Zia Ur Rehman et.al., Paper: http://arxiv.org/abs/2601.03194
- 2024-05-31, X-Instruction: Aligning Language Model in Low-resource Languages with Self-curated Cross-lingual Instructions, Chong Li et.al., Paper: http://arxiv.org/abs/2405.19744
- 2026-03-10, X-GS: An Extensible Open Framework Unifying 3DGS Architectures with Downstream Multimodal Models, Yueen Ma et.al., Paper: http://arxiv.org/abs/2603.09632
- 2025-12-16, WorldPlay: Towards Long-Term Geometric Consistency for Real-Time Interactive World Modeling, Wenqiang Sun et.al., Paper: http://arxiv.org/abs/2512.14614
- 2026-02-02, World-Gymnast: Training Robots with Reinforcement Learning in a World Model, Ansh Kumar Sharma et.al., Paper: http://arxiv.org/abs/2602.02454
- 2026-01-29, World of Workflows: a Benchmark for Bringing World Models to Enterprise Systems, Lakshya Gupta et.al., Paper: http://arxiv.org/abs/2601.22130
- 2026-03-04, World Properties without World Models: Recovering Spatial and Temporal Structure from Co-occurrence Statistics in Static Word Embeddings, Elan Barenholtz et.al., Paper: http://arxiv.org/abs/2603.04317
- 2026-02-16, World Models for Policy Refinement in StarCraft II, Yixin Zhang et.al., Paper: http://arxiv.org/abs/2602.14857
- 2026-03-11, Word Recovery in Large Language Models Enables Character-Level Tokenization Robustness, Zhipeng Yang et.al., Paper: http://arxiv.org/abs/2603.10771
- 2025-05-28, WizardLM: Empowering large pre-trained language models to follow complex instructions, Can Xu et.al., Paper: http://arxiv.org/abs/2304.12244
- 2025-05-28, WizardCoder: Empowering Code Large Language Models with Evol-Instruct, Ziyang Luo et.al., Paper: http://arxiv.org/abs/2306.08568
- 2025-10-24, Wisdom and Delusion of LLM Ensembles for Code Generation and Repair, Fernando Vallecillos Ruiz et.al., Paper: http://arxiv.org/abs/2510.21513
- 2026-02-10, WildCat: Near-Linear Attention in Theory and Practice, Tobias Schröder et.al., Paper: http://arxiv.org/abs/2602.10056
- 2026-03-10, WikiCLIP: An Efficient Contrastive Baseline for Open-domain Visual Entity Recognition, Shan Ning et.al., Paper: http://arxiv.org/abs/2603.09921
- 2025-11-17, Why is “Chicago” Predictive of Deceptive Reviews? Using LLMs to Discover Language Phenomena from Lexical Cues, Jiaming Qu et.al., Paper: http://arxiv.org/abs/2511.13658
- 2026-02-18, Why Thinking Hurts? Diagnosing and Rectifying the Reasoning Shift in Foundation Recommender Models, Luankang Zhang et.al., Paper: http://arxiv.org/abs/2602.16587
- 2025-10-21, Why Policy Gradient Algorithms Work for Undiscounted Total-Reward MDPs, Jongmin Lee et.al., Paper: http://arxiv.org/abs/2510.18340
- 2026-02-24, Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training, Anas Barakat et.al., Paper: http://arxiv.org/abs/2602.21189
- 2026-01-26, Why Keep Your Doubts to Yourself? Trading Visual Uncertainties in Multi-Agent Bandit Systems, Jusheng Zhang et.al., Paper: http://arxiv.org/abs/2601.18735
- 2026-02-26, Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding?, Pengxiang Li et.al., Paper: http://arxiv.org/abs/2602.23225
- 2026-03-19, Why Better Cross-Lingual Alignment Fails for Better Cross-Lingual Transfer: Case of Encoders, Yana Veitsman et.al., Paper: http://arxiv.org/abs/2603.18863
- 2026-03-24, Why AI-Generated Text Detection Fails: Evidence from Explainable AI Beyond Benchmark Accuracy, Shushanta Pudasaini et.al., Paper: http://arxiv.org/abs/2603.23146
- 2026-01-13, Why AI Alignment Failure Is Structural: Learned Human Interaction Structures and AGI as an Endogenous Evolutionary Shock, Didier Sornette et.al., Paper: http://arxiv.org/abs/2601.08673
- 2026-02-09, Whose Name Comes Up? Benchmarking and Intervention-Based Auditing of LLM-Based Scholar Recommendation, Lisette Espin-Noboa et.al., Paper: http://arxiv.org/abs/2602.08873
- 2026-02-18, Who can we trust? LLM-as-a-jury for Comparative Assessment, Mengjie Qian et.al., Paper: http://arxiv.org/abs/2602.16610
- 2026-03-25, Who Benefits from RAG? The Role of Exposure, Utility and Attribution Bias, Mahdi Dehghan et.al., Paper: http://arxiv.org/abs/2603.24218
- 2026-03-17, Who Benchmarks the Benchmarks? A Case Study of LLM Evaluation in Icelandic, Finnur Ágúst Ingimundarson et.al., Paper: http://arxiv.org/abs/2603.16406
- 2025-11-05, Whisper Leak: a side-channel attack on Large Language Models, Geoff McDonald et.al., Paper: http://arxiv.org/abs/2511.03675
- 2026-01-20, Which Reasoning Trajectories Teach Students to Reason Better? A Simple Metric of Informative Alignment, Yuming Yang et.al., Paper: http://arxiv.org/abs/2601.14249
- 2026-02-19, When to Trust the Cheap Check: Weak and Strong Verification for Reasoning, Shayan Kiyani et.al., Paper: http://arxiv.org/abs/2602.17633
- 2025-11-19, When to Think and When to Look: Uncertainty-Guided Lookback, Jing Bi et.al., Paper: http://arxiv.org/abs/2511.15613
- 2025-12-19, When the Gold Standard isn’t Necessarily Standard: Challenges of Evaluating the Translation of User-Generated Content, Lydia Nishimwe et.al., Paper: http://arxiv.org/abs/2512.17738
- 2025-11-06, When retrieval outperforms generation: Dense evidence retrieval for scalable fake news detection, Alamgir Munir Qazi et.al., Paper: http://arxiv.org/abs/2511.04643
- 2025-11-04, When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought, Yiyang Zhou et.al., Paper: http://arxiv.org/abs/2511.02779
- 2026-03-14, When Visual Privacy Protection Meets Multimodal Large Language Models, Xiaofei Hui et.al., Paper: http://arxiv.org/abs/2603.13978
- 2025-12-09, When Tables Leak: Attacking String Memorization in LLM-Based Tabular Data Generation, Joshua Ward et.al., Paper: http://arxiv.org/abs/2512.08875
- 2026-02-04, When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond?, Xinyu Zhou et.al., Paper: http://arxiv.org/abs/2602.04755
- 2025-11-04, When One Modality Sabotages the Others: A Diagnostic Lens on Multimodal Reasoning, Chenyu Zhang et.al., Paper: http://arxiv.org/abs/2511.02794
- 2026-03-22, When Minor Edits Matter: LLM-Driven Prompt Attack for Medical VLM Robustness in Ultrasound, Yasamin Medghalchi et.al., Paper: http://arxiv.org/abs/2603.21047
- 2025-12-08, When Large Language Models Do Not Work: Online Incivility Prediction through Graph Neural Networks, Zihan Chen et.al., Paper: http://arxiv.org/abs/2512.07684
- 2026-03-24, When Language Models Lose Their Mind: The Consequences of Brain Misalignment, Gabriele Merlin et.al., Paper: http://arxiv.org/abs/2603.23091
- 2026-02-04, When LLaVA Meets Objects: Token Composition for Vision-Language-Models, Soumya Jahagirdar et.al., Paper: http://arxiv.org/abs/2602.04864
- 2026-01-27, When Iterative RAG Beats Ideal Evidence: A Diagnostic Study in Scientific Multi-hop Question Answering, Mahdi Astaraki et.al., Paper: http://arxiv.org/abs/2601.19827
- 2026-01-07, When Helpers Become Hazards: A Benchmark for Analyzing Multimodal LLM-Powered Safety in Daily Life, Xinyue Lou et.al., Paper: http://arxiv.org/abs/2601.04043
- 2026-01-28, When Flores Bloomz Wrong: Cross-Direction Contamination in Machine Translation Evaluation, David Tan et.al., Paper: http://arxiv.org/abs/2601.20858
- 2026-03-11, When Fine-Tuning Fails and when it Generalises: Role of Data Diversity and Mixed Training in LLM-based TTS, Anupam Purwar et.al., Paper: http://arxiv.org/abs/2603.10904
- 2025-11-10, When Bias Pretends to Be Truth: How Spurious Correlations Undermine Hallucination Detection in LLMs, Shaowen Wang et.al., Paper: http://arxiv.org/abs/2511.07318
- 2026-01-21, When Agents Fail: A Comprehensive Study of Bugs in LLM Agents with Automated Labeling, Niful Islam et.al., Paper: http://arxiv.org/abs/2601.15232
- 2026-02-25, When AI Writes, Whose Voice Remains? Quantifying Cultural Marker Erasure Across World English Varieties in Large Language Models, Satyam Kumar Navneet et.al., Paper: http://arxiv.org/abs/2602.22145
- 2026-03-25, When AI Meets Early Childhood Education: Large Language Models as Assessment Teammates in Chinese Preschools, Xingming Li et.al., Paper: http://arxiv.org/abs/2603.24389
- 2026-03-04, When AI Fails, What Works? A Data-Driven Taxonomy of Real-World AI Risk Mitigation Strategies, Evgenija Popchanovska et.al., Paper: http://arxiv.org/abs/2603.04259
- 2025-11-18, When AI Democratizes Exploitation: LLM-Assisted Strategic Manipulation of Fair Division Algorithms, Priyanka Verma et.al., Paper: http://arxiv.org/abs/2511.14722
- 2026-02-19, What Language is This? Ask Your Tokenizer, Clara Meister et.al., Paper: http://arxiv.org/abs/2602.17655
- 2026-03-20, What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time, Dong Yan et.al., Paper: http://arxiv.org/abs/2603.19880
- 2026-03-02, What Exactly do Children Receive in Language Acquisition? A Case Study on CHILDES with Automated Detection of Filler-Gap Dependencies, Zhenghao Herbert Zhou et.al., Paper: http://arxiv.org/abs/2603.02082
- 2025-12-18, What Do Prosody and Text Convey? Characterizing How Meaningful Information is Distributed Across Multiple Channels, Aditya Yadavalli et.al., Paper: http://arxiv.org/abs/2512.16832
- 2025-11-07, What Are the Facts? Automated Extraction of Court-Established Facts from Criminal-Court Opinions, Klára Bendová et.al., Paper: http://arxiv.org/abs/2511.05320
- 2025-11-17, Weight-sparse transformers have interpretable circuits, Leo Gao et.al., Paper: http://arxiv.org/abs/2511.13653
- 2026-02-11, Weight Decay Improves Language Model Plasticity, Tessa Han et.al., Paper: http://arxiv.org/abs/2602.11137
- 2025-10-28, WebLeaper: Empowering Efficiency and Efficacy in WebAgent via Enabling Info-Rich Seeking, Zhengwei Tao et.al., Paper: http://arxiv.org/abs/2510.24697
- 2026-01-06, WebAnchor: Anchoring Agent Planning to Stabilize Long-Horizon Web Reasoning, Yu Xinmiao et.al., Paper: http://arxiv.org/abs/2601.03164
- 2025-12-29, Web World Models, Jichen Feng et.al., Paper: http://arxiv.org/abs/2512.23676
- 2026-02-25, WeaveTime: Stream from Earlier Frames into Emergent Memory in VideoLLMs, Yulin Zhang et.al., Paper: http://arxiv.org/abs/2602.22142
- 2025-11-05, Watermarking Large Language Models in Europe: Interpreting the AI Act in Light of Technology, Thomas Souverain et.al., Paper: http://arxiv.org/abs/2511.03641
- 2025-11-19, Walrus: A Cross-Domain Foundation Model for Continuum Dynamics, Michael McCabe et.al., Paper: http://arxiv.org/abs/2511.15684
- 2025-11-14, W2S-AlignTree: Weak-to-Strong Inference-Time Alignment for Large Language Models via Monte Carlo Tree Search, Zhenyu Ding et.al., Paper: http://arxiv.org/abs/2511.11518
- 2026-02-11, Vulnerabilities in Partial TEE-Shielded LLM Inference with Precomputed Noise, Abhishek Saini et.al., Paper: http://arxiv.org/abs/2602.11088
- 2025-12-31, Vulcan: Instance-Optimal Systems Heuristics Through LLM-Driven Search, Rohit Dwivedula et.al., Paper: http://arxiv.org/abs/2512.25065
- 2025-12-12, Visualizing token importance for black-box language models, Paulius Rauba et.al., Paper: http://arxiv.org/abs/2512.11573
- 2026-03-29, Visualization of Machine Learning Models through Their Spatial and Temporal Listeners, Siyu Wu et.al., Paper: http://arxiv.org/abs/2603.27527
- 2026-03-07, VisualScratchpad: Inference-time Visual Concepts Analysis in Vision Language Models, Hyesu Lim et.al., Paper: http://arxiv.org/abs/2603.07335
- 2025-12-10, VisualActBench: Can VLMs See and Act like a Human?, Daoan Zhang et.al., Paper: http://arxiv.org/abs/2512.09907
- 2026-03-13, Visual-ERM: Reward Modeling for Visual Equivalence, Ziyu Liu et.al., Paper: http://arxiv.org/abs/2603.13224
- 2025-12-22, Visual-Aware CoT: Achieving High-Fidelity Visual Consistency in Unified Models, Zixuan Ye et.al., Paper: http://arxiv.org/abs/2512.19686
- 2025-11-07, Visual Spatial Tuning, Rui Yang et.al., Paper: http://arxiv.org/abs/2511.05491
- 2026-03-09, Visual Self-Fulfilling Alignment: Shaping Safety-Oriented Personas via Threat-Related Images, Qishun Yang et.al., Paper: http://arxiv.org/abs/2603.08486
- 2025-12-04, Visual Reasoning Tracer: Object-Level Grounded Reasoning Benchmark, Haobo Yuan et.al., Paper: http://arxiv.org/abs/2512.05091
- 2026-02-12, Visual Reasoning Benchmark: Evaluating Multimodal LLMs on Classroom-Authentic Visual Problems from Primary Education, Mohamed Huti et.al., Paper: http://arxiv.org/abs/2602.12196
- 2026-01-27, Visual Generation Unlocks Human-Like Reasoning through Multimodal World Models, Jialong Wu et.al., Paper: http://arxiv.org/abs/2601.19834
- 2025-11-28, Visual Generation Tuning, Jiahao Guo et.al., Paper: http://arxiv.org/abs/2511.23469
- 2026-03-17, Visual Distraction Undermines Moral Reasoning in Vision-Language Models, Xinyi Yang et.al., Paper: http://arxiv.org/abs/2603.16445
- 2025-10-31, Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning, Qiusi Zhan et.al., Paper: http://arxiv.org/abs/2510.27623
- 2026-03-25, Vision-Language Models vs Human: Perceptual Image Quality Assessment, Imran Mehmood et.al., Paper: http://arxiv.org/abs/2603.24578
- 2026-01-12, Vision-Language Model for Accurate Crater Detection, Patrick Bauer et.al., Paper: http://arxiv.org/abs/2601.07795
- 2025-11-25, Vision-Language Memory for Spatial Reasoning, Zuntao Liu et.al., Paper: http://arxiv.org/abs/2511.20644
- 2026-01-08, Vision-Language Introspection: Mitigating Overconfident Hallucinations in MLLMs via Interpretable Bi-Causal Steering, Shuliang Liu et.al., Paper: http://arxiv.org/abs/2601.05159
- 2026-01-29, Vision-DeepResearch: Incentivizing DeepResearch Capability in Multimodal Large Language Models, Wenxuan Huang et.al., Paper: http://arxiv.org/abs/2601.22060
- 2025-11-18, Vision Large Language Models Are Good Noise Handlers in Engagement Analysis, Alexander Vedernikov et.al., Paper: http://arxiv.org/abs/2511.14749
- 2025-12-24, VisRes Bench: On Evaluating the Visual Reasoning Capabilities of VLMs, Brigitta Malagurski Törtei et.al., Paper: http://arxiv.org/abs/2512.21194
- 2026-02-05, VisRefiner: Learning from Visual Differences for Screenshot-to-Code Generation, Jie Deng et.al., Paper: http://arxiv.org/abs/2602.05998
- 2025-11-19, VisPlay: Self-Evolving Vision-Language Models from Images, Yicheng He et.al., Paper: http://arxiv.org/abs/2511.15661
- 2026-01-23, VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents, Zirui Wang et.al., Paper: http://arxiv.org/abs/2601.16973
- 2026-04-01, VisG AV-HuBERT: Viseme-Guided AV-HuBERT, Aristeidis Papadopoulos et.al., Paper: http://arxiv.org/abs/2604.00982
- 2026-02-17, VideoSketcher: Video Models Prior Enable Versatile Sequential Sketch Generation, Hui Ren et.al., Paper: http://arxiv.org/abs/2602.15819
- 2026-03-23, VideoDetective: Clue Hunting via both Extrinsic Query and Intrinsic Relevance for Long Video Understanding, Ruoliu Yang et.al., Paper: http://arxiv.org/abs/2603.22285
- 2026-01-08, VideoAuto-R1: Video Auto Reasoning via Thinking Once, Answering Twice, Shuming Liu et.al., Paper: http://arxiv.org/abs/2601.05175
- 2026-03-18, VideoAtlas: Navigating Long-Form Video in Logarithmic Compute, Mohamed Eltahir et.al., Paper: http://arxiv.org/abs/2603.17948
- 2026-01-30, Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning, Xiangyu Zeng et.al., Paper: http://arxiv.org/abs/2601.23224
- 2025-11-20, Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO, Junhao Cheng et.al., Paper: http://arxiv.org/abs/2511.16669
- 2025-11-28, Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models, Muhammad Maaz et.al., Paper: http://arxiv.org/abs/2511.23478
- 2026-03-25, Video-Only ToM: Enhancing Theory of Mind in Multimodal Large Language Models, Siqi Liu et.al., Paper: http://arxiv.org/abs/2603.24484
- 2025-11-28, Video-CoM: Interactive Video Reasoning via Chain of Manipulations, Hanoona Rasheed et.al., Paper: http://arxiv.org/abs/2511.23477
- 2026-03-12, Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously, Yiran Guan et.al., Paper: http://arxiv.org/abs/2603.12262
- 2025-10-23, Video Prediction of Dynamic Physical Simulations With Pixel-Space Spatiotemporal Transformers, Dean L Slack et.al., Paper: http://arxiv.org/abs/2510.20807
- 2026-01-12, Video Evidence to Reasoning Efficient Video Understanding via Explicit Evidence Grounding, Yanxiang Huang et.al., Paper: http://arxiv.org/abs/2601.07761
- 2025-11-04, VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models, Zhicheng Zhang et.al., Paper: http://arxiv.org/abs/2511.02712
- 2026-02-20, Vichara: Appellate Judgment Prediction and Explanation for the Indian Judicial System, Pavithra PM Nair et.al., Paper: http://arxiv.org/abs/2602.18346
- 2025-12-31, Vibe Coding, Interface Flattening, Hongrui Jin et.al., Paper: http://arxiv.org/abs/2512.24939
- 2026-03-25, Vibe Coding XR: Accelerating AI + XR Prototyping with XR Blocks and Gemini, Ruofei Du et.al., Paper: http://arxiv.org/abs/2603.24591
- 2026-03-17, Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences, Quan Cheng et.al., Paper: http://arxiv.org/abs/2603.16417
- 2026-03-16, ViX-Ray: A Vietnamese Chest X-Ray Dataset for Vision-Language Models, Duy Vu Minh Nguyen et.al., Paper: http://arxiv.org/abs/2603.15513
- 2026-02-17, ViTaB-A: Evaluating Multimodal Large Language Models on Visual Table Attribution, Yahia Alqurnawi et.al., Paper: http://arxiv.org/abs/2602.15769
- 2026-02-25, ViSTAR: Virtual Skill Training with Augmented Reality with 3D Avatars and LLM coaching agent, Chunggi Lee et.al., Paper: http://arxiv.org/abs/2602.22077
- 2025-12-02, ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation, Mengchen Zhang et.al., Paper: http://arxiv.org/abs/2512.03036
- 2025-12-16, VersatileFFN: Achieving Parameter Efficiency in LLMs via Adaptive Wide-and-Deep Reuse, Ying Nie et.al., Paper: http://arxiv.org/abs/2512.14531
- 2025-10-21, Verifiable Accuracy and Abstention Rewards in Curriculum RL to Alleviate Lost-in-Conversation, Ming Li et.al., Paper: http://arxiv.org/abs/2510.18731
- 2026-02-20, VeriSoftBench: Repository-Scale Formal Verification Benchmarks for Lean, Yutong Xin et.al., Paper: http://arxiv.org/abs/2602.18307
- 2025-10-31, VeriMoA: A Mixture-of-Agents Framework for Spec-to-HDL Generation, Heng Ping et.al., Paper: http://arxiv.org/abs/2510.27617
- 2025-11-06, VeriCoT: Neuro-symbolic Chain-of-Thought Validation via Logical Consistency Checks, Yu Feng et.al., Paper: http://arxiv.org/abs/2511.04662
- 2026-01-27, Veri-Sure: A Contract-Aware Multi-Agent Framework with Temporal Tracing and Formal Verification for Correct RTL Code Generation, Jiale Liu et.al., Paper: http://arxiv.org/abs/2601.19747
- 2026-01-29, Value-Based Pre-Training with Downstream Feedback, Shuqi Ke et.al., Paper: http://arxiv.org/abs/2601.22108
- 2026-01-14, Value-Aware Numerical Representations for Transformer Language Models, Andreea Dutulescu et.al., Paper: http://arxiv.org/abs/2601.09706
- 2025-12-04, Value Gradient Guidance for Flow Matching Alignment, Zhen Liu et.al., Paper: http://arxiv.org/abs/2512.05116
- 2025-10-30, Value Drifts: Tracing Value Alignment During LLM Post-Training, Mehar Bhatia et.al., Paper: http://arxiv.org/abs/2510.26707
- 2025-10-31, Validity Is What You Need, Sebastian Benthall et.al., Paper: http://arxiv.org/abs/2510.27628
- 2026-02-20, Validating Political Position Predictions of Arguments, Jordan Robinson et.al., Paper: http://arxiv.org/abs/2602.18351
- 2025-12-17, VTCBench: Can Vision-Language Models Understand Long Context with Vision-Text Compression?, Hongbo Zhao et.al., Paper: http://arxiv.org/abs/2512.15649
- 2026-01-29, VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning, Yibo Wang et.al., Paper: http://arxiv.org/abs/2601.22069
- 2025-12-05, VRSA: Jailbreaking Multimodal Large Language Models through Visual Reasoning Sequential Attack, Shiji Zhao et.al., Paper: http://arxiv.org/abs/2512.05853
- 2026-03-17, VQKV: High-Fidelity and High-Ratio Cache Compression via Vector-Quantization, Yixuan Wang et.al., Paper: http://arxiv.org/abs/2603.16435
- 2025-10-27, VOLD: Reasoning Transfer from LLMs to Vision-Language Models via On-Policy Distillation, Walid Bousselham et.al., Paper: http://arxiv.org/abs/2510.23497
- 2025-12-16, VLegal-Bench: Cognitively Grounded Benchmark for Vietnamese Legal Reasoning of Large Language Models, Nguyen Tien Dong et.al., Paper: http://arxiv.org/abs/2512.14554
- 2026-03-18, VLM2Rec: Resolving Modality Collapse in Vision-Language Model Embedders for Multimodal Sequential Recommendation, Junyoung Kim et.al., Paper: http://arxiv.org/abs/2603.17450
- 2025-12-17, VLIC: Vision-Language Models As Perceptual Judges for Human-Aligned Image Compression, Kyle Sargent et.al., Paper: http://arxiv.org/abs/2512.15701
- 2025-12-11, VL-JEPA: Joint Embedding Predictive Architecture for Vision-language, Delong Chen et.al., Paper: http://arxiv.org/abs/2512.10942
- 2026-03-24, VISion On Request: Enhanced VLLM efficiency with sparse, dynamically selected, vision-language interactions, Adrian Bulat et.al., Paper: http://arxiv.org/abs/2603.23495
- 2026-02-04, VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text?, Qing’an Liu et.al., Paper: http://arxiv.org/abs/2602.04802
- 2026-02-12, VIRENA: Virtual Arena for Research, Education, and Democratic Innovation, Emma Hoes et.al., Paper: http://arxiv.org/abs/2602.12207
- 2026-02-20, VIRAASAT: Traversing Novel Paths for Indian Cultural Reasoning, Harshul Raj Surana et.al., Paper: http://arxiv.org/abs/2602.18429
- 2026-01-05, VINO: A Unified Visual Generator with Interleaved OmniModal Context, Junyi Chen et.al., Paper: http://arxiv.org/abs/2601.02358
- 2026-03-25, VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models, Qijia He et.al., Paper: http://arxiv.org/abs/2603.24575
- 2026-03-19, VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models, Chonghan Liu et.al., Paper: http://arxiv.org/abs/2603.19152
- 2025-11-24, VDC-Agent: When Video Detailed Captioners Evolve Themselves via Agentic Self-Reflection, Qiang Wang et.al., Paper: http://arxiv.org/abs/2511.19436
- 2025-10-21, VAR: Visual Attention Reasoning via Structured Search and Backtracking, Wei Cai et.al., Paper: http://arxiv.org/abs/2510.18619
- 2025-11-10, VADER: Towards Causal Video Anomaly Understanding with Relation-Aware Large Language Models, Ying Cheng et.al., Paper: http://arxiv.org/abs/2511.07299
- 2026-02-05, V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval, Dongyang Chen et.al., Paper: http://arxiv.org/abs/2602.06034
- 2025-11-20, V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models, Yang Luo et.al., Paper: http://arxiv.org/abs/2511.16668
- 2026-03-19, V-Dreamer: Automating Robotic Simulation and Trajectory Synthesis via Video Generation Priors, Songjia He et.al., Paper: http://arxiv.org/abs/2603.18811
- 2026-03-29, V-CAST: Video Curvature-Aware Spatio-Temporal Pruning for Efficient Video Large Language Models, Xinying Lin et.al., Paper: http://arxiv.org/abs/2603.27650
- 2026-03-03, Utonia: Toward One Encoder for All Point Clouds, Yujia Zhang et.al., Paper: http://arxiv.org/abs/2603.03283
- 2026-02-26, Utilizing LLMs for Industrial Process Automation, Salim Fares et.al., Paper: http://arxiv.org/abs/2602.23331
- 2026-03-20, Utility-Guided Agent Orchestration for Efficient LLM Tool Use, Boyan Liu et.al., Paper: http://arxiv.org/abs/2603.19896
- 2025-11-10, Using Vision Language Models as Closed-Loop Symbolic Planners for Robotic Applications: A Control-Theoretic Perspective, Hao Wang et.al., Paper: http://arxiv.org/abs/2511.07410
- 2026-03-03, Using Learning Progressions to Guide AI Feedback for Science Learning, Xin Xia et.al., Paper: http://arxiv.org/abs/2603.03249
- 2025-12-05, Using Large Language Models to Create Personalized Networks From Therapy Sessions, Clarissa W. Ong et.al., Paper: http://arxiv.org/abs/2512.05836
- 2026-02-19, Using LLMs for Knowledge Component-level Correctness Labeling in Open-ended Coding Problems, Zhangqi Duan et.al., Paper: http://arxiv.org/abs/2602.17542
- 2025-10-30, Using Copilot Agent Mode to Automate Library Migration: A Quantitative Assessment, Aylton Almeida et.al., Paper: http://arxiv.org/abs/2510.26699
- 2026-01-30, User Prompting Strategies and Prompt Enhancement Methods for Open-Set Object Detection in XR Environments, Junfeng Lin et.al., Paper: http://arxiv.org/abs/2601.23281
- 2025-10-23, User Perceptions of Privacy and Helpfulness in LLM Responses to Privacy-Sensitive Scenarios, Xiaoyuan Wu et.al., Paper: http://arxiv.org/abs/2510.20721
- 2025-10-29, User Misconceptions of LLM-Based Conversational Programming Assistants, Gabrielle O’Brien et.al., Paper: http://arxiv.org/abs/2510.25662
- 2026-03-26, Unveiling the Resilience of LLM-Enhanced Search Engines against Black-Hat SEO Manipulation, Pei Chen et.al., Paper: http://arxiv.org/abs/2603.25500
- 2025-10-30, Unveiling Intrinsic Text Bias in Multimodal Large Language Models through Attention Key-Space Analysis, Xinhan Zheng et.al., Paper: http://arxiv.org/abs/2510.26721
- 2025-12-02, Unrolled Networks are Conditional Probability Flows in MRI Reconstruction, Kehan Qi et.al., Paper: http://arxiv.org/abs/2512.03020
- 2026-02-19, Unmasking the Factual-Conceptual Gap in Persian Language Models, Alireza Sakhaeirad et.al., Paper: http://arxiv.org/abs/2602.17623
- 2026-01-16, Unlocking the Potentials of Retrieval-Augmented Generation for Diffusion Language Models, Chuanyue Yu et.al., Paper: http://arxiv.org/abs/2601.11342
- 2026-03-25, Unlocking Few-Shot Capabilities in LVLMs via Prompt Conditioning and Head Selection, Adhemar de Senneville et.al., Paper: http://arxiv.org/abs/2603.24181
- 2025-11-25, Unleashing the Power of Vision-Language Models for Long-Tailed Multi-Label Visual Recognition, Wei Tang et.al., Paper: http://arxiv.org/abs/2511.20641
- 2026-01-15, Unleashing the Capabilities of Large Vision-Language Models for Intelligent Perception of Roadside Infrastructure, Luxuan Fu et.al., Paper: http://arxiv.org/abs/2601.10551
- 2026-03-25, Unleashing Vision-Language Semantics for Deepfake Video Detection, Jiawen Zhu et.al., Paper: http://arxiv.org/abs/2603.24454
- 2026-03-24, Unleashing Spatial Reasoning in Multimodal Large Language Models via Textual Representation Guided Reasoning, Jiacheng Hua et.al., Paper: http://arxiv.org/abs/2603.23404
- 2026-02-12, Unknown Attack Detection in IoT Networks using Large Language Models: A Robust, Data-efficient Approach, Shan Ali et.al., Paper: http://arxiv.org/abs/2602.12183
- 2026-04-01, Universal YOCO for Efficient Depth Scaling, Yutao Sun et.al., Paper: http://arxiv.org/abs/2604.01220
- 2026-03-18, Universal Skeleton Understanding via Differentiable Rendering and MLLMs, Ziyi Wang et.al., Paper: http://arxiv.org/abs/2603.18003
- 2025-12-03, Unique Lives, Shared World: Learning from Single-Life Videos, Tengda Han et.al., Paper: http://arxiv.org/abs/2512.04085
- 2025-10-21, Unifying and Enhancing Graph Transformers via a Hierarchical Mask Framework, Yujie Xing et.al., Paper: http://arxiv.org/abs/2510.18825
- 2025-12-26, Unifying Learning Dynamics and Generalization in Transformers Scaling Law, Chiwun Yang et.al., Paper: http://arxiv.org/abs/2512.22088
- 2026-03-18, Unified Spatio-Temporal Token Scoring for Efficient Video VLMs, Jianrui Zhang et.al., Paper: http://arxiv.org/abs/2603.18004
- 2025-12-09, Unified Diffusion Transformer for High-fidelity Text-Aware Image Restoration, Jin Hyeon Kim et.al., Paper: http://arxiv.org/abs/2512.08922
- 2025-12-10, UniUGP: Unifying Understanding, Generation, and Planing For End-to-end Autonomous Driving, Hao Lu et.al., Paper: http://arxiv.org/abs/2512.09864
- 2026-02-12, UniT: Unified Multimodal Chain-of-Thought Test-time Scaling, Leon Liangyu Chen et.al., Paper: http://arxiv.org/abs/2602.12279
- 2026-03-25, UniScale: Synergistic Entire Space Data and Model Scaling for Search Ranking, Liren Yu et.al., Paper: http://arxiv.org/abs/2603.24226
- 2025-10-21, UniGenBench++: A Unified Semantic Evaluation Benchmark for Text-to-Image Generation, Yibin Wang et.al., Paper: http://arxiv.org/abs/2510.18701
- 2025-11-18, UniGen-1.5: Enhancing Image Generation and Editing through Reward Unification in Reinforcement Learning, Rui Tian et.al., Paper: http://arxiv.org/abs/2511.14760
- 2026-02-03, UniGeM: Unifying Data Mixing and Selection via Geometric Exploration and Mining, Changhao Wang et.al., Paper: http://arxiv.org/abs/2602.03772
- 2026-03-03, UniG2U-Bench: Do Unified Models Advance Multimodal Understanding?, Zimo Wen et.al., Paper: http://arxiv.org/abs/2603.03241
- 2026-03-24, UniFunc3D: Unified Active Spatial-Temporal Grounding for 3D Functionality Segmentation, Jiaying Lin et.al., Paper: http://arxiv.org/abs/2603.23478
- 2026-04-02, UniDriveVLA: Unifying Understanding, Perception, and Action Planning for Autonomous Driving, Yongkang Li et.al., Paper: http://arxiv.org/abs/2604.02190
- 2026-03-10, Understanding the Use of a Large Language Model-Powered Guide to Make Virtual Reality Accessible for Blind and Low Vision People, Jazmin Collins et.al., Paper: http://arxiv.org/abs/2603.09964
- 2026-03-10, Understanding the Interplay between LLMs’ Utilisation of Parametric and Contextual Knowledge: A keynote at ECIR 2025, Isabelle Augenstein et.al., Paper: http://arxiv.org/abs/2603.09654
- 2026-03-14, Understanding the Emergence of Seemingly Useless Features in Next-Token Predictors, Mark Rofin et.al., Paper: http://arxiv.org/abs/2603.14087
- 2026-02-03, Understanding and Exploiting Weight Update Sparsity for Communication-Efficient Distributed RL, Erfan Miahi et.al., Paper: http://arxiv.org/abs/2602.03839
- 2026-04-01, Understanding Transformers and Attention Mechanisms: An Introduction for Applied Mathematicians, Michel Fabrice Serret et.al., Paper: http://arxiv.org/abs/2604.00965
- 2025-12-08, Understanding Privacy Risks in Code Models Through Training Dynamics: A Causal Approach, Hua Yang et.al., Paper: http://arxiv.org/abs/2512.07814
- 2026-01-16, Understanding Help Seeking for Digital Privacy, Safety, and Security, Kurt Thomas et.al., Paper: http://arxiv.org/abs/2601.11398
- 2026-02-25, Understanding Artificial Theory of Mind: Perturbed Tasks and Reasoning in Large Language Models, Christian Nickel et.al., Paper: http://arxiv.org/abs/2602.22072
- 2026-03-04, Underrepresented in Foundation Model Pretraining Data? A One-Shot Probe, Chris Vorster et.al., Paper: http://arxiv.org/abs/2603.04346
- 2026-02-17, Under-resourced studies of under-resourced languages: lemmatization and POS-tagging with LLM annotators for historical Armenian, Georgian, Greek and Syriac, Chahan Vidal-Gorène et.al., Paper: http://arxiv.org/abs/2602.15753
- 2026-01-13, Uncovering Political Bias in Large Language Models using Parliamentary Voting Records, Jieying Chen et.al., Paper: http://arxiv.org/abs/2601.08785
- 2026-02-06, Uncovering Cross-Objective Interference in Multi-Objective Alignment, Yining Lu et.al., Paper: http://arxiv.org/abs/2602.06869
- 2025-11-05, Uncovering Code Insights: Leveraging GitHub Artifacts for Deeper Code Understanding, Ziv Nevo et.al., Paper: http://arxiv.org/abs/2511.03549
- 2026-04-01, Uncertainty-Aware Variational Reward Factorization via Probabilistic Preference Bases for LLM Personalization, Gyuseok Lee et.al., Paper: http://arxiv.org/abs/2604.00997
- 2026-02-27, Uncertainty Quantification for Multimodal Large Language Models with Incoherence-adjusted Semantic Volume, Gregory Kang Ruey Lau et.al., Paper: http://arxiv.org/abs/2602.24195
- 2026-03-29, Umwelt Engineering: Designing the Cognitive Worlds of Linguistic Agents, Rodney Jehu-Appiah et.al., Paper: http://arxiv.org/abs/2603.27626
- 2026-01-06, UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward, Yile Liu et.al., Paper: http://arxiv.org/abs/2601.03205
- 2025-12-03, Ultra-lightweight Neural Video Representation Compression, Ho Man Kwan et.al., Paper: http://arxiv.org/abs/2512.04019
- 2026-02-27, UXSim: Towards a Hybrid User Search Simulation, Saber Zerhoudi et.al., Paper: http://arxiv.org/abs/2602.24241
- 2025-10-21, UWBench: A Comprehensive Vision-Language Benchmark for Underwater Understanding, Da Zhang et.al., Paper: http://arxiv.org/abs/2510.18262
- 2025-11-13, URaG: Unified Retrieval and Generation in Multimodal LLMs for Efficient Long Document Understanding, Yongxin Shi et.al., Paper: http://arxiv.org/abs/2511.10552
- 2026-01-30, UPA: Unsupervised Prompt Agent via Tree-Based Search and Selection, Siran Peng et.al., Paper: http://arxiv.org/abs/2601.23273
- 2026-03-09, UNBOX: Unveiling Black-box visual models with Natural-language, Simone Carnemolla et.al., Paper: http://arxiv.org/abs/2603.08639
- 2025-11-24, UISearch: Graph-Based Embeddings for Multimodal Enterprise UI Screenshots Retrieval, Maroun Ayli et.al., Paper: http://arxiv.org/abs/2511.19380
- 2026-03-25, UI-Voyager: A Self-Evolving GUI Agent Learning via Failed Experience, Zichuan Lin et.al., Paper: http://arxiv.org/abs/2603.24533
- 2026-03-19, UGID: Unified Graph Isomorphism for Debiasing Large Language Models, Zikang Ding et.al., Paper: http://arxiv.org/abs/2603.19144
- 2026-01-29, UEval: A Benchmark for Unified Multimodal Generation, Bo Li et.al., Paper: http://arxiv.org/abs/2601.22155
- 2026-02-24, UDVideoQA: A Traffic Video Question Answering Dataset for Multi-Object Spatio-Temporal Reasoning in Urban Dynamics, Joseph Raj Vishal et.al., Paper: http://arxiv.org/abs/2602.21137
- 2025-11-05, U2F: Encouraging SWE-Agent to Seize Novelty without Losing Feasibility, Wencheng Ye et.al., Paper: http://arxiv.org/abs/2511.03517
- 2026-03-03, Type-Aware Retrieval-Augmented Generation with Dependency Closure for Solver-Executable Industrial Optimization Modeling, Y. Zhong et.al., Paper: http://arxiv.org/abs/2603.03180
- 2025-11-19, Two-Faced Social Agents: Context Collapse in Role-Conditioned Large Language Models, Vikram K Suresh et.al., Paper: http://arxiv.org/abs/2511.15573
- 2026-01-20, TwinBrainVLA: Unleashing the Potential of Generalist VLMs for Embodied Tasks via Asymmetric Mixture-of-Transformers, Bin Yu et.al., Paper: http://arxiv.org/abs/2601.14133
- 2026-03-07, Tursio for Credit Unions: Powering Structured Data Search with Automated Context Graph, Shivani Tripathi et.al., Paper: http://arxiv.org/abs/2603.07304
- 2026-02-24, Turning Semantics into Topology: LLM-Driven Attribute Augmentation for Collaborative Filtering, Junjie Meng et.al., Paper: http://arxiv.org/abs/2602.21099
- 2025-11-07, Turning Adversaries into Allies: Reversing Typographic Attacks for Multimodal E-Commerce Product Retrieval, Janet Jenq et.al., Paper: http://arxiv.org/abs/2511.05325
- 2026-03-17, TurnWise: The Gap between Single- and Multi-turn Language Model Capabilities, Victoria Graf et.al., Paper: http://arxiv.org/abs/2603.16759
- 2026-01-26, Trustworthy Evaluation of Robotic Manipulation: A New Benchmark and AutoEval Methods, Mengyuan Liu et.al., Paper: http://arxiv.org/abs/2601.18723
- 2026-02-10, Trustworthy Agentic AI Requires Deterministic Architectural Boundaries, Manish Bhattarai et.al., Paper: http://arxiv.org/abs/2602.09947
- 2025-12-05, Trusted AI Agents in the Cloud, Teofil Bodea et.al., Paper: http://arxiv.org/abs/2512.05951
- 2025-12-19, Trust-Region Adaptive Policy Optimization, Mingyu Su et.al., Paper: http://arxiv.org/abs/2512.17636
- 2026-04-01, True (VIS) Lies: Analyzing How Generative AI Recognizes Intentionality, Rhetoric, and Misleadingness in Visualization Lies, Graziano Blasilli et.al., Paper: http://arxiv.org/abs/2604.01181
- 2026-03-02, Trident: Adaptive Scheduling for Heterogeneous Multimodal Data Pipelines, Ding Pan et.al., Paper: http://arxiv.org/abs/2603.02075
- 2025-12-09, Tri-Bench: Stress-Testing VLM Reliability on Spatial Reasoning under Camera Tilt and Object Interference, Amit Bendkhale et.al., Paper: http://arxiv.org/abs/2512.08860
- 2026-01-23, Trapped in the past? Disentangling fluid and crystallized intelligence of large language models using chess, Leonard S. Pleiss et.al., Paper: http://arxiv.org/abs/2601.16823
- 2026-03-06, Transparent AI for Mathematics: Transformer-Based Large Language Models for Mathematical Entity Relationship Extraction with XAI, Tanjim Taharat Aurpa et.al., Paper: http://arxiv.org/abs/2603.06348
- 2025-11-10, Transformers Provably Learn Chain-of-Thought Reasoning with Length Generalization, Yu Huang et.al., Paper: http://arxiv.org/abs/2511.07378
- 2026-01-20, Transformer Architectures for Respiratory Sound Analysis and Multimodal Diagnosis, Theodore Aptekarev et.al., Paper: http://arxiv.org/abs/2601.14227
- 2025-12-05, Training-Time Action Conditioning for Efficient Real-Time Chunking, Kevin Black et.al., Paper: http://arxiv.org/abs/2512.05964
- 2026-01-30, Training-Free Test-Time Adaptation with Brownian Distance Covariance in Vision-Language Models, Yi Zhang et.al., Paper: http://arxiv.org/abs/2601.23253
- 2025-12-03, Training-Free Policy Violation Detection via Activation-Space Whitening in LLMs, Oren Rachmil et.al., Paper: http://arxiv.org/abs/2512.03994
- 2025-11-17, Training-Free Multi-View Extension of IC-Light for Textual Position-Aware Scene Relighting, Jiangnan Ye et.al., Paper: http://arxiv.org/abs/2511.13684
- 2025-12-09, Training-Free Dual Hyperbolic Adapters for Better Cross-Modal Reasoning, Yi Zhang et.al., Paper: http://arxiv.org/abs/2512.08820
- 2026-01-28, Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning, Minwu Kim et.al., Paper: http://arxiv.org/abs/2601.20829
- 2026-02-03, Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling, Yubao Zhao et.al., Paper: http://arxiv.org/abs/2602.03719
- 2026-02-02, Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability, Xiao Liang et.al., Paper: http://arxiv.org/abs/2602.02477
- 2025-10-21, Training Diverse Graph Experts for Ensembles: A Systematic Empirical Study, Gangda Deng et.al., Paper: http://arxiv.org/abs/2510.18370
- 2026-03-18, Training Diffusion Language Models for Black-Box Optimization, Zipeng Sun et.al., Paper: http://arxiv.org/abs/2603.17919
- 2025-12-29, Training AI Co-Scientists Using Rubric Rewards, Shashwat Goel et.al., Paper: http://arxiv.org/abs/2512.23707
- 2026-03-17, Trained Persistent Memory for Frozen Encoder–Decoder LLMs: Six Architectural Methods, Hong Jeong et.al., Paper: http://arxiv.org/abs/2603.16413
- 2026-03-10, Tracking Cancer Through Text: Longitudinal Extraction From Radiology Reports Using Open-Source Large Language Models, Luc Builtjes et.al., Paper: http://arxiv.org/abs/2603.09638
- 2025-11-26, TraceGen: World Modeling in 3D Trace Space Enables Learning from Cross-Embodiment Videos, Seungjae Lee et.al., Paper: http://arxiv.org/abs/2511.21690
- 2026-02-06, TraceCoder: A Trace-Driven Multi-Agent Framework for Automated Debugging of LLM-Generated Code, Jiangping Huang et.al., Paper: http://arxiv.org/abs/2602.06875
- 2026-01-09, TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents, Dawei Wang et.al., Paper: http://arxiv.org/abs/2601.05899
- 2026-03-10, Towards a Neural Debugger for Python, Maximilian Beck et.al., Paper: http://arxiv.org/abs/2603.09951
- 2026-03-10, Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization, Ming Nie et.al., Paper: http://arxiv.org/abs/2603.09538
- 2026-01-21, Towards Understanding Best Practices for Quantization of Vision-Language Models, Gautom Das et.al., Paper: http://arxiv.org/abs/2601.15287
- 2025-11-05, Towards Transparent Stance Detection: A Zero-Shot Approach Using Implicit and Explicit Interpretability, Apoorva Upadhyaya et.al., Paper: http://arxiv.org/abs/2511.03635
- 2026-03-13, Towards Spatio-Temporal World Scene Graph Generation from Monocular Videos, Rohith Peddi et.al., Paper: http://arxiv.org/abs/2603.13185
- 2025-11-05, Towards Scalable Web Accessibility Audit with MLLMs as Copilots, Ming Gu et.al., Paper: http://arxiv.org/abs/2511.03471
- 2026-04-02, Towards Position-Robust Talent Recommendation via Large Language Models, Silin Du et.al., Paper: http://arxiv.org/abs/2604.02200
- 2025-12-16, Towards Nepali-language LLMs: Efficient GPT training with a Nepali BPE tokenizer, Adarsha Shrestha et.al., Paper: http://arxiv.org/abs/2512.14585
- 2025-11-28, Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach, Shuqi Liu et.al., Paper: http://arxiv.org/abs/2511.23335
- 2025-10-21, Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement Learning, Chenghao Zhu et.al., Paper: http://arxiv.org/abs/2510.18849
- 2025-12-19, Towards Explainable Conversational AI for Early Diagnosis with Large Language Models, Maliha Tabassum et.al., Paper: http://arxiv.org/abs/2512.17559
- 2025-11-13, Towards Emotionally Intelligent and Responsible Reinforcement Learning, Garapati Keerthana et.al., Paper: http://arxiv.org/abs/2511.10573
- 2025-12-11, Towards Efficient and Effective Multi-Camera Encoding for End-to-End Driving, Jiawei Yang et.al., Paper: http://arxiv.org/abs/2512.10947
- 2025-11-28, Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent, Jianzhe Lin et.al., Paper: http://arxiv.org/abs/2511.23436
- 2026-03-11, Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis, Yujie Zheng et.al., Paper: http://arxiv.org/abs/2603.10846
- 2025-12-22, Towards Closed-Loop Embodied Empathy Evolution: Probing LLM-Centric Lifelong Empathic Motion Generation in Unseen Scenarios, Jiawen Wang et.al., Paper: http://arxiv.org/abs/2512.19551
- 2025-11-13, Towards Blind and Low-Vision Accessibility of Lightweight VLMs and Custom LLM-Evals, Shruti Singh Baghel et.al., Paper: http://arxiv.org/abs/2511.10615
- 2026-03-25, Towards Automated Crowdsourced Testing via Personified-LLM, Shengcheng Yu et.al., Paper: http://arxiv.org/abs/2603.24160
- 2026-02-19, Towards Anytime-Valid Statistical Watermarking, Baihe Huang et.al., Paper: http://arxiv.org/abs/2602.17608
- 2026-01-14, Toward Understanding Unlearning Difficulty: A Mechanistic Perspective and Circuit-Guided Difficulty Metric, Jiali Cheng et.al., Paper: http://arxiv.org/abs/2601.09624
- 2025-12-18, Toward Systematic Counterfactual Fairness Evaluation of Large Language Models: The CAFFE Framework, Alessandra Parziale et.al., Paper: http://arxiv.org/abs/2512.16816
- 2026-03-29, Toward Reliable Evaluation of LLM-Based Financial Multi-Agent Systems: Taxonomy, Coordination Primacy, and Cost Awareness, Phat Nguyen et.al., Paper: http://arxiv.org/abs/2603.27539
- 2026-02-27, Toward Guarantees for Clinical Reasoning in Vision Language Models via Formal Verification, Vikash Singh et.al., Paper: http://arxiv.org/abs/2602.24111
- 2026-01-05, Toward Global Large Language Models in Medicine, Rui Yang et.al., Paper: http://arxiv.org/abs/2601.02186
- 2025-12-09, Toward Faithful Retrieval-Augmented Generation with Sparse Autoencoders, Guangzhi Xiong et.al., Paper: http://arxiv.org/abs/2512.08892
- 2026-02-26, Toward Expert Investment Teams:A Multi-Agent LLM System with Fine-Grained Trading Tasks, Kunihiro Miyazaki et.al., Paper: http://arxiv.org/abs/2602.23330
- 2026-03-17, Toward Experimentation-as-a-Service in 5G/6G: The Plaza6G Prototype for AI-Assisted Trials, Sergio Barrachina-Muñoz et.al., Paper: http://arxiv.org/abs/2603.16356
- 2025-12-19, Toward Ethical AI Through Bayesian Uncertainty in Neural Question Answering, Riccardo Di Sipio et.al., Paper: http://arxiv.org/abs/2512.17677
- 2026-01-20, Toward Efficient Agents: Memory, Tool learning, and Planning, Xiaofang Yang et.al., Paper: http://arxiv.org/abs/2601.14192
- 2025-10-21, Topoformer: brain-like topographic organization in Transformer language models through spatial querying and reweighting, Taha Binhuraib et.al., Paper: http://arxiv.org/abs/2510.18745
- 2026-03-13, Topo-R1: Detecting Topological Anomalies via Vision-Language Models, Meilong Xu et.al., Paper: http://arxiv.org/abs/2603.13054
- 2025-10-22, Top-P Masking for Cross Language Information Retrieval, Joseph Casale et.al., Paper: http://arxiv.org/abs/2510.19758
- 2025-11-26, ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration, Hongjin Su et.al., Paper: http://arxiv.org/abs/2511.21689
- 2025-10-22, ToolDreamer: Instilling LLM Reasoning Into Tool Retrievers, Saptarshi Sengupta et.al., Paper: http://arxiv.org/abs/2510.19791
- 2025-10-28, Tongyi DeepResearch Technical Report, Tongyi DeepResearch Team et.al., Paper: http://arxiv.org/abs/2510.24701
- 2025-10-21, Tokencake: A KV-Cache-centric Serving Framework for LLM-based Multi-Agent Applications, Zhuohang Bian et.al., Paper: http://arxiv.org/abs/2510.18586
- 2026-01-27, TokenSeek: Memory Efficient Fine Tuning via Instance-Aware Token Ditching, Runjia Zeng et.al., Paper: http://arxiv.org/abs/2601.19739
- 2025-12-02, TokenPowerBench: Benchmarking the Power Consumption of LLM Inference, Chenxu Niu et.al., Paper: http://arxiv.org/abs/2512.03024
- 2026-03-12, To Words and Beyond: Probing Large Language Models for Sentence-Level Psycholinguistic Norms of Memorability and Reading Times, Thomas Hikaru Clark et.al., Paper: http://arxiv.org/abs/2603.12105
- 2026-02-23, To Reason or Not to: Selective Chain-of-Thought in Medical Question Answering, Zaifu Zhan et.al., Paper: http://arxiv.org/abs/2602.20130
- 2025-11-17, Tissue Aware Nuclei Detection and Classification Model for Histopathology Images, Kesi Xu et.al., Paper: http://arxiv.org/abs/2511.13615
- 2025-12-18, Tiny Recursive Control: Iterative Reasoning for Efficient Optimal Control, Amit Jain et.al., Paper: http://arxiv.org/abs/2512.16824
- 2026-03-19, Tinted Frames: Question Framing Blinds Vision-Language Models, Wan-Cyuan Fan et.al., Paper: http://arxiv.org/abs/2603.19203
- 2025-11-20, TimeViper: A Hybrid Mamba-Transformer Vision-Language Model for Efficient Long Video Understanding, Boshen Xu et.al., Paper: http://arxiv.org/abs/2511.16595
- 2025-12-16, TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs, Jun Zhang et.al., Paper: http://arxiv.org/abs/2512.14698
- 2026-01-22, Timbre-Aware LLM-based Direct Speech-to-Speech Translation Extendable to Multiple Language Pairs, Lalaram Arya et.al., Paper: http://arxiv.org/abs/2601.16023
- 2025-11-26, Tidal forces around the Letelier-Alencar cloud of strings black hole, Marcos V. de S. Silva et.al., Paper: http://arxiv.org/abs/2511.21604
- 2025-11-17, TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models, Harold Haodong Chen et.al., Paper: http://arxiv.org/abs/2511.13704
- 2025-12-16, TiME: Tiny Monolingual Encoders for Efficient NLP Pipelines, David Schulmeister et.al., Paper: http://arxiv.org/abs/2512.14645
- 2026-02-17, This human study did not involve human subjects: Validating LLM simulations as behavioral evidence, Jessica Hullman et.al., Paper: http://arxiv.org/abs/2602.15785
- 2025-11-20, Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation, Ziyu Guo et.al., Paper: http://arxiv.org/abs/2511.16671
- 2025-11-06, Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm, Jingqi Tong et.al., Paper: http://arxiv.org/abs/2511.04570
- 2026-02-05, Thinking with Geometry: Active Geometry Integration for Spatial Reasoning, Haoyuan Li et.al., Paper: http://arxiv.org/abs/2602.06037
- 2026-01-07, Thinking with Frames: Generative Video Distortion Evaluation via Frame Reward Model, Yuan Wang et.al., Paper: http://arxiv.org/abs/2601.04033
- 2026-02-02, Thinking with Comics: Enhancing Multimodal Reasoning through Structured Visual Storytelling, Andong Chen et.al., Paper: http://arxiv.org/abs/2602.02453
- 2026-02-20, Thinking by Subtraction: Confidence-Driven Contrastive Decoding for LLM Reasoning, Lexiang Tang et.al., Paper: http://arxiv.org/abs/2602.18232
- 2025-11-28, Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction, Bao Shu et.al., Paper: http://arxiv.org/abs/2511.23476
- 2026-02-26, ThinkOmni: Lifting Textual Reasoning to Omni-modal Scenarios via Guidance Decoding, Yiran Guan et.al., Paper: http://arxiv.org/abs/2602.23306
- 2026-03-23, ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model, Haichao Zhang et.al., Paper: http://arxiv.org/abs/2603.22281
- 2025-12-29, ThinkGen: Generalized Thinking for Visual Generation, Siyu Jiao et.al., Paper: http://arxiv.org/abs/2512.23568
- 2026-01-16, Think-Clip-Sample: Slow-Fast Frame Selection for Video Understanding, Wenhui Tan et.al., Paper: http://arxiv.org/abs/2601.11359
- 2025-10-21, Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views, Zhangquan Chen et.al., Paper: http://arxiv.org/abs/2510.18632
- 2026-02-12, Think like a Scientist: Physics-guided LLM Agent for Equation Discovery, Jianke Yang et.al., Paper: http://arxiv.org/abs/2602.12259
- 2025-11-19, Think Visually, Reason Textually: Vision-Language Synergy in ARC, Beichen Zhang et.al., Paper: http://arxiv.org/abs/2511.15703
- 2025-10-27, Think Twice: Branch-and-Rethink Reasoning Reward Model, Yizhu Jiao et.al., Paper: http://arxiv.org/abs/2510.23596
- 2026-03-10, Think Before You Lie: How Reasoning Improves Honesty, Ann Yuan et.al., Paper: http://arxiv.org/abs/2603.09957
- 2026-03-31, Think Anywhere in Code Generation, Xue Jiang et.al., Paper: http://arxiv.org/abs/2603.29957
- 2026-02-03, They Said Memes Were Harmless-We Found the Ones That Hurt: Decoding Jokes, Symbols, and Cultural References, Sahil Tripathi et.al., Paper: http://arxiv.org/abs/2602.03822
- 2025-11-28, ThetaEvolve: Test-time Learning on Open Problems, Yiping Wang et.al., Paper: http://arxiv.org/abs/2511.23473
- 2026-02-16, ThermEval: A Structured Benchmark for Evaluation of Vision-Language Models on Thermal Imagery, Ayush Shrivastava et.al., Paper: http://arxiv.org/abs/2602.14989
- 2026-04-01, Therefore I am. I Think, Esakkivel Esakkiraja et.al., Paper: http://arxiv.org/abs/2604.01202
- 2025-10-29, TheraMind: A Strategic and Adaptive Agent for Longitudinal Psychological Counseling, He Hu et.al., Paper: http://arxiv.org/abs/2510.25758
- 2026-03-04, Theory Discovery in Social Networks: Automating ERGM Specification with Large Language Models, Yidan Sun et.al., Paper: http://arxiv.org/abs/2603.04306
- 2026-01-16, The unreasonable effectiveness of pattern matching, Gary Lupyan et.al., Paper: http://arxiv.org/abs/2601.11432
- 2025-11-26, The author is dead, but what if they never lived? A reception experiment on Czech AI- and human-authored poetry, Anna Marklová et.al., Paper: http://arxiv.org/abs/2511.21629
- 2026-03-07, The Yerkes-Dodson Curve for AI Agents: Emergent Cooperation Under Environmental Pressure in Multi-Agent LLM Simulations, Ivan Pasichnyk et.al., Paper: http://arxiv.org/abs/2603.07360
- 2026-02-16, The Well-Tempered Classifier: Some Elementary Properties of Temperature Scaling, Pierre-Alexandre Mattei et.al., Paper: http://arxiv.org/abs/2602.14862
- 2026-03-18, The Unreasonable Effectiveness of Text Embedding Interpolation for Continuous Image Steering, Yigit Ekin et.al., Paper: http://arxiv.org/abs/2603.17998
- 2025-10-24, The Universal Landscape of Human Reasoning, Qiguang Chen et.al., Paper: http://arxiv.org/abs/2510.21623
- 2025-10-21, The Trust Paradox in LLM-Based Multi-Agent Systems: When Collaboration Becomes a Security Vulnerability, Zijie Xu et.al., Paper: http://arxiv.org/abs/2510.18563
- 2026-03-31, The Triadic Cognitive Architecture: Bounding Autonomous Action via Spatio-Temporal and Epistemic Friction, Davide Di Gioia et.al., Paper: http://arxiv.org/abs/2603.30031
- 2026-03-07, The Third Ambition: Artificial Intelligence and the Science of Human Behavior, W. Russell Neuman et.al., Paper: http://arxiv.org/abs/2603.07329
- 2025-10-22, The Tail Tells All: Estimating Model-Level Membership Inference Vulnerability Without Reference Models, Euodia Dodd et.al., Paper: http://arxiv.org/abs/2510.19773
- 2026-02-27, The Subjectivity of Monoculture, Nathanael Jo et.al., Paper: http://arxiv.org/abs/2602.24086
- 2026-03-05, The Spike, the Sparse and the Sink: Anatomy of Massive Activations and Attention Sinks, Shangwen Sun et.al., Paper: http://arxiv.org/abs/2603.05498
- 2026-01-06, The Sonar Moment: Benchmarking Audio-Language Models in Audio Geo-Localization, Ruixing Zhang et.al., Paper: http://arxiv.org/abs/2601.03227
- 2026-01-20, The Side Effects of Being Smart: Safety Risks in MLLMs’ Multi-Image Reasoning, Renmiao Chen et.al., Paper: http://arxiv.org/abs/2601.14127
- 2025-11-19, The SA-FARI Dataset: Segment Anything in Footage of Animals for Recognition and Identification, Dante Francisco Wasmuht et.al., Paper: http://arxiv.org/abs/2511.15622
- 2026-03-20, The Robot’s Inner Critic: Self-Refinement of Social Behaviors through VLM-based Replanning, Jiyu Lim et.al., Paper: http://arxiv.org/abs/2603.20164
- 2026-02-06, The Representational Geometry of Number, Zhimin Hu et.al., Paper: http://arxiv.org/abs/2602.06843
- 2026-01-02, The Reasoning-Creativity Trade-off: Toward Creativity-Driven Problem Solving, Max Ruiz Luyten et.al., Paper: http://arxiv.org/abs/2601.00747
- 2026-03-14, The Reasoning Bottleneck in Graph-RAG: Structured Prompting and Context Compression for Multi-Hop QA, Yasaman Zarinkia et.al., Paper: http://arxiv.org/abs/2603.14045
- 2026-01-20, The Quest for Reliable AI Accelerators: Cross-Layer Evaluation and Design Optimization, Meng Li et.al., Paper: http://arxiv.org/abs/2601.14148
- 2026-02-06, The Quantum Sieve Tracer: A Hybrid Framework for Layer-Wise Activation Tracing in Large Language Models, Jonathan Pan et.al., Paper: http://arxiv.org/abs/2602.06852
- 2026-01-14, The Promptware Kill Chain: How Prompt Injections Gradually Evolved Into a Multi-Step Malware, Ben Nassi et.al., Paper: http://arxiv.org/abs/2601.09625
- 2025-11-28, The Price of Progress: Algorithmic Efficiency and the Falling Cost of AI Inference, Hans Gundlach et.al., Paper: http://arxiv.org/abs/2511.23455
- 2026-02-16, The Potential of CoT for Reasoning: A Closer Look at Trace Dynamics, Gregor Bachmann et.al., Paper: http://arxiv.org/abs/2602.14903
- 2026-03-16, The PokeAgent Challenge: Competitive and Long-Context Learning at Scale, Seth Karten et.al., Paper: http://arxiv.org/abs/2603.15563
- 2026-01-29, The Patient is not a Moving Document: A World Model Training Paradigm for Longitudinal EHR, Irsyad Adam et.al., Paper: http://arxiv.org/abs/2601.22128
- 2026-03-09, The Neural Compass: Probabilistic Relative Feature Fields for Robotic Search, Gabriele Somaschini et.al., Paper: http://arxiv.org/abs/2603.08544
- 2025-12-02, The Moral Consistency Pipeline: Continuous Ethical Evaluation for Large Language Models, Saeid Jamshidi et.al., Paper: http://arxiv.org/abs/2512.03026
- 2026-01-09, The Molecular Structure of Thought: Mapping the Topology of Long Chain-of-Thought Reasoning, Qiguang Chen et.al., Paper: http://arxiv.org/abs/2601.06002
- 2025-10-29, The Limits of Obliviate: Evaluating Unlearning in LLMs via Stimulus-Knowledge Entanglement-Behavior Framework, Aakriti Shah et.al., Paper: http://arxiv.org/abs/2510.25732
- 2026-02-23, The LLMbda Calculus: AI Agents, Conversations, and Information Flow, Zac Garby et.al., Paper: http://arxiv.org/abs/2602.20064
- 2025-11-19, The Impact of Quantization on Large Reasoning Model Reinforcement Learning, Medha Kumar et.al., Paper: http://arxiv.org/abs/2511.15694
- 2025-12-31, The Impact of LLMs on Online News Consumption and Production, Hangcheng Zhao et.al., Paper: http://arxiv.org/abs/2512.24968
- 2025-10-21, The Impact of Image Resolution on Biomedical Multimodal Large Language Models, Liangyu Chen et.al., Paper: http://arxiv.org/abs/2510.18304
- 2026-02-17, The Geometry of Alignment Collapse: When Fine-Tuning Breaks Safety, Max Springer et.al., Paper: http://arxiv.org/abs/2602.15799
- 2026-01-21, The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models, Zanlin Ni et.al., Paper: http://arxiv.org/abs/2601.15165
- 2025-10-22, The Feasibility of Training Sovereign Language Models in the Global South: A Study of Brazil and Mexico, Sandra Malagon et.al., Paper: http://arxiv.org/abs/2510.19801
- 2025-12-11, The FACTS Leaderboard: A Comprehensive Benchmark for Large Language Model Factuality, Aileen Cheng et.al., Paper: http://arxiv.org/abs/2512.10791
- 2026-04-02, The Expert Strikes Back: Interpreting Mixture-of-Experts Language Models at Expert Level, Jeremy Herbst et.al., Paper: http://arxiv.org/abs/2604.02178
- 2025-12-02, The Evolutionary Ecology of Software: Constraints, Innovation, and the AI Disruption, Sergi Valverde et.al., Paper: http://arxiv.org/abs/2512.02953
- 2025-10-30, The Era of Agentic Organization: Learning to Organize with Language Models, Zewen Chi et.al., Paper: http://arxiv.org/abs/2510.26658
- 2025-12-22, The Epistemological Consequences of Large Language Models: Rethinking collective intelligence and institutional knowledge, Angjelin Hila et.al., Paper: http://arxiv.org/abs/2512.19570
- 2025-10-30, The End of Manual Decoding: Towards Truly End-to-End Language Models, Zhichao Wang et.al., Paper: http://arxiv.org/abs/2510.26697
- 2026-01-21, The Effect of Scripts and Formats on LLM Numeracy, Varshini Reddy et.al., Paper: http://arxiv.org/abs/2601.15251
- 2025-11-28, The EBLM Project XVIII. 3D Obliquities of Five Low-Mass Eclipsing Binaries, Becca Spejcher et.al., Paper: http://arxiv.org/abs/2511.23430
- 2026-03-23, The Dual Mechanisms of Spatial Reasoning in Vision-Language Models, Kelly Cui et.al., Paper: http://arxiv.org/abs/2603.22278
- 2026-03-11, The Discrete Charm of the MLP: Binary Routing of Continuous Signals in Transformer Feed-Forward Layers, Peter Balogh et.al., Paper: http://arxiv.org/abs/2603.10985
- 2026-02-24, The Diffusion Duality, Chapter II: $Ψ$ -Samplers and Efficient Curriculum, Justin Deschenaux et.al., Paper: http://arxiv.org/abs/2602.21185
- 2025-11-25, The Consistency Critic: Correcting Inconsistencies in Generated Images via Reference-Guided Attentive Alignment, Ziheng Ouyang et.al., Paper: http://arxiv.org/abs/2511.20614
- 2026-01-12, The Confidence Trap: Gender Bias and Predictive Certainty in LLMs, Ahmed Sabir et.al., Paper: http://arxiv.org/abs/2601.07806
- 2026-03-04, The Company You Keep: How LLMs Respond to Dark Triad Traits, Zeyi Lu et.al., Paper: http://arxiv.org/abs/2603.04299
- 2025-12-29, The Big Three in Marriage Talk: LLM-Assisted Analysis of Moral Ethics and Sentiment on Weibo and Xiaohongshu, Frank Tian-Fang Ye et.al., Paper: http://arxiv.org/abs/2512.23609
- 2025-10-21, The Attribution Story of WhisperGate: An Academic Perspective, Oleksandr Adamov et.al., Paper: http://arxiv.org/abs/2510.18484
- 2025-12-01, The Art of Scaling Test-Time Compute for Large Language Models, Aradhye Agarwal et.al., Paper: http://arxiv.org/abs/2512.02008
- 2025-10-22, The Art of Asking: Multilingual Prompt Optimization for Synthetic Data, David Mora et.al., Paper: http://arxiv.org/abs/2510.19806
- 2026-03-20, The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$ -Calculus, Amartya Roy et.al., Paper: http://arxiv.org/abs/2603.20105
- 2025-11-21, That’s not natural: The Impact of Off-Policy Training Data on Probe Performance, Nathalie Kirch et.al., Paper: http://arxiv.org/abs/2511.17408
- 2025-11-13, Textual understanding boost in the WikiRace, Raman Ebrahimi et.al., Paper: http://arxiv.org/abs/2511.10585
- 2025-12-15, Textual Gradients are a Flawed Metaphor for Automatic Prompt Optimization, Daniel Melcer et.al., Paper: http://arxiv.org/abs/2512.13598
- 2025-10-21, Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs, Yanhong Li et.al., Paper: http://arxiv.org/abs/2510.18279
- 2026-02-16, Text Style Transfer with Parameter-efficient LLM Finetuning and Round-trip Translation, Ruoxi Liu et.al., Paper: http://arxiv.org/abs/2602.15013
- 2026-03-03, Tether: Autonomous Functional Play with Correspondence-Driven Trajectory Warping, William Liang et.al., Paper: http://arxiv.org/abs/2603.03278
- 2026-02-16, Testimole-Conversational: A 30-Billion-Word Italian Discussion Board Corpus (1996-2024) for Language Modeling and Sociolinguistic Research, Matteo Rinaldi et.al., Paper: http://arxiv.org/abs/2602.14819
- 2026-03-13, Test-Time Attention Purification for Backdoored Large Vision Language Models, Zhifang Zhang et.al., Paper: http://arxiv.org/abs/2603.12989
- 2025-11-14, Terrain Costmap Generation via Scaled Preference Conditioning, Luisa Mao et.al., Paper: http://arxiv.org/abs/2511.11529
- 2026-01-13, TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback, Prithwish Jana et.al., Paper: http://arxiv.org/abs/2601.08734
- 2026-02-27, Terminology Rarity Predicts Catastrophic Failure in LLM Translation of Low-Resource Ancient Languages: Evidence from Ancient Greek, James L. Zainaldin et.al., Paper: http://arxiv.org/abs/2602.24119
- 2025-12-15, Temporal Tokenization Strategies for Event Sequence Modeling with Large Language Models, Zefang Liu et.al., Paper: http://arxiv.org/abs/2512.13618
- 2026-02-26, Tell Me What To Learn: Generalizing Neural Memory to be Controllable in Natural Language, Max S. Bennett et.al., Paper: http://arxiv.org/abs/2602.23201
- 2026-02-04, Team, Then Trim: An Assembly-Line LLM Framework for High-Quality Tabular Data Generation, Congjing Zhang et.al., Paper: http://arxiv.org/abs/2602.04785
- 2026-03-13, Team RAS in 10th ABAW Competition: Multimodal Valence and Arousal Estimation Approach, Elena Ryumina et.al., Paper: http://arxiv.org/abs/2603.13056
- 2025-12-03, Teaching Using Immersion - Explaining Magnetism and Eclipses in a Planetarium Dome, Patricia H Reiff et.al., Paper: http://arxiv.org/abs/2512.04027
- 2025-11-10, Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence, Sean McLeish et.al., Paper: http://arxiv.org/abs/2511.07384
- 2025-12-03, Teaching Old Tokenizers New Words: Efficient Tokenizer Adaptation for Pre-trained Models, Taido Purason et.al., Paper: http://arxiv.org/abs/2512.03989
- 2025-11-07, TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework, Chao Zhang et.al., Paper: http://arxiv.org/abs/2511.05385
- 2026-03-04, TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning, Maximilian von Klinski et.al., Paper: http://arxiv.org/abs/2603.04380
- 2026-02-27, Task-Centric Acceleration of Small-Language Models, Dor Tsur et.al., Paper: http://arxiv.org/abs/2602.24174
- 2026-02-06, TamperBench: Systematically Stress-Testing LLM Safety Under Fine-Tuning and Tampering, Saad Hossain et.al., Paper: http://arxiv.org/abs/2602.06911
- 2025-11-20, Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter, Qinghao Hu et.al., Paper: http://arxiv.org/abs/2511.16665
- 2026-02-27, Taming Momentum: Rethinking Optimizer States Through Low-Rank Approximation, Zhengbo Wang et.al., Paper: http://arxiv.org/abs/2602.24283
- 2026-01-05, Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes, Jing Tan et.al., Paper: http://arxiv.org/abs/2601.02356
- 2025-11-18, Talk, Snap, Complain: Validation-Aware Multimodal Expert Framework for Fine-Grained Customer Grievances, Rishu Kumar Singh et.al., Paper: http://arxiv.org/abs/2511.14693
- 2026-03-06, Talk Freely, Execute Strictly: Schema-Gated Agentic AI for Flexible and Reproducible Scientific Workflows, Joel Strickland et.al., Paper: http://arxiv.org/abs/2603.06394
- 2026-03-11, Taking Shortcuts for Categorical VQA Using Super Neurons, Pierre Musacchio et.al., Paper: http://arxiv.org/abs/2603.10781
- 2026-03-22, TabPFN Extensions for Interpretable Geotechnical Modelling, Taiga Saito et.al., Paper: http://arxiv.org/abs/2603.21033
- 2026-03-16, TabKD: Tabular Knowledge Distillation through Interaction Diversity of Learned Feature Bins, Shovon Niverd Pereira et.al., Paper: http://arxiv.org/abs/2603.15481
- 2026-02-11, TabICLv2: A better, faster, scalable, and open tabular foundation model, Jingang Qu et.al., Paper: http://arxiv.org/abs/2602.11139
- 2025-11-17, TZ-LLM: Protecting On-Device Large Language Models with Arm TrustZone, Xunjie Wang et.al., Paper: http://arxiv.org/abs/2511.13717
- 2025-11-04, TWIST2: Scalable, Portable, and Holistic Humanoid Data Collection System, Yanjie Ze et.al., Paper: http://arxiv.org/abs/2511.02832
- 2025-12-04, TV2TV: A Unified Framework for Interleaved Language and Video Generation, Xiaochuang Han et.al., Paper: http://arxiv.org/abs/2512.05103
- 2026-01-30, TSAQA: Time Series Analysis Question And Answering Benchmark, Baoyu Jing et.al., Paper: http://arxiv.org/abs/2601.23204
- 2026-03-18, TRiMS: Real-Time Tracking of Minimal Sufficient Length for Efficient Reasoning via RL, Tingcheng Bian et.al., Paper: http://arxiv.org/abs/2603.17449
- 2026-03-24, TRAP: Hijacking VLA CoT-Reasoning via Adversarial Patches, Zhengxian Huang et.al., Paper: http://arxiv.org/abs/2603.23117
- 2025-12-05, TRACE: A Framework for Analyzing and Enhancing Stepwise Reasoning in Vision-Language Models, Shima Imani et.al., Paper: http://arxiv.org/abs/2512.05943
- 2026-04-02, TRACE-Bot: Detecting Emerging LLM-Driven Social Bots via Implicit Semantic Representations and AIGC-Enhanced Behavioral Patterns, Zhongbo Wang et.al., Paper: http://arxiv.org/abs/2604.02147
- 2026-03-11, TOSSS: a CVE-based Software Security Benchmark for Large Language Models, Marc Damie et.al., Paper: http://arxiv.org/abs/2603.10969
- 2025-12-18, TOGGLE: Temporal Logic-Guided Large Language Model Compression for Edge, Khurram Khalil et.al., Paper: http://arxiv.org/abs/2512.16855
- 2026-01-30, TEON: Tensorized Orthonormalization Beyond Layer-Wise Muon for Large Language Model Pre-Training, Ruijie Zhang et.al., Paper: http://arxiv.org/abs/2601.23261
- 2026-02-11, TEGRA: Text Encoding With Graph and Retrieval Augmentation for Misinformation Detection, Géraud Faye et.al., Paper: http://arxiv.org/abs/2602.11106
- 2025-11-26, TAGFN: A Text-Attributed Graph Dataset for Fake News Detection in the Age of LLMs, Kay Liu et.al., Paper: http://arxiv.org/abs/2511.21624
- 2026-02-17, TAC: Timestamped Audio Captioning, Sonal Kumar et.al., Paper: http://arxiv.org/abs/2602.15766
- 2026-02-12, T3D: Few-Step Diffusion Language Models via Trajectory Self-Distillation with Direct Discriminative Optimization, Tunyu Zhang et.al., Paper: http://arxiv.org/abs/2602.12262
- 2026-02-16, Symmetry in language statistics shapes the geometry of model representations, Dhruva Karkada et.al., Paper: http://arxiv.org/abs/2602.15029
- 2026-03-02, Symbol-Equivariant Recurrent Reasoning Models, Richard Freinschlag et.al., Paper: http://arxiv.org/abs/2603.02193
- 2025-12-05, SymPyBench: A Dynamic Benchmark for Scientific Reasoning with Executable Python Code, Shima Imani et.al., Paper: http://arxiv.org/abs/2512.05954
- 2026-02-05, SwimBird: Eliciting Switchable Reasoning Mode in Hybrid Autoregressive MLLMs, Jintao Tong et.al., Paper: http://arxiv.org/abs/2602.06040
- 2025-11-20, SurvAgent: Hierarchical CoT-Enhanced Case Banking and Dichotomy-Based Multi-Agent System for Multimodal Survival Prediction, Guolin Huang et.al., Paper: http://arxiv.org/abs/2511.16635
- 2026-03-10, Surgical Repair of Collapsed Attention Heads in ALiBi Transformers, Palmer Schallon et.al., Paper: http://arxiv.org/abs/2603.09616
- 2025-11-10, Surgical Agent Orchestration Platform for Voice-directed Patient Data Interaction, Hyeryun Park et.al., Paper: http://arxiv.org/abs/2511.07392
- 2026-03-17, Surg $Σ$ : A Spectrum of Large-Scale Multimodal Data and Foundation Models for Surgical Intelligence, Zhitao Zeng et.al., Paper: http://arxiv.org/abs/2603.16822
- 2026-01-23, Supporting Stakeholder Requirements Expression with LLM Revisions: An Empirical Evaluation, Michael Mircea et.al., Paper: http://arxiv.org/abs/2601.16699
- 2026-01-21, Supporting Humans in Evaluating AI Summaries of Legal Depositions, Naghmeh Farzi et.al., Paper: http://arxiv.org/abs/2601.15182
- 2025-12-10, Supervised learning pays attention, Erin Craig et.al., Paper: http://arxiv.org/abs/2512.09912
- 2026-03-14, Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models, Haitao Jiang et.al., Paper: http://arxiv.org/abs/2603.13985
- 2026-02-18, Supercharging Agenda Setting Research: The ParlaCAP Dataset of 28 European Parliaments and a Scalable Multilingual LLM-Based Classification, Taja Kuzman Pungeršek et.al., Paper: http://arxiv.org/abs/2602.16516
- 2025-12-12, Super Suffixes: Bypassing Text Generation Alignment and Guard Models Simultaneously, Andrew Adiletta et.al., Paper: http://arxiv.org/abs/2512.11783
- 2026-02-25, SumTablets: A Transliteration Dataset of Sumerian Tablets, Cole Simmons et.al., Paper: http://arxiv.org/abs/2602.22200
- 2026-03-25, SumRank: Aligning Summarization Models for Long-Document Listwise Reranking, Jincheng Feng et.al., Paper: http://arxiv.org/abs/2603.24204
- 2025-11-18, Subword Tokenization Strategies for Kurdish Word Embeddings, Ali Salehi et.al., Paper: http://arxiv.org/abs/2511.14696
- 2026-02-04, Subliminal Effects in Your Data: A General Mechanism via Log-Linearity, Ishaq Aden-Ali et.al., Paper: http://arxiv.org/abs/2602.04863
- 2026-01-20, Style Transfer as Bias Mitigation: Diffusion Models for Synthetic Mental Health Text for Arabic, Saad Mankarious et.al., Paper: http://arxiv.org/abs/2601.14124
- 2025-12-29, Style Amnesia: Investigating Speaking Style Degradation and Mitigation in Multi-Turn Spoken Language Models, Yu-Xiang Lin et.al., Paper: http://arxiv.org/abs/2512.23578
- 2025-12-15, StutterFuse: Mitigating Modality Collapse in Stuttering Detection with Jaccard-Weighted Metric Learning and Gated Fusion, Guransh Singh et.al., Paper: http://arxiv.org/abs/2512.13632
- 2026-01-28, Structured Semantic Information Helps Retrieve Better Examples for In-Context Learning in Few-Shot Relation Extraction, Aunabil Chakma et.al., Paper: http://arxiv.org/abs/2601.20803
- 2026-01-30, Structured Over Scale: Learning Spatial Reasoning from Educational Video, Bishoy Galoaa et.al., Paper: http://arxiv.org/abs/2601.23251
- 2026-03-29, Structured Observation Language for Efficient and Generalizable Vision-Language Navigation, Daojie Peng et.al., Paper: http://arxiv.org/abs/2603.27577
- 2026-01-22, Structured Hints for Sample-Efficient Lean Theorem Proving, Zachary Burton et.al., Paper: http://arxiv.org/abs/2601.16172
- 2025-12-04, Structured Document Translation via Format Reinforcement Learning, Haiyue Song et.al., Paper: http://arxiv.org/abs/2512.05100
- 2025-10-23, Structure-Conditional Minimum Bayes Risk Decoding, Bryan Eikema et.al., Paper: http://arxiv.org/abs/2510.20700
- 2025-12-19, Structure-Aware Antibody Design with Affinity-Optimized Inverse Folding, Xinyan Zhao et.al., Paper: http://arxiv.org/abs/2512.17815
- 2026-01-15, Structure and Diversity Aware Context Bubble Construction for Enterprise Retrieval Augmented Systems, Amir Khurshid et.al., Paper: http://arxiv.org/abs/2601.10681
- 2026-01-12, Structure First, Reason Next: Enhancing a Large Language Model using Knowledge Graph for Numerical Reasoning in Financial Documents, Aryan Mishra et.al., Paper: http://arxiv.org/abs/2601.07754
- 2026-02-23, StructXLIP: Enhancing Vision-language Models with Multimodal Structural Cues, Zanxi Ruan et.al., Paper: http://arxiv.org/abs/2602.20089
- 2025-10-21, Streamlining Acceptance Test Generation for Mobile Applications Through Large Language Models: An Industrial Case Study, Pedro Luís Fonseca et.al., Paper: http://arxiv.org/abs/2510.18861
- 2025-10-21, StreamingTOM: Streaming Token Compression for Efficient Video Understanding, Xueyi Chen et.al., Paper: http://arxiv.org/abs/2510.18269
- 2025-12-24, Streaming Video Instruction Tuning, Jiaer Xia et.al., Paper: http://arxiv.org/abs/2512.21334
- 2026-01-05, Streaming Hallucination Detection in Long Chain-of-Thought Reasoning, Haolang Lu et.al., Paper: http://arxiv.org/abs/2601.02170
- 2026-01-23, Strategies for Span Labeling with Large Language Models, Danil Semin et.al., Paper: http://arxiv.org/abs/2601.16946
- 2025-11-18, Strategic Innovation Management in the Age of Large Language Models Market Intelligence, Adaptive R&D, and Ethical Governance, Raha Aghaei et.al., Paper: http://arxiv.org/abs/2511.14709
- 2026-01-08, Stock Market Price Prediction using Neural Prophet with Deep Neural Network, Navin Chhibber et.al., Paper: http://arxiv.org/abs/2601.05202
- 2025-10-30, Stitch: Step-by-step LLM Guided Tutoring for Scratch, Yuan Si et.al., Paper: http://arxiv.org/abs/2510.26634
- 2026-02-11, SteuerLLM: Local specialized large language model for German tax law analysis, Sebastian Wind et.al., Paper: http://arxiv.org/abs/2602.11081
- 2025-12-17, Stepwise Think-Critique: A Unified Framework for Robust and Interpretable LLM Reasoning, Jiaqi Xu et.al., Paper: http://arxiv.org/abs/2512.15662
- 2026-03-10, Stepping VLMs onto the Court: Benchmarking Spatial Intelligence in Sports, Yuchen Yang et.al., Paper: http://arxiv.org/abs/2603.09896
- 2025-11-07, Steering Language Models with Weight Arithmetic, Constanza Fierro et.al., Paper: http://arxiv.org/abs/2511.05408
- 2026-02-13, Steerable Vision-Language-Action Policies for Embodied Reasoning and Hierarchical Control, William Chen et.al., Paper: http://arxiv.org/abs/2602.13193
- 2025-10-30, SteerVLM: Robust Model Control through Lightweight Activation Steering for Vision Language Models, Anushka Sivakumar et.al., Paper: http://arxiv.org/abs/2510.26769
- 2025-10-21, StarBench: A Turn-Based RPG Benchmark for Agentic Multimodal Decision-Making and Information Seeking, Haoran Zhang et.al., Paper: http://arxiv.org/abs/2510.18483
- 2026-01-23, Standardizing Longitudinal Radiology Report Evaluation via Large Language Model Annotation, Xinyi Wang et.al., Paper: http://arxiv.org/abs/2601.16753
- 2026-01-09, StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management, Ruizhe Zhang et.al., Paper: http://arxiv.org/abs/2601.05890
- 2025-12-03, Stable Signer: Hierarchical Sign Language Generative Model, Sen Fang et.al., Paper: http://arxiv.org/abs/2512.04048
- 2026-02-19, Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs, Luke Huang et.al., Paper: http://arxiv.org/abs/2602.17616
- 2025-12-16, Spoken DialogSum: An Emotion-Rich Conversational Dataset for Spoken Dialogue Summarization, Yen-Ju Lu et.al., Paper: http://arxiv.org/abs/2512.14687
- 2025-12-24, SpidR-Adapt: A Universal Speech Representation Model for Few-Shot Adaptation, Mahi Luthra et.al., Paper: http://arxiv.org/abs/2512.21204
- 2026-03-10, Speech-Omni-Lite: Portable Speech Interfaces for Vision-Language Models, Dehua Tao et.al., Paper: http://arxiv.org/abs/2603.09627
- 2025-12-05, Speech World Model: Causal State-Action Planning with Explicit Reasoning for Speech, Xuanru Zhou et.al., Paper: http://arxiv.org/abs/2512.05933
- 2025-12-12, Speculative Decoding Speed-of-Light: Optimal Lower Bounds via Branching Random Walks, Sergey Pankratov et.al., Paper: http://arxiv.org/abs/2512.11718
- 2026-03-17, SpecMoE: Spectral Mixture-of-Experts Foundation Model for Cross-Species EEG Decoding, D. Darankoum et.al., Paper: http://arxiv.org/abs/2603.16739
- 2026-03-24, SpecEyes: Accelerating Agentic Multimodal LLMs via Speculative Perception and Planning, Haoyu Huang et.al., Paper: http://arxiv.org/abs/2603.23483
- 2025-10-31, SpecAttn: Speculating Sparse Attention, Harsh Shah et.al., Paper: http://arxiv.org/abs/2510.27641
- 2026-01-07, SpeakerSleuth: Evaluating Large Audio-Language Models as Judges for Multi-turn Speaker Consistency, Jonggeun Lee et.al., Paper: http://arxiv.org/abs/2601.04029
- 2026-03-11, Speaker Verification with Speech-Aware LLMs: Evaluation and Augmentation, Thomas Thebaud et.al., Paper: http://arxiv.org/abs/2603.10827
- 2026-03-06, Speak in Context: Multilingual ASR with Speech Context Alignment via Contrastive Learning, Yuchen Zhang et.al., Paper: http://arxiv.org/abs/2603.06505
- 2025-11-10, SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards, Hunar Batra et.al., Paper: http://arxiv.org/abs/2511.07403
- 2026-03-23, SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation, Sashuai Zhou et.al., Paper: http://arxiv.org/abs/2603.22228
- 2025-12-08, SpatialDreamer: Incentivizing Spatial Reasoning via Active Mental Imagery, Meng Cao et.al., Paper: http://arxiv.org/abs/2512.07733
- 2025-10-31, Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning, Yuhong Liu et.al., Paper: http://arxiv.org/abs/2510.27606
- 2025-12-11, SparseSwaps: Tractable LLM Pruning Mask Refinement at Scale, Max Zimmer et.al., Paper: http://arxiv.org/abs/2512.10922
- 2025-11-18, SparseST: Exploiting Data Sparsity in Spatiotemporal Modeling and Prediction, Junfeng Wu et.al., Paper: http://arxiv.org/abs/2511.14753
- 2025-11-25, Sparse-to-Field Reconstruction via Stochastic Neural Dynamic Mode Decomposition, Yujin Kim et.al., Paper: http://arxiv.org/abs/2511.20612
- 2025-11-21, Sparse Mixture-of-Experts for Multi-Channel Imaging: Are All Channel Interactions Required?, Sukwon Yun et.al., Paper: http://arxiv.org/abs/2511.17400
- 2026-01-06, Sparse Knowledge Distillation: A Mathematical Framework for Probability-Domain Temperature Scaling and Multi-Stage Compression, Aaron R. Flouro et.al., Paper: http://arxiv.org/abs/2601.03195
- 2026-03-12, Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration, Priyanka Kargupta et.al., Paper: http://arxiv.org/abs/2603.12226
- 2026-02-24, SparkMe: Adaptive Semi-Structured Interviewing for Qualitative Insight Discovery, David Anugraha et.al., Paper: http://arxiv.org/abs/2602.21136
- 2025-12-03, SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL, Siyi Chen et.al., Paper: http://arxiv.org/abs/2512.04069
- 2026-02-24, Spa3R: Predictive Spatial Field Modeling for 3D Visual Reasoning, Haoyi Jiang et.al., Paper: http://arxiv.org/abs/2602.21186
- 2026-03-24, SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling, Yiqi Zhang et.al., Paper: http://arxiv.org/abs/2603.23414
- 2026-03-12, SommBench: Assessing Sommelier Expertise of Language Models, William Brach et.al., Paper: http://arxiv.org/abs/2603.12117
- 2026-01-28, SokoBench: Evaluating Long-Horizon Planning and Reasoning in Large Language Models, Sebastiano Monti et.al., Paper: http://arxiv.org/abs/2601.20856
- 2025-12-12, Softmax as Linear Attention in the Large-Prompt Regime: a Measure-based Perspective, Etienne Boursier et.al., Paper: http://arxiv.org/abs/2512.11784
- 2025-10-21, Socialized Learning and Emergent Behaviors in Multi-Agent Systems based on Multimodal Large Language Models, Sureyya Akin et.al., Paper: http://arxiv.org/abs/2510.18515
- 2026-03-17, SocialOmni: Benchmarking Audio-Visual Social Interactivity in Omni Models, Tianyu Xie et.al., Paper: http://arxiv.org/abs/2603.16859
- 2026-03-07, SoK: Agentic Retrieval-Augmented Generation (RAG): Taxonomy, Architectures, Evaluation, and Research Directions, Saroj Mishra et.al., Paper: http://arxiv.org/abs/2603.07379
- 2026-01-12, Smooth Operator: Smooth Verifiable Reward Activates Spatial Reasoning Ability of Vision-Language Model, Siwen Jiao et.al., Paper: http://arxiv.org/abs/2601.07695
- 2025-10-22, SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration, Xichen Zhang et.al., Paper: http://arxiv.org/abs/2510.19767
- 2025-10-23, Small Drafts, Big Verdict: Information-Intensive Visual Reasoning via Speculation, Yuhan Liu et.al., Paper: http://arxiv.org/abs/2510.20812
- 2026-03-26, SlotVTG: Object-Centric Adapter for Generalizable Video Temporal Grounding, Jiwook Han et.al., Paper: http://arxiv.org/abs/2603.25733
- 2026-03-13, SldprtNet: A Large-Scale Multimodal Dataset for CAD Generation in Language-Driven 3D Design, Ruogu Li et.al., Paper: http://arxiv.org/abs/2603.13098
- 2025-12-17, Skyra: AI-Generated Video Detection via Grounded Artifact Reasoning, Yifei Li et.al., Paper: http://arxiv.org/abs/2512.15693
- 2025-12-15, SkipCat: Rank-Maximized Low-Rank Compression of Large Language Models via Shared Projection and Block Skipping, Yu-Chen Lu et.al., Paper: http://arxiv.org/abs/2512.13494
- 2026-03-22, SkinCLIP-VL: Consistency-Aware Vision-Language Learning for Multimodal Skin Cancer Diagnosis, Zhixiang Lu et.al., Paper: http://arxiv.org/abs/2603.21010
- 2026-03-22, SkillProbe: Security Auditing for Emerging Agent Skill Marketplaces via Multi-Agent Collaboration, Zihan Guo et.al., Paper: http://arxiv.org/abs/2603.21019
- 2025-12-03, SkillFactory: Self-Distillation For Learning Cognitive Behaviors, Zayne Sprague et.al., Paper: http://arxiv.org/abs/2512.04072
- 2026-02-19, Sink-Aware Pruning for Diffusion Language Models, Aidar Myrzakhan et.al., Paper: http://arxiv.org/abs/2602.17664
- 2026-01-15, Single-Stage Huffman Encoder for ML Compression, Aditya Agrawal et.al., Paper: http://arxiv.org/abs/2601.10673
- 2026-01-07, Simulated Students in Tutoring Dialogues: Substance or Illusion?, Alexander Scarlatos et.al., Paper: http://arxiv.org/abs/2601.04025
- 2026-01-08, SimuAgent: An LLM-Based Simulink Modeling Assistant Enhanced with Reinforcement Learning, Yanchang Liang et.al., Paper: http://arxiv.org/abs/2601.05187
- 2026-02-20, Simplifying Outcomes of Language Model Component Analyses with ELIA, Aaron Louis Eidt et.al., Paper: http://arxiv.org/abs/2602.18262
- 2025-12-09, SimpleDevQA: Benchmarking Large Language Models on Development Knowledge QA, Jing Zhang et.al., Paper: http://arxiv.org/abs/2512.08867
- 2025-10-21, Simple and Efficient Heterogeneous Temporal Graph Neural Network, Yili Wang et.al., Paper: http://arxiv.org/abs/2510.18467
- 2025-10-23, Simple Context Compression: Mean-Pooling and Multi-Ratio Training, Yair Feldman et.al., Paper: http://arxiv.org/abs/2510.20797
- 2026-03-24, Similarity-Aware Mixture-of-Experts for Data-Efficient Continual Learning, Connor Mclaughlin et.al., Paper: http://arxiv.org/abs/2603.23436
- 2025-12-03, SimFlow: Simplified and End-to-End Training of Latent Normalizing Flows, Qinyu Zhao et.al., Paper: http://arxiv.org/abs/2512.04084
- 2026-01-02, Sigmoid Head for Quality Estimation under Language Ambiguity, Tu Anh Dinh et.al., Paper: http://arxiv.org/abs/2601.00680
- 2026-01-30, ShotFinder: Imagination-Driven Open-Domain Video Shot Retrieval via Web Search, Tao Yu et.al., Paper: http://arxiv.org/abs/2601.23232
- 2026-03-18, Shot-Aware Frame Sampling for Video Understanding, Mengyu Zhao et.al., Paper: http://arxiv.org/abs/2603.17374
- 2026-01-14, ShortCoder: Knowledge-Augmented Syntax Optimization for Token-Efficient Code Generation, Sicong Liu et.al., Paper: http://arxiv.org/abs/2601.09703
- 2025-12-19, ShareChat: A Dataset of Chatbot Conversations in the Wild, Yueru Yan et.al., Paper: http://arxiv.org/abs/2512.17843
- 2026-01-16, ShapeR: Robust Conditional 3D Shape Generation from Casual Captures, Yawar Siddiqui et.al., Paper: http://arxiv.org/abs/2601.11514
- 2026-03-31, ShapE-GRPO: Shapley-Enhanced Reward Allocation for Multi-Candidate LLM Training, Rui Ai et.al., Paper: http://arxiv.org/abs/2603.29871
- 2025-10-21, ShaRE your Data! Characterizing Datasets for LLM-based Requirements Engineering, Quim Motger et.al., Paper: http://arxiv.org/abs/2510.18787
- 2026-03-12, Separable neural architectures as a primitive for unified predictive and generative intelligence, Reza T. Batley et.al., Paper: http://arxiv.org/abs/2603.12244
- 2025-12-26, Semiparametric Preference Optimization: Your Language Model is Secretly a Single-Index Model, Nathan Kallus et.al., Paper: http://arxiv.org/abs/2512.21917
- 2025-10-21, SemiAdapt and SemiLoRA: Efficient Domain Adaptation for Transformer-based Low-Resource Language Translation with a Case Study on Irish, Josh McGiff et.al., Paper: http://arxiv.org/abs/2510.18725
- 2025-11-07, Semantic-Guided Natural Language and Visual Fusion for Cross-Modal Interaction Based on Tiny Object Detection, Xian-Hong Huang et.al., Paper: http://arxiv.org/abs/2511.05474
- 2025-11-21, Semantic and Semiotic Interplays in Text-to-Audio AI: Exploring Cognitive Dynamics and Musical Interactions, Guilherme Coelho et.al., Paper: http://arxiv.org/abs/2511.17429
- 2025-10-22, Semantic World Models, Jacob Berg et.al., Paper: http://arxiv.org/abs/2510.19818
- 2026-03-20, Semantic Token Clustering for Efficient Uncertainty Quantification in Large Language Models, Qi Cao et.al., Paper: http://arxiv.org/abs/2603.20161
- 2025-12-04, Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning, Purbesh Mitra et.al., Paper: http://arxiv.org/abs/2512.05105
- 2026-03-11, Semantic Satellite Communications for Synchronized Audiovisual Reconstruction, Fangyu Liu et.al., Paper: http://arxiv.org/abs/2603.10791
- 2026-03-13, Semantic Invariance in Agentic AI, I. de Zarzà et.al., Paper: http://arxiv.org/abs/2603.13173
- 2026-04-02, Semantic Evolution over Populations for LLM-Guided Automated Program Repair, Cuong Chi Le et.al., Paper: http://arxiv.org/abs/2604.02134
- 2026-02-13, Semantic Chunking and the Entropy of Natural Language, Weishun Zhong et.al., Paper: http://arxiv.org/abs/2602.13194
- 2026-03-25, Semantic Alignment across Ancient Egyptian Language Stages via Normalization-Aware Multitask Learning, He Huang et.al., Paper: http://arxiv.org/abs/2603.24258
- 2026-03-14, SemEval-2026 Task 6: CLARITY – Unmasking Political Question Evasions, Konstantinos Thomas et.al., Paper: http://arxiv.org/abs/2603.14027
- 2026-01-14, Self-Supervised Animal Identification for Long Videos, Xuyang Fang et.al., Paper: http://arxiv.org/abs/2601.09663
- 2026-03-26, Self-Improvement of Large Language Models: A Technical Overview and Future Outlook, Haoyan Yang et.al., Paper: http://arxiv.org/abs/2603.25681
- 2025-11-10, Self-Evaluating LLMs for Multi-Step Tasks: Stepwise Confidence Estimation for Failure Detection, Vaibhav Mavi et.al., Paper: http://arxiv.org/abs/2511.07364
- 2026-01-26, Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models, Siyan Zhao et.al., Paper: http://arxiv.org/abs/2601.18734
- 2026-01-27, Self-Distillation Enables Continual Learning, Idan Shenfeld et.al., Paper: http://arxiv.org/abs/2601.19897
- 2025-11-21, Selective Rotary Position Embedding, Sajad Movahedi et.al., Paper: http://arxiv.org/abs/2511.17388
- 2025-11-10, Selecting Auxiliary Data via Neural Tangent Kernels for Low-Resource Domains, Pingjie Wang et.al., Paper: http://arxiv.org/abs/2511.07380
- 2025-10-21, SegTune: Structured and Fine-Grained Control for Song Generation, Pengfei Cai et.al., Paper: http://arxiv.org/abs/2510.18416
- 2025-10-21, Seg the HAB: Language-Guided Geospatial Algae Bloom Reasoning and Segmentation, Patterson Hsieh et.al., Paper: http://arxiv.org/abs/2510.18751
- 2026-03-26, Seeing to Ground: Visual Attention for Hallucination-Resilient MDLLMs, Vishal Narnaware et.al., Paper: http://arxiv.org/abs/2603.25711
- 2026-03-07, Seeing the Reasoning: How LLM Rationales Influence User Trust and Decision-Making in Factual Verification Tasks, Xin Sun et.al., Paper: http://arxiv.org/abs/2603.07306
- 2026-03-23, Seeing is Improving: Visual Feedback for Iterative Text Layout Refinement, Junrong Guo et.al., Paper: http://arxiv.org/abs/2603.22187
- 2026-02-24, Seeing Through Words: Controlling Visual Retrieval Quality with Language Models, Jianglin Lu et.al., Paper: http://arxiv.org/abs/2602.21175
- 2026-02-06, Seeing Beyond Redundancy: Task Complexity’s Role in Vision Token Specialization in VLLMs, Darryl Hannan et.al., Paper: http://arxiv.org/abs/2602.06914
- 2025-10-21, See the Text: From Tokenization to Visual Reading, Ling Xing et.al., Paper: http://arxiv.org/abs/2510.18840
- 2025-12-26, See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning, Shuoshuo Zhang et.al., Paper: http://arxiv.org/abs/2512.22120
- 2026-01-12, SecureCAI: Injection-Resilient LLM Assistants for Cybersecurity Operations, Mohammed Himayath Ali et.al., Paper: http://arxiv.org/abs/2601.07835
- 2026-03-09, SecAgent: Efficient Mobile GUI Agent with Semantic Context, Yiping Xie et.al., Paper: http://arxiv.org/abs/2603.08533
- 2025-12-01, Script: Graph-Structured and Query-Conditioned Semantic Token Pruning for Multimodal Large Language Models, Zhongyu Yang et.al., Paper: http://arxiv.org/abs/2512.01949
- 2026-04-01, Screening Is Enough, Ken M. Nakanishi et.al., Paper: http://arxiv.org/abs/2604.01178
- 2026-03-29, Sci-Mind: Cognitively-Inspired Adversarial Debate for Autonomous Mathematical Modeling, Ruiying Sun et.al., Paper: http://arxiv.org/abs/2603.27584
- 2026-02-12, Sci-CoE: Co-evolving Scientific Reasoning LLMs via Geometric Consensus with Sparse Supervision, Xiaohan He et.al., Paper: http://arxiv.org/abs/2602.12164
- 2026-03-12, SceneAssistant: A Visual Feedback Agent for Open-Vocabulary 3D Scene Generation, Jun Luo et.al., Paper: http://arxiv.org/abs/2603.12238
- 2026-01-07, Scanner-Induced Domain Shifts Undermine the Robustness of Pathology Foundation Models, Erik Thiringer et.al., Paper: http://arxiv.org/abs/2601.04163
- 2026-02-24, Scaling Vision Transformers: Evaluating DeepSpeed for Image-Centric Workloads, Huy Trinh et.al., Paper: http://arxiv.org/abs/2602.21081
- 2026-03-31, Scaling Video Pretraining for Surgical Foundation Models, Sicheng Lu et.al., Paper: http://arxiv.org/abs/2603.29966
- 2026-02-12, Scaling Verification Can Be More Effective than Scaling Policy Learning for Vision-Language-Action Alignment, Jacky Kwok et.al., Paper: http://arxiv.org/abs/2602.12281
- 2026-02-24, Scaling State-Space Models on Multiple GPUs with Tensor Parallelism, Anurag Dutt et.al., Paper: http://arxiv.org/abs/2602.21144
- 2025-11-17, Scaling Spatial Intelligence with Multimodal Foundation Models, Zhongang Cai et.al., Paper: http://arxiv.org/abs/2511.13719
- 2026-03-25, Scaling Recurrence-aware Foundation Models for Clinical Records via Next-Visit Prediction, Haresh Rengaraj Rajamohan et.al., Paper: http://arxiv.org/abs/2603.24562
- 2025-12-31, Scaling Open-Ended Reasoning to Predict the Future, Nikhil Chandak et.al., Paper: http://arxiv.org/abs/2512.25070
- 2026-02-18, Scaling Open Discrete Audio Foundation Models with Interleaved Semantic, Acoustic, and Text Tokens, Potsawee Manakul et.al., Paper: http://arxiv.org/abs/2602.16687
- 2026-03-07, Scaling Laws in the Tiny Regime: How Small Models Change Their Mistakes, Mohammed Alnemari et.al., Paper: http://arxiv.org/abs/2603.07365
- 2025-12-24, Scaling Laws for Economic Productivity: Experimental Evidence in LLM-Assisted Consulting, Data Analyst, and Management Tasks, Ali Merali et.al., Paper: http://arxiv.org/abs/2512.21316
- 2025-10-29, Scaling Latent Reasoning via Looped Language Models, Rui-Jie Zhu et.al., Paper: http://arxiv.org/abs/2510.25741
- 2025-11-28, Scaling HuBERT for African Languages: From Base to Large and XL, Antoine Caubrière et.al., Paper: http://arxiv.org/abs/2511.23370
- 2026-03-23, Scaling DoRA: High-Rank Adaptation via Factored Norms and Fused Kernels, Alexandra Zelenin et.al., Paper: http://arxiv.org/abs/2603.22276
- 2026-02-16, Scaling Beyond Masked Diffusion Language Models, Subham Sekhar Sahoo et.al., Paper: http://arxiv.org/abs/2602.15014
- 2025-12-11, Scaling Behavior of Discrete Diffusion Language Models, Dimitri von Rütte et.al., Paper: http://arxiv.org/abs/2512.10858
- 2026-02-26, Scale Can’t Overcome Pragmatics: The Impact of Reporting Bias on Vision-Language Reasoning, Amita Kamath et.al., Paper: http://arxiv.org/abs/2602.23351
- 2025-12-22, Scalably Enhancing the Clinical Validity of a Task Benchmark with Physician Oversight, Junze Ye et.al., Paper: http://arxiv.org/abs/2512.19691
- 2025-11-14, Scalable Policy Evaluation with Video World Models, Wei-Cheng Tseng et.al., Paper: http://arxiv.org/abs/2511.11520
- 2025-11-24, Scalable Parameter-Light Spectral Method for Clustering Short Text Embeddings with a Cohesion-Based Evaluation Metric, Nikita Neveditsin et.al., Paper: http://arxiv.org/abs/2511.19350
- 2026-03-04, Scalable Evaluation of the Realism of Synthetic Environmental Augmentations in Images, Damian J. Ruck et.al., Paper: http://arxiv.org/abs/2603.04325
- 2026-02-09, Scalable Delphi: Large Language Models for Structured Risk Estimation, Tobias Lorenz et.al., Paper: http://arxiv.org/abs/2602.08889
- 2025-10-22, Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning, Xichen Zhang et.al., Paper: http://arxiv.org/abs/2510.19807
- 2025-12-12, Say it or AI it: Evaluating Hands-Free Text Correction in Virtual Reality, Ziming Li et.al., Paper: http://arxiv.org/abs/2512.11564
- 2026-01-22, Sawtooth Wavefront Reordering: Enhanced CuTile FlashAttention on NVIDIA GB10, Yifan Zhu et.al., Paper: http://arxiv.org/abs/2601.16032
- 2025-12-29, Same or Not? Enhancing Visual Perception in Vision-Language Models, Damiano Marsili et.al., Paper: http://arxiv.org/abs/2512.23592
- 2025-12-09, Same Content, Different Answers: Cross-Modal Inconsistency in MLLMs, Angela van Sprang et.al., Paper: http://arxiv.org/abs/2512.08923
- 2026-02-18, Saliency-Aware Multi-Route Thinking: Revisiting Vision-Language Reasoning, Mingjia Shi et.al., Paper: http://arxiv.org/abs/2602.16702
- 2026-02-11, Safety Recovery in Reasoning Models Is Only a Few Early Steering Steps Away, Soumya Suvra Ghosal et.al., Paper: http://arxiv.org/abs/2602.11096
- 2026-03-18, SafeTutors: Benchmarking Pedagogical Safety in AI Tutoring Systems, Rima Hazra et.al., Paper: http://arxiv.org/abs/2603.17373
- 2026-02-12, SafeNeuron: Neuron-Level Safety Alignment for Large Language Models, Zhaoxin Wang et.al., Paper: http://arxiv.org/abs/2602.12158
- 2026-02-27, SafeGen-LLM: Enhancing Safety Generalization in Task Planning for Robotic Systems, Jialiang Fan et.al., Paper: http://arxiv.org/abs/2602.24235
- 2026-03-03, Saarthi for AGI: Towards Domain-Specific General Intelligence for Formal Verification, Aman Kumar et.al., Paper: http://arxiv.org/abs/2603.03175
- 2026-01-29, SWE-Replay: Efficient Test-Time Scaling for Software Engineering Agents, Yifeng Ding et.al., Paper: http://arxiv.org/abs/2601.22129
- 2026-02-03, SWE-Refactor: A Repository-Level Benchmark for Real-World LLM-Based Code Refactoring, Yisen Xu et.al., Paper: http://arxiv.org/abs/2602.03712
- 2025-12-26, SWE-RM: Execution-free Feedback For Software Engineering Agents, KaShun Shum et.al., Paper: http://arxiv.org/abs/2512.21919
- 2026-02-25, SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents, Patrick Tser Jern Kon et.al., Paper: http://arxiv.org/abs/2602.22124
- 2025-11-07, SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models, Jingxuan Xu et.al., Paper: http://arxiv.org/abs/2511.05459
- 2026-01-15, SVII-3D: Advancing Roadside Infrastructure Inventory with Decimeter-level 3D Localization and Comprehension from Sparse Street Imagery, Chong Liu et.al., Paper: http://arxiv.org/abs/2601.10535
- 2026-03-14, SVD Contextual Sparsity Predictors for Fast LLM Inference, Georgii Serbin et.al., Paper: http://arxiv.org/abs/2603.14110
- 2026-03-06, SUREON: A Benchmark and Vision-Language-Model for Surgical Reasoning, Alejandra Perez et.al., Paper: http://arxiv.org/abs/2603.06570
- 2026-03-29, STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding, Junho Kim et.al., Paper: http://arxiv.org/abs/2603.27593
- 2026-02-26, STELLAR: Storage Tuning Engine Leveraging LLM Autonomous Reasoning for High Performance Parallel File Systems, Chris Egersdoerfer et.al., Paper: http://arxiv.org/abs/2602.23220
- 2025-12-04, STARE-VLA: Progressive Stage-Aware Reinforcement for Fine-Tuning Vision-Language-Action Models, Feng Xu et.al., Paper: http://arxiv.org/abs/2512.05107
- 2025-10-28, STAR-Bench: Probing Deep Spatio-Temporal Reasoning as Audio 4D Intelligence, Zihan Liu et.al., Paper: http://arxiv.org/abs/2510.24693
- 2026-02-10, ST4VLA: Spatially Guided Training for Vision-Language-Action Models, Jinhui Ye et.al., Paper: http://arxiv.org/abs/2602.10109
- 2025-11-13, SSR: Socratic Self-Refine for Large Language Model Reasoning, Haizhou Shi et.al., Paper: http://arxiv.org/abs/2511.10621
- 2026-03-04, SSR: A Generic Framework for Text-Aided Map Compression for Localization, Mohammad Omama et.al., Paper: http://arxiv.org/abs/2603.04272
- 2025-10-21, SSD: Spatial-Semantic Head Decoupling for Efficient Autoregressive Image Generation, Siyong Jian et.al., Paper: http://arxiv.org/abs/2510.18716
- 2025-11-19, SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models, Senyu Fei et.al., Paper: http://arxiv.org/abs/2511.15605
- 2026-02-20, SPQ: An Ensemble Technique for Large Language Model Compression, Jiamin Yao et.al., Paper: http://arxiv.org/abs/2602.18420
- 2025-11-10, SPOT: An Annotated French Corpus and Benchmark for Detecting Critical Interventions in Online Conversations, Manon Berriche et.al., Paper: http://arxiv.org/abs/2511.07405
- 2025-11-21, SPEAR-1: Scaling Beyond Robot Demonstrations via 3D Understanding, Nikolay Nikolov et.al., Paper: http://arxiv.org/abs/2511.17411
- 2026-02-02, SPARKLING: Balancing Signal Preservation and Symmetry Breaking for Width-Progressive Learning, Qifan Yu et.al., Paper: http://arxiv.org/abs/2602.02472
- 2026-02-18, SPARC: Scenario Planning and Reasoning for Automated C Unit Test Generation, Jaid Monwar Chowdhury et.al., Paper: http://arxiv.org/abs/2602.16671
- 2026-03-23, SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection, Kexian Tang et.al., Paper: http://arxiv.org/abs/2603.22213
- 2026-02-26, SOTAlign: Semi-Supervised Alignment of Unimodal Vision and Language Models via Optimal Transport, Simon Roschmann et.al., Paper: http://arxiv.org/abs/2602.23353
- 2026-03-17, SOMP: Scalable Gradient Inversion for Large Language Models via Subspace-Guided Orthogonal Matching Pursuit, Yibo Li et.al., Paper: http://arxiv.org/abs/2603.16761
- 2025-11-05, SOLVE-Med: Specialized Orchestration for Leading Vertical Experts across Medical Specialties, Roberta Di Marino et.al., Paper: http://arxiv.org/abs/2511.03542
- 2026-03-31, SNEAK: Evaluating Strategic Communication and Information Leakage in Large Language Models, Adar Avsian et.al., Paper: http://arxiv.org/abs/2603.29846
- 2026-03-24, SMSP: A Plug-and-Play Strategy of Multi-Scale Perception for MLLMs to Perceive Visual Illusions, Jinzhe Tu et.al., Paper: http://arxiv.org/abs/2603.23118
- 2025-11-18, SMRC: Aligning Large Language Models with Student Reasoning for Mathematical Error Correction, Biaojie Zeng et.al., Paper: http://arxiv.org/abs/2511.14684
- 2025-11-21, SMILE: A Composite Lexical-Semantic Metric for Question-Answering Evaluation, Shrikant Kendre et.al., Paper: http://arxiv.org/abs/2511.17432
- 2025-12-24, SMART SLM: Structured Memory and Reasoning Transformer, A Small Language Model for Accurate Document Assistance, Divij Dudeja et.al., Paper: http://arxiv.org/abs/2512.21280
- 2025-11-24, SLMFix: Leveraging Small Language Models for Error Fixing with Reinforcement Learning, David Jiahao Fu et.al., Paper: http://arxiv.org/abs/2511.19422
- 2026-01-06, SLIM: Stealthy Low-Coverage Black-Box Watermarking via Latent-Space Confusion Zones, Hengyu Wu et.al., Paper: http://arxiv.org/abs/2601.03242
- 2025-10-21, SLICE: SLO-Driven Scheduling for LLM Inference on Edge Computing Devices, Pan Zhou et.al., Paper: http://arxiv.org/abs/2510.18544
- 2026-03-31, SISA: A Scale-In Systolic Array for GEMM Acceleration, Luigi Altamura et.al., Paper: http://arxiv.org/abs/2603.29913
- 2026-01-29, SINA: A Circuit Schematic Image-to-Netlist Generator Using Artificial Intelligence, Saoud Aldowaish et.al., Paper: http://arxiv.org/abs/2601.22114
- 2025-11-06, SIMS-V: Simulated Instruction-Tuning for Spatial Video Understanding, Ellis Brown et.al., Paper: http://arxiv.org/abs/2511.04668
- 2025-12-05, SIMPACT: Simulation-Enabled Action Planning using Vision-Language Models, Haowen Liu et.al., Paper: http://arxiv.org/abs/2512.05955
- 2025-12-01, SGDiff: Scene Graph Guided Diffusion Model for Image Collaborative SegCaptioning, Xu Zhang et.al., Paper: http://arxiv.org/abs/2512.01975
- 2026-04-01, SERSEM: Selective Entropy-Weighted Scoring for Membership Inference in Code Language Models, Kıvanç Kuzey Dikici et.al., Paper: http://arxiv.org/abs/2604.01147
- 2026-02-06, SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks, Mingqian Feng et.al., Paper: http://arxiv.org/abs/2602.06854
- 2026-02-24, SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards, Dengjia Zhang et.al., Paper: http://arxiv.org/abs/2602.21158
- 2026-03-19, SEAR: Simple and Efficient Adaptation of Visual Geometric Transformers for RGB+Thermal 3D Reconstruction, Vsevolod Skorokhodov et.al., Paper: http://arxiv.org/abs/2603.18774
- 2026-02-10, SCORE: Specificity, Context Utilization, Robustness, and Relevance for Reference-Free LLM Evaluation, Homaira Huda Shomee et.al., Paper: http://arxiv.org/abs/2602.10017
- 2026-02-13, SCOPE: Selective Conformal Optimized Pairwise LLM Judging, Sher Badshah et.al., Paper: http://arxiv.org/abs/2602.13110
- 2025-12-10, SCOPE: Language Models as One-Time Teacher for Hierarchical Planning in Text Environments, Haoye Lu et.al., Paper: http://arxiv.org/abs/2512.09897
- 2026-03-10, SCENEBench: An Audio Understanding Benchmark Grounded in Assistive and Industrial Use Cases, Laya Iyer et.al., Paper: http://arxiv.org/abs/2603.09853
- 2025-12-05, SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representations, Wenhao Yan et.al., Paper: http://arxiv.org/abs/2512.05905
- 2026-03-09, SCAFFOLD-CEGIS: Preventing Latent Security Degradation in LLM-Driven Iterative Code Refinement, Yi Chen et.al., Paper: http://arxiv.org/abs/2603.08520
- 2025-10-24, SBASH: a Framework for Designing and Evaluating RAG vs. Prompt-Tuned LLM Honeypots, Adetayo Adebimpe et.al., Paper: http://arxiv.org/abs/2510.21459
- 2025-12-08, SAVE: Sparse Autoencoder-Driven Visual Information Enhancement for Mitigating Object Hallucination, Sangha Park et.al., Paper: http://arxiv.org/abs/2512.07730
- 2025-12-09, SATGround: A Spatially-Aware Approach for Visual Grounding in Remote Sensing, Aysim Toker et.al., Paper: http://arxiv.org/abs/2512.08881
- 2025-11-20, SAM 3D: 3Dfy Anything in Images, SAM 3D Team et.al., Paper: http://arxiv.org/abs/2511.16624
- 2026-03-20, SAGE: Sustainable Agent-Guided Expert-tuning for Culturally Attuned Translation in Low-Resource Southeast Asia, Zhixiang Lu et.al., Paper: http://arxiv.org/abs/2603.19931
- 2025-11-06, SAFe-Copilot: Unified Shared Autonomy Framework, Phat Nguyen et.al., Paper: http://arxiv.org/abs/2511.04664
- 2026-03-26, S2D2: Fast Decoding for Diffusion LLMs via Training-Free Self-Speculation, Ligong Han et.al., Paper: http://arxiv.org/abs/2603.25702
- 2026-04-01, S0 Tuning: Zero-Overhead Adaptation of Hybrid Recurrent-Attention Models, Jack Young et.al., Paper: http://arxiv.org/abs/2604.01168
- 2025-11-21, RynnVLA-002: A Unified Vision-Language-Action and World Model, Jun Cen et.al., Paper: http://arxiv.org/abs/2511.17502
- 2025-12-29, RxnBench: A Multimodal Benchmark for Evaluating Large Language Models on Chemical Reaction Understanding from Scientific Literature, Hanzheng Li et.al., Paper: http://arxiv.org/abs/2512.23565
- 2025-11-25, RubricRL: Simple Generalizable Rewards for Text-to-Image Generation, Xuelu Feng et.al., Paper: http://arxiv.org/abs/2511.20651
- 2026-01-14, Routing with Generated Data: Annotation-Free LLM Skill Estimation and Expert Selection, Tianyi Niu et.al., Paper: http://arxiv.org/abs/2601.09692
- 2025-10-28, Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance, Yujie Wei et.al., Paper: http://arxiv.org/abs/2510.24711
- 2025-11-10, Routing Manifold Alignment Improves Generalization of Mixture-of-Experts LLMs, Zhongyang Li et.al., Paper: http://arxiv.org/abs/2511.07419
- 2026-01-09, Router-Suggest: Dynamic Routing for Multimodal Auto-Completion in Visually-Grounded Dialogs, Sandeep Mishra et.al., Paper: http://arxiv.org/abs/2601.05851
- 2026-03-23, RotorMap and Quantum Fingerprints of DNA Sequences via Rotary Position Embeddings, Danylo Yakymenko et.al., Paper: http://arxiv.org/abs/2603.22245
- 2026-03-17, Rotated Robustness: A Training-Free Defense against Bit-Flip Attacks on Large Language Models, Deng Liu et.al., Paper: http://arxiv.org/abs/2603.16382
- 2026-03-04, Robustness of Agentic AI Systems via Adversarially-Aligned Jacobian Regularization, Furkan Mumcu et.al., Paper: http://arxiv.org/abs/2603.04378
- 2026-03-24, Robust Safety Monitoring of Language Models via Activation Watermarking, Toluwani Aremu et.al., Paper: http://arxiv.org/abs/2603.23171
- 2026-01-08, Robust Reasoning as a Symmetry-Protected Topological Phase, Ilmo Sung et.al., Paper: http://arxiv.org/abs/2601.05240
- 2026-01-05, Robust Persona-Aware Toxicity Detection with Prompt Optimization and Learned Ensembling, Berk Atil et.al., Paper: http://arxiv.org/abs/2601.02337
- 2026-01-21, Robust Fake News Detection using Large Language Models under Adversarial Sentiment Attacks, Sahar Tahmasebi et.al., Paper: http://arxiv.org/abs/2601.15277
- 2025-10-27, RobotArena $\infty$ : Scalable Robot Benchmarking via Real-to-Sim Translation, Yash Jangir et.al., Paper: http://arxiv.org/abs/2510.23571
- 2025-12-15, RoboTracer: Mastering Spatial Trace with Reasoning in Vision-Language Models for Robotics, Enshen Zhou et.al., Paper: http://arxiv.org/abs/2512.13660
- 2026-03-13, RoboStream: Weaving Spatio-Temporal Reasoning with Memory in Vision-Language Models for Robotics, Yuzhi Huang et.al., Paper: http://arxiv.org/abs/2603.12939
- 2025-12-24, RoboSafe: Safeguarding Embodied Agents via Executable Safety Logic, Le Wang et.al., Paper: http://arxiv.org/abs/2512.21220
- 2026-01-02, RoboReward: General-Purpose Vision-Language Reward Models for Robotics, Tony Lee et.al., Paper: http://arxiv.org/abs/2601.00675
- 2026-02-10, RoboInter: A Holistic Intermediate Representation Suite Towards Robotic Manipulation, Hao Li et.al., Paper: http://arxiv.org/abs/2602.09973
- 2025-11-21, RoboCOIN: An Open-Sourced Bimanual Robotic Data COllection for INtegrated Manipulation, Shihan Wu et.al., Paper: http://arxiv.org/abs/2511.17441
- 2026-03-11, Risk-Adjusted Harm Scoring for Automated Red Teaming for LLMs in Financial Services, Fabrizio Dimino et.al., Paper: http://arxiv.org/abs/2603.10807
- 2025-10-24, Risk Management for Mitigating Benchmark Failure Modes: BenchRisk, Sean McGregor et.al., Paper: http://arxiv.org/abs/2510.21460
- 2026-01-13, Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs, Zhiyuan Hu et.al., Paper: http://arxiv.org/abs/2601.08763
- 2026-03-19, RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models, Xiao Feng et.al., Paper: http://arxiv.org/abs/2603.18859
- 2026-02-02, Reward-free Alignment for Conflicting Objectives, Peter Chen et.al., Paper: http://arxiv.org/abs/2602.02495
- 2026-03-31, Reward-Based Online LLM Routing via NeuralUCB, Ming-Hua Tsai et.al., Paper: http://arxiv.org/abs/2603.30035
- 2026-01-28, Reward Models Inherit Value Biases from Pretraining, Brian Christian et.al., Paper: http://arxiv.org/abs/2601.20838
- 2025-12-09, Revisiting the Scaling Properties of Downstream Metrics in Large Language Model Training, Jakub Krajewski et.al., Paper: http://arxiv.org/abs/2512.08894
- 2026-03-23, Revisiting Quantum Code Generation: Where Should Domain Knowledge Live?, Oscar Novo et.al., Paper: http://arxiv.org/abs/2603.22184
- 2026-03-26, Revisiting On-Policy Distillation: Empirical Failure Modes and Simple Fixes, Yuqian Fu et.al., Paper: http://arxiv.org/abs/2603.25562
- 2026-01-27, Revisiting Incremental Stochastic Majorization-Minimization Algorithms with Applications to Mixture of Experts, TrungKhang Tran et.al., Paper: http://arxiv.org/abs/2601.19811
- 2025-11-26, Revisiting Generalization Across Difficulty Levels: It’s Not So Easy, Yeganeh Kordi et.al., Paper: http://arxiv.org/abs/2511.21692
- 2026-03-20, Revisiting Gene Ontology Knowledge Discovery with Hierarchical Feature Selection and Virtual Study Group of AI Agents, Cen Wan et.al., Paper: http://arxiv.org/abs/2603.20132
- 2025-11-24, Revisiting Feedback Models for HyDE, Nour Jedidi et.al., Paper: http://arxiv.org/abs/2511.19349
- 2025-10-22, Review of Tools for Zero-Code LLM Based Application Development, Priyaranjan Pattnayak et.al., Paper: http://arxiv.org/abs/2510.19747
- 2026-01-08, Reverse-engineering NLI: A study of the meta-inferential properties of Natural Language Inference, Rasmus Blanck et.al., Paper: http://arxiv.org/abs/2601.05170
- 2026-03-09, RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback, Xiaoying Zhang et.al., Paper: http://arxiv.org/abs/2603.08561
- 2025-11-10, Retriv at BLP-2025 Task 2: Test-Driven Feedback-Guided Framework for Bangla-to-Python Code Generation, K M Nafi Asib et.al., Paper: http://arxiv.org/abs/2511.07382
- 2026-03-17, Retrieving Counterfactuals Improves Visual In-Context Learning, Guangzhi Xiong et.al., Paper: http://arxiv.org/abs/2603.16737
- 2026-02-26, Retrieve and Segment: Are a Few Examples Enough to Bridge the Supervision Gap in Open-Vocabulary Segmentation?, Tilemachos Aravanis et.al., Paper: http://arxiv.org/abs/2602.23339
- 2026-02-18, Retrieval Augmented Generation of Literature-derived Polymer Knowledge: The Example of a Biodegradable Polymer Expert System, Sonakshi Gupta et.al., Paper: http://arxiv.org/abs/2602.16650
- 2026-02-19, RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward, Qiucheng Wu et.al., Paper: http://arxiv.org/abs/2602.17558
- 2026-02-04, Rethinking the Trust Region in LLM Reinforcement Learning, Penghui Qi et.al., Paper: http://arxiv.org/abs/2602.04879
- 2025-11-14, Rethinking Progression of Memory State in Robotic Manipulation: An Object-Centric Perspective, Nhat Chung et.al., Paper: http://arxiv.org/abs/2511.11478
- 2026-02-17, Rethinking Metrics for Lexical Semantic Change Detection, Roksana Goworek et.al., Paper: http://arxiv.org/abs/2602.15716
- 2025-10-21, Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting, Howard Chen et.al., Paper: http://arxiv.org/abs/2510.18874
- 2025-12-31, ResponseRank: Data-Efficient Reward Modeling through Preference Strength Learning, Timo Kaufmann et.al., Paper: http://arxiv.org/abs/2512.25023
- 2026-03-12, Resource-Efficient Iterative LLM-Based NAS with Feedback Memory, Xiaojie Gu et.al., Paper: http://arxiv.org/abs/2603.12091
- 2025-11-19, RescueLens: LLM-Powered Triage and Action on Volunteer Feedback for Food Rescue, Naveen Raman et.al., Paper: http://arxiv.org/abs/2511.15698
- 2026-03-22, ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models, Xu Li et.al., Paper: http://arxiv.org/abs/2603.21105
- 2026-03-24, ReqFusion: A Multi-Provider Framework for Automated PEGS Analysis Across Software Domains, Muhammad Khalid et.al., Paper: http://arxiv.org/abs/2603.23482
- 2026-02-02, Repurposing Protein Language Models for Latent Flow-Based Fitness Optimization, Amaru Caceres Arroyo et.al., Paper: http://arxiv.org/abs/2602.02425
- 2025-12-15, Reproducing and Dissecting Denoising Language Models for Speech Recognition, Dorian Koch et.al., Paper: http://arxiv.org/abs/2512.13576
- 2026-03-25, Representation Learning to Study Temporal Dynamics in Tutorial Scaffolding, Conrad Borchers et.al., Paper: http://arxiv.org/abs/2603.24535
- 2025-10-28, ReplicationBench: Can AI Agents Replicate Astrophysics Research Papers?, Christine Ye et.al., Paper: http://arxiv.org/abs/2510.24591
- 2026-03-26, RenoBench: A Citation Parsing Benchmark, Parth Sarin et.al., Paper: http://arxiv.org/abs/2603.25640
- 2026-04-02, Reliable Control-Point Selection for Steering Reasoning in Large Language Models, Haomin Zhuang et.al., Paper: http://arxiv.org/abs/2604.02113
- 2026-04-01, Reliability of Large Language Models for Design Synthesis: An Empirical Study of Variance, Prompt Sensitivity, and Method Scaffolding, Rabia Iftikhar et.al., Paper: http://arxiv.org/abs/2604.00851
- 2025-11-04, ReleaseEval: A Benchmark for Evaluating Language Models in Automated Release Note Generation, Qianru Meng et.al., Paper: http://arxiv.org/abs/2511.02713
- 2026-01-08, RelayLLM: Efficient Reasoning via Collaborative Decoding, Chengsong Huang et.al., Paper: http://arxiv.org/abs/2601.05167
- 2025-10-28, Relative Scaling Laws for LLMs, William Held et.al., Paper: http://arxiv.org/abs/2510.24626
- 2026-02-02, Relationship-Aware Hierarchical 3D Scene Graph for Task Reasoning, Albert Gassol Puigjaner et.al., Paper: http://arxiv.org/abs/2602.02456
- 2025-12-08, Relational Visual Similarity, Thao Nguyen et.al., Paper: http://arxiv.org/abs/2512.07833
- 2026-01-16, Relational Linearity is a Predictor of Hallucinations, Yuetian Lu et.al., Paper: http://arxiv.org/abs/2601.11429
- 2025-11-25, Reinforcing Action Policies by Prophesying, Jiahui Zhang et.al., Paper: http://arxiv.org/abs/2511.20633
- 2026-01-28, Reinforcement Learning via Self-Distillation, Jonas Hübotter et.al., Paper: http://arxiv.org/abs/2601.20802
- 2026-02-18, Reinforced Fast Weights with Next-Sequence Prediction, Hee Seung Hwang et.al., Paper: http://arxiv.org/abs/2602.16704
- 2026-02-04, Reinforced Attention Learning, Bangzheng Li et.al., Paper: http://arxiv.org/abs/2602.04884
- 2026-01-27, Reimagining Social Robots as Recommender Systems: Foundations, Framework, and Applications, Jin Huang et.al., Paper: http://arxiv.org/abs/2601.19761
- 2025-11-13, Regular Games – an Automata-Based General Game Playing Language, Radosław Miernik et.al., Paper: http://arxiv.org/abs/2511.10593
- 2026-02-03, RegionReasoner: Region-Grounded Multi-Round Visual Reasoning, Wenfang Sun et.al., Paper: http://arxiv.org/abs/2602.03733
- 2025-12-19, Region-Constraint In-Context Generation for Instructional Video Editing, Zhongwei Zhang et.al., Paper: http://arxiv.org/abs/2512.17650
- 2025-11-24, Refractive neutrino masses in the solar DM halo: Can the dark-LMA solution be revived?, Susobhan Chattopadhyay et.al., Paper: http://arxiv.org/abs/2511.19420
- 2026-01-27, Reflective Translation: Improving Low-Resource Machine Translation via Structured Self-Reflection, Nicholas Cheng et.al., Paper: http://arxiv.org/abs/2601.19871
- 2026-01-26, Reflect: Transparent Principle-Guided Reasoning for Constitutional Alignment at Scale, Henry Bell et.al., Paper: http://arxiv.org/abs/2601.18730
- 2026-01-12, Reference Games as a Testbed for the Alignment of Model Uncertainty and Clarification Requests, Manar Ali et.al., Paper: http://arxiv.org/abs/2601.07820
- 2026-03-25, RefReward-SR: LR-Conditioned Reward Modeling for Preference-Aligned Super-Resolution, Yushuai Song et.al., Paper: http://arxiv.org/abs/2603.24198
- 2026-02-18, Recursive language models for jailbreak detection: a procedural defense for tool-augmented agents, Doron Shavit et.al., Paper: http://arxiv.org/abs/2602.16520
- 2026-03-02, Recursive Think-Answer Process for LLMs and VLMs, Byung-Kwan Lee et.al., Paper: http://arxiv.org/abs/2603.02099
- 2026-03-02, Recursive Models for Long-Horizon Reasoning, Chenxiao Yang et.al., Paper: http://arxiv.org/abs/2603.02112
- 2026-02-17, Recursive Concept Evolution for Compositional Reasoning in Large Language Models, Sarim Chaudhry et.al., Paper: http://arxiv.org/abs/2602.15725
- 2026-02-25, Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets, Hanna Yukhymenko et.al., Paper: http://arxiv.org/abs/2602.22207
- 2026-03-10, RecThinker: An Agentic Framework for Tool-Augmented Reasoning in Recommendation, Haobo Zhang et.al., Paper: http://arxiv.org/abs/2603.09843
- 2025-12-16, RecGPT-V2 Technical Report, Chao Yi et.al., Paper: http://arxiv.org/abs/2512.14503
- 2026-02-03, Reasoning with Latent Tokens in Diffusion Language Models, Andre He et.al., Paper: http://arxiv.org/abs/2602.03769
- 2026-03-13, Reasoning over Video: Evaluating How MLLMs Extract, Integrate, and Reconstruct Spatiotemporal Evidence, Seunghwan Bang et.al., Paper: http://arxiv.org/abs/2603.13091
- 2026-01-29, Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers, Xin Chen et.al., Paper: http://arxiv.org/abs/2601.22139
- 2026-03-05, Reasoning Theater: Disentangling Model Beliefs from Chain-of-Thought, Siddharth Boppana et.al., Paper: http://arxiv.org/abs/2603.05488
- 2026-04-01, Reasoning Shift: How Context Silently Shortens LLM Reasoning, Gleb Rodionov et.al., Paper: http://arxiv.org/abs/2604.01161
- 2026-01-23, Reasoning Promotes Robustness in Theory of Mind Tasks, Ian B. de Haan et.al., Paper: http://arxiv.org/abs/2601.16853
- 2026-01-13, Reasoning Matters for 3D Visual Grounding, Hsiang-Wei Huang et.al., Paper: http://arxiv.org/abs/2601.08811
- 2025-10-21, Reasoning Language Model Inference Serving Unveiled: An Empirical Study, Qi Li et.al., Paper: http://arxiv.org/abs/2510.18672
- 2026-03-20, Reasoning Gets Harder for LLMs Inside A Dialogue, Ivan Kartáč et.al., Paper: http://arxiv.org/abs/2603.20133
- 2026-03-02, Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training, Valentin Lacombe et.al., Paper: http://arxiv.org/abs/2603.02208
- 2026-02-03, Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL, Ian Wu et.al., Paper: http://arxiv.org/abs/2602.03773
- 2025-12-08, ReasonBENCH: Benchmarking the (In)Stability of LLM Reasoning, Nearchos Potamitis et.al., Paper: http://arxiv.org/abs/2512.07795
- 2026-03-09, Reading $\neq$ Seeing: Diagnosing and Closing the Typography Gap in Vision-Language Models, Heng Zhou et.al., Paper: http://arxiv.org/abs/2603.08497
- 2025-12-24, ReaSeq: Unleashing World Knowledge via Reasoning for Sequential Modeling, Chuan Wang et.al., Paper: http://arxiv.org/abs/2512.21257
- 2025-12-19, ReX-MLE: The Autonomous Agent Benchmark for Medical Imaging Challenges, Roshan Kenia et.al., Paper: http://arxiv.org/abs/2512.17838
- 2026-03-20, ReViSQL: Achieving Human-Level Text-to-SQL, Yuxuan Zhu et.al., Paper: http://arxiv.org/abs/2603.20004
- 2025-12-10, ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning, Xinyu Liu et.al., Paper: http://arxiv.org/abs/2512.09924
- 2026-03-11, ReTabSyn: Realistic Tabular Data Synthesis via Reinforcement Learning, Xiaofeng Lin et.al., Paper: http://arxiv.org/abs/2603.10823
- 2026-02-23, ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models, Andre He et.al., Paper: http://arxiv.org/abs/2602.20117
- 2026-01-20, ReSearch: A Multi-Stage Machine Learning Framework for Earth Science Data Discovery, Youran Sun et.al., Paper: http://arxiv.org/abs/2601.14176
- 2025-11-26, ReSAM: Refine, Requery, and Reinforce: Self-Prompting Point-Supervised Segmentation for Remote Sensing Images, M. Naseer Subhani et.al., Paper: http://arxiv.org/abs/2511.21606
- 2026-01-30, ReGuLaR: Variational Latent Reasoning Guided by Rendered Chain-of-Thought, Fanmeng Wang et.al., Paper: http://arxiv.org/abs/2601.23184
- 2025-12-15, ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding, Jia-Nan Li et.al., Paper: http://arxiv.org/abs/2512.13586
- 2025-10-28, ReForm: Reflective Autoformalization with Prospective Bounded Sequence Optimization, Guoxin Chen et.al., Paper: http://arxiv.org/abs/2510.24592
- 2025-10-27, ReCode: Unify Plan and Action for Universal Granularity Control, Zhaoyang Yu et.al., Paper: http://arxiv.org/abs/2510.23564
- 2025-12-18, Radiology Report Generation with Layer-Wise Anatomical Attention, Emmanuel D. Muñiz-De-León et.al., Paper: http://arxiv.org/abs/2512.16841
- 2026-03-25, RVLM: Recursive Vision-Language Models with Adaptive Depth, Nicanor Mayumu et.al., Paper: http://arxiv.org/abs/2603.24224
- 2026-02-25, RT-RMOT: A Dataset and Framework for RGB-Thermal Referring Multi-Object Tracking, Yanqiu Yu et.al., Paper: http://arxiv.org/abs/2602.22033
- 2025-11-25, ROOT: Robust Orthogonalized Optimizer for Neural Network Training, Wei He et.al., Paper: http://arxiv.org/abs/2511.20626
- 2025-11-10, RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments, Zhiyuan Zeng et.al., Paper: http://arxiv.org/abs/2511.07317
- 2025-10-22, RLIE: Rule Generation with Logistic Regression, Iterative Refinement, and Evaluation for Large Language Models, Yang Yang et.al., Paper: http://arxiv.org/abs/2510.19698
- 2025-12-08, RL-MTJail: Reinforcement Learning for Automated Black-Box Multi-Turn Jailbreaking of Large Language Models, Xiqiao Xiong et.al., Paper: http://arxiv.org/abs/2512.07761
- 2026-01-16, RITA: A Tool for Automated Requirements Classification and Specification from Online User Feedback, Manjeshwar Aniruddh Mallya et.al., Paper: http://arxiv.org/abs/2601.11362
- 2026-03-07, RILEC: Detection and Generation of L1 Russian Interference Errors in English Learner Texts, Darya Kharlamova et.al., Paper: http://arxiv.org/abs/2603.07366
- 2025-12-10, RIFT: A Scalable Methodology for LLM Accelerator Fault Assessment using Reinforcement Learning, Khurram Khalil et.al., Paper: http://arxiv.org/abs/2512.09829
- 2026-02-16, RF-GPT: Teaching AI to See the Wireless World, Hang Zou et.al., Paper: http://arxiv.org/abs/2602.14833
- 2025-10-24, RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models, Xueyuan Lin et.al., Paper: http://arxiv.org/abs/2510.21604
- 2025-11-21, REMSA: An LLM Agent for Foundation Model Selection in Remote Sensing, Binger Chen et.al., Paper: http://arxiv.org/abs/2511.17442
- 2025-10-24, REMONI: An Autonomous System Integrating Wearables and Multimodal Large Language Models for Enhanced Remote Health Monitoring, Thanh Cong Ho et.al., Paper: http://arxiv.org/abs/2510.21445
- 2025-12-03, RELIC: Interactive Video World Model with Long-Horizon Memory, Yicong Hong et.al., Paper: http://arxiv.org/abs/2512.04040
- 2026-03-17, RECOVER: Robust Entity Correction via agentic Orchestration of hypothesis Variants for Evidence-based Recovery, Abhishek Kumar et.al., Paper: http://arxiv.org/abs/2603.16411
- 2025-10-31, RDMA Point-to-Point Communication for LLM Systems, Nandor Licker et.al., Paper: http://arxiv.org/abs/2510.27656
- 2025-12-22, RAPID-LLM: Resilience-Aware Performance analysis of Infrastructure for Distributed LLM Training and Inference, George Karfakis et.al., Paper: http://arxiv.org/abs/2512.19606
- 2026-03-04, RANGER: Sparsely-Gated Mixture-of-Experts with Adaptive Retrieval Re-ranking for Pathology Report Generation, Yixin Chen et.al., Paper: http://arxiv.org/abs/2603.04348
- 2026-03-06, RAMoEA-QA: Hierarchical Specialization for Robust Respiratory Audio Question Answering, Gaia A. Bertolino et.al., Paper: http://arxiv.org/abs/2603.06542
- 2026-03-18, RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference, Arpit Singh Gautam et.al., Paper: http://arxiv.org/abs/2603.17891
- 2025-12-31, RAIR: A Rule-Aware Benchmark Uniting Challenging Long-Tail and Visual Salience Subset for E-commerce Relevance Assessment, Chenji Lu et.al., Paper: http://arxiv.org/abs/2512.24943
- 2026-03-29, RAGent: Physics-Aware Agentic Reasoning for Training-Free mmWave Human Activity Recognition, Mingda Han et.al., Paper: http://arxiv.org/abs/2603.27571
- 2025-11-06, RAGalyst: Automated Human-Aligned Agentic Evaluation for Domain-Specific RAG, Joshua Gao et.al., Paper: http://arxiv.org/abs/2511.04502
- 2026-01-13, RAGShaper: Eliciting Sophisticated Agentic RAG Skills via Automated Data Synthesis, Zhengwei Tao et.al., Paper: http://arxiv.org/abs/2601.08699
- 2025-10-23, RAGRank: Using PageRank to Counter Poisoning in CTI LLM Pipelines, Austin Jia et.al., Paper: http://arxiv.org/abs/2510.20768
- 2025-11-05, RAGBoost: Efficient Retrieval-Augmented Generation with Accuracy-Preserving Context Reuse, Yinsicheng Jiang et.al., Paper: http://arxiv.org/abs/2511.03475
- 2025-11-26, Qwen3-VL Technical Report, Shuai Bai et.al., Paper: http://arxiv.org/abs/2511.21631
- 2025-11-06, Question the Questions: Auditing Representation in Online Deliberative Processes, Soham De et.al., Paper: http://arxiv.org/abs/2511.04588
- 2025-11-13, Querying Labeled Time Series Data with Scenario Programs, Edward Kim et.al., Paper: http://arxiv.org/abs/2511.10627
- 2026-02-12, Query-focused and Memory-aware Reranker for Long Context Processing, Yuqing Li et.al., Paper: http://arxiv.org/abs/2602.12192
- 2026-01-28, QueerGen: How LLMs Reflect Societal Norms on Gender and Sexuality in Sentence Completion Tasks, Mae Sosto et.al., Paper: http://arxiv.org/abs/2601.20731
- 2026-02-18, Quecto-V1: Empirical Analysis of 8-bit Quantized Small Language Models for On-Device Legal Retrieval, Subrit Dikshit et.al., Paper: http://arxiv.org/abs/2602.16640
- 2025-11-19, Quantum-Guided Test Case Minimization for LLM-Based Code Generation, Huixiang Zhang et.al., Paper: http://arxiv.org/abs/2511.15665
- 2026-02-10, Quantum-Audit: Evaluating the Reasoning Limits of LLMs on Quantum Computing, Mohamed Afane et.al., Paper: http://arxiv.org/abs/2602.10092
- 2026-02-20, Quantum Maximum Likelihood Prediction via Hilbert Space Embeddings, Sreejith Sreekumar et.al., Paper: http://arxiv.org/abs/2602.18364
- 2025-11-28, Quantized-Tinyllava: a new multimodal foundation model enables efficient split learning, Jiajun Guo et.al., Paper: http://arxiv.org/abs/2511.23402
- 2026-02-13, Quantization-Robust LLM Unlearning via Low-Rank Adaptation, João Vitor Boer Abitante et.al., Paper: http://arxiv.org/abs/2602.13151
- 2026-04-02, Quantifying Self-Preservation Bias in Large Language Models, Matteo Migliarini et.al., Paper: http://arxiv.org/abs/2604.02174
- 2025-12-22, QuantiPhy: A Quantitative Benchmark Evaluating Physical Reasoning Abilities of Vision-Language Models, Li Puyin et.al., Paper: http://arxiv.org/abs/2512.19526
- 2026-01-13, QuantEval: A Benchmark for Financial Quantitative Tasks in Large Language Models, Zhaolu Kang et.al., Paper: http://arxiv.org/abs/2601.08689
- 2026-02-20, Qualitative Coding Analysis through Open-Source Large Language Models: A User Study and Design Recommendations, Tung T. Ngo et.al., Paper: http://arxiv.org/abs/2602.18352
- 2025-12-24, Quadrupped-Legged Robot Movement Plan Generation using Large Language Model, Muhtadin et.al., Paper: http://arxiv.org/abs/2512.21293
- 2025-11-18, Quadratic Term Correction on Heaps’ Law, Oscar Fontanelli et.al., Paper: http://arxiv.org/abs/2511.14683
- 2026-03-16, QiboAgent: a practitioner’s guideline to open source assistants for Quantum Computing code development, Lorenzo Esposito et.al., Paper: http://arxiv.org/abs/2603.15538
- 2026-02-03, QVLA: Not All Channels Are Equal in Vision-Language-Action Model’s Quantization, Yuhao Xu et.al., Paper: http://arxiv.org/abs/2602.03782
- 2026-01-02, QSLM: A Performance- and Memory-aware Quantization Framework with Tiered Search Strategy for Spike-driven Language Models, Rachmad Vidya Wicaksana Putra et.al., Paper: http://arxiv.org/abs/2601.00679
- 2026-03-29, Q-BIOLAT: Binary Latent Protein Fitness Landscapes for QUBO-Based Optimization, Truong-Son Hy et.al., Paper: http://arxiv.org/abs/2603.27526
- 2026-02-19, Pushing the Frontier of Black-Box LVLM Attacks via Fine-Grained Detail Targeting, Xiaohan Zhao et.al., Paper: http://arxiv.org/abs/2602.17645
- 2025-12-11, PubTables-v2: A new large-scale dataset for full-page and multi-page table extraction, Brandon Smock et.al., Paper: http://arxiv.org/abs/2512.10888
- 2025-12-10, Provably Learning from Modern Language Models via Low Logit Rank, Noah Golowich et.al., Paper: http://arxiv.org/abs/2512.09892
- 2026-01-22, Provable Robustness in Multimodal Large Language Models via Feature Space Smoothing, Song Xia et.al., Paper: http://arxiv.org/abs/2601.16200
- 2025-12-08, Provable Long-Range Benefits of Next-Token Prediction, Xinyuan Cao et.al., Paper: http://arxiv.org/abs/2512.07818
- 2026-02-25, Provable Last-Iterate Convergence for Multi-Objective Safe LLM Alignment via Optimistic Primal-Dual, Yining Li et.al., Paper: http://arxiv.org/abs/2602.22146
- 2025-11-10, Provable Benefit of Curriculum in Transformer Tree-Reasoning Post-Training, Dake Bu et.al., Paper: http://arxiv.org/abs/2511.07372
- 2025-12-02, ProteinPNet: Prototypical Part Networks for Concept Learning in Spatial Proteomics, Louis McConnell et.al., Paper: http://arxiv.org/abs/2512.02983
- 2025-11-17, Protein Secondary Structure Prediction Using 3D Graphs and Relation-Aware Message Passing Transformers, Disha Varshney et.al., Paper: http://arxiv.org/abs/2511.13685
- 2026-02-04, Protein Autoregressive Modeling via Multiscale Structure Generation, Yanru Qu et.al., Paper: http://arxiv.org/abs/2602.04883
- 2025-10-21, Prompting the Priorities: A First Look at Evaluating LLMs for Vulnerability Triage and Prioritization, Osama Al Haddad et.al., Paper: http://arxiv.org/abs/2510.18508
- 2026-03-19, PromptHub: Enhancing Multi-Prompt Visual In-Context Learning with Locality-Aware Fusion, Concentration and Alignment, Tianci Luo et.al., Paper: http://arxiv.org/abs/2603.18891
- 2026-02-24, Prompt-Level Distillation: A Non-Parametric Alternative to Model Fine-Tuning for Efficient Reasoning, Sanket Badhe et.al., Paper: http://arxiv.org/abs/2602.21103
- 2026-03-17, Prompt Programming for Cultural Bias and Alignment of Large Language Models, Maksim Eren et.al., Paper: http://arxiv.org/abs/2603.16827
- 2025-11-24, Prompt Less, Smile More: MTP with Semantic Engineering in Lieu of Prompt Engineering, Jayanaka L. Dantanarayana et.al., Paper: http://arxiv.org/abs/2511.19427
- 2026-03-24, Prompt Amplification and Zero-Shot Late Fusion in Audio-Language Models for Speech Emotion Recognition, Saurabh Kataria et.al., Paper: http://arxiv.org/abs/2603.23057
- 2026-01-05, Project Ariadne: A Structural Causal Framework for Auditing Faithfulness in LLM Agents, Sourena Khanzadeh et.al., Paper: http://arxiv.org/abs/2601.02314
- 2026-03-05, Progressive Residual Warmup for Language Model Pretraining, Tianhao Chen et.al., Paper: http://arxiv.org/abs/2603.05369
- 2025-10-30, ProfOlaf: Semi-Automated Tool for Systematic Literature Reviews, Martim Afonso et.al., Paper: http://arxiv.org/abs/2510.26750
- 2026-01-28, ProfInfer: An eBPF-based Fine-Grained LLM Inference Profiler, Bohua Zou et.al., Paper: http://arxiv.org/abs/2601.20755
- 2025-10-29, Process-Level Trajectory Evaluation for Environment Configuration in Software Engineering Agents, Jiayi Kuang et.al., Paper: http://arxiv.org/abs/2510.25694
- 2025-12-05, Probing the effectiveness of World Models for Spatial Reasoning through Test-time Scaling, Saurav Jha et.al., Paper: http://arxiv.org/abs/2512.05809
- 2026-03-10, Probing the Reliability of Driving VLMs: From Inconsistent Responses to Grounded Temporal Reasoning, Chun-Peng Chang et.al., Paper: http://arxiv.org/abs/2603.09512
- 2026-03-17, Probing Cultural Signals in Large Language Models through Author Profiling, Valentin Lafargue et.al., Paper: http://arxiv.org/abs/2603.16749
- 2026-03-18, ProbeFlow: Training-Free Adaptive Flow Matching for Vision-Language-Action Models, Zhou Fang et.al., Paper: http://arxiv.org/abs/2603.17850
- 2025-10-21, Probabilistic Modeling of Intentions in Socially Intelligent LLM Agents, Feifan Xia et.al., Paper: http://arxiv.org/abs/2510.18476
- 2026-01-02, Probabilistic Guarantees for Reducing Contextual Hallucinations in LLMs, Nils Rautenberg et.al., Paper: http://arxiv.org/abs/2601.00641
- 2025-10-21, Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language Models, Lehan Wang et.al., Paper: http://arxiv.org/abs/2510.18303
- 2026-02-17, Proactive Conversational Assistant for a Procedural Manual Task based on Audio and IMU, Rehana Mahfuz et.al., Paper: http://arxiv.org/abs/2602.15707
- 2026-04-01, ProCap: Projection-Aware Captioning for Spatial Augmented Reality, Zimo Cao et.al., Paper: http://arxiv.org/abs/2604.00912
- 2025-12-08, Privacy Practices of Browser Agents, Alisha Ukani et.al., Paper: http://arxiv.org/abs/2512.07725
- 2026-01-21, Privacy Collapse: Benign Fine-Tuning Can Break Contextual Privacy in Language Models, Anmol Goel et.al., Paper: http://arxiv.org/abs/2601.15220
- 2025-12-09, PrivTune: Efficient and Privacy-Preserving Fine-Tuning of Large Language Models via Device-Cloud Collaboration, Yi Liu et.al., Paper: http://arxiv.org/abs/2512.08809
- 2026-01-13, PrivGemo: Privacy-Preserving Dual-Tower Graph Retrieval for Empowering LLM Reasoning with Memory Augmentation, Xingyu Tan et.al., Paper: http://arxiv.org/abs/2601.08739
- 2026-03-18, Pretrained Multilingual Transformers Reveal Quantitative Distance Between Human Languages, Yue Zhao et.al., Paper: http://arxiv.org/abs/2603.17912
- 2025-11-10, Preparation of Fractal-Inspired Computational Architectures for Advanced Large Language Model Analysis, Yash Mittal et.al., Paper: http://arxiv.org/abs/2511.07329
- 2025-12-26, Prefill vs. Decode Bottlenecks: SRAM-Frequency Tradeoffs and the Memory-Bandwidth Ceiling, Hannah Atmer et.al., Paper: http://arxiv.org/abs/2512.22066
- 2025-10-21, Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options, Joongkyu Lee et.al., Paper: http://arxiv.org/abs/2510.18713
- 2026-02-27, Preference Packing: Efficient Preference Optimization for Large Language Models, Jaekyung Cho et.al., Paper: http://arxiv.org/abs/2602.24082
- 2026-02-05, Predicting Camera Pose from Perspective Descriptions for Spatial Reasoning, Xuejun Zhang et.al., Paper: http://arxiv.org/abs/2602.06041
- 2026-01-16, Predict the Retrieval! Test time adaptation for Retrieval Augmented Generation, Xin Sun et.al., Paper: http://arxiv.org/abs/2601.11443
- 2025-11-07, PreResQ-R1: Towards Fine-Grained Rank-and-Score Reinforcement Learning for Visual Quality Assessment via Preference-Response Disentangled Policy Optimization, Zehui Feng et.al., Paper: http://arxiv.org/abs/2511.05393
- 2026-01-05, Power-of-Two Quantization-Aware-Training (PoT-QAT) in Large Language Models (LLMs), Mahmoud Elgenedy et.al., Paper: http://arxiv.org/abs/2601.02298
- 2025-12-03, PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design, Jiazhe Wei et.al., Paper: http://arxiv.org/abs/2512.04082
- 2026-03-09, PostTrainBench: Can LLM Agents Automate LLM Post-Training?, Ben Rank et.al., Paper: http://arxiv.org/abs/2603.08640
- 2026-01-27, Post-LayerNorm Is Back: Stable, ExpressivE, and Deep, Chen Chen et.al., Paper: http://arxiv.org/abs/2601.19895
- 2026-04-01, Positional Cognitive Specialization: Where Do LLMs Learn To Comprehend and Speak Your Language?, Luis Frentzen Salim et.al., Paper: http://arxiv.org/abs/2604.00923
- 2026-03-04, Position: Vector Prompt Interfaces Should Be Exposed to Enable Customization of Large Language Models, Liangwei Yang et.al., Paper: http://arxiv.org/abs/2603.04292
- 2026-03-07, Position: LLMs Must Use Functor-Based and RAG-Driven Bias Mitigation for Fairness, Ravi Ranjan et.al., Paper: http://arxiv.org/abs/2603.07368
- 2025-10-21, Position: LLM Watermarking Should Align Stakeholders’ Incentives for Practical Adoption, Yepeng Liu et.al., Paper: http://arxiv.org/abs/2510.18333
- 2026-02-23, Position: General Alignment Has Hit a Ceiling; Edge Alignment Must Be Taken Seriously, Han Bao et.al., Paper: http://arxiv.org/abs/2602.20042
- 2025-12-16, Polypersona: Persona-Grounded LLM for Synthetic Survey Responses, Tejaswani Dash et.al., Paper: http://arxiv.org/abs/2512.14562
- 2026-04-01, Policy Improvement Reinforcement Learning, Huaiyang Wang et.al., Paper: http://arxiv.org/abs/2604.00860
- 2026-03-24, Polaris: A Gödel Agent Framework for Small Language Models through Experience-Abstracted Policy Repair, Aditya Kakade et.al., Paper: http://arxiv.org/abs/2603.23129
- 2026-03-04, Pointer-CAD: Unifying B-Rep and Command Sequences via Pointer-based Edges & Faces Selection, Dacheng Qi et.al., Paper: http://arxiv.org/abs/2603.04337
- 2025-10-27, Point Convergence of Nesterov’s Accelerated Gradient Method: An AI-Assisted Proof, Uijeong Jang et.al., Paper: http://arxiv.org/abs/2510.23513
- 2026-01-22, Point Bridge: 3D Representations for Cross Domain Policy Learning, Siddhant Haldar et.al., Paper: http://arxiv.org/abs/2601.16212
- 2026-01-23, PocketDVDNet: Realtime Video Denoising for Real Camera Noise, Crispian Morris et.al., Paper: http://arxiv.org/abs/2601.16780
- 2026-03-17, PlotTwist: A Creative Plot Generation Framework with Small Language Models, Abhinav Thorat et.al., Paper: http://arxiv.org/abs/2603.16410
- 2026-02-06, Plato’s Form: Toward Backdoor Defense-as-a-Service for LLMs with Prototype Representations, Chen Chen et.al., Paper: http://arxiv.org/abs/2602.06887
- 2025-10-21, PlanU: Large Language Model Decision Making through Planning under Uncertainty, Ziwei Deng et.al., Paper: http://arxiv.org/abs/2510.18442
- 2025-10-23, Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs, Yanlin Song et.al., Paper: http://arxiv.org/abs/2510.20691
- 2026-01-05, Placement Semantics for Distributed Deep Learning: A Systematic Framework for Analyzing Parallelism Strategies, Deep Pankajbhai Mehta et.al., Paper: http://arxiv.org/abs/2601.02311
- 2025-10-27, PixelRefer: A Unified Framework for Spatio-Temporal Object Referring with Arbitrary Granularity, Yuqian Yuan et.al., Paper: http://arxiv.org/abs/2510.23603
- 2026-04-01, PixelPrune: Pixel-Level Adaptive Visual Token Reduction via Predictive Coding, Nan Wang et.al., Paper: http://arxiv.org/abs/2604.00886
- 2025-11-06, PixCLIP: Achieving Fine-grained Visual Language Understanding via Any-granularity Pixel-Text Alignment Learning, Yicheng Xiao et.al., Paper: http://arxiv.org/abs/2511.04601
- 2026-03-11, PivotAttack: Rethinking the Search Trajectory in Hard-Label Text Attacks via Pivot Words, Yuzhi Liang et.al., Paper: http://arxiv.org/abs/2603.10842
- 2026-03-20, Pitfalls in Evaluating Interpretability Agents, Tal Haklay et.al., Paper: http://arxiv.org/abs/2603.20101
- 2026-01-02, Physio-DPO: Aligning Large Language Models with the Protein Energy Landscape to Eliminate Structural Hallucinations, QiWei Meng et.al., Paper: http://arxiv.org/abs/2601.00647
- 2026-01-22, PhysicsMind: Sim and Real Mechanics Benchmarking for Physical Reasoning and Prediction in Foundational VLMs and World Models, Chak-Wing Mak et.al., Paper: http://arxiv.org/abs/2601.16007
- 2026-02-05, PhysicsAgentABM: Physics-Guided Generative Agent-Based Modeling, Kavana Venkatesh et.al., Paper: http://arxiv.org/abs/2602.06030
- 2025-12-03, Physics-Embedded Gaussian Process for Traffic State Estimation, Yanlin Chen et.al., Paper: http://arxiv.org/abs/2512.04004
- 2025-12-05, Physically-Based Simulation of Automotive LiDAR, L. Dudzik et.al., Paper: http://arxiv.org/abs/2512.05932
- 2025-12-31, PhysTalk: Language-driven Real-time Physics in 3D Gaussian Scenes, Luca Collorone et.al., Paper: http://arxiv.org/abs/2512.24986
- 2025-12-18, PhysBrain: Human Egocentric Data as a Bridge from Vision Language Models to Physical Intelligence, Xiaopeng Lin et.al., Paper: http://arxiv.org/abs/2512.16793
- 2026-01-27, Phonological Tokenizer: Prosody-Aware Phonetic Token via Multi-Objective Fine-Tuning with Differentiable K-Means, Kentaro Onda et.al., Paper: http://arxiv.org/abs/2601.19781
- 2025-10-31, Phased DMD: Few-step Distribution Matching Distillation via Score Matching within Subintervals, Xiangyu Fan et.al., Paper: http://arxiv.org/abs/2510.27684
- 2026-04-01, Phase transition on a context-sensitive random language model with short range interactions, Yuma Toji et.al., Paper: http://arxiv.org/abs/2604.00947
- 2026-01-23, Persuasion Tokens for Editing Factual Knowledge in LLMs, Paul Youssef et.al., Paper: http://arxiv.org/abs/2601.16781
- 2025-12-04, Personalizing Agent Privacy Decisions via Logical Entailment, James Flemings et.al., Paper: http://arxiv.org/abs/2512.05065
- 2026-03-12, PersonaTrace: Synthesizing Realistic Digital Footprints with LLM Agents, Minjia Wang et.al., Paper: http://arxiv.org/abs/2603.11955
- 2025-11-21, PersonaAgent with GraphRAG: Community-Aware Knowledge Graphs for Personalized LLM, Siqi Liang et.al., Paper: http://arxiv.org/abs/2511.17467
- 2026-01-28, Persona Prompting as a Lens on LLM Social Reasoning, Jing Yang et.al., Paper: http://arxiv.org/abs/2601.20757
- 2025-11-17, Person-AI Bidirectional Fit - A Proof-Of-Concept Case Study Of Augmented Human-Ai Symbiosis In Management Decision-Making Process, Agnieszka Bieńkowska et.al., Paper: http://arxiv.org/abs/2511.13670
- 2026-03-05, PersianPunc: A Large-Scale Dataset and BERT-Based Approach for Persian Punctuation Restoration, Mohammad Javad Ranjbar Kalahroodi et.al., Paper: http://arxiv.org/abs/2603.05314
- 2026-03-31, Performative Scenario Optimization, Quanyan Zhu et.al., Paper: http://arxiv.org/abs/2603.29982
- 2025-11-05, PerfDojo: Automated ML Library Generation for Heterogeneous Architectures, Andrei Ivanov et.al., Paper: http://arxiv.org/abs/2511.03586
- 2026-03-19, Perceptio: Perception Enhanced Vision Language Models via Spatial Token Generation, Yuchen Li et.al., Paper: http://arxiv.org/abs/2603.18795
- 2025-12-26, Perceive and Calibrate: Analyzing and Enhancing Robustness of Medical Multi-Modal Large Language Models, Dunyuan XU et.al., Paper: http://arxiv.org/abs/2512.21964
- 2025-12-16, PerProb: Indirectly Evaluating Memorization in Large Language Models, Yihan Liao et.al., Paper: http://arxiv.org/abs/2512.14600
- 2026-03-06, Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders, Boqiang Zhang et.al., Paper: http://arxiv.org/abs/2603.06569
- 2026-03-02, Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning, Justin Waugh et.al., Paper: http://arxiv.org/abs/2603.02119
- 2024-11-12, PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications, Dingkang Yang et.al., Paper: http://arxiv.org/abs/2405.19266
- 2026-02-12, Pedagogically-Inspired Data Synthesis for Language Model Knowledge Distillation, Bowei He et.al., Paper: http://arxiv.org/abs/2602.12172
- 2026-02-13, Peaceful Anarcho-Accelerationism: Decentralized Full Automation for a Society of Universal Care, Eduardo C. Garrido-Merchán et.al., Paper: http://arxiv.org/abs/2602.13154
- 2026-01-29, Pay for Hints, Not Answers: LLM Shepherding for Cost-Efficient Inference, Ziming Dong et.al., Paper: http://arxiv.org/abs/2601.22132
- 2025-10-31, Patient-Centered Summarization Framework for AI Clinical Summarization: A Mixed-Methods Design, Maria Lizarazo Jimenez et.al., Paper: http://arxiv.org/abs/2510.27535
- 2026-03-10, PathMem: Toward Cognition-Aligned Memory Transformation for Pathology MLLMs, Jinyue Li et.al., Paper: http://arxiv.org/abs/2603.09943
- 2025-12-19, PathFLIP: Fine-grained Language-Image Pretraining for Versatile Computational Pathology, Fengchun Liu et.al., Paper: http://arxiv.org/abs/2512.17621
- 2025-11-17, Part-X-MLLM: Part-aware 3D Multimodal Large Language Model, Chunshi Wang et.al., Paper: http://arxiv.org/abs/2511.13647
- 2025-11-13, ParoQuant: Pairwise Rotation Quantization for Efficient Reasoning LLM Inference, Yesheng Liang et.al., Paper: http://arxiv.org/abs/2511.10645
- 2026-02-06, Parameters as Experts: Adapting Vision Models with Dynamic Parameter Routing, Meng Lou et.al., Paper: http://arxiv.org/abs/2602.06862
- 2025-12-24, Parallel Token Prediction for Language Models, Felix Draxler et.al., Paper: http://arxiv.org/abs/2512.21323
- 2025-10-21, ParaStyleTTS: Toward Efficient and Robust Paralinguistic Style Control for Expressive Text-to-Speech Generation, Haowei Lou et.al., Paper: http://arxiv.org/abs/2510.18308
- 2025-10-24, ParaRNN: Unlocking Parallel Training of Nonlinear RNNs for Large Language Models, Federico Danieli et.al., Paper: http://arxiv.org/abs/2510.21450
- 2026-01-30, PaperBanana: Automating Academic Illustration for AI Scientists, Dawei Zhu et.al., Paper: http://arxiv.org/abs/2601.23265
- 2026-03-12, Paper Title: LoV3D: Grounding Cognitive Prognosis Reasoning in Longitudinal 3D Brain MRI via Regional Volume Assessments, Zhaoyang Jiang et.al., Paper: http://arxiv.org/abs/2603.12071
- 2025-12-16, Pairwise Comparison for Bias Identification and Quantification, Fabian Haak et.al., Paper: http://arxiv.org/abs/2512.14565
- 2025-10-30, PairUni: Pairwise Training for Unified Multimodal Language Models, Jiani Zheng et.al., Paper: http://arxiv.org/abs/2510.25682
- 2025-12-05, PRiSM: An Agentic Multimodal Benchmark for Scientific Reasoning via Python-Grounded Evaluation, Shima Imani et.al., Paper: http://arxiv.org/abs/2512.05930
- 2026-01-21, PROGRESSLM: Towards Progress Reasoning in Vision-Language Models, Jianshu Zhang et.al., Paper: http://arxiv.org/abs/2601.15224
- 2025-12-29, PROFASR-BENCH: A Benchmark for Context-Conditioned ASR in High-Stakes Professional Speech, Deepak Babu Piskala et.al., Paper: http://arxiv.org/abs/2512.23686
- 2025-10-28, PRISM-Bench: A Benchmark of Puzzle-Based Visual Tasks with CoT Error Detection, Yusu Qian et.al., Paper: http://arxiv.org/abs/2510.23594
- 2026-01-26, PRECISE: Reducing the Bias of LLM Evaluations Using Prediction-Powered Ranking Estimation, Abhishek Divekar et.al., Paper: http://arxiv.org/abs/2601.18777
- 2025-11-14, PRBench: Large-Scale Expert Rubrics for Evaluating High-Stakes Professional Reasoning, Afra Feyza Akyürek et.al., Paper: http://arxiv.org/abs/2511.11562
- 2026-03-29, PRBench: End-to-end Paper Reproduction in Physics Research, Shi Qiu et.al., Paper: http://arxiv.org/abs/2603.27646
- 2026-03-19, PPI is the Difference Estimator: Recognizing the Survey Sampling Roots of Prediction-Powered Inference, Reagan Mozer et.al., Paper: http://arxiv.org/abs/2603.19160
- 2026-03-25, PP-OCRv5: A Specialized 5M-Parameter Model Rivaling Billion-Parameter Vision-Language Models on OCR Tasks, Cheng Cui et.al., Paper: http://arxiv.org/abs/2603.24373
- 2026-01-26, POPE: Learning to Reason on Hard Problems via Privileged On-Policy Exploration, Yuxiao Qu et.al., Paper: http://arxiv.org/abs/2601.18779
- 2026-03-06, PONTE: Personalized Orchestration for Natural Language Trustworthy Explanations, Vittoria Vineis et.al., Paper: http://arxiv.org/abs/2603.06485
- 2025-11-20, POMA-3D: The Point Map Way to 3D Scene Understanding, Ye Mao et.al., Paper: http://arxiv.org/abs/2511.16567
- 2026-03-05, POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation, Zeju Qiu et.al., Paper: http://arxiv.org/abs/2603.05500
- 2026-04-02, PLUME: Latent Reasoning Based Universal Multimodal Embedding, Chenwei He et.al., Paper: http://arxiv.org/abs/2604.02073
- 2026-03-25, PINGALA: Prosody-Aware Decoding for Sanskrit Poetry Generation, Manoj Balaji Jagadeeshan et.al., Paper: http://arxiv.org/abs/2603.24413
- 2026-03-26, PICon: A Multi-Turn Interrogation Framework for Evaluating Persona Agent Consistency, Minseo Kim et.al., Paper: http://arxiv.org/abs/2603.25620
- 2026-02-26, PGVMS: A Prompt-Guided Unified Framework for Virtual Multiplex IHC Staining with Pathological Semantic Learning, Fuqiang Chen et.al., Paper: http://arxiv.org/abs/2602.23292
- 2025-10-31, PETAR: Localized Findings Generation with Mask-Aware Vision-Language Modeling for PET Automated Reporting, Danyal Maqbool et.al., Paper: http://arxiv.org/abs/2510.27680
- 2026-01-06, PET-TURTLE: Deep Unsupervised Support Vector Machines for Imbalanced Data Clusters, Javier Salazar Cavazos et.al., Paper: http://arxiv.org/abs/2601.03237
- 2025-12-12, PD-Swap: Prefill-Decode Logic Swapping for End-to-End LLM Inference on Edge FPGAs via Dynamic Partial Reconfiguration, Yifan Zhang et.al., Paper: http://arxiv.org/abs/2512.11550
- 2026-03-18, PCA-Seg: Revisiting Cost Aggregation for Open-Vocabulary Semantic and Part Segmentation, Jianjian Yin et.al., Paper: http://arxiv.org/abs/2603.17520
- 2023-11-09, PB-LLM: Partially Binarized Large Language Models, Yuzhang Shang et.al., Paper: http://arxiv.org/abs/2310.00034
- 2025-11-14, PAS : Prelim Attention Score for Detecting Object Hallucinations in Large Vision–Language Models, Nhat Hoang-Xuan et.al., Paper: http://arxiv.org/abs/2511.11502
- 2026-01-22, PAL*M: Property Attestation for Large Generative Models, Prach Chantasantitam et.al., Paper: http://arxiv.org/abs/2601.16199
- 2025-12-01, PAI-Bench: A Comprehensive Benchmark For Physical AI, Fengzhe Zhou et.al., Paper: http://arxiv.org/abs/2512.01989
- 2026-01-15, PACEvolve: Enabling Long-Horizon Progress-Aware Consistent Evolution, Minghao Yan et.al., Paper: http://arxiv.org/abs/2601.10657
- 2025-11-17, P1: Mastering Physics Olympiads with Reinforcement Learning, Jiacheng Chen et.al., Paper: http://arxiv.org/abs/2511.13612
- 2026-03-20, Overreliance on AI in Information-seeking from Video Content, Anders Giovanni Møller et.al., Paper: http://arxiv.org/abs/2603.19843
- 2026-03-29, Over-Refusal and Representation Subspaces: A Mechanistic Analysis of Task-Conditioned Refusal in Aligned LLMs, Utsav Maskey et.al., Paper: http://arxiv.org/abs/2603.27518
- 2026-01-21, Outcome-Based RL Provably Leads Transformers to Reason, but Only With the Right Data, Yuval Ran-Milo et.al., Paper: http://arxiv.org/abs/2601.15158
- 2025-11-05, Outbidding and Outbluffing Elite Humans: Mastering Liar’s Poker via Self-Play and Reinforcement Learning, Richard Dewey et.al., Paper: http://arxiv.org/abs/2511.03724
- 2025-12-01, Orientational lineage memory and mechanical ordering during diffusion-limited growth, Ilias-Marios Sarris et.al., Paper: http://arxiv.org/abs/2512.01981
- 2025-12-01, Order and shape dependence of mechanical relaxation in proliferating active matter, Jonas Isensee et.al., Paper: http://arxiv.org/abs/2512.01950
- 2026-03-06, OralGPT-Plus: Learning to Use Visual Tools via Reinforcement Learning for Panoramic X-ray Analysis, Yuxuan Fan et.al., Paper: http://arxiv.org/abs/2603.06366
- 2025-10-28, Optimizing Retrieval for RAG via Reinforced Contrastive Learning, Jiawei Zhou et.al., Paper: http://arxiv.org/abs/2510.24652
- 2025-11-28, Optimizing Multimodal Language Models through Attention-based Interpretability, Alexander Sergeev et.al., Paper: http://arxiv.org/abs/2511.23375
- 2026-03-25, Optimizing Multilingual LLMs via Federated Learning: A Study of Client Language Composition, Aleix Sant et.al., Paper: http://arxiv.org/abs/2603.24242
- 2025-11-14, Optimizing Mixture of Block Attention, Guangxuan Xiao et.al., Paper: http://arxiv.org/abs/2511.11571
- 2025-12-05, Optimizing Medical Question-Answering Systems: A Comparative Study of Fine-Tuned and Zero-Shot Large Language Models with RAG Framework, Tasnimul Hassan et.al., Paper: http://arxiv.org/abs/2512.05863
- 2026-02-06, Optimal Turkish Subword Strategies at Scale: Systematic Evaluation of Data, Vocabulary, Morphology Interplay, Duygu Altinok et.al., Paper: http://arxiv.org/abs/2602.06942
- 2026-03-19, Optimal Splitting of Language Models from Mixtures to Specialized Domains, Skyler Seto et.al., Paper: http://arxiv.org/abs/2603.19149
- 2025-11-04, Optimal Singular Damage: Efficient LLM Inference in Low Storage Regimes, Mohammadsajad Alipour et.al., Paper: http://arxiv.org/abs/2511.02681
- 2025-11-06, Optimal Inference Schedules for Masked Diffusion Models, Sitan Chen et.al., Paper: http://arxiv.org/abs/2511.04647
- 2026-02-17, Operationalising the Superficial Alignment Hypothesis via Task Complexity, Tomás Vergara-Browne et.al., Paper: http://arxiv.org/abs/2602.15829
- 2026-01-14, OpenVoxel: Training-Free Grouping and Captioning Voxels for Open-Vocabulary 3D Scene Understanding, Sheng-Yu Huang et.al., Paper: http://arxiv.org/abs/2601.09575
- 2026-03-16, OpenSeeker: Democratizing Frontier Search Agents by Fully Open-Sourcing Training Data, Yuwen Du et.al., Paper: http://arxiv.org/abs/2603.15594
- 2025-10-29, OpenReward: Learning to Reward Long-form Agentic Tasks via Reinforcement Learning, Ziyou Hu et.al., Paper: http://arxiv.org/abs/2510.24636
- 2026-03-29, OpenDPR: Open-Vocabulary Change Detection via Vision-Centric Diffusion-Guided Prototype Retrieval for Remote Sensing Imagery, Qi Guo et.al., Paper: http://arxiv.org/abs/2603.27645
- 2026-01-28, Open-Vocabulary Functional 3D Human-Scene Interaction Generation, Jie Liu et.al., Paper: http://arxiv.org/abs/2601.20835
- 2026-01-09, Open-Vocabulary 3D Instruction Ambiguity Detection, Jiayu Ding et.al., Paper: http://arxiv.org/abs/2601.05991
- 2025-10-28, Open Korean Historical Corpus: A Millennia-Scale Diachronic Collection of Public Domain Texts, Seyoung Song et.al., Paper: http://arxiv.org/abs/2510.24541
- 2025-11-17, Ontology-Driven Model-to-Model Transformation of Workflow Specifications, Francisco Abreu et.al., Paper: http://arxiv.org/abs/2511.13661
- 2026-03-18, Only relative ranks matter in weight-clustered large language models, Borja Aizpurua et.al., Paper: http://arxiv.org/abs/2603.17917
- 2026-04-01, Online Reasoning Calibration: Test-Time Training Enables Generalizable Conformal LLM Reasoning, Cai Zhou et.al., Paper: http://arxiv.org/abs/2604.01170
- 2026-03-19, Online Learning and Equilibrium Computation with Ranking Feedback, Mingyang Liu et.al., Paper: http://arxiv.org/abs/2603.19221
- 2026-03-17, Online Experiential Learning for Language Models, Tianzhu Ye et.al., Paper: http://arxiv.org/abs/2603.16856
- 2025-12-02, OneThinker: All-in-one Reasoning Model for Image and Video, Kaituo Feng et.al., Paper: http://arxiv.org/abs/2512.03043
- 2026-03-25, OneSearch-V2: The Latent Reasoning Enhanced Self-distillation Generative Search Framework, Ben Chen et.al., Paper: http://arxiv.org/abs/2603.24422
- 2025-10-21, One Size Fits All? A Modular Adaptive Sanitization Kit (MASK) for Customizable Privacy-Preserving Phone Scam Detection, Kangzhong Wang et.al., Paper: http://arxiv.org/abs/2510.18493
- 2025-11-05, One Battle After Another: Probing LLMs’ Limits on Multi-Turn Instruction Following with a Benchmark Evolving Framework, Qi Jia et.al., Paper: http://arxiv.org/abs/2511.03508
- 2026-01-26, One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment, Hongru Cai et.al., Paper: http://arxiv.org/abs/2601.18731
- 2026-02-12, On-Policy Context Distillation for Language Models, Tianzhu Ye et.al., Paper: http://arxiv.org/abs/2602.12275
- 2026-01-15, On the origin of neural scaling laws: from random graphs to natural language, Maissam Barkeshli et.al., Paper: http://arxiv.org/abs/2601.10684
- 2026-02-20, On the Semantic and Syntactic Information Encoded in Proto-Tokens for One-Step Text Reconstruction, Ivan Bondarenko et.al., Paper: http://arxiv.org/abs/2602.18301
- 2026-01-29, On the Paradoxical Interference between Instruction-Following and Task Solving, Yunjia Qi et.al., Paper: http://arxiv.org/abs/2601.22047
- 2025-12-08, On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models, Charlie Zhang et.al., Paper: http://arxiv.org/abs/2512.07783
- 2025-10-23, On the Detectability of LLM-Generated Text: What Exactly Is LLM-Generated Text?, Mingmeng Geng et.al., Paper: http://arxiv.org/abs/2510.20810
- 2026-02-20, On the “Induction Bias” in Sequence Models, M. Reza Ebrahimi et.al., Paper: http://arxiv.org/abs/2602.18333
- 2025-10-31, On Selecting Few-Shot Examples for LLM-based Code Vulnerability Detection, Md Abdul Hannan et.al., Paper: http://arxiv.org/abs/2510.27675
- 2026-03-19, On Optimizing Multimodal Jailbreaks for Spoken Language Models, Aravind Krishnan et.al., Paper: http://arxiv.org/abs/2603.19127
- 2026-03-12, On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents, Deyu Zou et.al., Paper: http://arxiv.org/abs/2603.12109
- 2025-11-25, On Evaluating LLM Alignment by Evaluating LLMs as Judges, Yixin Liu et.al., Paper: http://arxiv.org/abs/2511.20604
- 2026-02-24, On Data Engineering for Scaling LLM Terminal Capabilities, Renjie Pi et.al., Paper: http://arxiv.org/abs/2602.21193
- 2025-10-22, On Controlled Change: Generative AI’s Impact on Professional Authority in Journalism, Tomás Dodds et.al., Paper: http://arxiv.org/abs/2510.19792
- 2025-11-13, OmniVGGT: Omni-Modality Driven Visual Geometry Grounded, Haosong Peng et.al., Paper: http://arxiv.org/abs/2511.10560
- 2026-02-04, OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models, Yue Ding et.al., Paper: http://arxiv.org/abs/2602.04804
- 2026-02-09, OmniReview: A Large-scale Benchmark and LLM-enhanced Framework for Realistic Reviewer Recommendation, Yehua Huang et.al., Paper: http://arxiv.org/abs/2602.08896
- 2026-03-02, OmniRet: Efficient and High-Fidelity Omni Modality Retrieval, Chuong Huynh et.al., Paper: http://arxiv.org/abs/2603.02098
- 2026-03-02, OmniLottie: Generating Vector Animations via Parameterized Lottie Tokens, Yiying Yang et.al., Paper: http://arxiv.org/abs/2603.02138
- 2025-12-29, OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding, Keda Tao et.al., Paper: http://arxiv.org/abs/2512.23646
- 2026-03-06, Omni-Diffusion: Unified Multimodal Understanding and Generation with Masked Discrete Diffusion, Lijiang Li et.al., Paper: http://arxiv.org/abs/2603.06577
- 2026-02-12, Olmix: A Framework for Data Mixing Throughout LM Development, Mayee F. Chen et.al., Paper: http://arxiv.org/abs/2602.12237
- 2026-03-24, Off-Policy Value-Based Reinforcement Learning for Large Language Models, Peng-Yuan Wang et.al., Paper: http://arxiv.org/abs/2603.23355
- 2026-03-05, Observing and Controlling Features in Vision-Language-Action Models, Hugo Buurmeijer et.al., Paper: http://arxiv.org/abs/2603.05487
- 2026-01-08, Observations and Remedies for Large Language Model Bias in Self-Consuming Performative Loop, Yaxuan Wang et.al., Paper: http://arxiv.org/abs/2601.05184
- 2026-03-14, OasisSimp: An Open-source Asian-English Sentence Simplification Dataset, Hannah Liu et.al., Paper: http://arxiv.org/abs/2603.14111
- 2026-03-11, OSUM-Pangu: An Open-Source Multidimension Speech Understanding Foundation Model Built upon OpenPangu on Ascend NPUs, Yujie Liao et.al., Paper: http://arxiv.org/abs/2603.10862
- 2026-01-12, OS-Symphony: A Holistic Framework for Robust and Generalist Computer-Using Agent, Bowen Yang et.al., Paper: http://arxiv.org/abs/2601.07779
- 2026-03-05, ORMOT: A Dataset and Framework for Omnidirectional Referring Multi-Object Tracking, Sijia Chen et.al., Paper: http://arxiv.org/abs/2603.05384
- 2025-10-31, ORGEval: Graph-Theoretic Evaluation of LLMs in Optimization Modeling, Zhuohan Wang et.al., Paper: http://arxiv.org/abs/2510.27610
- 2026-04-01, ORBIT: Scalable and Verifiable Data Generation for Search Agents on a Tight Budget, Nandan Thakur et.al., Paper: http://arxiv.org/abs/2604.01195
- 2026-02-19, ODESteer: A Unified ODE-Based Steering Framework for LLM Alignment, Hongjue Zhao et.al., Paper: http://arxiv.org/abs/2602.17560
- 2026-03-11, Nurture-First Agent Development: Building Domain-Expert AI Agents Through Conversational Knowledge Crystallization, Linghao Zhang et.al., Paper: http://arxiv.org/abs/2603.10808
- 2026-01-30, Now You Hear Me: Audio Narrative Attacks Against Large Audio-Language Models, Ye Yu et.al., Paper: http://arxiv.org/abs/2601.23255
- 2026-02-23, NovaPlan: Zero-Shot Long-Horizon Manipulation via Closed-Loop Video Language Planning, Jiahui Fu et.al., Paper: http://arxiv.org/abs/2602.20119
- 2026-03-16, Not All Invariants Are Equal: Curating Training Data to Accelerate Program Verification with SLMs, Ido Pinto et.al., Paper: http://arxiv.org/abs/2603.15510
- 2024-05-31, Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models, Xudong Lu et.al., Paper: http://arxiv.org/abs/2402.14800
- 2026-03-10, Nonparametric Variational Differential Privacy via Embedding Parameter Clipping, Dina El Zein et.al., Paper: http://arxiv.org/abs/2603.09583
- 2025-10-21, Noise-Conditioned Mixture-of-Experts Framework for Robust Speaker Verification, Bin Gu et.al., Paper: http://arxiv.org/abs/2510.18533
- 2026-03-22, NoOVD: Novel Category Discovery and Embedding for Open-Vocabulary Object Detection, Yupeng Zhang et.al., Paper: http://arxiv.org/abs/2603.21069
- 2026-02-25, NoLan: Mitigating Object Hallucinations in Large Vision-Language Models via Dynamic Suppression of Language Priors, Lingfeng Ren et.al., Paper: http://arxiv.org/abs/2602.22144
- 2026-03-03, No Memorization, No Detection: Output Distribution-Based Contamination Detection in Small Language Models, Omer Sela et.al., Paper: http://arxiv.org/abs/2603.03203
- 2025-12-09, No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers, Damiano Marsili et.al., Paper: http://arxiv.org/abs/2512.08889
- 2026-02-09, Next Concept Prediction in Discrete Latent Space Leads to Stronger Language Models, Yuliang Liu et.al., Paper: http://arxiv.org/abs/2602.08984
- 2026-01-20, NewsRECON: News article REtrieval for image CONtextualization, Jonathan Tonglet et.al., Paper: http://arxiv.org/abs/2601.14121
- 2026-03-13, Neuron-Aware Data Selection In Instruction Tuning For Large Language Models, Xin Chen et.al., Paper: http://arxiv.org/abs/2603.13201
- 2026-02-04, NeuroCanvas: VLLM-Powered Robust Seizure Detection by Reformulating Multichannel EEG as Image, Yan Chen et.al., Paper: http://arxiv.org/abs/2602.04769
- 2026-04-02, Neuro-RIT: Neuron-Guided Instruction Tuning for Robust Retrieval-Augmented Language Model, Jaemin Kim et.al., Paper: http://arxiv.org/abs/2604.02194
- 2025-12-04, NeuralRemaster: Phase-Preserving Diffusion for Structure-Aligned Generation, Yu Zeng et.al., Paper: http://arxiv.org/abs/2512.05106
- 2026-02-17, Neural Scaling Laws for Boosted Jet Tagging, Matthias Vigl et.al., Paper: http://arxiv.org/abs/2602.15781
- 2026-01-27, Neural Neural Scaling Laws, Michael Y. Hu et.al., Paper: http://arxiv.org/abs/2601.19831
- 2025-10-23, Neural Diversity Regularizes Hallucinations in Small Models, Kushal Chakrabarti et.al., Paper: http://arxiv.org/abs/2510.20690
- 2025-11-06, Neural Computation Without Slots: Steps Towards Biologically Plausible Memory and Attention in Natural and Artificial Intelligence, Shaunak Bhandarkar et.al., Paper: http://arxiv.org/abs/2511.04593
- 2026-01-16, Neural Chain-of-Thought Search: Searching the Optimal Reasoning Path to Enhance Large Language Models, Guoming Ling et.al., Paper: http://arxiv.org/abs/2601.11340
- 2025-11-20, Nemotron Elastic: Towards Efficient Many-in-One Reasoning LLMs, Ali Taghibakhshi et.al., Paper: http://arxiv.org/abs/2511.16664
- 2025-11-18, Near-Lossless Model Compression Enables Longer Context Inference in DNA Large Language Models, Rui Zhu et.al., Paper: http://arxiv.org/abs/2511.14694
- 2026-03-13, Navig-AI-tion: Navigation by Contextual AI and Spatial Audio, Mathias N. Lystbæk et.al., Paper: http://arxiv.org/abs/2603.13200
- 2026-01-06, NavAI: A Generalizable LLM Framework for Navigation Tasks in Virtual Reality Environments, Xue Qin et.al., Paper: http://arxiv.org/abs/2601.03251
- 2025-12-11, Natural Language Interface for Firewall Configuration, F. Taghiyev et.al., Paper: http://arxiv.org/abs/2512.10789
- 2026-01-13, Nationality and Region Prediction from Names: A Comparative Study of Neural Models and Large Language Models, Keito Inoshita et.al., Paper: http://arxiv.org/abs/2601.08692
- 2026-02-23, NanoKnow: How to Know What Your Language Model Knows, Lingwei Gu et.al., Paper: http://arxiv.org/abs/2602.20122
- 2026-02-06, NanoFLUX: Distillation-Driven Compression of Large Text-to-Image Generation Models for Mobile Devices, Ruchika Chavhan et.al., Paper: http://arxiv.org/abs/2602.06879
- 2026-03-02, Nano-EmoX: Unifying Multimodal Emotional Intelligence from Perception to Empathy, Jiahao Huang et.al., Paper: http://arxiv.org/abs/2603.02123
- 2026-03-05, NL2GDS: LLM-aided interface for Open Source Chip Design, Max Eland et.al., Paper: http://arxiv.org/abs/2603.05489
- 2026-02-25, NESTOR: A Nested MOE-based Neural Operator for Large-Scale PDE Pre-Training, Dengdi Sun et.al., Paper: http://arxiv.org/abs/2602.22059
- 2026-03-10, N-gram-like Language Models Predict Reading Time Best, James A. Michaelov et.al., Paper: http://arxiv.org/abs/2603.09872
- 2025-11-14, Multistability of Self-Attention Dynamics in Transformers, Claudio Altafini et.al., Paper: http://arxiv.org/abs/2511.11553
- 2025-12-12, Multiscale Causal Geometric Deep Learning for Modeling Brain Structure, Chengzhi Xia et.al., Paper: http://arxiv.org/abs/2512.11738
- 2026-01-13, Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge, Yao Tang et.al., Paper: http://arxiv.org/abs/2601.08808
- 2025-10-29, Multimodal Spatial Reasoning in the Large Model Era: A Survey and Benchmarks, Xu Zheng et.al., Paper: http://arxiv.org/abs/2510.25760
- 2025-12-18, Multimodal RewardBench 2: Evaluating Omni Reward Models for Interleaved Text and Image, Yushi Hu et.al., Paper: http://arxiv.org/abs/2512.16899
- 2026-03-06, Multimodal Large Language Models as Image Classifiers, Nikita Kisel et.al., Paper: http://arxiv.org/abs/2603.06578
- 2025-12-22, Multimodal LLMs for Historical Dataset Construction from Archival Image Scans: German Patents (1877-1918), Niclas Griesshaber et.al., Paper: http://arxiv.org/abs/2512.19675
- 2026-01-22, Multimodal Climate Disinformation Detection: Integrating Vision-Language Models with External Knowledge Sources, Marzieh Adeli Shamsabad et.al., Paper: http://arxiv.org/abs/2601.16108
- 2026-02-23, Multilingual Large Language Models do not comprehend all natural languages to equal degrees, Natalia Moskvina et.al., Paper: http://arxiv.org/abs/2602.20065
- 2025-12-29, Multilingual Hidden Prompt Injection Attacks on LLM-Based Academic Reviewing, Panagiotis Theocharopoulos et.al., Paper: http://arxiv.org/abs/2512.23684
- 2025-10-31, Multilingual BERT language model for medical tasks: Evaluation on domain-specific adaptation and cross-linguality, Yinghao Luo et.al., Paper: http://arxiv.org/abs/2510.27552
- 2026-03-19, MultihopSpatial: Multi-hop Compositional Spatial Reasoning Benchmark for Vision-Language Model, Youngwan Lee et.al., Paper: http://arxiv.org/abs/2603.18892
- 2025-11-05, MultiZebraLogic: A Multilingual Logical Reasoning Benchmark, Sofie Helene Bruun et.al., Paper: http://arxiv.org/abs/2511.03553
- 2024-11-06, Multi-modal Preference Alignment Remedies Degradation of Visual Instruction Tuning on Language Models, Shengzhi Li et.al., Paper: http://arxiv.org/abs/2402.10884
- 2026-02-04, Multi-layer Cross-Attention is Provably Optimal for Multi-modal In-context Learning, Nicholas Barnfield et.al., Paper: http://arxiv.org/abs/2602.04872
- 2026-01-21, Multi-context principal component analysis, Kexin Wang et.al., Paper: http://arxiv.org/abs/2601.15239
- 2026-02-05, Multi-Token Prediction via Self-Distillation, John Kirchenbauer et.al., Paper: http://arxiv.org/abs/2602.06019
- 2025-11-19, Multi-Stage Residual-Aware Unsupervised Deep Learning Framework for Consistent Ultrasound Strain Elastography, Shourov Joarder et.al., Paper: http://arxiv.org/abs/2511.15640
- 2026-01-06, Multi-RADS Synthetic Radiology Report Dataset and Head-to-Head Benchmarking of 41 Open-Weight and Proprietary Language Models, Kartik Bose et.al., Paper: http://arxiv.org/abs/2601.03232
- 2025-12-04, Multi-LLM Collaboration for Medication Recommendation, Huascar Sanchez et.al., Paper: http://arxiv.org/abs/2512.05066
- 2026-03-02, Multi-Head Low-Rank Attention, Songtao Liu et.al., Paper: http://arxiv.org/abs/2603.02188
- 2026-02-04, Multi-Head LatentMoE and Head Parallel: Communication-Efficient and Deterministic MoE Parallelism, Chenwei Cui et.al., Paper: http://arxiv.org/abs/2602.04870
- 2025-12-11, Multi-Granular Node Pruning for Circuit Discovery, Muhammad Umair Haider et.al., Paper: http://arxiv.org/abs/2512.10903
- 2026-03-14, Multi-Grained Vision-Language Alignment for Domain Generalized Person Re-Identification, Jiachen Li et.al., Paper: http://arxiv.org/abs/2603.14012
- 2025-12-23, Multi-Grained Text-Guided Image Fusion for Multi-Exposure and Multi-Focus Scenarios, Mingwei Tang et.al., Paper: http://arxiv.org/abs/2512.20556
- 2025-11-26, Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following, Tianyi Xiong et.al., Paper: http://arxiv.org/abs/2511.21662
- 2026-03-07, Multi-Agentic AI for Conflict-Aware rApp Policy Orchestration in Open RAN, Haiyuan Li et.al., Paper: http://arxiv.org/abs/2603.07375
- 2026-04-02, Multi-Agent Video Recommenders: Evolution, Patterns, and Open Challenges, Srivaths Ranganathan et.al., Paper: http://arxiv.org/abs/2604.02211
- 2026-04-01, Multi-Agent LLM Governance for Safe Two-Timescale Reinforcement Learning in SDN-IoT Defense, Saeid Jamshidi et.al., Paper: http://arxiv.org/abs/2604.01127
- 2025-10-28, Multi-Agent Evolve: LLM Self-Improve through Co-evolution, Yixing Chen et.al., Paper: http://arxiv.org/abs/2510.23595
- 2025-11-21, Moving superfluids in the rotating universe, Jose Beltrán Jiménez et.al., Paper: http://arxiv.org/abs/2511.17472
- 2026-02-26, MovieTeller: Tool-augmented Movie Synopsis with ID Consistent Progressive Abstraction, Yizhi Li et.al., Paper: http://arxiv.org/abs/2602.23228
- 2026-03-13, MotionAnymesh: Physics-Grounded Articulation for Simulation-Ready Digital Twins, WenBo Xu et.al., Paper: http://arxiv.org/abs/2603.12936
- 2026-03-19, Motion-o: Trajectory-Grounded Video Reasoning, Bishoy Galoaa et.al., Paper: http://arxiv.org/abs/2603.18856
- 2026-01-26, MortalMATH: Evaluating the Conflict Between Reasoning Objectives and Emergency Contexts, Etienne Lanzeray et.al., Paper: http://arxiv.org/abs/2601.18790
- 2025-11-21, MorphSeek: Fine-grained Latent Representation-Level Policy Optimization for Deformable Image Registration, Runxun Zhang et.al., Paper: http://arxiv.org/abs/2511.17392
- 2026-03-10, More than the Sum: Panorama-Language Models for Adverse Omni-Scenes, Weijia Fan et.al., Paper: http://arxiv.org/abs/2603.09573
- 2026-01-12, More Images, More Problems? A Controlled Analysis of VLM Failure Modes, Anurag Das et.al., Paper: http://arxiv.org/abs/2601.07812
- 2025-10-24, MoniTor: Exploiting Large Language Models with Instruction for Online Video Anomaly Detection, Shengtian Yang et.al., Paper: http://arxiv.org/abs/2510.21449
- 2025-12-18, MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning, Yuanchen Ju et.al., Paper: http://arxiv.org/abs/2512.16909
- 2026-03-17, MolmoB0T: Large-Scale Simulation Enables Zero-Shot Manipulation, Abhay Deshpande et.al., Paper: http://arxiv.org/abs/2603.16861
- 2026-01-15, Molmo2: Open Weights and Data for Vision-Language Models with Video Understanding and Grounding, Christopher Clark et.al., Paper: http://arxiv.org/abs/2601.10611
- 2026-01-21, MolecularIQ: Characterizing Chemical Reasoning Capabilities Through Symbolic Verification on Molecular Graphs, Christoph Bartmann et.al., Paper: http://arxiv.org/abs/2601.15279
- 2026-01-07, Modular Prompt Optimization: Optimizing Structured Prompts with Section-Local Textual Gradients, Prith Sharma et.al., Paper: http://arxiv.org/abs/2601.04055
- 2025-10-24, Modest-Align: Data-Efficient Alignment for Vision-Language Models, Jiaxiang Liu et.al., Paper: http://arxiv.org/abs/2510.21606
- 2025-12-31, Modeling Language as a Sequence of Thoughts, Nasim Borazjanizadeh et.al., Paper: http://arxiv.org/abs/2512.25026
- 2026-01-13, Modeling LLM Agent Reviewer Dynamics in Elo-Ranked Review System, Hsiang-Wei Huang et.al., Paper: http://arxiv.org/abs/2601.08829
- 2026-02-19, Modeling Distinct Human Interaction in Web Agents, Faria Huq et.al., Paper: http://arxiv.org/abs/2602.17588
- 2025-11-06, Modeling Clinical Uncertainty in Radiology Reports: from Explicit Uncertainty Markers to Implicit Reasoning Pathways, Paloma Rabaey et.al., Paper: http://arxiv.org/abs/2511.04506
- 2026-03-10, Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions, Mingyang Song et.al., Paper: http://arxiv.org/abs/2603.09938
- 2026-03-06, MoEless: Efficient MoE LLM Serving via Serverless Computing, Hanfei Yu et.al., Paper: http://arxiv.org/abs/2603.06350
- 2026-03-06, MoEMambaMIL: Structure-Aware Selective State Space Modeling for Whole-Slide Image Analysis, Dongqing Xie et.al., Paper: http://arxiv.org/abs/2603.06378
- 2026-03-13, MoEKD: Mixture-of-Experts Knowledge Distillation for Robust and High-Performing Compressed Code Models, Md. Abdul Awal et.al., Paper: http://arxiv.org/abs/2603.13213
- 2026-01-08, MoE3D: A Mixture-of-Experts Module for 3D Reconstruction, Zichen Wang et.al., Paper: http://arxiv.org/abs/2601.05208
- 2025-12-23, MoE-DiffuSeq: Enhancing Long-Document Diffusion Models with Sparse Attention and Mixture of Experts, Alexandros Christoforos et.al., Paper: http://arxiv.org/abs/2512.20604
- 2025-11-19, MoDES: Accelerating Mixture-of-Experts Multimodal Large Language Models via Dynamic Expert Skipping, Yushi Huang et.al., Paper: http://arxiv.org/abs/2511.15690
- 2026-03-03, MoD-DPO: Towards Mitigating Cross-modal Hallucinations in Omni LLMs using Modality Decoupled Preference Optimization, Ashutosh Chaubey et.al., Paper: http://arxiv.org/abs/2603.03192
- 2026-01-23, Mixture-of-Models: Unifying Heterogeneous Agents via N-Way Self-Evaluating Deliberation, Tims Pecerskis et.al., Paper: http://arxiv.org/abs/2601.16863
- 2026-03-16, Mixture-of-Depths Attention, Lianghui Zhu et.al., Paper: http://arxiv.org/abs/2603.15619
- 2026-03-23, Mixture of Mini Experts: Overcoming the Linear Layer Bottleneck in Multiple Instance Learning, Daniel Shao et.al., Paper: http://arxiv.org/abs/2603.22198
- 2025-11-24, Mixture of Horizons in Action Chunking, Dong Jing et.al., Paper: http://arxiv.org/abs/2511.19433
- 2026-03-22, Mixture of Chapters: Scaling Learnt Memory in Transformers, Tasmay Pankaj Tibrewal et.al., Paper: http://arxiv.org/abs/2603.21096
- 2025-10-23, Mixing Importance with Diversity: Joint Optimization for KV Cache Compression in Large Vision-Language Models, Xuyang Liu et.al., Paper: http://arxiv.org/abs/2510.20707
- 2026-01-13, MixServe: An Automatic Distributed Serving System for MoE Models with Hybrid Parallelism Based on Fused Communication Algorithm, Bowen Zhou et.al., Paper: http://arxiv.org/abs/2601.08800
- 2025-12-10, Mitigating Social Bias in English and Urdu Language Models Using PRM-Guided Candidate Selection and Sequential Refinement, Muneeb Ur Raheem Khan et.al., Paper: http://arxiv.org/abs/2512.09854
- 2026-03-22, Mitigating Selection Bias in Large Language Models via Permutation-Aware GRPO, Jinquan Zheng et.al., Paper: http://arxiv.org/abs/2603.21016
- 2026-02-26, Mitigating Legibility Tax with Decoupled Prover-Verifier Games, Yegon Kim et.al., Paper: http://arxiv.org/abs/2602.23248
- 2026-03-18, Mitigating LLM Hallucinations through Domain-Grounded Tiered Retrieval, Md. Asraful Haque et.al., Paper: http://arxiv.org/abs/2603.17872
- 2026-04-02, Mining Instance-Centric Vision-Language Contexts for Human-Object Interaction Detection, Soo Won Seo et.al., Paper: http://arxiv.org/abs/2604.02071
- 2025-10-27, Minimizing Human Intervention in Online Classification, William Réveillard et.al., Paper: http://arxiv.org/abs/2510.23557
- 2026-03-10, MiniAppBench: Evaluating the Shift from Text to Interactive HTML Responses in LLM-Powered Assistants, Zuhao Zhang et.al., Paper: http://arxiv.org/abs/2603.09652
- 2025-12-15, MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning, Haoyu Fu et.al., Paper: http://arxiv.org/abs/2512.13636
- 2025-12-24, MiST: Understanding the Role of Mid-Stage Scientific Training in Developing Chemical Reasoning Models, Andres M Bran et.al., Paper: http://arxiv.org/abs/2512.21231
- 2026-01-29, MetricAnything: Scaling Metric Depth Pretraining with Noisy Heterogeneous Sources, Baorui Ma et.al., Paper: http://arxiv.org/abs/2601.22054
- 2026-01-21, Metadata Conditioned Large Language Models for Localization, Anjishnu Mukherjee et.al., Paper: http://arxiv.org/abs/2601.15236
- 2026-03-09, MetaWorld-X: Hierarchical World Modeling via VLM-Orchestrated Experts for Humanoid Loco-Manipulation, Yutong Shen et.al., Paper: http://arxiv.org/abs/2603.08572
- 2026-02-02, MetaCLASS: Metacognitive Coaching for Learning with Adaptive Self-regulation Support, Naiming Liu et.al., Paper: http://arxiv.org/abs/2602.02457
- 2025-12-18, Meta-RL Induces Exploration in Language Agents, Yulun Jiang et.al., Paper: http://arxiv.org/abs/2512.16848
- 2026-02-02, MentisOculi: Revealing the Limits of Reasoning with Mental Imagery, Jana Zeller et.al., Paper: http://arxiv.org/abs/2602.02465
- 2026-03-13, Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation, Yifeng Liu et.al., Paper: http://arxiv.org/abs/2603.13045
- 2026-02-13, Memory-Efficient Structured Backpropagation for On-Device LLM Fine-Tuning, Juneyoung Park et.al., Paper: http://arxiv.org/abs/2602.13069
- 2026-03-25, Memory-Augmented Vision-Language Agents for Persistent and Semantically Consistent Object Captioning, Tommaso Galliena et.al., Paper: http://arxiv.org/abs/2603.24257
- 2026-02-27, Memory Caching: RNNs with Growing Memory, Ali Behrouz et.al., Paper: http://arxiv.org/abs/2602.24281
- 2026-01-02, Memory Bank Compression for Continual Adaptation of Large Language Models, Thomas Katraouras et.al., Paper: http://arxiv.org/abs/2601.00756
- 2026-03-20, Memori: A Persistent Memory Layer for Efficient, Context-Aware LLM Agents, Luiz C. Borro et.al., Paper: http://arxiv.org/abs/2603.19935
- 2026-03-04, Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory, Zhenting Wang et.al., Paper: http://arxiv.org/abs/2603.04257
- 2026-02-02, MemSkill: Learning and Evolving Memory Skills for Self-Evolving Agents, Haozhen Zhang et.al., Paper: http://arxiv.org/abs/2602.02474
- 2026-01-06, MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory, Shengtao Zhang et.al., Paper: http://arxiv.org/abs/2601.03192
- 2026-03-23, MemDLM: Memory-Enhanced DLM Training, Zehua Pei et.al., Paper: http://arxiv.org/abs/2603.22241
- 2026-01-28, MemCtrl: Using MLLMs as Active Memory Controllers on Embodied Agents, Vishnu Sashank Dorbala et.al., Paper: http://arxiv.org/abs/2601.20831
- 2025-11-28, MegaChat: A Synthetic Persian Q&A Dataset for High-Quality Sales Chatbot Evaluation, Mahdi Rahmani et.al., Paper: http://arxiv.org/abs/2511.23397
- 2026-02-26, MediX-R1: Open Ended Medical Reinforcement Learning, Sahal Shaji Mullappilly et.al., Paper: http://arxiv.org/abs/2602.23363
- 2026-03-20, MedSPOT: A Workflow-Aware Sequential Grounding Benchmark for Clinical GUI, Rozain Shakeel et.al., Paper: http://arxiv.org/abs/2603.19993
- 2025-11-25, MedROV: Towards Real-Time Open-Vocabulary Detection Across Diverse Medical Imaging Modalities, Tooba Tehreem Sheikh et.al., Paper: http://arxiv.org/abs/2511.20650
- 2026-03-20, MedQ-Engine: A Closed-Loop Data Engine for Evolving MLLMs in Medical Image Quality Assessment, Jiyao Liu et.al., Paper: http://arxiv.org/abs/2603.19863
- 2026-03-24, MedObvious: Exposing the Medical Moravec’s Paradox in VLMs via Clinical Triage, Ufaq Khan et.al., Paper: http://arxiv.org/abs/2603.23501
- 2026-02-06, MedMO: Grounding and Understanding Multimodal Large Language Model for Medical Images, Ankan Deria et.al., Paper: http://arxiv.org/abs/2602.06965
- 2026-03-10, MedMASLab: A Unified Orchestration Framework for Benchmarking Multimodal Medical Multi-Agent Systems, Yunhang Qian et.al., Paper: http://arxiv.org/abs/2603.09909
- 2025-12-10, MedForget: Hierarchy-Aware Multimodal Unlearning Testbed for Medical AI, Fengli Wu et.al., Paper: http://arxiv.org/abs/2512.09867
- 2026-03-24, MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models, Jianxin Lin et.al., Paper: http://arxiv.org/abs/2603.23085
- 2026-03-17, MedCL-Bench: Benchmarking stability-efficiency trade-offs and scaling in biomedical continual learning, Min Zeng et.al., Paper: http://arxiv.org/abs/2603.16738
- 2025-12-15, MedCEG: Reinforcing Verifiable Medical Reasoning with Critical Evidence Graph, Linjie Mu et.al., Paper: http://arxiv.org/abs/2512.13510
- 2025-11-20, MedBayes-Lite: Bayesian Uncertainty Quantification for Safe Clinical Decision Support, Elias Hossain et.al., Paper: http://arxiv.org/abs/2511.16625
- 2025-10-21, Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents, Guangfu Guo et.al., Paper: http://arxiv.org/abs/2510.18424
- 2026-03-05, Med-V1: Small Language Models for Zero-shot and Scalable Biomedical Evidence Attribution, Qiao Jin et.al., Paper: http://arxiv.org/abs/2603.05308
- 2026-01-30, Med-Scout: Curing MLLMs’ Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training, Anglin Liu et.al., Paper: http://arxiv.org/abs/2601.23220
- 2026-03-16, Mechanistic Origin of Moral Indifference in Language Models, Lingyu Li et.al., Paper: http://arxiv.org/abs/2603.15615
- 2025-12-05, Mechanistic Interpretability of Antibody Language Models Using SAEs, Rebonto Haque et.al., Paper: http://arxiv.org/abs/2512.05794
- 2026-01-26, Mechanistic Analysis of Catastrophic Forgetting in Large Language Models During Continual Fine-tuning, Olaf Yunus Laitinen Imanov et.al., Paper: http://arxiv.org/abs/2601.18699
- 2026-01-08, Mechanisms of Prompt-Induced Hallucination in Vision-Language Models, William Rudman et.al., Paper: http://arxiv.org/abs/2601.05201
- 2025-10-31, Mechanics of Learned Reasoning 1: TempoBench, A Benchmark for Interpretable Deconstruction of Reasoning System Performance, Nikolaus Holzer et.al., Paper: http://arxiv.org/abs/2510.27544
- 2026-03-25, Mechanic: Sorrifier-Driven Formal Decomposition Workflow for Automated Theorem Proving, Ruichen Qiu et.al., Paper: http://arxiv.org/abs/2603.24465
- 2026-01-22, Mecellem Models: Turkish Models Trained from Scratch and Continually Pre-trained for the Legal Domain, Özgür Uğur et.al., Paper: http://arxiv.org/abs/2601.16018
- 2026-01-08, Measuring and Fostering Peace through Machine Learning and Artificial Intelligence, P. Gilda et.al., Paper: http://arxiv.org/abs/2601.05232
- 2026-03-19, Measuring and Exploiting Confirmation Bias in LLM-Assisted Security Code Review, Dimitris Mitropoulos et.al., Paper: http://arxiv.org/abs/2603.18740
- 2026-03-26, Measuring What Matters – or What’s Convenient?: Robustness of LLM-Based Scoring Systems to Construct-Irrelevant Factors, Cole Walsh et.al., Paper: http://arxiv.org/abs/2603.25674
- 2026-03-14, Measuring Weather Effects and Link Quality Dynamics in LEO Satellite Networks, Clemens Lottermoser et.al., Paper: http://arxiv.org/abs/2603.14008
- 2026-02-18, Measuring Mid-2025 LLM-Assistance on Novice Performance in Biology, Shen Zhou Hong et.al., Paper: http://arxiv.org/abs/2602.16703
- 2026-03-20, Measuring Faithfulness Depends on How You Measure: Classifier Sensitivity in LLM Chain-of-Thought Evaluation, Richard J. Young et.al., Paper: http://arxiv.org/abs/2603.20172
- 2025-11-18, Measuring AI Progress in Drug Discovery: A Reproducible Leaderboard for the Tox21 Challenge, Antonia Ebner et.al., Paper: http://arxiv.org/abs/2511.14744
- 2026-03-19, Meanings and Measurements: Multi-Agent Probabilistic Grounding for Vision-Language Navigation, Swagat Padhan et.al., Paper: http://arxiv.org/abs/2603.19166
- 2025-11-26, Mean-field Modelling of Moiré Materials: A User’s Guide with Selected Applications to Twisted Bilayer Graphene, Yves H. Kwan et.al., Paper: http://arxiv.org/abs/2511.21683
- 2026-01-06, Maximizing Local Entropy Where It Matters: Prefix-Aware Localized LLM Unlearning, Naixin Zhai et.al., Paper: http://arxiv.org/abs/2601.03190
- 2025-11-13, Maximizing Efficiency of Dataset Compression for Machine Learning Potentials With Information Theory, Benjamin Yu et.al., Paper: http://arxiv.org/abs/2511.10561
- 2025-12-05, MaxShapley: Towards Incentive-compatible Generative Search with Fair Context Attribution, Sara Patel et.al., Paper: http://arxiv.org/abs/2512.05958
- 2025-11-26, Matrix: Peer-to-Peer Multi-Agent Synthetic Data Generation Framework, Dong Wang et.al., Paper: http://arxiv.org/abs/2511.21686
- 2026-01-02, Materials Informatics: Emergence To Autonomous Discovery In The Age Of AI, Turab Lookman et.al., Paper: http://arxiv.org/abs/2601.00742
- 2026-03-12, Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models, Samy Jelassi et.al., Paper: http://arxiv.org/abs/2603.12248
- 2026-01-15, MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching, Changle Qu et.al., Paper: http://arxiv.org/abs/2601.10712
- 2025-11-21, Masked-and-Reordered Self-Supervision for Reinforcement Learning from Verifiable Rewards, Zhen Wang et.al., Paper: http://arxiv.org/abs/2511.17473
- 2025-10-30, Masked Diffusion Captioning for Visual Feature Learning, Chao Feng et.al., Paper: http://arxiv.org/abs/2510.26799
- 2026-01-29, MasalBench: A Benchmark for Contextual and Cross-Cultural Understanding of Persian Proverbs in LLMs, Ghazal Kalhor et.al., Paper: http://arxiv.org/abs/2601.22050
- 2025-12-08, Mary, the Cheeseburger-Eating Vegetarian: Do LLMs Recognize Incoherence in Narratives?, Karin de Langis et.al., Paper: http://arxiv.org/abs/2512.07777
- 2025-12-03, MarkTune: Improving the Quality-Detectability Trade-off in Open-Weight LLM Watermarking, Yizhou Zhao et.al., Paper: http://arxiv.org/abs/2512.04044
- 2025-12-22, MapTrace: Scalable Data Generation for Route Tracing on Maps, Artemis Panagopoulou et.al., Paper: http://arxiv.org/abs/2512.19609
- 2025-11-25, MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models, Chieh-Yun Chen et.al., Paper: http://arxiv.org/abs/2511.20629
- 2026-01-16, Map2Thought: Explicit 3D Spatial Reasoning via Metric Cognitive Maps, Xiangjun Gao et.al., Paper: http://arxiv.org/abs/2601.11442
- 2025-12-31, Many Minds from One Model: Bayesian Transformers for Population Intelligence, Diji Yang et.al., Paper: http://arxiv.org/abs/2512.25063
- 2025-12-01, ManualVLA: A Unified VLA Model for Chain-of-Thought Manual Generation and Robotic Manipulation, Chenyang Gu et.al., Paper: http://arxiv.org/abs/2512.02013
- 2026-03-16, Mamba-3: Improved Sequence Modeling using State Space Principles, Aakash Lahoti et.al., Paper: http://arxiv.org/abs/2603.15569
- 2026-01-06, MalruleLib: Large-Scale Executable Misconception Reasoning with Step Traces for Modeling Student Thinking in Mathematics, Xinghe Chen et.al., Paper: http://arxiv.org/abs/2601.03217
- 2025-12-23, Making Large Language Models Efficient Dense Retrievers, Yibin Lei et.al., Paper: http://arxiv.org/abs/2512.20612
- 2023-09-06, Making Large Language Models Better Reasoners with Alignment, Peiyi Wang et.al., Paper: http://arxiv.org/abs/2309.02144
- 2025-10-21, MTraining: Distributed Dynamic Sparse Attention for Efficient Ultra-Long Context Training, Wenxuan Li et.al., Paper: http://arxiv.org/abs/2510.18830
- 2026-04-02, MTI: A Behavior-Based Temperament Profiling System for AI Agents, Jihoon Jeong et.al., Paper: http://arxiv.org/abs/2604.02145
- 2026-02-27, MT-PingEval: Evaluating Multi-Turn Collaboration with Private Information Games, Jacob Eisenstein et.al., Paper: http://arxiv.org/abs/2602.24188
- 2026-03-10, MSSR: Memory-Aware Adaptive Replay for Continual LLM Fine-Tuning, Yiyang Lu et.al., Paper: http://arxiv.org/abs/2603.09892
- 2025-10-24, MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization, Chenglong Wang et.al., Paper: http://arxiv.org/abs/2510.21473
- 2025-12-15, MMhops-R1: Multimodal Multi-hop Reasoning, Tao Zhang et.al., Paper: http://arxiv.org/abs/2512.13573
- 2025-11-21, MMT-ARD: Multimodal Multi-Teacher Adversarial Distillation for Robust Vision-Language Models, Yuqi Li et.al., Paper: http://arxiv.org/abs/2511.17448
- 2025-12-11, MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence, Jingli Lin et.al., Paper: http://arxiv.org/abs/2512.10863
- 2025-12-23, MMEDIT: A Unified Framework for Multi-Type Audio Editing via Audio Language Model, Ye Tao et.al., Paper: http://arxiv.org/abs/2512.20339
- 2026-03-10, MM-tau-p $^2$ : Persona-Adaptive Prompting for Robust Multi-Modal Agent Evaluation in Dual-Control Settings, Anupam Purwar et.al., Paper: http://arxiv.org/abs/2603.09643
- 2026-03-12, MM-CondChain: A Programmatically Verified Benchmark for Visually Grounded Deep Compositional Reasoning, Haozhan Shen et.al., Paper: http://arxiv.org/abs/2603.12266
- 2025-10-21, MLMA: Towards Multilingual with Mamba Based Architectures, Mohamed Nabih Ali et.al., Paper: http://arxiv.org/abs/2510.18684
- 2026-03-24, MLLM-HWSI: A Multimodal Large Language Model for Hierarchical Whole Slide Image Understanding, Basit Alawode et.al., Paper: http://arxiv.org/abs/2603.23067
- 2026-03-03, MIBURI: Towards Expressive Interactive Gesture Synthesis, M. Hamza Mughal et.al., Paper: http://arxiv.org/abs/2603.03282
- 2026-01-16, MHA2MLA-VLM: Enabling DeepSeek’s Economical Multi-Head Latent Attention across Vision-Language Models, Xiaoran Fan et.al., Paper: http://arxiv.org/abs/2601.11464
- 2025-10-21, MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models, ChangSu Choi et.al., Paper: http://arxiv.org/abs/2510.18383
- 2026-02-02, MEG-XL: Data-Efficient Brain-to-Text via Long-Context Pre-Training, Dulhan Jayalath et.al., Paper: http://arxiv.org/abs/2602.02494
- 2025-11-21, MCMoE: Completing Missing Modalities with Mixture of Experts for Incomplete Multimodal Action Quality Assessment, Huangbiao Xu et.al., Paper: http://arxiv.org/abs/2511.17397
- 2026-03-07, MAviS: A Multimodal Conversational Assistant For Avian Species, Yevheniia Kryklyvets et.al., Paper: http://arxiv.org/abs/2603.07294
- 2026-02-19, MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning, Xiaoliang Fu et.al., Paper: http://arxiv.org/abs/2602.17550
- 2026-03-23, MARCUS: An agentic, multimodal vision-language model for cardiac diagnosis and management, Jack W O’Sullivan et.al., Paper: http://arxiv.org/abs/2603.22179
- 2026-03-25, MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination, Zhuo Li et.al., Paper: http://arxiv.org/abs/2603.24579
- 2025-10-31, MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval, Qi Luo et.al., Paper: http://arxiv.org/abs/2510.27569
- 2025-12-31, MAMA-Memeia! Multi-Aspect Multi-Agent Collaboration for Depressive Symptoms Identification in Memes, Siddhant Agarwal et.al., Paper: http://arxiv.org/abs/2512.25015
- 2026-01-06, MAGMA: A Multi-Graph based Agentic Memory Architecture for AI Agents, Dongming Jiang et.al., Paper: http://arxiv.org/abs/2601.03236
- 2026-02-16, MAC-AMP: A Closed-Loop Multi-Agent Collaboration System for Multi-Objective Antimicrobial Peptide Design, Gen Zhou et.al., Paper: http://arxiv.org/abs/2602.14926
- 2025-12-05, M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG, David Anugraha et.al., Paper: http://arxiv.org/abs/2512.05959
- 2025-12-10, M3Net: A Multi-Metric Mixture of Experts Network Digital Twin with Graph Neural Networks, Blessed Guda et.al., Paper: http://arxiv.org/abs/2512.09797
- 2026-01-13, M3CoTBench: Benchmark Chain-of-Thought of MLLMs in Medical Image Understanding, Juntao Jiang et.al., Paper: http://arxiv.org/abs/2601.08758
- 2026-03-20, LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation, Jiazheng Xing et.al., Paper: http://arxiv.org/abs/2603.20192
- 2025-12-02, Lumos: Let there be Language Model System Certification, Isha Chaudhary et.al., Paper: http://arxiv.org/abs/2512.02966
- 2026-02-02, Lower bounds for multivariate independence polynomials and their generalisations, Joonkyung Lee et.al., Paper: http://arxiv.org/abs/2602.02450
- 2026-01-22, Low-altitude Multi-UAV-assisted Data Collection and Semantic Forwarding for Post-Disaster Relief, Xiaoya Zheng et.al., Paper: http://arxiv.org/abs/2601.16146
- 2023-06-27, Low-Rank Prune-And-Factorize for Language Model Compression, Siyu Ren et.al., Paper: http://arxiv.org/abs/2306.14152
- 2025-12-01, Low-Rank Prehab: Preparing Neural Networks for SVD Compression, Haoran Qin et.al., Paper: http://arxiv.org/abs/2512.01980
- 2025-10-30, Low-Altitude UAV-Carried Movable Antenna for Joint Wireless Power Transfer and Covert Communications, Chuang Zhang et.al., Paper: http://arxiv.org/abs/2510.26628
- 2026-01-20, Lost in the Prompt Order: Revealing the Limitations of Causal Attention in Language Models, Hyunjong Ok et.al., Paper: http://arxiv.org/abs/2601.14152
- 2025-11-19, Lost in Vagueness: Towards Context-Sensitive Standards for Robustness Assessment under the EU AI Act, Roberta Tamponi et.al., Paper: http://arxiv.org/abs/2511.15620
- 2026-03-11, LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation, Jinwoo Ahn et.al., Paper: http://arxiv.org/abs/2603.10899
- 2025-12-24, LookPlanGraph: Embodied Instruction Following Method with VLM Graph Augmentation, Anatoly O. Onishchenko et.al., Paper: http://arxiv.org/abs/2512.21243
- 2025-11-18, Look-Ahead Reasoning on Learning Platforms, Haiqing Zhu et.al., Paper: http://arxiv.org/abs/2511.14745
- 2026-02-13, Look Inward to Explore Outward: Learning Temperature Policy from LLM Internal States via Hierarchical RL, Yixiao Zhou et.al., Paper: http://arxiv.org/abs/2602.13035
- 2025-12-26, Look Closer! An Adversarial Parametric Editing Framework for Hallucination Mitigation in VLMs, Jiayu Hu et.al., Paper: http://arxiv.org/abs/2512.21999
- 2026-03-02, LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards, Guanzheng Chen et.al., Paper: http://arxiv.org/abs/2603.02146
- 2026-03-29, LongCat-Next: Lexicalizing Modalities as Discrete Tokens, Meituan LongCat Team et.al., Paper: http://arxiv.org/abs/2603.27538
- 2026-01-23, LongCat-Flash-Thinking-2601 Technical Report, Meituan LongCat Team et.al., Paper: http://arxiv.org/abs/2601.16725
- 2026-03-22, LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning, Jianing Wang et.al., Paper: http://arxiv.org/abs/2603.21065
- 2026-03-12, Long-Context Encoder Models for Polish Language Understanding, Sławomir Dadas et.al., Paper: http://arxiv.org/abs/2603.12191
- 2026-02-16, Long Context, Less Focus: A Scaling Gap in LLMs Revealed through Privacy and Personalization, Shangding Gu et.al., Paper: http://arxiv.org/abs/2602.15028
- 2026-02-10, Long Chain-of-Thought Compression via Fine-Grained Group Policy Optimization, Xinchen Han et.al., Paper: http://arxiv.org/abs/2602.10048
- 2025-11-06, Logit-Entropy Adaptive Stopping Heuristic for Efficient Chain-of-Thought Reasoning, Mohammad Atif Quamar et.al., Paper: http://arxiv.org/abs/2511.04654
- 2025-11-25, LocateAnything3D: Vision-Language 3D Detection with Chain-of-Sight, Yunze Man et.al., Paper: http://arxiv.org/abs/2511.20648
- 2026-03-18, Loc3R-VLM: Language-based Localization and 3D Reasoning with Vision-Language Models, Kevin Qu et.al., Paper: http://arxiv.org/abs/2603.18002
- 2025-10-30, LoRAQuant: Mixed-Precision Quantization of LoRA to Ultra-Low Bits, Amir Reza Mirzaei et.al., Paper: http://arxiv.org/abs/2510.26690
- 2026-03-20, LoASR-Bench: Evaluating Large Speech Language Models on Low-Resource Automatic Speech Recognition Across Language Families, Jianan Chen et.al., Paper: http://arxiv.org/abs/2603.20042
- 2025-11-05, LiveTradeBench: Seeking Real-World Alpha with Large Language Models, Haofei Yu et.al., Paper: http://arxiv.org/abs/2511.03628
- 2025-12-29, LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation, Ethan Chern et.al., Paper: http://arxiv.org/abs/2512.23576
- 2025-11-07, LiveStar: Live Streaming Assistant for Real-World Online Video Understanding, Zhenyu Yang et.al., Paper: http://arxiv.org/abs/2511.05299
- 2025-11-17, Live-SWE-agent: Can Software Engineering Agents Self-Evolve on the Fly?, Chunqiu Steven Xia et.al., Paper: http://arxiv.org/abs/2511.13646
- 2026-01-14, LiteEmbed: Adapting CLIP to Rare Classes, Aishwarya Agarwal et.al., Paper: http://arxiv.org/abs/2601.09661
- 2025-11-07, Listening Between the Lines: Decoding Podcast Narratives with Language Modeling, Shreya Gupta et.al., Paper: http://arxiv.org/abs/2511.05310
- 2026-03-12, Linking Perception, Confidence and Accuracy in MLLMs, Yuetian Du et.al., Paper: http://arxiv.org/abs/2603.12149
- 2025-12-18, LinkedOut: Linking World Knowledge Representation Out of Video LLM for Next-Generation Video Recommendation, Haichao Zhang et.al., Paper: http://arxiv.org/abs/2512.16891
- 2026-01-28, Linear representations in language models can change dramatically over a conversation, Andrew Kyle Lampinen et.al., Paper: http://arxiv.org/abs/2601.20834
- 2025-12-19, Linear Personality Probing and Steering in LLMs: A Big Five Study, Michel Frising et.al., Paper: http://arxiv.org/abs/2512.17639
- 2026-01-28, Like a Therapist, But Not: Reddit Narratives of AI in Mental Health Contexts, Elham Aghakhani et.al., Paper: http://arxiv.org/abs/2601.20747
- 2025-10-27, Lightweight Robust Direct Preference Optimization, Cheol Woo Kim et.al., Paper: http://arxiv.org/abs/2510.23590
- 2026-04-01, Lightweight Prompt-Guided CLIP Adaptation for Monocular Depth Estimation, Reyhaneh Ahani Manghotay et.al., Paper: http://arxiv.org/abs/2604.01118
- 2026-01-21, Lightweight LLMs for Network Attack Detection in IoT Networks, Piyumi Bhagya Sudasinghe et.al., Paper: http://arxiv.org/abs/2601.15269
- 2025-12-23, LightTact: A Visual-Tactile Fingertip Sensor for Deformation-Independent Contact Sensing, Changyi Lin et.al., Paper: http://arxiv.org/abs/2512.20591
- 2026-01-20, LightOnOCR: A 1B End-to-End Multilingual Vision-Language Model for State-of-the-Art OCR, Said Taghadouini et.al., Paper: http://arxiv.org/abs/2601.14251
- 2025-10-21, LightMem: Lightweight and Efficient Memory-Augmented Generation, Jizhan Fang et.al., Paper: http://arxiv.org/abs/2510.18866
- 2026-03-12, LifeSim: Long-Horizon User Life Simulator for Personalized Assistant Evaluation, Feiyu Duan et.al., Paper: http://arxiv.org/abs/2603.12152
- 2025-12-24, Leveraging Lightweight Entity Extraction for Scalable Event-Based Image Retrieval, Dao Sy Duy Minh et.al., Paper: http://arxiv.org/abs/2512.21221
- 2025-11-24, Leveraging LLMs for reward function design in reinforcement learning control tasks, Franklin Cardenoso et.al., Paper: http://arxiv.org/abs/2511.19355
- 2026-03-24, Leveraging LLMs and Social Media to Understand User Perception of Smartphone-Based Earthquake Early Warnings, Hanjing Wang et.al., Paper: http://arxiv.org/abs/2603.23322
- 2026-03-05, Leveraging LLM Parametric Knowledge for Fact Checking without Retrieval, Artem Vazhentsev et.al., Paper: http://arxiv.org/abs/2603.05471
- 2025-12-29, Less is more: Probabilistic reduction is best explained by small-scale predictability measures, Cassandra L. Jacobs et.al., Paper: http://arxiv.org/abs/2512.23659
- 2026-03-25, LensWalk: Agentic Video Understanding by Planning How You See in Videos, Keliang Li et.al., Paper: http://arxiv.org/abs/2603.24558
- 2026-02-27, LemmaBench: A Live, Research-Level Benchmark to Evaluate LLM Capabilities in Mathematics, Antoine Peyronnet et.al., Paper: http://arxiv.org/abs/2602.24173
- 2026-03-05, Legal interpretation and AI: from expert systems to argumentation and LLMs, Václav Janeček et.al., Paper: http://arxiv.org/abs/2603.05392
- 2026-03-14, LegacyTranslate: LLM-based Multi-Agent Method for Legacy Code Translation, Zahra Moti et.al., Paper: http://arxiv.org/abs/2603.14054
- 2026-01-09, Left, Right, or Center? Evaluating LLM Framing in News Classification and Generation, Molly Kennedy et.al., Paper: http://arxiv.org/abs/2601.05835
- 2026-03-22, Left Behind: Cross-Lingual Transfer as a Bridge for Low-Resource Languages in Large Language Models, Abdul-Salem Beibitkhan et.al., Paper: http://arxiv.org/abs/2603.21036
- 2026-03-11, Leech Lattice Vector Quantization for Efficient LLM Compression, Tycho F. A. van der Ouderaa et.al., Paper: http://arxiv.org/abs/2603.11021
- 2025-10-23, Learning to Triage Taint Flows Reported by Dynamic Program Analysis in Node.js Packages, Ronghao Ni et.al., Paper: http://arxiv.org/abs/2510.20739
- 2025-11-20, Learning to Think Fast and Slow for Visual Language Models, Chenyu Lin et.al., Paper: http://arxiv.org/abs/2511.16670
- 2026-02-19, Learning to Stay Safe: Adaptive Regularization Against Safety Degradation during Fine-Tuning, Jyotin Goel et.al., Paper: http://arxiv.org/abs/2602.17546
- 2026-03-29, Learning to See through Illumination Extremes with Event Streaming in Multimodal Large Language Models, Baoheng Zhang et.al., Paper: http://arxiv.org/abs/2603.27558
- 2026-02-17, Learning to Retrieve Navigable Candidates for Efficient Vision-and-Language Navigation, Shutian Gu et.al., Paper: http://arxiv.org/abs/2602.15724
- 2025-12-23, Learning to Reason in 4D: Dynamic Spatial Understanding for Vision Language Models, Shengchao Zhou et.al., Paper: http://arxiv.org/abs/2512.20557
- 2025-10-27, Learning to Reason Efficiently with Discounted Reinforcement Learning, Alex Ayoub et.al., Paper: http://arxiv.org/abs/2510.23486
- 2025-12-03, Learning to Comparison-Shop, Jie Tang et.al., Paper: http://arxiv.org/abs/2512.04009
- 2025-10-27, Learning the PTM Code through a Coarse-to-Fine, Mechanism-Aware Framework, Jingjie Zhang et.al., Paper: http://arxiv.org/abs/2510.23492
- 2026-03-02, Learning from Synthetic Data Improves Multi-hop Reasoning, Anmol Kabra et.al., Paper: http://arxiv.org/abs/2603.02091
- 2026-03-18, Learning When to Attend: Conditional Memory Access for Long-Context LLMs, Sakshi Choudhary et.al., Paper: http://arxiv.org/abs/2603.17484
- 2026-03-03, Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use, Aradhye Agarwal et.al., Paper: http://arxiv.org/abs/2603.03205
- 2026-02-16, Learning User Interests via Reasoning and Distillation for Cross-Domain News Recommendation, Mengdan Zhu et.al., Paper: http://arxiv.org/abs/2602.15005
- 2026-03-12, Learning Transferable Sensor Models via Language-Informed Pretraining, Yuliang Chen et.al., Paper: http://arxiv.org/abs/2603.11950
- 2026-01-12, Learning Through Dialogue: Unpacking the Dynamics of Human-LLM Conversations on Political Issues, Shaz Furniturewala et.al., Paper: http://arxiv.org/abs/2601.07796
- 2026-02-16, Learning State-Tracking from Code Using Linear RNNs, Julien Siems et.al., Paper: http://arxiv.org/abs/2602.14814
- 2025-11-24, Learning Robust Social Strategies with Large Language Models, Dereck Piche et.al., Paper: http://arxiv.org/abs/2511.19405
- 2026-02-05, Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory, Haozhen Zhang et.al., Paper: http://arxiv.org/abs/2602.06025
- 2024-04-17, Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents, Renxi Wang et.al., Paper: http://arxiv.org/abs/2402.11651
- 2026-03-20, Learning Dynamic Belief Graphs for Theory-of-mind Reasoning, Ruxiao Chen et.al., Paper: http://arxiv.org/abs/2603.20170
- 2025-12-22, Learning Continuous Solvent Effects from Transient Flow Data: A Graph Neural Network Benchmark on Catechol Rearrangement, Hongsheng Xing et.al., Paper: http://arxiv.org/abs/2512.19530
- 2025-12-01, Learned-Rule-Augmented Large Language Model Evaluators, Jie Meng et.al., Paper: http://arxiv.org/abs/2512.01958
- 2026-01-07, Layer-wise Positional Bias in Short-Context Language Modeling, Maryam Rahimi et.al., Paper: http://arxiv.org/abs/2601.04098
- 2026-02-05, Layer-wise LoRA fine-tuning: a similarity metric approach, Keith Ando Ogawa et.al., Paper: http://arxiv.org/abs/2602.05988
- 2026-03-12, LatentGeo: Learnable Auxiliary Constructions in Latent Space for Multimodal Geometric Reasoning, Haiying Xu et.al., Paper: http://arxiv.org/abs/2603.12166
- 2025-12-24, Latent Implicit Visual Reasoning, Kelvin Li et.al., Paper: http://arxiv.org/abs/2512.21218
- 2025-11-25, Latent Collaboration in Multi-Agent Systems, Jiaru Zou et.al., Paper: http://arxiv.org/abs/2511.20639
- 2026-01-29, Latent Adversarial Regularization for Offline Preference Optimization, Enyi Jiang et.al., Paper: http://arxiv.org/abs/2601.22083
- 2025-12-23, Laser: Governing Long-Horizon Agentic Search via Structured Protocol and Context Register, Shuting Wang et.al., Paper: http://arxiv.org/abs/2512.20458
- 2025-12-15, Large-Language Memorization During the Classification of United States Supreme Court Cases, John E. Ortega et.al., Paper: http://arxiv.org/abs/2512.13654
- 2025-11-06, Large language models replicate and predict human cooperation across experiments in game theory, Andrea Cera Palatsi et.al., Paper: http://arxiv.org/abs/2511.04500
- 2025-10-21, Large language models for folktale type automation based on motifs: Cinderella case study, Tjaša Arčon et.al., Paper: http://arxiv.org/abs/2510.18561
- 2025-12-31, Large language models and the entropy of English, Colin Scheibner et.al., Paper: http://arxiv.org/abs/2512.24969
- 2026-02-26, Large Multimodal Models as General In-Context Classifiers, Marco Garosi et.al., Paper: http://arxiv.org/abs/2602.23229
- 2025-10-21, Large Language Models in Thematic Analysis: Prompt Engineering, Evaluation, and Guidelines for Qualitative Software Engineering Research, Cristina Martinez Montes et.al., Paper: http://arxiv.org/abs/2510.18456
- 2025-12-11, Large Language Models for Superconductor Discovery, Suman Itani et.al., Paper: http://arxiv.org/abs/2512.10847
- 2026-01-30, Large Language Models for Patent Classification: Strengths, Trade-offs, and the Long Tail Effect, Lorenzo Emer et.al., Paper: http://arxiv.org/abs/2601.23200
- 2026-02-02, Large Language Models for Mental Health: A Multilingual Evaluation, Nishat Raihan et.al., Paper: http://arxiv.org/abs/2602.02440
- 2025-12-03, Large Language Models for Limited Noisy Data: A Gravitational Wave Identification Study, Yixuan Li et.al., Paper: http://arxiv.org/abs/2512.04031
- 2026-02-09, Large Language Models for Geolocation Extraction in Humanitarian Crisis Response, G. Cafferata et.al., Paper: http://arxiv.org/abs/2602.08872
- 2025-11-07, Large Language Models for Explainable Threat Intelligence, Tiago Dinis et.al., Paper: http://arxiv.org/abs/2511.05406
- 2025-11-06, Large Language Models for Cyber Security, Raunak Somani et.al., Paper: http://arxiv.org/abs/2511.04508
- 2026-03-18, Large Language Models as a Semantic Interface and Ethical Mediator in Neuro-Digital Ecosystems: Conceptual Foundations and a Regulatory Imperative, Alexander V. Shenderuk-Zhidkov et.al., Paper: http://arxiv.org/abs/2603.17444
- 2026-01-23, Large Language Models as Automatic Annotators and Annotation Adjudicators for Fine-Grained Opinion Analysis, Gaurav Negi et.al., Paper: http://arxiv.org/abs/2601.16800
- 2026-03-11, Large Language Models as Annotators for Machine Translation Quality Estimation, Sidi Wang et.al., Paper: http://arxiv.org/abs/2603.10775
- 2026-03-20, Large Language Models and Stock Investing: Is the Human Factor Required?, Ricardo Crisostomo et.al., Paper: http://arxiv.org/abs/2603.19944
- 2026-03-25, Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning, Dogan Urgun et.al., Paper: http://arxiv.org/abs/2603.24324
- 2025-12-08, Large Causal Models from Large Language Models, Sridhar Mahadevan et.al., Paper: http://arxiv.org/abs/2512.07796
- 2026-03-25, Language-Assisted Image Clustering Guided by Discriminative Relational Signals and Adaptive Semantic Centers, Jun Ma et.al., Paper: http://arxiv.org/abs/2603.24275
- 2026-03-18, Language on Demand, Knowledge at Core: Composing LLMs with Encoder-Decoder Translation Models for Extensible Multilinguality, Mengyu Bu et.al., Paper: http://arxiv.org/abs/2603.17512
- 2026-02-25, Language Models Exhibit Inconsistent Biases Towards Algorithmic Agents and Human Experts, Jessica Y. Bo et.al., Paper: http://arxiv.org/abs/2602.22070
- 2026-03-12, Language Model Teams as Distributed Systems, Elizabeth Mieczkowski et.al., Paper: http://arxiv.org/abs/2603.12229
- 2026-02-11, Language Model Inversion through End-to-End Differentiation, Kevin Yandoka Denamganaï et.al., Paper: http://arxiv.org/abs/2602.11044
- 2025-12-11, LabelFusion: Learning to Fuse LLMs and Transformer Classifiers for Robust Text Classification, Michael Schlee et.al., Paper: http://arxiv.org/abs/2512.10793
- 2026-03-04, LabelBuddy: An Open Source Music and Audio Language Annotation Tagging Tool Using AI Assistance, Ioannis Prokopiou et.al., Paper: http://arxiv.org/abs/2603.04293
- 2026-01-08, LaST $_{0}$ : Latent Spatio-Temporal Chain-of-Thought for Robotic Vision-Language-Action Model, Zhuoyang Liu et.al., Paper: http://arxiv.org/abs/2601.05248
- 2026-01-13, LWM-Spectro: A Foundation Model for Wireless Baseband Signal Spectrograms, Namhyun Kim et.al., Paper: http://arxiv.org/abs/2601.08780
- 2026-03-19, LVOmniBench: Pioneering Long Audio-Video Understanding Evaluation for Omnimodal LLMs, Keda Tao et.al., Paper: http://arxiv.org/abs/2603.19217
- 2025-12-26, LVLM-Aided Alignment of Task-Specific Vision Models, Alexander Koebler et.al., Paper: http://arxiv.org/abs/2512.21985
- 2026-02-24, LUMEN: Longitudinal Multi-Modal Radiology Model for Prognosis and Diagnosis, Zhifan Jiang et.al., Paper: http://arxiv.org/abs/2602.21142
- 2025-12-02, LORE: A Large Generative Model for Search Relevance, Chenji Lu et.al., Paper: http://arxiv.org/abs/2512.03025
- 2025-12-10, LLMs in Interpreting Legal Documents, Simone Corbo et.al., Paper: http://arxiv.org/abs/2512.09830
- 2026-01-13, LLMs in Code Vulnerability Analysis: A Proof of Concept, Shaznin Sultana et.al., Paper: http://arxiv.org/abs/2601.08691
- 2026-01-14, LLMs can Compress LLMs: Adaptive Pruning by Agents, Sai Varun Kodathala et.al., Paper: http://arxiv.org/abs/2601.09694
- 2026-03-02, LLMs as Strategic Actors: Behavioral Alignment, Risk Calibration, and Argumentation Framing in Geopolitical Simulations, Veronika Solopova et.al., Paper: http://arxiv.org/abs/2603.02128
- 2025-10-21, LLMs as Sparse Retrievers:A Framework for First-Stage Product Search, Hongru Song et.al., Paper: http://arxiv.org/abs/2510.18527
- 2026-01-14, LLMs Got Rhythm? Hybrid Phonological Filtering for Greek Poetry Rhyme Detection and Generation, Stergios Chatzikyriakidis et.al., Paper: http://arxiv.org/abs/2601.09631
- 2025-12-11, LLMs Can Assist with Proposal Selection at Large User Facilities, Lijie Ding et.al., Paper: http://arxiv.org/abs/2512.10895
- 2026-01-07, LLMberjack: Guided Trimming of Debate Trees for Multi-Party Conversation Creation, Leonardo Bottona et.al., Paper: http://arxiv.org/abs/2601.04135
- 2026-02-25, LLMTailor: A Layer-wise Tailoring Tool for Efficient Checkpointing of Large Language Models, Minqiu Sun et.al., Paper: http://arxiv.org/abs/2602.22158
- 2026-03-11, LLMGreenRec: LLM-Based Multi-Agent Recommender System for Sustainable E-Commerce, Hao N. Nguyen et.al., Paper: http://arxiv.org/abs/2603.11025
- 2025-12-18, LLMCache: Layer-Wise Caching Strategies for Accelerated Reuse in Transformer Inference, Harsh Vardhan Bansal et.al., Paper: http://arxiv.org/abs/2512.16843
- 2026-03-11, LLM2Vec-Gen: Generative Embeddings from Large Language Models, Parishad BehnamGhader et.al., Paper: http://arxiv.org/abs/2603.10913
- 2026-03-04, LLM-supported 3D Modeling Tool for Radio Radiance Field Reconstruction, Chengling Xu et.al., Paper: http://arxiv.org/abs/2603.04368
- 2026-01-23, LLM-powered Real-time Patent Citation Recommendation for Financial Technologies, Tianang Deng et.al., Paper: http://arxiv.org/abs/2601.16775
- 2025-11-05, LLM-enhanced Air Quality Monitoring Interface via Model Context Protocol, Yu-Erh Pan et.al., Paper: http://arxiv.org/abs/2511.03706
- 2025-12-16, LLM-driven Knowledge Enhancement for Multimodal Cancer Survival Prediction, Chenyu Zhao et.al., Paper: http://arxiv.org/abs/2512.14594
- 2025-12-19, LLM-based Behaviour Driven Development for Hardware Design, Rolf Drechsler et.al., Paper: http://arxiv.org/abs/2512.17814
- 2025-11-06, LLM-as-a-Judge: Toward World Models for Slate Recommendation Systems, Baptiste Bonin et.al., Paper: http://arxiv.org/abs/2511.04541
- 2026-04-02, LLM-as-a-Judge for Time Series Explanations, Preetham Sivalingam et.al., Paper: http://arxiv.org/abs/2604.02118
- 2025-11-04, LLM-Supported Formal Knowledge Representation for Enhancing Control Engineering Content with an Interactive Semantic Layer, Julius Fiedler et.al., Paper: http://arxiv.org/abs/2511.02759
- 2026-03-14, LLM-Guided Safe Reinforcement Learning for Energy System Topology Reconfiguration, Zongyan Zhang et.al., Paper: http://arxiv.org/abs/2603.14018
- 2026-03-14, LLM-Guided Reinforcement Learning for Audio-Visual Speech Enhancement, Chih-Ning Chen et.al., Paper: http://arxiv.org/abs/2603.13952
- 2026-03-07, LLM-FK: Multi-Agent LLM Reasoning for Foreign Key Detection in Large-Scale Complex Databases, Zijian Tang et.al., Paper: http://arxiv.org/abs/2603.07278
- 2026-03-29, LLM-Enabled Low-Altitude UAV Natural Language Navigation via Signal Temporal Logic Specification Translation and Repair, Yuqi Ping et.al., Paper: http://arxiv.org/abs/2603.27583
- 2025-11-24, LLM-Driven Stationarity-Aware Expert Demonstrations for Multi-Agent Reinforcement Learning in Mobile Systems, Tianyang Duan et.al., Paper: http://arxiv.org/abs/2511.19368
- 2025-12-01, LLM-Driven Corrective Robot Operation Code Generation with Static Text-Based Simulation, Wenhao Wang et.al., Paper: http://arxiv.org/abs/2512.02002
- 2025-12-23, LLM-Based Authoring of Agent-Based Narratives through Scene Descriptions, Vinayak Regmi et.al., Paper: http://arxiv.org/abs/2512.20550
- 2025-12-12, LLM tools in the prediction of the stability of perovskite solar cells, S. Frenkel et.al., Paper: http://arxiv.org/abs/2512.11615
- 2025-12-08, LLM Use for Mental Health: Crowdsourcing Users’ Sentiment-based Perspectives and Values from Social Discussions, Lingyao Li et.al., Paper: http://arxiv.org/abs/2512.07797
- 2026-04-01, LLM REgression with a Latent Iterative State Head, Yiheng Su et.al., Paper: http://arxiv.org/abs/2604.01206
- 2026-01-22, LLM Prompt Evaluation for Educational Applications, Langdon Holmes et.al., Paper: http://arxiv.org/abs/2601.16134
- 2026-02-26, LLM Novice Uplift on Dual-Use, In Silico Biology Tasks, Chen Bo Calvin Zhang et.al., Paper: http://arxiv.org/abs/2602.23329
- 2025-12-05, LLM Harms: A Taxonomy and Discussion, Kevin Chen et.al., Paper: http://arxiv.org/abs/2512.05929
- 2026-03-13, LLM Constitutional Multi-Agent Governance, J. de Curtò et.al., Paper: http://arxiv.org/abs/2603.13189
- 2025-12-01, LLM CHESS: Benchmarking Reasoning and Instruction-Following in LLMs through Chess, Sai Kolasani et.al., Paper: http://arxiv.org/abs/2512.01992
- 2026-01-20, LLM Augmented Intervenable Multimodal Adaptor for Post-operative Complication Prediction in Lung Cancer Surgery, Shubham Pandey et.al., Paper: http://arxiv.org/abs/2601.14154
- 2025-11-04, LLEXICORP: End-user Explainability of Convolutional Neural Networks, Vojtěch Kůr et.al., Paper: http://arxiv.org/abs/2511.02720
- 2025-11-28, LFM2 Technical Report, Alexander Amini et.al., Paper: http://arxiv.org/abs/2511.23404
- 2026-03-16, LEXI: Lossless Exponent Coding for Efficient Inter-Chiplet Communication in Hybrid LLMs, Miao Sun et.al., Paper: http://arxiv.org/abs/2603.15589
- 2026-01-08, LELA: an LLM-based Entity Linking Approach with Zero-Shot Domain Adaptation, Samy Haffoudhi et.al., Paper: http://arxiv.org/abs/2601.05192
- 2026-02-13, LCSB: Layer-Cyclic Selective Backpropagation for Memory-Efficient On-Device LLM Fine-Tuning, Juneyoung Park et.al., Paper: http://arxiv.org/abs/2602.13073
- 2025-11-18, LAUD: Integrating Large Language Models with Active Learning for Unlabeled Data, Tzu-Hsuan Chou et.al., Paper: http://arxiv.org/abs/2511.14738
- 2026-03-25, LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control, Yifeng Zhang et.al., Paper: http://arxiv.org/abs/2603.24361
- 2026-02-19, LATA: Laplacian-Assisted Transductive Adaptation for Conformal Uncertainty in Medical VLMs, Behzad Bozorgtabar et.al., Paper: http://arxiv.org/abs/2602.17535
- 2026-02-04, LALM-as-a-Judge: Benchmarking Large Audio-Language Models for Safety Evaluation in Multi-Turn Spoken Dialogues, Amir Ivry et.al., Paper: http://arxiv.org/abs/2602.04796
- 2025-10-21, LAFA: Agentic LLM-Driven Federated Analytics over Decentralized Data Sources, Haichao Ji et.al., Paper: http://arxiv.org/abs/2510.18477
- 2026-03-12, LABSHIELD: A Multimodal Benchmark for Safety-Critical Reasoning and Planning in Scientific Laboratories, Qianpu Sun et.al., Paper: http://arxiv.org/abs/2603.11987
- 2026-02-10, Kunlun: Establishing Scaling Laws for Massive-Scale Recommendation Systems through Unified Architecture Design, Bojian Hou et.al., Paper: http://arxiv.org/abs/2602.10016
- 2025-10-21, KrishokBondhu: A Retrieval-Augmented Voice-Based Agricultural Advisory Call Center for Bengali Farmers, Mohd Ruhul Ameen et.al., Paper: http://arxiv.org/abs/2510.18355
- 2025-10-21, KoSimpleQA: A Korean Factuality Benchmark with an Analysis of Reasoning LLMs, Donghyeon Ko et.al., Paper: http://arxiv.org/abs/2510.18368
- 2026-01-21, Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning, Yuval Kansal et.al., Paper: http://arxiv.org/abs/2601.15160
- 2025-11-13, Know Your Limits: Entropy Estimation Modeling for Compression and Generalization, Benjamin L. Badger et.al., Paper: http://arxiv.org/abs/2511.10618
- 2026-02-13, Know More, Know Clearer: A Meta-Cognitive Framework for Knowledge Augmentation in Large Language Models, Hao Chen et.al., Paper: http://arxiv.org/abs/2602.12996
- 2026-01-12, Kinship Data Benchmark for Multi-hop Reasoning, Tianda Sun et.al., Paper: http://arxiv.org/abs/2601.07794
- 2025-10-30, Kimi Linear: An Expressive, Efficient Attention Architecture, Kimi Team et.al., Paper: http://arxiv.org/abs/2510.26692
- 2025-12-01, KV Pareto: Systems-Level Optimization of KV Cache and Model Compression for Long Context Inference, Sai Gokhale et.al., Paper: http://arxiv.org/abs/2512.01953
- 2026-04-01, KUET at StanceNakba Shared Task: StanceMoE: Mixture-of-Experts Architecture for Stance Detection, Abdullah Al Shafi et.al., Paper: http://arxiv.org/abs/2604.00878
- 2025-12-05, KQ-SVD: Compressing the KV Cache with Provable Guarantees on Attention Fidelity, Damien Lesens et.al., Paper: http://arxiv.org/abs/2512.05916
- 2026-02-23, KNIGHT: Knowledge Graph-Driven Multiple-Choice Question Generation with Adaptive Hardness Calibration, Mohammad Amanlou et.al., Paper: http://arxiv.org/abs/2602.20135
- 2026-03-22, KLDrive: Fine-Grained 3D Scene Reasoning for Autonomous Driving based on Knowledge Graph, Ye Tian et.al., Paper: http://arxiv.org/abs/2603.21029
- 2025-10-23, KL-Regularized Reinforcement Learning is Designed to Mode Collapse, Anthony GX-Chen et.al., Paper: http://arxiv.org/abs/2510.20817
- 2026-01-07, KDCM: Reducing Hallucination in LLMs through Explicit Reasoning Structures, Jinbo Hao et.al., Paper: http://arxiv.org/abs/2601.04086
- 2025-10-21, KAT-Coder Technical Report, Zizheng Zhan et.al., Paper: http://arxiv.org/abs/2510.18779
- 2026-03-06, K-MaT: Knowledge-Anchored Manifold Transport for Cross-Modal Prompt Learning in Medical Imaging, Jiajun Zeng et.al., Paper: http://arxiv.org/abs/2603.06340
- 2026-02-11, Just on Time: Token-Level Early Stopping for Diffusion Language Models, Zahar Kohut et.al., Paper: http://arxiv.org/abs/2602.11133
- 2025-11-19, Joint Semantic-Channel Coding and Modulation for Token Communications, Jingkai Ying et.al., Paper: http://arxiv.org/abs/2511.15699
- 2025-12-03, Jina-VLM: Small Multilingual Vision Language Model, Andreas Koukounas et.al., Paper: http://arxiv.org/abs/2512.04032
- 2026-01-20, Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow, Haocheng Xi et.al., Paper: http://arxiv.org/abs/2601.14243
- 2025-12-15, Janus: Disaggregating Attention and Experts for Scalable MoE Inference, Zhexiang Zhang et.al., Paper: http://arxiv.org/abs/2512.13525
- 2023-10-02, Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models, Neha Sengupta et.al., Paper: http://arxiv.org/abs/2308.16149
- 2026-04-02, Jagle: Building a Large-Scale Japanese Multimodal Post-Training Dataset for Vision-Language Models, Issa Sugiura et.al., Paper: http://arxiv.org/abs/2604.02048
- 2025-10-21, JAUNT: Joint Alignment of User Intent and Network State for QoE-centric LLM Tool Routing, Enhan Li et.al., Paper: http://arxiv.org/abs/2510.18550
- 2026-04-01, JAMMEval: A Refined Collection of Japanese Benchmarks for Reliable VLM Evaluation, Issa Sugiura et.al., Paper: http://arxiv.org/abs/2604.00909
- 2026-01-21, Iterative Refinement Improves Compositional Image Generation, Shantanu Jaiswal et.al., Paper: http://arxiv.org/abs/2601.15286
- 2025-12-31, Iterative Deployment Improves Planning Skills in LLMs, Augusto B. Corrêa et.al., Paper: http://arxiv.org/abs/2512.24940
- 2026-03-12, IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL, Zhoujun Cheng et.al., Paper: http://arxiv.org/abs/2603.12151
- 2026-03-20, IsoCLIP: Decomposing CLIP Projectors for Efficient Intra-modal Alignment, Simone Magistri et.al., Paper: http://arxiv.org/abs/2603.19862
- 2026-03-18, Is Your LLM-as-a-Recommender Agent Trustable? LLMs’ Recommendation is Easily Hacked by Biases (Preferences), Zichen Tang et.al., Paper: http://arxiv.org/abs/2603.17417
- 2026-02-09, Is Reasoning Capability Enough for Safety in Long-Context Language Models?, Yu Fu et.al., Paper: http://arxiv.org/abs/2602.08874
- 2026-03-26, Is Mathematical Problem-Solving Expertise in Large Language Models Associated with Assessment Performance?, Liang Zhang et.al., Paper: http://arxiv.org/abs/2603.25633
- 2026-03-17, Is Conformal Factuality for RAG-based LLMs Robust? Novel Metrics and Systematic Insights, Yi Chen et.al., Paper: http://arxiv.org/abs/2603.16817
- 2026-01-12, Is Agentic RAG worth it? An experimental comparison of RAG approaches, Pietro Ferrazzi et.al., Paper: http://arxiv.org/abs/2601.07711
- 2026-03-25, Invisible Threats from Model Context Protocol: Generating Stealthy Injection Payload via Tree-based Adaptive Search, Yulin Shen et.al., Paper: http://arxiv.org/abs/2603.24203
- 2026-01-02, Investigating the Viability of Employing Multi-modal Large Language Models in the Context of Audio Deepfake Detection, Akanksha Chuchra et.al., Paper: http://arxiv.org/abs/2601.00777
- 2026-03-29, Investigating the Influence of Language on Sycophantic Behavior of Multilingual LLMs, Bayan Abdullah Aldahlawi et.al., Paper: http://arxiv.org/abs/2603.27664
- 2026-03-26, Investigating the Fundamental Limit: A Feasibility Study of Hybrid-Neural Archival, Marcus Armstrong et.al., Paper: http://arxiv.org/abs/2603.25526
- 2026-04-01, Investigating Autonomous Agent Contributions in the Wild: Activity Patterns and Code Change over Time, Razvan Mihai Popescu et.al., Paper: http://arxiv.org/abs/2604.00917
- 2026-02-05, Inverse Depth Scaling From Most Layers Being Similar, Yizhou Liu et.al., Paper: http://arxiv.org/abs/2602.05970
- 2025-12-02, Invasive Context Engineering to Control Large Language Models, Thomas Rivasseau et.al., Paper: http://arxiv.org/abs/2512.03001
- 2025-12-26, Introducing TrGLUE and SentiTurca: A Comprehensive Benchmark for Turkish General Language Understanding and Sentiment Analysis, Duygu Altinok et.al., Paper: http://arxiv.org/abs/2512.22100
- 2026-03-16, InterveneBench: Benchmarking LLMs for Intervention Reasoning and Causal Study Design in Real Social Systems, Shaojie Shi et.al., Paper: http://arxiv.org/abs/2603.15542
- 2025-10-29, Interpreting LLMs as Credit Risk Classifiers: Do Their Feature Explanations Align with Classical ML?, Saeed AlMarri et.al., Paper: http://arxiv.org/abs/2510.25701
- 2026-03-18, Interpreting Context-Aware Human Preferences for Multi-Objective Robot Navigation, Tharun Sethuraman et.al., Paper: http://arxiv.org/abs/2603.17510
- 2026-03-17, Internalizing Agency from Reflective Experience, Rui Ge et.al., Paper: http://arxiv.org/abs/2603.16843
- 2026-01-08, Internal Representations as Indicators of Hallucinations in Agent Tool Selection, Kait Healy et.al., Paper: http://arxiv.org/abs/2601.05214
- 2026-03-10, InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing, Changyao Tian et.al., Paper: http://arxiv.org/abs/2603.09877
- 2025-11-20, InternData-A1: Pioneering High-Fidelity Synthetic Data for Pre-training Generalist Policy, Yang Tian et.al., Paper: http://arxiv.org/abs/2511.16651
- 2025-11-03, Interaction as Intelligence Part II: Asynchronous Human-Agent Rollout for Long-Horizon Task Training, Dayuan Fu et.al., Paper: http://arxiv.org/abs/2510.27630
- 2026-03-24, InterDyad: Interactive Dyadic Speech-to-Video Generation by Querying Intermediate Visual Guidance, Dongwei Pan et.al., Paper: http://arxiv.org/abs/2603.23132
- 2025-10-22, Integrating Transparent Models, LLMs, and Practitioner-in-the-Loop: A Case of Nonprofit Program Evaluation, Ji Ma et.al., Paper: http://arxiv.org/abs/2510.19799
- 2025-11-20, Integrating Symbolic Natural Language Understanding and Language Models for Word Sense Disambiguation, Kexin Zhao et.al., Paper: http://arxiv.org/abs/2511.16577
- 2025-10-21, Integrating Large Language Models and Evaluating Student Outcomes in an Introductory Computer Science Course, Annapurna Vadaparty et.al., Paper: http://arxiv.org/abs/2510.18806
- 2025-07-31, Instruction-tuned Large Language Models for Machine Translation in the Medical Domain, Miguel Rios et.al., Paper: http://arxiv.org/abs/2408.16440
- 2025-12-29, Instruction-Following Evaluation of Large Vision-Language Models, Daiki Shiono et.al., Paper: http://arxiv.org/abs/2512.23572
- 2026-03-11, Instruction set for the representation of graphs, Ezequiel Lopez-Rubio et.al., Paper: http://arxiv.org/abs/2603.11039
- 2025-12-05, InstructMPC: A Human-LLM-in-the-Loop Framework for Context-Aware Power Grid Control, Ruixiang Wu et.al., Paper: http://arxiv.org/abs/2512.05876
- 2025-11-13, Instella: Fully Open Language Models with Stellar Performance, Jiang Liu et.al., Paper: http://arxiv.org/abs/2511.10628
- 2025-12-02, Instant Video Models: Universal Adapters for Stabilizing Image-Based Networks, Matthew Dutson et.al., Paper: http://arxiv.org/abs/2512.03014
- 2025-10-21, InspectCoder: Dynamic Analysis-Enabled Self Repair through interactive LLM-Debugger Collaboration, Yunkun Wang et.al., Paper: http://arxiv.org/abs/2510.18327
- 2025-12-18, Inside Out: Uncovering How Comment Internalization Steers LLMs for Better or Worse, Aaron Imani et.al., Paper: http://arxiv.org/abs/2512.16790
- 2025-11-03, InnovatorBench: Evaluating Agents’ Ability to Conduct Innovative LLM Research, Yunze Wu et.al., Paper: http://arxiv.org/abs/2510.27598
- 2026-02-26, InnerQ: Hardware-aware Tuning-free Quantization of KV Cache for Large Language Models, Sayed Mohammadreza Tayaranian Hosseini et.al., Paper: http://arxiv.org/abs/2602.23200
- 2026-03-03, Inherited Goal Drift: Contextual Pressure Can Undermine Agentic Goals, Achyutha Menon et.al., Paper: http://arxiv.org/abs/2603.03258
- 2026-02-06, InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning, Yuchen Yan et.al., Paper: http://arxiv.org/abs/2602.06960
- 2026-02-11, Information Abstraction for Data Transmission Networks based on Large Language Models, Haoyuan Zhu et.al., Paper: http://arxiv.org/abs/2602.11022
- 2026-01-15, Influential Training Data Retrieval for Explaining Verbalized Confidence of LLMs, Yuxi Xia et.al., Paper: http://arxiv.org/abs/2601.10645
- 2026-03-10, Influencing LLM Multi-Agent Dialogue via Policy-Parameterized Prompts, Hongbo Bo et.al., Paper: http://arxiv.org/abs/2603.09890
- 2025-12-04, Influence of Object Affordance on Action Language Understanding: Evidence from Dynamic Causal Modeling Analysis, Supriya Bordoloi et.al., Paper: http://arxiv.org/abs/2512.04989
- 2025-12-09, InfiniteVL: Synergizing Linear and Sparse Attention for Highly-Efficient, Unlimited-Input Vision-Language Models, Hongyuan Tao et.al., Paper: http://arxiv.org/abs/2512.08829
- 2026-01-13, Inferring Latent Intentions: Attributional Natural Language Inference in LLM Agents, Xin Quan et.al., Paper: http://arxiv.org/abs/2601.08742
- 2026-03-18, Inducing Epistemological Humility in Large Language Models: A Targeted SFT Approach to Reducing Hallucination, Cem Uluoglakci et.al., Paper: http://arxiv.org/abs/2603.17504
- 2026-03-20, IndoorR2X: Indoor Robot-to-Everything Coordination with LLM-Driven Planning, Fan Yang et.al., Paper: http://arxiv.org/abs/2603.20182
- 2026-02-02, Indications of Belief-Guided Agency and Meta-Cognitive Monitoring in Large Language Models, Noam Steinmetz Yalon et.al., Paper: http://arxiv.org/abs/2602.02467
- 2026-03-18, IndicSafe: A Benchmark for Evaluating Multilingual LLM Safety in South Asia, Priyaranjan Pattnayak et.al., Paper: http://arxiv.org/abs/2603.17915
- 2026-03-12, IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse, Yushi Bai et.al., Paper: http://arxiv.org/abs/2603.12201
- 2025-12-22, Increasing the Thinking Budget is Not All You Need, Ignacio Iacobacci et.al., Paper: http://arxiv.org/abs/2512.19585
- 2026-03-17, InViC: Intent-aware Visual Cues for Medical Visual Question Answering, Zhisong Wang et.al., Paper: http://arxiv.org/abs/2603.16372
- 2026-01-20, InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning, Matthew Y. R. Yang et.al., Paper: http://arxiv.org/abs/2601.14209
- 2025-12-02, InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration, Zhongyu Yang et.al., Paper: http://arxiv.org/abs/2512.02981
- 2026-03-17, InCoder-32B: Code Foundation Model for Industrial Scenarios, Jian Yang et.al., Paper: http://arxiv.org/abs/2603.16790
- 2026-02-11, In-the-Wild Model Organisms: Mitigating Undesirable Emergent Behaviors in Production LLM Post-Training via Data Attribution, Frank Xiao et.al., Paper: http://arxiv.org/abs/2602.11079
- 2025-03-18, In-context Learning vs. Instruction Tuning: The Case of Small and Multilingual Language Models, David Ponce et.al., Paper: http://arxiv.org/abs/2503.01611
- 2025-12-08, In-Context and Few-Shots Learning for Forecasting Time Series Data based on Large Language Models, Saroj Gopali et.al., Paper: http://arxiv.org/abs/2512.07705
- 2025-12-02, In-Context Sync-LoRA for Portrait Video Editing, Sagi Polaczek et.al., Paper: http://arxiv.org/abs/2512.03013
- 2025-12-12, In-Context Learning for Seismic Data Processing, Fabian Fuchs et.al., Paper: http://arxiv.org/abs/2512.11575
- 2026-02-13, In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach, Yiran Gao et.al., Paper: http://arxiv.org/abs/2602.13156
- 2025-12-19, In Times of Crisis: An Exploratory Study of Media and Political Discourse on YouTube During the 2024 French Elections, Vera Sosnovik et.al., Paper: http://arxiv.org/abs/2512.17768
- 2025-11-04, In Good GRACEs: Principled Teacher Selection for Knowledge Distillation, Abhishek Panigrahi et.al., Paper: http://arxiv.org/abs/2511.02833
- 2025-11-28, Improving motor imagery decoding methods for an EEG-based mobile brain-computer interface in the context of the 2024 Cybathlon, Isabel Whiteley Tscherniak et.al., Paper: http://arxiv.org/abs/2511.23384
- 2026-03-14, Improving Visual Reasoning with Iterative Evidence Refinement, Zeru Shi et.al., Paper: http://arxiv.org/abs/2603.14117
- 2026-01-22, Improving Training Efficiency and Reducing Maintenance Costs via Language Specific Model Merging, Alphaeus Dmonte et.al., Paper: http://arxiv.org/abs/2601.16127
- 2026-01-02, Improving Router Security using BERT, John Carter et.al., Paper: http://arxiv.org/abs/2601.00783
- 2026-02-25, Improving Parametric Knowledge Access in Reasoning Language Models, Melody Ma et.al., Paper: http://arxiv.org/abs/2602.22193
- 2026-02-06, Improved Sampling Schedules for Discrete Diffusion Models, Alberto Foresti et.al., Paper: http://arxiv.org/abs/2602.06849
- 2025-12-01, Improved Mean Flows: On the Challenges of Fastforward Generative Models, Zhengyang Geng et.al., Paper: http://arxiv.org/abs/2512.02012
- 2026-02-13, Implicit-Scale 3D Reconstruction for Multi-Food Volume Estimation from Monocular Images, Yuhao Chen et.al., Paper: http://arxiv.org/abs/2602.13041
- 2026-03-19, Implicit Grading Bias in Large Language Models: How Writing Style Affects Automated Assessment Across Math, Programming, and Essay Tasks, Rudra Jadhav et.al., Paper: http://arxiv.org/abs/2603.18765
- 2025-12-18, Impacts of Racial Bias in Historical Training Data for News AI, Rahul Bhargava et.al., Paper: http://arxiv.org/abs/2512.16901
- 2025-11-13, Impact of Layer Norm on Memorization and Generalization in Transformers, Rishi Singhal et.al., Paper: http://arxiv.org/abs/2511.10566
- 2025-10-21, ImageGem: In-the-wild Generative Image Interaction Dataset for Generative Model Personalization, Yuanhe Guo et.al., Paper: http://arxiv.org/abs/2510.18433
- 2026-01-14, Image2Garment: Simulation-ready Garment Generation from a Single Image, Selim Emir Can et.al., Paper: http://arxiv.org/abs/2601.09658
- 2025-11-14, ImAgent: A Unified Multimodal Agent Framework for Test-Time Scalable Image Generation, Kaishen Wang et.al., Paper: http://arxiv.org/abs/2511.11483
- 2025-10-21, Illusions of reflection: open-ended task reveals systematic failures in Large Language Models’ reflective reasoning, Sion Weatherhead et.al., Paper: http://arxiv.org/abs/2510.18254
- 2026-01-09, Illusions of Confidence? Diagnosing LLM Truthfulness via Neighborhood Consistency, Haoming Xu et.al., Paper: http://arxiv.org/abs/2601.05905
- 2025-10-21, Identity-Aware Large Language Models require Cultural Reasoning, Alistair Plum et.al., Paper: http://arxiv.org/abs/2510.18510
- 2025-11-28, Identifying bars in galaxies using machine learning, Rajit Shrivastava et.al., Paper: http://arxiv.org/abs/2511.23383
- 2026-01-27, Identifying and Transferring Reasoning-Critical Neurons: Improving LLM Inference Reliability via Activation Steering, Fangan Dong et.al., Paper: http://arxiv.org/abs/2601.19847
- 2026-01-28, Idea2Story: An Automated Pipeline for Transforming Research Concepts into Complete Scientific Narratives, Tengyue Xu et.al., Paper: http://arxiv.org/abs/2601.20833
- 2026-01-16, Idea First, Code Later: Disentangling Problem Solving from Code Generation in Evaluating LLMs for Competitive Programming, Sama Hadhoud et.al., Paper: http://arxiv.org/abs/2601.11332
- 2026-01-22, IVRA: Improving Visual-Token Relations for Robot Action Policy with Training-Free Hint-Based Guidance, Jongwoo Park et.al., Paper: http://arxiv.org/abs/2601.16207
- 2025-10-27, ISA-Bench: Benchmarking Instruction Sensitivity for Large Audio Language Models, Bohan Li et.al., Paper: http://arxiv.org/abs/2510.23558
- 2026-01-02, IRPO: Scaling the Bradley-Terry Model via Reinforcement Learning, Haonan Song et.al., Paper: http://arxiv.org/abs/2601.00677
- 2026-03-17, IQuest-Coder-V1 Technical Report, Jian Yang et.al., Paper: http://arxiv.org/abs/2603.16733
- 2025-10-27, IPQA: A Benchmark for Core Intent Identification in Personalized Question Answering, Jieyong Kim et.al., Paper: http://arxiv.org/abs/2510.23536
- 2026-03-17, IOSVLM: A 3D Vision-Language Model for Unified Dental Diagnosis from Intraoral Scans, Huimin Xiong et.al., Paper: http://arxiv.org/abs/2603.16781
- 2025-10-29, INT v.s. FP: A Comprehensive Study of Fine-Grained Low-bit Quantization Formats, Mengzhao Chen et.al., Paper: http://arxiv.org/abs/2510.25602
- 2025-10-21, IMB: An Italian Medical Benchmark for Question Answering, Antonio Romano et.al., Paper: http://arxiv.org/abs/2510.18468
- 2026-01-20, IIR-VLM: In-Context Instance-level Recognition for Large Vision-Language Models, Liang Shi et.al., Paper: http://arxiv.org/abs/2601.14188
- 2026-01-09, IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck, Huilin Deng et.al., Paper: http://arxiv.org/abs/2601.05870
- 2025-10-21, IF-VidCap: Can Video Caption Models Follow Instructions?, Shihao Li et.al., Paper: http://arxiv.org/abs/2510.18726
- 2025-12-17, IC-Effect: Precise and Efficient Video Effects Editing via In-Context Learning, Yuanhang Li et.al., Paper: http://arxiv.org/abs/2512.15635
- 2026-02-10, Hydra-Nav: Object Navigation via Adaptive Dual-Process Reasoning, Zixuan Wang et.al., Paper: http://arxiv.org/abs/2602.09972
- 2026-03-31, Hybrid Framework for Robotic Manipulation: Integrating Reinforcement Learning and Large Language Models, Md Saad et.al., Paper: http://arxiv.org/abs/2603.30022
- 2025-10-24, Huxley-Gödel Machine: Human-Level Coding Agent Development by an Approximation of the Optimal Self-Improving Machine, Wenyi Wang et.al., Paper: http://arxiv.org/abs/2510.21614
- 2025-11-28, Hunyuan-GameCraft-2: Instruction-following Interactive Game World Model, Junshu Tang et.al., Paper: http://arxiv.org/abs/2511.23429
- 2026-03-26, Humans vs Vision-Language Models: A Unified Measure of Narrative Coherence, Nikolai Ilinykh et.al., Paper: http://arxiv.org/abs/2603.25537
- 2026-03-18, Humans and transformer LMs: Abstraction drives language learning, Jasper Jian et.al., Paper: http://arxiv.org/abs/2603.17475
- 2026-01-02, Human-like AI-based Auto-Field-in-Field Whole-Brain Radiotherapy Treatment Planning With Conversation Large Language Model Feedback, Adnan Jafar et.al., Paper: http://arxiv.org/abs/2601.00685
- 2026-03-12, Human-Centred LLM Privacy Audits: Findings and Frictions, Dimitri Staufer et.al., Paper: http://arxiv.org/abs/2603.12094
- 2026-02-13, Human-Aligned MLLM Judges for Fine-Grained Image Editing Evaluation: A Benchmark, Framework, and Analysis, Runzhou Liu et.al., Paper: http://arxiv.org/abs/2602.13028
- 2025-11-14, Human-AI collaborative autonomous synthesis with pulsed laser deposition for remote epitaxy, Asraful Haque et.al., Paper: http://arxiv.org/abs/2511.11558
- 2026-01-20, Human Values in a Single Sentence: Moral Presence, Hierarchies, and Transformer Ensembles on the Schwartz Continuum, Víctor Yeste et.al., Paper: http://arxiv.org/abs/2601.14172
- 2025-10-22, Hubble: a Model Suite to Advance the Study of LLM Memorization, Johnny Tian-Zheng Wei et.al., Paper: http://arxiv.org/abs/2510.19811
- 2026-01-14, How well LLM-based test generation techniques perform with newer LLM versions?, Michael Konstantinou et.al., Paper: http://arxiv.org/abs/2601.09695
- 2026-03-07, How to Steal Reasoning Without Reasoning Traces, Tingwei Zhang et.al., Paper: http://arxiv.org/abs/2603.07267
- 2026-01-21, How to Build AI Agents by Augmenting LLMs with Codified Human Expert Domain Knowledge? A Software Engineering Framework, Choro Ulan uulu et.al., Paper: http://arxiv.org/abs/2601.15153
- 2026-03-02, How Small Can 6G Reason? Scaling Tiny Language Models for AI-Native Networks, Mohamed Amine Ferrag et.al., Paper: http://arxiv.org/abs/2603.02156
- 2026-02-12, How Sampling Shapes LLM Alignment: From One-Shot Optima to Iterative Dynamics, Yurong Chen et.al., Paper: http://arxiv.org/abs/2602.12180
- 2026-02-23, How Retrieved Context Shapes Internal Representations in RAG, Samuel Yeh et.al., Paper: http://arxiv.org/abs/2602.20091
- 2025-12-17, How Much is Too Much? Exploring LoRA Rank Trade-offs for Retaining Knowledge and Domain Robustness, Darshita Rathore et.al., Paper: http://arxiv.org/abs/2512.15634
- 2026-01-16, How Much Would a Clinician Edit This Draft? Evaluating LLM Alignment for Patient Message Response Drafting, Parker Seegmiller et.al., Paper: http://arxiv.org/abs/2601.11344
- 2026-03-07, How Much Noise Can BERT Handle? Insights from Multilingual Sentence Difficulty Detection, Nouran Khallaf et.al., Paper: http://arxiv.org/abs/2603.07346
- 2025-12-18, How Good is Post-Hoc Watermarking With Language Model Rephrasing?, Pierre Fernandez et.al., Paper: http://arxiv.org/abs/2512.16904
- 2025-10-21, How Efficient Are Diffusion Language Models? A Critical Examination of Efficiency Evaluation Practices, Han Peng et.al., Paper: http://arxiv.org/abs/2510.18480
- 2025-10-21, How Do LLMs Use Their Depth?, Akshat Gupta et.al., Paper: http://arxiv.org/abs/2510.18871
- 2026-03-19, How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation, Ke-Han Lu et.al., Paper: http://arxiv.org/abs/2603.19195
- 2026-03-16, HorizonMath: Measuring AI Progress Toward Mathematical Discovery with Automatic Verification, Erik Y. Wang et.al., Paper: http://arxiv.org/abs/2603.15617
- 2026-02-04, Horizon-LM: A RAM-Centric Architecture for LLM Training, Zhengqing Yuan et.al., Paper: http://arxiv.org/abs/2602.04816
- 2026-01-07, HoneyTrap: Deceiving Large Language Model Attackers to Honeypot Traps with Resilient Multi-Agent Defense, Siyuan Li et.al., Paper: http://arxiv.org/abs/2601.04034
- 2025-11-14, Honesty over Accuracy: Trustworthy Language Models through Reinforced Hesitation, Mohamad Amin Mohamadi et.al., Paper: http://arxiv.org/abs/2511.11500
- 2026-03-12, HomeSafe-Bench: Evaluating Vision-Language Models on Unsafe Action Detection for Embodied Agents in Household Scenarios, Jiayue Pu et.al., Paper: http://arxiv.org/abs/2603.11975
- 2026-03-12, Hoi3DGen: Generating High-Quality Human-Object-Interactions in 3D, Agniv Sharma et.al., Paper: http://arxiv.org/abs/2603.12126
- 2026-04-01, HippoCamp: Benchmarking Contextual Agents on Personal Computers, Zhe Yang et.al., Paper: http://arxiv.org/abs/2604.01221
- 2026-01-30, High-quality generation of dynamic game content via small language models: A proof of concept, Morten I. K. Munk et.al., Paper: http://arxiv.org/abs/2601.23206
- 2025-11-19, Hierarchical Semantic Tree Anchoring for CLIP-Based Class-Incremental Learning, Tao Hu et.al., Paper: http://arxiv.org/abs/2511.15633
- 2026-01-16, Hierarchical Orthogonal Residual Spread for Precise Massive Editing in Large Language Models, Xiaojie Gu et.al., Paper: http://arxiv.org/abs/2601.11441
- 2026-03-05, Hierarchical Decoding for Discrete Speech Synthesis with Multi-Resolution Spoof Detection, Junchuan Zhao et.al., Paper: http://arxiv.org/abs/2603.05373
- 2026-03-29, Hidden Ads: Behavior Triggered Semantic Backdoors for Advertisement Injection in Vision Language Models, Duanyi Yao et.al., Paper: http://arxiv.org/abs/2603.27522
- 2026-03-20, HiPath: Hierarchical Vision-Language Alignment for Structured Pathology Report Prediction, Ruicheng Yuan et.al., Paper: http://arxiv.org/abs/2603.19957
- 2026-01-05, Heterogeneous Low-Bandwidth Pre-Training of LLMs, Yazan Obeidi et.al., Paper: http://arxiv.org/abs/2601.02360
- 2026-02-23, HeatPrompt: Zero-Shot Vision-Language Modeling of Urban Heat Demand from Satellite Images, Kundan Thota et.al., Paper: http://arxiv.org/abs/2602.20066
- 2026-01-26, Health-SCORE: Towards Scalable Rubrics for Improving Health-LLMs, Zhichao Yang et.al., Paper: http://arxiv.org/abs/2601.18706
- 2026-01-16, Health Facility Location in Ethiopia: Leveraging LLMs to Integrate Expert Knowledge into Algorithmic Planning, Yohai Trabelsi et.al., Paper: http://arxiv.org/abs/2601.11479
- 2025-12-19, HeadHunt-VAD: Hunting Robust Anomaly-Sensitive Heads in MLLM for Tuning-Free Video Anomaly Detection, Zhaolin Cai et.al., Paper: http://arxiv.org/abs/2512.17601
- 2025-10-24, Head Pursuit: Probing Attention Specialization in Multimodal Transformers, Lorenzo Basile et.al., Paper: http://arxiv.org/abs/2510.21518
- 2026-03-18, Harnessing the Power of Foundation Models for Accurate Material Classification, Qingran Lin et.al., Paper: http://arxiv.org/abs/2603.17390
- 2025-11-21, Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization, Vinay Kanakeri et.al., Paper: http://arxiv.org/abs/2511.17489
- 2025-10-21, HarmNet: A Framework for Adaptive Multi-Turn Jailbreak Attacks on Large Language Models, Sidhant Narula et.al., Paper: http://arxiv.org/abs/2510.18728
- 2025-10-21, Hardness of Learning Regular Languages in the Next Symbol Prediction Setting, Satwik Bhattamishra et.al., Paper: http://arxiv.org/abs/2510.18634
- 2026-03-11, HanMoVLM: Large Vision-Language Models for Professional Artistic Painting Evaluation, Hongji Yang et.al., Paper: http://arxiv.org/abs/2603.10814
- 2026-02-06, Halluverse-M^3: A multitask multilingual benchmark for hallucination in LLMs, Samir Abdaljalil et.al., Paper: http://arxiv.org/abs/2602.06920
- 2025-12-08, HalluShift++: Bridging Language and Vision through Internal Representation Shifts for Hierarchical Hallucinations in MLLMs, Sujoy Nath et.al., Paper: http://arxiv.org/abs/2512.07687
- 2026-01-26, HalluGuard: Demystifying Data-Driven and Reasoning-Driven Hallucinations in LLMs, Xinyue Zeng et.al., Paper: http://arxiv.org/abs/2601.18753
- 2025-12-04, HTR-ConvText: Leveraging Convolution and Textual Information for Handwritten Text Recognition, Pham Thach Thanh Truc et.al., Paper: http://arxiv.org/abs/2512.05021
- 2025-11-19, HSKBenchmark: Modeling and Benchmarking Chinese Second Language Acquisition in Large Language Models through Curriculum Tuning, Qihao Yang et.al., Paper: http://arxiv.org/abs/2511.15574
- 2025-12-10, HPM-KD: Hierarchical Progressive Multi-Teacher Framework for Knowledge Distillation and Efficient Model Compression, Gustavo Coelho Haase et.al., Paper: http://arxiv.org/abs/2512.09886
- 2026-03-19, HORNet: Task-Guided Frame Selection for Video Question Answering with Vision-Language Models, Xiangyu Bai et.al., Paper: http://arxiv.org/abs/2603.18850
- 2025-11-18, HMC: Learning Heterogeneous Meta-Control for Contact-Rich Loco-Manipulation, Lai Wei et.al., Paper: http://arxiv.org/abs/2511.14756
- 2026-03-24, HGNet: Scalable Foundation Model for Automated Knowledge Graph Generation from Scientific Literature, Devvrat Joshi et.al., Paper: http://arxiv.org/abs/2603.23136
- 2025-12-12, HFS: Holistic Query-Aware Frame Selection for Efficient Video Reasoning, Yiqing Yang et.al., Paper: http://arxiv.org/abs/2512.11534
- 2026-01-28, HESTIA: A Hessian-Guided Differentiable Quantization-Aware Training Framework for Extremely Low-Bit LLMs, Guoan Wang et.al., Paper: http://arxiv.org/abs/2601.20745
- 2026-01-27, HARMONI: Multimodal Personalization of Multi-User Human-Robot Interactions with LLMs, Jeanne Malécot et.al., Paper: http://arxiv.org/abs/2601.19839
- 2026-01-09, HAPS: Hierarchical LLM Routing with Joint Architecture and Parameter Search, Zihang Tian et.al., Paper: http://arxiv.org/abs/2601.05903
- 2026-01-20, HALT: Hallucination Assessment via Latent Testing, Rohan Bhatnagar et.al., Paper: http://arxiv.org/abs/2601.14210
- 2026-03-05, HALP: Detecting Hallucinations in Vision-Language Models without Generating a Single Token, Sai Akhil Kogilathota et.al., Paper: http://arxiv.org/abs/2603.05465
- 2026-02-24, HALO: A Unified Vision-Language-Action Model for Embodied Multimodal Chain-of-Thought Reasoning, Quanxin Shou et.al., Paper: http://arxiv.org/abs/2602.21157
- 2026-03-23, Gumbel Distillation for Parallel Text Generation, Chi Zhang et.al., Paper: http://arxiv.org/abs/2603.22216
- 2025-12-11, Guided Transfer Learning for Discrete Diffusion Models, Julian Kleutgens et.al., Paper: http://arxiv.org/abs/2512.10877
- 2025-11-24, Growing with the Generator: Self-paced GRPO for Video Generation, Rui Li et.al., Paper: http://arxiv.org/abs/2511.19356
- 2025-10-21, Grounding or Guessing? Visual Signals for Detecting Hallucinations in Sign Language Translation, Yasser Hamidullah et.al., Paper: http://arxiv.org/abs/2510.18439
- 2026-03-10, Grounding Synthetic Data Generation With Vision and Language Models, Ümit Mert Çağlar et.al., Paper: http://arxiv.org/abs/2603.09625
- 2026-01-22, Grounding Large Language Models in Reaction Knowledge Graphs for Synthesis Retrieval, Olga Bunkova et.al., Paper: http://arxiv.org/abs/2601.16038
- 2026-01-15, Grounding Agent Memory in Contextual Intent, Ruozhen Yang et.al., Paper: http://arxiv.org/abs/2601.10702
- 2026-04-02, GroundVTS: Visual Token Sampling in Multimodal Large Language Models for Video Temporal Grounding, Rong Fan et.al., Paper: http://arxiv.org/abs/2604.02093
- 2026-03-11, GroundCount: Grounding Vision-Language Models with Object Detection for Mitigating Counting Hallucinations, Boyuan Chen et.al., Paper: http://arxiv.org/abs/2603.10978
- 2025-11-18, Ground Truth Generation for Multilingual Historical NLP using LLMs, Clovis Gladstone et.al., Paper: http://arxiv.org/abs/2511.14688
- 2025-12-01, GrndCtrl: Grounding World Models via Self-Supervised Reward Alignment, Haoyang He et.al., Paper: http://arxiv.org/abs/2512.01952
- 2026-03-26, GridVAD: Open-Set Video Anomaly Detection via Spatial Reasoning over Stratified Frame Grids, Mohamed Eltahir et.al., Paper: http://arxiv.org/abs/2603.25467
- 2026-01-30, GrepRAG: An Empirical Study and Optimization of Grep-Like Retrieval for Code Completion, Baoyi Wang et.al., Paper: http://arxiv.org/abs/2601.23254
- 2025-10-28, Greedy Sampling Is Provably Efficient for RLHF, Di Wu et.al., Paper: http://arxiv.org/abs/2510.24700
- 2026-03-23, Greater accessibility can amplify discrimination in generative AI, Carolin Holtermann et.al., Paper: http://arxiv.org/abs/2603.22260
- 2025-10-22, Grasp Any Region: Towards Precise, Contextual Pixel Understanding for Multimodal LLMs, Haochen Wang et.al., Paper: http://arxiv.org/abs/2510.18876
- 2026-02-19, GraphThinker: Reinforcing Video Reasoning with Event Graph Thinking, Zixu Cheng et.al., Paper: http://arxiv.org/abs/2602.17555
- 2025-12-02, GraphFusion3D: Dynamic Graph Attention Convolution with Adaptive Cross-Modal Transformer for 3D Object Detection, Md Sohag Mia et.al., Paper: http://arxiv.org/abs/2512.02991
- 2025-11-19, Graph Rewriting Language as a Platform for Quantum Diagrammatic Calculi, Kayo Tei et.al., Paper: http://arxiv.org/abs/2511.15581
- 2025-12-18, Grammar-Forced Translation of Natural Language to Temporal Logic using LLMs, William English et.al., Paper: http://arxiv.org/abs/2512.16814
- 2026-01-02, Grading Handwritten Engineering Exams with Multimodal Large Language Models, Janez Perš et.al., Paper: http://arxiv.org/abs/2601.00730
- 2026-03-24, Good for the Planet, Bad for Me? Intended and Unintended Consequences of AI Energy Consumption Disclosure, Michael Klesel et.al., Paper: http://arxiv.org/abs/2603.23075
- 2026-02-16, Goldilocks RL: Tuning Task Difficulty to Escape Sparse Rewards for Reasoning, Ilia Mahrooghi et.al., Paper: http://arxiv.org/abs/2602.14868
- 2026-02-20, Going Down Memory Lane: Scaling Tokens for Video Stream Understanding with Dynamic KV-Cache Memory, Vatsal Agarwal et.al., Paper: http://arxiv.org/abs/2602.18434
- 2026-01-26, Goal-oriented Communication for Fast and Robust Robotic Fault Detection and Recovery, Shutong Chen et.al., Paper: http://arxiv.org/abs/2601.18765
- 2026-03-09, Global Cross-Modal Geo-Localization: A Million-Scale Dataset and a Physical Consistency Learning Framework, Yutong Hu et.al., Paper: http://arxiv.org/abs/2603.08491
- 2026-03-20, Gesture2Speech: How Far Can Hand Movements Shape Expressive Speech?, Lokesh Kumar et.al., Paper: http://arxiv.org/abs/2603.19831
- 2026-03-13, Geometry-Guided Camera Motion Understanding in VideoLLMs, Haoan Feng et.al., Paper: http://arxiv.org/abs/2603.13119
- 2026-01-02, Geometry of Reason: Spectral Signatures of Valid Mathematical Reasoning, Valentin Noël et.al., Paper: http://arxiv.org/abs/2601.00791
- 2026-03-10, GeoSolver: Scaling Test-Time Reasoning in Remote Sensing with Fine-Grained Process Supervision, Lang Sun et.al., Paper: http://arxiv.org/abs/2603.09551
- 2026-01-07, GeoReason: Aligning Thinking And Answering In Remote Sensing Vision-Language Models Via Logical Consistency Reinforcement Learning, Wenshuai Li et.al., Paper: http://arxiv.org/abs/2601.04118
- 2026-02-25, GeoDiv: Framework For Measuring Geographical Diversity In Text-To-Image Models, Abhipsa Basu et.al., Paper: http://arxiv.org/abs/2602.22120
- 2025-10-21, Genesis: Evolving Attack Strategies for LLM Web Agent Red-Teaming, Zheng Zhang et.al., Paper: http://arxiv.org/abs/2510.18314
- 2025-10-23, Generative Reasoning Recommendation via LLMs, Minjie Hong et.al., Paper: http://arxiv.org/abs/2510.20815
- 2025-12-04, Generative Neural Video Compression via Video Diffusion Prior, Qi Mao et.al., Paper: http://arxiv.org/abs/2512.05016
- 2025-12-19, Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs, Zhaolin Cai et.al., Paper: http://arxiv.org/abs/2512.17640
- 2025-12-18, Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning, Qihao Liu et.al., Paper: http://arxiv.org/abs/2512.16917
- 2025-10-28, Generative AI for Healthcare: Fundamentals, Challenges, and Perspectives, Gang Chen et.al., Paper: http://arxiv.org/abs/2510.24551
- 2026-01-15, Generative AI collective behavior needs an interactionist paradigm, Laura Ferrarotti et.al., Paper: http://arxiv.org/abs/2601.10567
- 2026-03-19, Generation Models Know Space: Unleashing Implicit 3D Priors for Scene Understanding, Xianjin Wu et.al., Paper: http://arxiv.org/abs/2603.19235
- 2026-03-18, Gender Disambiguation in Machine Translation: Diagnostic Evaluation in Decoder-Only Architectures, Chiara Manna et.al., Paper: http://arxiv.org/abs/2603.17952
- 2026-01-09, Gender Bias in LLMs: Preliminary Evidence from Shared Parenting Scenario in Czech Family Law, Jakub Harasta et.al., Paper: http://arxiv.org/abs/2601.05879
- 2025-12-22, GenEnv: Difficulty-Aligned Co-Evolution Between LLM Agents and Environment Simulators, Jiacheng Guo et.al., Paper: http://arxiv.org/abs/2512.19682
- 2026-03-02, GenDB: The Next Generation of Query Processing – Synthesized, Not Engineered, Jiale Lao et.al., Paper: http://arxiv.org/abs/2603.02081
- 2026-02-05, GenArena: How Can We Achieve Human-Aligned Evaluation for Visual Generation Tasks?, Ruihang Li et.al., Paper: http://arxiv.org/abs/2602.06013
- 2026-01-08, GenAI-DrawIO-Creator: A Framework for Automated Diagram Generation, Jinze Yu et.al., Paper: http://arxiv.org/abs/2601.05162
- 2025-12-08, GatedFWA: Linear Flash Windowed Attention with Gated Associative Memory, Jiaxu Liu et.al., Paper: http://arxiv.org/abs/2512.07782
- 2025-10-29, Gaperon: A Peppered English-French Generative Language Model Suite, Nathan Godey et.al., Paper: http://arxiv.org/abs/2510.25771
- 2026-01-26, Gained in Translation: Privileged Pairwise Judges Enhance Multilingual Reasoning, Lintang Sutawika et.al., Paper: http://arxiv.org/abs/2601.18722
- 2026-04-02, GaelEval: Benchmarking LLM Performance for Scottish Gaelic, Peter Devine et.al., Paper: http://arxiv.org/abs/2604.02135
- 2026-02-09, GSS: Gated Subspace Steering for Selective Memorization Mitigation in LLMs, Xuanqi Zhang et.al., Paper: http://arxiv.org/abs/2602.08901
- 2026-03-19, GSMem: 3D Gaussian Splatting as Persistent Spatial Memory for Zero-Shot Embodied Exploration and Reasoning, Yiren Lu et.al., Paper: http://arxiv.org/abs/2603.19137
- 2026-03-14, GRPO and Reflection Reward for Mathematical Reasoning in Large Language Models, Zhijie Wang et.al., Paper: http://arxiv.org/abs/2603.14041
- 2026-01-23, GRIP: Algorithm-Agnostic Machine Unlearning for Mixture-of-Experts via Geometric Router Constraints, Andy Zhu et.al., Paper: http://arxiv.org/abs/2601.16905
- 2025-12-17, GRAN-TED: Generating Robust, Aligned, and Nuanced Text Embedding for Diffusion Models, Bozhou Li et.al., Paper: http://arxiv.org/abs/2512.15560
- 2025-12-26, GQ-VAE: A gated quantized VAE for learning variable length tokens, Theo Datta et.al., Paper: http://arxiv.org/abs/2512.21913
- 2026-02-13, GPTZero: Robust Detection of LLM-Generated Texts, George Alexandru Adam et.al., Paper: http://arxiv.org/abs/2602.13042
- 2025-10-21, GPTFace: Generative Pre-training of Facial-Linguistic Transformer by Span Masking and Weakly Correlated Text-image Data, Yudong Li et.al., Paper: http://arxiv.org/abs/2510.18345
- 2026-04-01, GPT-NL Public Corpus: A Permissively Licensed, Dutch-First Dataset for LLM Pre-training, Jesse van Oort et.al., Paper: http://arxiv.org/abs/2604.00920
- 2026-02-17, GLM-5: from Vibe Coding to Agentic Engineering, GLM-5 Team et.al., Paper: http://arxiv.org/abs/2602.15763
- 2026-01-08, GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization, Shih-Yang Liu et.al., Paper: http://arxiv.org/abs/2601.05242
- 2026-01-27, GAVEL: Towards rule-based safety through activation monitoring, Shir Rozenfeld et.al., Paper: http://arxiv.org/abs/2601.19768
- 2026-03-10, GAST: Gradient-aligned Sparse Tuning of Large Language Models with Data-layer Selection, Kai Yao et.al., Paper: http://arxiv.org/abs/2603.09865
- 2025-11-26, G $^2$ VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning, Wenbo Hu et.al., Paper: http://arxiv.org/abs/2511.21688
- 2026-03-19, Functional Subspace Watermarking for Large Language Models, Zikang Ding et.al., Paper: http://arxiv.org/abs/2603.18793
- 2025-10-28, FunReason-MT Technical Report: Overcoming the Complexity Barrier in Multi-Turn Function Calling, Zengzhuang Xu et.al., Paper: http://arxiv.org/abs/2510.24645
- 2026-03-02, Frontier Models Can Take Actions at Low Probabilities, Alex Serrano et.al., Paper: http://arxiv.org/abs/2603.02202
- 2025-12-15, From Zipf’s Law to Neural Scaling through Heaps’ Law and Hilberg’s Hypothesis, Łukasz Dębowski et.al., Paper: http://arxiv.org/abs/2512.13491
- 2026-01-05, From XAI to Stories: A Factorial Study of LLM-Generated Explanation Quality, Fabian Lukassen et.al., Paper: http://arxiv.org/abs/2601.02224
- 2025-12-12, From Verification Burden to Trusted Collaboration: Design Goals for LLM-Assisted Literature Reviews, Brenda Nogueira et.al., Paper: http://arxiv.org/abs/2512.11661
- 2025-12-05, From Text to Returns: Using Large Language Models for Mutual Fund Portfolio Optimization and Risk-Adjusted Allocation, Abrar Hossain Mufakir Qamar Ansari Haziq Jeelani Monia Digra Fayeq Jeelani Syed et.al., Paper: http://arxiv.org/abs/2512.05907
- 2026-03-24, From Synthetic to Native: Benchmarking Multilingual Intent Classification in Logistics Customer Service, Haoyu He et.al., Paper: http://arxiv.org/abs/2603.23172
- 2026-01-15, From Single to Multi-Agent Reasoning: Advancing GeneGPT for Genomics QA, Kimia Abedini et.al., Paper: http://arxiv.org/abs/2601.10581
- 2026-01-13, From Rows to Reasoning: A Retrieval-Augmented Multimodal Framework for Spreadsheet Understanding, Anmol Gulati et.al., Paper: http://arxiv.org/abs/2601.08741
- 2025-10-21, From Retrieval to Generation: Unifying External and Parametric Knowledge for Medical Question Answering, Lei Li et.al., Paper: http://arxiv.org/abs/2510.18297
- 2026-03-24, From Questions to Trust Reports: A LLM-IR Framework for the TREC 2025 DRAGUN Track, Ignacy Alwasiak et.al., Paper: http://arxiv.org/abs/2603.23125
- 2025-10-21, From Quarter to All: Accelerating Speculative LLM Decoding via Floating-Point Exponent Remapping and Parameter Sharing, Yushu Zhao et.al., Paper: http://arxiv.org/abs/2510.18525
- 2026-03-06, From Prompting to Preference Optimization: A Comparative Study of LLM-based Automated Essay Scoring, Minh Hoang Nguyen et.al., Paper: http://arxiv.org/abs/2603.06424
- 2026-01-14, From Prompt to Protocol: Fast Charging Batteries with Large Language Models, Ge Lei et.al., Paper: http://arxiv.org/abs/2601.09626
- 2025-10-24, From Polyester Girlfriends to Blind Mice: Creating the First Pragmatics Understanding Benchmarks for Slovene, Mojca Brglez et.al., Paper: http://arxiv.org/abs/2510.21575
- 2026-03-16, From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation, Yibin Liu et.al., Paper: http://arxiv.org/abs/2603.15600
- 2026-03-07, From Passive Consumption to Active Interaction: Exploring Interactive LLM Scaffolding to Support Learning Engagement, Zixin Chen et.al., Paper: http://arxiv.org/abs/2603.07277
- 2026-01-15, From One-to-One to Many-to-Many: Dynamic Cross-Layer Injection for Deep Vision-Language Fusion, Cheng Chen et.al., Paper: http://arxiv.org/abs/2601.10710
- 2026-02-09, From Obstacles to Etiquette: Robot Social Navigation with VLM-Informed Path Selection, Zilin Fang et.al., Paper: http://arxiv.org/abs/2602.09002
- 2026-03-17, From Natural Language to Executable Option Strategies via Large Language Models, Haochen Luo et.al., Paper: http://arxiv.org/abs/2603.16434
- 2025-12-02, From Moderation to Mediation: Can LLMs Serve as Mediators in Online Flame Wars?, Dawei Li et.al., Paper: http://arxiv.org/abs/2512.03005
- 2025-11-06, From Model to Breach: Towards Actionable LLM-Generated Vulnerabilities Reporting, Cyril Vallez et.al., Paper: http://arxiv.org/abs/2511.04538
- 2025-12-11, From Macro to Micro: Benchmarking Microscopic Spatial Intelligence on Molecules via Vision-Language Models, Zongzhao Li et.al., Paper: http://arxiv.org/abs/2512.10867
- 2026-03-03, From Language to Action: Can LLM-Based Agents Be Used for Embodied Robot Cognition?, Shinas Shaji et.al., Paper: http://arxiv.org/abs/2603.03148
- 2026-03-19, From Inference Efficiency to Embodied Efficiency: Revisiting Efficiency Metrics for Vision-Language-Action Models, Zhuofan Li et.al., Paper: http://arxiv.org/abs/2603.19131
- 2025-12-22, From Indoor to Open World: Revealing the Spatial Reasoning Gap in MLLMs, Mingrui Wu et.al., Paper: http://arxiv.org/abs/2512.19683
- 2026-03-11, From Images to Words: Efficient Cross-Modal Knowledge Distillation to Language Models from Black-box Teachers, Ayan Sengupta et.al., Paper: http://arxiv.org/abs/2603.10877
- 2026-02-05, From Human-Human Collaboration to Human-Agent Collaboration: A Vision, Design Philosophy, and an Empirical Framework for Achieving Successful Partnerships Between Humans and LLM Agents, Bingsheng Yao et.al., Paper: http://arxiv.org/abs/2602.05987
- 2025-12-04, From Generated Human Videos to Physically Plausible Robot Trajectories, James Ni et.al., Paper: http://arxiv.org/abs/2512.05094
- 2026-01-26, From Fuzzy to Exact: The Halo Architecture for Infinite-Depth Reasoning via Rational Arithmetic, Hansheng Ren et.al., Paper: http://arxiv.org/abs/2601.18702
- 2025-12-18, From Facts to Conclusions : Integrating Deductive Reasoning in Retrieval-Augmented LLMs, Shubham Mishra et.al., Paper: http://arxiv.org/abs/2512.16795
- 2026-03-13, From Experiments to Expertise: Scientific Knowledge Consolidation for AI-Driven Computational Research, Haonan Huang et.al., Paper: http://arxiv.org/abs/2603.13191
- 2026-02-02, From Directions to Regions: Decomposing Activations in Language Models via Local Geometry, Or Shafran et.al., Paper: http://arxiv.org/abs/2602.02464
- 2026-03-10, From Data Statistics to Feature Geometry: How Correlations Shape Superposition, Lucas Prieto et.al., Paper: http://arxiv.org/abs/2603.09972
- 2025-12-01, From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning, Sitao Cheng et.al., Paper: http://arxiv.org/abs/2512.01970
- 2026-03-12, Frequentist Consistency of Prior-Data Fitted Networks for Causal Inference, Valentyn Melnychuk et.al., Paper: http://arxiv.org/abs/2603.12037
- 2026-01-30, FourierSampler: Unlocking Non-Autoregressive Potential in Diffusion Language Models via Frequency-Guided Generation, Siyang He et.al., Paper: http://arxiv.org/abs/2601.23182
- 2025-12-01, Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling, Jack Cook et.al., Paper: http://arxiv.org/abs/2512.02010
- 2025-12-11, FoundationMotion: Auto-Labeling and Reasoning about Spatial Movement in Videos, Yulu Gan et.al., Paper: http://arxiv.org/abs/2512.10927
- 2026-01-05, FormationEval, an open multiple-choice benchmark for petroleum geoscience, Almaz Ermilov et.al., Paper: http://arxiv.org/abs/2601.02158
- 2025-11-20, Formal Abductive Latent Explanations for Prototype-Based Networks, Jules Soria et.al., Paper: http://arxiv.org/abs/2511.16588
- 2026-01-15, Form and Meaning in Intrinsic Multilingual Evaluations, Wessel Poelman et.al., Paper: http://arxiv.org/abs/2601.10580
- 2026-03-12, ForensicZip: More Tokens are Better but Not Necessary in Forensic Vision-Language Models, Yingxin Lai et.al., Paper: http://arxiv.org/abs/2603.12208
- 2025-10-22, Forbidden Sidon subsets of perfect difference sets, featuring a human-assisted proof, Boris Alexeev et.al., Paper: http://arxiv.org/abs/2510.19804
- 2025-12-16, Focus: A Streaming Concentration Architecture for Efficient Vision-Language Models, Chiyue Wei et.al., Paper: http://arxiv.org/abs/2512.14661
- 2026-02-04, Fluid Representations in Reasoning Models, Dmitrii Kharlapenko et.al., Paper: http://arxiv.org/abs/2602.04843
- 2025-12-09, Fluent Alignment with Disfluent Judges: Post-training for Lower-resource Languages, David Samuel et.al., Paper: http://arxiv.org/abs/2512.08777
- 2026-02-18, FlowPrefill: Decoupling Preemption from Prefill Scheduling Granularity to Mitigate Head-of-Line Blocking in LLM Serving, Chia-chi Hsieh et.al., Paper: http://arxiv.org/abs/2602.16603
- 2025-12-10, FlipLLM: Efficient Bit-Flip Attacks on Multimodal LLMs using Reinforcement Learning, Khurram Khalil et.al., Paper: http://arxiv.org/abs/2512.09872
- 2026-01-02, FlexSpec: Frozen Drafts Meet Evolving Targets in Edge-Cloud Collaborative LLM Speculative Decoding, Yuchen Li et.al., Paper: http://arxiv.org/abs/2601.00644
- 2026-04-01, FlexAI: A Multi-modal Solution for Delivering Personalized and Adaptive Fitness Interventions, Shivangi Agarwal et.al., Paper: http://arxiv.org/abs/2604.00968
- 2026-04-02, FlatAttention: Dataflow and Fabric Collectives Co-Optimization for Large Attention-Based Model Inference on Tile-Based Accelerators, Chi Zhang et.al., Paper: http://arxiv.org/abs/2604.02110
- 2025-12-23, FlashVLM: Text-Guided Visual Token Selection for Large Multimodal Models, Kaitong Cai et.al., Paper: http://arxiv.org/abs/2512.20561
- 2026-03-05, FlashAttention-4: Algorithm and Kernel Pipelining Co-Design for Asymmetric Hardware Scaling, Ted Zadouri et.al., Paper: http://arxiv.org/abs/2603.05451
- 2026-04-01, FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pretraining, Xiquan Li et.al., Paper: http://arxiv.org/abs/2604.01155
- 2026-01-29, FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale, Ajay Patel et.al., Paper: http://arxiv.org/abs/2601.22146
- 2026-01-06, Fine-tuning Small Language Models as Efficient Enterprise Search Relevance Labelers, Yue Kang et.al., Paper: http://arxiv.org/abs/2601.03211
- 2025-12-15, Fine-tuned LLM-based Code Migration Framework, Oleg Grynets et.al., Paper: http://arxiv.org/abs/2512.13515
- 2026-03-10, Fine-grained Motion Retrieval via Joint-Angle Motion Images and Token-Patch Late Interaction, Yao Zhang et.al., Paper: http://arxiv.org/abs/2603.09930
- 2025-12-29, Fine-Tuning LLMs with Fine-Grained Human Feedback on Text Spans, Sky CH-Wang et.al., Paper: http://arxiv.org/abs/2512.23693
- 2025-10-21, Fine-Tuned Thoughts: Leveraging Chain-of-Thought Reasoning for Industrial Asset Health Monitoring, Shuxin Lin et.al., Paper: http://arxiv.org/abs/2510.18817
- 2025-12-02, Fine-Tuned Large Language Models for Logical Translation: Reducing Hallucinations with Lang2Logic, Muyu Pan et.al., Paper: http://arxiv.org/abs/2512.02987
- 2025-12-09, Financial News Summarization: Can extractive methods still offer a true alternative to LLMs?, Nicolas Reche et.al., Paper: http://arxiv.org/abs/2512.08764
- 2026-03-19, FinTradeBench: A Financial Reasoning Benchmark for LLMs, Yogesh Agrawal et.al., Paper: http://arxiv.org/abs/2603.19225
- 2026-03-07, FinSheet-Bench: From Simple Lookups to Complex Reasoning, Where LLMs Break on Financial Spreadsheets, Jan Ravnik et.al., Paper: http://arxiv.org/abs/2603.07316
- 2023-11-14, FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets, Neng Wang et.al., Paper: http://arxiv.org/abs/2310.04793
- 2025-11-25, Fighting AI with AI: Leveraging Foundation Models for Assuring AI-Enabled Safety-Critical Systems, Anastasia Mavridou et.al., Paper: http://arxiv.org/abs/2511.20627
- 2026-03-18, Feeling the Space: Egomotion-Aware Video Representation for Efficient and Accurate 3D Scene Understanding, Shuyao Shi et.al., Paper: http://arxiv.org/abs/2603.17980
- 2026-03-04, FeedAIde: Guiding App Users to Submit Rich Feedback Reports by Asking Context-Aware Follow-Up Questions, Ali Ebrahimi Pourasad et.al., Paper: http://arxiv.org/abs/2603.04244
- 2025-10-21, FedDEAP: Adaptive Dual-Prompt Tuning for Multi-Domain Federated Learning, Yubin Zheng et.al., Paper: http://arxiv.org/abs/2510.18837
- 2026-02-10, Features as Rewards: Scalable Supervision for Open-Ended Tasks via Interpretability, Aaditya Vikram Prasad et.al., Paper: http://arxiv.org/abs/2602.10067
- 2025-10-21, FeClustRE: Hierarchical Clustering and Semantic Tagging of App Features from User Reviews, Max Tiessler et.al., Paper: http://arxiv.org/abs/2510.18799
- 2026-01-02, Fast-weight Product Key Memory, Tianyu Zhao et.al., Paper: http://arxiv.org/abs/2601.00671
- 2026-01-14, Fast-ThinkAct: Efficient Vision-Language-Action Reasoning via Verbalizable Latent Planning, Chi-Pin Huang et.al., Paper: http://arxiv.org/abs/2601.09708
- 2026-02-03, Fast-Slow Efficient Training for Multimodal Large Language Models via Visual Token Pruning, Dingkun Zhang et.al., Paper: http://arxiv.org/abs/2602.03815
- 2025-12-16, Fast and Accurate Causal Parallel Decoding using Jacobi Forcing, Lanxiang Hu et.al., Paper: http://arxiv.org/abs/2512.14681
- 2025-10-23, Fast Inference via Hierarchical Speculative Decoding, Clara Mohri et.al., Paper: http://arxiv.org/abs/2510.19705
- 2025-11-14, FarSkip-Collective: Unhobbling Blocking Communication in Mixture of Experts Models, Yonatan Dukler et.al., Paper: http://arxiv.org/abs/2511.11505
- 2026-03-09, Fanar-Sadiq: A Multi-Agent Architecture for Grounded Islamic QA, Ummar Abbas et.al., Paper: http://arxiv.org/abs/2603.08501
- 2026-01-05, Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling, Falcon LLM Team et.al., Paper: http://arxiv.org/abs/2601.02346
- 2026-02-10, Fake-HR1: Rethinking reasoning of vision language model for synthetic image detection, Changjiang Jiang et.al., Paper: http://arxiv.org/abs/2602.10042
- 2026-03-14, Faithful or Just Plausible? Evaluating the Faithfulness of Closed-Source LLMs in Medical Reasoning, Halimat Afolabi et.al., Paper: http://arxiv.org/abs/2603.13988
- 2026-03-24, Failure of contextual invariance in gender inference with large language models, Sagar Kumar et.al., Paper: http://arxiv.org/abs/2603.23485
- 2025-12-23, Fail Fast, Win Big: Rethinking the Drafting Strategy in Speculative Decoding via Diffusion LLMs, Rui Pan et.al., Paper: http://arxiv.org/abs/2512.20573
- 2025-12-04, Factuality and Transparency Are All RAG Needs! Self-Explaining Contrastive Evidence Re-ranking, Francielle Vargas et.al., Paper: http://arxiv.org/abs/2512.05012
- 2026-01-08, FaST: Efficient and Effective Long-Horizon Forecasting for Large-Scale Spatial-Temporal Graphs via Mixture-of-Experts, Yiji Zhao et.al., Paper: http://arxiv.org/abs/2601.05174
- 2026-03-09, FVG-PT: Adaptive Foreground View-Guided Prompt Tuning for Vision-Language Models, Haoyang Li et.al., Paper: http://arxiv.org/abs/2603.08708
- 2025-12-26, FUSCO: High-Performance Distributed Data Shuffling via Transformation-Communication Fusion, Zhuoran Zhu et.al., Paper: http://arxiv.org/abs/2512.22036
- 2026-01-30, FOCUS: DLLMs Know How to Tame Their Compute Bound, Kaihua Liang et.al., Paper: http://arxiv.org/abs/2601.23278
- 2026-03-14, FLUX: Data Worth Training On, Gowtham et.al., Paper: http://arxiv.org/abs/2603.13972
- 2026-01-07, FLEx: Language Modeling with Few-shot Language Explanations, Adar Avsian et.al., Paper: http://arxiv.org/abs/2601.04157
- 2026-03-20, FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization, Chiyu Ma et.al., Paper: http://arxiv.org/abs/2603.19835
- 2026-02-17, FAST-EQA: Efficient Embodied Question Answering with Global and Local Region Relevancy, Haochen Zhang et.al., Paper: http://arxiv.org/abs/2602.15813
- 2025-10-29, FARSIQA: Faithful and Advanced RAG System for Islamic Question Answering, Mohammad Aghajani Asl et.al., Paper: http://arxiv.org/abs/2510.25621
- 2025-10-27, FARMER: Flow AutoRegressive Transformer over Pixels, Guangting Zheng et.al., Paper: http://arxiv.org/abs/2510.23588
- 2025-12-12, Extending a Parliamentary Corpus with MPs’ Tweets: Automatic Annotation and Evaluation Using MultiParTweet, Mevlüt Bagci et.al., Paper: http://arxiv.org/abs/2512.11567
- 2025-12-22, Exploring the features used for summary evaluation by Human and GPT, Zahra Sadeghi et.al., Paper: http://arxiv.org/abs/2512.19620
- 2026-01-02, Exploring the Performance of Large Language Models on Subjective Span Identification Tasks, Alphaeus Dmonte et.al., Paper: http://arxiv.org/abs/2601.00736
- 2026-01-12, Exploring the Meta-level Reasoning of Large Language Models via a Tool-based Multi-hop Tabular Question Answering Task, Nick Ferguson et.al., Paper: http://arxiv.org/abs/2601.07696
- 2025-12-26, Exploring the Heterogeneity of Tabular Data: A Diversity-aware Data Generator via LLMs, Yafeng Tang et.al., Paper: http://arxiv.org/abs/2512.21915
- 2025-10-21, Exploring a Unified Vision-Centric Contrastive Alternatives on Multi-Modal Web Documents, Yiqi Lin et.al., Paper: http://arxiv.org/abs/2510.18703
- 2026-02-13, Exploring a New Competency Modeling Process with Large Language Models, Silin Du et.al., Paper: http://arxiv.org/abs/2602.13084
- 2025-12-22, Exploring Zero-Shot ACSA with Unified Meaning Representation in Chain-of-Thought Prompting, Filippos Ventirozos et.al., Paper: http://arxiv.org/abs/2512.19651
- 2025-12-10, Exploring Protein Language Model Architecture-Induced Biases for Antibody Comprehension, Mengren et.al., Paper: http://arxiv.org/abs/2512.09894
- 2025-10-21, Exploring Membership Inference Vulnerabilities in Clinical Large Language Models, Alexander Nemecek et.al., Paper: http://arxiv.org/abs/2510.18674
- 2025-10-23, Exploring Large Language Models for Access Control Policy Synthesis and Summarization, Adarsh Vatsa et.al., Paper: http://arxiv.org/abs/2510.20692
- 2026-01-14, Exploring Fine-Tuning for Tabular Foundation Models, Aditya Tanna et.al., Paper: http://arxiv.org/abs/2601.09654
- 2025-12-18, Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward, Peter Chen et.al., Paper: http://arxiv.org/abs/2512.16912
- 2025-12-17, Explaining the Reasoning of Large Language Models Using Attribution Graphs, Chase Walker et.al., Paper: http://arxiv.org/abs/2512.15663
- 2025-12-26, Explainable Statute Prediction via Attention-based Model and LLM Prompting, Sachin Pawar et.al., Paper: http://arxiv.org/abs/2512.21902
- 2025-10-30, ExpertFlow: Adaptive Expert Scheduling and Memory Coordination for Efficient MoE Inference, Zixu Shen et.al., Paper: http://arxiv.org/abs/2510.26730
- 2026-03-29, Expert Streaming: Accelerating Low-Batch MoE Inference via Multi-chiplet Architecture and Dynamic Expert Trajectory Scheduling, Songchen Ma et.al., Paper: http://arxiv.org/abs/2603.27624
- 2025-11-14, Experience-Guided Adaptation of Inference-Time Reasoning Strategies, Adam Stein et.al., Paper: http://arxiv.org/abs/2511.11519
- 2026-03-20, Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs, Wenjian Zhang et.al., Paper: http://arxiv.org/abs/2603.20046
- 2026-03-09, Exp-Force: Experience-Conditioned Pre-Grasp Force Selection with Vision-Language Models, Siqi Shang et.al., Paper: http://arxiv.org/abs/2603.08668
- 2026-02-12, ExStrucTiny: A Benchmark for Schema-Variable Structured Information Extraction from Document Images, Mathieu Sibue et.al., Paper: http://arxiv.org/abs/2602.12203
- 2025-10-30, Evontree: Ontology Rule-Guided Self-Evolution of Large Language Models, Mingchen Tu et.al., Paper: http://arxiv.org/abs/2510.26683
- 2026-03-10, Evolving Prompt Adaptation for Vision-Language Models, Enming Zhang et.al., Paper: http://arxiv.org/abs/2603.09493
- 2026-03-20, Evolving Jailbreaks: Automated Multi-Objective Long-Tail Attacks on Large Language Models, Wenjing Hong et.al., Paper: http://arxiv.org/abs/2603.20122
- 2025-10-28, Evolving Diagnostic Agents in a Virtual Clinical Environment, Pengcheng Qiu et.al., Paper: http://arxiv.org/abs/2510.24654
- 2025-12-04, Evolutionary Architecture Search through Grammar-Based Sequence Alignment, Adri Gómez Martín et.al., Paper: http://arxiv.org/abs/2512.04992
- 2025-11-20, Evolution Strategies at the Hyperscale, Bidipta Sarkar et.al., Paper: http://arxiv.org/abs/2511.16652
- 2026-03-12, EvoTok: A Unified Image Tokenizer via Residual Latent Evolution for Visual Understanding and Generation, Yan Li et.al., Paper: http://arxiv.org/abs/2603.12108
- 2025-11-06, Evo-1: Lightweight Vision-Language-Action Model with Preserved Semantic Alignment, Tao Lin et.al., Paper: http://arxiv.org/abs/2511.04555
- 2026-03-24, Evidence of political bias in search engines and language models before major elections, Íris Damião et.al., Paper: http://arxiv.org/abs/2603.23474
- 2026-03-14, EviAgent: Evidence-Driven Agent for Radiology Report Generation, Tuoshi Qi et.al., Paper: http://arxiv.org/abs/2603.13956
- 2026-01-05, EverMemOS: A Self-Organizing Memory Operating System for Structured Long-Horizon Reasoning, Chuanrui Hu et.al., Paper: http://arxiv.org/abs/2601.02163
- 2025-12-22, Event Extraction in Large Language Model, Bobo Li et.al., Paper: http://arxiv.org/abs/2512.19537
- 2026-01-27, Evaluation of Oncotimia: An LLM based system for supporting tumour boards, Luis Lorenzo et.al., Paper: http://arxiv.org/abs/2601.19899
- 2026-01-21, Evaluation of Large Language Models in Legal Applications: Challenges, Methods, and Future Directions, Yiran Hu et.al., Paper: http://arxiv.org/abs/2601.15267
- 2026-03-06, Evaluation of Deontic Conditional Reasoning in Large Language Models: The Case of Wason’s Selection Task, Hirohiko Abe et.al., Paper: http://arxiv.org/abs/2603.06416
- 2026-01-12, Evaluating the encoding competence of visual language models using uncommon actions, Chen Ling et.al., Paper: http://arxiv.org/abs/2601.07737
- 2025-10-30, Evaluating the Role of Verifiers in Test-Time Scaling for Legal Reasoning Tasks, Davide Romano et.al., Paper: http://arxiv.org/abs/2510.25623
- 2026-03-23, Evaluating the Reliability and Fidelity of Automated Judgment Systems of Large Language Models, Tom Biskupski et.al., Paper: http://arxiv.org/abs/2603.22214
- 2025-12-31, Evaluating the Impact of Compression Techniques on the Robustness of CNNs under Natural Corruptions, Itallo Patrick Castro Alves Da Silva et.al., Paper: http://arxiv.org/abs/2512.24971
- 2026-02-26, Evaluating Zero-Shot and One-Shot Adaptation of Small Language Models in Leader-Follower Interaction, Rafael R. Baptista et.al., Paper: http://arxiv.org/abs/2602.23312
- 2026-03-13, Evaluating VLMs’ Spatial Reasoning Over Robot Motion: A Step Towards Robot Planning with Motion Preferences, Wenxi Wu et.al., Paper: http://arxiv.org/abs/2603.13100
- 2025-11-07, Evaluating Subword Tokenization Techniques for Bengali: A Benchmark Study with BengaliBPE, Firoj Ahmmed Patwary et.al., Paper: http://arxiv.org/abs/2511.05324
- 2026-03-22, Evaluating Reasoning-Based Scaffolds for Human-AI Co-Annotation: The ReasonAlign Annotation Protocol, Smitha Muthya Sudheendra et.al., Paper: http://arxiv.org/abs/2603.21094
- 2025-11-13, Evaluating Prompting Strategies with MedGemma for Medical Order Extraction, Abhinand Balachandran et.al., Paper: http://arxiv.org/abs/2511.10583
- 2025-12-17, Evaluating Metrics for Safety with LLM-as-Judges, Kester Clegg et.al., Paper: http://arxiv.org/abs/2512.15617
- 2026-01-23, Evaluating Large Vision-language Models for Surgical Tool Detection, Nakul Poudel et.al., Paper: http://arxiv.org/abs/2601.16895
- 2025-10-21, Evaluating Large Language Models in detecting Secrets in Android Apps, Marco Alecci et.al., Paper: http://arxiv.org/abs/2510.18601
- 2025-12-17, Evaluating Large Language Models in Scientific Discovery, Zhangde Song et.al., Paper: http://arxiv.org/abs/2512.15567
- 2025-11-28, Evaluating LLMs for One-Shot Patching of Real and Artificial Vulnerabilities, Aayush Garg et.al., Paper: http://arxiv.org/abs/2511.23408
- 2026-03-19, Evaluating LLM-Generated Lessons from the Language Learning Students’ Perspective: A Short Case Study on Duolingo, Carlos Rafael Catalan et.al., Paper: http://arxiv.org/abs/2603.18873
- 2026-03-24, Evaluating LLM-Based Test Generation Under Software Evolution, Sabaat Haroon et.al., Paper: http://arxiv.org/abs/2603.23443
- 2025-10-21, Evaluating LLM-Based Mobile App Recommendations: An Empirical Study, Quim Motger et.al., Paper: http://arxiv.org/abs/2510.18364
- 2026-01-16, Evaluating LLM Behavior in Hiring: Implicit Weights, Fairness Across Groups, and Alignment with Human Preferences, Morgane Hoffmann et.al., Paper: http://arxiv.org/abs/2601.11379
- 2026-03-09, Evaluating Financial Intelligence in Large Language Models: Benchmarking SuperInvesting AI with LLM Engines, Akshay Gulati et.al., Paper: http://arxiv.org/abs/2603.08704
- 2026-03-20, Evaluating Evidence Grounding Under User Pressure in Instruction-Tuned Language Models, Sai Koneru et.al., Paper: http://arxiv.org/abs/2603.20162
- 2026-03-19, Evaluating Counterfactual Strategic Reasoning in Large Language Models, Dimitrios Georgousis et.al., Paper: http://arxiv.org/abs/2603.19167
- 2025-12-12, Evaluating Cooperative Resilience in Multiagent Systems: A Comparison Between Humans and LLMs, Manuela Chacon-Chamorro et.al., Paper: http://arxiv.org/abs/2512.11689
- 2026-03-25, Evaluating Chunking Strategies For Retrieval-Augmented Generation in Oil and Gas Enterprise Documents, Samuel Taiwo et.al., Paper: http://arxiv.org/abs/2603.24556
- 2026-02-19, Evaluating Chain-of-Thought Reasoning through Reusability and Verifiability, Shashank Aggarwal et.al., Paper: http://arxiv.org/abs/2602.17544
- 2025-12-03, Eval Factsheets: A Structured Framework for Documenting AI Evaluations, Florian Bordes et.al., Paper: http://arxiv.org/abs/2512.04062
- 2026-03-29, EvA: An Evidence-First Audio Understanding Paradigm for LALMs, Xinyuan Xie et.al., Paper: http://arxiv.org/abs/2603.27667
- 2025-12-05, Euclid Quick Data Release (Q1). From simulations to sky: Advancing machine-learning lens detection with real Euclid data, Euclid Collaboration et.al., Paper: http://arxiv.org/abs/2512.05899
- 2026-01-05, Estimating Text Temperature, Nikolay Mikhaylovskiy et.al., Paper: http://arxiv.org/abs/2601.02320
- 2025-11-26, Escaping the Verifier: Learning to Reason via Demonstrations, Locke Cai et.al., Paper: http://arxiv.org/abs/2511.21667
- 2026-03-25, Environment-Grounded Multi-Agent Workflow for Autonomous Penetration Testing, Michael Somma et.al., Paper: http://arxiv.org/abs/2603.24221
- 2026-02-23, Entropy in Large Language Models, Marco Scharringhausen et.al., Paper: http://arxiv.org/abs/2602.20052
- 2026-03-05, Ensembling Language Models with Sequential Monte Carlo, Robin Shing Moon Chan et.al., Paper: http://arxiv.org/abs/2603.05432
- 2025-11-13, Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling, Jiahao Wang et.al., Paper: http://arxiv.org/abs/2511.10648
- 2026-01-12, Enhancing Self-Correction in Large Language Models through Multi-Perspective Reflection, Mariana Costa et.al., Paper: http://arxiv.org/abs/2601.07780
- 2025-12-05, Enhancing Retrieval-Augmented Generation with Entity Linking for Educational Platforms, Francesco Granata et.al., Paper: http://arxiv.org/abs/2512.05967
- 2025-11-21, Enhancing Quranic Learning: A Multimodal Deep Learning Approach for Arabic Phoneme Recognition, Ayhan Kucukmanisa et.al., Paper: http://arxiv.org/abs/2511.17477
- 2025-10-21, Enhancing Hotel Recommendations with AI: LLM-Based Review Summarization and Query-Driven Insights, Nikolaos Belibasakis et.al., Paper: http://arxiv.org/abs/2510.18277
- 2026-03-25, Enhancing Efficiency and Performance in Deepfake Audio Detection through Neuron-level dropin & Neuroplasticity Mechanisms, Yupei Li et.al., Paper: http://arxiv.org/abs/2603.24343
- 2026-03-23, Enhancing Document-Level Machine Translation via Filtered Synthetic Corpora and Two-Stage LLM Adaptation, Ireh Kim et.al., Paper: http://arxiv.org/abs/2603.22186
- 2026-03-10, Enhancing Debunking Effectiveness through LLM-based Personality Adaptation, Pietro Dell’Oglio et.al., Paper: http://arxiv.org/abs/2603.09533
- 2026-02-17, Enhancing Building Semantics Preservation in AI Model Training with Large Language Model Encodings, Suhyung Jang et.al., Paper: http://arxiv.org/abs/2602.15791
- 2025-10-21, Engagement Undermines Safety: How Stereotypes and Toxicity Shape Humor in Language Models, Atharvan Dogra et.al., Paper: http://arxiv.org/abs/2510.18454
- 2026-03-17, EngGPT2: Sovereign, Efficient and Open Intelligence, G. Ciarfaglia et.al., Paper: http://arxiv.org/abs/2603.16430
- 2026-03-25, Enes Causal Discovery, Alexis Kafantaris et.al., Paper: http://arxiv.org/abs/2603.24436
- 2026-02-06, Endogenous Resistance to Activation Steering in Language Models, Alex McKenzie et.al., Paper: http://arxiv.org/abs/2602.06941
- 2026-03-12, EndoCoT: Scaling Endogenous Chain-of-Thought Reasoning in Diffusion Models, Xuanlang Dai et.al., Paper: http://arxiv.org/abs/2603.12252
- 2025-12-29, End-to-End Test-Time Training for Long Context, Arnuv Tandon et.al., Paper: http://arxiv.org/abs/2512.23675
- 2025-12-24, Encrypted Traffic Detection in Resource Constrained IoT Networks: A Diffusion Model and LLM Integrated Framework, Hongjuan Li et.al., Paper: http://arxiv.org/abs/2512.21144
- 2025-11-18, Encoding and Understanding Astrophysical Information in Large Language Model-Generated Summaries, Kiera McCormick et.al., Paper: http://arxiv.org/abs/2511.14685
- 2025-12-19, Enabling Disaggregated Multi-Stage MLLM Inference via GPU-Internal Scheduling and Resource Sharing, Lingxiao Zhao et.al., Paper: http://arxiv.org/abs/2512.17574
- 2026-01-06, Empowering Reliable Visual-Centric Instruction Following in MLLMs, Weilei He et.al., Paper: http://arxiv.org/abs/2601.03198
- 2026-01-23, Empowering Medical Equipment Sustainability in Low-Resource Settings: An AI-Powered Diagnostic and Support Platform for Biomedical Technicians, Bernes Lorier Atabonfack et.al., Paper: http://arxiv.org/abs/2601.16967
- 2025-10-23, Empathic Prompting: Non-Verbal Context Integration for Multimodal LLM Conversations, Lorenzo Stacchio et.al., Paper: http://arxiv.org/abs/2510.20743
- 2026-03-19, Empathetic Motion Generation for Humanoid Educational Robots via Reasoning-Guided Vision–Language–Motion Diffusion Architecture, Fuze Sun et.al., Paper: http://arxiv.org/abs/2603.18771
- 2026-01-12, Emotional Support Evaluation Framework via Controllable and Diverse Seeker Simulator, Chaewon Heo et.al., Paper: http://arxiv.org/abs/2601.07698
- 2025-10-27, Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale Verifier, Hyeongseop Rha et.al., Paper: http://arxiv.org/abs/2510.23506
- 2025-12-15, Embedding-Based Rankings of Educational Resources based on Learning Outcome Alignment: Benchmarking, Expert Validation, and Learner Performance, Mohammadreza Molavi et.al., Paper: http://arxiv.org/abs/2512.13658
- 2026-02-02, Embedding Perturbation may Better Reflect the Uncertainty in LLM Reasoning, Qihao Wen et.al., Paper: http://arxiv.org/abs/2602.02427
- 2026-02-11, Embedding Inversion via Conditional Masked Diffusion Language Models, Han Xiao et.al., Paper: http://arxiv.org/abs/2602.11047
- 2026-04-01, Embarrassingly Simple Self-Distillation Improves Code Generation, Ruixiang Zhang et.al., Paper: http://arxiv.org/abs/2604.01193
- 2026-03-12, EmbTracker: Traceable Black-box Watermarking for Federated Language Models, Haodong Zhao et.al., Paper: http://arxiv.org/abs/2603.12089
- 2026-03-10, EmbC-Test: How to Speed Up Embedded Software Testing Using LLMs and RAG, Maximilian Harnot et.al., Paper: http://arxiv.org/abs/2603.09497
- 2025-12-29, Eliciting Behaviors in Multi-Turn Conversations, Jing Huang et.al., Paper: http://arxiv.org/abs/2512.23701
- 2026-02-04, El Agente Estructural: An Artificially Intelligent Molecular Editor, Changhyeok Choi et.al., Paper: http://arxiv.org/abs/2602.04849
- 2025-10-27, EgoThinker: Unveiling Egocentric Reasoning with Spatio-Temporal CoT, Baoqi Pei et.al., Paper: http://arxiv.org/abs/2510.23569
- 2026-03-06, EgoReasoner: Learning Egocentric 4D Reasoning via Task-Adaptive Structured Thinking, Fangrui Zhu et.al., Paper: http://arxiv.org/abs/2603.06561
- 2026-03-12, EgoIntent: An Egocentric Step-level Benchmark for Understanding What, Why, and Next, Ye Pan et.al., Paper: http://arxiv.org/abs/2603.12147
- 2026-01-27, EgoHandICL: Egocentric 3D Hand Reconstruction with In-Context Learning, Binzhu Xie et.al., Paper: http://arxiv.org/abs/2601.19850
- 2025-12-31, Efficiently Estimating Data Efficiency for Language Model Fine-tuning, Gyung Hyun Je et.al., Paper: http://arxiv.org/abs/2512.24991
- 2025-10-21, EfficientNav: Towards On-Device Object-Goal Navigation with Navigation Map Caching and Retrieval, Zebin Yang et.al., Paper: http://arxiv.org/abs/2510.18546
- 2026-02-09, Efficient and Stable Reinforcement Learning for Diffusion Language Models, Jiawei Liu et.al., Paper: http://arxiv.org/abs/2602.08905
- 2026-03-18, Efficient Training-Free Multi-Token Prediction via Embedding-Space Probing, Raghavv Goel et.al., Paper: http://arxiv.org/abs/2603.17942
- 2026-03-18, Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control, Hao Ma et.al., Paper: http://arxiv.org/abs/2603.17468
- 2026-03-04, Efficient Refusal Ablation in LLM through Optimal Transport, Geraldin Nanfack et.al., Paper: http://arxiv.org/abs/2603.04355
- 2026-03-17, Efficient Reasoning on the Edge, Yelysei Bondarenko et.al., Paper: http://arxiv.org/abs/2603.16867
- 2026-03-18, Efficient Exploration at Scale, Seyed Mohammad Asghari et.al., Paper: http://arxiv.org/abs/2603.17378
- 2026-02-03, Efficient Estimation of Kernel Surrogate Models for Task Attribution, Zhenshuo Zhang et.al., Paper: http://arxiv.org/abs/2602.03783
- 2026-03-09, Efficient Credal Prediction through Decalibration, Paul Hofman et.al., Paper: http://arxiv.org/abs/2603.08495
- 2025-12-10, Efficient Continual Learning in Neural Machine Translation: A Low-Rank Adaptation Approach, Salvador Carrión et.al., Paper: http://arxiv.org/abs/2512.09910
- 2025-02-19, Efficient Alignment of Large Language Models via Data Sampling, Amrit Khera et.al., Paper: http://arxiv.org/abs/2411.10545
- 2025-10-21, EffiReasonTrans: RL-Optimized Reasoning for Code Translation, Yanlin Wang et.al., Paper: http://arxiv.org/abs/2510.18863
- 2026-03-16, Effective Distillation to Hybrid xLSTM Architectures, Lukas Hauzenberger et.al., Paper: http://arxiv.org/abs/2603.15590
- 2025-12-05, EditThinker: Unlocking Iterative Reasoning for Any Image Editor, Hongyu Li et.al., Paper: http://arxiv.org/abs/2512.05965
- 2026-02-03, Edge-Optimized Vision-Language Models for Underground Infrastructure Assessment, Johny J. Lopez et.al., Paper: http://arxiv.org/abs/2602.03742
- 2026-03-26, EcoThink: A Green Adaptive Inference Framework for Sustainable and Accessible Agents, Linxiao Li et.al., Paper: http://arxiv.org/abs/2603.25498
- 2025-11-04, EasyTUS: A Comprehensive Framework for Fast and Accurate Table Union Search across Data Lakes, Tim Otto et.al., Paper: http://arxiv.org/abs/2511.02674
- 2025-12-16, EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models, Zechen Bai et.al., Paper: http://arxiv.org/abs/2512.14666
- 2025-10-24, EU-Agent-Bench: Measuring Illegal Behavior of LLM Agents Under EU Law, Ilija Lichkovski et.al., Paper: http://arxiv.org/abs/2510.21524
- 2026-02-10, ESTAR: Early-Stopping Token-Aware Reasoning For Efficient Inference, Junda Wang et.al., Paper: http://arxiv.org/abs/2602.10004
- 2026-03-13, ESPIRE: A Diagnostic Benchmark for Embodied Spatial Reasoning of Vision-Language Models, Yanpeng Zhao et.al., Paper: http://arxiv.org/abs/2603.13033
- 2026-03-13, ESG-Bench: Benchmarking Long-Context ESG Reports for Hallucination Mitigation, Siqi Sun et.al., Paper: http://arxiv.org/abs/2603.13154
- 2026-03-06, ESAA-Security: An Event-Sourced, Verifiable Architecture for Agent-Assisted Security Audits of AI-Generated Code, Elzo Brito dos Santos Filho et.al., Paper: http://arxiv.org/abs/2603.06365
- 2026-01-05, ELLA: Efficient Lifelong Learning for Adapters in Large Language Models, Shristi Das Biswas et.al., Paper: http://arxiv.org/abs/2601.02232
- 2025-10-29, EHR-R1: A Reasoning-Enhanced Foundational Language Model for Electronic Health Record Analysis, Yusheng Liao et.al., Paper: http://arxiv.org/abs/2510.25628
- 2026-01-29, ECO: Quantized Training without Full-Precision Master Weights, Mahdi Nikdan et.al., Paper: http://arxiv.org/abs/2601.22101
- 2026-03-22, ECI: Effective Contrastive Information to Evaluate Hard-Negatives, Aarush Sinha et.al., Paper: http://arxiv.org/abs/2603.20990
- 2025-10-21, ECG-LLM– training and evaluation of domain-specific large language models for electrocardiography, Lara Ahrens et.al., Paper: http://arxiv.org/abs/2510.18339
- 2026-03-31, EC-Bench: Enumeration and Counting Benchmark for Ultra-Long Videos, Fumihiko Tsuchiya et.al., Paper: http://arxiv.org/abs/2603.29943
- 2025-10-29, E-Scores for (In)Correctness Assessment of Generative Model Outputs, Guneet S. Dhillon et.al., Paper: http://arxiv.org/abs/2510.25770
- 2026-03-11, Dynamics-Predictive Sampling for Active RL Finetuning of Large Reasoning Models, Yixiu Mao et.al., Paper: http://arxiv.org/abs/2603.10887
- 2025-11-04, Dynamic Reflections: Probing Video Representations with Text Alignment, Tyler Zhu et.al., Paper: http://arxiv.org/abs/2511.02767
- 2025-12-17, Dynamic Rebatching for Efficient Early-Exit Inference with DREX, Xuting Liu et.al., Paper: http://arxiv.org/abs/2512.15705
- 2026-02-25, Dynamic Personality Adaptation in Large Language Models via State Machines, Leon Pielage et.al., Paper: http://arxiv.org/abs/2602.22157
- 2026-03-10, Dynamic Multimodal Expression Generation for LLM-Driven Pedagogical Agents: From User Experience Perspective, Ninghao Wan et.al., Paper: http://arxiv.org/abs/2603.09536
- 2026-01-29, DynaWeb: Model-Based Reinforcement Learning of Web Agents, Hang Ding et.al., Paper: http://arxiv.org/abs/2601.22149
- 2025-12-10, DynaIP: Dynamic Image Prompt Adapter for Scalable Zero-shot Personalized Text-to-Image Generation, Zhizhong Wang et.al., Paper: http://arxiv.org/abs/2512.09814
- 2026-02-05, DyTopo: Dynamic Topology Routing for Multi-Agent Reasoning via Semantic Matching, Yuxing Lu et.al., Paper: http://arxiv.org/abs/2602.06039
- 2026-02-25, DySCO: Dynamic Attention-Scaling Decoding for Long-Context LMs, Xi Ye et.al., Paper: http://arxiv.org/abs/2602.22175
- 2025-12-11, DuetSVG: Unified Multimodal SVG Generation with Internal Visual Guidance, Peiying Zhang et.al., Paper: http://arxiv.org/abs/2512.10894
- 2026-03-23, DualCoT-VLA: Visual-Linguistic Chain of Thought via Parallel Reasoning for Vision-Language-Action Models, Zhide Zhong et.al., Paper: http://arxiv.org/abs/2603.22280
- 2026-02-20, Dual-Tree LLM-Enhanced Negative Sampling for Implicit Collaborative Filtering, Jiayi Wu et.al., Paper: http://arxiv.org/abs/2602.18249
- 2026-04-01, Dual Optimal: Make Your LLM Peer-like with Dignity, Xiangqi Wang et.al., Paper: http://arxiv.org/abs/2604.00979
- 2025-12-16, Dual Language Models: Balancing Training Efficiency and Overfitting Resilience, David Samuel et.al., Paper: http://arxiv.org/abs/2512.14549
- 2025-12-26, DuaDeep-SeqAffinity: Dual-Stream Deep Learning Framework for Sequence-Only Antigen-Antibody Affinity Prediction, Aicha Boutorh et.al., Paper: http://arxiv.org/abs/2512.22007
- 2026-03-19, DriftGuard: Mitigating Asynchronous Data Drift in Federated Learning, Yizhou Han et.al., Paper: http://arxiv.org/abs/2603.18872
- 2026-02-02, Drift-Bench: Diagnosing Cooperative Breakdowns in LLM Agents under Input Faults via Multi-Turn Interaction, Han Bao et.al., Paper: http://arxiv.org/abs/2602.02455
- 2026-03-17, DreamPlan: Efficient Reinforcement Fine-Tuning of Vision-Language Planners via Video World Models, Emily Yue-Ting Jia et.al., Paper: http://arxiv.org/abs/2603.16860
- 2025-12-04, DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation, Dongzhi Jiang et.al., Paper: http://arxiv.org/abs/2512.05112
- 2025-11-21, Downscaling Intelligence: Exploring Perception and Reasoning Bottlenecks in Small Multimodal Models, Mark Endo et.al., Paper: http://arxiv.org/abs/2511.17487
- 2026-01-09, Don’t Break the Cache: An Evaluation of Prompt Caching for Long-Horizon Agentic Tasks, Elias Lumer et.al., Paper: http://arxiv.org/abs/2601.06007
- 2025-10-29, Don’t Blind Your VLA: Aligning Visual Representations for OOD Generalization, Nikita Kachaev et.al., Paper: http://arxiv.org/abs/2510.25616
- 2026-01-20, Domain-Adaptation through Synthetic Data: Fine-Tuning Large Language Models for German Law, Ali Hamza Bashir et.al., Paper: http://arxiv.org/abs/2601.14160
- 2026-03-11, Does AI See like Art Historians? Interpreting How Vision Language Models Recognize Artistic Style, Marvin Limpijankit et.al., Paper: http://arxiv.org/abs/2603.11024
- 2025-11-14, DocLens : A Tool-Augmented Multi-Agent Framework for Long Visual Document Understanding, Dawei Zhu et.al., Paper: http://arxiv.org/abs/2511.11552
- 2025-12-15, Do-Undo: Generating and Reversing Physical Actions in Vision-Language Models, Shweta Mahajan et.al., Paper: http://arxiv.org/abs/2512.13609
- 2026-01-16, Do explanations generalize across large reasoning models?, Koyena Pal et.al., Paper: http://arxiv.org/abs/2601.11517
- 2026-03-10, Do What I Say: A Spoken Prompt Dataset for Instruction-Following, Maike Züfle et.al., Paper: http://arxiv.org/abs/2603.09881
- 2026-01-29, Do VLMs Perceive or Recall? Probing Visual Perception vs. Memory with Classic Visual Illusions, Xiaoxiao Sun et.al., Paper: http://arxiv.org/abs/2601.22150
- 2026-03-19, Do VLMs Need Vision Transformers? Evaluating State Space Models as Vision Encoders, Shang-Jui Ray Kuo et.al., Paper: http://arxiv.org/abs/2603.19209
- 2026-02-23, Do Large Language Models Understand Data Visualization Rules?, Martin Sinnona et.al., Paper: http://arxiv.org/abs/2602.20137
- 2026-02-23, Do Large Language Models Understand Data Visualization Principles?, Martin Sinnona et.al., Paper: http://arxiv.org/abs/2602.20084
- 2026-02-27, Do LLMs Benefit From Their Own Words?, Jenny Y. Huang et.al., Paper: http://arxiv.org/abs/2602.24287
- 2025-12-08, Do Generalisation Results Generalise?, Matteo Boglioni et.al., Paper: http://arxiv.org/abs/2512.07832
- 2026-03-06, Do Foundation Models Know Geometry? Probing Frozen Features for Continuous Physical Measurement, Yakov Pyotr Shkolnikov et.al., Paper: http://arxiv.org/abs/2603.06459
- 2025-11-05, Do Androids Dream of Unseen Puppeteers? Probing for a Conspiracy Mindset in Large Language Models, Francesco Corso et.al., Paper: http://arxiv.org/abs/2511.03699
- 2026-02-11, Divide, Harmonize, Then Conquer It: Shooting Multi-Commodity Flow Problems with Multimodal Language Models, Xinyu Yuan et.al., Paper: http://arxiv.org/abs/2602.11057
- 2025-11-25, Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization, Tahira Kazimi et.al., Paper: http://arxiv.org/abs/2511.20647
- 2025-12-29, Divergent-Convergent Thinking in Large Language Models for Creative Problem Generation, Manh Hung Nguyen et.al., Paper: http://arxiv.org/abs/2512.23601
- 2025-12-02, Distribution-Calibrated Inference time compute for Thinking LLM-as-a-Judge, Hamid Dadkhahi et.al., Paper: http://arxiv.org/abs/2512.03019
- 2025-12-08, Distribution Matching Variational AutoEncoder, Sen Ye et.al., Paper: http://arxiv.org/abs/2512.07778
- 2026-03-05, Distributed Partial Information Puzzles: Examining Common Ground Construction Under Epistemic Asymmetry, Yifan Zhu et.al., Paper: http://arxiv.org/abs/2603.05450
- 2026-01-08, Distilling the Thought, Watermarking the Answer: A Principle Semantic Guided Watermark for Large Reasoning Models, Shuliang Liu et.al., Paper: http://arxiv.org/abs/2601.05144
- 2025-10-28, Dissecting Role Cognition in Medical LLMs via Neuronal Ablation, Xun Liang et.al., Paper: http://arxiv.org/abs/2510.24677
- 2026-01-28, Dissecting Multimodal In-Context Learning: Modality Asymmetries and Circuit Dynamics in modern Transformers, Yiran Huang et.al., Paper: http://arxiv.org/abs/2601.20796
- 2026-01-14, Disentangling Task Conflicts in Multi-Task LoRA via Orthogonal Gradient Projection, Ziyu Yang et.al., Paper: http://arxiv.org/abs/2601.09684
- 2025-11-05, Disentangled Concepts Speak Louder Than Words:Explainable Video Action Recognition, Jongseo Lee et.al., Paper: http://arxiv.org/abs/2511.03725
- 2026-01-09, Discriminative-Generative Target Speaker Extraction with Decoder-Only Language Models, Bang Zeng et.al., Paper: http://arxiv.org/abs/2601.06006
- 2026-02-10, Discovering High Level Patterns from Simulation Traces, Sean Memery et.al., Paper: http://arxiv.org/abs/2602.10009
- 2026-03-21, DiscoUQ: Structured Disagreement Analysis for Uncertainty Quantification in LLM Agent Ensembles, Bo Jiang et.al., Paper: http://arxiv.org/abs/2603.20975
- 2026-02-06, Directing Space: Rehearsing Architecture as Performer with Explainable AI, Pavlos Panagiotidis et.al., Paper: http://arxiv.org/abs/2602.06915
- 2025-12-03, DirectDrag: High-Fidelity, Mask-Free, Prompt-Free Drag-based Image Editing via Readout-Guided Feature Alignment, Sheng-Hao Liao et.al., Paper: http://arxiv.org/abs/2512.03981
- 2026-02-09, DirMoE: Dirichlet-routed Mixture of Experts, Amirhossein Vahidi et.al., Paper: http://arxiv.org/abs/2602.09001
- 2025-12-17, DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models, Lunbin Zeng et.al., Paper: http://arxiv.org/abs/2512.15713
- 2026-02-11, Diffusion-Pretrained Dense and Contextual Embeddings, Sedigheh Eslami et.al., Paper: http://arxiv.org/abs/2602.11151
- 2026-01-07, Diffusion-DRF: Differentiable Reward Flow for Video Diffusion Fine-Tuning, Yifan Wang et.al., Paper: http://arxiv.org/abs/2601.04153
- 2025-12-31, Diffusion Language Models are Provably Optimal Parallel Samplers, Haozhe Jiang et.al., Paper: http://arxiv.org/abs/2512.25014
- 2025-10-28, Diffusion LLM with Native Variable Generation Lengths: Let [EOS] Lead the Way, Yicun Yang et.al., Paper: http://arxiv.org/abs/2510.24605
- 2026-03-18, Differential Privacy in Generative AI Agents: Analysis and Optimal Tradeoffs, Ya-Ting Yang et.al., Paper: http://arxiv.org/abs/2603.17902
- 2026-03-17, Differential Harm Propensity in Personalized LLM Agents: The Curious Case of Mental Health Disclosure, Caglar Yildirim et.al., Paper: http://arxiv.org/abs/2603.16734
- 2026-02-19, Differences in Typological Alignment in Language Models’ Treatment of Differential Argument Marking, Iskar Deng et.al., Paper: http://arxiv.org/abs/2602.17653
- 2026-01-06, DiffBench Meets DiffAgent: End-to-End LLM-Driven Diffusion Acceleration Code Generation, Jiajun jiao et.al., Paper: http://arxiv.org/abs/2601.03178
- 2026-01-14, Dialogue Telemetry: Turn-Level Instrumentation for Autonomous Information Gathering, Dimitris Panagopoulos et.al., Paper: http://arxiv.org/abs/2601.09570
- 2025-10-31, DialectalArabicMMLU: Benchmarking Dialectal Capabilities in Arabic and Multilingual Language Models, Malik H. Altakrori et.al., Paper: http://arxiv.org/abs/2510.27543
- 2025-10-29, DiagramEval: Evaluating LLM-Generated Diagrams via Graphs, Chumeng Liang et.al., Paper: http://arxiv.org/abs/2510.25761
- 2025-10-23, Diagnosing Visual Reasoning: Challenges, Insights, and a Path Forward, Jing Bi et.al., Paper: http://arxiv.org/abs/2510.20696
- 2026-03-05, DiSCTT: Consensus-Guided Self-Curriculum for Efficient Test-Time Adaptation in Reasoning, Mohammad Mahdi Moradi et.al., Paper: http://arxiv.org/abs/2603.05357
- 2025-11-25, DiFR: Inference Verification Despite Nondeterminism, Adam Karvonen et.al., Paper: http://arxiv.org/abs/2511.20621
- 2026-01-22, DextER: Language-driven Dexterous Grasp Generation with Embodied Reasoning, Junha Lee et.al., Paper: http://arxiv.org/abs/2601.16046
- 2026-03-13, Developing the PsyCogMetrics AI Lab to Evaluate Large Language Models and Advance Cognitive Science – A Three-Cycle Action Design Science Study, Zhiye Jin et.al., Paper: http://arxiv.org/abs/2603.13126
- 2025-12-11, Developing and Evaluating a Large Language Model-Based Automated Feedback System Grounded in Evidence-Centered Design for Supporting Physics Problem Solving, Holger Maus et.al., Paper: http://arxiv.org/abs/2512.10785
- 2026-03-08, Deterministic Fuzzy Triage for Legal Compliance Classification and Evidence Retrieval, Rian Atri et.al., Paper: http://arxiv.org/abs/2603.07390
- 2026-03-21, Detection of adversarial intent in Human-AI teams using LLMs, Abed K. Musaffar et.al., Paper: http://arxiv.org/abs/2603.20976
- 2026-03-18, Detecting the Machine: A Comprehensive Benchmark of AI-Generated Text Detectors Across Architectures, Domains, and Adversarial Conditions, Madhav S. Baidya et.al., Paper: http://arxiv.org/abs/2603.17522
- 2026-01-15, Detecting Winning Arguments with Large Language Models and Persuasion Strategies, Tiziano Labruna et.al., Paper: http://arxiv.org/abs/2601.10660
- 2026-03-17, Detecting Sentiment Steering Attacks on RAG-enabled Large Language Models, Isha Andrade et.al., Paper: http://arxiv.org/abs/2603.16342
- 2026-01-02, Detecting Performance Degradation under Data Shift in Pathology Vision-Language Model, Hao Guan et.al., Paper: http://arxiv.org/abs/2601.00716
- 2026-02-12, Detecting Overflow in Compressed Token Representations for Retrieval-Augmented Generation, Julia Belikova et.al., Paper: http://arxiv.org/abs/2602.12235
- 2026-03-20, Detached Skip-Links and $R$ -Probe: Decoupling Feature Aggregation from Gradient Propagation for MLLM OCR, Ziye Yuan et.al., Paper: http://arxiv.org/abs/2603.20020
- 2026-03-24, DetPO: In-Context Learning with Multi-Modal LLMs for Few-Shot Object Detection, Gautam Rajendrakumar Gare et.al., Paper: http://arxiv.org/abs/2603.23455
- 2026-03-24, Designing Agentic AI-Based Screening for Portfolio Investment, Mehmet Caner et.al., Paper: http://arxiv.org/abs/2603.23300
- 2026-01-26, Design Techniques for LLM-Powered Interactive Storytelling: A Case Study of the Dramamancer System, Tiffany Wang et.al., Paper: http://arxiv.org/abs/2601.18785
- 2026-03-24, Describe-Then-Act: Proactive Agent Steering via Distilled Language-Action World Models, Massimiliano Pappa et.al., Paper: http://arxiv.org/abs/2603.23149
- 2025-12-08, Depth-Wise Activation Steering for Honest Language Models, Gracjan Góral et.al., Paper: http://arxiv.org/abs/2512.07667
- 2026-01-26, Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory, Yanming Liu et.al., Paper: http://arxiv.org/abs/2601.18771
- 2025-12-12, DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry, Zhenyang Cai et.al., Paper: http://arxiv.org/abs/2512.11558
- 2026-03-03, Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals, Patrick Gerard et.al., Paper: http://arxiv.org/abs/2603.03242
- 2025-11-07, Dense Motion Captioning, Shiyao Xu et.al., Paper: http://arxiv.org/abs/2511.05369
- 2024-02-20, Demystifying Instruction Mixing for Fine-tuning Large Language Models, Renxi Wang et.al., Paper: http://arxiv.org/abs/2312.10793
- 2026-03-26, Demographic Fairness in Multimodal LLMs: A Benchmark of Gender and Ethnicity Bias in Face Verification, Ünsal Öztürk et.al., Paper: http://arxiv.org/abs/2603.25613
- 2026-03-14, Demand-Driven Context: A Methodology for Building Enterprise Knowledge Bases Through Agent Failure, Raj Navakoti et.al., Paper: http://arxiv.org/abs/2603.14057
- 2025-10-21, DelvePO: Direction-Guided Self-Evolving Framework for Flexible Prompt Optimization, Tao Tao et.al., Paper: http://arxiv.org/abs/2510.18257
- 2026-03-13, Delta1 with LLM: symbolic and neural integration for credible and explainable reasoning, Yang Xu et.al., Paper: http://arxiv.org/abs/2603.12953
- 2025-10-30, Delegated Authorization for Agents Constrained to Semantic Task-to-Scope Matching, Majed El Helou et.al., Paper: http://arxiv.org/abs/2510.26702
- 2026-01-22, Deja Vu in Plots: Leveraging Cross-Session Evidence with Retrieval-Augmented LLMs for Live Streaming Risk Assessment, Yiran Qiao et.al., Paper: http://arxiv.org/abs/2601.16027
- 2025-12-10, Defining Cost Function of Steganography with Large Language Models, Hanzhou Wu et.al., Paper: http://arxiv.org/abs/2512.09769
- 2026-01-15, Defending Large Language Models Against Jailbreak Attacks via In-Decoding Safety-Awareness Probing, Yinzhi Zhao et.al., Paper: http://arxiv.org/abs/2601.10543
- 2025-10-30, Defeating the Training-Inference Mismatch via FP16, Penghui Qi et.al., Paper: http://arxiv.org/abs/2510.26788
- 2025-10-21, DeepTx: Real-Time Transaction Risk Analysis via Multi-Modal Features and LLM Reasoning, Yixuan Liu et.al., Paper: http://arxiv.org/abs/2510.18438
- 2025-12-10, DeepSeek’s WEIRD Behavior: The cultural alignment of Large Language Models and the effects of prompt language and cultural prompting, James Luther et.al., Paper: http://arxiv.org/abs/2512.09772
- 2026-02-09, DeepQuali: Initial results of a study on the use of large language models for assessing the quality of user stories, Adam Trendowicz et.al., Paper: http://arxiv.org/abs/2602.08887
- 2025-11-10, DeepPersona: A Generative Engine for Scaling Deep Synthetic Personas, Zhen Wang et.al., Paper: http://arxiv.org/abs/2511.07338
- 2026-01-30, Deep Search with Hierarchical Meta-Cognitive Monitoring Inspired by Cognitive Neuroscience, Zhongxiang Sun et.al., Paper: http://arxiv.org/abs/2601.23188
- 2025-10-27, Deductive Chain-of-Thought Augmented Socially-aware Robot Navigation World Model, Weizheng Wang et.al., Paper: http://arxiv.org/abs/2510.23509
- 2026-01-06, Decoupling the Effect of Chain-of-Thought Reasoning: A Human Label Variation Perspective, Beiduo Chen et.al., Paper: http://arxiv.org/abs/2601.03154
- 2026-02-06, Decoupling Variance and Scale-Invariant Updates in Adaptive Gradient Descent for Unified Vector and Matrix Optimization, Zitao Song et.al., Paper: http://arxiv.org/abs/2602.06880
- 2026-02-10, Decoupled Reasoning with Implicit Fact Tokens (DRIFT): A Dual-Model Framework for Efficient Long-Context Inference, Wenxuan Xie et.al., Paper: http://arxiv.org/abs/2602.10021
- 2025-10-29, Decomposition-Enhanced Training for Post-Hoc Attributions In Language Models, Sriram Balasubramaniam et.al., Paper: http://arxiv.org/abs/2510.25766
- 2026-02-04, Decomposed Prompting Does Not Fix Knowledge Gaps, But Helps Models Say “I Don’t Know”, Dhruv Madhwal et.al., Paper: http://arxiv.org/abs/2602.04853
- 2026-03-17, Decoding the Critique Mechanism in Large Reasoning Models, Hoang Phan et.al., Paper: http://arxiv.org/abs/2603.16331
- 2026-02-20, Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers, Xiaotong Ji et.al., Paper: http://arxiv.org/abs/2602.18292
- 2026-02-17, Decision Quality Evaluation Framework at Pinterest, Yuqi Tian et.al., Paper: http://arxiv.org/abs/2602.15809
- 2026-01-06, Decentralized Autoregressive Generation, Stepan Maschan et.al., Paper: http://arxiv.org/abs/2601.03184
- 2026-03-18, DebugLM: Learning Traceable Training Data Provenance for LLMs, Wenjie Jacky Mo et.al., Paper: http://arxiv.org/abs/2603.17884
- 2025-12-04, Debt, Growth, and the Carbon Lock-In, Silvia Montagnania et.al., Paper: http://arxiv.org/abs/2512.05063
- 2026-01-21, Deaf and Hard of Hearing Access to Intelligent Personal Assistants: Comparison of Voice-Based Options with an LLM-Powered Touch Interface, Paige S. DeVries et.al., Paper: http://arxiv.org/abs/2601.15209
- 2025-12-09, De novo generation of functional terpene synthases using TpsGPT, Hamsini Ramanathan et.al., Paper: http://arxiv.org/abs/2512.08772
- 2025-12-04, David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design?, Shashwat Shankar et.al., Paper: http://arxiv.org/abs/2512.05073
- 2026-01-23, DataStates-LLM: Scalable Checkpointing for Transformer Models Using Composable State Providers, Avinash Maurya et.al., Paper: http://arxiv.org/abs/2601.16956
- 2026-02-11, DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning, Yicheng Chen et.al., Paper: http://arxiv.org/abs/2602.11089
- 2026-03-07, Data-Driven Hints in Intelligent Tutoring Systems, Sutapa Dey Tithi et.al., Paper: http://arxiv.org/abs/2603.07311
- 2025-11-17, Data Value in the Age of Scaling: Understanding LLM Scaling Dynamics Under Real-Synthetic Data Mixtures, Haohui Wang et.al., Paper: http://arxiv.org/abs/2511.13640
- 2026-02-11, Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning, Dawid J. Kopiczko et.al., Paper: http://arxiv.org/abs/2602.11149
- 2026-01-05, DatBench: Discriminative, Faithful, and Efficient VLM Evaluations, Siddharth Joshi et.al., Paper: http://arxiv.org/abs/2601.02316
- 2025-12-31, DarkEQA: Benchmarking Vision-Language Models for Embodied Question Answering in Low-Light Indoor Environments, Yohan Park et.al., Paper: http://arxiv.org/abs/2512.24985
- 2026-03-16, DUET: Disaggregated Hybrid Mamba-Transformer LLMs with Prefill and Decode-Specific Packages, Alish Kanani et.al., Paper: http://arxiv.org/abs/2603.15530
- 2026-01-22, DTP: A Simple yet Effective Distracting Token Pruning Framework for Vision-Language Action Models, Chenyang Li et.al., Paper: http://arxiv.org/abs/2601.16065
- 2025-10-21, DSI-Bench: A Benchmark for Dynamic Spatial Intelligence, Ziang Zhang et.al., Paper: http://arxiv.org/abs/2510.18873
- 2025-11-26, DSD: A Distributed Speculative Decoding Solution for Edge-Cloud Agile Large Model Serving, Fengze Yu et.al., Paper: http://arxiv.org/abs/2511.21669
- 2026-02-05, DSB: Dynamic Sliding Block Scheduling for Diffusion LLMs, Lizhuo Luo et.al., Paper: http://arxiv.org/abs/2602.05992
- 2025-11-24, DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research, Rulin Shao et.al., Paper: http://arxiv.org/abs/2511.19399
- 2026-01-14, DPWriter: Reinforcement Learning with Diverse Planning Branching for Creative Writing, Qian Cao et.al., Paper: http://arxiv.org/abs/2601.09609
- 2026-02-16, DM0: An Embodied-Native Vision-Language-Action Model towards Physical AI, En Yu et.al., Paper: http://arxiv.org/abs/2602.14974
- 2025-12-03, DIQ-H: Evaluating Hallucination Persistence in VLMs Under Temporal Visual Degradation, Zexin Lin et.al., Paper: http://arxiv.org/abs/2512.03992
- 2026-01-06, DIP: Dynamic In-Context Planner For Diffusion Language Models, Yang Li et.al., Paper: http://arxiv.org/abs/2601.03199
- 2026-03-31, DIAL: Decoupling Intent and Action via Latent World Modeling for End-to-End VLA, Yi Chen et.al., Paper: http://arxiv.org/abs/2603.29844
- 2026-02-05, DFlash: Block Diffusion for Flash Speculative Decoding, Jian Chen et.al., Paper: http://arxiv.org/abs/2602.06036
- 2025-11-13, DESS: DeBERTa Enhanced Syntactic-Semantic Aspect Sentiment Triplet Extraction, Vishal Thenuwara et.al., Paper: http://arxiv.org/abs/2511.10577
- 2025-12-19, DEER: A Comprehensive and Reliable Benchmark for Deep-Research Expert Reports, Janghoon Han et.al., Paper: http://arxiv.org/abs/2512.17776
- 2025-11-28, DEAL-300K: Diffusion-based Editing Area Localization with a 300K-Scale Dataset and Frequency-Prompted Baseline, Rui Zhang et.al., Paper: http://arxiv.org/abs/2511.23377
- 2026-02-06, DAWN: Dependency-Aware Fast Inference for Diffusion LLMs, Lizhuo Luo et.al., Paper: http://arxiv.org/abs/2602.06953
- 2025-10-21, DART: A Structured Dataset of Regulatory Drug Documents in Italian for Clinical NLP, Mariano Barone et.al., Paper: http://arxiv.org/abs/2510.18475
- 2026-02-27, DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science, Fan Shu et.al., Paper: http://arxiv.org/abs/2602.24288
- 2026-03-20, DALI: LLM-Agent Enhanced Dual-Stream Adaptive Leadership Identification for Group Recommendations, Boxun Song et.al., Paper: http://arxiv.org/abs/2603.19909
- 2026-01-09, CyberGFM: Graph Foundation Models for Lateral Movement Detection in Enterprise Networks, Isaiah J. King et.al., Paper: http://arxiv.org/abs/2601.05988
- 2026-01-08, Cutting AI Research Costs: How Task-Aware Compression Makes Large Language Model Agents Affordable, Zuhair Ahmed Khan Taha et.al., Paper: http://arxiv.org/abs/2601.05191
- 2026-03-24, Curriculum-Driven 3D CT Report Generation via Language-Free Visual Grafting and Zone-Constrained Compression, V. K. Cody Bumgardner et.al., Paper: http://arxiv.org/abs/2603.23308
- 2025-11-04, Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs, Georgios Tzannetos et.al., Paper: http://arxiv.org/abs/2511.02690
- 2026-03-20, Current LLMs still cannot ‘talk much’ about grammar modules: Evidence from syntax, Mohammed Q. Shormani et.al., Paper: http://arxiv.org/abs/2603.20114
- 2026-03-19, Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens, Yuqing Wang et.al., Paper: http://arxiv.org/abs/2603.19232
- 2025-12-23, Cube Bench: A Benchmark for Spatial Visual Reasoning in MLLMs, Dhruv Anand et.al., Paper: http://arxiv.org/abs/2512.20595
- 2025-11-17, Crossing Borders: A Multimodal Challenge for Indian Poetry Translation and Image Generation, Sofia Jamil et.al., Paper: http://arxiv.org/abs/2511.13689
- 2026-03-12, CrossEarth-SAR: A SAR-Centric and Billion-Scale Geospatial Foundation Model for Domain Generalizable Semantic Segmentation, Ziqi Ye et.al., Paper: http://arxiv.org/abs/2603.12008
- 2025-12-12, Cross-modal Context-aware Learning for Visual Prompt Guided Multimodal Image Understanding in Remote Sensing, Xu Zhang et.al., Paper: http://arxiv.org/abs/2512.11680
- 2026-03-25, Cross-Modal Prototype Alignment and Mixing for Training-Free Few-Shot Classification, Dipam Goswami et.al., Paper: http://arxiv.org/abs/2603.24528
- 2026-03-12, Cross-Context Review: Improving LLM Output Quality by Separating Production and Review Sessions, Tae-Eun Song et.al., Paper: http://arxiv.org/abs/2603.12123
- 2026-02-17, CrispEdit: Low-Curvature Projections for Scalable Non-Destructive LLM Editing, Zarif Ikram et.al., Paper: http://arxiv.org/abs/2602.15823
- 2026-02-18, Creating a digital poet, Vered Tohar et.al., Paper: http://arxiv.org/abs/2602.16578
- 2025-11-17, CreBench: Human-Aligned Creativity Evaluation from Idea to Process to Product, Kaiwen Xue et.al., Paper: http://arxiv.org/abs/2511.13626
- 2025-10-21, CovMatch: Cross-Covariance Guided Multimodal Dataset Distillation with Trainable Text Encoder, Yongmin Lee et.al., Paper: http://arxiv.org/abs/2510.18583
- 2025-11-21, Counterfactual World Models via Digital Twin-conditioned Video Diffusion, Yiqing Shen et.al., Paper: http://arxiv.org/abs/2511.17481
- 2025-10-21, Counterfactual Reasoning for Steerable Pluralistic Value Alignment of Large Language Models, Hanze Guo et.al., Paper: http://arxiv.org/abs/2510.18526
- 2026-02-16, Counterfactual Fairness Evaluation of LLM-Based Contact Center Agent Quality Assurance System, Kawin Mayilvaghanan et.al., Paper: http://arxiv.org/abs/2602.14970
- 2025-11-04, CostBench: Evaluating Multi-Turn Cost-Optimal Planning and Adaptation in Dynamic Environments for LLM Tool-Use Agents, Jiayu Liu et.al., Paper: http://arxiv.org/abs/2511.02734
- 2026-02-05, Correctness-Optimized Residual Activation Lens (CORAL): Transferrable and Calibration-Aware Inference-Time Steering, Miranda Muqing Miao et.al., Paper: http://arxiv.org/abs/2602.06022
- 2025-12-17, Corrective Diffusion Language Models, Shuibai Zhang et.al., Paper: http://arxiv.org/abs/2512.15596
- 2025-11-25, Copyright Detection in Large Language Models: An Ethical Approach to Generative AI Development, David Szczecina et.al., Paper: http://arxiv.org/abs/2511.20623
- 2025-11-24, Cook and Clean Together: Teaching Embodied Agents for Parallel Task Execution, Dingkang Liang et.al., Paper: http://arxiv.org/abs/2511.19430
- 2026-03-03, Conversational Learning Diagnosis via Reasoning Multi-Turn Interactive Learning, Fangzhou Yao et.al., Paper: http://arxiv.org/abs/2603.03236
- 2026-02-11, Conversational Behavior Modeling Foundation Model With Multi-Level Perception, Dingkun Zhou et.al., Paper: http://arxiv.org/abs/2602.11065
- 2025-11-10, ConvFill: Model Collaboration for Responsive Conversational Voice Agents, Vidya Srinivas et.al., Paper: http://arxiv.org/abs/2511.07397
- 2025-11-04, Controlling Performance and Budget of a Centralized Multi-agent LLM System with Reinforcement Learning, Bowen Jin et.al., Paper: http://arxiv.org/abs/2511.02755
- 2026-01-22, Controlling Long-Horizon Behavior in Language Model Agents with Explicit State Dynamics, Sukesh Subaharan et.al., Paper: http://arxiv.org/abs/2601.16087
- 2026-01-12, Contrastive Learning with Narrative Twins for Modeling Story Salience, Igor Sterner et.al., Paper: http://arxiv.org/abs/2601.07765
- 2025-10-31, Continuous Autoregressive Language Models, Chenze Shao et.al., Paper: http://arxiv.org/abs/2510.27688
- 2026-01-09, Continual-learning for Modelling Low-Resource Languages from Large Language Models, Santosh Srinath K et.al., Paper: http://arxiv.org/abs/2601.05874
- 2026-03-12, Continual Learning with Vision-Language Models via Semantic-Geometry Preservation, Chiyuan He et.al., Paper: http://arxiv.org/abs/2603.12055
- 2025-12-02, Contextual Image Attack: How Visual Context Exposes Multimodal Safety Vulnerabilities, Yuan Xiong et.al., Paper: http://arxiv.org/abs/2512.02973
- 2026-01-07, ContextFocus: Activation Steering for Contextual Faithfulness in Large Language Models, Nikhil Anand et.al., Paper: http://arxiv.org/abs/2601.04131
- 2026-03-31, ContextClaim: A Context-Driven Paradigm for Verifiable Claim Detection, Yufeng Li et.al., Paper: http://arxiv.org/abs/2603.30025
- 2025-12-31, Context-aware LLM-based AI Agents for Human-centered Energy Management Systems in Smart Buildings, Tianzhi He et.al., Paper: http://arxiv.org/abs/2512.25055
- 2025-11-14, Context-aware Adaptive Visualizations for Critical Decision Making, Angela Lopez-Cardona et.al., Paper: http://arxiv.org/abs/2511.11476
- 2026-02-20, Context-Aware Mapping of 2D Drawing Annotations to 3D CAD Features Using LLM-Assisted Reasoning for Manufacturing Automation, Muhammad Tayyab Khana et.al., Paper: http://arxiv.org/abs/2602.18296
- 2025-12-26, Context-Aware Intelligent Chatbot Framework Leveraging Mobile Sensing, Ziyan Zhang et.al., Paper: http://arxiv.org/abs/2512.22032
- 2026-01-09, Context-Aware Decoding for Faithful Vision-Language Generation, Mehrdad Fazli et.al., Paper: http://arxiv.org/abs/2601.05939
- 2026-01-28, Context-Augmented Code Generation Using Programming Knowledge Graphs, Shahd Seddik et.al., Paper: http://arxiv.org/abs/2601.20810
- 2025-12-26, Context as a Tool: Context Management for Long-Horizon SWE-Agents, Shukai Liu et.al., Paper: http://arxiv.org/abs/2512.22087
- 2026-02-03, Context Compression via Explicit Information Transmission, Jiangnan Ye et.al., Paper: http://arxiv.org/abs/2602.03784
- 2026-03-22, Consistent but Dangerous: Per-Sample Safety Classification Reveals False Reliability in Medical Vision-Language Models, Binesh Sadanandan et.al., Paper: http://arxiv.org/abs/2603.20985
- 2025-12-01, Consistent Synthetic Sequences Unlock Structural Diversity in Fully Atomistic De Novo Protein Design, Danny Reidenbach et.al., Paper: http://arxiv.org/abs/2512.01976
- 2026-02-13, Consistency of Large Reasoning Models Under Multi-Turn Attacks, Yubo Li et.al., Paper: http://arxiv.org/abs/2602.13093
- 2025-11-10, Consistency Is Not Always Correct: Towards Understanding the Role of Exploration in Post-Training Reasoning, Dake Bu et.al., Paper: http://arxiv.org/abs/2511.07368
- 2025-11-07, Connectomics Informed by Large Language Models, Elinor Thompson et.al., Paper: http://arxiv.org/abs/2511.05383
- 2026-02-03, Conformal Thinking: Risk Control for Reasoning on a Compute Budget, Xi Wang et.al., Paper: http://arxiv.org/abs/2602.03814
- 2026-03-24, Conformal Cross-Modal Active Learning, Huy Hoang Nguyen et.al., Paper: http://arxiv.org/abs/2603.23159
- 2026-02-25, Confidence-Driven Multi-Scale Model Selection for Cost-Efficient Inference, Bo-Wei Chen et.al., Paper: http://arxiv.org/abs/2602.22090
- 2025-12-19, Confidence-Credibility Aware Weighted Ensembles of Small LLMs Outperform Large LLMs in Emotion Detection, Menna Elgabry et.al., Paper: http://arxiv.org/abs/2512.17630
- 2026-03-23, Confidence-Based Decoding is Provably Efficient for Diffusion Language Models, Changxiao Cai et.al., Paper: http://arxiv.org/abs/2603.22248
- 2026-01-05, Confidence Estimation for LLMs in Multi-turn Interactions, Caiqi Zhang et.al., Paper: http://arxiv.org/abs/2601.02179
- 2026-03-24, ConceptCoder: Improve Code Reasoning via Concept Learning, Md Mahbubur Rahman et.al., Paper: http://arxiv.org/abs/2603.23470
- 2025-11-25, Concept-Aware Batch Sampling Improves Language-Image Pretraining, Adhiraj Ghosh et.al., Paper: http://arxiv.org/abs/2511.20643
- 2026-02-16, Concept Influence: Leveraging Interpretability to Improve Performance and Efficiency in Training Data Attribution, Matthew Kowal et.al., Paper: http://arxiv.org/abs/2602.14869
- 2025-11-07, ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations, Amr Gomaa et.al., Paper: http://arxiv.org/abs/2511.05359
- 2025-11-04, ConMeZO: Adaptive Descent-Direction Sampling for Gradient-Free Finetuning of Large Language Models, Lejs Deen Behric et.al., Paper: http://arxiv.org/abs/2511.02757
- 2026-03-18, ConGA: Guidelines for Contextual Gender Annotation. A Framework for Annotating Gender in Machine Translation, Argentina Anna Rescigno et.al., Paper: http://arxiv.org/abs/2603.17962
- 2025-11-13, Computing the Formal and Institutional Boundaries of Contemporary Genre and Literary Fiction, Natasha Johnson et.al., Paper: http://arxiv.org/abs/2511.10546
- 2025-11-19, Computer-Use Agents as Judges for Generative User Interface, Kevin Qinghong Lin et.al., Paper: http://arxiv.org/abs/2511.15567
- 2025-12-11, Computational emotion analysis with multimodal LLMs: Current evidence on an emerging methodological opportunity, Hauke Licht et.al., Paper: http://arxiv.org/abs/2512.10882
- 2026-01-28, Compression Tells Intelligence: Visual Coding, Visual Token Technology, and the Unification, Xin Jin et.al., Paper: http://arxiv.org/abs/2601.20742
- 2026-01-27, Component-Aware Pruning Framework for Neural Network Controllers via Gradient-Based Importance Estimation, Ganesh Sundaram et.al., Paper: http://arxiv.org/abs/2601.19794
- 2026-03-31, Compiling Code LLMs into Lightweight Executables, Jieke Shi et.al., Paper: http://arxiv.org/abs/2603.29813
- 2026-03-10, Compartmentalization-Aware Automated Program Repair, Jia Hu et.al., Paper: http://arxiv.org/abs/2603.09544
- 2025-11-20, Comparison of Text-Based and Image-Based Retrieval in Multimodal Retrieval Augmented Generation Large Language Model Systems, Elias Lumer et.al., Paper: http://arxiv.org/abs/2511.16654
- 2025-12-15, Comparative Analysis of LLM Abliteration Methods: A Cross-Architecture Evaluation, Richard J. Young et.al., Paper: http://arxiv.org/abs/2512.13655
- 2025-10-29, Communication and Verification in LLM Agents towards Collaboration under Information Asymmetry, Run Peng et.al., Paper: http://arxiv.org/abs/2510.25595
- 2025-10-28, ComboBench: Can LLMs Manipulate Physical Devices to Play Virtual Reality Games?, Shuqing Li et.al., Paper: http://arxiv.org/abs/2510.24706
- 2025-10-21, Combining Distantly Supervised Models with In Context Learning for Monolingual and Cross-Lingual Relation Extraction, Vipul Rathore et.al., Paper: http://arxiv.org/abs/2510.18344
- 2025-10-24, ColorEcosystem: Powering Personalized, Standardized, and Trustworthy Agentic Service in massive-agent Ecosystem, Fangwen Wu et.al., Paper: http://arxiv.org/abs/2510.21566
- 2026-03-26, Colon-Bench: An Agentic Workflow for Scalable Dense Lesion Annotation in Full-Procedure Colonoscopy Videos, Abdullah Hamdi et.al., Paper: http://arxiv.org/abs/2603.25645
- 2025-11-14, Collaborative Representation Learning for Alignment of Tactile, Language, and Vision Modalities, Yiyun Zhou et.al., Paper: http://arxiv.org/abs/2511.11512
- 2026-03-31, Cold-Starts in Generative Recommendation: A Reproducibility Study, Zhen Zhang et.al., Paper: http://arxiv.org/abs/2603.29845
- 2025-12-23, Coherence in the brain unfolds across separable temporal regimes, Davide Stauba et.al., Paper: http://arxiv.org/abs/2512.20481
- 2025-11-20, Cognitive Foundations for Reasoning and Their Manifestation in LLMs, Priyanka Kargupta et.al., Paper: http://arxiv.org/abs/2511.16660
- 2026-01-14, CogRail: Benchmarking VLMs in Cognitive Intrusion Perception for Intelligent Railway Transportation Systems, Yonglin Tian et.al., Paper: http://arxiv.org/abs/2601.09613
- 2025-11-20, Codec2Vec: Self-Supervised Speech Representation Learning Using Neural Speech Codecs, Wei-Cheng Tseng et.al., Paper: http://arxiv.org/abs/2511.16639
- 2025-10-21, CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment, Xue Jiang et.al., Paper: http://arxiv.org/abs/2510.18471
- 2025-10-31, CodeAlignBench: Assessing Code Generation Models on Developer-Preferred Code Adjustments, Forough Mehralian et.al., Paper: http://arxiv.org/abs/2510.27565
- 2026-03-03, Code2Math: Can Your Code Agent Effectively Evolve Math Problems Through Exploration?, Dadi Guo et.al., Paper: http://arxiv.org/abs/2603.03202
- 2025-11-07, Code Review Automation using Retrieval Augmented Generation, Qianru Meng et.al., Paper: http://arxiv.org/abs/2511.05302
- 2026-03-22, CoVFT: Context-aware Visual Fine-tuning for Multimodal Large Language Models, Nan Zhou et.al., Paper: http://arxiv.org/abs/2603.21077
- 2026-01-08, CoV: Chain-of-View Prompting for Spatial Reasoning, Haoyu Zhao et.al., Paper: http://arxiv.org/abs/2601.05172
- 2026-02-04, CoT is Not the Chain of Truth: An Empirical Internal Analysis of Reasoning LLMs for Fake News Generation, Zhao Tong et.al., Paper: http://arxiv.org/abs/2602.04856
- 2026-02-09, CoRefine: Confidence-Guided Self-Refinement for Adaptive Test-Time Compute, Chen Jin et.al., Paper: http://arxiv.org/abs/2602.08948
- 2026-02-13, CoPE-VideoLM: Codec Primitives For Efficient Video Language Models, Sayan Deb Sarkar et.al., Paper: http://arxiv.org/abs/2602.13191
- 2026-03-09, CoCo: Code as CoT for Text-to-Image Preview and Rare Concept Generation, Haodong Li et.al., Paper: http://arxiv.org/abs/2603.08652
- 2025-12-29, Close the Loop: Synthesizing Infinite Tool-Use Data via Multi-Agent Role-Playing, Yuwen Li et.al., Paper: http://arxiv.org/abs/2512.23611
- 2026-03-02, ClinConsensus: A Consensus-Based Benchmark for Evaluating Chinese Medical LLMs across Difficulty Levels, Xiang Zheng et.al., Paper: http://arxiv.org/abs/2603.02097
- 2025-11-07, Cleaning Maintenance Logs with LLM Agents for Improved Predictive Maintenance, Valeriu Dimidov et.al., Paper: http://arxiv.org/abs/2511.05311
- 2025-10-22, Class-Aware Prototype Learning with Negative Contrast for Test-Time Adaptation of Vision-Language Models, Xiaozhen Qiao et.al., Paper: http://arxiv.org/abs/2510.19802
- 2026-02-18, CitiLink-Summ: Summarization of Discussion Subjects in European Portuguese Municipal Meeting Minutes, Miguel Marques et.al., Paper: http://arxiv.org/abs/2602.16607
- 2025-10-21, CircuitSeer: Mining High-Quality Data by Probing Mathematical Reasoning Circuits in LLMs, Shaobo Wang et.al., Paper: http://arxiv.org/abs/2510.18470
- 2025-12-10, ChronusOmni: Improving Time Awareness of Omni Large Language Models, Yijing Chen et.al., Paper: http://arxiv.org/abs/2512.09841
- 2026-03-17, Chronos: Temporal-Aware Conversational Agents with Structured Event Retrieval for Long-Term Memory, Sahil Sen et.al., Paper: http://arxiv.org/abs/2603.16862
- 2026-02-11, Chatting with Images for Introspective Visual Thinking, Junfei Wu et.al., Paper: http://arxiv.org/abs/2602.11073
- 2025-12-23, ChatGPT: Excellent Paper! Accept It. Editor: Imposter Found! Review Rejected, Kanchon Gharami et.al., Paper: http://arxiv.org/abs/2512.20405
- 2026-02-17, ChartEditBench: Evaluating Grounded Multi-Turn Chart Editing in Multimodal Language Models, Manav Nitin Kapadnis et.al., Paper: http://arxiv.org/abs/2602.15758
- 2025-10-30, ChartAB: A Benchmark for Chart Grounding & Dense Alignment, Aniruddh Bansal et.al., Paper: http://arxiv.org/abs/2510.26781
- 2025-12-17, Characterizing Mamba’s Selective Memory using Auto-Encoders, Tamanna Hossain et.al., Paper: http://arxiv.org/abs/2512.15653
- 2025-11-24, Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens, Yiming Qin et.al., Paper: http://arxiv.org/abs/2511.19418
- 2026-02-11, Chain-of-Look Spatial Reasoning for Dense Surgical Instrument Counting, Rishikesh Bhyri et.al., Paper: http://arxiv.org/abs/2602.11024
- 2025-12-01, Chain-of-Ground: Improving GUI Grounding via Iterative Reasoning and Reference Feedback, Aiden Yiliu Li et.al., Paper: http://arxiv.org/abs/2512.01979
- 2025-10-21, Chain-of-Conceptual-Thought: Eliciting the Agent to Deeply Think within the Response, Qingqing Gu et.al., Paper: http://arxiv.org/abs/2510.18434
- 2025-12-23, Chain-of-Anomaly Thoughts with Large Vision-Language Models, Pedro Domingos et.al., Paper: http://arxiv.org/abs/2512.20417
- 2026-03-05, Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation, Helena Casademunt et.al., Paper: http://arxiv.org/abs/2603.05494
- 2026-03-23, CayleyPy-4: AI-Holography. Towards analogs of holographic string dualities for AI tasks, A. Chervov et.al., Paper: http://arxiv.org/abs/2603.22195
- 2026-02-18, Causality is Key for Interpretability Claims to Generalise, Shruti Joshi et.al., Paper: http://arxiv.org/abs/2602.16698
- 2026-03-04, Causality Elicitation from Large Language Models, Takashi Kameyama et.al., Paper: http://arxiv.org/abs/2603.04276
- 2026-02-23, CausalFlip: A Benchmark for LLM Causal Judgment Beyond Semantic Matching, Yuzhe Wang et.al., Paper: http://arxiv.org/abs/2602.20094
- 2026-02-17, Causal Effect Estimation with Latent Textual Treatments, Omri Feldman et.al., Paper: http://arxiv.org/abs/2602.15730
- 2026-02-19, Catastrophic Forgetting Resilient One-Shot Incremental Federated Learning, Obaidullah Zaland et.al., Paper: http://arxiv.org/abs/2602.17625
- 2025-12-24, Casting a SPELL: Sentence Pairing Exploration for LLM Limitation-breaking, Yifan Huang et.al., Paper: http://arxiv.org/abs/2512.21236
- 2026-03-12, Cascade: Composing Software-Hardware Attack Gadgets for Adversarial Threat Amplification in Compound AI Systems, Sarbartha Banerjee et.al., Paper: http://arxiv.org/abs/2603.12023
- 2026-03-17, Capability-Guided Compression: Toward Interpretability-Aware Budget Allocation for Large Language Models, Rishaank Gupta et.al., Paper: http://arxiv.org/abs/2603.16440
- 2026-02-20, CapNav: Benchmarking Vision Language Models on Capability-conditioned Indoor Navigation, Xia Su et.al., Paper: http://arxiv.org/abs/2602.18424
- 2026-03-22, Can we automatize scientific discovery in the cognitive sciences?, Akshay K. Jagadish et.al., Paper: http://arxiv.org/abs/2603.20988
- 2026-02-05, Can vision language models learn intuitive physics from interaction?, Luca M. Schulze Buschoff et.al., Paper: http://arxiv.org/abs/2602.06033
- 2026-03-24, Can an LLM Detect Instances of Microservice Infrastructure Patterns?, Carlos Eduardo Duarte et.al., Paper: http://arxiv.org/abs/2603.23073
- 2026-02-23, Can You Tell It’s AI? Human Perception of Synthetic Voices in Vishing Scenarios, Zoha Hayat Bhatti et.al., Paper: http://arxiv.org/abs/2602.20061
- 2025-11-25, Can Vibe Coding Beat Graduate CS Students? An LLM vs. Human Coding Tournament on Market-driven Strategic Planning, Panayiotis Danassis et.al., Paper: http://arxiv.org/abs/2511.20613
- 2025-12-09, Can TabPFN Compete with GNNs for Node Classification via Graph Tabularization?, Jeongwhan Choi et.al., Paper: http://arxiv.org/abs/2512.08798
- 2026-02-11, Can Large Language Models Make Everyone Happy?, Usman Naseem et.al., Paper: http://arxiv.org/abs/2602.11091
- 2026-03-08, Can Large Language Models Keep Up? Benchmarking Online Adaptation to Continual Knowledge Streams, Jiyeon Kim et.al., Paper: http://arxiv.org/abs/2603.07392
- 2026-03-24, Can Language Models Pass Software Testing Certification Exams? a case study, Fitash Ul Haq et.al., Paper: http://arxiv.org/abs/2603.23142
- 2025-11-04, Can LLMs subtract numbers?, Mayank Jobanputra et.al., Paper: http://arxiv.org/abs/2511.02795
- 2025-12-23, Can LLMs Predict Their Own Failures? Self-Awareness via Internal Circuits, Amirhosein Ghasemabadi et.al., Paper: http://arxiv.org/abs/2512.20578
- 2026-03-16, Can LLMs Model Incorrect Student Reasoning? A Case Study on Distractor Generation, Yanick Zengaffinen et.al., Paper: http://arxiv.org/abs/2603.15547
- 2025-12-17, Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning, Zhenwen Liang et.al., Paper: http://arxiv.org/abs/2512.15687
- 2026-02-24, Can Interest-Bearing Positions Solve the Long-Horizon Problem in Prediction Markets?, Caleb Maresca et.al., Paper: http://arxiv.org/abs/2602.21091
- 2026-03-13, Can Fairness Be Prompted? Prompt-Based Debiasing Strategies in High-Stakes Recommendations, Mihaela Rotar et.al., Paper: http://arxiv.org/abs/2603.12935
- 2026-03-31, Can Commercial LLMs Be Parliamentary Political Companions? Comparing LLM Reasoning Against Romanian Legislative Expuneri de Motive, Iulian Lucău et.al., Paper: http://arxiv.org/abs/2603.30028
- 2026-01-09, Can AI mediation improve democratic deliberation?, Michael Henry Tessler et.al., Paper: http://arxiv.org/abs/2601.05904
- 2025-12-29, Can AI Recognize Its Own Reflection? Self-Detection Performance of LLMs in Computing Education, Christopher Burger et.al., Paper: http://arxiv.org/abs/2512.23587
- 2026-03-18, Caging the Agents: A Zero Trust Security Architecture for Autonomous AI in Healthcare, Saikat Maiti et.al., Paper: http://arxiv.org/abs/2603.17419
- 2025-11-17, CacheFlow: Compressive Streaming Memory for Efficient Long-Form Video Understanding, Shrenik Patel et.al., Paper: http://arxiv.org/abs/2511.13644
- 2026-03-06, CaTok: Taming Mean Flows for One-Dimensional Causal Image Tokenization, Yitong Chen et.al., Paper: http://arxiv.org/abs/2603.06449
- 2026-02-26, CXReasonAgent: Evidence-Grounded Diagnostic Reasoning Agent for Chest X-rays, Hyungyung Lee et.al., Paper: http://arxiv.org/abs/2602.23276
- 2026-03-22, CVT-Bench: Counterfactual Viewpoint Transformations Reveal Unstable Spatial Representations in Multimodal LLMs, Shanmukha Vellamcheti et.al., Paper: http://arxiv.org/abs/2603.21114
- 2025-11-14, CVChess: A Deep Learning Framework for Converting Chessboard Images to Forsyth-Edwards Notation, Luthira Abeykoon et.al., Paper: http://arxiv.org/abs/2511.11522
- 2026-02-27, CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation, Weinan Dai et.al., Paper: http://arxiv.org/abs/2602.24286
- 2025-10-21, CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using Agent, Haojia Lin et.al., Paper: http://arxiv.org/abs/2510.18596
- 2026-01-02, CRoPS: A Training-Free Hallucination Mitigation Framework for Vision-Language Models, Neeraj Anand et.al., Paper: http://arxiv.org/abs/2601.00659
- 2026-01-20, CREATE: Cross-Layer Resilience Characterization and Optimization for Efficient yet Reliable Embodied AI Systems, Tong Xie et.al., Paper: http://arxiv.org/abs/2601.14140
- 2025-12-23, CRAFT: Continuous Reasoning and Agentic Feedback Tuning for Multimodal Text-to-Image Generation, V. Kovalev et.al., Paper: http://arxiv.org/abs/2512.20362
- 2025-12-31, CPJ: Explainable Agricultural Pest Diagnosis via Caption-Prompt-Judge with LLM-Judged Refinement, Wentao Zhang et.al., Paper: http://arxiv.org/abs/2512.24947
- 2026-01-05, CORE: Code-based Inverse Self-Training Framework with Graph Expansion for Virtual Agents, Keyu Wang et.al., Paper: http://arxiv.org/abs/2601.02201
- 2026-03-06, COLD-Steer: Steering Large Language Models via In-Context One-step Learning Dynamics, Kartik Sharma et.al., Paper: http://arxiv.org/abs/2603.06495
- 2026-03-14, CMHL: Contrastive Multi-Head Learning for Emotionally Consistent Text Classification, Menna Elgabry et.al., Paper: http://arxiv.org/abs/2603.14078
- 2026-01-09, CLewR: Curriculum Learning with Restarts for Machine Translation Preference Learning, Alexandra Dragomir et.al., Paper: http://arxiv.org/abs/2601.05858
- 2026-03-22, CLT-Forge: A Scalable Library for Cross-Layer Transcoders and Attribution Graphs, Florent Draye et.al., Paper: http://arxiv.org/abs/2603.21014
- 2026-03-12, CLASP: Defending Hybrid Large Language Models Against Hidden State Poisoning Attacks, Alexandre Le Mercier et.al., Paper: http://arxiv.org/abs/2603.12206
- 2025-10-21, CLASP: Cost-Optimized LLM-based Agentic System for Phishing Detection, Fouad Trad et.al., Paper: http://arxiv.org/abs/2510.18585
- 2026-03-26, CLAR: CIF-Localized Alignment for Retrieval-Augmented Speech LLM-Based Contextual ASR, Shangkun Huang et.al., Paper: http://arxiv.org/abs/2603.25460
- 2026-02-09, CIC-Trap4Phish: A Unified Multi-Format Dataset for Phishing and Quishing Attachment Detection, Fatemeh Nejati et.al., Paper: http://arxiv.org/abs/2602.09015
- 2026-03-12, CHiL(L)Grader: Calibrated Human-in-the-Loop Short-Answer Grading, Pranav Raikote et.al., Paper: http://arxiv.org/abs/2603.11957
- 2025-10-21, CEFR-Annotated WordNet: LLM-Based Proficiency-Guided Semantic Database for Language Learning, Masato Kikuchi et.al., Paper: http://arxiv.org/abs/2510.18466
- 2026-01-05, CD4LM: Consistency Distillation and aDaptive Decoding for Diffusion Language Models, Yihao Liang et.al., Paper: http://arxiv.org/abs/2601.02236
- 2025-12-22, CASA: Cross-Attention via Self-Attention for Efficient Vision-Language Fusion, Moritz Böhle et.al., Paper: http://arxiv.org/abs/2512.19535
- 2026-04-01, CARE: Privacy-Compliant Agentic Reasoning with Evidence Discordance, Haochen Liu et.al., Paper: http://arxiv.org/abs/2604.01113
- 2026-02-10, CAPID: Context-Aware PII Detection for Question-Answering Systems, Mariia Ponomarenko et.al., Paper: http://arxiv.org/abs/2602.10074
- 2026-03-04, CAMMSR: Category-Guided Attentive Mixture of Experts for Multimodal Sequential Recommendation, Jinfeng Xu et.al., Paper: http://arxiv.org/abs/2603.04320
- 2025-12-02, CAMEO: Correspondence-Attention Alignment for Multi-View Diffusion Models, Minkyung Kwon et.al., Paper: http://arxiv.org/abs/2512.03045
- 2025-11-10, C3PO: Optimized Large Language Model Cascades with Probabilistic Cost Constraints for Reasoning, Antonios Valkanas et.al., Paper: http://arxiv.org/abs/2511.07396
- 2025-12-24, C2LLM Technical Report: A New Frontier in Code Retrieval via Adaptive Cross-Attention Pooling, Jin Qin et.al., Paper: http://arxiv.org/abs/2512.21332
- 2025-12-16, C-ing Clearly: Enhanced Binary Code Explanations using C code, Teodor Poncu et.al., Paper: http://arxiv.org/abs/2512.14500
- 2026-03-31, C-TRAIL: A Commonsense World Framework for Trajectory Planning in Autonomous Driving, Zhihong Cui et.al., Paper: http://arxiv.org/abs/2603.29908
- 2025-11-13, Bytes of a Feather: Personality and Opinion Alignment Effects in Human-AI Interaction, Maximilian Eder et.al., Paper: http://arxiv.org/abs/2511.10544
- 2026-02-13, Buy versus Build an LLM: A Decision Framework for Governments, Jiahao Lu et.al., Paper: http://arxiv.org/abs/2602.13033
- 2026-02-13, Bus-Conditioned Zero-Shot Trajectory Generation via Task Arithmetic, Shuai Liu et.al., Paper: http://arxiv.org/abs/2602.13071
- 2025-10-21, Building Trust in Clinical LLMs: Bias Analysis and Dataset Transparency, Svetlana Maslenkova et.al., Paper: http://arxiv.org/abs/2510.18556
- 2026-01-16, Building Production-Ready Probes For Gemini, János Kramár et.al., Paper: http://arxiv.org/abs/2601.11516
- 2025-12-26, Broken Words, Broken Performance: Effect of Tokenization on Performance of LLMs, Sachin Pawar et.al., Paper: http://arxiv.org/abs/2512.21933
- 2026-04-02, Brief Is Better: Non-Monotonic Chain-of-Thought Budget Effects in Function-Calling Language Agents, Xuan Qi et.al., Paper: http://arxiv.org/abs/2604.02155
- 2026-01-07, Bridging the Discrete-Continuous Gap: Unified Multimodal Generation via Coupled Manifold Discrete Absorbing Diffusion, Yuanfeng Xu et.al., Paper: http://arxiv.org/abs/2601.04056
- 2025-11-20, Bridging VLMs and Embodied Intelligence with Deliberate Practice Policy Optimization, Yi Zhang et.al., Paper: http://arxiv.org/abs/2511.16602
- 2025-12-12, Bridging Streaming Continual Learning via In-Context Large Tabular Models, Afonso Lourenço et.al., Paper: http://arxiv.org/abs/2512.11668
- 2026-02-03, Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation, Ziru Chen et.al., Paper: http://arxiv.org/abs/2602.03806
- 2026-03-19, Bridging Network Fragmentation: A Semantic-Augmented DRL Framework for UAV-aided VANETs, Gaoxiang Cao et.al., Paper: http://arxiv.org/abs/2603.18871
- 2026-03-16, Bridging Local and Global Knowledge: Cascaded Mixture-of-Experts Learning for Near-Shortest Path Routing, Yung-Fu Chen et.al., Paper: http://arxiv.org/abs/2603.15541
- 2025-11-14, Bridging Hidden States in Vision-Language Models, Benjamin Fein-Ashley et.al., Paper: http://arxiv.org/abs/2511.11526
- 2026-03-13, Breaking the Tuning Barrier: Zero-Hyperparameters Yield Multi-Corner Analysis Via Learned Priors, Wei W. Xing et.al., Paper: http://arxiv.org/abs/2603.13092
- 2026-02-02, Breaking the Reversal Curse in Autoregressive Language Models via Identity Bridge, Xutao Ma et.al., Paper: http://arxiv.org/abs/2602.02470
- 2026-03-20, Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States, Yurun Yuan et.al., Paper: http://arxiv.org/abs/2603.19987
- 2026-04-01, Brainstacks: Cross-Domain Cognitive Capabilities via Frozen MoE-LoRA Stacks for Continual LLM Learning, Mohammad R. Abu Ayyash et.al., Paper: http://arxiv.org/abs/2604.01152
- 2026-02-25, Brain3D: Brain Report Automation via Inflated Vision Transformers in 3D, Mariano Barone et.al., Paper: http://arxiv.org/abs/2602.22098
- 2025-10-24, Brain-tuning Improves Generalizability and Efficiency of Brain Alignment in Speech Models, Omer Moussa et.al., Paper: http://arxiv.org/abs/2510.21520
- 2025-10-21, BrailleLLM: Braille Instruction Tuning with Large Language Models for Braille Domain Tasks, Tianyuan Huang et.al., Paper: http://arxiv.org/abs/2510.18288
- 2026-03-19, Box Maze: A Process-Control Architecture for Reliable LLM Reasoning, Zou Qiang et.al., Paper: http://arxiv.org/abs/2603.19182
- 2025-12-12, Bounding Hallucinations: Information-Theoretic Guarantees for RAG Systems via Merlin-Arthur Protocols, Björn Deiseroth et.al., Paper: http://arxiv.org/abs/2512.11614
- 2025-12-22, Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies, Yuqiao Tan et.al., Paper: http://arxiv.org/abs/2512.19673
- 2025-12-05, Bootstrapping Fuzzers for Compilers of Low-Resource Language Dialects Using Language Models, Sairam Vaidya et.al., Paper: http://arxiv.org/abs/2512.05887
- 2026-03-09, Boosting MLLM Spatial Reasoning with Geometrically Referenced 3D Scene Representations, Jiangye Yuan et.al., Paper: http://arxiv.org/abs/2603.08592
- 2026-03-25, Boosting Document Parsing Efficiency and Performance with Coarse-to-Fine Visual Processing, Cheng Cui et.al., Paper: http://arxiv.org/abs/2603.24326
- 2025-12-17, Bolmo: Byteifying the Next Generation of Language Models, Benjamin Minixhofer et.al., Paper: http://arxiv.org/abs/2512.15586
- 2025-10-22, Blackbox Model Provenance via Palimpsestic Membership Inference, Rohith Kuditipudi et.al., Paper: http://arxiv.org/abs/2510.19796
- 2025-11-13, Black-Box On-Policy Distillation of Large Language Models, Tianzhu Ye et.al., Paper: http://arxiv.org/abs/2511.10643
- 2026-02-10, Biases in the Blind Spot: Detecting What LLMs Fail to Mention, Iván Arcuschin et.al., Paper: http://arxiv.org/abs/2602.10117
- 2026-01-05, BiPrompt: Bilateral Prompt Optimization for Visual and Textual Debiasing in Vision-Language Models, Sunny Gupta et.al., Paper: http://arxiv.org/abs/2601.02147
- 2026-02-24, Beyond the Star Rating: A Scalable Framework for Aspect-Based Sentiment Analysis Using LLMs and Text Classification, Vishal Patil et.al., Paper: http://arxiv.org/abs/2602.21082
- 2026-03-16, Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models, Xiyu Liu et.al., Paper: http://arxiv.org/abs/2603.15518
- 2025-12-15, Beyond surface form: A pipeline for semantic analysis in Alzheimer’s Disease detection from spontaneous speech, Dylan Phelps et.al., Paper: http://arxiv.org/abs/2512.13685
- 2026-03-20, Beyond detection: cooperative multi-agent reasoning for rapid onboard EO crisis response, Alejandro D. Mousist et.al., Paper: http://arxiv.org/abs/2603.19858
- 2026-03-26, Beyond Via: Analysis and Estimation of the Impact of Large Language Models in Academic Papers, Mingmeng Geng et.al., Paper: http://arxiv.org/abs/2603.25638
- 2026-02-11, Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling, Gongye Liu et.al., Paper: http://arxiv.org/abs/2602.11146
- 2025-11-26, Beyond URLs: Metadata Diversity and Position for Efficient LLM Pretraining, Dongyang Fan et.al., Paper: http://arxiv.org/abs/2511.21613
- 2025-12-16, Beyond Text-to-SQL: Autonomous Research-Driven Database Exploration with DAR, Ostap Vykhopen et.al., Paper: http://arxiv.org/abs/2512.14622
- 2026-04-01, Beyond Symbolic Solving: Multi Chain-of-Thought Voting for Geometric Reasoning in Large Language Models, Md. Abu Bakor Siddique et.al., Paper: http://arxiv.org/abs/2604.00890
- 2025-10-21, Beyond Single Models: Mitigating Multimodal Hallucinations via Adaptive Token Ensemble Decoding, Jinlin Li et.al., Paper: http://arxiv.org/abs/2510.18321
- 2026-03-11, Beyond Sequential Distance: Inter-Modal Distance Invariant Position Encoding, Lin Chen et.al., Paper: http://arxiv.org/abs/2603.10863
- 2026-03-05, Beyond Scattered Acceptance: Fast and Coherent Inference for DLMs via Longest Stable Prefixes, Pengxiang Li et.al., Paper: http://arxiv.org/abs/2603.05454
- 2025-11-17, Beyond SELECT: A Comprehensive Taxonomy-Guided Benchmark for Real-World Text-to-SQL Translation, Hao Wang et.al., Paper: http://arxiv.org/abs/2511.13590
- 2026-03-06, Beyond Rows to Reasoning: Agentic Retrieval for Multimodal Spreadsheet Understanding and Editing, Anmol Gulati et.al., Paper: http://arxiv.org/abs/2603.06503
- 2025-11-24, Beyond Protein Language Models: An Agentic LLM Framework for Mechanistic Enzyme Design, Bruno Jacob et.al., Paper: http://arxiv.org/abs/2511.19423
- 2026-03-24, Beyond Preset Identities: How Agents Form Stances and Boundaries in Generative Societies, Hanzhong Zhang et.al., Paper: http://arxiv.org/abs/2603.23406
- 2026-01-26, Beyond Preferences: Learning Alignment Principles Grounded in Human Reasons and Values, Henry Bell et.al., Paper: http://arxiv.org/abs/2601.18760
- 2026-03-18, Beyond Muon: MUD (MomentUm Decorrelation) for Faster Transformer Training, Ben S. Southworth et.al., Paper: http://arxiv.org/abs/2603.17970
- 2025-11-21, Beyond Multiple Choice: A Hybrid Framework for Unifying Robust Evaluation and Verifiable Reasoning Training, Yesheng Liu et.al., Paper: http://arxiv.org/abs/2511.17405
- 2025-11-17, Beyond Mimicry: Preference Coherence in LLMs, Luhan Mikaelson et.al., Paper: http://arxiv.org/abs/2511.13630
- 2025-12-24, Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models, Li-Zhong Szu-Tu et.al., Paper: http://arxiv.org/abs/2512.21337
- 2026-02-04, Beyond Many-Shot Translation: Scaling In-Context Demonstrations For Low-Resource Machine Translation, Luis Frentzen Salim et.al., Paper: http://arxiv.org/abs/2602.04764
- 2026-03-03, Beyond Language Modeling: An Exploration of Multimodal Pretraining, Shengbang Tong et.al., Paper: http://arxiv.org/abs/2603.03276
- 2025-12-22, Beyond Language Boundaries: Uncovering Programming Language Families for Code Language Models, Shangbo Yun et.al., Paper: http://arxiv.org/abs/2512.19509
- 2026-03-17, Beyond Grading Accuracy: Exploring Alignment of TAs and LLMs, Matthijs Jansen op de Haar et.al., Paper: http://arxiv.org/abs/2603.16357
- 2026-02-27, Beyond Explainable AI (XAI): An Overdue Paradigm Shift and Post-XAI Research Directions, Saleh Afroogh et.al., Paper: http://arxiv.org/abs/2602.24176
- 2025-12-22, Beyond CLIP: Knowledge-Enhanced Multimodal Transformers for Cross-Modal Alignment in Diabetic Retinopathy Diagnosis, Argha Kamal Samanta et.al., Paper: http://arxiv.org/abs/2512.19663
- 2026-03-20, Beyond Accuracy: Towards a Robust Evaluation Methodology for AI Systems for Language Education, James Edgell et.al., Paper: http://arxiv.org/abs/2603.20088
- 2025-11-26, Beyond Accuracy: An Empirical Study of Uncertainty Estimation in Imputation, Zarin Tahia Hossain et.al., Paper: http://arxiv.org/abs/2511.21607
- 2026-03-24, Between Rules and Reality: On the Context Sensitivity of LLM Moral Judgment, Adrian Sauter et.al., Paper: http://arxiv.org/abs/2603.23114
- 2026-03-31, Bethe Ansatz with a Large Language Model, Balázs Pozsgay et.al., Paper: http://arxiv.org/abs/2603.29932
- 2026-04-01, Benchmarking and Mechanistic Analysis of Vision-Language Models for Cross-Depiction Assembly Instruction Alignment, Zhuchenyang Liu et.al., Paper: http://arxiv.org/abs/2604.00913
- 2026-01-12, Benchmarking Small Language Models and Small Reasoning Language Models on System Log Severity Classification, Yahya Masri et.al., Paper: http://arxiv.org/abs/2601.07790
- 2026-03-10, Benchmarking Political Persuasion Risks Across Frontier Large Language Models, Zhongren Chen et.al., Paper: http://arxiv.org/abs/2603.09884
- 2026-01-27, Benchmarking Multimodal Large Language Models for Missing Modality Completion in Product Catalogues, Junchen Fu et.al., Paper: http://arxiv.org/abs/2601.19750
- 2026-01-21, Benchmarking Large Language Models for ABAP Code Generation: An Empirical Study on Iterative Improvement by Compiler Feedback, Stephan Wallraven et.al., Paper: http://arxiv.org/abs/2601.15188
- 2026-03-09, Benchmarking Language Modeling for Lossless Compression of Full-Fidelity Audio, Phillip Long et.al., Paper: http://arxiv.org/abs/2603.08683
- 2025-12-23, Benchmarking LLMs for Predictive Applications in the Intensive Care Units, Chehak Malhotra et.al., Paper: http://arxiv.org/abs/2512.20520
- 2025-12-10, Benchmarking Document Parsers on Mathematical Formula Extraction from PDFs, Pius Horn et.al., Paper: http://arxiv.org/abs/2512.09874
- 2025-11-06, Benchmark Designers Should “Train on the Test Set” to Expose Exploitable Non-Visual Shortcuts, Ellis Brown et.al., Paper: http://arxiv.org/abs/2511.04655
- 2025-11-13, Belief Net: A Filter-Based Framework for Learning Hidden Markov Models from Observations, Reginald Zhiyan Chen et.al., Paper: http://arxiv.org/abs/2511.10571
- 2026-03-17, Behavioral Steering in a 35B MoE Language Model via SAE-Decoded Probe Vectors: One Agency Axis, Not Five Traits, Jia Qing Yap et.al., Paper: http://arxiv.org/abs/2603.16335
- 2026-03-12, BehaviorVLM: Unified Finetuning-Free Behavioral Understanding with Vision-Language Reasoning, Jingyang Ke et.al., Paper: http://arxiv.org/abs/2603.12176
- 2025-12-17, Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary, Xinshun Feng et.al., Paper: http://arxiv.org/abs/2512.15614
- 2026-03-13, Before and After ChatGPT: Revisiting AI-Based Dialogue Systems for Emotional Support, Daeun Lee et.al., Paper: http://arxiv.org/abs/2603.13043
- 2026-03-06, Before You Hand Over the Wheel: Evaluating LLMs for Security Incident Analysis, Sourov Jajodia et.al., Paper: http://arxiv.org/abs/2603.06422
- 2026-01-22, Beat-ssl: Capturing Local ECG Morphology through Heartbeat-level Contrastive Learning with Soft Targets, Muhammad Ilham Rizqyawan et.al., Paper: http://arxiv.org/abs/2601.16147
- 2026-03-19, BeamAgent: LLM-Aided MIMO Beamforming with Decoupled Intent Parsing and Alternating Optimization for Joint Site Selection and Precoding, Xiucheng Wang et.al., Paper: http://arxiv.org/abs/2603.18855
- 2026-01-15, Be Your Own Red Teamer: Safety Alignment via Self-Play and Reflective Experience Replay, Hao Wang et.al., Paper: http://arxiv.org/abs/2601.10589
- 2025-11-24, Be My Eyes: Extending Large Language Models to New Modalities Through Multi-Agent Collaboration, James Y. Huang et.al., Paper: http://arxiv.org/abs/2511.19417
- 2025-10-23, Bayesian Jammer Localization with a Hybrid CNN and Path-Loss Mixture of Experts, Mariona Jaramillo-Civill et.al., Paper: http://arxiv.org/abs/2510.20666
- 2026-02-23, BarrierSteer: LLM Safety via Learning Barrier Steering, Thanh Q. Tran et.al., Paper: http://arxiv.org/abs/2602.20102
- 2025-11-05, BanglaSTEM: A Parallel Corpus for Technical Domain Bangla-English Translation, Kazi Reyazul Hasan et.al., Paper: http://arxiv.org/abs/2511.03498
- 2026-03-18, Baguan-TS: A Sequence-Native In-Context Learning Model for Time Series Forecasting with Covariates, Linxiao Yang et.al., Paper: http://arxiv.org/abs/2603.17439
- 2025-12-11, BabyVLM-V2: Toward Developmentally Grounded Pretraining and Benchmarking of Vision Foundation Models, Shengao Wang et.al., Paper: http://arxiv.org/abs/2512.10932
- 2026-02-23, BabyLM Turns 4: Call for Papers for the 2026 BabyLM Workshop, Leshem Choshen et.al., Paper: http://arxiv.org/abs/2602.20092
- 2026-03-12, BTZSC: A Benchmark for Zero-Shot Text Classification Across Cross-Encoders, Embedding Models, Rerankers and LLMs, Ilias Aarab et.al., Paper: http://arxiv.org/abs/2603.11991
- 2025-12-23, BRIDGE: Budget-aware Reasoning via Intermediate Distillation with Guided Examples, Xuan-An Le et.al., Paper: http://arxiv.org/abs/2512.20403
- 2026-02-16, BPP: Long-Context Robot Imitation Learning by Focusing on Key History Frames, Max Sobol Mark et.al., Paper: http://arxiv.org/abs/2602.15010
- 2025-12-29, BOAD: Discovering Hierarchical Software Engineering Agents via Bandit Optimization, Iris Xu et.al., Paper: http://arxiv.org/abs/2512.23631
- 2026-03-06, BEVLM: Distilling Semantic Knowledge from LLMs into Bird’s-Eye View Representations, Thomas Monninger et.al., Paper: http://arxiv.org/abs/2603.06576
- 2026-03-10, BEACON: Language-Conditioned Navigation Affordance Prediction under Occlusion, Xinyu Gao et.al., Paper: http://arxiv.org/abs/2603.09961
- 2026-03-25, B-MoE: A Body-Part-Aware Mixture-of-Experts “All Parts Matter” Approach to Micro-Action Recognition, Nishit Poddar et.al., Paper: http://arxiv.org/abs/2603.24245
- 2026-02-02, Avenir-Web: Human-Experience-Imitating Multimodal Web Agents with Mixture of Grounding Experts, Aiden Yiliu Li et.al., Paper: http://arxiv.org/abs/2602.02468
- 2025-11-26, Auxiliary Metrics Help Decoding Skill Neurons in the Wild, Yixiu Zhao et.al., Paper: http://arxiv.org/abs/2511.21610
- 2025-12-17, Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction, Mathieu Blondel et.al., Paper: http://arxiv.org/abs/2512.15605
- 2026-01-14, Automating Supply Chain Disruption Monitoring via an Agentic AI Approach, Sara AlMahri et.al., Paper: http://arxiv.org/abs/2601.09680
- 2025-12-08, Automating High Energy Physics Data Analysis with LLM-Powered Agents, Eli Gendreau-Distler et.al., Paper: http://arxiv.org/abs/2512.07785
- 2026-03-19, Automatic detection of Gen-AI texts: A comparative framework of neural models, Cristian Buttaro et.al., Paper: http://arxiv.org/abs/2603.18750
- 2026-02-09, Automatic In-Domain Exemplar Construction and LLM-Based Refinement of Multi-LLM Expansions for Query Expansion, Minghan Li et.al., Paper: http://arxiv.org/abs/2602.08917
- 2025-10-21, Automated urban waterlogging assessment and early warning through a mixture of foundation models, Chenxu Zhang et.al., Paper: http://arxiv.org/abs/2510.18425
- 2025-12-23, Automated stereotactic radiosurgery planning using a human-in-the-loop reasoning large language model agent, Humza Nusrat et.al., Paper: http://arxiv.org/abs/2512.20586
- 2026-02-12, Automated Test Suite Enhancement Using Large Language Models with Few-shot Prompting, Alex Chudic et.al., Paper: http://arxiv.org/abs/2602.12256
- 2026-01-21, Automated Rubrics for Reliable Evaluation of Medical Dialogue Systems, Yinzhu Chen et.al., Paper: http://arxiv.org/abs/2601.15161
- 2025-11-26, Automated Protein Motif Localization using Concept Activation Vectors in Protein Language Model Embedding Space, Ahmad Shamail et.al., Paper: http://arxiv.org/abs/2511.21614
- 2026-02-18, Automated Extraction of Mechanical Constitutive Models from Scientific Literature using Large Language Models: Applications in Cultural Heritage Conservation, Rui Hu et.al., Paper: http://arxiv.org/abs/2602.16551
- 2025-10-23, Automated Extraction of Fluoropyrimidine Treatment and Treatment-Related Toxicities from Clinical Notes Using Natural Language Processing, Xizhi Wu et.al., Paper: http://arxiv.org/abs/2510.20727
- 2025-12-02, AutoNeural: Co-Designing Vision-Language Models for NPU Inference, Wei Chen et.al., Paper: http://arxiv.org/abs/2512.02924
- 2026-01-23, Auto-Regressive Masked Diffusion Models, Mahdi Karami et.al., Paper: http://arxiv.org/abs/2601.16971
- 2025-12-03, AugServe: Adaptive Request Scheduling for Augmented Large Language Model Inference Serving, Ying Wang et.al., Paper: http://arxiv.org/abs/2512.04013
- 2026-04-01, Auditing the Reliability of Multimodal Generative Search, Erfan Samieyan Sahneh et.al., Paper: http://arxiv.org/abs/2604.00944
- 2026-02-12, AttentionRetriever: Attention Layers are Secretly Long Document Retrievers, David Jiahao Fu et.al., Paper: http://arxiv.org/abs/2602.12278
- 2025-11-26, Attention-Guided Patch-Wise Sparse Adversarial Attacks on Vision-Language-Action Models, Naifu Zhang et.al., Paper: http://arxiv.org/abs/2511.21663
- 2026-01-20, Attention-Based Offline Reinforcement Learning and Clustering for Interpretable Sepsis Treatment, Punit Kumar et.al., Paper: http://arxiv.org/abs/2601.14228
- 2025-11-18, Attention via Synaptic Plasticity is All You Need: A Biologically Inspired Spiking Neuromorphic Transformer, Kallol Mondal et.al., Paper: http://arxiv.org/abs/2511.14691
- 2025-11-07, Attention and Compression is all you need for Controllably Efficient Language Models, Jatin Prakash et.al., Paper: http://arxiv.org/abs/2511.05313
- 2026-03-12, Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing, Baifeng Shi et.al., Paper: http://arxiv.org/abs/2603.12254
- 2026-03-09, AtomVLA: Scalable Post-Training for Robotic Manipulation via Predictive Latent World Models, Xiaoquan Sun et.al., Paper: http://arxiv.org/abs/2603.08519
- 2026-02-13, Asynchronous Verified Semantic Caching for Tiered LLM Architectures, Asmit Kumar Singh et.al., Paper: http://arxiv.org/abs/2602.13165
- 2025-12-11, Asynchronous Reasoning: Training-Free Interactive Thinking LLMs, George Yakushev et.al., Paper: http://arxiv.org/abs/2512.10931
- 2026-01-13, Asymptotic Universal Alignment: A New Alignment Framework via Test-Time Scaling, Yang Cai et.al., Paper: http://arxiv.org/abs/2601.08777
- 2026-01-16, AstroReason-Bench: Evaluating Unified Agentic Planning across Heterogeneous Space Planning Problems, Weiyi Wang et.al., Paper: http://arxiv.org/abs/2601.11354
- 2025-12-24, Assessing the Software Security Comprehension of Large Language Models, Mohammed Latif Siddiq et.al., Paper: http://arxiv.org/abs/2512.21238
- 2025-12-09, Ask, Answer, and Detect: Role-Playing LLMs for Personality Detection with Question-Conditioned Mixture-of-Experts, Yifan Lyu et.al., Paper: http://arxiv.org/abs/2512.08814
- 2026-02-09, ArkEval: Benchmarking and Evaluating Automated CodeRepair for ArkTS, Bang Xie et.al., Paper: http://arxiv.org/abs/2602.08866
- 2026-02-27, ArgLLM-App: An Interactive System for Argumentative Reasoning with Large Language Models, Adam Dejl et.al., Paper: http://arxiv.org/abs/2602.24172
- 2026-01-30, Are you going to finish that? A Practical Study of the Tokenization Boundary Problem, Hao Xu et.al., Paper: http://arxiv.org/abs/2601.23223
- 2025-10-24, Are the LLMs Capable of Maintaining at Least the Language Genus?, Sandra Mitrović et.al., Paper: http://arxiv.org/abs/2510.21561
- 2025-11-06, Are language models aware of the road not taken? Token-level uncertainty and hidden state dynamics, Amir Zur et.al., Paper: http://arxiv.org/abs/2511.04527
- 2026-03-19, Are complicated loss functions necessary for teaching LLMs to reason?, Gabriele Carrino et.al., Paper: http://arxiv.org/abs/2603.18756
- 2026-01-15, Are Your Reasoning Models Reasoning or Guessing? A Mechanistic Analysis of Hierarchical Reasoning Models, Zirui Ren et.al., Paper: http://arxiv.org/abs/2601.10679
- 2025-10-29, Are Language Models Efficient Reasoners? A Perspective from Logic Programming, Andreas Opedal et.al., Paper: http://arxiv.org/abs/2510.25626
- 2026-01-12, Are LLM Decisions Faithful to Verbal Confidence?, Jiawei Wang et.al., Paper: http://arxiv.org/abs/2601.07767
- 2025-11-24, Are Image-to-Video Models Good Zero-Shot Image Editors?, Zechuan Zhang et.al., Paper: http://arxiv.org/abs/2511.19435
- 2026-03-16, Are Dilemmas and Conflicts in LLM Alignment Solvable? A View from Priority Graph, Zhenheng Tang et.al., Paper: http://arxiv.org/abs/2603.15527
- 2024-03-27, Are Compressed Language Models Less Subgroup Robust?, Leonidas Gee et.al., Paper: http://arxiv.org/abs/2403.17811
- 2026-03-31, Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injection Attacks, Chong Xiang et.al., Paper: http://arxiv.org/abs/2603.30016
- 2025-12-12, Architecting Large Action Models for Human-in-the-Loop Intelligent Robots, Kanisorn Sangchai et.al., Paper: http://arxiv.org/abs/2512.11620
- 2025-12-04, Arbitrage: Efficient Reasoning via Advantage-Aware Speculation, Monishwaran Maheswaran et.al., Paper: http://arxiv.org/abs/2512.05033
- 2026-02-12, Any House Any Task: Scalable Long-Horizon Planning for Abstract Human Tasks, Zhihong Liu et.al., Paper: http://arxiv.org/abs/2602.12244
- 2026-02-03, Antidistillation Fingerprinting, Yixuan Even Xu et.al., Paper: http://arxiv.org/abs/2602.03812
- 2026-02-10, Answer First, Reason Later: Aligning Search Relevance via Mode-Balanced Reinforcement Learning, Shijie Zhang et.al., Paper: http://arxiv.org/abs/2602.10006
- 2026-02-09, AnomSeer: Reinforcing Multimodal LLMs to Reason for Time-Series Anomaly Detection, Junru Zhang et.al., Paper: http://arxiv.org/abs/2602.08868
- 2025-11-24, Annotation-Free Class-Incremental Learning, Hari Chandana Kuchibhotla et.al., Paper: http://arxiv.org/abs/2511.19344
- 2025-12-19, AncientBench: Towards Comprehensive Evaluation on Excavated and Transmitted Chinese Corpora, Zhihan Zhou et.al., Paper: http://arxiv.org/abs/2512.17756
- 2025-12-22, Anatomy-R1: Enhancing Anatomy Reasoning in Multimodal Large Language Models via Anatomical Similarity Curriculum and Group Diversity Augmentation, Ziyang Song et.al., Paper: http://arxiv.org/abs/2512.19512
- 2026-03-16, Anatomy of a Lie: A Multi-Stage Diagnostic Framework for Tracing Hallucinations in Vision-Language Models, Lexiang Xiong et.al., Paper: http://arxiv.org/abs/2603.15557
- 2026-01-06, AnatomiX, an Anatomy-Aware Grounded Multimodal Large Language Model for Chest X-Ray Interpretation, Anees Ur Rehman Hashmi et.al., Paper: http://arxiv.org/abs/2601.03191
- 2026-02-27, Anansi: Scalable Characterization of Message-Based Job Scams, Abisheka Pitumpe et.al., Paper: http://arxiv.org/abs/2602.24223
- 2026-01-07, Analyzing and Improving Cross-lingual Knowledge Transfer for Machine Translation, David Stap et.al., Paper: http://arxiv.org/abs/2601.04036
- 2026-02-20, Analyzing and Improving Chain-of-Thought Monitorability Through Information Theory, Usman Anwar et.al., Paper: http://arxiv.org/abs/2602.18297
- 2026-01-07, Analyzing Reasoning Consistency in Large Multimodal Models under Cross-Modal Conflicts, Zhihao Zhu et.al., Paper: http://arxiv.org/abs/2601.04073
- 2026-01-13, Analyzing Bias in False Refusal Behavior of Large Language Models for Hate Speech Detoxification, Kyuri Im et.al., Paper: http://arxiv.org/abs/2601.08668
- 2025-11-05, AnaFlow: Agentic LLM-based Workflow for Reasoning-Driven Explainable and Sample-Efficient Analog Circuit Sizing, Mohsen Ahmadzadeh et.al., Paper: http://arxiv.org/abs/2511.03697
- 2026-03-17, An Interpretable Machine Learning Framework for Non-Small Cell Lung Cancer Drug Response Analysis, Ann Rachel et.al., Paper: http://arxiv.org/abs/2603.16330
- 2026-03-05, An Exploration-Analysis-Disambiguation Reasoning Framework for Word Sense Disambiguation with Low-Parameter LLMs, Deshan Sumanathilaka et.al., Paper: http://arxiv.org/abs/2603.05400
- 2026-02-24, An Expert Schema for Evaluating Large Language Model Errors in Scholarly Question-Answering Systems, Anna Martin-Boyle et.al., Paper: http://arxiv.org/abs/2602.21059
- 2026-03-26, An Experimental Comparison of the Most Popular Approaches to Fake News Detection, Pietro Dell’Oglio et.al., Paper: http://arxiv.org/abs/2603.25501
- 2025-10-21, An Encoder-Decoder Foundation Chemical Language Model for Generative Polymer Design, Harikrishna Sahu et.al., Paper: http://arxiv.org/abs/2510.18860
- 2026-01-09, An Empirical Study on Preference Tuning Generalization and Diversity Under Domain Shift, Constantinos Karouzos et.al., Paper: http://arxiv.org/abs/2601.05882
- 2026-03-20, An Empirical Study of SFT-DPO Interaction and Parameterization in Small Language Models, Yuming Feng et.al., Paper: http://arxiv.org/abs/2603.20100
- 2026-02-03, An Empirical Study of Collective Behaviors and Social Dynamics in Large Language Model Agents, Farnoosh Hashemi et.al., Paper: http://arxiv.org/abs/2602.03775
- 2026-03-17, An Efficient Heterogeneous Co-Design for Fine-Tuning on a Single GPU, Ruijia Yang et.al., Paper: http://arxiv.org/abs/2603.16428
- 2025-11-24, An Anatomy Aware Hybrid Deep Learning Framework for Lung Cancer Tumor Stage Classification, Saniah Kayenat Chowdhury et.al., Paper: http://arxiv.org/abs/2511.19367
- 2026-03-20, An Agentic Approach to Generating XAI-Narratives, Yifan He et.al., Paper: http://arxiv.org/abs/2603.20003
- 2025-11-28, Ambiguity Awareness Optimization: Towards Semantic Disambiguation for Direct Preference Optimization, Jian Li et.al., Paper: http://arxiv.org/abs/2511.23391
- 2026-01-15, Alterbute: Editing Intrinsic Attributes of Objects in Images, Tal Reiss et.al., Paper: http://arxiv.org/abs/2601.10714
- 2025-10-30, All You Need for Object Detection: From Pixels, Points, and Prompts to Next-Gen Fusion and Multimodal LLMs/VLMs in Autonomous Vehicles, Sayed Pedram Haeri Boroujeni et.al., Paper: http://arxiv.org/abs/2510.26641
- 2026-01-07, All That Glisters Is Not Gold: A Benchmark for Reference-Free Counterfactual Financial Misinformation Detection, Yuechen Jiang et.al., Paper: http://arxiv.org/abs/2601.04160
- 2025-10-27, Alita-G: Self-Evolving Generative Agent for Agent Generation, Jiahao Qiu et.al., Paper: http://arxiv.org/abs/2510.23601
- 2025-11-14, Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping, Dena Mujtaba et.al., Paper: http://arxiv.org/abs/2511.11551
- 2025-11-26, Aligning LLMs Toward Multi-Turn Conversational Outcomes Using Iterative PPO, Daniel R. Jiang et.al., Paper: http://arxiv.org/abs/2511.21638
- 2026-03-31, Aligned, Orthogonal or In-conflict: When can we safely optimize Chain-of-Thought?, Max Kaufmann et.al., Paper: http://arxiv.org/abs/2603.30036
- 2025-12-01, AlignSAE: Concept-Aligned Sparse Autoencoders, Minglai Yang et.al., Paper: http://arxiv.org/abs/2512.02004
- 2026-02-18, Align Once, Benefit Multilingually: Enforcing Multilingual Consistency for LLM Safety Alignment, Yuyan Bu et.al., Paper: http://arxiv.org/abs/2602.16660
- 2026-02-16, Algorithmic Simplification of Neural Networks with Mosaic-of-Motifs, Pedram Bakhtiarifard et.al., Paper: http://arxiv.org/abs/2602.14896
- 2025-12-01, AirSim360: A Panoramic Simulation Platform within Drone View, Xian Ge et.al., Paper: http://arxiv.org/abs/2512.02009
- 2025-12-16, Agreement Between Large Language Models and Human Raters in Essay Scoring: A Research Synthesis, Hongli Li et.al., Paper: http://arxiv.org/abs/2512.14561
- 2026-01-30, Agile Reinforcement Learning through Separable Neural Architecture, Rajib Mostakim et.al., Paper: http://arxiv.org/abs/2601.23225
- 2026-01-23, AgentsEval: Clinically Faithful Evaluation of Medical Imaging Reports via Multi-Agent Reasoning, Suzhong Fu et.al., Paper: http://arxiv.org/abs/2601.16685
- 2026-02-05, AgenticPay: A Multi-Agent LLM Negotiation System for Buyer-Seller Transactions, Xianyang Liu et.al., Paper: http://arxiv.org/abs/2602.06008
- 2025-11-04, Agentic World Modeling for 6G: Near-Real-Time Generative State-Space Reasoning, Farhad Rezazadeh et.al., Paper: http://arxiv.org/abs/2511.02748
- 2026-03-20, Agentic Harness for Real-World Compilers, Yingwei Zheng et.al., Paper: http://arxiv.org/abs/2603.20075
- 2026-01-28, Agentic Fog: A Policy-driven Framework for Distributed Intelligence in Fog Computing, Saeed Akbar et.al., Paper: http://arxiv.org/abs/2601.20764
- 2026-03-09, Agentic Critical Training, Weize Liu et.al., Paper: http://arxiv.org/abs/2603.08706
- 2026-03-03, Agentic AI-based Coverage Closure for Formal Verification, Sivaram Pothireddypalli et.al., Paper: http://arxiv.org/abs/2603.03147
- 2026-02-04, Agentic AI in Healthcare & Medicine: A Seven-Dimensional Taxonomy for Empirical Evaluation of LLM-based Agents, Shubham Vatsal et.al., Paper: http://arxiv.org/abs/2602.04813
- 2026-02-13, Agentic AI for Robot Control: Flexible but still Fragile, Oscar Lima et.al., Paper: http://arxiv.org/abs/2602.13081
- 2026-04-01, AgentWatcher: A Rule-based Prompt Injection Monitor, Yanting Wang et.al., Paper: http://arxiv.org/abs/2604.01194
- 2026-03-13, AgentRM: An OS-Inspired Resource Manager for LLM Agent Systems, Jianshu She et.al., Paper: http://arxiv.org/abs/2603.13110
- 2026-01-28, AgentLongBench: A Controllable Long Benchmark For Long-Contexts Agents via Environment Rollouts, Shicheng Fang et.al., Paper: http://arxiv.org/abs/2601.20730
- 2025-12-15, AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection, Junwen Miao et.al., Paper: http://arxiv.org/abs/2512.13671
- 2025-10-28, AgentFrontier: Expanding the Capability Frontier of LLM Agents with ZPD-Guided Data Synthesis, Xuanzhong Chen et.al., Paper: http://arxiv.org/abs/2510.24695
- 2026-03-24, AgentFoX: LLM Agent-Guided Fusion with eXplainability for AI-Generated Image Detection, Yangxin Yu et.al., Paper: http://arxiv.org/abs/2603.23115
- 2026-01-23, AgentDrive: An Open Benchmark Dataset for Agentic AI Reasoning with LLM-Generated Scenarios in Autonomous Systems, Mohamed Amine Ferrag et.al., Paper: http://arxiv.org/abs/2601.16964
- 2025-12-26, Agent-based simulation of online social networks and disinformation, Alejandro Buitrago López et.al., Paper: http://arxiv.org/abs/2512.22082
- 2025-11-04, Agent-Omni: Test-Time Multimodal Reasoning via Model Coordination for Understanding Anything, Huawei Lin et.al., Paper: http://arxiv.org/abs/2511.02834
- 2026-02-10, Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning, Zhaoyang Wang et.al., Paper: http://arxiv.org/abs/2602.10090
- 2026-02-18, Agent Skill Framework: Perspectives on the Potential of Small Language Models in Industrial Environments, Yangjie Xu et.al., Paper: http://arxiv.org/abs/2602.16653
- 2026-01-07, Agent Drift: Quantifying Behavioral Degradation in Multi-Agent LLM Systems Over Extended Interactions, Abhishek Rath et.al., Paper: http://arxiv.org/abs/2601.04170
- 2026-02-26, Agency and Architectural Limits: Why Optimization-Based Systems Cannot Be Norm-Responsive, Radha Sarma et.al., Paper: http://arxiv.org/abs/2602.23239
- 2025-10-28, Advancing site-specific disease and pest management in precision agriculture: From reasoning-driven foundation models to adaptive, feedback-based learning, Nitin Rai et.al., Paper: http://arxiv.org/abs/2510.24650
- 2026-01-13, Advancing ESG Intelligence: An Expert-level Agent and Comprehensive Benchmark for Sustainable Finance, Yilei Zhao et.al., Paper: http://arxiv.org/abs/2601.08676
- 2026-01-26, Advances and Innovations in the Multi-Agent Robotic System (MARS) Challenge, Li Kang et.al., Paper: http://arxiv.org/abs/2601.18733
- 2025-12-05, Adsorption energies are necessary but not sufficient to identify good catalysts, Shahana Chatterjee et.al., Paper: http://arxiv.org/abs/2512.05938
- 2025-11-17, Adaptive Multi-Scale Integration Unlocks Robust Cell Annotation in Histopathology Images, Yinuo Xu et.al., Paper: http://arxiv.org/abs/2511.13586
- 2025-11-25, Adaptive Hopfield Network: Rethinking Similarities in Associative Memory, Shurong Wang et.al., Paper: http://arxiv.org/abs/2511.20609
- 2026-03-20, Adaptive Greedy Frame Selection for Long Video Understanding, Yuning Huang et.al., Paper: http://arxiv.org/abs/2603.20180
- 2025-12-31, Adaptive Dependency-aware Prompt Optimization Framework for Multi-Step LLM Pipeline, Minjun Zhao et.al., Paper: http://arxiv.org/abs/2512.24933
- 2025-12-16, Adapting Speech Language Model to Singing Voice Synthesis, Yiwen Zhao et.al., Paper: http://arxiv.org/abs/2512.14657
- 2023-11-07, Adapting Language Models to Compress Contexts, Alexis Chevalier et.al., Paper: http://arxiv.org/abs/2305.14788
- 2026-03-06, Adapter-Augmented Bandits for Online Multi-Constrained Multi-Modal Inference Scheduling, Xianzhi Zhang et.al., Paper: http://arxiv.org/abs/2603.06403
- 2026-01-22, Adapter Fusion for Multilingual Text2Cypher with Linear and Learned Gating, Makbule Gulcin Ozsoy et.al., Paper: http://arxiv.org/abs/2601.16097
- 2026-02-23, Adaptation to Intrinsic Dependence in Diffusion Language Models, Yunxiao Zhao et.al., Paper: http://arxiv.org/abs/2602.20126
- 2025-12-19, AdaptPrompt: Parameter-Efficient Adaptation of VLMs for Generalizable Deepfake Detection, Yichen Jiang et.al., Paper: http://arxiv.org/abs/2512.17730
- 2025-10-21, Adamas: Hadamard Sparse Attention for Efficient Long-Context Inference, Siyuan Yan et.al., Paper: http://arxiv.org/abs/2510.18413
- 2025-11-18, AdamHD: Decoupled Huber Decay Regularization for Language Model Pre-Training, Fu-Ming Guo et.al., Paper: http://arxiv.org/abs/2511.14721
- 2026-04-02, Adam’s Law: Textual Frequency Law on Large Language Models, Hongyuan Adam Lu et.al., Paper: http://arxiv.org/abs/2604.02176
- 2026-03-02, Adam Converges Without Any Modification On Update Rules, Yushun Zhang et.al., Paper: http://arxiv.org/abs/2603.02092
- 2026-03-18, AdaZoom-GUI: Adaptive Zoom-based GUI Grounding with Instruction Refinement, Siqi Pei et.al., Paper: http://arxiv.org/abs/2603.17441
- 2025-12-18, AdaTooler-V: Adaptive Tool-Use for Images and Videos, Chaoyang Wang et.al., Paper: http://arxiv.org/abs/2512.16918
- 2025-12-18, AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning, Tzu-Han Lin et.al., Paper: http://arxiv.org/abs/2512.16883
- 2025-10-22, AdaSPEC: Selective Knowledge Distillation for Efficient Speculative Decoders, Yuezhou Hu et.al., Paper: http://arxiv.org/abs/2510.19779
- 2026-04-01, AdaLoRA-QAT: Adaptive Low-Rank and Quantization-Aware Segmentation, Prantik Deb et.al., Paper: http://arxiv.org/abs/2604.01167
- 2026-01-09, AdaFuse: Adaptive Ensemble Decoding with Test-Time Scaling for LLMs, Chengming Cui et.al., Paper: http://arxiv.org/abs/2601.06022
- 2026-02-23, AdaEvolve: Adaptive LLM Driven Zeroth-Order Optimization, Mert Cemri et.al., Paper: http://arxiv.org/abs/2602.20133
- 2026-02-16, Activation-Space Uncertainty Quantification for Pretrained Networks, Richard Bergna et.al., Paper: http://arxiv.org/abs/2602.14934
- 2025-12-17, Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers, Adam Karvonen et.al., Paper: http://arxiv.org/abs/2512.15674
- 2025-10-24, Actionable Cybersecurity Notifications for Smart Homes: A User Study on the Role of Length and Complexity, Victor Jüttner et.al., Paper: http://arxiv.org/abs/2510.21508
- 2026-02-24, ActionReasoning: Robot Action Reasoning in 3D Space with LLM for Robotic Brick Stacking, Guangming Wang et.al., Paper: http://arxiv.org/abs/2602.21161
- 2025-02-19, Acoustic Prompt Tuning: Empowering Large Language Models with Audition Capabilities, Jinhua Liang et.al., Paper: http://arxiv.org/abs/2312.00249
- 2025-11-14, Accurate models for recoil velocity distribution in black hole mergers with comparable to extreme mass-ratios and their astrophysical implications, Tousif Islam et.al., Paper: http://arxiv.org/abs/2511.11536
- 2025-10-30, Accelerating mathematical research with language models: A case study of an interaction with GPT-5-Pro on a convex analysis problem, Adil Salim et.al., Paper: http://arxiv.org/abs/2510.26647
- 2026-02-03, Accelerating Scientific Research with Gemini: Case Studies and Common Techniques, David P. Woodruff et.al., Paper: http://arxiv.org/abs/2602.03837
- 2025-12-26, Accelerate Speculative Decoding with Sparse Computation in Verification, Jikai Wang et.al., Paper: http://arxiv.org/abs/2512.21911
- 2026-02-02, Abstract Activation Spaces for Content-Invariant Reasoning in Large Language Models, Gabriele Maraia et.al., Paper: http://arxiv.org/abs/2602.02462
- 2026-03-06, Abductive Reasoning with Syllogistic Forms in Large Language Models, Hirohiko Abe et.al., Paper: http://arxiv.org/abs/2603.06428
- 2026-03-25, AVO: Agentic Variation Operators for Autonomous Evolutionary Search, Terry Chen et.al., Paper: http://arxiv.org/abs/2603.24517
- 2025-11-19, AVATAAR: Agentic Video Answering via Temporal Adaptive Alignment and Reasoning, Urjitkumar Patel et.al., Paper: http://arxiv.org/abs/2511.15578
- 2026-03-31, ATP-Bench: Towards Agentic Tool Planning for MLLM Interleaved Generation, Yinuo Liu et.al., Paper: http://arxiv.org/abs/2603.29902
- 2025-11-05, ASVRI-Legal: Fine-Tuning LLMs with Retrieval Augmented Generation for Enhanced Legal Regulation, One Octadion et.al., Paper: http://arxiv.org/abs/2511.03563
- 2025-12-04, ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning, Shengyuan Ding et.al., Paper: http://arxiv.org/abs/2512.05111
- 2026-03-13, ARL-Tangram: Unleash the Resource Efficiency in Agentic Reinforcement Learning, Bangjun Xiao et.al., Paper: http://arxiv.org/abs/2603.13019
- 2025-10-23, ARGenSeg: Image Segmentation with Autoregressive Image Generation Model, Xiaolong Wang et.al., Paper: http://arxiv.org/abs/2510.20803
- 2026-02-27, ARGUS: Seeing the Influence of Narrative Features on Persuasion in Argumentative Texts, Sara Nabhani et.al., Paper: http://arxiv.org/abs/2602.24109
- 2026-03-24, ARGENT: Adaptive Hierarchical Image-Text Representations, Chuong Huynh et.al., Paper: http://arxiv.org/abs/2603.23311
- 2025-11-06, ARETE: an R package for Automated REtrieval from TExt with large language models, Vasco V. Branco et.al., Paper: http://arxiv.org/abs/2511.04573
- 2026-02-18, AREG: Adversarial Resource Extraction Game for Evaluating Persuasion and Resistance in Large Language Models, Adib Sakhawat et.al., Paper: http://arxiv.org/abs/2602.16639
- 2025-11-18, ARC Is a Vision Problem!, Keya Hu et.al., Paper: http://arxiv.org/abs/2511.14761
- 2026-03-08, AQuA: Toward Strategic Response Generation for Ambiguous Visual Questions, Jihyoung Jang et.al., Paper: http://arxiv.org/abs/2603.07394
- 2026-03-03, APRES: An Agentic Paper Revision and Evaluation System, Bingchen Zhao et.al., Paper: http://arxiv.org/abs/2603.03142
- 2026-02-05, AP-OOD: Attention Pooling for Out-of-Distribution Detection, Claus Hofmann et.al., Paper: http://arxiv.org/abs/2602.06031
- 2026-02-09, ANCRe: Adaptive Neural Connection Reassignment for Efficient Depth Scaling, Yilang Zhang et.al., Paper: http://arxiv.org/abs/2602.09009
- 2025-10-30, AMO-Bench: Large Language Models Still Struggle in High School Math Competitions, Shengnan An et.al., Paper: http://arxiv.org/abs/2510.26768
- 2025-10-31, AMD MI300X GPU Performance Analysis, Chandrish Ambati et.al., Paper: http://arxiv.org/abs/2510.27583
- 2025-12-31, AMAP Agentic Planning Technical Report, Yulan Hu et.al., Paper: http://arxiv.org/abs/2512.24957
- 2025-10-29, ALDEN: Reinforcement Learning for Active Navigation and Evidence Gathering in Long Documents, Tianyu Yang et.al., Paper: http://arxiv.org/abs/2510.25668
- 2026-03-10, ALARM: Audio-Language Alignment for Reasoning Models, Petr Grinberg et.al., Paper: http://arxiv.org/abs/2603.09556
- 2026-02-06, AIRS-Bench: a Suite of Tasks for Frontier AI Research Science Agents, Alisia Lupidi et.al., Paper: http://arxiv.org/abs/2602.06855
- 2026-01-29, AIRPET: Virtual Positron Emission Tomography, J. Renner et.al., Paper: http://arxiv.org/abs/2601.22059
- 2025-11-05, AILA–First Experiments with Localist Language Models, Joachim Diederich et.al., Paper: http://arxiv.org/abs/2511.03559
- 2026-03-03, AI-for-Science Low-code Platform with Bayesian Adversarial Multi-Agent Framework, Zihang Zeng et.al., Paper: http://arxiv.org/abs/2603.03233
- 2026-02-09, AI-based Verbal and Visual Scaffolding in a Serious Game: Effects on Learning and Cognitive Load, Caroline Wermann et.al., Paper: http://arxiv.org/abs/2602.08893
- 2026-02-20, AI-Wrapped: Participatory, Privacy-Preserving Measurement of Longitudinal LLM Use In-the-Wild, Cathy Mengying Fang et.al., Paper: http://arxiv.org/abs/2602.18415
- 2026-03-25, AI-Supervisor: Autonomous AI Research Supervision via a Persistent Research World Model, Yunbo Long et.al., Paper: http://arxiv.org/abs/2603.24402
- 2025-12-12, AI-MASLD Metabolic Dysfunction and Information Steatosis of Large Language Models in Unstructured Clinical Narratives, Yuan Shen et.al., Paper: http://arxiv.org/abs/2512.11544
- 2026-03-18, AI-Assisted Goal Setting Improves Goal Progress Through Social Accountability, Michel Schimpf et.al., Paper: http://arxiv.org/abs/2603.17887
- 2026-02-19, AI Gamestore: Scalable, Open-Ended Evaluation of Machine General Intelligence with Human Games, Lance Ying et.al., Paper: http://arxiv.org/abs/2602.17594
- 2025-11-04, AI Diffusion in Low Resource Language Countries, Amit Misra et.al., Paper: http://arxiv.org/abs/2511.02752
- 2025-12-09, AI Didn’t Start the Fire: Examining the Stack Exchange Moderator and Contributor Strike, Yiwei Wu et.al., Paper: http://arxiv.org/abs/2512.08884
- 2025-12-12, AI Benchmark Democratization and Carpentry, Gregor von Laszewski et.al., Paper: http://arxiv.org/abs/2512.11588
- 2026-03-20, AI Agents Can Already Autonomously Perform Experimental High Energy Physics, Eric A. Moreno et.al., Paper: http://arxiv.org/abs/2603.20179
- 2026-03-19, ADAPT: Attention Driven Adaptive Prompt Scheduling and InTerpolating Orthogonal Complements for Rare Concepts Generation, Kwanyoung Lee et.al., Paper: http://arxiv.org/abs/2603.19157
- 2026-01-16, ACoT-VLA: Action Chain-of-Thought for Vision-Language-Action Models, Linqing Zhong et.al., Paper: http://arxiv.org/abs/2601.11404
- 2026-04-01, ACT Now: Preempting LVLM Hallucinations via Adaptive Context Integration, Bei Yan et.al., Paper: http://arxiv.org/abs/2604.00983
- 2025-11-10, ACE-ICD: Acronym Expansion As Data Augmentation For Automated ICD Coding, Tuan-Dung Le et.al., Paper: http://arxiv.org/abs/2511.07311
- 2026-03-03, ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments, Ziyang Gong et.al., Paper: http://arxiv.org/abs/2603.03198
- 2026-04-02, AA-SVD : Anchored and Adaptive SVD for Large Language Model Compression, Atul Kumar Sinha et.al., Paper: http://arxiv.org/abs/2604.02119
- 2025-11-19, A critical review of pre-post surveys designed to measure student epistemology in undergraduate science courses, Kyriaki Chatzikyriakidou et.al., Paper: http://arxiv.org/abs/2511.15575
- 2026-01-02, A Vision-and-Knowledge Enhanced Large Language Model for Generalizable Pedestrian Crossing Behavior Inference, Qingwen Pu et.al., Paper: http://arxiv.org/abs/2601.00694
- 2025-10-23, A Use-Case Specific Dataset for Measuring Dimensions of Responsible Performance in LLM-generated Text, Alicia Sagae et.al., Paper: http://arxiv.org/abs/2510.20782
- 2026-02-10, A Unified Assessment of the Poverty of the Stimulus Argument for Neural Language Models, Xiulin Yang et.al., Paper: http://arxiv.org/abs/2602.09992
- 2026-03-12, A Two-Stage Dual-Modality Model for Facial Emotional Expression Recognition, Jiajun Sun et.al., Paper: http://arxiv.org/abs/2603.12221
- 2026-03-14, A Theory of Appropriateness That Accounts for Norms of Rationality, Joel Z. Leibo et.al., Paper: http://arxiv.org/abs/2603.14050
- 2026-02-19, A Theoretical Framework for Modular Learning of Robust Generative Models, Corinna Cortes et.al., Paper: http://arxiv.org/abs/2602.17554
- 2026-02-25, A Taxonomy of Human–MLLM Interaction in Early-Stage Sketch-Based Design Ideation, Weiayn Shi et.al., Paper: http://arxiv.org/abs/2602.22171
- 2026-02-10, A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula, Chenruo Liu et.al., Paper: http://arxiv.org/abs/2602.10014
- 2026-03-11, A Systematic Study of Pseudo-Relevance Feedback with LLMs, Nour Jedidi et.al., Paper: http://arxiv.org/abs/2603.11008
- 2025-12-09, A Systematic Evaluation of Preference Aggregation in Federated RLHF for Pluralistic Alignment of LLMs, Mahmoud Srewa et.al., Paper: http://arxiv.org/abs/2512.08786
- 2026-02-05, A Systematic Evaluation of Large Language Models for PTSD Severity Estimation: The Role of Contextual Knowledge and Modeling Strategies, Panagiotis Kaliosis et.al., Paper: http://arxiv.org/abs/2602.06015
- 2024-04-09, A Survey on Transformer Compression, Yehui Tang et.al., Paper: http://arxiv.org/abs/2402.05964
- 2025-10-27, A Survey of Data Agents: Emerging Paradigm or Overstated Hype?, Yizhang Zhu et.al., Paper: http://arxiv.org/abs/2510.23587
- 2025-12-15, A Scientific Reasoning Model for Organic Synthesis Procedure Generation, Guoqing Liu et.al., Paper: http://arxiv.org/abs/2512.13668
- 2026-01-07, A Scheduling Framework for Efficient MoE Inference on Edge GPU-NDP Systems, Qi Wu et.al., Paper: http://arxiv.org/abs/2601.03992
- 2026-02-03, A Scene Graph Backed Approach to Open Set Semantic Mapping, Martin Günther et.al., Paper: http://arxiv.org/abs/2602.03781
- 2026-01-23, A Scalable Measure of Loss Landscape Curvature for Analyzing the Training Dynamics of LLMs, Dayal Singh Kalra et.al., Paper: http://arxiv.org/abs/2601.16979
- 2026-03-06, A Scalable Benchmark for Repository-Oriented Long-Horizon Conversational Context Management, Yang Liu et.al., Paper: http://arxiv.org/abs/2603.06358
- 2026-04-01, A ROS 2 Wrapper for Florence-2: Multi-Mode Local Vision-Language Inference for Robotic Systems, J. E. Domínguez-Vidal et.al., Paper: http://arxiv.org/abs/2604.01179
- 2025-12-09, A Practical Guide for Designing, Developing, and Deploying Production-Grade Agentic AI Workflows, Eranga Bandara et.al., Paper: http://arxiv.org/abs/2512.08769
- 2025-12-24, A Plan Reuse Mechanism for LLM-Driven Agent, Guopeng Li et.al., Paper: http://arxiv.org/abs/2512.21309
- 2026-03-19, A Pipelined Collaborative Speculative Decoding Framework for Efficient Edge-Cloud LLM Inference, Yida Zhang et.al., Paper: http://arxiv.org/abs/2603.19133
- 2026-02-27, A Novel Hierarchical Multi-Agent System for Payments Using LLMs, Joon Kiat Chua et.al., Paper: http://arxiv.org/abs/2602.24068
- 2025-10-24, A Multimodal Benchmark for Framing of Oil & Gas Advertising and Potential Greenwashing Detection, Gaku Morio et.al., Paper: http://arxiv.org/abs/2510.21679
- 2026-02-26, A Mixture-of-Experts Model for Multimodal Emotion Recognition in Conversations, Soumya Dutta et.al., Paper: http://arxiv.org/abs/2602.23300
- 2026-03-06, A Mixture-of-Experts Framework for Practical Hybrid-Quantum Models in Credit Card Fraud Detection, Rodrigo Chaves et.al., Paper: http://arxiv.org/abs/2603.06473
- 2025-11-07, A Metamorphic Testing Perspective on Knowledge Distillation for Language Models of Code: Does the Student Deeply Mimic the Teacher?, Md. Abdul Awal et.al., Paper: http://arxiv.org/abs/2511.05476
- 2026-03-26, A Mentalistic Interface for Probing Folk-Psychological Attribution to Non-Humanoid Robots, Giulio Pisaneschi et.al., Paper: http://arxiv.org/abs/2603.25646
- 2025-12-05, A Machine Learning Framework for Predicting Glass-Forming Ability in Ternary Alloy Systems, Fatemeh Mahmoudi et.al., Paper: http://arxiv.org/abs/2512.05895
- 2026-03-11, A Hybrid Knowledge-Grounded Framework for Safety and Traceability in Prescription Verification, Yichi Zhu et.al., Paper: http://arxiv.org/abs/2603.10891
- 2026-03-17, A Human-Centred Architecture for Large Language Models-Cognitive Assistants in Manufacturing within Quality Management Systems, Marcos Galdino et.al., Paper: http://arxiv.org/abs/2603.16325
- 2026-03-13, A Generative Model of Conspicuous Consumption and Status Signaling, Logan Cross et.al., Paper: http://arxiv.org/abs/2603.13220
- 2026-01-29, A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine, Anran Li et.al., Paper: http://arxiv.org/abs/2601.22124
- 2026-03-04, A Dual-Helix Governance Approach Towards Reliable Agentic AI for WebGIS Development, Boyuan et.al., Paper: http://arxiv.org/abs/2603.04390
- 2026-02-17, A Differential Fuzzing-Based Evaluation of Functional Equivalence in LLM-Generated Code Refactorings, Simantika Bhattacharjee Dristi et.al., Paper: http://arxiv.org/abs/2602.15761
- 2025-12-23, A DeepSeek-Powered AI System for Automated Chest Radiograph Interpretation in Clinical Practice, Yaowei Bai et.al., Paper: http://arxiv.org/abs/2512.20344
- 2026-03-25, A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula, Cansu Sancaktar et.al., Paper: http://arxiv.org/abs/2603.24202
- 2025-10-24, A Data-Centric Approach to Multilingual E-Commerce Product Search: Case Study on Query-Category and Query-Item Relevance, Yabo Yin et.al., Paper: http://arxiv.org/abs/2510.21671
- 2026-02-18, A Contrastive Learning Framework Empowered by Attention-based Feature Adaptation for Street-View Image Classification, Qi You et.al., Paper: http://arxiv.org/abs/2602.16590
- 2026-03-13, A Closed-Form Solution for Debiasing Vision-Language Models with Utility Guarantees Across Modalities and Tasks, Tangzheng Lian et.al., Paper: http://arxiv.org/abs/2603.12998
- 2026-03-29, A Benchmarking Methodology to Assess Open-Source Video Large Language Models in Automatic Captioning of News Videos, David Miranda Paredes et.al., Paper: http://arxiv.org/abs/2603.27662
- 2026-02-24, A Benchmark for Deep Information Synthesis, Debjit Paul et.al., Paper: http://arxiv.org/abs/2602.21143
- 2026-02-09, A Behavioural and Representational Evaluation of Goal-Directedness in Language Model Agents, Raghu Arghal et.al., Paper: http://arxiv.org/abs/2602.08964
- 2026-04-01, A 4D Representation for Training-Free Agentic Reasoning from Monocular Laparoscopic Video, Maximilian Fehrentz et.al., Paper: http://arxiv.org/abs/2604.00867
- 2026-02-12, 3DGSNav: Enhancing Vision-Language Model Reasoning for Object Navigation via Active 3D Gaussian Splatting, Wancai Zheng et.al., Paper: http://arxiv.org/abs/2602.12159
- 2026-03-24, 3DCity-LLM: Empowering Multi-modality Large Language Models for 3D City-scale Perception and Understanding, Yiping Chen et.al., Paper: http://arxiv.org/abs/2603.23447
- 2026-03-25, 3D-Mix for VLA: A Plug-and-Play Module for Integrating VGGT-based 3D Information into Vision-Language-Action Models, Bin Yu et.al., Paper: http://arxiv.org/abs/2603.24393
- 2026-03-23, 3D-Layout-R1: Structured Reasoning for Language-Instructed Spatial Editing, Haoyu Zhen et.al., Paper: http://arxiv.org/abs/2603.22279
- 2025-11-25, 3D-Aware Multi-Task Learning with Cross-View Correlations for Dense Scene Understanding, Xiaoye Wang et.al., Paper: http://arxiv.org/abs/2511.20646
- 2025-11-18, $π^{*}_{0.6}$ : a VLA That Learns From Experience, Ali Amin et.al., Paper: http://arxiv.org/abs/2511.14759
- 2026-01-26, $α^3$ -SecBench: A Large-Scale Evaluation Suite of Security, Resilience, and Trust for LLM-based UAV Agents over 6G Networks, Mohamed Amine Ferrag et.al., Paper: http://arxiv.org/abs/2601.18754
- 2025-11-07, $\mathbf{S^2LM}$ : Towards Semantic Steganography via Large Language Models, Huanqi Wu et.al., Paper: http://arxiv.org/abs/2511.05319
- 2026-01-12, “TODO: Fix the Mess Gemini Created”: Towards Understanding GenAI-Induced Self-Admitted Technical Debt, Abdullah Al Mujahid et.al., Paper: http://arxiv.org/abs/2601.07786
- 2026-02-20, “How Do I …?”: Procedural Questions Predominate Student-LLM Chatbot Conversations, Alexandra Neagu et.al., Paper: http://arxiv.org/abs/2602.18372
- 2026-02-24, “Are You Sure?”: An Empirical Study of Human Perception Vulnerability in LLM-Driven Agentic Systems, Xinfeng Li et.al., Paper: http://arxiv.org/abs/2602.21127
(<a href=#updated-on-20260403>back to top</a>)
Reinforcement Learning
- 2026-03-07, wDPO: Winsorized Direct Preference Optimization for Robust LLM Alignment, Jilong Liu et.al., Paper: http://arxiv.org/abs/2603.07211
- 2025-11-10, samsara: A Continuous-Time Markov Chain Monte Carlo Sampler for Trans-Dimensional Bayesian Analysis, Gabriele Astorino et.al., Paper: http://arxiv.org/abs/2511.07385
- 2026-02-08, rePIRL: Learn PRM with Inverse RL for LLM Reasoning, Xian Wu et.al., Paper: http://arxiv.org/abs/2602.07832
- 2025-10-31, pDANSE: Particle-based Data-driven Nonlinear State Estimation from Nonlinear Measurements, Anubhab Ghosh et.al., Paper: http://arxiv.org/abs/2510.27503
- 2025-10-22, olmOCR 2: Unit Test Rewards for Document OCR, Jake Poznanski et.al., Paper: http://arxiv.org/abs/2510.19817
- 2026-02-23, noDice: Inference for Discrete Probabilistic Programs with Nondeterminism and Conditioning, Tobias Gürtler et.al., Paper: http://arxiv.org/abs/2602.20049
- 2026-03-11, mAceReason-Math: A Dataset of High-Quality Multilingual Math Problems Ready For RLVR, Konstantin Dobler et.al., Paper: http://arxiv.org/abs/2603.10767
- 2026-02-09, iGRPO: Self-Feedback-Driven LLM Reasoning, Ali Hatamizadeh et.al., Paper: http://arxiv.org/abs/2602.09000
- 2025-11-20, gfnx: Fast and Scalable Library for Generative Flow Networks in JAX, Daniil Tiapkin et.al., Paper: http://arxiv.org/abs/2511.16592
- 2026-02-19, g4chargeit: Geant4-based kinetic Monte Carlo simulations of charging in dielectric materials, Kush P. Gandhi et.al., Paper: http://arxiv.org/abs/2602.17332
- 2025-12-10, d-TreeRPO: Towards More Reliable Policy Optimization for Diffusion Language Models, Leyi Pan et.al., Paper: http://arxiv.org/abs/2512.09675
- 2026-02-06, compar:IA: The French Government’s LLM arena to collect French-language human prompts and preference data, Lucie Termignon et.al., Paper: http://arxiv.org/abs/2602.06669
- 2026-01-27, Zeroth-order parallel sampling, Francesco Pozza et.al., Paper: http://arxiv.org/abs/2601.19722
- 2026-01-23, Zero-Shot MARL Benchmark in the Cyber-Physical Mobility Lab, Julius Beerwerth et.al., Paper: http://arxiv.org/abs/2601.16578
- 2025-10-29, Zero Reinforcement Learning Towards General Domains, Yuyuan Zeng et.al., Paper: http://arxiv.org/abs/2510.25528
- 2026-03-14, Your Vision-Language-Action Model Already Has Attention Heads For Path Deviation Detection, Jaehwan Jeong et.al., Paper: http://arxiv.org/abs/2603.13782
- 2026-01-13, Your Group-Relative Advantage Is Biased, Fengkai Yang et.al., Paper: http://arxiv.org/abs/2601.08521
- 2025-11-18, XAttn-BMD: Multimodal Deep Learning with Cross-Attention for Femoral Neck Bone Mineral Density Estimation, Yilin Zhang et.al., Paper: http://arxiv.org/abs/2511.14604
- 2026-02-09, WorldCompass: Reinforcement Learning for Long-Horizon World Models, Zehan Wang et.al., Paper: http://arxiv.org/abs/2602.09022
- 2026-02-02, World-Gymnast: Training Robots with Reinforcement Learning in a World Model, Ansh Kumar Sharma et.al., Paper: http://arxiv.org/abs/2602.02454
- 2026-03-24, WildWorld: A Large-Scale Dataset for Dynamic World Modeling with Actions and Explicit State toward Generative ARPG, Zhen Li et.al., Paper: http://arxiv.org/abs/2603.23497
- 2026-02-09, WildReward: Learning Reward Models from In-the-Wild Human Interactions, Hao Peng et.al., Paper: http://arxiv.org/abs/2602.08829
- 2026-03-05, Wiki-R1: Incentivizing Multimodal Reasoning for Knowledge-based VQA via Data and Sampling Curriculum, Shan Ning et.al., Paper: http://arxiv.org/abs/2603.05256
- 2026-03-16, Why Quarks and Leptons Demand Different Symmetries: A Systematic Z3 Froggatt-Nielsen Analysis, Navid Ardakanian et.al., Paper: http://arxiv.org/abs/2603.15455
- 2025-10-21, Why Policy Gradient Algorithms Work for Undiscounted Total-Reward MDPs, Jongmin Lee et.al., Paper: http://arxiv.org/abs/2510.18340
- 2026-02-24, Why Pass@k Optimization Can Degrade Pass@1: Prompt Interference in LLM Post-training, Anas Barakat et.al., Paper: http://arxiv.org/abs/2602.21189
- 2026-01-30, Why GRPO Needs Normalization: A Local-Curvature Perspective on Adaptive Gradients, Cheng Ge et.al., Paper: http://arxiv.org/abs/2601.23135
- 2026-03-05, Whispering to a Blackbox: Bootstrapping Frozen OCR with Visual Prompts, Samandar Samandarov et.al., Paper: http://arxiv.org/abs/2603.05276
- 2026-03-28, Where-to-Learn: Analytical Policy Gradient Directed Exploration for On-Policy Robotic Reinforcement Learning, Leixin Chang et.al., Paper: http://arxiv.org/abs/2603.27317
- 2026-03-17, When and Why Does Unsupervised RL Succeed in Mathematical Reasoning? A Manifold Envelopment Perspective, Zelin Zhang et.al., Paper: http://arxiv.org/abs/2603.16578
- 2026-02-04, When Silence Is Golden: Can LLMs Learn to Abstain in Temporal QA and Beyond?, Xinyu Zhou et.al., Paper: http://arxiv.org/abs/2602.04755
- 2026-03-17, When Should a Robot Think? Resource-Aware Reasoning via Reinforcement Learning for Embodied Robotic Decision-Making, Jun Liu et.al., Paper: http://arxiv.org/abs/2603.16673
- 2026-02-06, When RL Meets Adaptive Speculative Training: A Unified Training-Serving System, Junxiong Wang et.al., Paper: http://arxiv.org/abs/2602.06932
- 2026-03-10, When Learning Rates Go Wrong: Early Structural Signals in PPO Actor-Critic, Alberto Fernández-Hernández et.al., Paper: http://arxiv.org/abs/2603.09950
- 2026-01-16, When Are Two Scores Better Than One? Investigating Ensembles of Diffusion Models, Raphaël Razafindralambo et.al., Paper: http://arxiv.org/abs/2601.11444
- 2025-10-31, When AI Trading Agents Compete: Adverse Selection of Meta-Orders by Reinforcement Learning-Based Market Making, Ali Raza Jafree et.al., Paper: http://arxiv.org/abs/2510.27334
- 2025-10-20, When 5G NTN Meets GNSS: Tracking GNSS Signals under Overlaid 5G Waveforms, Idir Edjekouane et.al., Paper: http://arxiv.org/abs/2510.17324
- 2025-12-05, Whatever Remains Must Be True: Filtering Drives Reasoning in LLMs, Shaping Diversity, Germán Kruszewski et.al., Paper: http://arxiv.org/abs/2512.05962
- 2026-03-17, What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline, Benoît Alcaraz et.al., Paper: http://arxiv.org/abs/2603.16651
- 2026-04-02, What can be computed in average anonymous networks?, Joel Rybicki et.al., Paper: http://arxiv.org/abs/2604.02192
- 2026-03-20, What If Consensus Lies? Selective-Complementary Reinforcement Learning at Test Time, Dong Yan et.al., Paper: http://arxiv.org/abs/2603.19880
- 2026-03-04, What Does Flow Matching Bring To TD Learning?, Bhavya Agrawalla et.al., Paper: http://arxiv.org/abs/2603.04333
- 2025-12-17, Well Begun, Half Done: Reinforcement Learning with Prefix Optimization for LLM Reasoning, Yiliu Sun et.al., Paper: http://arxiv.org/abs/2512.15274
- 2025-10-21, WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection, Guanzhong He et.al., Paper: http://arxiv.org/abs/2510.18798
- 2026-01-06, WebAnchor: Anchoring Agent Planning to Stabilize Long-Horizon Web Reasoning, Yu Xinmiao et.al., Paper: http://arxiv.org/abs/2601.03164
- 2026-02-16, Weak Poincaré inequalities for Deterministic-scan Metropolis-within-Gibbs samplers, Mengxi Gao et.al., Paper: http://arxiv.org/abs/2602.14692
- 2026-03-13, Weak Adversarial Neural Pushforward Method for Fractional Fokker-Planck Equations, Andrew Qing He et.al., Paper: http://arxiv.org/abs/2603.12869
- 2025-10-30, Wavefront Curvature and Transverse Atomic Motion in Time-Resolved Atom Interferometry: Impact and Mitigation, Noam Mouelle et.al., Paper: http://arxiv.org/abs/2510.26739
- 2026-03-12, Wasserstein Gradient Flows for Batch Bayesian Optimal Experimental Design, Louis Sharrock et.al., Paper: http://arxiv.org/abs/2603.12102
- 2026-03-18, WINFlowNets: Warm-up Integrated Networks Training of Generative Flow Networks for Robotics and Machine Fault Adaptation, Zahin Sufiyan et.al., Paper: http://arxiv.org/abs/2603.17301
- 2025-11-14, W2S-AlignTree: Weak-to-Strong Inference-Time Alignment for Large Language Models via Monte Carlo Tree Search, Zhenyu Ding et.al., Paper: http://arxiv.org/abs/2511.11518
- 2026-02-18, Vulnerability Analysis of Safe Reinforcement Learning via Inverse Constrained Reinforcement Learning, Jialiang Fan et.al., Paper: http://arxiv.org/abs/2602.16543
- 2026-03-18, VolumeDP: Modeling Volumetric Representation for Manipulation Policy Learning, Tianxing Zhou et.al., Paper: http://arxiv.org/abs/2603.17720
- 2026-03-13, Visual-ERM: Reward Modeling for Visual Equivalence, Ziyu Liu et.al., Paper: http://arxiv.org/abs/2603.13224
- 2025-11-07, Visual Spatial Tuning, Rui Yang et.al., Paper: http://arxiv.org/abs/2511.05491
- 2026-01-02, Vision-based Goal-Reaching Control for Mobile Robots Using a Hierarchical Learning Framework, Mehdi Heydari Shahna et.al., Paper: http://arxiv.org/abs/2601.00610
- 2025-11-19, Viscous Dark Energy and Mass-Varying Dark Matter in Lyra Manifold: Cosmological Dynamics and Observational Constraints, Giridhari Deogharia et.al., Paper: http://arxiv.org/abs/2511.15422
- 2026-02-05, VisRefiner: Learning from Visual Differences for Screenshot-to-Code Generation, Jie Deng et.al., Paper: http://arxiv.org/abs/2602.05998
- 2025-11-19, VisPlay: Self-Evolving Vision-Language Models from Images, Yicheng He et.al., Paper: http://arxiv.org/abs/2511.15661
- 2026-03-18, Virtual Polarization Modulation: Enabling CSI-Free DCO-OFDM over Dynamic OWC Channels, Tian Cao et.al., Paper: http://arxiv.org/abs/2603.17316
- 2026-02-09, VideoVeritas: AI-Generated Video Detection via Perception Pretext Reinforcement Learning, Hao Tan et.al., Paper: http://arxiv.org/abs/2602.08828
- 2026-02-08, VideoTemp-o3: Harmonizing Temporal Grounding and Video Understanding in Agentic Thinking-with-Videos, Wenqi Liu et.al., Paper: http://arxiv.org/abs/2602.07801
- 2025-10-27, VideoTG-R1: Boosting Video Temporal Grounding via Curriculum Reinforcement Learning on Reflected Boundary Annotations, Lu Dong et.al., Paper: http://arxiv.org/abs/2510.23397
- 2026-01-30, Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning, Xiangyu Zeng et.al., Paper: http://arxiv.org/abs/2601.23224
- 2025-11-20, Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO, Junhao Cheng et.al., Paper: http://arxiv.org/abs/2511.16669
- 2025-10-27, Video-Thinker: Sparking “Thinking with Videos” via Reinforcement Learning, Shijian Wang et.al., Paper: http://arxiv.org/abs/2510.23473
- 2025-11-21, Video-R4: Reinforcing Text-Rich Video Reasoning with Visual Rumination, Yolo Yunlong Tang et.al., Paper: http://arxiv.org/abs/2511.17490
- 2025-11-28, Video-R2: Reinforcing Consistent and Grounded Reasoning in Multimodal Language Models, Muhammad Maaz et.al., Paper: http://arxiv.org/abs/2511.23478
- 2026-01-27, Video-KTR: Reinforcing Video Reasoning via Key Token Attribution, Ziyue Wang et.al., Paper: http://arxiv.org/abs/2601.19686
- 2025-11-28, Video-CoM: Interactive Video Reasoning via Chain of Manipulations, Hanoona Rasheed et.al., Paper: http://arxiv.org/abs/2511.23477
- 2026-01-12, Video Generation Models in Robotics – Applications, Research Challenges, Future Directions, Zhiting Mei et.al., Paper: http://arxiv.org/abs/2601.07823
- 2026-01-12, Video Evidence to Reasoning Efficient Video Understanding via Explicit Evidence Grounding, Yanxiang Huang et.al., Paper: http://arxiv.org/abs/2601.07761
- 2025-11-04, VidEmo: Affective-Tree Reasoning for Emotion-Centric Video Foundation Models, Zhicheng Zhang et.al., Paper: http://arxiv.org/abs/2511.02712
- 2026-03-17, Via Negativa for AI Alignment: Why Negative Constraints Are Structurally Superior to Positive Preferences, Quan Cheng et.al., Paper: http://arxiv.org/abs/2603.16417
- 2026-03-19, ViTac-Tracing: Visual-Tactile Imitation Learning of Deformable Object Tracing, Yongqiang Zhao et.al., Paper: http://arxiv.org/abs/2603.18784
- 2025-12-16, Vertically resolved minimal-set k-distribution for thermal infrared absorption in the Venus atmosphere, Boris Fomin et.al., Paper: http://arxiv.org/abs/2512.14120
- 2025-10-21, Verifiable Accuracy and Abstention Rewards in Curriculum RL to Alleviate Lost-in-Conversation, Ming Li et.al., Paper: http://arxiv.org/abs/2510.18731
- 2026-01-21, Vehicle Routing with Finite Time Horizon using Deep Reinforcement Learning with Improved Network Embedding, Ayan Maity et.al., Paper: http://arxiv.org/abs/2601.15131
- 2026-01-23, Variational approximate penalized credible regions for Bayesian grouped regression, Weichang Yu et.al., Paper: http://arxiv.org/abs/2601.16585
- 2026-03-19, Variational and Annealing-Based Approaches to Quantum Combinatorial Optimization, Hala Hawashin et.al., Paper: http://arxiv.org/abs/2603.19117
- 2025-10-27, Variational Thermal State Preparation on Digital Quantum Processors Assisted by Matrix Product States, Rui-Hao Li et.al., Paper: http://arxiv.org/abs/2510.23546
- 2025-12-05, Variational Quantum Rainbow Deep Q-Network for Optimizing Resource Allocation Problem, Truong Thanh Hung Nguyen et.al., Paper: http://arxiv.org/abs/2512.05946
- 2025-11-14, Variational Quantum Algorithms for Particle Track Reconstruction, Vincenzo Lipardi et.al., Paper: http://arxiv.org/abs/2511.11397
- 2026-04-01, Variational Dynamics of Open Quantum Spin Systems in Phase Space, Jacopo Tosca et.al., Paper: http://arxiv.org/abs/2604.01165
- 2025-12-22, Variational Autoregressive Networks Applied to $φ^4$ Field Theory Systems, Moxian Qian et.al., Paper: http://arxiv.org/abs/2512.19575
- 2025-11-28, Variation of Microphysical Parameters in Reverse-shock Scenario, Nissim Fraija et.al., Paper: http://arxiv.org/abs/2511.23426
- 2025-12-04, Value Gradient Guidance for Flow Matching Alignment, Zhen Liu et.al., Paper: http://arxiv.org/abs/2512.05116
- 2026-03-03, Valet: A Standardized Testbed of Traditional Imperfect-Information Card Games, Mark Goadrich et.al., Paper: http://arxiv.org/abs/2603.03252
- 2026-02-27, Vacancy-induced local moments in quantum paramagnetic phases: An SU( $N$ ) designer Hamiltonian study, Md Zahid Ansari et.al., Paper: http://arxiv.org/abs/2602.24203
- 2026-02-03, Vacancy defects in square-triangle tilings and their implications for quasicrystals formed by square-shoulder particles, Alptuğ Ulugöl et.al., Paper: http://arxiv.org/abs/2602.03813
- 2025-10-28, VOLD: Reasoning Transfer from LLMs to Vision-Language Models via On-Policy Distillation, Walid Bousselham et.al., Paper: http://arxiv.org/abs/2510.23497
- 2026-01-13, VLingNav: Embodied Navigation with Adaptive Reasoning and Visual-Assisted Linguistic Memory, Shaoan Wang et.al., Paper: http://arxiv.org/abs/2601.08665
- 2026-03-24, VLGOR: Visual-Language Knowledge Guided Offline Reinforcement Learning for Generalizable Agents, Pengsen Liu et.al., Paper: http://arxiv.org/abs/2603.22892
- 2026-02-18, VIGOR: Visual Goal-In-Context Inference for Unified Humanoid Fall Safety, Osher Azulay et.al., Paper: http://arxiv.org/abs/2602.16511
- 2026-03-17, VIGOR: VIdeo Geometry-Oriented Reward for Temporal Generative Alignment, Tengjiao Yin et.al., Paper: http://arxiv.org/abs/2603.16271
- 2026-03-25, VFIG: Vectorizing Complex Figures in SVG with Vision-Language Models, Qijia He et.al., Paper: http://arxiv.org/abs/2603.24575
- 2026-03-19, VEPO: Variable Entropy Policy Optimization for Low-Resource Language Foundation Models, Chonghan Liu et.al., Paper: http://arxiv.org/abs/2603.19152
- 2025-10-31, VCORE: Variance-Controlled Optimization-based Reweighting for Chain-of-Thought Supervision, Xuan Gong et.al., Paper: http://arxiv.org/abs/2510.27462
- 2026-01-05, VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation, Shikun Sun et.al., Paper: http://arxiv.org/abs/2601.02256
- 2025-12-22, VA- $π$ : Variational Policy Alignment for Pixel-Aware Autoregressive Generation, Xinyao Liao et.al., Paper: http://arxiv.org/abs/2512.19680
- 2025-11-06, V-Thinker: Interactive Thinking with Images, Runqi Qiao et.al., Paper: http://arxiv.org/abs/2511.04460
- 2026-02-05, V-Retrver: Evidence-Driven Agentic Reasoning for Universal Multimodal Retrieval, Dongyang Chen et.al., Paper: http://arxiv.org/abs/2602.06034
- 2025-12-11, V-OCBF: Learning Safety Filters from Offline Data via Value-Guided Offline Control Barrier Functions, Mumuksh Tayal et.al., Paper: http://arxiv.org/abs/2512.10822
- 2026-03-19, V-Dreamer: Automating Robotic Simulation and Trajectory Synthesis via Video Generation Priors, Songjia He et.al., Paper: http://arxiv.org/abs/2603.18811
- 2025-12-09, Using reinforcement learning to probe the role of feedback in skill acquisition, Antonio Terpin et.al., Paper: http://arxiv.org/abs/2512.08463
- 2026-03-16, Using an SU(3)/U(2) Wigner Function to Represent Noisy Spin Ensembles, Andrew Kolmer Forbes et.al., Paper: http://arxiv.org/abs/2603.15387
- 2025-10-22, Using Non-Expert Data to Robustify Imitation Learning via Offline Reinforcement Learning, Kevin Huang et.al., Paper: http://arxiv.org/abs/2510.19495
- 2026-01-15, Urban Socio-Semantic Segmentation with Vision-Language Reasoning, Yu Wang et.al., Paper: http://arxiv.org/abs/2601.10477
- 2025-12-31, Upscaling from ab initio atomistic simulations to electrode scale: The case of manganese hexacyanoferrate, a cathode material for Na-ion batteries, Yuan-Chi Yang et.al., Paper: http://arxiv.org/abs/2512.24816
- 2026-01-27, Unsupervised Learning of Efficient Exploration: Pre-training Adaptive Policies via Self-Imposed Goals, Octavio Pappalardo et.al., Paper: http://arxiv.org/abs/2601.19810
- 2026-01-30, Unsupervised Hierarchical Skill Discovery, Damion Harvey et.al., Paper: http://arxiv.org/abs/2601.23156
- 2025-11-05, Unraveling Deconfined Quantum Criticality in Non-Hermitian Easy-Plane $J$-$Q$ Model, Xuan Zou et.al., Paper: http://arxiv.org/abs/2511.03456
- 2025-11-10, Unlocking the Regression Space, Liudas Giraitis et.al., Paper: http://arxiv.org/abs/2511.07183
- 2026-01-30, Unlocking the Power of Orbital-Free Density Functional Theory to Explore the Electronic Structure Under Extreme Conditions, Cheng Ma et.al., Paper: http://arxiv.org/abs/2601.23002
- 2026-01-07, Universality in driven systems with a multiply-degenerate umbilic point, Johannes Schmidt et.al., Paper: http://arxiv.org/abs/2601.04116
- 2025-11-24, Universal scaling limits at the spectral singularity of structured random matrices, Markus Ebke et.al., Paper: http://arxiv.org/abs/2511.19308
- 2026-03-25, Universal Quantum Suppression in Frustrated Ising Magnets across the Quasi-1D to 2D Crossover via Quantum Annealing, Kumar Ghosh et.al., Paper: http://arxiv.org/abs/2603.24311
- 2025-10-22, Universal Quantitative Abstraction: Categorical Duality and Logical Completeness for Probabilistic Systems, Nivar Anwer et.al., Paper: http://arxiv.org/abs/2510.19444
- 2025-10-29, Universal Features of Chiral Symmetry Breaking in Large- $N$ QCD, Claudio Bonanno et.al., Paper: http://arxiv.org/abs/2510.25644
- 2026-01-09, Universal Dilation of Linear Itô SDEs: Quantum Trajectories and Lindblad Simulation of Second Moments, Hsuan-Cheng Wu et.al., Paper: http://arxiv.org/abs/2601.05928
- 2025-12-15, Universal Dexterous Functional Grasping via Demonstration-Editing Reinforcement Learning, Chuan Mao et.al., Paper: http://arxiv.org/abs/2512.13380
- 2026-01-08, Unitary fault-tolerant encoding of Pauli states in surface codes, Luis Colmenarez et.al., Paper: http://arxiv.org/abs/2601.05113
- 2026-03-17, Unifying Optimization and Dynamics to Parallelize Sequential Computation: A Guide to Parallel Newton Methods for Breaking Sequential Bottlenecks, Xavier Gonzalez et.al., Paper: http://arxiv.org/abs/2603.16850
- 2026-02-20, Unifying Formal Explanations: A Complexity-Theoretic Perspective, Shahaf Bassan et.al., Paper: http://arxiv.org/abs/2602.18160
- 2026-03-20, Uniform Maximum Projection Designs for Computer Experiments, Miroslav Vořechovský et.al., Paper: http://arxiv.org/abs/2603.19778
- 2025-10-24, Unified token representations for sequential decision models, Zhuojing Tian et.al., Paper: http://arxiv.org/abs/2510.21448
- 2026-01-06, Unified Thinker: A General Reasoning Modular Core for Image Generation, Sashuai Zhou et.al., Paper: http://arxiv.org/abs/2601.03127
- 2026-03-18, Unified Policy Value Decomposition for Rapid Adaptation, Cristiano Capone et.al., Paper: http://arxiv.org/abs/2603.17947
- 2026-02-02, Unified Personalized Reward Model for Vision Generation, Yibin Wang et.al., Paper: http://arxiv.org/abs/2602.02380
- 2025-11-10, Unified Humanoid Fall-Safety Policy from a Few Demonstrations, Zhengjie Xu et.al., Paper: http://arxiv.org/abs/2511.07407
- 2026-02-06, UnifSrv: AP Selection for Achieving Uniformly Good Performance of CF-MIMO in Realistic Urban Networks, Yunlu Xiao et.al., Paper: http://arxiv.org/abs/2602.06780
- 2025-10-20, UniRL-Zero: Reinforcement Learning on Unified Models with Joint Language Model and Diffusion Model Experts, Fu-Yun Wang et.al., Paper: http://arxiv.org/abs/2510.17937
- 2025-11-18, UniGen-1.5: Enhancing Image Generation and Editing through Reward Unification in Reinforcement Learning, Rui Tian et.al., Paper: http://arxiv.org/abs/2511.14760
- 2026-03-24, UniGRPO: Unified Policy Optimization for Reasoning-Driven Visual Generation, Jie Liu et.al., Paper: http://arxiv.org/abs/2603.23500
- 2025-12-12, UniBYD: A Unified Framework for Learning Robotic Manipulation Across Embodiments Beyond Imitation of Human Demonstrations, Tingyu Yuan et.al., Paper: http://arxiv.org/abs/2512.11609
- 2026-03-08, Underwater Embodied Intelligence for Autonomous Robots: A Constraint-Coupled Perspective on Planning, Control, and Deployment, Jingzehua Xu et.al., Paper: http://arxiv.org/abs/2603.07393
- 2026-03-09, Understanding thermal and quantum fluctuations in extended Kitaev-Yao-Lee spin-orbital model, Jiefu Cen et.al., Paper: http://arxiv.org/abs/2603.08710
- 2025-12-16, Understanding and Improving Hyperbolic Deep Reinforcement Learning, Timo Klein et.al., Paper: http://arxiv.org/abs/2512.14202
- 2026-02-03, Understanding and Exploiting Weight Update Sparsity for Communication-Efficient Distributed RL, Erfan Miahi et.al., Paper: http://arxiv.org/abs/2602.03839
- 2025-12-08, Understanding Individual Decision-Making in Multi-Agent Reinforcement Learning: A Dynamical Systems Approach, James Rudd-Jones et.al., Paper: http://arxiv.org/abs/2512.07588
- 2026-03-11, Uncovering statistical structure in large-scale neural activity with Restricted Boltzmann Machines, Nicolas Béreux et.al., Paper: http://arxiv.org/abs/2603.11032
- 2025-10-21, Uncovering critical temperature dependence in Heusler magnets via explicit machine learning, Jean-Baptiste Morée et.al., Paper: http://arxiv.org/abs/2510.18469
- 2026-02-23, Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning, Thanh Nguyen et.al., Paper: http://arxiv.org/abs/2602.19917
- 2026-02-08, Uncertainty-Aware Counterfactual Traffic Signal Control with Predictive Safety and Starvation-Avoidance Constraints Using Vision-Based Sensing, Jayawant Bodagala et.al., Paper: http://arxiv.org/abs/2602.07784
- 2026-03-16, Unbiased and Biased Variance-Reduced Forward-Reflected-Backward Splitting Methods for Stochastic Composite Inclusions, Quoc Tran-Dinh et.al., Paper: http://arxiv.org/abs/2603.15576
- 2026-04-02, Ultrafast Ionization Dynamics Encoded in a Photoelectron Spin Torus, Xiaodan Mao et.al., Paper: http://arxiv.org/abs/2604.02062
- 2026-01-06, UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward, Yile Liu et.al., Paper: http://arxiv.org/abs/2601.03205
- 2025-10-20, UltraCUA: A Foundation Model for Computer Use Agents with Hybrid Action, Yuhao Yang et.al., Paper: http://arxiv.org/abs/2510.17790
- 2026-03-07, Ultra-Sharp Upright Photon Radiotherapy via Low Energy Extended Distance: An Alternative to FLASH for high flux Sources, Lloyd E Kamole Ghomsi et.al., Paper: http://arxiv.org/abs/2603.07103
- 2025-11-17, Ultra Low Overhead Syndrome Extraction for the Steane code, Boldizsár Poór et.al., Paper: http://arxiv.org/abs/2511.13700
- 2025-12-05, USV: Unified Sparsification for Accelerating Video Diffusion Models, Xinjian Wu et.al., Paper: http://arxiv.org/abs/2512.05754
- 2026-03-26, UMBRELLA: Uncertainty-aware Multi-robot Reactive Coordination under Dynamic Temporal Logic Tasks, Qisheng Zhao et.al., Paper: http://arxiv.org/abs/2603.25395
- 2026-03-03, ULTRA: Unified Multimodal Control for Autonomous Humanoid Whole-Body Loco-Manipulation, Xialin He et.al., Paper: http://arxiv.org/abs/2603.03279
- 2026-02-05, UI-Mem: Self-Evolving Experience Memory for Online Reinforcement Learning in Mobile GUI Agents, Han Xiao et.al., Paper: http://arxiv.org/abs/2602.05832
- 2025-11-10, UAV-Assisted Resilience in 6G and Beyond Network Energy Saving: A Multi-Agent DRL Approach, Dao Lan Vy Dinh et.al., Paper: http://arxiv.org/abs/2511.07366
- 2026-03-17, Typical models of the distribution system restoration process, Arslan Ahmad et.al., Paper: http://arxiv.org/abs/2603.16841
- 2025-10-21, Two-loop QCD corrections for real and off-shell diphoton and triphoton production via quark loops, Dario Kermanschah et.al., Paper: http://arxiv.org/abs/2510.18801
- 2025-10-30, Two-Timescale Optimization Framework for IAB-Enabled Heterogeneous UAV Networks, Jikang Deng et.al., Paper: http://arxiv.org/abs/2510.26578
- 2026-03-18, Two stroke Pumping Technique for Many-Body Systems, Serge Galam et.al., Paper: http://arxiv.org/abs/2603.17873
- 2025-11-26, Two behavioural pseudometrics for continuous-time Markov processes, Linan Chen et.al., Paper: http://arxiv.org/abs/2511.21621
- 2025-11-14, Two Useful Facts About Generating Functions, Alex Kasman et.al., Paper: http://arxiv.org/abs/2511.11559
- 2026-02-09, Two Robust Interstellar Meteor Candidates in the Post-2018 CNEOS Fireball Database, Richard Cloete et.al., Paper: http://arxiv.org/abs/2602.08956
- 2025-12-05, Twisting the Hagedorn temperature in planar $\mathcal{N}=4$ super Yang-Mills, Simon Ekhammar et.al., Paper: http://arxiv.org/abs/2512.05810
- 2026-02-09, TwinRL-VLA: Digital Twin-Driven Reinforcement Learning for Real-World Robotic Manipulation, Qinwen Xu et.al., Paper: http://arxiv.org/abs/2602.09023
- 2026-03-25, Tutor-Student Reinforcement Learning: A Dynamic Curriculum for Robust Deepfake Detection, Zhanhe Lei et.al., Paper: http://arxiv.org/abs/2603.24139
- 2026-03-19, Tuning polymer architecture for quasicrystal self-assembly, D. J. Ratliff et.al., Paper: http://arxiv.org/abs/2603.18694
- 2026-03-23, Tuning Real-World Image Restoration at Inference: A Test-Time Scaling Paradigm for Flow Matching Models, Purui Bai et.al., Paper: http://arxiv.org/abs/2603.22027
- 2026-01-26, Trustworthy Evaluation of Robotic Manipulation: A New Benchmark and AutoEval Methods, Mengyuan Liu et.al., Paper: http://arxiv.org/abs/2601.18723
- 2025-12-19, Trust-Region Adaptive Policy Optimization, Mingyu Su et.al., Paper: http://arxiv.org/abs/2512.17636
- 2026-01-26, Trust, Don’t Trust, or Flip: Robust Preference-Based Reinforcement Learning with Multi-Expert Feedback, Seyed Amir Hosseini et.al., Paper: http://arxiv.org/abs/2601.18751
- 2026-03-12, Trust Your Critic: Robust Reward Modeling and Reinforcement Learning for Faithful Image Editing and Generation, Xiangyu Zhao et.al., Paper: http://arxiv.org/abs/2603.12247
- 2026-03-20, Triple/Double-Debiased Lasso, Denis Chetverikov et.al., Paper: http://arxiv.org/abs/2603.20134
- 2025-12-09, Triangular $J_1$-$J_2$ Heisenberg Antiferromagnet in a Magnetic Field, Thomas Bader et.al., Paper: http://arxiv.org/abs/2512.08768
- 2025-12-08, Trapped Fermions Through Kolmogorov-Arnold Wavefunctions, Paulo F. Bedaque et.al., Paper: http://arxiv.org/abs/2512.07800
- 2023-01-04, Transformer in Transformer as Backbone for Deep Reinforcement Learning, Hangyu Mao et.al., Paper: http://arxiv.org/abs/2212.14538
- 2025-12-12, Transfer Learning (Il)liquidity, Andrea Conti et.al., Paper: http://arxiv.org/abs/2512.11731
- 2026-03-16, Trajectory-Diversity-Driven Robust Vision-and-Language Navigation, Jiangyang Li et.al., Paper: http://arxiv.org/abs/2603.15370
- 2026-01-02, Training-Free Certified Bounds for Quantum Regression: A Scalable Framework, Demerson N. Gonçalves et.al., Paper: http://arxiv.org/abs/2601.00745
- 2025-12-03, Training for Identity, Inference for Controllability: A Unified Approach to Tuning-Free Face Personalization, Lianyu Pang et.al., Paper: http://arxiv.org/abs/2512.03964
- 2026-01-28, Training Reasoning Models on Saturated Problems via Failure-Prefix Conditioning, Minwu Kim et.al., Paper: http://arxiv.org/abs/2601.20829
- 2025-12-10, Training One Model to Master Cross-Level Agentic Actions via Reinforcement Learning, Kaichen He et.al., Paper: http://arxiv.org/abs/2512.09706
- 2026-02-03, Training Multi-Turn Search Agent via Contrastive Dynamic Branch Sampling, Yubao Zhao et.al., Paper: http://arxiv.org/abs/2602.03719
- 2026-02-02, Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability, Xiao Liang et.al., Paper: http://arxiv.org/abs/2602.02477
- 2026-03-18, Training Diffusion Language Models for Black-Box Optimization, Zipeng Sun et.al., Paper: http://arxiv.org/abs/2603.17919
- 2025-12-29, Training AI Co-Scientists Using Rubric Rewards, Shashwat Goel et.al., Paper: http://arxiv.org/abs/2512.23707
- 2025-10-20, Train for Truth, Keep the Skills: Binary Retrieval-Augmented Reward Mitigates Hallucinations, Tong Chen et.al., Paper: http://arxiv.org/abs/2510.17733
- 2026-01-02, Traffic-Aware Optimal Taxi Placement Using Graph Neural Network-Based Reinforcement Learning, Sonia Khetarpaul et.al., Paper: http://arxiv.org/abs/2601.00607
- 2026-03-14, Traffic and weather driven hybrid digital twin for bridge monitoring, Phani Raja Bharath Balijepalli et.al., Paper: http://arxiv.org/abs/2603.14028
- 2025-10-20, Trading with the Devil: Risk and Return in Foundation Model Strategies, Jinrui Zhang et.al., Paper: http://arxiv.org/abs/2510.17165
- 2026-01-07, Trade-R1: Bridging Verifiable Rewards to Stochastic Environments via Process-Level Reasoning Verification, Rui Sun et.al., Paper: http://arxiv.org/abs/2601.03948
- 2026-02-26, Tracking the Lithiation State of Li $_x$ Si from Machine-Learned XPS Binding Energies, Michael Alejandro Hernandez Bertran et.al., Paper: http://arxiv.org/abs/2602.23028
- 2026-01-27, Tracking Drift: Variation-Aware Entropy Scheduling for Non-Stationary Reinforcement Learning, Tongxi Wang et.al., Paper: http://arxiv.org/abs/2601.19624
- 2026-01-09, TowerMind: A Tower Defence Game Learning Environment and Benchmark for LLM as Agents, Dawei Wang et.al., Paper: http://arxiv.org/abs/2601.05899
- 2025-01-14, Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning, Lanqing Li et.al., Paper: http://arxiv.org/abs/2402.02429
- 2025-12-05, Towards agent-based-model informed neural networks, Nino Antulov-Fantulin et.al., Paper: http://arxiv.org/abs/2512.05764
- 2025-12-04, Towards a unified framework for guided diffusion models, Yuchen Jiao et.al., Paper: http://arxiv.org/abs/2512.04985
- 2025-11-18, Towards a Unified Analysis of Neural Networks in Nonparametric Instrumental Variable Regression: Optimization and Generalization, Zonghao Chen et.al., Paper: http://arxiv.org/abs/2511.14710
- 2026-02-06, Towards a Fully Automated Pipeline for Short-Term Forecasting of In Situ Coronal Mass Ejection Magnetic Field Structure, Hannah T. Rüdisser et.al., Paper: http://arxiv.org/abs/2602.06926
- 2026-03-10, Towards Unified Multimodal Interleaved Generation via Group Relative Policy Optimization, Ming Nie et.al., Paper: http://arxiv.org/abs/2603.09538
- 2025-12-12, Towards Trustworthy Multi-Turn LLM Agents via Behavioral Guidance, Gonca Gürsun et.al., Paper: http://arxiv.org/abs/2512.11421
- 2025-10-27, Towards Stochastic (N-1)-Secure Redispatch, Oleksii Molodchyk et.al., Paper: http://arxiv.org/abs/2510.23551
- 2026-03-25, Towards Reward Modeling for AI Tutors in Math Mistake Remediation, Kseniia Petukhova et.al., Paper: http://arxiv.org/abs/2603.24375
- 2025-10-28, Towards Quadrupedal Jumping and Walking for Dynamic Locomotion using Reinforcement Learning, Jørgen Anker Olsen et.al., Paper: http://arxiv.org/abs/2510.24584
- 2025-10-20, Towards Optimal Control and Algorithmic Structure of Decompression Schedules, Benjamin Marsh et.al., Paper: http://arxiv.org/abs/2510.17551
- 2026-02-12, Towards On-Policy SFT: Distribution Discriminant Theory and its Applications in LLM Training, Miaosen Zhang et.al., Paper: http://arxiv.org/abs/2602.12222
- 2025-11-17, Towards Multimodal Representation Learning in Paediatric Kidney Disease, Ana Durica et.al., Paper: http://arxiv.org/abs/2511.13637
- 2026-02-26, Towards Intelligible Human-Robot Interaction: An Active Inference Approach to Occluded Pedestrian Scenarios, Kai Chen et.al., Paper: http://arxiv.org/abs/2602.23109
- 2026-03-12, Towards High-Fidelity CAD Generation via LLM-Driven Program Generation and Text-Based B-Rep Primitive Grounding, Jiahao Li et.al., Paper: http://arxiv.org/abs/2603.11831
- 2025-11-05, Towards Formalizing Reinforcement Learning Theory, Shangtong Zhang et.al., Paper: http://arxiv.org/abs/2511.03618
- 2025-10-21, Towards Faithful and Controllable Personalization via Critique-Post-Edit Reinforcement Learning, Chenghao Zhu et.al., Paper: http://arxiv.org/abs/2510.18849
- 2025-11-13, Towards Emotionally Intelligent and Responsible Reinforcement Learning, Garapati Keerthana et.al., Paper: http://arxiv.org/abs/2511.10573
- 2026-03-26, Towards Embodied AI with MuscleMimic: Unlocking full-body musculoskeletal motor learning at scale, Chengkun Li et.al., Paper: http://arxiv.org/abs/2603.25544
- 2025-11-28, Towards Continuous Intelligence Growth: Self-Training, Continual Learning, and Dual-Scale Memory in SuperIntelliAgent, Jianzhe Lin et.al., Paper: http://arxiv.org/abs/2511.23436
- 2026-03-11, Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis, Yujie Zheng et.al., Paper: http://arxiv.org/abs/2603.10846
- 2026-03-09, Towards Batch-to-Streaming Deep Reinforcement Learning for Continuous Control, Riccardo De Monte et.al., Paper: http://arxiv.org/abs/2603.08588
- 2025-12-09, Toward Quantitative Modeling of Cybersecurity Risks Due to AI Misuse, Steve Barrett et.al., Paper: http://arxiv.org/abs/2512.08864
- 2025-12-05, Toward Efficient and Robust Behavior Models for Multi-Agent Driving Simulation, Fabian Konstantinidis et.al., Paper: http://arxiv.org/abs/2512.05812
- 2026-01-20, Toward Efficient Agents: Memory, Tool learning, and Planning, Xiaofang Yang et.al., Paper: http://arxiv.org/abs/2601.14192
- 2025-10-20, Toward Autonomous Neural VMC: An Energy-Variance Convergence Criterion for Quantum Systems, Huan-Chen Shi et.al., Paper: http://arxiv.org/abs/2510.17490
- 2025-12-01, Topological Order in Deep State, Ahmed Abouelkomsan et.al., Paper: http://arxiv.org/abs/2512.01863
- 2026-03-12, Topological Enhancement of Protein Kinetic Stability, João NC Especial et.al., Paper: http://arxiv.org/abs/2603.12053
- 2026-01-30, TopoLS: Lattice Surgery Compilation via Topological Program Transformations, Junyu Zhou et.al., Paper: http://arxiv.org/abs/2601.23109
- 2026-03-13, Topo-R1: Detecting Topological Anomalies via Vision-Language Models, Meilong Xu et.al., Paper: http://arxiv.org/abs/2603.13054
- 2025-12-23, Topic-informed dynamic mixture model for occupational heterogeneity in health risk behaviors, Lorenzo Schiavon et.al., Paper: http://arxiv.org/abs/2512.20408
- 2025-11-26, ToolOrchestra: Elevating Intelligence via Efficient Model and Tool Orchestration, Hongjin Su et.al., Paper: http://arxiv.org/abs/2511.21689
- 2026-03-02, Tool Verification for Test-Time Reinforcement Learning, Ruotong Liao et.al., Paper: http://arxiv.org/abs/2603.02203
- 2026-01-08, Token-Level LLM Collaboration via FusionRoute, Nuoya Xiong et.al., Paper: http://arxiv.org/abs/2601.05106
- 2026-01-29, Token-Guard: Towards Token-Level Hallucination Control via Self-Checking Decoding, Yifan Zhu et.al., Paper: http://arxiv.org/abs/2601.21969
- 2026-02-03, TodyComm: Task-Oriented Dynamic Communication for Multi-Round LLM-based Multi-Agent System, Wenzhe Fan et.al., Paper: http://arxiv.org/abs/2602.03688
- 2026-02-08, TodoEvolve: Learning to Architect Agent Planning Systems, Jiaxi Liu et.al., Paper: http://arxiv.org/abs/2602.07839
- 2026-01-12, To report or not to report: Optimal claim reporting in a bonus-malus system, Lea Enzi et.al., Paper: http://arxiv.org/abs/2601.07655
- 2026-03-07, To Predict or Not to Predict? Towards reliable uncertainty estimation in the presence of noise, Nouran Khallaf et.al., Paper: http://arxiv.org/abs/2603.07330
- 2025-11-07, TimeSearch-R: Adaptive Temporal Search for Long-Form Video Understanding via Self-Verification Reinforcement Learning, Junwen Pan et.al., Paper: http://arxiv.org/abs/2511.05489
- 2025-12-16, TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs, Jun Zhang et.al., Paper: http://arxiv.org/abs/2512.14698
- 2026-02-05, Time lags and their association with the Boundary Layer structure in a Z source GX 349+2, Abhishek M. V. R. et.al., Paper: http://arxiv.org/abs/2602.05989
- 2026-02-20, Time consistent portfolio strategies for a general utility function, Oumar Mbodji et.al., Paper: http://arxiv.org/abs/2602.18157
- 2026-02-08, Time Series Reasoning via Process-Verifiable Thinking Data Synthesis and Scheduling for Tailored LLM Reasoning, Jiahui Zhou et.al., Paper: http://arxiv.org/abs/2602.07830
- 2025-11-24, Tilings of a bounded region of the plane by maximal one-dimensional tiles, Eduardo J. Aguilar et.al., Paper: http://arxiv.org/abs/2511.19288
- 2026-03-03, TikZilla: Scaling Text-to-TikZ with High-Quality Data and Reinforcement Learning, Christian Greisinger et.al., Paper: http://arxiv.org/abs/2603.03072
- 2026-02-17, Tight Communication Bounds for Distributed Algorithms in the Quantum Routing Model, Fabien Dufoulon et.al., Paper: http://arxiv.org/abs/2602.15529
- 2025-12-01, Tight Bounds for Feedback Vertex Set Parameterized by Clique-width, Narek Bojikian et.al., Paper: http://arxiv.org/abs/2512.01900
- 2025-11-17, TiViBench: Benchmarking Think-in-Video Reasoning for Video Generative Models, Harold Haodong Chen et.al., Paper: http://arxiv.org/abs/2511.13704
- 2026-03-23, TiCo: Time-Controllable Training for Spoken Dialogue Models, Kai-Wei Chang et.al., Paper: http://arxiv.org/abs/2603.22267
- 2025-12-31, Throughput Optimization in UAV-Mounted RIS under Jittering and Imperfect CSI via DRL, Anas K. Saeed et.al., Paper: http://arxiv.org/abs/2512.24773
- 2025-10-24, Three-nucleon lepton-number-violating potentials in chiral EFT and their matrix elements in light nuclei, Graham Chambers-Wall et.al., Paper: http://arxiv.org/abs/2510.21564
- 2025-12-05, Three-loop jet function for boosted top quarks, Alberto M. Clavero et.al., Paper: http://arxiv.org/abs/2512.05795
- 2026-01-22, Three’s a crowd: Identification challenges in the triple difference model with spillover effects, Silvia De Nicolò et.al., Paper: http://arxiv.org/abs/2601.15764
- 2025-12-12, Three methods, one problem: Classical and AI approaches to no-three-in-line, Pranav Ramanathan et.al., Paper: http://arxiv.org/abs/2512.11469
- 2025-11-20, Thinking-while-Generating: Interleaving Textual Reasoning throughout Visual Generation, Ziyu Guo et.al., Paper: http://arxiv.org/abs/2511.16671
- 2025-12-09, Thinking with Images via Self-Calling Agent, Wenxi Yang et.al., Paper: http://arxiv.org/abs/2512.08511
- 2026-01-07, Thinking with Frames: Generative Video Distortion Evaluation via Frame Reward Model, Yuan Wang et.al., Paper: http://arxiv.org/abs/2601.04033
- 2026-03-19, Thinking with Constructions: A Benchmark and Policy Optimization for Visual-Text Interleaved Geometric Reasoning, Haokun Zhao et.al., Paper: http://arxiv.org/abs/2603.18662
- 2026-01-05, Thinking with Blueprints: Assisting Vision-Language Models in Spatial Reasoning via Structured Object Representation, Weijian Ma et.al., Paper: http://arxiv.org/abs/2601.01984
- 2026-03-13, Thinking in Streaming Video, Zikang Liu et.al., Paper: http://arxiv.org/abs/2603.12938
- 2025-11-28, Thinking by Doing: Building Efficient World Model Reasoning in LLMs via Multi-turn Interaction, Bao Shu et.al., Paper: http://arxiv.org/abs/2511.23476
- 2025-12-29, ThinkGen: Generalized Thinking for Visual Generation, Siyu Jiao et.al., Paper: http://arxiv.org/abs/2512.23568
- 2025-10-27, Think Twice: Branch-and-Rethink Reasoning Reward Model, Yizhu Jiao et.al., Paper: http://arxiv.org/abs/2510.23596
- 2025-11-28, ThetaEvolve: Test-time Learning on Open Problems, Yiping Wang et.al., Paper: http://arxiv.org/abs/2511.23473
- 2025-11-17, Thermodynamics of the Fermi-Hubbard Model through Stochastic Calculus and Girsanov Transformation, Detlef Lehmann et.al., Paper: http://arxiv.org/abs/2511.13581
- 2025-12-24, Thermodynamic sampling of materials using neutral-atom quantum computers, Bruno Camino et.al., Paper: http://arxiv.org/abs/2512.21142
- 2026-01-30, Theory of Little-Parks oscillations by vortices in two-dimensional superconductors, Ying-Ming Xie et.al., Paper: http://arxiv.org/abs/2601.23050
- 2026-02-27, Theoretical Studies of alpha Clustering in Nuclei and Beyond, Takaharu Otsuka et.al., Paper: http://arxiv.org/abs/2602.24175
- 2025-11-19, Theoretical Closed-loop Stability Bounds for Dynamical System Coupled with Diffusion Policies, Gabriel Lauzier et.al., Paper: http://arxiv.org/abs/2511.15520
- 2025-12-10, Theoretical Characterization of the Magnetic Properties of Vanadium-doped Ti2C MXenes, Carlos Patiño et.al., Paper: http://arxiv.org/abs/2512.09657
- 2025-11-04, The two-dimensional optical Su-Schrieffer-Heeger model: ground state and thermodynamic properties, Jadson L. Portela e Silva et.al., Paper: http://arxiv.org/abs/2511.02707
- 2026-03-12, The price of decentralization in managing engineering systems through multi-agent reinforcement learning, Prateek Bhustali et.al., Paper: http://arxiv.org/abs/2603.11884
- 2025-10-24, The population of Galactic young massive star clusters in the TeV range, Rowan Batzofin et.al., Paper: http://arxiv.org/abs/2510.21480
- 2025-11-17, The physical properties of post-mass-transfer binaries, Rhys Seeburger et.al., Paper: http://arxiv.org/abs/2511.13692
- 2025-11-05, The moment is here: a generalised class of estimators for fuzzy regression discontinuity designs, Stuart Lane et.al., Paper: http://arxiv.org/abs/2511.03424
- 2026-02-23, The interplay of cation/anion and monovalent/divalent selectivity in negatively charged nanopores: local charge inversion and anion leakage, Eszter Lakics et.al., Paper: http://arxiv.org/abs/2602.19992
- 2025-10-21, The implications of inflation for the last ACT, Zhi-Chong Qiu et.al., Paper: http://arxiv.org/abs/2510.18320
- 2026-03-10, The framework to unify all complexity dichotomy theorems for Boolean tensor networks, Mingji Xia et.al., Paper: http://arxiv.org/abs/2603.09417
- 2026-02-16, The empirical distribution of sequential LS factors in Multi-level Dynamic Factor Models, Gian Pietro Bellocca et.al., Paper: http://arxiv.org/abs/2602.14813
- 2026-01-20, The Transparency Paradox in Explainable AI: A Theory of Autonomy Depletion Through Cognitive Load, Ancuta Margondai et.al., Paper: http://arxiv.org/abs/2601.13973
- 2026-01-23, The Trajectory Alignment Coefficient in Two Acts: From Reward Tuning to Reward Learning, Calarina Muslimani et.al., Paper: http://arxiv.org/abs/2601.16906
- 2026-03-26, The Symmetric Perceptron: a Teacher-Student Scenario, Giovanni Catania et.al., Paper: http://arxiv.org/abs/2603.25440
- 2025-10-23, The Shape of Reasoning: Topological Analysis of Reasoning Traces in Large Language Models, Xue Wen Tan et.al., Paper: http://arxiv.org/abs/2510.20665
- 2026-01-05, The Sequential Monte Carlo goes NUTS: Boosting Gravitational-Wave Inference, Gabriele Demasi et.al., Paper: http://arxiv.org/abs/2601.02336
- 2026-03-26, The Reward Function and the Least Cost Principle for Gravitation and other Laws of Physics, Rubén Moreno-Bote et.al., Paper: http://arxiv.org/abs/2603.25444
- 2026-02-13, The Refractive Index of Gallium Antimonide, Ulrich Galander et.al., Paper: http://arxiv.org/abs/2602.12862
- 2025-10-21, The Picard-Lagrange Framework for Higher-Order Langevin Monte Carlo, Jaideep Mahajan et.al., Paper: http://arxiv.org/abs/2510.18242
- 2025-11-06, The Peril of Preference: Why GRPO fails on Ordinal Rewards, Anisha Garg et.al., Paper: http://arxiv.org/abs/2511.04439
- 2025-10-30, The Oversight Game: Learning to Cooperatively Balance an AI Agent’s Safety and Autonomy, William Overman et.al., Paper: http://arxiv.org/abs/2510.26752
- 2026-02-17, The Obfuscation Atlas: Mapping Where Honesty Emerges in RLVR with Deception Probes, Mohammad Taufeeque et.al., Paper: http://arxiv.org/abs/2602.15515
- 2026-01-16, The Mini Wheelbot Dataset: High-Fidelity Data for Robot Learning, Henrik Hose et.al., Paper: http://arxiv.org/abs/2601.11394
- 2025-11-14, The Maximal Variance of Unilaterally Truncated Gaussian and Chi Distributions, Robert J. Petrella et.al., Paper: http://arxiv.org/abs/2511.11566
- 2025-10-20, The Marked Edge Walk: A Novel MCMC Algorithm for Sampling of Graph Partitions, Atticus McWhorter et.al., Paper: http://arxiv.org/abs/2510.17714
- 2026-02-07, The Laplacian Keyboard: Beyond the Linear Span, Siddarth Chandrasekar et.al., Paper: http://arxiv.org/abs/2602.07730
- 2026-03-22, The Intelligent Disobedience Game: Formulating Disobedience in Stackelberg Games and Markov Decision Processes, Benedikt Hornig et.al., Paper: http://arxiv.org/abs/2603.20994
- 2025-11-19, The Impact of Quantization on Large Reasoning Model Reinforcement Learning, Medha Kumar et.al., Paper: http://arxiv.org/abs/2511.15694
- 2026-03-31, The Hollyfeld Gambit in Astrophysics, Benne Holwerda et.al., Paper: http://arxiv.org/abs/2603.29964
- 2026-01-09, The Hadronization Impact on $J/ψ$ Energy Correlators: A Pythia8 Study from Partonic to Hadronic Observables, Jin-peng Zhang et.al., Paper: http://arxiv.org/abs/2601.05658
- 2026-01-16, The Great March 100: 100 Detail-oriented Tasks for Evaluating Embodied AI Agents, Ziyu Wang et.al., Paper: http://arxiv.org/abs/2601.11421
- 2025-12-04, The Geometry of Intelligence: Deterministic Functional Topology as a Foundation for Real-World Perception, Eduardo Di Santi et.al., Paper: http://arxiv.org/abs/2512.05089
- 2026-01-21, The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models, Zanlin Ni et.al., Paper: http://arxiv.org/abs/2601.15165
- 2026-02-17, The First Instrumentally Documented Fall of an Iron Meteorite: atmospheric trajectory and ground impact, Jarmo Moilanen et.al., Paper: http://arxiv.org/abs/2602.15440
- 2026-03-03, The Extended Real Line with Reentry: A Compact Quotient Space Separating US from KC, Damian Rafael Lattenero et.al., Paper: http://arxiv.org/abs/2603.03228
- 2025-10-30, The Era of Agentic Organization: Learning to Organize with Language Models, Zewen Chi et.al., Paper: http://arxiv.org/abs/2510.26658
- 2026-02-08, The Development of a Preclinical Alpha Irradiation Platform with Versatile Control of Dose, Dose Rate, and Spatiotemporal Irradiation Patterns, Harsh Arya et.al., Paper: http://arxiv.org/abs/2602.07760
- 2025-12-02, The Convex Matching Distance in Multiparameter Persistence, Patrizio Frosini et.al., Paper: http://arxiv.org/abs/2512.02944
- 2025-10-22, The Confusing Instance Principle for Online Linear Quadratic Control, Waris Radji et.al., Paper: http://arxiv.org/abs/2510.19531
- 2026-04-02, The Bures metric and the quantum metric on the density space of a C*-algebra: the non-unital case, Konrad Aguilar et.al., Paper: http://arxiv.org/abs/2604.02117
- 2025-10-27, The Best of N Worlds: Aligning Reinforcement Learning with Best-of-N Sampling via max@k Optimisation, Farid Bagirov et.al., Paper: http://arxiv.org/abs/2510.23393
- 2026-02-24, The Art of Efficient Reasoning: Data, Reward, and Optimization, Taiqiang Wu et.al., Paper: http://arxiv.org/abs/2602.20945
- 2025-12-08, The Agent Capability Problem: Predicting Solvability Through Information-Theoretic Bounds, Shahar Lutati et.al., Paper: http://arxiv.org/abs/2512.07631
- 2025-11-13, The $L_p$ -error rate for randomized quasi-Monte Carlo self-normalized importance sampling of unbounded integrands, Jiarui Du et.al., Paper: http://arxiv.org/abs/2511.10599
- 2025-11-28, The $L$-test: Increasing the Linear Model $F$ -test’s Power Under Sparsity Without Sacrificing Validity, Danielle Paulson et.al., Paper: http://arxiv.org/abs/2511.23466
- 2026-01-08, Text as a Universal Interface for Transferable Personalization, Yuting Liu et.al., Paper: http://arxiv.org/abs/2601.04963
- 2025-10-30, Tests of exogeneity in duration models with censored data, Gilles Crommen et.al., Paper: http://arxiv.org/abs/2510.26613
- 2026-03-13, Test-time RL alignment exposes task familiarity artifacts in LLM benchmarks, Kun Wang et.al., Paper: http://arxiv.org/abs/2603.12875
- 2026-01-13, TerraFormer: Automated Infrastructure-as-Code with LLMs Fine-Tuned via Policy-Guided Verifier Feedback, Prithwish Jana et.al., Paper: http://arxiv.org/abs/2601.08734
- 2026-01-14, Terminally constrained flow-based generative models from an optimal control perspective, Weiguo Gao et.al., Paper: http://arxiv.org/abs/2601.09474
- 2025-11-05, Tensor-Efficient High-Dimensional Q-learning, Junyi Wu et.al., Paper: http://arxiv.org/abs/2511.03595
- 2026-01-28, Tensor renormalization group study of cold and dense QCD in the strong coupling limit, Yuto Sugimoto et.al., Paper: http://arxiv.org/abs/2601.20690
- 2026-03-04, Tendon Force Modeling for Sim2Real Transfer of Reinforcement Learning Policies for Tendon-Driven Robots, Valentin Yuryev et.al., Paper: http://arxiv.org/abs/2603.04351
- 2026-01-30, Temporally Coherent Imitation Learning via Latent Action Flow Matching for Robotic Manipulation, Wu Songwei et.al., Paper: http://arxiv.org/abs/2601.23087
- 2026-03-02, Temporal Representations for Exploration: Learning Complex Exploratory Behavior without Extrinsic Rewards, Faisal Mohamed et.al., Paper: http://arxiv.org/abs/2603.02008
- 2025-11-06, Temporal Action Selection for Action Chunking, Yueyang Weng et.al., Paper: http://arxiv.org/abs/2511.04421
- 2026-02-20, TempoNet: Slack-Quantized Transformer-Guided Reinforcement Scheduler for Adaptive Deadline-Centric Real-Time Dispatchs, Rong Fu et.al., Paper: http://arxiv.org/abs/2602.18109
- 2026-02-25, Tempered Christoffel-Weighted Polynomial Chaos Expansion for Resilience-Oriented Uncertainty Quantification, Mahsa Ebadat-Parast et.al., Paper: http://arxiv.org/abs/2602.22133
- 2025-12-03, TempR1: Improving Temporal Understanding of MLLMs via Temporal-Aware Multi-Task Reinforcement Learning, Tao Wu et.al., Paper: http://arxiv.org/abs/2512.03963
- 2026-01-26, Teaching Models to Teach Themselves: Reasoning at the Edge of Learnability, Shobhita Sundaram et.al., Paper: http://arxiv.org/abs/2601.18778
- 2025-11-07, TeaRAG: A Token-Efficient Agentic Retrieval-Augmented Generation Framework, Chao Zhang et.al., Paper: http://arxiv.org/abs/2511.05385
- 2025-12-05, Taylor Approximation Variance Reduction for Approximation Errors in PDE-constrained Bayesian Inverse Problems, Ruanui Nicholson et.al., Paper: http://arxiv.org/abs/2512.05723
- 2026-03-04, TaxonRL: Reinforcement Learning with Intermediate Rewards for Interpretable Fine-Grained Visual Reasoning, Maximilian von Klinski et.al., Paper: http://arxiv.org/abs/2603.04380
- 2026-02-24, Task-oriented grasping for dexterous robots using postural synergies and reinforcement learning, Dimitrios Dimou et.al., Paper: http://arxiv.org/abs/2602.20915
- 2026-03-31, Target-Aligned Reinforcement Learning, Leonard S. Pleiss et.al., Paper: http://arxiv.org/abs/2603.29501
- 2025-11-20, Taming the Long-Tail: Efficient Reasoning RL Training with Adaptive Drafter, Qinghao Hu et.al., Paper: http://arxiv.org/abs/2511.16665
- 2026-03-12, Taming the Adversary: Stable Minimax Deep Deterministic Policy Gradient via Fractional Objectives, Taeho Lee et.al., Paper: http://arxiv.org/abs/2603.12110
- 2025-12-02, Taming Camera-Controlled Video Generation with Verifiable Geometry Reward, Zhaoqing Wang et.al., Paper: http://arxiv.org/abs/2512.02870
- 2026-01-05, Talk2Move: Reinforcement Learning for Text-Instructed Object-Level Geometric Transformation in Scenes, Jing Tan et.al., Paper: http://arxiv.org/abs/2601.02356
- 2026-01-20, Tail-Aware Density Forecasting of Locally Explosive Time Series: A Neural Network Approach, Elena Dumitrescu et.al., Paper: http://arxiv.org/abs/2601.14049
- 2025-12-10, Tachyonic dark energy- Constraints from current observations, Ramanpreet Singh et.al., Paper: http://arxiv.org/abs/2512.09744
- 2025-12-23, TableGPT-R1: Advancing Tabular Reasoning Through Reinforcement Learning, Saisai Yang et.al., Paper: http://arxiv.org/abs/2512.20312
- 2025-10-20, TabR1: Taming GRPO for tabular reasoning LLMs, Pengxiang Cai et.al., Paper: http://arxiv.org/abs/2510.17385
- 2026-04-01, TTA-Vid: Generalized Test-Time Adaptation for Video Reasoning, Soumya Shamarao Jahagirdar et.al., Paper: http://arxiv.org/abs/2604.00696
- 2025-11-17, TSE-Net: Semi-supervised Monocular Height Estimation from Single Remote Sensing Images, Sining Chen et.al., Paper: http://arxiv.org/abs/2511.13552
- 2026-03-17, TRUST-SQL: Tool-Integrated Multi-Turn Reinforcement Learning for Text-to-SQL over Unknown Schemas, Ai Jian et.al., Paper: http://arxiv.org/abs/2603.16448
- 2026-03-23, TREX: Trajectory Explanations for Multi-Objective Reinforcement Learning, Dilina Rajapakse et.al., Paper: http://arxiv.org/abs/2603.21988
- 2026-02-03, TRE: Encouraging Exploration in the Trust Region, Chao Huang et.al., Paper: http://arxiv.org/abs/2602.03635
- 2026-02-02, TIC-VLA: A Think-in-Control Vision-Language-Action Model for Robot Navigation in Dynamic Environments, Zhiyu Huang et.al., Paper: http://arxiv.org/abs/2602.02459
- 2026-01-30, THINKSAFE: Self-Generated Safety Alignment for Reasoning Models, Seanie Lee et.al., Paper: http://arxiv.org/abs/2601.23143
- 2026-02-13, TFTF: Training-Free Targeted Flow for Conditional Sampling, Qianqian Qu et.al., Paper: http://arxiv.org/abs/2602.12932
- 2026-02-13, TCRL: Temporal-Coupled Adversarial Training for Robust Constrained Reinforcement Learning in Worst-Case Scenarios, Wentao Xu et.al., Paper: http://arxiv.org/abs/2602.13040
- 2026-03-26, TAPO: Translation Augmented Policy Optimization for Multilingual Mathematical Reasoning, Xu Huang et.al., Paper: http://arxiv.org/abs/2603.25419
- 2026-01-16, TANDEM: Temporal-Aware Neural Detection for Multimodal Hate Speech, Girish A. Koushik et.al., Paper: http://arxiv.org/abs/2601.11178
- 2026-01-13, Systemic Risk Surveillance, Timo Dimitriadis et.al., Paper: http://arxiv.org/abs/2601.08598
- 2026-04-02, Systematic Analyses of Reinforcement Learning Controllers in Signalized Urban Corridors, Xiaofei Song et.al., Paper: http://arxiv.org/abs/2604.02025
- 2025-10-24, System-Theoretic Analysis of Dynamic Generalized Nash Equilibrium Problems – Turnpikes and Dissipativity, Sophie Hall et.al., Paper: http://arxiv.org/abs/2510.21556
- 2026-02-08, System-Level Error Propagation and Tail-Risk Amplification in Reference-Based Robotic Navigation, Ning Hu et.al., Paper: http://arxiv.org/abs/2602.07846
- 2026-02-25, System Design of the Ultra Mobility Vehicle: A Driving, Balancing, and Jumping Bicycle Robot, Benjamin Bokser et.al., Paper: http://arxiv.org/abs/2602.22118
- 2026-03-06, Synthetic Monitoring Environments for Reinforcement Learning, Leonard Pleiss et.al., Paper: http://arxiv.org/abs/2603.06252
- 2025-12-10, SynthPix: A lightspeed PIV images generator, Antonio Terpin et.al., Paper: http://arxiv.org/abs/2512.09664
- 2025-11-28, Synchrotron Self-Compton Model of TeV Afterglows in Gamma-Ray Bursts, Edilberto Aguilar-Ruiz et.al., Paper: http://arxiv.org/abs/2511.23349
- 2025-11-24, Syn-GRPO: Self-Evolving Data Synthesis for MLLM Perception Reasoning, Qihan Huang et.al., Paper: http://arxiv.org/abs/2511.19343
- 2026-01-20, Symmetry Breaking and Phase Transitions in Random Non-Commutative Geometries and Related Random-Matrix Ensembles, Mauro D’Arcangelo et.al., Paper: http://arxiv.org/abs/2601.14141
- 2025-12-31, Symmetric mass generation as a multicritical point with enhanced symmetry, Sandip Maiti et.al., Paper: http://arxiv.org/abs/2512.24836
- 2026-02-26, Symmetric Mass Generation via Multicriticality in a 3D Lattice Gross-Neveu Model, Sandip Maiti et.al., Paper: http://arxiv.org/abs/2602.23158
- 2026-01-29, SymbXRL: Symbolic Explainable Deep Reinforcement Learning for Mobile Networks, Abhishek Duttagupta et.al., Paper: http://arxiv.org/abs/2601.22024
- 2026-02-03, SymPlex: A Structure-Aware Transformer for Symbolic PDE Solving, Yesom Park et.al., Paper: http://arxiv.org/abs/2602.03816
- 2026-03-04, Swimming Under Constraints: A Safe Reinforcement Learning Framework for Quadrupedal Bio-Inspired Propulsion, Xinyu Cui et.al., Paper: http://arxiv.org/abs/2603.04073
- 2025-10-24, Surrogate-based quantification of policy uncertainty in generative flow networks, Ramón Nartallo-Kaluarachchi et.al., Paper: http://arxiv.org/abs/2510.21523
- 2025-12-19, Surrogate-Accelerated Bayesian Inversion for Exoplanet Interior Characterization, Tijn De Wringer et.al., Paper: http://arxiv.org/abs/2512.17626
- 2026-02-02, Surface Interactions in Photon Monte Carlo Simulations, J. R. Peterson et.al., Paper: http://arxiv.org/abs/2602.02321
- 2026-03-14, Supervised Fine-Tuning versus Reinforcement Learning: A Study of Post-Training Methods for Large Language Models, Haitao Jiang et.al., Paper: http://arxiv.org/abs/2603.13985
- 2025-11-04, Supernova Classification using the Recurrent Neural Network in the CSST Ultra-Deep Field Survey, Minglin Wang et.al., Paper: http://arxiv.org/abs/2511.02631
- 2025-11-10, Superhuman AI for Stratego Using Self-Play Reinforcement Learning and Test-Time Search, Samuel Sokota et.al., Paper: http://arxiv.org/abs/2511.07312
- 2025-12-22, Superconductivity in Electron Liquids: Precision Many-Body Treatment of Coulomb Interaction, Xiansheng Cai et.al., Paper: http://arxiv.org/abs/2512.19382
- 2026-03-25, SumRank: Aligning Summarization Models for Long-Document Listwise Reranking, Jincheng Feng et.al., Paper: http://arxiv.org/abs/2603.24204
- 2026-03-10, Subspace decomposition with defect diffusion coefficient, Dilini Kolombage et.al., Paper: http://arxiv.org/abs/2603.09924
- 2025-12-09, Study of a small-scale gamma-ray detection system employing Compton scattering with a monolithic CeBr3 crystal and segmented photodetector array, Veronika Asova et.al., Paper: http://arxiv.org/abs/2512.08593
- 2025-10-31, Study of Central Exclusive Production of $π^+π^-$, $K^+K^-$ and $p \bar{p}$ Pairs in Proton-Proton Collisions at $\sqrt{s} = 510$ GeV with the STAR Detector at RHIC, Tomas Truhlar et.al., Paper: http://arxiv.org/abs/2510.27482
- 2026-01-22, Structured Hints for Sample-Efficient Lean Theorem Proving, Zachary Burton et.al., Paper: http://arxiv.org/abs/2601.16172
- 2025-12-04, Structured Document Translation via Format Reinforcement Learning, Haiyue Song et.al., Paper: http://arxiv.org/abs/2512.05100
- 2026-02-08, Structure Preserving Approximation of Semiconcave Functions, Karl Kunisch et.al., Paper: http://arxiv.org/abs/2602.07770
- 2026-03-13, Structural Parameters of the Globular Cluster M 15, M. V. Petkova et.al., Paper: http://arxiv.org/abs/2603.13140
- 2026-03-25, Strong-to-Weak Spontaneous Symmetry Breaking in a $(2+1)$ D Transverse-Field Ising Model under Decoherence, Yi-Ming Ding et.al., Paper: http://arxiv.org/abs/2603.24342
- 2025-11-13, Strategic Opponent Modeling with Graph Neural Networks, Deep Reinforcement Learning and Probabilistic Topic Modeling, Georgios Chalkiadakis et.al., Paper: http://arxiv.org/abs/2511.10501
- 2025-11-13, Strangeness enhancement at its extremes: multiple (multi-)strange hadron production in pp collisions at $\mathbf{\sqrt{\textit{s}} = 5.02}$ TeV, ALICE Collaboration et.al., Paper: http://arxiv.org/abs/2511.10413
- 2025-12-18, Strain-Controlled Magnetic Phase Transitions through Anisotropic Exchange Interactions: A Combined DFT and Monte Carlo Study, Sudip Mandal et.al., Paper: http://arxiv.org/abs/2512.16765
- 2026-02-12, Stop Unnecessary Reflection: Training LRMs for Efficient Reasoning with Adaptive Reflection and Length Coordinated Penalty, Zewei Yu et.al., Paper: http://arxiv.org/abs/2602.12113
- 2026-02-05, Stop Rewarding Hallucinated Steps: Faithfulness-Aware Step-Level Reinforcement Learning for Small Reasoning Models, Shuo Nie et.al., Paper: http://arxiv.org/abs/2602.05897
- 2026-01-22, Stochastically forced compressible Navier-Stokes equations with slip boundary conditions of friction type, Reo Tsuboya et.al., Paper: http://arxiv.org/abs/2601.15768
- 2025-11-06, Stochastic simulation of partial discharge inception, Jannis Teunissen et.al., Paper: http://arxiv.org/abs/2511.04356
- 2026-03-17, Stochastic Resetting Accelerates Policy Convergence in Reinforcement Learning, Jello Zhou et.al., Paper: http://arxiv.org/abs/2603.16842
- 2025-12-05, Stochastic Reconfiguration with Warm-Started SVD, Dexuan Zhou et.al., Paper: http://arxiv.org/abs/2512.05749
- 2026-02-25, Stochastic Optimal Control with Side Information and Bayesian Learning, Johannes Milz et.al., Paper: http://arxiv.org/abs/2602.22047
- 2025-11-26, Stochastic Optimal Control of Interacting Particle Systems in Hilbert Spaces and Applications, Filippo de Feo et.al., Paper: http://arxiv.org/abs/2511.21646
- 2026-03-09, Stochastic Loop Corrections to Belief Propagation for Tensor Network Contraction, Gi Beom Sim et.al., Paper: http://arxiv.org/abs/2603.08427
- 2025-10-29, Stochastic Control of Dividends with a Drawdown Penalty, Kira Dudziak et.al., Paper: http://arxiv.org/abs/2510.25494
- 2025-11-24, Stochastic Adaptive Optimization with Unreliable Inputs: A Unified Framework for High-Probability Complexity Analysis, Katya Scheinberg et.al., Paper: http://arxiv.org/abs/2511.19411
- 2026-01-02, Stochastic Actor-Critic: Mitigating Overestimation via Temporal Aleatoric Uncertainty, Uğurcan Özalp et.al., Paper: http://arxiv.org/abs/2601.00737
- 2025-12-17, Stepwise Think-Critique: A Unified Framework for Robust and Interpretable LLM Reasoning, Jiaqi Xu et.al., Paper: http://arxiv.org/abs/2512.15662
- 2025-12-02, Steering Vision-Language-Action Models as Anti-Exploration: A Test-Time Scaling Approach, Siyuan Yang et.al., Paper: http://arxiv.org/abs/2512.02834
- 2026-02-09, StealthRL: Reinforcement Learning Paraphrase Attacks for Multi-Detector Evasion of AI-Text Detectors, Suraj Ranganath et.al., Paper: http://arxiv.org/abs/2602.08934
- 2025-12-17, Statistical repulsion on hyperons in two-color dense QCD, Masato Nagatsuka et.al., Paper: http://arxiv.org/abs/2512.15434
- 2022-11-09, State Advantage Weighting for Offline RL, Jiafei Lyu et.al., Paper: http://arxiv.org/abs/2210.04251
- 2026-03-06, Star-based Navigation in the Outer Solar System, Vittorio Franzese et.al., Paper: http://arxiv.org/abs/2603.06247
- 2026-01-12, Stagewise Reinforcement Learning and the Geometry of the Regret Landscape, Chris Elliott et.al., Paper: http://arxiv.org/abs/2601.07524
- 2026-02-19, Stackelberg Dynamic Location Planning under Cumulative Demand, Warley Almeida Silva et.al., Paper: http://arxiv.org/abs/2602.17392
- 2026-01-09, StackPlanner: A Centralized Hierarchical Multi-Agent System with Task-Experience Memory Management, Ruizhe Zhang et.al., Paper: http://arxiv.org/abs/2601.05890
- 2026-03-03, Stable solutions to reaction-diffusion elliptic problems, Xavier Cabre et.al., Paper: http://arxiv.org/abs/2603.03161
- 2026-04-02, Stable and Efficient Algorithms for the Fermion Determinant, Johann Ostmeyer et.al., Paper: http://arxiv.org/abs/2604.02130
- 2026-01-07, Stable Language Guidance for Vision-Language-Action Models, Zhihao Zhan et.al., Paper: http://arxiv.org/abs/2601.04052
- 2026-04-01, Stable Determinant Monte Carlo Simulations at Large Inverse Temperature $β$, Thomas Luu et.al., Paper: http://arxiv.org/abs/2604.00815
- 2026-02-19, Stable Asynchrony: Variance-Controlled Off-Policy RL for LLMs, Luke Huang et.al., Paper: http://arxiv.org/abs/2602.17616
- 2025-10-30, Stabilizing Rayleigh-Benard convection with reinforcement learning trained on a reduced-order model, Qiwei Chen et.al., Paper: http://arxiv.org/abs/2510.26705
- 2025-11-20, Stabilizing Policy Gradient Methods via Reward Profiling, Shihab Ahmed et.al., Paper: http://arxiv.org/abs/2511.16629
- 2026-02-17, Stability of Bose-Fermi mixtures in two dimensions: a lowest-order constrained variational approach, Pietro Cordioli et.al., Paper: http://arxiv.org/abs/2602.15598
- 2026-02-24, Squint: Fast Visual Reinforcement Learning for Sim-to-Real Robotics, Abdulaziz Almuzairee et.al., Paper: http://arxiv.org/abs/2602.21203
- 2025-12-01, Spontaneous Symmetry Breaking in Two-dimensional Long-range Heisenberg Model, Dingyun Yao et.al., Paper: http://arxiv.org/abs/2512.01956
- 2026-03-11, Splat2Real: Novel-view Scaling for Physical AI with 3D Gaussian Splatting, Hansol Lim et.al., Paper: http://arxiv.org/abs/2603.10638
- 2025-12-02, Spinons and Spin-Charge Separation at the Deconfined Quantum Critical Point, Sibin Yang et.al., Paper: http://arxiv.org/abs/2512.02962
- 2025-11-13, Spin Liquids on the Tetratrillium Lattice, Matías G. Gonzalez et.al., Paper: http://arxiv.org/abs/2511.10489
- 2026-01-23, Spiking Neural Networks for Communication Systems: Encoding Schemes, Learning Algorithms, and Equalization~Techniques, Eike-Manuel Edelmann et.al., Paper: http://arxiv.org/abs/2601.16550
- 2026-03-05, SpiderCat: Optimal Fault-Tolerant Cat State Preparation, Andrey Boris Khesin et.al., Paper: http://arxiv.org/abs/2603.05391
- 2026-01-02, Spectral Shapes of Pair Annihilation Line Emission in Magnetar Giant Flares, Tomoki Wada et.al., Paper: http://arxiv.org/abs/2601.00666
- 2026-03-20, Spectral Alignment in Forward-Backward Representations via Temporal Abstraction, Seyed Mahdi B. Azad et.al., Paper: http://arxiv.org/abs/2603.20103
- 2026-03-03, Specificity-aware reinforcement learning for fine-grained open-world classification, Samuele Angheben et.al., Paper: http://arxiv.org/abs/2603.03197
- 2026-03-24, SpecXMaster Technical Report, Yutang Ge et.al., Paper: http://arxiv.org/abs/2603.23101
- 2026-01-20, Spatiotemporal Wildfire Prediction and Reinforcement Learning for Helitack Suppression, Shaurya Mathur et.al., Paper: http://arxiv.org/abs/2601.14238
- 2025-12-12, Spatially Varying Gene Regulatory Networks via Bayesian Nonparametric Covariate-Dependent Directed Cyclic Graphical Models, Trisha Dawn et.al., Paper: http://arxiv.org/abs/2512.11732
- 2025-11-10, SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards, Hunar Batra et.al., Paper: http://arxiv.org/abs/2511.07403
- 2026-03-23, SpatialReward: Verifiable Spatial Reward Modeling for Fine-Grained Spatial Consistency in Text-to-Image Generation, Sashuai Zhou et.al., Paper: http://arxiv.org/abs/2603.22228
- 2025-12-08, SpatialDreamer: Incentivizing Spatial Reasoning via Active Mental Imagery, Meng Cao et.al., Paper: http://arxiv.org/abs/2512.07733
- 2025-10-31, Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning, Yuhong Liu et.al., Paper: http://arxiv.org/abs/2510.27606
- 2026-01-13, Sparsifying transform priors in Gaussian graphical models, Marcus Gehrmann et.al., Paper: http://arxiv.org/abs/2601.08596
- 2026-03-06, Sparse probabilistic evaluation for treatment planning: a feasibility study in IMPT head & neck patients, Jenneke I. de Jong et.al., Paper: http://arxiv.org/abs/2603.06314
- 2025-12-31, Sparse Offline Reinforcement Learning with Corruption Robustness, Nam Phuong Tran et.al., Paper: http://arxiv.org/abs/2512.24768
- 2026-02-23, Sparse Masked Attention Policies for Reliable Generalization, Caroline Horsch et.al., Paper: http://arxiv.org/abs/2602.19956
- 2025-12-26, Spacetime Spins: Statistical mechanics for error correction with stabilizer circuits, Cory T. Aitchison et.al., Paper: http://arxiv.org/abs/2512.21991
- 2025-12-03, SpaceTools: Tool-Augmented Spatial Reasoning via Double Interactive RL, Siyi Chen et.al., Paper: http://arxiv.org/abs/2512.04069
- 2026-03-24, SortedRL: Accelerating RL Training for LLMs through Online Length-Aware Scheduling, Yiqi Zhang et.al., Paper: http://arxiv.org/abs/2603.23414
- 2026-02-17, Some phenomenological aspects of a quantum-corrected Reissner-Nordström black hole: quasi-periodic oscillations, scalar perturbations and thermal fluctuations, Faizuddin Ahmed et.al., Paper: http://arxiv.org/abs/2602.15551
- 2025-11-26, Some aspects of robustness in modern Markov Chain Monte Carlo, Sam Power et.al., Paper: http://arxiv.org/abs/2511.21563
- 2026-02-17, Solving Parameter-Robust Avoid Problems with Unknown Feasibility using Reinforcement Learning, Oswin So et.al., Paper: http://arxiv.org/abs/2602.15817
- 2025-11-18, Solving Navier-Stokes Equations Using Data-free Physics-Informed Neural Networks With Hard Boundary Conditions, Ritik Pal et.al., Paper: http://arxiv.org/abs/2511.14497
- 2025-10-20, SoftMimic: Learning Compliant Whole-body Control from Examples, Gabriel B. Margolis et.al., Paper: http://arxiv.org/abs/2510.17792
- 2026-02-06, Soft Forward-Backward Representations for Zero-shot Reinforcement Learning with General Utilities, Marco Bagatella et.al., Paper: http://arxiv.org/abs/2602.06769
- 2025-10-21, Socialized Learning and Emergent Behaviors in Multi-Agent Systems based on Multimodal Large Language Models, Sureyya Akin et.al., Paper: http://arxiv.org/abs/2510.18515
- 2026-02-09, SoK: The Pitfalls of Deep Reinforcement Learning for Cybersecurity, Shae McFadden et.al., Paper: http://arxiv.org/abs/2602.08690
- 2025-11-10, Smoothing Out Sticking Points: Sampling from Discrete-Continuous Mixtures with Dynamical Monte Carlo by Mapping Discrete Mass into a Latent Universe, Andrew Chin et.al., Paper: http://arxiv.org/abs/2511.07340
- 2025-11-17, Smoothed-Cubic Spin-Glass Model of Random Lasers, Marcello Benedetti et.al., Paper: http://arxiv.org/abs/2511.13508
- 2026-03-14, SmoothVLA: Aligning Vision-Language-Action Models with Physical Constraints via Intrinsic Smoothness Optimization, Jiashun Li et.al., Paper: http://arxiv.org/abs/2603.13925
- 2026-01-12, Smooth Operator: Smooth Verifiable Reward Activates Spatial Reasoning Ability of Vision-Language Model, Siwen Jiao et.al., Paper: http://arxiv.org/abs/2601.07695
- 2025-10-22, SmartSwitch: Advancing LLM Reasoning by Overcoming Underthinking via Promoting Deeper Thought Exploration, Xichen Zhang et.al., Paper: http://arxiv.org/abs/2510.19767
- 2025-11-28, SmallWorlds: Assessing Dynamics Understanding of World Models in Isolated Environments, Xinyi Li et.al., Paper: http://arxiv.org/abs/2511.23465
- 2026-02-04, Skin Tokens: A Learned Compact Representation for Unified Autoregressive Rigging, Jia-peng Zhang et.al., Paper: http://arxiv.org/abs/2602.04805
- 2025-12-03, SkillFactory: Self-Distillation For Learning Cognitive Behaviors, Zayne Sprague et.al., Paper: http://arxiv.org/abs/2512.04072
- 2026-01-09, SketchVL: Policy Optimization via Fine-Grained Credit Assignment for Chart Understanding and More, Muye Huang et.al., Paper: http://arxiv.org/abs/2601.05688
- 2026-02-04, Site and bond percolation in linearly distorted triangular and square lattices, Bishnu Bhowmik et.al., Paper: http://arxiv.org/abs/2602.04818
- 2026-02-24, Singular Arrange and Traverse Algorithm for Computing Reeb Spaces of Bivariate PL Maps, Petar Hristov et.al., Paper: http://arxiv.org/abs/2602.21087
- 2025-11-25, Single-hole spectral functions in 1D quantum magnets with different ground states, Sibin Yang et.al., Paper: http://arxiv.org/abs/2511.20447
- 2026-04-01, Single-Waveguide Multiple-Pinching-Antenna Systems: OMA versus NOMA, Yanyu Cheng et.al., Paper: http://arxiv.org/abs/2604.00588
- 2025-12-10, Simultaneous Tactile-Visual Perception for Learning Multimodal Robot Manipulation, Yuyang Li et.al., Paper: http://arxiv.org/abs/2512.09851
- 2026-02-11, Simultaneous Speech-to-Speech Translation Without Aligned Data, Tom Labiausse et.al., Paper: http://arxiv.org/abs/2602.11072
- 2025-11-05, Simulation-Based Validation of an Integrated 4D/5D Digital-Twin Framework for Predictive Construction Control, Atena Khoshkonesh et.al., Paper: http://arxiv.org/abs/2511.03684
- 2025-12-29, Simulation of tau decays, ambiguities and anomalous couplings, Zbigniew Was et.al., Paper: http://arxiv.org/abs/2512.23475
- 2026-01-05, Simulation of Radiation Chemistry by a One-Shot Hybrid Continuum / Monte Carlo Method, Charlie Fynn Perkins et.al., Paper: http://arxiv.org/abs/2601.02132
- 2026-03-16, Simulating the Open System Dynamics of Multiple Exchange-Only Qubits using Subspace Monte Carlo, Tameem Albash et.al., Paper: http://arxiv.org/abs/2603.15577
- 2025-11-24, Simulating dynamics of the two-dimensional transverse-field Ising model: a comparative study of large-scale classical numerics, Joseph Vovrosh et.al., Paper: http://arxiv.org/abs/2511.19340
- 2026-01-08, SimuAgent: An LLM-Based Simulink Modeling Assistant Enhanced with Reinforcement Learning, Yanchang Liang et.al., Paper: http://arxiv.org/abs/2601.05187
- 2026-02-26, Simple Models, Real Swimming: Digital Twins for Tendon-Driven Underwater Robots, Mike Y. Michelis et.al., Paper: http://arxiv.org/abs/2602.23283
- 2026-01-14, Sim2real Image Translation Enables Viewpoint-Robust Policies from Fixed-Camera Datasets, Jeremiah Coholich et.al., Paper: http://arxiv.org/abs/2601.09605
- 2025-12-09, Sim2Swim: Zero-Shot Velocity Control for Agile AUV Maneuvering in 3 Minutes, Lauritz Rismark Fosso et.al., Paper: http://arxiv.org/abs/2512.08656
- 2026-03-12, Sim-to-reality adaptation for Deep Reinforcement Learning applied to an underwater docking application, Alaaeddine Chaarani et.al., Paper: http://arxiv.org/abs/2603.12020
- 2026-01-23, Sim-to-Real Transfer via a Style-Identified Cycle Consistent Generative Adversarial Network: Zero-Shot Deployment on Robotic Manipulators through Visual Domain Adaptation, Lucía Güitta-López et.al., Paper: http://arxiv.org/abs/2601.16677
- 2025-12-03, Sign-Resolved Statistics and the Origin of Bias in Quantum Monte Carlo, Ryan Larson et.al., Paper: http://arxiv.org/abs/2512.04056
- 2026-03-18, ShuttleEnv: An Interactive Data-Driven RL Environment for Badminton Strategy Modeling, Ang Li et.al., Paper: http://arxiv.org/abs/2603.17324
- 2025-11-05, Shrinking the Variance: Shrinkage Baselines for Reinforcement Learning with Verifiable Rewards, Guanning Zeng et.al., Paper: http://arxiv.org/abs/2511.03710
- 2025-10-21, Sherlock Your Queries: Learning to Ask the Right Questions for Dialogue-Based Retrieval, Dong Yun et.al., Paper: http://arxiv.org/abs/2510.18659
- 2026-03-31, ShapE-GRPO: Shapley-Enhanced Reward Allocation for Multi-Candidate LLM Training, Rui Ai et.al., Paper: http://arxiv.org/abs/2603.29871
- 2025-10-27, Sequential Multi-Agent Dynamic Algorithm Configuration, Chen Lu et.al., Paper: http://arxiv.org/abs/2510.23535
- 2026-01-09, Sequential Bayesian Optimal Experimental Design in Infinite Dimensions via Policy Gradient Reinforcement Learning, Kaichen Shen et.al., Paper: http://arxiv.org/abs/2601.05868
- 2026-02-12, Seq2Seq2Seq: Lossless Data Compression via Discrete Latent Transformers and Reinforcement Learning, Mahdi Khodabandeh et.al., Paper: http://arxiv.org/abs/2602.12146
- 2026-03-12, Separable neural architectures as a primitive for unified predictive and generative intelligence, Reza T. Batley et.al., Paper: http://arxiv.org/abs/2603.12244
- 2026-03-26, Sensitivity Analysis for Instrumental Variables Under Joint Relaxations of Monotonicity and Independence, Pedro Picchetti et.al., Paper: http://arxiv.org/abs/2603.25529
- 2025-11-17, Sense and Sensitivity - I. Uncertainty analysis of the gas-phase chemistry in AGB outflows, M. Van de Sande et.al., Paper: http://arxiv.org/abs/2511.13638
- 2025-11-19, Semiparametric Estimation of Fractional Integration: An Evaluation of Local Whittle Methods, Jason R. Blevins et.al., Paper: http://arxiv.org/abs/2511.15689
- 2026-01-14, Semi-physical Gamma-Process Degradation Modeling and Performance-Driven Opportunistic Maintenance Optimization for LED Lighting Systems, Haohao Shi et.al., Paper: http://arxiv.org/abs/2601.09380
- 2026-03-24, Semi-cosmographic constraints on decaying dark matter and dynamical dark energy: DESI DR2 BAO and 21\, cm intensity-mapping forecasts, Mohit Yadav et.al., Paper: http://arxiv.org/abs/2603.23247
- 2025-12-03, Semi-Markov Decision Process Framework for Age of Incorrect Information Minimization, Ismail Cosandal et.al., Paper: http://arxiv.org/abs/2512.04077
- 2025-10-22, Semi-Implicit Approaches for Large-Scale Bayesian Spatial Interpolation, Sébastien Garneau et.al., Paper: http://arxiv.org/abs/2510.19722
- 2026-01-14, Semi-Contention-Free Access in IoT NOMA Networks: A Reinforcement Learning Framework, Abhishek Kumar et.al., Paper: http://arxiv.org/abs/2601.09422
- 2026-02-06, Semantically Labelled Automata for Multi-Task Reinforcement Learning with LTL Instructions, Alessandro Abate et.al., Paper: http://arxiv.org/abs/2602.06746
- 2025-12-04, Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning, Purbesh Mitra et.al., Paper: http://arxiv.org/abs/2512.05105
- 2026-03-04, Self-adapting Robotic Agents through Online Continual Reinforcement Learning with World Model Feedback, Fabian Domberg et.al., Paper: http://arxiv.org/abs/2603.04029
- 2026-01-26, Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models, Siyan Zhao et.al., Paper: http://arxiv.org/abs/2601.18734
- 2026-01-27, Self-Distillation Enables Continual Learning, Idan Shenfeld et.al., Paper: http://arxiv.org/abs/2601.19897
- 2026-02-25, Self-Curriculum Model-based Reinforcement Learning for Shape Control of Deformable Linear Objects, Zhaowei Liang et.al., Paper: http://arxiv.org/abs/2602.21816
- 2025-11-18, Seer: Online Context Learning for Fast Synchronous LLM Reinforcement Learning, Ruoyu Qin et.al., Paper: http://arxiv.org/abs/2511.14617
- 2026-03-23, Seeing is Improving: Visual Feedback for Iterative Text Layout Refinement, Junrong Guo et.al., Paper: http://arxiv.org/abs/2603.22187
- 2025-12-15, Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model, Siyan Chen et.al., Paper: http://arxiv.org/abs/2512.13507
- 2026-03-29, Secure Reinforcement Learning: On Model-Free Detection of Man in the Middle Attacks, Rishi Rani et.al., Paper: http://arxiv.org/abs/2603.27592
- 2026-02-03, Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration, Bowei He et.al., Paper: http://arxiv.org/abs/2602.03647
- 2025-10-21, Search Self-play: Pushing the Frontier of Agent Capability without Supervision, Hongliang Lu et.al., Paper: http://arxiv.org/abs/2510.18821
- 2025-12-29, Scrutinizing the KNT model with vacuum stability conditions, Tim Huesmann et.al., Paper: http://arxiv.org/abs/2512.23662
- 2025-11-20, SceneDesigner: Controllable Multi-Object Image Generation with 9-DoF Pose Manipulation, Zhenyuan Qin et.al., Paper: http://arxiv.org/abs/2511.16666
- 2025-10-29, Scaling flow-based approaches for topology sampling in $\mathrm{SU}(3)$ gauge theory, Claudio Bonanno et.al., Paper: http://arxiv.org/abs/2510.25704
- 2025-12-31, Scaling Open-Ended Reasoning to Predict the Future, Nikhil Chandak et.al., Paper: http://arxiv.org/abs/2512.25070
- 2025-12-22, Scalably Enhancing the Clinical Validity of a Task Benchmark with Physician Oversight, Junze Ye et.al., Paper: http://arxiv.org/abs/2512.19691
- 2026-01-27, Scalable Exploration for High-Dimensional Continuous Control via Value-Guided Flow, Yunyue Wei et.al., Paper: http://arxiv.org/abs/2601.19707
- 2025-11-06, Scalable Domain-decomposed Monte Carlo Neutral Transport for Nuclear Fusion, Oskar Lappi et.al., Paper: http://arxiv.org/abs/2511.04489
- 2026-01-15, Scalable Algorithms for Approximate DNF Model Counting, Paul Burkhardt et.al., Paper: http://arxiv.org/abs/2601.10511
- 2025-10-22, Scaf-GRPO: Scaffolded Group Relative Policy Optimization for Enhancing LLM Reasoning, Xichen Zhang et.al., Paper: http://arxiv.org/abs/2510.19807
- 2026-03-05, SarcasmMiner: A Dual-Track Post-Training Framework for Robust Audio-Visual Sarcasm Reasoning, Zhu Li et.al., Paper: http://arxiv.org/abs/2603.05275
- 2025-11-17, Sampling Density for Gabor Phase Retrieval, Ting Chen et.al., Paper: http://arxiv.org/abs/2511.13500
- 2025-10-28, Sample-efficient and Scalable Exploration in Continuous-Time RL, Klemens Iten et.al., Paper: http://arxiv.org/abs/2510.24482
- 2025-11-07, Sample Complexity of Distributionally Robust Off-Dynamics Reinforcement Learning with Online Interaction, Yiting He et.al., Paper: http://arxiv.org/abs/2511.05396
- 2026-03-11, Safety-critical Control Under Partial Observability: Reach-Avoid POMDP meets Belief Space Control, Matti Vahs et.al., Paper: http://arxiv.org/abs/2603.10572
- 2026-03-19, Safety-Guaranteed Imitation Learning from Nonlinear Model Predictive Control for Spacecraft Close Proximity Operations, Alexander Meinert et.al., Paper: http://arxiv.org/abs/2603.18910
- 2026-02-11, Safety Recovery in Reasoning Models Is Only a Few Early Steering Steps Away, Soumya Suvra Ghosal et.al., Paper: http://arxiv.org/abs/2602.11096
- 2026-02-27, SafeGen-LLM: Enhancing Safety Generalization in Task Planning for Robotic Systems, Jialiang Fan et.al., Paper: http://arxiv.org/abs/2602.24235
- 2026-03-03, Safe and Robust Domains of Attraction for Discrete-Time Systems: A Set-Based Characterization and Certifiable Neural Network Estimation, Mohamed Serry et.al., Paper: http://arxiv.org/abs/2603.03082
- 2026-02-04, Safe Urban Traffic Control via Uncertainty-Aware Conformal Prediction and World-Model Reinforcement Learning, Joydeep Chandra et.al., Paper: http://arxiv.org/abs/2602.04821
- 2026-01-08, Safe Reinforcement Learning Beyond Baseline Control: A Hierarchical Framework for Space Triangle Tethered Formation System, Xinyi Tao et.al., Paper: http://arxiv.org/abs/2601.04957
- 2026-03-11, Safe RLHF Beyond Expectation: Stochastic Dominance for Universal Spectral Risk Control, Yaswanth Chittepu et.al., Paper: http://arxiv.org/abs/2603.10938
- 2026-01-27, Safe Exploration via Policy Priors, Manuel Wendl et.al., Paper: http://arxiv.org/abs/2601.19612
- 2026-01-08, Safe Continual Reinforcement Learning Methods for Nonstationary Environments. Towards a Survey of the State of the Art, Timofey Tomashevskiy et.al., Paper: http://arxiv.org/abs/2601.05152
- 2025-10-21, Safe But Not Sorry: Reducing Over-Conservatism in Safety Critics via Uncertainty-Aware Modulation, Daniel Bethell et.al., Paper: http://arxiv.org/abs/2510.18478
- 2026-03-04, SaFeR: Safety-Critical Scenario Generation for Autonomous Driving Test via Feasibility-Constrained Token Resampling, Jinlong Cui et.al., Paper: http://arxiv.org/abs/2603.04071
- 2026-02-02, SWE-Universe: Scale Real-World Verifiable Environments to Millions, Mouxiang Chen et.al., Paper: http://arxiv.org/abs/2602.02361
- 2025-12-26, SWE-RM: Execution-free Feedback For Software Engineering Agents, KaShun Shum et.al., Paper: http://arxiv.org/abs/2512.21919
- 2026-02-25, SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents, Patrick Tser Jern Kon et.al., Paper: http://arxiv.org/abs/2602.22124
- 2025-11-21, SVRecon: Sparse Voxel Rasterization for Surface Reconstruction, Seunghun Oh et.al., Paper: http://arxiv.org/abs/2511.17364
- 2025-12-12, SUMFORU: An LLM-Based Review Summarization Framework for Personalized Purchase Decision Support, Yuming Feng et.al., Paper: http://arxiv.org/abs/2512.11755
- 2025-11-28, SU( $N_c$) confinement and color $N_c$ -ality, V. Tomas Mari Surkau et.al., Paper: http://arxiv.org/abs/2511.23413
- 2026-01-06, STReasoner: Empowering LLMs for Spatio-Temporal Reasoning in Time Series via Spatial-Aware Reinforcement Learning, Juntong Ni et.al., Paper: http://arxiv.org/abs/2601.03248
- 2026-01-14, STEP3-VL-10B Technical Report, Ailin Huang et.al., Paper: http://arxiv.org/abs/2601.09668
- 2025-11-14, STEM EBIC as a Quantitative Probe of Semiconductor Devices, Sebastian Schneider et.al., Paper: http://arxiv.org/abs/2511.11528
- 2025-12-04, STARE-VLA: Progressive Stage-Aware Reinforcement for Fine-Tuning Vision-Language-Action Models, Feng Xu et.al., Paper: http://arxiv.org/abs/2512.05107
- 2026-02-17, STAPO: Stabilizing Reinforcement Learning for LLMs by Silencing Rare Spurious Tokens, Shiqi Liu et.al., Paper: http://arxiv.org/abs/2602.15620
- 2025-12-10, STACHE: Local Black-Box Explanations for Reinforcement Learning Policies, Andrew Elashkin et.al., Paper: http://arxiv.org/abs/2512.09909
- 2026-03-04, SSR: A Generic Framework for Text-Aided Map Compression for Localization, Mohammad Omama et.al., Paper: http://arxiv.org/abs/2603.04272
- 2025-11-19, SRPO: Self-Referential Policy Optimization for Vision-Language-Action Models, Senyu Fei et.al., Paper: http://arxiv.org/abs/2511.15605
- 2026-03-17, SQL-ASTRA: Alleviating Sparse Feedback in Agentic SQL via Column-Set Matching and Trajectory Aggregation, Long Li et.al., Paper: http://arxiv.org/abs/2603.16161
- 2025-10-28, SPICE: Self-Play In Corpus Environments Improves Reasoning, Bo Liu et.al., Paper: http://arxiv.org/abs/2510.24684
- 2025-10-28, SPARTA: Evaluating Reasoning Segmentation Robustness through Black-Box Adversarial Paraphrasing in Text Autoencoder Latent Space, Viktoriia Zinkovich et.al., Paper: http://arxiv.org/abs/2510.24446
- 2026-02-26, SPARR: Simulation-based Policies with Asymmetric Real-world Residuals for Assembly, Yijie Guo et.al., Paper: http://arxiv.org/abs/2602.23253
- 2025-10-20, SPACeR: Self-Play Anchoring with Centralized Reference Models, Wei-Jer Chang et.al., Paper: http://arxiv.org/abs/2510.18060
- 2026-01-06, SOP: A Scalable Online Post-Training System for Vision-Language-Action Models, Mingjie Pan et.al., Paper: http://arxiv.org/abs/2601.03044
- 2025-11-17, SO(3) real algebra method for finite baryon-number density QCD, Hideo Suganuma et.al., Paper: http://arxiv.org/abs/2511.13551
- 2025-11-18, SMRC: Aligning Large Language Models with Student Reasoning for Mathematical Error Correction, Biaojie Zeng et.al., Paper: http://arxiv.org/abs/2511.14684
- 2025-12-02, SMP: Reusable Score-Matching Motion Priors for Physics-Based Character Control, Yuxuan Mu et.al., Paper: http://arxiv.org/abs/2512.03028
- 2026-02-19, SMAC: Score-Matched Actor-Critics for Robust Offline-to-Online Transfer, Nathan S. de Lara et.al., Paper: http://arxiv.org/abs/2602.17632
- 2025-11-24, SLMFix: Leveraging Small Language Models for Error Fixing with Reinforcement Learning, David Jiahao Fu et.al., Paper: http://arxiv.org/abs/2511.19422
- 2026-02-02, SLIME: Stabilized Likelihood Implicit Margin Enforcement for Preference Optimization, Maksim Afanasyev et.al., Paper: http://arxiv.org/abs/2602.02383
- 2026-01-29, SIA: Symbolic Interpretability for Anticipatory Deep Reinforcement Learning in Network Control, MohammadErfan Jabbari et.al., Paper: http://arxiv.org/abs/2601.22044
- 2025-10-28, SGFusion: Stochastic Geographic Gradient Fusion in Federated Learning, Khoa Nguyen et.al., Paper: http://arxiv.org/abs/2510.23455
- 2026-01-28, SERA: Soft-Verified Efficient Repository Agents, Ethan Shen et.al., Paper: http://arxiv.org/abs/2601.20789
- 2026-02-06, SEMA: Simple yet Effective Learning for Multi-Turn Jailbreak Attacks, Mingqian Feng et.al., Paper: http://arxiv.org/abs/2602.06854
- 2026-02-24, SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards, Dengjia Zhang et.al., Paper: http://arxiv.org/abs/2602.21158
- 2025-10-22, SEA: Semantic Map Prediction for Active Exploration of Uncertain Areas, Hongyu Ding et.al., Paper: http://arxiv.org/abs/2510.19766
- 2026-03-10, SEA-Nav: Efficient Policy Learning for Safe and Agile Quadruped Navigation in Cluttered Environments, Shiyi Chen et.al., Paper: http://arxiv.org/abs/2603.09460
- 2025-12-15, SCR2-ST: Combine Single Cell with Spatial Transcriptomics for Efficient Active Sampling via Reinforcement Learning, Junchao Zhu et.al., Paper: http://arxiv.org/abs/2512.13635
- 2025-12-19, SCOPE: Sequential Causal Optimization of Process Interventions, Jakob De Moor et.al., Paper: http://arxiv.org/abs/2512.17629
- 2026-02-10, SCOPE: A Training-Free Online 3D Deployment for UAV-BSs with Theoretical Analysis and Comparative Study, Chuan-Chi Lai et.al., Paper: http://arxiv.org/abs/2602.09971
- 2025-11-25, SBP-FDEC: Summation-by-Parts Finite Difference Exterior Calculus, Daniel Bach et.al., Paper: http://arxiv.org/abs/2511.20529
- 2026-01-22, SAMTok: Representing Any Mask with Two Words, Yikang Zhou et.al., Paper: http://arxiv.org/abs/2601.16093
- 2026-03-20, SAGE: Sustainable Agent-Guided Expert-tuning for Culturally Attuned Translation in Low-Resource Southeast Asia, Zhixiang Lu et.al., Paper: http://arxiv.org/abs/2603.19931
- 2026-02-04, SAFE: Stable Alignment Finetuning with Entropy-Aware Predictive Control for RLHF, Dipan Maity et.al., Paper: http://arxiv.org/abs/2602.04651
- 2025-12-04, SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards, Yuan Gao et.al., Paper: http://arxiv.org/abs/2512.05098
- 2026-03-18, Ruyi2.5 Technical Report, Huan Song et.al., Paper: http://arxiv.org/abs/2603.17311
- 2026-01-27, Run Dependent Monte Carlo at Belle II, Giovanni Gaudino et.al., Paper: http://arxiv.org/abs/2601.19728
- 2025-11-25, RubricRL: Simple Generalizable Rewards for Text-to-Image Generation, Xuelu Feng et.al., Paper: http://arxiv.org/abs/2511.20651
- 2025-11-13, Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following, Yun He et.al., Paper: http://arxiv.org/abs/2511.10507
- 2025-12-26, Rotationally invariant dynamical lattice regulators for Euclidean quantum field theories, Tsogtgerel Gantumur et.al., Paper: http://arxiv.org/abs/2512.22072
- 2026-03-21, Role of interstitial $s$ orbital in a model of infinite-layer nickelates, Yan Peng et.al., Paper: http://arxiv.org/abs/2603.20705
- 2025-11-14, Robust and Efficient Communication in Multi-Agent Reinforcement Learning, Zejiao Liu et.al., Paper: http://arxiv.org/abs/2511.11393
- 2025-11-19, Robust H-infinity control and worst-case search in constrained parametric space, Ervan Kassarian et.al., Paper: http://arxiv.org/abs/2511.15480
- 2026-03-10, Robust Cooperative Localization in Featureless Environments: A Comparative Study of DCL, StCL, CCL, CI, and Standard-CL, Nivand Khosravi et.al., Paper: http://arxiv.org/abs/2603.09886
- 2026-03-20, Robust Beam Codebooks for mmWave/THz Systems: Toward a Stochastic RL Approach, Anouar Nechi et.al., Paper: http://arxiv.org/abs/2603.19930
- 2025-11-10, Robot Learning from a Physical World Model, Jiageng Mao et.al., Paper: http://arxiv.org/abs/2511.07416
- 2025-11-13, Robot Crash Course: Learning Soft and Stylized Falling, Pascal Strauch et.al., Paper: http://arxiv.org/abs/2511.10635
- 2026-03-02, Robometer: Scaling General-Purpose Robotic Reward Models via Trajectory Comparisons, Anthony Liang et.al., Paper: http://arxiv.org/abs/2603.02115
- 2026-01-02, RoboReward: General-Purpose Vision-Language Reward Models for Robotics, Tony Lee et.al., Paper: http://arxiv.org/abs/2601.00675
- 2026-03-05, RoboPocket: Improve Robot Policies Instantly with Your Phone, Junjie Fang et.al., Paper: http://arxiv.org/abs/2603.05504
- 2025-12-24, RoboCade: Gamifying Robot Data Collection, Suvir Mirchandani et.al., Paper: http://arxiv.org/abs/2512.21235
- 2026-02-10, Robo3R: Enhancing Robotic Manipulation with Accurate Feed-Forward 3D Reconstruction, Sizhe Yang et.al., Paper: http://arxiv.org/abs/2602.10101
- 2025-12-29, Robo-Dopamine: General Process Reward Modeling for High-Precision Robotic Manipulation, Huajie Tan et.al., Paper: http://arxiv.org/abs/2512.23703
- 2025-12-01, RoaD: Rollouts as Demonstrations for Closed-Loop Supervised Fine-Tuning of Autonomous Driving Policies, Guillermo Garcia-Cobo et.al., Paper: http://arxiv.org/abs/2512.01993
- 2026-03-07, RoTri-Diff: A Spatial Robot-Object Triadic Interaction-Guided Diffusion Model for Bimanual Manipulation, Zixuan Chen et.al., Paper: http://arxiv.org/abs/2603.07165
- 2025-12-05, RoBoN: Routed Online Best-of-n for Test-Time Scaling with Multiple LLMs, Jonathan Geuter et.al., Paper: http://arxiv.org/abs/2512.05542
- 2026-02-26, Risk-Aware World Model Predictive Control for Generalizable End-to-End Autonomous Driving, Jiangxin Sun et.al., Paper: http://arxiv.org/abs/2602.23259
- 2025-11-14, Risk-Aware Deep Reinforcement Learning for Dynamic Portfolio Optimization, Emmanuel Lwele et.al., Paper: http://arxiv.org/abs/2511.11481
- 2026-01-16, Ring isomorphisms in norm between Banach algebras of continuous complex-valued functions, T. Miura et.al., Paper: http://arxiv.org/abs/2601.11165
- 2026-03-13, Reweighted information inequalities, Jonathan Niles-Weed et.al., Paper: http://arxiv.org/abs/2603.13135
- 2026-01-13, Rewarding the Rare: Uniqueness-Aware RL for Creative Problem Solving in LLMs, Zhiyuan Hu et.al., Paper: http://arxiv.org/abs/2601.08763
- 2025-10-20, Rewarding the Journey, Not Just the Destination: A Composite Path and Answer Self-Scoring Reward Mechanism for Test-Time Reinforcement Learning, Chenwei Tang et.al., Paper: http://arxiv.org/abs/2510.17923
- 2026-03-19, RewardFlow: Topology-Aware Reward Propagation on State Graphs for Agentic RL with Large Language Models, Xiao Feng et.al., Paper: http://arxiv.org/abs/2603.18859
- 2026-02-02, Reward-free Alignment for Conflicting Objectives, Peter Chen et.al., Paper: http://arxiv.org/abs/2602.02495
- 2023-06-22, Reward Shaping via Diffusion Process in Reinforcement Learning, Peeyush Kumar et.al., Paper: http://arxiv.org/abs/2306.11885
- 2026-02-03, Reward Redistribution for CVaR MDPs using a Bellman Operator on L-infinity, Aneri Muni et.al., Paper: http://arxiv.org/abs/2602.03778
- 2026-03-10, Reward Prediction with Factorized World States, Yijun Shen et.al., Paper: http://arxiv.org/abs/2603.09400
- 2026-01-28, Reward Models Inherit Value Biases from Pretraining, Brian Christian et.al., Paper: http://arxiv.org/abs/2601.20838
- 2026-01-16, Reward Modeling for Scientific Writing Evaluation, Furkan Şahinuç et.al., Paper: http://arxiv.org/abs/2601.11374
- 2025-12-17, Revisiting the Phase Diagram of Hard Sphere Dumbbells with Nested Sampling: Known Phases and New Packing Variants, Omar-Farouk Adesida et.al., Paper: http://arxiv.org/abs/2512.15665
- 2025-12-19, Revisiting the Broken Symmetry Phase of Solid Hydrogen: A Neural Network Variational Monte Carlo Study, Shengdu Chai et.al., Paper: http://arxiv.org/abs/2512.17703
- 2026-01-14, Revisiting Jahn–Teller Transitions in Correlated Oxides with Monte Carlo Modeling, Liam A. V. Nagle-Cocco et.al., Paper: http://arxiv.org/abs/2601.09705
- 2023-02-14, Review of Deep Reinforcement Learning for Autonomous Driving, B. Udugama et.al., Paper: http://arxiv.org/abs/2302.06370
- 2026-01-08, Revealing the Truth: Calculating True Values in Causal Inference Simulation Studies via Gaussian Quadrature, Alex Ocampo et.al., Paper: http://arxiv.org/abs/2601.05128
- 2025-11-13, Revealing the Connection Between the Filamentary Hierarchy and Star Cluster Formation in a Simulated NGC 628 Galaxy, Tamara Koletic et.al., Paper: http://arxiv.org/abs/2511.10486
- 2026-01-26, Reuse your FLOPs: Scaling RL on Hard Problems by Conditioning on Very Off-Policy Prefixes, Amrith Setlur et.al., Paper: http://arxiv.org/abs/2601.18795
- 2026-02-19, Retrospective In-Context Learning for Temporal Credit Assignment with Large Language Models, Wen-Tse Chen et.al., Paper: http://arxiv.org/abs/2602.17497
- 2026-03-09, RetroAgent: From Solving to Evolving via Retrospective Dual Intrinsic Feedback, Xiaoying Zhang et.al., Paper: http://arxiv.org/abs/2603.08561
- 2026-02-19, RetouchIQ: MLLM Agents for Instruction-Based Image Retouching with Generalist Reward, Qiucheng Wu et.al., Paper: http://arxiv.org/abs/2602.17558
- 2026-02-04, Rethinking the Trust Region in LLM Reinforcement Learning, Penghui Qi et.al., Paper: http://arxiv.org/abs/2602.04879
- 2026-02-03, Rethinking the Reranker: Boundary-Aware Evidence Selection for Robust Retrieval-Augmented Generation, Jiashuo Sun et.al., Paper: http://arxiv.org/abs/2602.03689
- 2026-03-04, Rethinking the Efficiency and Effectiveness of Reinforcement Learning for Radiology Report Generation, Zilin Lu et.al., Paper: http://arxiv.org/abs/2603.04022
- 2026-02-04, Rethinking the Design Space of Reinforcement Learning for Diffusion Models: On the Importance of Likelihood Estimation Beyond Loss Design, Jaemoo Choi et.al., Paper: http://arxiv.org/abs/2602.04663
- 2025-12-29, Rethinking conditioning in polarimetry: a new framework beyond $\ell^2$ -based metrics, Yuxi Cai et.al., Paper: http://arxiv.org/abs/2512.23689
- 2026-03-24, Rethinking Token-Level Policy Optimization for Multimodal Chain-of-Thought, Yunheng Li et.al., Paper: http://arxiv.org/abs/2603.22847
- 2026-03-09, Rethinking Strict Dissipativity for Economic MPC, Mario Zanon et.al., Paper: http://arxiv.org/abs/2603.08535
- 2026-03-17, Rethinking Pose Refinement in 3D Gaussian Splatting under Pose Prior and Geometric Uncertainty, Mangyu Kong et.al., Paper: http://arxiv.org/abs/2603.16538
- 2025-10-20, Rethinking On-policy Optimization for Query Augmentation, Zhichao Xu et.al., Paper: http://arxiv.org/abs/2510.17139
- 2025-12-12, Rethinking Expert Trajectory Utilization in LLM Post-training, Bowen Ding et.al., Paper: http://arxiv.org/abs/2512.11470
- 2026-03-02, Rethinking Camera Choice: An Empirical Study on Fisheye Camera Properties in Robotic Manipulation, Han Xue et.al., Paper: http://arxiv.org/abs/2603.02139
- 2025-10-21, Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting, Howard Chen et.al., Paper: http://arxiv.org/abs/2510.18874
- 2025-12-31, ResponseRank: Data-Efficient Reward Modeling through Preference Strength Learning, Timo Kaufmann et.al., Paper: http://arxiv.org/abs/2512.25023
- 2026-02-11, Resource-Efficient Model-Free Reinforcement Learning for Board Games, Kazuki Ota et.al., Paper: http://arxiv.org/abs/2602.10894
- 2026-03-17, Resonant scattering at the center of the galaxy cluster PKS 0745-191 with XRISM, Keita Tanaka et.al., Paper: http://arxiv.org/abs/2603.16263
- 2026-02-03, Resolving Quantum Criticality in the Honeycomb Hubbard Model, Fo-Hong Wang et.al., Paper: http://arxiv.org/abs/2602.03656
- 2026-02-10, Resilient Topology-Aware Coordination for Dynamic 3D UAV Networks under Node Failure, Chuan-Chi Lai et.al., Paper: http://arxiv.org/abs/2602.10029
- 2025-12-23, Resilient Packet Forwarding: A Reinforcement Learning Approach to Routing in Gaussian Interconnected Networks with Clustered Faults, Mohammad Walid Charrwi et.al., Paper: http://arxiv.org/abs/2512.20394
- 2025-11-17, Resilient Distribution Network Planning against Dynamic Malicious Power Injection Attacks, Hampei Sasahara et.al., Paper: http://arxiv.org/abs/2511.13698
- 2026-02-05, Residual Reinforcement Learning for Waste-Container Lifting Using Large-Scale Cranes with Underactuated Tools, Qi Li et.al., Paper: http://arxiv.org/abs/2602.05895
- 2025-12-29, Replay Failures as Successes: Sample-Efficient Reinforcement Learning for Instruction Following, Kongcheng Zhang et.al., Paper: http://arxiv.org/abs/2512.23457
- 2026-03-17, Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning, Haomin Wang et.al., Paper: http://arxiv.org/abs/2603.16189
- 2025-11-25, Reinforcing Action Policies by Prophesying, Jiahui Zhang et.al., Paper: http://arxiv.org/abs/2511.20633
- 2025-11-20, Reinforcement learning of quantum circuit architectures for molecular potential energy curves, Maureen Krumtünger et.al., Paper: http://arxiv.org/abs/2511.16559
- 2026-03-02, Reinforcement Learning-Based Filters for Convection-Dominated Flows: Reference-Free and Reference-Guided Training, Anna Ivagnes et.al., Paper: http://arxiv.org/abs/2603.02086
- 2026-03-03, Reinforcement Learning with Symbolic Reward Machines, Thomas Krug et.al., Paper: http://arxiv.org/abs/2603.03068
- 2026-01-15, Reinforcement Learning with Multi-Step Lookahead Information Via Adaptive Batching, Nadav Merlis et.al., Paper: http://arxiv.org/abs/2601.10418
- 2025-10-21, Reinforcement Learning with Imperfect Transition Predictions: A Bellman-Jensen Approach, Chenbei Lu et.al., Paper: http://arxiv.org/abs/2510.18687
- 2026-03-11, Reinforcement Learning with Conditional Expectation Reward, Changyi Xiao et.al., Paper: http://arxiv.org/abs/2603.10624
- 2026-01-28, Reinforcement Learning via Self-Distillation, Jonas Hübotter et.al., Paper: http://arxiv.org/abs/2601.20802
- 2025-10-29, Reinforcement Learning techniques for the flavor problem in particle physics, A. Giarnetti et.al., Paper: http://arxiv.org/abs/2510.25495
- 2026-02-17, Reinforcement Learning in Real Option Models, Jodi Dianetti et.al., Paper: http://arxiv.org/abs/2602.15643
- 2025-12-11, Reinforcement Learning in Financial Decision Making: A Systematic Review of Performance, Challenges, and Implementation Strategies, Mohammad Rezoanul Hoque et.al., Paper: http://arxiv.org/abs/2512.10913
- 2026-03-07, Reinforcement Learning for Vehicle-to-Grid Voltage Regulation: Single-Hub to Multi-Hub Coordination with Battery-Aware Constraints, Jingbo Wang et.al., Paper: http://arxiv.org/abs/2603.07237
- 2026-04-02, Reinforcement Learning for Speculative Trading under Exploratory Framework, Yun Zhao et.al., Paper: http://arxiv.org/abs/2604.02035
- 2026-02-18, Reinforcement Learning for Parameterized Quantum State Preparation: A Comparative Study, Gerhard Stenzel et.al., Paper: http://arxiv.org/abs/2602.16523
- 2026-01-12, Reinforcement Learning for Micro-Level Claims Reserving, Benjamin Avanzi et.al., Paper: http://arxiv.org/abs/2601.07637
- 2025-10-31, Reinforcement Learning for Long-Horizon Unordered Tasks: From Boolean to Coupled Reward Machines, Kristina Levina et.al., Paper: http://arxiv.org/abs/2510.27329
- 2026-03-13, Reinforcement Learning for Discounted and Ergodic Control of Diffusion Processes, Erhan Bayraktar et.al., Paper: http://arxiv.org/abs/2603.13155
- 2025-12-15, Reinforcement Learning based 6-DoF Maneuvers for Microgravity Intravehicular Docking: A Simulation Study with Int-Ball2 in ISS-JEM, Aman Arora et.al., Paper: http://arxiv.org/abs/2512.13514
- 2025-10-23, Reinforcement Learning and Consumption-Savings Behavior, Brandon Kaplowitz et.al., Paper: http://arxiv.org/abs/2510.20748
- 2025-11-05, Reinforcement Learning Using known Invariances, Alexandru Cioba et.al., Paper: http://arxiv.org/abs/2511.03473
- 2025-12-09, Reinforcement Learning From State and Temporal Differences, Lex Weaver et.al., Paper: http://arxiv.org/abs/2512.08855
- 2026-01-05, Reinforcement Learning Based Computationally Efficient Conditional Choice Simulation Estimation of Dynamic Discrete Choice Models, Ahmed Khwaja et.al., Paper: http://arxiv.org/abs/2601.02069
- 2026-02-03, Reinforcement Fine-Tuning for History-Aware Dense Retriever in RAG, Yicheng Zhang et.al., Paper: http://arxiv.org/abs/2602.03645
- 2026-03-31, Reinforced Reasoning for End-to-End Retrosynthetic Planning, Chenyang Zuo et.al., Paper: http://arxiv.org/abs/2603.29723
- 2026-02-18, Reinforced Fast Weights with Next-Sequence Prediction, Hee Seung Hwang et.al., Paper: http://arxiv.org/abs/2602.16704
- 2026-01-08, Reinforced Efficient Reasoning via Semantically Diverse Exploration, Ziqi Zhao et.al., Paper: http://arxiv.org/abs/2601.05053
- 2026-02-04, Reinforced Attention Learning, Bangzheng Li et.al., Paper: http://arxiv.org/abs/2602.04884
- 2025-12-18, ReinforceGen: Hybrid Skill Policies with Automated Data Generation and Reinforcement Learning, Zihan Zhou et.al., Paper: http://arxiv.org/abs/2512.16861
- 2026-01-27, Reimagining Social Robots as Recommender Systems: Foundations, Framework, and Applications, Jin Huang et.al., Paper: http://arxiv.org/abs/2601.19761
- 2026-01-27, Reimagining Peer Review Process Through Multi-Agent Mechanism Design, Ahmad Farooq et.al., Paper: http://arxiv.org/abs/2601.19778
- 2026-02-26, Regular Fourier Features for Nonstationary Gaussian Processes, Arsalan Jawaid et.al., Paper: http://arxiv.org/abs/2602.23006
- 2026-02-24, Regret-Guided Search Control for Efficient Learning in AlphaZero, Yun-Jui Tsai et.al., Paper: http://arxiv.org/abs/2602.20809
- 2026-02-03, RegionReasoner: Region-Grounded Multi-Round Visual Reasoning, Wenfang Sun et.al., Paper: http://arxiv.org/abs/2602.03733
- 2025-11-18, ReflexGrad: Three-Way Synergistic Architecture for Zero-Shot Generalization in LLM Agents, Ankush Kadu et.al., Paper: http://arxiv.org/abs/2511.14584
- 2025-11-07, Reflective Personalization Optimization: A Post-hoc Rewriting Framework for Black-Box Large Language Models, Teqi Hao et.al., Paper: http://arxiv.org/abs/2511.05286
- 2026-01-26, Reflect: Transparent Principle-Guided Reasoning for Constitutional Alignment at Scale, Henry Bell et.al., Paper: http://arxiv.org/abs/2601.18730
- 2026-04-01, RefineRL: Advancing Competitive Programming with Self-Refinement Reinforcement Learning, Shaopeng Fu et.al., Paper: http://arxiv.org/abs/2604.00790
- 2026-03-25, RefReward-SR: LR-Conditioned Reward Modeling for Preference-Aligned Super-Resolution, Yushuai Song et.al., Paper: http://arxiv.org/abs/2603.24198
- 2026-02-05, Reducing the Computational Cost Scaling of Tensor Network Algorithms via Field-Programmable Gate Array Parallelism, Songtai Lv et.al., Paper: http://arxiv.org/abs/2602.05900
- 2026-01-06, Reducing Hallucinations in LLMs via Factuality-Aware Preference Learning, Sindhuja Chaduvula et.al., Paper: http://arxiv.org/abs/2601.03027
- 2025-10-24, Reduced Floating-Point Precision Implicit Monte Carlo, Simon Butson et.al., Paper: http://arxiv.org/abs/2510.21683
- 2026-02-27, Recycling Failures: Salvaging Exploration in RLVR via Fine-Grained Off-Policy Guidance, Yanwei Ren et.al., Paper: http://arxiv.org/abs/2602.24110
- 2025-12-19, Recursive state estimation via approximate modal paths, Filip Tronarp et.al., Paper: http://arxiv.org/abs/2512.17737
- 2026-02-17, Recursive Concept Evolution for Compositional Reasoning in Large Language Models, Sarim Chaudhry et.al., Paper: http://arxiv.org/abs/2602.15725
- 2026-02-23, Recurrent Structural Policy Gradient for Partially Observable Mean Field Games, Clarisse Wibault et.al., Paper: http://arxiv.org/abs/2602.20141
- 2026-03-18, Recurrent Reasoning with Vision-Language Models for Estimating Long-Horizon Embodied Task Progress, Yuelin Zhang et.al., Paper: http://arxiv.org/abs/2603.17312
- 2025-12-23, Recurrent Off-Policy Deep Reinforcement Learning Doesn’t Have to be Slow, Tyler Clark et.al., Paper: http://arxiv.org/abs/2512.20513
- 2025-12-01, Rectifying LLM Thought from Lens of Optimization, Junnan Liu et.al., Paper: http://arxiv.org/abs/2512.01925
- 2025-12-08, Recovery of the optimal control value function in reproducing kernel Hilbert spaces from verification conditions, Tobias Ehring et.al., Paper: http://arxiv.org/abs/2512.07477
- 2026-03-10, RecThinker: An Agentic Framework for Tool-Augmented Reasoning in Recommendation, Haobo Zhang et.al., Paper: http://arxiv.org/abs/2603.09843
- 2025-12-16, RecGPT-V2 Technical Report, Chao Yi et.al., Paper: http://arxiv.org/abs/2512.14503
- 2026-03-19, Reasoning over mathematical objects: on-policy reward modeling and test time aggregation, Pranjal Aggarwal et.al., Paper: http://arxiv.org/abs/2603.18886
- 2026-03-09, Reasoning as Compression: Unifying Budget Forcing via the Conditional Information Bottleneck, Fabio Valerio Massoli et.al., Paper: http://arxiv.org/abs/2603.08462
- 2026-01-23, Reasoning Promotes Robustness in Theory of Mind Tasks, Ian B. de Haan et.al., Paper: http://arxiv.org/abs/2601.16853
- 2025-10-31, Reasoning Models Sometimes Output Illegible Chains of Thought, Arun Jose et.al., Paper: http://arxiv.org/abs/2510.27338
- 2025-10-22, Reasoning Like Experts: Leveraging Multimodal Large Language Models for Drawing-based Psychoanalysis, Xueqi Ma et.al., Paper: http://arxiv.org/abs/2510.19451
- 2026-03-02, Reasoning Core: A Scalable Procedural Data Generation Suite for Symbolic Pre-training and Post-Training, Valentin Lacombe et.al., Paper: http://arxiv.org/abs/2603.02208
- 2026-02-03, Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL, Ian Wu et.al., Paper: http://arxiv.org/abs/2602.03773
- 2025-11-13, Reasoning About Intent for Ambiguous Requests, Irina Saparina et.al., Paper: http://arxiv.org/abs/2511.10453
- 2025-11-05, Realization of repulsive polarons in the strongly correlated regime, René Henke et.al., Paper: http://arxiv.org/abs/2511.03569
- 2025-12-04, Realizable Abstractions: Near-Optimal Hierarchical Reinforcement Learning, Roberto Cipollone et.al., Paper: http://arxiv.org/abs/2512.04958
- 2025-10-31, Realistic pedestrian-driver interaction modelling using multi-agent RL with human perceptual-motor constraints, Yueyang Wang et.al., Paper: http://arxiv.org/abs/2510.27383
- 2025-12-05, Real-time Remote Tracking and Autonomous Planning for Whale Rendezvous using Robots, Sushmita Bhattacharya et.al., Paper: http://arxiv.org/abs/2512.05808
- 2025-11-07, Real-World Adverse Weather Image Restoration via Dual-Level Reinforcement Learning with High-Quality Cold Start, Fuyang Liu et.al., Paper: http://arxiv.org/abs/2511.05095
- 2025-11-10, Real-Time LiDAR Super-Resolution via Frequency-Aware Multi-Scale Fusion, June Moh Goo et.al., Paper: http://arxiv.org/abs/2511.07377
- 2025-10-24, Real-Time Gait Adaptation for Quadrupeds using Model Predictive Control and Reinforcement Learning, Prakrut Kotecha et.al., Paper: http://arxiv.org/abs/2510.20706
- 2026-03-04, Real Eyes Realize Faster: Gaze Stability and Pupil Novelty for Efficient Egocentric Learning, Ajan Subramanian et.al., Paper: http://arxiv.org/abs/2603.04098
- 2026-03-20, ReViSQL: Achieving Human-Level Text-to-SQL, Yuxuan Zhu et.al., Paper: http://arxiv.org/abs/2603.20004
- 2025-12-02, ReVSeg: Incentivizing the Reasoning Chain for Video Segmentation with Reinforcement Learning, Yifan Li et.al., Paper: http://arxiv.org/abs/2512.02835
- 2026-03-10, ReTac-ACT: A State-Gated Vision-Tactile Fusion Transformer for Precision Assembly, Minchi Ruan et.al., Paper: http://arxiv.org/abs/2603.09565
- 2026-03-11, ReTabSyn: Realistic Tabular Data Synthesis via Reinforcement Learning, Xiaofeng Lin et.al., Paper: http://arxiv.org/abs/2603.10823
- 2026-02-23, ReSyn: Autonomously Scaling Synthetic Environments for Reasoning Models, Andre He et.al., Paper: http://arxiv.org/abs/2602.20117
- 2025-12-18, RePlan: Reasoning-guided Region Planning for Complex Instruction-based Image Editing, Tianyuan Qu et.al., Paper: http://arxiv.org/abs/2512.16864
- 2026-03-20, ReLi3D: Relightable Multi-view 3D Reconstruction with Disentangled Illumination, Jan-Niklas Dihlmann et.al., Paper: http://arxiv.org/abs/2603.19753
- 2025-12-08, ReLaX: Reasoning with Latent Exploration for Large Reasoning Models, Shimin Zhang et.al., Paper: http://arxiv.org/abs/2512.07558
- 2026-03-18, ReLMXEL: Adaptive RL-Based Memory Controller with Explainable Energy and Latency Optimization, Panuganti Chirag Sai et.al., Paper: http://arxiv.org/abs/2603.17309
- 2025-11-21, ReBaPL: Repulsive Bayesian Prompt Learning, Yassir Bendou et.al., Paper: http://arxiv.org/abs/2511.17339
- 2025-12-24, ReACT-Drug: Reaction-Template Guided Reinforcement Learning for de novo Drug Design, R Yadunandan et.al., Paper: http://arxiv.org/abs/2512.20958
- 2026-02-04, Rationality Measurement and Theory for Reinforcement Learning Agents, Kejiang Qian et.al., Paper: http://arxiv.org/abs/2602.04737
- 2026-03-17, Rationale Matters: Learning Transferable Rubrics via Proxy-Guided Critique for VLMReward Models, Weijie Qiu et.al., Paper: http://arxiv.org/abs/2603.16600
- 2025-10-21, Ranking-based Preference Optimization for Diffusion Models from Implicit User Feedback, Yi-Lun Wu et.al., Paper: http://arxiv.org/abs/2510.18353
- 2026-01-26, Rank-1 Approximation of Inverse Fisher for Natural Policy Gradients in Deep Reinforcement Learning, Yingxiao Huo et.al., Paper: http://arxiv.org/abs/2601.18626
- 2026-03-28, Rainbow-DemoRL: Combining Improvements in Demonstration-Augmented Reinforcement Learning, Dwait Bhatt et.al., Paper: http://arxiv.org/abs/2603.27400
- 2026-03-13, Radiative return meets GVMD, Pau Petit Rosàs et.al., Paper: http://arxiv.org/abs/2603.13171
- 2026-01-27, Radiative return at NLOPS accuracy, Ettore Budassi et.al., Paper: http://arxiv.org/abs/2601.19530
- 2026-01-27, R^3: Replay, Reflection, and Ranking Rewards for LLM Reinforcement Learning, Zhizheng Jiang et.al., Paper: http://arxiv.org/abs/2601.19620
- 2026-03-29, RTLSeek: Boosting the LLM-Based RTL Generation with Multi-Stage Diversity-Oriented Reinforcement Learning, Xinyu Zhang et.al., Paper: http://arxiv.org/abs/2603.27630
- 2025-05-13, ROLeR: Effective Reward Shaping in Offline Reinforcement Learning for Recommender Systems, Yi Zhang et.al., Paper: http://arxiv.org/abs/2407.13163
- 2026-01-30, RN-D: Discretized Categorical Actors with Regularized Networks for On-Policy Reinforcement Learning, Yuexin Bian et.al., Paper: http://arxiv.org/abs/2601.23075
- 2026-01-20, RM-Distiller: Exploiting Generative LLM for Reward Model Distillation, Hongli Zhou et.al., Paper: http://arxiv.org/abs/2601.14032
- 2026-02-08, RLinf-USER: A Unified and Extensible System for Real-World Online Policy Learning in Embodied AI, Hongzhi Zang et.al., Paper: http://arxiv.org/abs/2602.07837
- 2026-03-21, RLVR Training of LLMs Does Not Improve Thinking Ability for General QA: Evaluation Method and a Simple Solution, Kaiyuan Li et.al., Paper: http://arxiv.org/abs/2603.20799
- 2025-11-10, RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments, Zhiyuan Zeng et.al., Paper: http://arxiv.org/abs/2511.07317
- 2026-02-02, RLAnything: Forge Environment, Policy, and Reward Model in Completely Dynamic RL System, Yinjie Wang et.al., Paper: http://arxiv.org/abs/2602.02488
- 2026-02-23, RL-RIG: A Generative Spatial Reasoner via Intrinsic Reflection, Tianyu Wang et.al., Paper: http://arxiv.org/abs/2602.19974
- 2025-12-08, RL-MTJail: Reinforcement Learning for Automated Black-Box Multi-Turn Jailbreaking of Large Language Models, Xiqiao Xiong et.al., Paper: http://arxiv.org/abs/2512.07761
- 2025-10-20, RL-Driven Security-Aware Resource Allocation Framework for UAV-Assisted O-RAN, Zaineh Abughazzah et.al., Paper: http://arxiv.org/abs/2510.18084
- 2026-01-20, RL-BioAug: Label-Efficient Reinforcement Learning for Self-Supervised EEG Representation Learning, Cheol-Hui Lee et.al., Paper: http://arxiv.org/abs/2601.13964
- 2026-03-03, RL-Based Coverage Path Planning for Deformable Objects on 3D Surfaces, Yuhang Zhang et.al., Paper: http://arxiv.org/abs/2603.03137
- 2026-03-11, RL-Augmented MPC for Non-Gaited Legged and Hybrid Locomotion, Andrea Patrizi et.al., Paper: http://arxiv.org/abs/2603.10878
- 2025-11-04, RL-Aided Cognitive ISAC: Robust Detection and Sensing-Communication Trade-offs, Adam Umra et.al., Paper: http://arxiv.org/abs/2511.02672
- 2026-01-08, RL-AWB: Deep Reinforcement Learning for Auto White Balance Correction in Low-Light Night-time Scenes, Yuan-Kang Lee et.al., Paper: http://arxiv.org/abs/2601.05249
- 2026-03-25, RKKY-dipolar Interactions and 3D Spin Supersolid on Stacked Triangular Lattice, Ning Xi et.al., Paper: http://arxiv.org/abs/2603.24446
- 2026-02-11, RISE: Self-Improving Robot Policy with Compositional World Model, Jiazhi Yang et.al., Paper: http://arxiv.org/abs/2602.11075
- 2025-12-23, RIS-Empowered OTFS Modulation With Faster-than-Nyquist Signaling in High-Mobility Wireless Communications, Chaorong Zhang et.al., Paper: http://arxiv.org/abs/2512.20332
- 2025-12-10, RIFT: A Scalable Methodology for LLM Accelerator Fault Assessment using Reinforcement Learning, Khurram Khalil et.al., Paper: http://arxiv.org/abs/2512.09829
- 2026-02-18, RIDER: 3D RNA Inverse Design with Reinforcement Learning-Guided Diffusion, Tianmeng Hu et.al., Paper: http://arxiv.org/abs/2602.16548
- 2025-10-24, RETuning: Upgrading Inference-Time Scaling for Stock Movement Prediction with Large Language Models, Xueyuan Lin et.al., Paper: http://arxiv.org/abs/2510.21604
- 2025-10-20, RESample: A Robust Data Augmentation Framework via Exploratory Sampling for Robotic Manipulation, Yuquan Xue et.al., Paper: http://arxiv.org/abs/2510.17640
- 2026-03-03, RAPO: Expanding Exploration for LLM Agents via Retrieval-Augmented Policy Optimization, Siwei Zhang et.al., Paper: http://arxiv.org/abs/2603.03078
- 2026-02-25, RAMSeS: Robust and Adaptive Model Selection for Time-Series Anomaly Detection Algorithms, Mohamed Abdelmaksoud et.al., Paper: http://arxiv.org/abs/2602.21766
- 2026-03-18, RAMP: Reinforcement Adaptive Mixed Precision Quantization for Efficient On Device LLM Inference, Arpit Singh Gautam et.al., Paper: http://arxiv.org/abs/2603.17891
- 2026-02-25, RADAR: Reasoning as Discrimination with Aligned Representations for LLM-based Knowledge Graph Reasoning, Bo Xue et.al., Paper: http://arxiv.org/abs/2602.21951
- 2026-03-12, RADAR: Closed-Loop Robotic Data Generation via Semantic Planning and Autonomous Causal Environment Reset, Yongzhong Wang et.al., Paper: http://arxiv.org/abs/2603.11811
- 2025-12-16, RADAR: Accelerating Large Language Model Inference With RL-Based Dynamic Draft Trees, Junjie Ma et.al., Paper: http://arxiv.org/abs/2512.14069
- 2025-11-21, R2PS: Worst-Case Robust Real-Time Pursuit Strategies under Partial Observability, Runyu Lu et.al., Paper: http://arxiv.org/abs/2511.17367
- 2025-10-20, R2L: Reliable Reinforcement Learning: Guaranteed Return & Reliable Policies in Reinforcement Learning, Nadir Farhi et.al., Paper: http://arxiv.org/abs/2510.18074
- 2025-10-20, R2BC: Multi-Agent Imitation Learning from Single-Agent Demonstrations, Connor Mattson et.al., Paper: http://arxiv.org/abs/2510.18085
- 2026-03-26, R-C2: Cycle-Consistent Reinforcement Learning Improves Multimodal Reasoning, Zirui Zhang et.al., Paper: http://arxiv.org/abs/2603.25720
- 2026-02-06, R-Align: Enhancing Generative Reward Models through Rationale-Centric Meta-Judging, Yanlin Lai et.al., Paper: http://arxiv.org/abs/2602.06763
- 2026-04-01, Query-Conditioned Evidential Keyframe Sampling for MLLM-Based Long-Form Video Understanding, Yiheng Wang et.al., Paper: http://arxiv.org/abs/2604.01002
- 2025-12-19, Quenching statistics in Si and Ge SPADs using particle Monte Carlo simulation, Philippe Dollfus et.al., Paper: http://arxiv.org/abs/2512.17818
- 2025-10-20, QueST: Incentivizing LLMs to Generate Difficult Problems, Hanxu Hu et.al., Paper: http://arxiv.org/abs/2510.17715
- 2026-01-12, Quasi-optimal quantum Markov chain spectral gap estimation, Adam Connolly et.al., Paper: http://arxiv.org/abs/2601.07601
- 2025-11-20, Quasi-metric spaces on which real-valued continuous functions are uniformly continuous, Om Dev Singh et.al., Paper: http://arxiv.org/abs/2511.16503
- 2025-12-31, Quasi-Maximum Likelihood Estimation for a Genuinely Unbalanced Dynamic Network Panel Data Model, Zhijian Wang et.al., Paper: http://arxiv.org/abs/2512.24748
- 2026-01-26, Quasi Monte Carlo methods enable extremely low-dimensional deep generative models, Miles Martinez et.al., Paper: http://arxiv.org/abs/2601.18676
- 2026-02-18, Quantum-classical correspondence for spins at finite temperatures with application to Monte Carlo simulations, A. El Mendili et.al., Paper: http://arxiv.org/abs/2602.16501
- 2025-12-23, Quantum vs thermal fluctuations in phase transitions of two-dimensional superconductors, Andrea Ponticelli et.al., Paper: http://arxiv.org/abs/2512.20476
- 2026-03-10, Quantum spin ladder with ferromagnetic rungs in Bi $_2$CuO$_3$(SO$_4$ ), Rodolfo A. Rangel Hernandez et.al., Paper: http://arxiv.org/abs/2603.09442
- 2025-11-19, Quantum measurement tomography with mini-batch stochastic gradient descent, Akshay Gaikwad et.al., Paper: http://arxiv.org/abs/2511.15682
- 2025-11-20, Quantum corrections in general relativity explored through a GUP-inspired maximal acceleration analysis, Christian Corda et.al., Paper: http://arxiv.org/abs/2511.16502
- 2026-01-07, Quantum computing for multidimensional option pricing: End-to-end pipeline, Julien Hok et.al., Paper: http://arxiv.org/abs/2601.04049
- 2026-04-01, Quantum Statistical Bootstrap, Yongkai Chen et.al., Paper: http://arxiv.org/abs/2604.00951
- 2026-02-03, Quantum Speedups for Derivative Pricing Beyond Black-Scholes, Dylan Herman et.al., Paper: http://arxiv.org/abs/2602.03725
- 2026-02-05, Quantum Reinforcement Learning with Transformers for the Capacitated Vehicle Routing Problem, Eva Andrés et.al., Paper: http://arxiv.org/abs/2602.05920
- 2025-10-22, Quantum Monte Carlo study of low-dimensional Fermi fluids of dipolar atoms, Clio Johnson et.al., Paper: http://arxiv.org/abs/2510.19533
- 2025-12-19, Quantum Monte Carlo studies of U(1) lattice gauge models of Kondo breakdown, Gaopei Pan et.al., Paper: http://arxiv.org/abs/2512.17801
- 2026-01-07, Quantum Monte Carlo Simulations for predicting electron-positron pair production via the linear Breit-Wheeler process, Lucas I. Iñigo Gamiz et.al., Paper: http://arxiv.org/abs/2601.03953
- 2025-11-28, Quantum Matrix Spherical Functions, Stein Meereboer et.al., Paper: http://arxiv.org/abs/2511.23367
- 2025-10-22, Quantum Machine Learning methods for Fourier-based distribution estimation with application in option pricing, Fernando Alonso et.al., Paper: http://arxiv.org/abs/2510.19494
- 2026-03-20, Quantum Fisher Information as a Probe of Critical Scaling in Frustrated Magnets: Signatures from Kagome Quantum Spin Liquid, Zhengbang Zhou et.al., Paper: http://arxiv.org/abs/2603.19951
- 2026-03-17, Quantum Brownian Motion: proving that the Schmid transition belongs to the Berezinskii-Kosterlitz-Thouless universality class, Francesco G. Capone et.al., Paper: http://arxiv.org/abs/2603.16227
- 2026-04-01, Quantum Algorithms for Gibbs Expectation of Non-log-concave and Heavy-tailed Distributions, Xinmiao Li et.al., Paper: http://arxiv.org/abs/2604.00656
- 2026-01-13, QuantEval: A Benchmark for Financial Quantitative Tasks in Large Language Models, Zhaolu Kang et.al., Paper: http://arxiv.org/abs/2601.08689
- 2026-03-24, Quality Over Clicks: Intrinsic Quality-Driven Iterative Reinforcement Learning for Cold-Start E-Commerce Query Suggestion, Qi Sun et.al., Paper: http://arxiv.org/abs/2603.22922
- 2025-11-18, Quadratic Term Correction on Heaps’ Law, Oscar Fontanelli et.al., Paper: http://arxiv.org/abs/2511.14683
- 2025-12-15, QoS-Aware State-Augmented Learnable Framework for 5G NR-U/Wi-Fi Coexistence: Impact of Parameter Selection and Enhanced Collision Resolution, Mohammad Reza Fasihi et.al., Paper: http://arxiv.org/abs/2512.13393
- 2025-11-07, QUESTER: Query Specification for Generative Retrieval, Arthur Satouf et.al., Paper: http://arxiv.org/abs/2511.05301
- 2025-11-19, QTIS: A QAOA-Based Quantum Time Interval Scheduler, José A. Tirado-Domínguez et.al., Paper: http://arxiv.org/abs/2511.15590
- 2025-11-05, QMeCha: quantum Monte Carlo package for fermions in embedding environments, Matteo Barborini et.al., Paper: http://arxiv.org/abs/2511.03439
- 2025-12-18, QMCkl: A Kernel Library for Quantum Monte Carlo Applications, Emiel Slootman et.al., Paper: http://arxiv.org/abs/2512.16677
- 2026-03-03, QFlowNet: Fast, Diverse, and Efficient Unitary Synthesis with Generative Flow Networks, Inhoe Koo et.al., Paper: http://arxiv.org/abs/2603.03045
- 2026-01-20, Q-learning with Adjoint Matching, Qiyang Li et.al., Paper: http://arxiv.org/abs/2601.14234
- 2026-03-20, Q-approximation of operating characteristics of clinical trial designs, Susanna Gentile et.al., Paper: http://arxiv.org/abs/2603.20022
- 2025-11-10, Q-RAG: Long Context Multi-step Retrieval via Value-based Embedder Training, Artyom Sorokin et.al., Paper: http://arxiv.org/abs/2511.07328
- 2025-12-26, Q-A3C2: Quantum Reinforcement Learning with Time-Series Dynamic Clustering for Adaptive ETF Stock Selection, Yen-Ku Liu et.al., Paper: http://arxiv.org/abs/2512.21819
- 2026-03-28, PyINLA: Fast Bayesian Inference for Latent Gaussian Models in Python, Esmail Abdul Fattah et.al., Paper: http://arxiv.org/abs/2603.27276
- 2025-10-29, PyDPF: A Python Package for Differentiable Particle Filtering, John-Joseph Brady et.al., Paper: http://arxiv.org/abs/2510.25693
- 2025-12-16, PushGen: Push Notifications Generation with LLM, Shifu Bie et.al., Paper: http://arxiv.org/abs/2512.14490
- 2026-02-08, Pruning as a Cooperative Game: Surrogate-Assisted Layer Contribution Estimation for Large Language Models, Xuan Ding et.al., Paper: http://arxiv.org/abs/2602.07804
- 2026-01-13, Provably Safe Reinforcement Learning using Entropy Regularizer, Abhijit Mazumdar et.al., Paper: http://arxiv.org/abs/2601.08646
- 2025-10-20, Provably Optimal Reinforcement Learning under Safety Filtering, Donggeon David Oh et.al., Paper: http://arxiv.org/abs/2510.18082
- 2026-02-25, Provable Last-Iterate Convergence for Multi-Objective Safe LLM Alignment via Optimistic Primal-Dual, Yining Li et.al., Paper: http://arxiv.org/abs/2602.22146
- 2025-11-10, Provable Benefit of Curriculum in Transformer Tree-Reasoning Post-Training, Dake Bu et.al., Paper: http://arxiv.org/abs/2511.07372
- 2025-12-08, Prospects for measuring electroweak production of $Zγγ$ and 2 jets at the LHC, Ran Ding et.al., Paper: http://arxiv.org/abs/2512.07689
- 2025-10-29, Prospects for a 95 GeV Higgs Boson at Future Higgs Factories with Transformer Networks, Yabo Dong et.al., Paper: http://arxiv.org/abs/2510.24662
- 2026-02-02, Proof-RM: A Scalable and Generalizable Reward Model for Math Proof, Haotong Yang et.al., Paper: http://arxiv.org/abs/2602.02377
- 2026-01-15, Projected Microbatch Accumulation yields reference-free proximal policy updates for reinforcement learning, Nilin Abrahamsen et.al., Paper: http://arxiv.org/abs/2601.10498
- 2025-12-24, Production of charmed particles in proton-proton and light nucleus-nucleus interactions in Geant4 FTF model, A. Galoyan et.al., Paper: http://arxiv.org/abs/2512.21184
- 2026-03-18, Process Supervision for Chain-of-Thought Reasoning via Monte Carlo Net Information Gain, Corentin Royer et.al., Paper: http://arxiv.org/abs/2603.17815
- 2026-03-02, Process Over Outcome: Cultivating Forensic Reasoning for Generalizable Multimodal Manipulation Detection, Yuchen Zhang et.al., Paper: http://arxiv.org/abs/2603.01993
- 2026-03-18, Procedural Generation of Algorithm Discovery Tasks in Machine Learning, Alexander D. Goldie et.al., Paper: http://arxiv.org/abs/2603.17863
- 2025-12-05, Probing the effectiveness of World Models for Spatial Reasoning through Test-time Scaling, Saurav Jha et.al., Paper: http://arxiv.org/abs/2512.05809
- 2025-10-31, Probing cosmic isotropy with Gamma-ray bursts: A dipole and quadrupole analysis of BATSE and Fermi GBM data, Debosi Mondal et.al., Paper: http://arxiv.org/abs/2510.27644
- 2026-03-10, Probing Physics Beyond the Standard Model through Combined Analyses of Next-Generation Type Ia Supernova, CMB, and BAO Surveys, Srinivasan Raghunathan et.al., Paper: http://arxiv.org/abs/2603.09973
- 2026-03-25, Probing Interacting Dark Sectors with upcoming Post-Reionization and Galaxy Surveys, Rahul Shah et.al., Paper: http://arxiv.org/abs/2603.24554
- 2026-02-24, Probing Dec-POMDP Reasoning in Cooperative MARL, Kale-ab Tessera et.al., Paper: http://arxiv.org/abs/2602.20804
- 2026-02-13, ProbeLLM: Automating Principled Diagnosis of LLM Failures, Yue Huang et.al., Paper: http://arxiv.org/abs/2602.12966
- 2026-03-19, Probabilistic multivariate statistical process control via kernel parameter uncertainty propagation, Zina-Sabrina Duma et.al., Paper: http://arxiv.org/abs/2603.19055
- 2025-10-21, Proactive Reasoning-with-Retrieval Framework for Medical Multimodal Large Language Models, Lehan Wang et.al., Paper: http://arxiv.org/abs/2510.18303
- 2026-03-03, Proactive Guiding Strategy for Item-side Fairness in Interactive Recommendation, Chongjun Xia et.al., Paper: http://arxiv.org/abs/2603.03094
- 2026-03-19, ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents, Hao Zhang et.al., Paper: http://arxiv.org/abs/2603.18815
- 2025-12-29, ProGuard: Towards Proactive Multimodal Safeguard, Shaohan Yu et.al., Paper: http://arxiv.org/abs/2512.23573
- 2026-04-02, ProCeedRL: Process Critic with Exploratory Demonstration Reinforcement Learning for LLM Agentic Reasoning, Jingyue Gao et.al., Paper: http://arxiv.org/abs/2604.02006
- 2026-03-03, PrivMedChat: End-to-End Differentially Private RLHF for Medical Dialogue Systems, Sudip Bhujel et.al., Paper: http://arxiv.org/abs/2603.03054
- 2026-01-13, Prism: Towards Lowering User Cognitive Load in LLMs via Complex Intent Understanding, Zenghua Liao et.al., Paper: http://arxiv.org/abs/2601.08653
- 2025-12-03, Primary gravitational waves at high frequencies I: Origin of suppression in the power spectrum, Alipriyo Hoory et.al., Paper: http://arxiv.org/abs/2512.03959
- 2026-02-17, Pricing Discrete and Nonlinear Markets With Semidefinite Relaxations, Cheng Guo et.al., Paper: http://arxiv.org/abs/2602.15722
- 2025-12-01, Prescribed energy solutions of concave-convex type problems involving sign-changing or vanishing weights, Kanishka Perera et.al., Paper: http://arxiv.org/abs/2512.01931
- 2025-12-10, Prefrontal scaling of reward prediction error readout gates reinforcement-derived adaptive behavior in primates, Tian Sang et.al., Paper: http://arxiv.org/abs/2512.09761
- 2025-10-21, Preference-based Reinforcement Learning beyond Pairwise Comparisons: Benefits of Multiple Options, Joongkyu Lee et.al., Paper: http://arxiv.org/abs/2510.18713
- 2026-02-27, Preference Packing: Efficient Preference Optimization for Large Language Models, Jaekyung Cho et.al., Paper: http://arxiv.org/abs/2602.24082
- 2026-02-08, Preference Conditioned Multi-Objective Reinforcement Learning: Decomposed, Diversity-Driven Policy Optimization, Tanmay Ambadkar et.al., Paper: http://arxiv.org/abs/2602.07764
- 2025-12-03, Predicting parameters of a model cuprate superconductor using machine learning, V. A. Ulitko et.al., Paper: http://arxiv.org/abs/2512.04024
- 2025-10-24, Predicted observational effects of rapid rotation for Be stars, Rina G. Rast et.al., Paper: http://arxiv.org/abs/2510.21640
- 2026-03-19, Preconditioning Hamiltonian Monte Carlo by minimizing Fisher Divergence, Adrian Seyboldt et.al., Paper: http://arxiv.org/abs/2603.18845
- 2025-11-25, Precision thermodynamics of the strongly interacting Fermi gas in two dimensions, S. Ramachandran et.al., Paper: http://arxiv.org/abs/2511.20599
- 2026-01-02, Precision Autotuning for Linear Solvers via Contextual Bandit-Based RL, Erin Carson et.al., Paper: http://arxiv.org/abs/2601.00728
- 2025-11-07, PreResQ-R1: Towards Fine-Grained Rank-and-Score Reinforcement Learning for Visual Quality Assessment via Preference-Response Disentangled Policy Optimization, Zehui Feng et.al., Paper: http://arxiv.org/abs/2511.05393
- 2025-10-22, Practical algorithm for simulating thermal pure quantum states, Wei-Bo He et.al., Paper: http://arxiv.org/abs/2510.19504
- 2025-12-18, Posterior Behavioral Cloning: Pretraining BC Policies for Efficient RL Finetuning, Andrew Wagenmaker et.al., Paper: http://arxiv.org/abs/2512.16911
- 2025-12-03, PosterCopilot: Toward Layout Reasoning and Controllable Editing for Professional Graphic Design, Jiazhe Wei et.al., Paper: http://arxiv.org/abs/2512.04082
- 2026-01-29, Post-Disaster Resource Redistribution and Cooperation Evolution Based on Two-Layer Network Evolutionary Games, Yu Chen et.al., Paper: http://arxiv.org/abs/2601.22021
- 2026-01-06, Post-Decision State-Based Online Learning for Delay-Energy-Aware Flow Allocation in Wireless Systems, Mahesh Ganesh Bhat et.al., Paper: http://arxiv.org/abs/2601.03108
- 2026-01-28, Positive-Unlabeled Reinforcement Learning Distillation for On-Premise Small Models, Zhiqiang Kou et.al., Paper: http://arxiv.org/abs/2601.20687
- 2026-02-02, Position: Explaining Behavioral Shifts in Large Language Models Requires a Comparative Approach, Martino Ciaperoni et.al., Paper: http://arxiv.org/abs/2602.02304
- 2026-03-24, Portfolio Optimization under Recursive Utility via Reinforcement Learning, Minkey Chang et.al., Paper: http://arxiv.org/abs/2603.22880
- 2025-11-20, Polynomial-Time Algorithms for Computing the Nucleolus: An Assessment, Holger I. Meinhardt et.al., Paper: http://arxiv.org/abs/2511.16517
- 2026-03-25, Polynomial Speedup in Diffusion Models with the Multilevel Euler-Maruyama Method, Arthur Jacot et.al., Paper: http://arxiv.org/abs/2603.24594
- 2026-03-24, Policy-based Tuning of Autoregressive Image Models with Instance- and Distribution-Level Rewards, Orhun Buğra Baran et.al., Paper: http://arxiv.org/abs/2603.23086
- 2025-12-24, Policy-Conditioned Policies for Multi-Agent Task Solving, Yue Lin et.al., Paper: http://arxiv.org/abs/2512.21024
- 2026-01-14, Policy-Based Reinforcement Learning with Action Masking for Dynamic Job Shop Scheduling under Uncertainty: Handling Random Arrivals and Machine Failures, Sofiene Lassoued et.al., Paper: http://arxiv.org/abs/2601.09293
- 2026-01-16, Policy-Based Deep Reinforcement Learning Hyperheuristics for Job-Shop Scheduling Problems, Sofiene Lassoued et.al., Paper: http://arxiv.org/abs/2601.11189
- 2025-11-10, Policy Learning for Perturbance-wise Linear Quadratic Control Problem, Haoran Zhang et.al., Paper: http://arxiv.org/abs/2511.07388
- 2026-03-06, Policy Iteration Achieves Regularized Equilibrium under Time Inconsistency, Yu-Jui Huang et.al., Paper: http://arxiv.org/abs/2603.06145
- 2026-04-01, Policy Improvement Reinforcement Learning, Huaiyang Wang et.al., Paper: http://arxiv.org/abs/2604.00860
- 2025-11-04, Policy Gradient Methods for Information-Theoretic Opacity in Markov Decision Processes, Chongyang Shi et.al., Paper: http://arxiv.org/abs/2511.02704
- 2024-12-10, Policy Agnostic RL: Offline RL and Online RL Fine-Tuning of Any Class and Backbone, Max Sobol Mark et.al., Paper: http://arxiv.org/abs/2412.06685
- 2026-01-21, Plug-and-Play Benchmarking of Reinforcement Learning Algorithms for Large-Scale Flow Control, Jannis Becktepe et.al., Paper: http://arxiv.org/abs/2601.15015
- 2026-03-17, PlotTwist: A Creative Plot Generation Framework with Small Language Models, Abhinav Thorat et.al., Paper: http://arxiv.org/abs/2603.16410
- 2025-10-20, Plasma Shape Control via Zero-shot Generative Reinforcement Learning, Niannian Wu et.al., Paper: http://arxiv.org/abs/2510.17531
- 2026-02-27, Planning from Observation and Interaction, Tyler Han et.al., Paper: http://arxiv.org/abs/2602.24121
- 2025-12-19, Planning as Descent: Goal-Conditioned Latent Trajectory Synthesis in Learned Energy Landscapes, Carlos Vélez García et.al., Paper: http://arxiv.org/abs/2512.17846
- 2025-12-19, Plane Strong Connectivity Augmentation, Stéphane Bessy et.al., Paper: http://arxiv.org/abs/2512.17904
- 2025-10-21, PlanU: Large Language Model Decision Making through Planning under Uncertainty, Ziwei Deng et.al., Paper: http://arxiv.org/abs/2510.18442
- 2025-10-23, Plan Then Retrieve: Reinforcement Learning-Guided Complex Reasoning over Knowledge Graphs, Yanlin Song et.al., Paper: http://arxiv.org/abs/2510.20691
- 2026-02-25, Pilot-Free Optimal Control over Wireless Networks: A Control-Aided Channel Prediction Approach, Minjie Tang et.al., Paper: http://arxiv.org/abs/2602.21752
- 2025-10-22, Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing, Yusu Qian et.al., Paper: http://arxiv.org/abs/2510.19808
- 2026-03-18, Physics-informed offline reinforcement learning eliminates catastrophic fuel waste in maritime routing, Aniruddha Bora et.al., Paper: http://arxiv.org/abs/2603.17319
- 2026-02-17, Physics-Informed Anomaly Detection of Terrain Material Change in Radar Imagery, Abdel Hakiem Mohamed Abbas Mohamed Ahmed et.al., Paper: http://arxiv.org/abs/2602.15618
- 2026-02-26, Physics Informed Viscous Value Representations, Hrishikesh Viswanath et.al., Paper: http://arxiv.org/abs/2602.23280
- 2025-11-25, Physically Interpretable Interatomic Potentials via Symbolic Regression and Reinforcement Learning, Bilvin Varughese et.al., Paper: http://arxiv.org/abs/2511.20506
- 2026-01-21, Physical Layer Security in Massive MIMO: Challenges and Open Research Directions Against Passive Eavesdroppers, Nipun Agarwal et.al., Paper: http://arxiv.org/abs/2601.15024
- 2026-01-22, PhysProver: Advancing Automatic Theorem Proving for Physics, Hanning Zhang et.al., Paper: http://arxiv.org/abs/2601.15737
- 2026-03-31, Phyelds: A Pythonic Framework for Aggregate Computing, Gianluca Aguzzi et.al., Paper: http://arxiv.org/abs/2603.29999
- 2026-03-13, PhaseJumps: fast computation of zeros from planar grid samples, Antti Haimi et.al., Paper: http://arxiv.org/abs/2603.13158
- 2025-12-02, Phase-Adaptive LLM Framework with Multi-Stage Validation for Construction Robot Task Allocation: A Systematic Benchmark Against Traditional Optimization Algorithms, Shyam prasad reddy Kaitha et.al., Paper: http://arxiv.org/abs/2512.02810
- 2026-02-20, Phase diagram of a lattice fermion model with symmetric mass generation, Sandip Maiti et.al., Paper: http://arxiv.org/abs/2602.18360
- 2026-03-10, Phase diagram of 4D SU(3) Yang-Mills theory at $θ=π$ via imaginary theta simulations, Akira Matsumoto et.al., Paper: http://arxiv.org/abs/2603.09604
- 2026-03-10, Phase diagram and Ashkin-Teller universality in the classical square-lattice Heisenberg-compass model, Yuchen Fan et.al., Paper: http://arxiv.org/abs/2603.09469
- 2026-01-13, PersonaDual: Balancing Personalization and Objectivity via Adaptive Reasoning, Xiaoyou Liu et.al., Paper: http://arxiv.org/abs/2601.08679
- 2026-03-26, Persistent Robot World Models: Stabilizing Multi-Step Rollouts via Reinforcement Learning, Jai Bardhan et.al., Paper: http://arxiv.org/abs/2603.25685
- 2025-12-10, Persistent Cycle Representatives and Generalized Landscapes for Codimension 1 Persistent Homology, Fabian Lenzen et.al., Paper: http://arxiv.org/abs/2512.09668
- 2025-12-23, Performative Policy Gradient: Optimality in Performative Reinforcement Learning, Debabrota Basu et.al., Paper: http://arxiv.org/abs/2512.20576
- 2025-12-03, Performance and efficiency of a transformer-based quark/gluon jet tagger in the ATLAS experiment, ATLAS Collaboration et.al., Paper: http://arxiv.org/abs/2512.03949
- 2025-11-05, PerfDojo: Automated ML Library Generation for Heterogeneous Architectures, Andrei Ivanov et.al., Paper: http://arxiv.org/abs/2511.03586
- 2026-02-17, Perceptive Humanoid Parkour: Chaining Dynamic Human Skills via Motion Matching, Zhen Wu et.al., Paper: http://arxiv.org/abs/2602.15827
- 2026-03-02, Pencil Puzzle Bench: A Benchmark for Multi-Step Verifiable Reasoning, Justin Waugh et.al., Paper: http://arxiv.org/abs/2603.02119
- 2026-03-31, Penalized GMM Framework for Inference on Functionals of Nonparametric Instrumental Variable Estimators, Edvard Bakhitov et.al., Paper: http://arxiv.org/abs/2603.29889
- 2026-02-13, Peaceful Anarcho-Accelerationism: Decentralized Full Automation for a Society of Universal Care, Eduardo C. Garrido-Merchán et.al., Paper: http://arxiv.org/abs/2602.13154
- 2025-12-18, Pattern recognition in complex systems via vector-field representations of spatio-temporal data, Ingrid Amaranta Membrillo Solis et.al., Paper: http://arxiv.org/abs/2512.16763
- 2025-12-29, PathFound: An Agentic Multimodal Model Activating Evidence-seeking Pathological Diagnosis, Shengyi Hua et.al., Paper: http://arxiv.org/abs/2512.23545
- 2026-03-14, Path-conditioned Reinforcement Learning-based Local Planning for Long-Range Navigation, Mateo Haro et.al., Paper: http://arxiv.org/abs/2603.13888
- 2026-03-24, Path Planning and Reinforcement Learning-Driven Control of On-Orbit Free-Flying Multi-Arm Robots, Álvaro Belmonte-Baeza et.al., Paper: http://arxiv.org/abs/2603.23182
- 2025-12-22, Partition Function Estimation Using Analog Quantum Processors, Thinh Le et.al., Paper: http://arxiv.org/abs/2512.19685
- 2026-01-30, Particle-Guided Diffusion Models for Partial Differential Equations, Andrew Millard et.al., Paper: http://arxiv.org/abs/2601.23262
- 2026-03-06, Partial Policy Gradients for RL in LLMs, Puneet Mathur et.al., Paper: http://arxiv.org/abs/2603.06138
- 2026-01-02, Parametrized Sharing for Multi-Agent Hybrid DRL for Multiple Multi-Functional RISs-Aided Downlink NOMA Networks, Chi-Te Kuo et.al., Paper: http://arxiv.org/abs/2601.00538
- 2026-03-07, Parametric modal regression for right-censored positive responses, Christian E. Galarza et.al., Paper: http://arxiv.org/abs/2603.07099
- 2025-11-14, Parameterized complexity of the f-Critical Set problem, Thiago Marcilon et.al., Paper: http://arxiv.org/abs/2511.11546
- 2025-11-13, Parallel and GPU accelerated code for phase-field and reaction-diffusion simulations, Steven A. Silber et.al., Paper: http://arxiv.org/abs/2511.10508
- 2026-02-25, PanoEnv: Exploring 3D Spatial Intelligence in Panoramic Environments with Reinforcement Learning, Zekai Lin et.al., Paper: http://arxiv.org/abs/2602.21992
- 2025-10-30, PairUni: Pairwise Training for Unified Multimodal Language Models, Jiani Zheng et.al., Paper: http://arxiv.org/abs/2510.25682
- 2025-10-28, Pair Approximation Meets Reality: Diffusion of Innovation in Organizational Networks within the biased-independence q-Voter Model, Angelika Abramiuk-Szurlej et.al., Paper: http://arxiv.org/abs/2510.24447
- 2026-02-23, PackFlow: Generative Molecular Crystal Structure Prediction via Reinforcement Learning Alignment, Akshay Subramanian et.al., Paper: http://arxiv.org/abs/2602.20140
- 2026-01-22, PUMA: Perception-driven Unified Foothold Prior for Mobility Augmented Quadruped Parkour, Liang Wang et.al., Paper: http://arxiv.org/abs/2601.15995
- 2025-11-24, PRInTS: Reward Modeling for Long-Horizon Information Seeking, Jaewoo Lee et.al., Paper: http://arxiv.org/abs/2511.19314
- 2026-02-02, PRISM: Performer RS-IMLE for Single-pass Multisensory Imitation Learning, Amisha Bhaskar et.al., Paper: http://arxiv.org/abs/2602.02396
- 2026-02-20, PRISM: Parallel Reward Integration with Symmetry for MORL, Finn van der Knaap et.al., Paper: http://arxiv.org/abs/2602.18277
- 2026-01-26, POPE: Learning to Reason on Hard Problems via Privileged On-Policy Exploration, Yuxiao Qu et.al., Paper: http://arxiv.org/abs/2601.18779
- 2026-03-13, PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses, Chenlong Yin et.al., Paper: http://arxiv.org/abs/2603.13026
- 2026-02-24, PIME: Prototype-based Interpretable MCTS-Enhanced Brain Network Analysis for Disorder Diagnosis, Kunyu Zhang et.al., Paper: http://arxiv.org/abs/2602.21046
- 2025-10-21, PGTT: Phase-Guided Terrain Traversal for Perceptive Legged Locomotion, Alexandros Ntagkas et.al., Paper: http://arxiv.org/abs/2510.18348
- 2026-01-15, PERM: Psychology-grounded Empathetic Reward Modeling for Large Language Models, Chengbing Wang et.al., Paper: http://arxiv.org/abs/2601.10532
- 2026-02-04, PDF-HR: Pose Distance Fields for Humanoid Robots, Yi Gu et.al., Paper: http://arxiv.org/abs/2602.04851
- 2025-10-21, PCMS: Parallel Coupler For Multimodel Simulations, Jacob S. Merson et.al., Paper: http://arxiv.org/abs/2510.18838
- 2025-11-17, P1: Mastering Physics Olympiads with Reinforcement Learning, Jiacheng Chen et.al., Paper: http://arxiv.org/abs/2511.13612
- 2026-02-12, P-GenRM: Personalized Generative Reward Model with Test-time User-based Scaling, Pinyi Zhang et.al., Paper: http://arxiv.org/abs/2602.12116
- 2026-01-06, P-Check: Advancing Personalized Reward Model via Learning to Generate Dynamic Checklist, Kwangwook Seo et.al., Paper: http://arxiv.org/abs/2601.02986
- 2025-10-20, Oxidation State Dynamics and Emerging Patterns in Magnetite, Emre Gürsoy et.al., Paper: http://arxiv.org/abs/2510.18061
- 2026-02-24, Overton Pluralistic Reinforcement Learning for Large Language Models, Yu Fu et.al., Paper: http://arxiv.org/abs/2602.20759
- 2026-01-21, Outcome-Based RL Provably Leads Transformers to Reason, but Only With the Right Data, Yuval Ran-Milo et.al., Paper: http://arxiv.org/abs/2601.15158
- 2026-02-04, Outcome Accuracy is Not Enough: Aligning the Reasoning Process of Reward Models, Binghai Wang et.al., Paper: http://arxiv.org/abs/2602.04649
- 2025-11-05, Outbidding and Outbluffing Elite Humans: Mastering Liar’s Poker via Self-Play and Reinforcement Learning, Richard Dewey et.al., Paper: http://arxiv.org/abs/2511.03724
- 2026-01-16, Oriented Triplet $p$ -Wave Pairing from Fermi surface Anisotropy and Nonlocal Attraction, Shuning Tan et.al., Paper: http://arxiv.org/abs/2601.11267
- 2026-03-21, Ordinal Patterns Based Testing of Spatial Independence in Irregular Spatial Structures, Giorgio Micali et.al., Paper: http://arxiv.org/abs/2603.20783
- 2025-12-12, Order statistics for multijet events, G. Chachamis et.al., Paper: http://arxiv.org/abs/2512.11697
- 2026-03-22, OrbitStream: Training-Free Adaptive 360-degree Video Streaming via Semantic Potential Fields, Aizierjiang Aiersilan et.al., Paper: http://arxiv.org/abs/2603.20999
- 2026-03-24, Orbit-Level Stretching in Cubic Fourier-Galerkin Navier-Stokes: Sharp Incidence, Spectral Decay, and a Continuation Criterion, Oleg Kiriukhin et.al., Paper: http://arxiv.org/abs/2603.23293
- 2026-03-06, OralGPT-Plus: Learning to Use Visual Tools via Reinforcement Learning for Panoramic X-ray Analysis, Yuxuan Fan et.al., Paper: http://arxiv.org/abs/2603.06366
- 2026-03-09, Oracle-Guided Soft Shielding for Safe Move Prediction in Chess, Prajit T Rajendran et.al., Paper: http://arxiv.org/abs/2603.08506
- 2025-10-22, Optimizing the Unknown: Black Box Bayesian Optimization with Energy-Based Model and Reinforcement Learning, Ruiyao Miao et.al., Paper: http://arxiv.org/abs/2510.19530
- 2026-02-18, Optimizing p-spin models through hypergraph neural networks and deep reinforcement learning, Li Zeng et.al., Paper: http://arxiv.org/abs/2602.16665
- 2026-04-02, Optimizing RAG Rerankers with LLM Feedback via Reinforcement Learning, Yuhang Wu et.al., Paper: http://arxiv.org/abs/2604.02091
- 2025-11-04, Optimizing Kernel Discrepancies via Subset Selection, Deyao Chen et.al., Paper: http://arxiv.org/abs/2511.02706
- 2026-01-20, Optimizing Energy and Data Collection in UAV-aided IoT Networks using Attention-based Multi-Objective Reinforcement Learning, Babacar Toure et.al., Paper: http://arxiv.org/abs/2601.14092
- 2025-10-20, Optimizing Energy Management of Smart Grid using Reinforcement Learning aided by Surrogate models built using Physics-informed Neural Networks, Julen Cestero et.al., Paper: http://arxiv.org/abs/2510.17380
- 2026-03-17, Optimizing Density Functional Theory for Strain-Dependent Magnetic Properties of Monolayer MnBi $_2$Te$_4$ with Diffusion Monte Carlo, Jeonghwan Ahn et.al., Paper: http://arxiv.org/abs/2603.16162
- 2026-03-06, Optimizing 3D Diffusion Models for Medical Imaging via Multi-Scale Reward Learning, Yueying Tian et.al., Paper: http://arxiv.org/abs/2603.06173
- 2026-03-25, Optimized control protocols for stable skyrmion creation using deep reinforcement learning, Ji Seok Song et.al., Paper: http://arxiv.org/abs/2603.24177
- 2025-11-25, Optimization of Sums of Bivariate Functions: An Introduction to Relaxation-Based Methods for the Case of Finite Domains, Nils Müller et.al., Paper: http://arxiv.org/abs/2511.20607
- 2026-02-10, Optimistic World Models: Efficient Exploration in Model-Based Deep Reinforcement Learning, Akshay Mete et.al., Paper: http://arxiv.org/abs/2602.10044
- 2026-03-29, Optimal resource allocation for maintaining system solvency, Gaoyue Guo et.al., Paper: http://arxiv.org/abs/2603.27622
- 2025-12-09, Optimal navigation in two-dimensional regular and turbulent flows, Vladimir Parfenyev et.al., Paper: http://arxiv.org/abs/2512.08766
- 2026-03-25, Optimal control of infinite-dimensional dissipative systems, Anthony Hastir et.al., Paper: http://arxiv.org/abs/2603.24507
- 2026-01-14, Optimal control of McKean-Vlasov systems under partial observation and hidden Markov switching, Marco Fuhrman et.al., Paper: http://arxiv.org/abs/2601.09311
- 2026-03-13, Optimal Stopping for Systems Driven by the Brownian Sheet, Nacira Agram et.al., Paper: http://arxiv.org/abs/2603.12853
- 2026-02-18, Optimal Placement and Sizing of PV-Based DG Units in a Distribution Network Considering Loading Capacity, Abhinav Sharma et.al., Paper: http://arxiv.org/abs/2602.16565
- 2025-12-09, Optimal Perturbation Budget Allocation for Data Poisoning in Offline Reinforcement Learning, Junnan Qiu et.al., Paper: http://arxiv.org/abs/2512.08485
- 2025-11-14, Optimal Dividend, Reinsurance and Capital Injection Strategies for Collaborating Business Lines: The Case of Excess-of-Loss Reinsurance, Tim J. Boonen et.al., Paper: http://arxiv.org/abs/2511.11383
- 2026-02-06, Optimal Derivative Feedback Control for an Active Magnetic Levitation System: An Experimental Study on Data-Driven Approaches, Saber Omidi et.al., Paper: http://arxiv.org/abs/2602.06944
- 2026-03-05, Optimal Decoding with the Worm, Zac Tobias et.al., Paper: http://arxiv.org/abs/2603.05428
- 2026-02-08, Optimal Control of Unbounded Stochastic Evolution Systems in Hilbert Spaces, Shanjian Tang et.al., Paper: http://arxiv.org/abs/2602.07793
- 2026-03-24, Optimal Control of Switched Systems Governed by Logical Switching Dynamics, Xiao Zhang et.al., Paper: http://arxiv.org/abs/2603.23131
- 2025-11-26, Optimal Bit Detection in Thermal Noise Communication Systems Under Rician Fading, Mohamed El Jbari et.al., Paper: http://arxiv.org/abs/2511.21649
- 2026-01-16, Optimal Abatement Schedules for Excess Carbon Emissions Towards a Net-Zero Target, Hansjoerg Albrecher et.al., Paper: http://arxiv.org/abs/2601.11348
- 2025-11-07, Optical studies of scintillation detectors for precision beta-energy measurements, S. Vanlangendonck et.al., Paper: http://arxiv.org/abs/2511.05083
- 2026-03-04, OptiQKD: A Machine Learning-Optimized Framework for Real-Time Parameter Tuning in Quantum Key Distribution, Noureldin Mohamed et.al., Paper: http://arxiv.org/abs/2603.04192
- 2026-01-26, OptiGAN for Crystal Arrays: Physics-Informed Generative Modeling of Optical Photon Transport in PET Detector Arrays, Stephan Naunheim et.al., Paper: http://arxiv.org/abs/2601.18780
- 2025-12-02, OptPO: Optimal Rollout Allocation for Test-time Policy Optimization, Youkang Wang et.al., Paper: http://arxiv.org/abs/2512.02882
- 2026-03-25, Opinion-Driven Vaccination and Epidemic Dynamics on Heterogeneous Networks, Anika Roy et.al., Paper: http://arxiv.org/abs/2603.24403
- 2026-03-18, Operator-Theoretic Foundations and Policy Gradient Methods for General MDPs with Unbounded Costs, Abhishek Gupta et.al., Paper: http://arxiv.org/abs/2603.17875
- 2026-03-12, Operator Splitting, Policy Iteration, and Machine Learning for Stochastic Optimal Control, Alain Bensoussan et.al., Paper: http://arxiv.org/abs/2603.12167
- 2026-02-13, Operator Learning for Families of Finite-State Mean-Field Games, William Hofgard et.al., Paper: http://arxiv.org/abs/2602.13169
- 2025-10-29, OpenReward: Learning to Reward Long-form Agentic Tasks via Reinforcement Learning, Ziyou Hu et.al., Paper: http://arxiv.org/abs/2510.24636
- 2025-12-01, OpenREAD: Reinforced Open-Ended Reasoing for End-to-End Autonomous Driving with LLM-as-Critic, Songyan Zhang et.al., Paper: http://arxiv.org/abs/2512.01830
- 2026-03-13, OpenACMv2: An Accuracy-Constrained Co-Optimization Framework for Approximate DCiM, Yiqi Zhou et.al., Paper: http://arxiv.org/abs/2603.13042
- 2025-10-23, Open-o3 Video: Grounded Video Reasoning with Explicit Spatio-Temporal Evidence, Jiahao Meng et.al., Paper: http://arxiv.org/abs/2510.20579
- 2026-01-20, Onset of stripe order in classical fluids: Lessons from lattice-gas mixtures, Gabriele Costa et.al., Paper: http://arxiv.org/abs/2601.14082
- 2026-02-10, Online Selective Conformal Prediction with Asymmetric Rules: A Permutation Test Approach, Mingyi Zheng et.al., Paper: http://arxiv.org/abs/2602.10018
- 2025-10-21, Online SFT for LLM Reasoning: Surprising Effectiveness of Self-Tuning without Rewards, Mengqi Li et.al., Paper: http://arxiv.org/abs/2510.18814
- 2026-01-12, Online Markov Decision Processes with Terminal Law Constraints, Bianca Marin Moreno et.al., Paper: http://arxiv.org/abs/2601.07492
- 2026-03-09, Online Learning in Semiparametric Econometric Models, Xiaohong Chen et.al., Paper: http://arxiv.org/abs/2603.08614
- 2026-03-28, Online Inertia Tensor Identification for Non-Cooperative Spacecraft via Augmented UKF, Batu Candan et.al., Paper: http://arxiv.org/abs/2603.27361
- 2026-01-08, Online Bayesian Learning of Agent Behavior in Differential Games, Francesco Bianchin et.al., Paper: http://arxiv.org/abs/2601.05087
- 2025-12-02, OneThinker: All-in-one Reasoning Model for Image and Video, Kaituo Feng et.al., Paper: http://arxiv.org/abs/2512.03043
- 2025-12-24, One Tool Is Enough: Reinforcement Learning for Repository-Level LLM Agents, Zhaoxi Zhang et.al., Paper: http://arxiv.org/abs/2512.20957
- 2026-01-28, One Step Is Enough: Dispersive MeanFlow Policy Optimization, Guowei Zou et.al., Paper: http://arxiv.org/abs/2601.20701
- 2026-01-06, One Sample to Rule Them All: Extreme Data Efficiency in RL Scaling, Yiyuan Li et.al., Paper: http://arxiv.org/abs/2601.03111
- 2026-01-26, One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment, Hongru Cai et.al., Paper: http://arxiv.org/abs/2601.18731
- 2025-10-20, OncoReason: Structuring Clinical Reasoning in LLMs for Robust and Interpretable Survival Prediction, Raghu Vamshi Hemadri et.al., Paper: http://arxiv.org/abs/2510.17532
- 2026-03-17, Onboard MuJoCo-based Model Predictive Control for Shipboard Crane with Double-Pendulum Sway Suppression, Oscar Pang et.al., Paper: http://arxiv.org/abs/2603.16407
- 2020-05-19, On-Policy Robot Imitation Learning from a Converging Supervisor, Ashwin Balakrishna et.al., Paper: http://arxiv.org/abs/1907.03423
- 2026-01-07, On-Device Deep Reinforcement Learning for Decentralized Task Offloading Performance trade-offs in the training process, Gorka Nieto et.al., Paper: http://arxiv.org/abs/2601.03976
- 2025-12-12, On Łojasiewicz Ideals and Flatness for Zero Sets with Infinite Tangential Geometry, Abdelhafed El Khadiri et.al., Paper: http://arxiv.org/abs/2512.11432
- 2026-02-20, On the simulated kinematic distributions of semileptonic $B$ decays, Florian Herren et.al., Paper: http://arxiv.org/abs/2602.18378
- 2026-01-15, On the reconstruction of kinematic distributions computed with Monte Carlo methods using orthogonal basis functions, Kirill Melnikov et.al., Paper: http://arxiv.org/abs/2601.10420
- 2026-01-27, On the rarity of rocket-driven Penrose extraction in Kerr spacetime, An T. Le et.al., Paper: http://arxiv.org/abs/2601.19616
- 2025-10-29, On the instability of local learning algorithms: Q-learning can fail in infinite state spaces, Vittorio Puricelli et.al., Paper: http://arxiv.org/abs/2510.25572
- 2025-12-22, On the inclusion of the pion form factor in $e^+e^- \to π^+π^-$ beyond leading order, Francesco P. Ucci et.al., Paper: http://arxiv.org/abs/2512.19359
- 2026-01-08, On the Value Function of Convex Bolza Problems Governed by Stochastic Difference Equations, Sebastián Álvarez et.al., Paper: http://arxiv.org/abs/2601.05207
- 2025-11-24, On the PB Sudakov: NNLL coefficient, CS kernel and intrinsic-kt, Aleksandra Lelek et.al., Paper: http://arxiv.org/abs/2511.19318
- 2026-02-16, On the Learning Dynamics of RLVR at the Edge of Competence, Yu Huang et.al., Paper: http://arxiv.org/abs/2602.14872
- 2025-12-08, On the Interplay of Pre-Training, Mid-Training, and RL on Reasoning Language Models, Charlie Zhang et.al., Paper: http://arxiv.org/abs/2512.07783
- 2026-01-08, On the Hidden Objective Biases of Group-based Reinforcement Learning, Aleksandar Fontana et.al., Paper: http://arxiv.org/abs/2601.05002
- 2025-11-06, On the Exoplanet Yield of Gaia Astrometry, Caleb Lammers et.al., Paper: http://arxiv.org/abs/2511.04673
- 2025-11-06, On the Estimation of Own Funds for Life Insurers: A Study of Direct, Indirect, and Control Variate Methods in a Risk-Neutral Pricing Framework, Mark-Oliver Wolf et.al., Paper: http://arxiv.org/abs/2511.04412
- 2026-01-07, On the Distributed Estimation for Scalar-on-Function Regression Models, Peilun He et.al., Paper: http://arxiv.org/abs/2601.04138
- 2026-03-23, On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation, Kexin Huang et.al., Paper: http://arxiv.org/abs/2603.22117
- 2026-02-06, On the Design of an Optimal Multi-Tone Jammer Against the Wiener Interpolation Filter, Corentin Fonteneau et.al., Paper: http://arxiv.org/abs/2602.06816
- 2026-02-12, On the Complexity of Offline Reinforcement Learning with $Q^\star$ -Approximation and Partial Coverage, Haolin Liu et.al., Paper: http://arxiv.org/abs/2602.12107
- 2025-12-17, On regularity of compressions and diagonals of operator functions, Vladimir Müller et.al., Paper: http://arxiv.org/abs/2512.15467
- 2026-02-06, On micromodes in Bayesian posterior distributions and their implications for MCMC, Sanket Agrawal et.al., Paper: http://arxiv.org/abs/2602.06931
- 2026-03-06, On a PDE model for Learning in Stochastic Market Entry Games, Esther Bou Dagher et.al., Paper: http://arxiv.org/abs/2603.06514
- 2026-01-30, On Safer Reinforcement Learning Policies for Sedation and Analgesia in Intensive Care, Joel Romero-Hernandez et.al., Paper: http://arxiv.org/abs/2601.23154
- 2025-10-23, On Multiple Robustness of Proximal Dynamic Treatment Regimes, Yuanshan Gao et.al., Paper: http://arxiv.org/abs/2510.20451
- 2026-03-12, On Information Self-Locking in Reinforcement Learning for Active Reasoning of LLM agents, Deyu Zou et.al., Paper: http://arxiv.org/abs/2603.12109
- 2026-02-27, On Hamiltonian Monte Carlo for Gaussian Random Variables with Random Hamiltonians, Yingdong Lu et.al., Paper: http://arxiv.org/abs/2602.24256
- 2025-12-25, On Critical Temperature and Finite Size Scaling of Continuous Spin $2d$ Ising Model, Swapna Mahapatra et.al., Paper: http://arxiv.org/abs/2512.21748
- 2026-02-05, On Computation and Reinforcement Learning, Raj Ghugare et.al., Paper: http://arxiv.org/abs/2602.05999
- 2025-10-21, On AI Verification in Open RAN, Rahul Soundrarajan et.al., Paper: http://arxiv.org/abs/2510.18417
- 2025-10-27, Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences, Zhuoran Jin et.al., Paper: http://arxiv.org/abs/2510.23451
- 2025-12-18, Olaf: Bringing an Animated Character to Life in the Physical World, David Müller et.al., Paper: http://arxiv.org/abs/2512.16705
- 2024-05-30, Offline-Boosted Actor-Critic: Adaptively Blending Optimal Historical Behaviors in Deep Off-Policy RL, Yu Luo et.al., Paper: http://arxiv.org/abs/2405.18520
- 2026-01-16, Offline Reinforcement-Learning-Based Power Control for Application-Agnostic Energy Efficiency, Akhilesh Raj et.al., Paper: http://arxiv.org/abs/2601.11352
- 2023-01-02, Offline Policy Optimization in RL with Variance Regularizaton, Riashat Islam et.al., Paper: http://arxiv.org/abs/2212.14405
- 2026-03-17, Offline Exploration-Aware Fine-Tuning for Long-Chain Mathematical Reasoning, Yongyu Mu et.al., Paper: http://arxiv.org/abs/2603.16206
- 2025-10-29, Off-policy Reinforcement Learning with Model-based Exploration Augmentation, Likun Wang et.al., Paper: http://arxiv.org/abs/2510.25529
- 2026-03-24, Off-Policy Value-Based Reinforcement Learning for Large Language Models, Peng-Yuan Wang et.al., Paper: http://arxiv.org/abs/2603.23355
- 2026-01-22, Off-Policy Actor-Critic with Sigmoid-Bounded Entropy for Real-World Robot Learning, Xiefeng Wu et.al., Paper: http://arxiv.org/abs/2601.15761
- 2025-11-04, Observational tests of the conformal osculating Barthel-Kropina cosmological model, Himanshu Chaudhary et.al., Paper: http://arxiv.org/abs/2511.02729
- 2026-03-18, Observational Signatures of Exact Black Hole Solutions in a Dark Matter Halo, Azalbek Boltaev et.al., Paper: http://arxiv.org/abs/2603.17986
- 2026-01-15, Observation Timelines for the Potential Lunar Impact of Asteroid 2024 YR4, Yifan He et.al., Paper: http://arxiv.org/abs/2601.10666
- 2025-12-08, Observability of eccentricity in a population of merging compact binaries, Mukesh Kumar Singh et.al., Paper: http://arxiv.org/abs/2512.07688
- 2026-01-29, OVD: On-policy Verbal Distillation, Jing Xiong et.al., Paper: http://arxiv.org/abs/2601.21968
- 2026-02-11, OSIL: Learning Offline Safe Imitation Policies with Safety Inferred from Non-preferred Trajectories, Returaj Burnwal et.al., Paper: http://arxiv.org/abs/2602.11018
- 2026-03-19, OS-Themis: A Scalable Critic Framework for Generalist GUI Rewards, Zehao Li et.al., Paper: http://arxiv.org/abs/2603.19191
- 2026-02-10, ORCHID: Fairness-Aware Orchestration in Mission-Critical Air-Ground Integrated Networks, Chuan-Chi Lai et.al., Paper: http://arxiv.org/abs/2602.09994
- 2025-12-11, OPV: Outcome-based Process Verifier for Efficient Long Chain-of-Thought Verification, Zijian Wu et.al., Paper: http://arxiv.org/abs/2512.10756
- 2025-10-20, OPTAGENT: Optimizing Multi-Agent LLM Interactions Through Verbal Reinforcement Learning for Enhanced Reasoning, Zhenyu Bi et.al., Paper: http://arxiv.org/abs/2510.18032
- 2025-12-17, OMCL: Open-vocabulary Monte Carlo Localization, Evgenii Kruzhkov et.al., Paper: http://arxiv.org/abs/2512.15557
- 2026-01-21, Numerical study of multiple solar flare induced modulation of Very Low Frequency (VLF) diurnal profile, Sourav Palit et.al., Paper: http://arxiv.org/abs/2601.14948
- 2025-11-24, Numerical Approximation In Real Domain Of Special Function Of Product Of A Variable And Its Double Exponential, Narinder Kumar Wadhawan et.al., Paper: http://arxiv.org/abs/2511.19270
- 2026-02-13, Nuclear gradients from auxiliary-field quantum Monte Carlo and their application in geometry optimization and transition state search, Jo S. Kurian et.al., Paper: http://arxiv.org/abs/2602.13187
- 2025-11-18, Nonparametric Uniform Inference in Binary Classification and Policy Values, Nan Liu et.al., Paper: http://arxiv.org/abs/2511.14700
- 2025-12-31, Nonlinear Noise2Noise for Efficient Monte Carlo Denoiser Training, Andrew Tinits et.al., Paper: http://arxiv.org/abs/2512.24794
- 2026-02-11, Non-centred Bayesian inference for discrete-valued state-transition models: the Rippler algorithm, James Neill et.al., Paper: http://arxiv.org/abs/2602.10924
- 2025-11-13, Non-Resonant Alpha-Induced Neutron-Emission: A Multi- Method Comparison Of Nuclear Reaction Rates, Bhavay Luthra et.al., Paper: http://arxiv.org/abs/2511.10536
- 2026-02-11, Noise-balanced multilevel on-the-fly sparse grid surrogates for coupling Monte Carlo models into continuum models with application to heterogeneous catalysis, Tobias Hülser et.al., Paper: http://arxiv.org/abs/2602.11006
- 2025-10-23, No-Regret Thompson Sampling for Finite-Horizon Markov Decision Processes with Gaussian Processes, Jasmine Bayrooti et.al., Paper: http://arxiv.org/abs/2510.20725
- 2026-03-25, No Single Metric Tells the Whole Story: A Multi-Dimensional Evaluation Framework for Uncertainty Attributions, Emily Schiller et.al., Paper: http://arxiv.org/abs/2603.24524
- 2025-12-09, No Labels, No Problem: Training Visual Reasoners with Multimodal Verifiers, Damiano Marsili et.al., Paper: http://arxiv.org/abs/2512.08889
- 2026-01-05, NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation, Huichao Zhang et.al., Paper: http://arxiv.org/abs/2601.02204
- 2025-12-04, Nex-N1: Agentic Models Trained via a Unified Ecosystem for Large-Scale Environment Construction, Nex-AGI Team et.al., Paper: http://arxiv.org/abs/2512.04987
- 2026-01-26, New parameters for star cluster dynamics: observational results, Barbara Lanzoni et.al., Paper: http://arxiv.org/abs/2601.18679
- 2025-12-02, New insights into hydrogen-assisted intergranular cracking in nickel, S. Quan et.al., Paper: http://arxiv.org/abs/2512.02979
- 2025-12-01, New Spiking Architecture for Multi-Modal Decision-Making in Autonomous Vehicles, Aref Ghoreishee et.al., Paper: http://arxiv.org/abs/2512.01882
- 2026-01-16, New Adaptive Mechanism for Large Neighborhood Search using Dual Actor-Critic, Shaohua Yu et.al., Paper: http://arxiv.org/abs/2601.11414
- 2026-03-19, NeuroGame Transformer: Gibbs-Inspired Attention Driven by Game Theory and Statistical Physics, Djamel Bouchaffra et.al., Paper: http://arxiv.org/abs/2603.18761
- 2026-03-05, Neural Wavefunction Calculations of μSR Spectra with Quantum Muons and Protons, Jamie Carr et.al., Paper: http://arxiv.org/abs/2603.05453
- 2026-01-28, Neural Quantum States in Mixed Precision, Massimo Solinas et.al., Paper: http://arxiv.org/abs/2601.20782
- 2026-02-13, Neural Quantum States Based on Selected Configurations, Marco Julian Solanki et.al., Paper: http://arxiv.org/abs/2602.12993
- 2026-03-24, Neural ODE and SDE Models for Adaptation and Planning in Model-Based Reinforcement Learning, Chao Han et.al., Paper: http://arxiv.org/abs/2603.23245
- 2026-01-09, Neural Methods for Multiple Systems Estimation Models, Joseph Marsh et.al., Paper: http://arxiv.org/abs/2601.05859
- 2026-03-22, Neural Inference Functions for Margins for Time Series Copula Models, Daniel Fynn et.al., Paper: http://arxiv.org/abs/2603.21075
- 2026-03-07, Neural Control and Learning of Simulated Hand Movements With an EMG-Based Closed-Loop Interface, Balint K. Hodossy et.al., Paper: http://arxiv.org/abs/2603.07364
- 2025-12-19, NeuRehab: A Reinforcement Learning and Spiking Neural Network-Based Rehab Automation Framework, Phani Pavan Kambhampati et.al., Paper: http://arxiv.org/abs/2512.17841
- 2025-12-15, Nemotron-Cascade: Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models, Boxin Wang et.al., Paper: http://arxiv.org/abs/2512.13607
- 2026-03-29, NeedleDB: A Generative-AI Based System for Accurate and Efficient Image Retrieval using Complex Natural Language Queries, Mahdi Erfanian et.al., Paper: http://arxiv.org/abs/2603.27464
- 2026-01-23, Necessary Optimality Conditions for Integrated Learning and Optimization Problem in Contextual Optimization, Yuan Tao et.al., Paper: http://arxiv.org/abs/2601.16581
- 2026-03-02, Near-Optimal Regret for KL-Regularized Multi-Armed Bandits, Kaixuan Ji et.al., Paper: http://arxiv.org/abs/2603.02155
- 2026-02-11, Near-Constant Strong Violation and Last-Iterate Convergence for Online CMDPs via Decaying Safety Margins, Qian Zuo et.al., Paper: http://arxiv.org/abs/2602.10917
- 2025-10-29, Navigation in a Three-Dimensional Urban Flow using Deep Reinforcement Learning, Federica Tonti et.al., Paper: http://arxiv.org/abs/2510.25679
- 2026-03-19, Navigating complex phase diagrams in soft matter systems, Michael Wassermair et.al., Paper: http://arxiv.org/abs/2603.18918
- 2026-03-16, NavThinker: Action-Conditioned World Models for Coupled Prediction and Planning in Social Navigation, Tianshuai Hu et.al., Paper: http://arxiv.org/abs/2603.15359
- 2025-11-04, Natural-gas storage modelling by deep reinforcement learning, Tiziano Balaconi et.al., Paper: http://arxiv.org/abs/2511.02646
- 2025-10-21, Nash Policy Gradient: A Policy Gradient Method with Iteratively Refined Regularization for Finding Nash Equilibria, Eason Yu et.al., Paper: http://arxiv.org/abs/2510.18183
- 2025-10-21, NTKMTL: Mitigating Task Imbalance in Multi-Task Learning from Neural Tangent Kernel Perspective, Xiaohan Qin et.al., Paper: http://arxiv.org/abs/2510.18258
- 2026-03-10, NS-VLA: Towards Neuro-Symbolic Vision-Language-Action Models, Ziyue Zhu et.al., Paper: http://arxiv.org/abs/2603.09542
- 2025-11-18, NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards, Chia-Yu Hung et.al., Paper: http://arxiv.org/abs/2511.14659
- 2026-03-14, NEP_MiniMax: An Approach for NEPs Based on Matrix-valued Minimax Approximations, Chenkun Zhang et.al., Paper: http://arxiv.org/abs/2603.13794
- 2026-03-20, NASimJax: GPU-Accelerated Policy Learning Framework for Penetration Testing, Raphael Simon et.al., Paper: http://arxiv.org/abs/2603.19864
- 2025-10-31, Muon veto system for the CROSS double-beta decay search experiment, A. S. Barabash et.al., Paper: http://arxiv.org/abs/2510.27406
- 2025-11-25, Multivariable Wold-Type Decomposition and Analytic Models for a class of left-inverse commuting pairs, Monojit Bhattacharjee et.al., Paper: http://arxiv.org/abs/2511.20632
- 2025-11-26, Multivalued backward stochastic differential equations with jumps and moving boundary, Badr Elmansouri et.al., Paper: http://arxiv.org/abs/2511.21679
- 2026-01-13, Multiplex Thinking: Reasoning via Token-wise Branch-and-Merge, Yao Tang et.al., Paper: http://arxiv.org/abs/2601.08808
- 2026-02-04, Multiple Imputation Methods under Extreme Values, Enzo Porto Brasil et.al., Paper: http://arxiv.org/abs/2602.04751
- 2025-10-20, Multimodal Safety Is Asymmetric: Cross-Modal Exploits Unlock Black-Box MLLMs Jailbreaks, Xinkai Wang et.al., Paper: http://arxiv.org/abs/2510.17277
- 2025-12-18, Multimodal RewardBench 2: Evaluating Omni Reward Models for Interleaved Text and Image, Yushi Hu et.al., Paper: http://arxiv.org/abs/2512.16899
- 2026-03-11, Multilingual Reasoning Gym: Multilingual Scaling of Procedural Reasoning Environments, Konstantin Dobler et.al., Paper: http://arxiv.org/abs/2603.10793
- 2025-10-24, Multilevel Picard scheme for solving high-dimensional drift control problems with state constraints, Yuan Zhong et.al., Paper: http://arxiv.org/abs/2510.21607
- 2026-03-19, MultihopSpatial: Multi-hop Compositional Spatial Reasoning Benchmark for Vision-Language Model, Youngwan Lee et.al., Paper: http://arxiv.org/abs/2603.18892
- 2025-12-25, Multiconnectivity for SAGIN: Current Trends, Challenges, AI-driven Solutions, and Opportunities, Abd Ullah Khan et.al., Paper: http://arxiv.org/abs/2512.21717
- 2025-12-26, Multi-reference Trial State for Lattice Quantum Monte Carlo Simulations, Teng Wang et.al., Paper: http://arxiv.org/abs/2512.21942
- 2025-12-31, Multi-particle quantum systems within the Worldline Monte Carlo formalism, Ivan Ahumada et.al., Paper: http://arxiv.org/abs/2512.24942
- 2026-03-07, Multi-parameter determination in the semilinear Helmholtz equation, Long-Ling Du et.al., Paper: http://arxiv.org/abs/2603.07247
- 2026-03-26, Multi-User Covert Communication in Spatially Heterogeneous Wireless Networks, Jinyoung Lee et.al., Paper: http://arxiv.org/abs/2603.25549
- 2026-02-11, Multi-Task Reinforcement Learning of Drone Aerobatics by Exploiting Geometric Symmetries, Zhanyu Guo et.al., Paper: http://arxiv.org/abs/2602.10997
- 2026-02-04, Multi-Source Retrieval and Reasoning for Legal Sentencing Prediction, Junjie Chen et.al., Paper: http://arxiv.org/abs/2602.04690
- 2025-11-25, Multi-Resonant-Line Radiative Transfer: Lyman-Alpha Fine Structure and Deuterium Coupling, Ethan Stace et.al., Paper: http://arxiv.org/abs/2511.20580
- 2025-11-14, Multi-Phase Spacecraft Trajectory Optimization via Transformer-Based Reinforcement Learning, Amit Jain et.al., Paper: http://arxiv.org/abs/2511.11402
- 2025-12-11, Multi-Objective Reward and Preference Optimization: Theory and Algorithms, Akhil Agnihotri et.al., Paper: http://arxiv.org/abs/2512.10601
- 2026-02-27, Multi-Objective Reinforcement Learning for Large-Scale Tote Allocation in Human-Robot Collaborative Fulfillment Centers, Sikata Sengupta et.al., Paper: http://arxiv.org/abs/2602.24182
- 2026-01-26, Multi-Objective Reinforcement Learning for Efficient Tactical Decision Making for Trucks in Highway Traffic, Deepthi Pathare et.al., Paper: http://arxiv.org/abs/2601.18783
- 2026-03-25, Multi-GPU Hybrid Particle-in-Cell Monte Carlo Simulations for Exascale Computing Systems, Jeremy J. Williams et.al., Paper: http://arxiv.org/abs/2603.24508
- 2026-04-02, Multi-Agent Video Recommenders: Evolution, Patterns, and Open Challenges, Srivaths Ranganathan et.al., Paper: http://arxiv.org/abs/2604.02211
- 2025-11-21, Multi-Agent Pointer Transformer: Seq-to-Seq Reinforcement Learning for Multi-Vehicle Dynamic Pickup-Delivery Problems, Zengyu Zou et.al., Paper: http://arxiv.org/abs/2511.17435
- 2026-02-02, Multi-Agent Monte Carlo Tree Search for Makespan-Efficient Object Rearrangement in Cluttered Spaces, Hanwen Ren et.al., Paper: http://arxiv.org/abs/2602.02411
- 2025-12-16, Multi-Agent Medical Decision Consensus Matrix System: An Intelligent Collaborative Framework for Oncology MDT Consultations, Xudong Han et.al., Paper: http://arxiv.org/abs/2512.14321
- 2026-04-01, Multi-Agent LLM Governance for Safe Two-Timescale Reinforcement Learning in SDN-IoT Defense, Saeid Jamshidi et.al., Paper: http://arxiv.org/abs/2604.01127
- 2025-10-28, Multi-Agent Evolve: LLM Self-Improve through Co-evolution, Yixing Chen et.al., Paper: http://arxiv.org/abs/2510.23595
- 2025-11-21, MorphSeek: Fine-grained Latent Representation-Level Policy Optimization for Deformable Image Registration, Runxun Zhang et.al., Paper: http://arxiv.org/abs/2511.17392
- 2026-01-14, Monte-Carlo Tree Search with Neural Network Guidance for Lane-Free Autonomous Driving, Ioannis Peridis et.al., Paper: http://arxiv.org/abs/2601.09353
- 2026-02-18, Monte Carlo study of the classical antiferromagnetic $J_1$-$J_2$-$J_3$ Heisenberg model on a simple cubic lattice, A. N. Ignatenko et.al., Paper: http://arxiv.org/abs/2602.16471
- 2025-10-22, Monte Carlo study of the $O(2)$-invariant $φ^4$ theory with a cubic perturbation in three dimensions, Martin Hasenbusch et.al., Paper: http://arxiv.org/abs/2510.19473
- 2026-03-17, Monte Carlo sampling from a projected entangled-pair state in simulations of quantum annealing in the three dimensional random Ising model, Jacek Dziarmaga et.al., Paper: http://arxiv.org/abs/2603.16509
- 2026-03-20, Monte Carlo conformal prediction for quantifying uncertainty in radio galaxy classification under ambiguous ground truth, Alex Walls et.al., Paper: http://arxiv.org/abs/2603.20000
- 2025-10-23, Monte Carlo Sampling for Wave Functions Requiring (Anti)Symmetrization, Koyena Bose et.al., Paper: http://arxiv.org/abs/2510.20577
- 2026-03-05, Monitoring Covariance in Multichannel Profiles via Functional Graphical Models, Christian Capezza et.al., Paper: http://arxiv.org/abs/2603.05274
- 2026-02-19, Momentum Measurement of Charged Particles in FASER’s Emulsion Detector at the LHC, FASER Collaboration et.al., Paper: http://arxiv.org/abs/2602.17575
- 2025-12-18, MomaGraph: State-Aware Unified Scene Graphs with Vision-Language Model for Embodied Task Planning, Yuanchen Ju et.al., Paper: http://arxiv.org/abs/2512.16909
- 2025-11-21, MolSight: Optical Chemical Structure Recognition with SMILES Pretraining, Multi-Granularity Learning and Reinforcement Learning, Wenrui Zhang et.al., Paper: http://arxiv.org/abs/2511.17300
- 2026-03-25, MolEvolve: LLM-Guided Evolutionary Search for Interpretable Molecular Optimization, Xiangsen Chen et.al., Paper: http://arxiv.org/abs/2603.24382
- 2026-03-26, Modernising Reinforcement Learning-Based Navigation for Embodied Semantic Scene Graph Generation, Roman Kueble et.al., Paper: http://arxiv.org/abs/2603.25415
- 2018-11-20, Modelling the Dynamic Joint Policy of Teammates with Attention Multi-agent DDPG, Hangyu Mao et.al., Paper: http://arxiv.org/abs/1811.07029
- 2025-11-14, Modeling the Multi-Wavelength Afterglow of Short Gamma-Ray Bursts with a Plateau Phase, Chen Deng et.al., Paper: http://arxiv.org/abs/2511.11396
- 2026-03-21, Modeling surface radiation of rotating neutron stars with Monk-NS, Wenda Zhang et.al., Paper: http://arxiv.org/abs/2603.20870
- 2025-10-31, Modeling partially-ionized dense plasma using wavepacket molecular dynamics, Daniel Plummer et.al., Paper: http://arxiv.org/abs/2510.27446
- 2026-02-19, Modeling of Relativistic Plasmas with a Conservative Discontinuous Galerkin Method, James Juno et.al., Paper: http://arxiv.org/abs/2602.17487
- 2026-02-16, Modeling isolated magnetar spin-down evolution and implications for long-period radio transients, Jon Kwong et.al., Paper: http://arxiv.org/abs/2602.15024
- 2025-12-24, Model-free stochastic linear quadratic control for discrete-time systems with multiplicative and additive noises via semidefinite programming, Jing Guo et.al., Paper: http://arxiv.org/abs/2512.20911
- 2026-01-16, Model-free policy gradient for discrete-time mean-field control, Matthieu Meunier et.al., Paper: http://arxiv.org/abs/2601.11217
- 2022-11-22, Model-based Trajectory Stitching for Improved Offline Reinforcement Learning, Charles A. Hepburn et.al., Paper: http://arxiv.org/abs/2211.11603
- 2025-12-04, Model-Free Assessment of Simulator Fidelity via Quantile Curves, Garud Iyengar et.al., Paper: http://arxiv.org/abs/2512.05024
- 2025-12-16, Model-Based Reinforcement Learning in Discrete-Action Non-Markovian Reward Decision Processes, Alessandro Trapasso et.al., Paper: http://arxiv.org/abs/2512.14617
- 2025-12-08, Model-Based Reinforcement Learning Under Confounding, Nishanth Venkatesh et.al., Paper: http://arxiv.org/abs/2512.07528
- 2026-01-13, Model-Agnostic Solutions for Deep Reinforcement Learning in Non-Ergodic Contexts, Bert Verbruggen et.al., Paper: http://arxiv.org/abs/2601.08726
- 2026-03-19, MoRI: Learning Motivation-Grounded Reasoning for Scientific Ideation in Large Language Models, Chenyang Gu et.al., Paper: http://arxiv.org/abs/2603.19044
- 2025-10-21, MoMaGen: Generating Demonstrations under Soft and Hard Constraints for Multi-Step Bimanual Mobile Manipulation, Chengshu Li et.al., Paper: http://arxiv.org/abs/2510.18316
- 2025-11-26, MoGAN: Improving Motion Quality in Video Diffusion via Few-Step Motion Adversarial Post-Training, Haotian Xue et.al., Paper: http://arxiv.org/abs/2511.21592
- 2026-01-29, MoE-ACT: Improving Surgical Imitation Learning Policies through Supervised Mixture-of-Experts, Lorenzo Mazza et.al., Paper: http://arxiv.org/abs/2601.21971
- 2025-11-06, Mixed-State Measurement-Induced Phase Transitions in Imaginary-Time Dynamics, Yi-Ming Ding et.al., Paper: http://arxiv.org/abs/2511.04402
- 2025-12-12, Mitigating the Safety Alignment Tax with Null-Space Constrained Policy Optimization, Yifan Niu et.al., Paper: http://arxiv.org/abs/2512.11391
- 2025-11-07, Minority-Aware Satisfaction Estimation in Dialogue Systems via Preference-Adaptive Reinforcement Learning, Yahui Fu et.al., Paper: http://arxiv.org/abs/2511.05407
- 2026-03-24, Minimizing Material Waste in Additive Manufacturing through Online Reel Assignment, Ilayda Celenk et.al., Paper: http://arxiv.org/abs/2603.23042
- 2026-03-06, Minimizers for boundary reactions: renormalized energy, location of singularities, and applications, Xavier Cabre et.al., Paper: http://arxiv.org/abs/2603.06435
- 2026-02-24, Minimal loop currents in doped Mott insulators, Can Cui et.al., Paper: http://arxiv.org/abs/2602.21206
- 2025-12-24, Minijets and Broken Stationarity in a Blazar : Novel Insights into the Origin of $γ$ -ray Variability in CTA 102, Agniva Roychowdhury et.al., Paper: http://arxiv.org/abs/2512.21240
- 2025-10-28, MiniOneRec: An Open-Source Framework for Scaling Generative Recommendation, Xiaoyu Kong et.al., Paper: http://arxiv.org/abs/2510.24431
- 2025-12-02, MindGPT-4ov: An Enhanced MLLM via a Multi-Stage Post-Training Paradigm, Wei Chen et.al., Paper: http://arxiv.org/abs/2512.02895
- 2025-12-15, MindDrive: A Vision-Language-Action Model for Autonomous Driving via Online Reinforcement Learning, Haoyu Fu et.al., Paper: http://arxiv.org/abs/2512.13636
- 2025-12-09, Mind to Hand: Purposeful Robotic Control via Embodied Reasoning, Peijun Tang et.al., Paper: http://arxiv.org/abs/2512.08580
- 2026-02-27, Microwave response of fractional quantum Hall droplets with quasiparticle tunneling, Fumihiro Murabayashi et.al., Paper: http://arxiv.org/abs/2602.24116
- 2025-12-24, MiST: Understanding the Role of Mid-Stage Scientific Training in Developing Chemical Reasoning Models, Andres M Bran et.al., Paper: http://arxiv.org/abs/2512.21231
- 2025-11-20, MiMo-Embodied: X-Embodied Foundation Model Technical Report, Xiaoshuai Hao et.al., Paper: http://arxiv.org/abs/2511.16518
- 2026-03-19, Mi:dm K 2.5 Pro, KT Tech innovation Group et.al., Paper: http://arxiv.org/abs/2603.18788
- 2026-03-09, MetaWorld-X: Hierarchical World Modeling via VLM-Orchestrated Experts for Humanoid Loco-Manipulation, Yutong Shen et.al., Paper: http://arxiv.org/abs/2603.08572
- 2025-10-29, MetaLore: Learning to Orchestrate Communication and Computation for Metaverse Synchronization, Elif Ebru Ohri et.al., Paper: http://arxiv.org/abs/2510.25705
- 2026-02-12, Meta-Sel: Efficient Demonstration Selection for In-Context Learning via Supervised Meta-Learning, Xubin Wang et.al., Paper: http://arxiv.org/abs/2602.12123
- 2026-03-09, Meta-RL with Shared Representations Enables Fast Adaptation in Energy Systems, Théo Zangato et.al., Paper: http://arxiv.org/abs/2603.08418
- 2025-12-18, Meta-RL Induces Exploration in Language Agents, Yulun Jiang et.al., Paper: http://arxiv.org/abs/2512.16848
- 2025-12-26, Meta-Learning-Based Handover Management in NextG O-RAN, Michail Kalntis et.al., Paper: http://arxiv.org/abs/2512.22022
- 2025-11-19, Meta-Black-Box Optimization with Bi-Space Landscape Analysis and Dual-Control Mechanism for SAEA, Yukun Du et.al., Paper: http://arxiv.org/abs/2511.15551
- 2026-02-17, MeshMimic: Geometry-Aware Humanoid Motion Learning through 3D Scene Reconstruction, Qiang Zhang et.al., Paper: http://arxiv.org/abs/2602.15733
- 2026-03-12, Merger-driven buildup of the $M_{\rm BH}$ - $M_*$ relation bridging high-$z$ overmassive black holes with the local relation, Takumi S. Tanaka et.al., Paper: http://arxiv.org/abs/2603.11747
- 2025-10-27, MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding, Xin Jin et.al., Paper: http://arxiv.org/abs/2510.23479
- 2026-03-13, Mending the Holes: Mitigating Reward Hacking in Reinforcement Learning for Multilingual Translation, Yifeng Liu et.al., Paper: http://arxiv.org/abs/2603.13045
- 2025-12-15, Memory in the Age of AI Agents, Yuyang Hu et.al., Paper: http://arxiv.org/abs/2512.13564
- 2026-01-21, Memory Retention Is Not Enough to Master Memory Tasks in Reinforcement Learning, Oleg Shchendrigin et.al., Paper: http://arxiv.org/abs/2601.15086
- 2025-10-22, Memo: Training Memory-Efficient Embodied Agents with Reinforcement Learning, Gunshi Gupta et.al., Paper: http://arxiv.org/abs/2510.19732
- 2026-03-04, Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory, Zhenting Wang et.al., Paper: http://arxiv.org/abs/2603.04257
- 2026-03-19, Memento-Skills: Let Agents Design Agents, Huichi Zhou et.al., Paper: http://arxiv.org/abs/2603.18743
- 2025-11-04, MemSearcher: Training LLMs to Reason, Search and Manage Memory via End-to-End Reinforcement Learning, Qianhao Yuan et.al., Paper: http://arxiv.org/abs/2511.02805
- 2026-01-06, MemRL: Self-Evolving Agents via Runtime Reinforcement Learning on Episodic Memory, Shengtao Zhang et.al., Paper: http://arxiv.org/abs/2601.03192
- 2026-01-28, MemCtrl: Using MLLMs as Active Memory Controllers on Embodied Agents, Vishnu Sashank Dorbala et.al., Paper: http://arxiv.org/abs/2601.20831
- 2026-01-30, Mem-T: Densifying Rewards for Long-Horizon Memory Agents, Yanwei Yue et.al., Paper: http://arxiv.org/abs/2601.23014
- 2026-02-26, MediX-R1: Open Ended Medical Reinforcement Learning, Sahal Shaji Mullappilly et.al., Paper: http://arxiv.org/abs/2602.23363
- 2025-12-05, MedTutor-R1: Socratic Personalized Medical Teaching with Multi-Agent Simulation, Zhitao He et.al., Paper: http://arxiv.org/abs/2512.05671
- 2025-10-22, MedReason-R1: Learning to Reason for CT Diagnosis with Reinforcement Learning and Local Zoom, Yifan Li et.al., Paper: http://arxiv.org/abs/2510.19626
- 2026-02-06, MedMO: Grounding and Understanding Multimodal Large Language Model for Medical Images, Ankan Deria et.al., Paper: http://arxiv.org/abs/2602.06965
- 2026-03-24, MedCausalX: Adaptive Causal Reasoning with Self-Reflection for Trustworthy Medical Vision-Language Models, Jianxin Lin et.al., Paper: http://arxiv.org/abs/2603.23085
- 2025-12-15, MedCEG: Reinforcing Verifiable Medical Reasoning with Critical Evidence Graph, Linjie Mu et.al., Paper: http://arxiv.org/abs/2512.13510
- 2025-11-20, MedBayes-Lite: Bayesian Uncertainty Quantification for Safe Clinical Decision Support, Elias Hossain et.al., Paper: http://arxiv.org/abs/2511.16625
- 2025-10-21, Med-VRAgent: A Framework for Medical Visual Reasoning-Enhanced Agents, Guangfu Guo et.al., Paper: http://arxiv.org/abs/2510.18424
- 2026-01-30, Med-Scout: Curing MLLMs’ Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training, Anglin Liu et.al., Paper: http://arxiv.org/abs/2601.23220
- 2025-10-24, Mechanistic Interpretability for Neural TSP Solvers, Reuben Narad et.al., Paper: http://arxiv.org/abs/2510.21693
- 2025-10-23, Measuring cosmic dipole with the GRB luminosity-time relation, Jessica Santiago et.al., Paper: http://arxiv.org/abs/2510.20705
- 2025-10-20, Measuring Reasoning in LLMs: a New Dialectical Angle, Soheil Abbasloo et.al., Paper: http://arxiv.org/abs/2510.18134
- 2025-11-19, Measure finite topology on the ring of measurable functions, Soumajit Dey et.al., Paper: http://arxiv.org/abs/2511.15436
- 2026-03-18, Mean first escape times of Brownian motion on asymptotically hyperbolic and gas giant metric surfaces, Jesse Gell-Redman et.al., Paper: http://arxiv.org/abs/2603.17313
- 2026-01-26, McSAS3: improved Monte Carlo small-angle scattering analysis software for dilute and dense scatterers, Brian Richard Pauw et.al., Paper: http://arxiv.org/abs/2601.18659
- 2026-03-19, Maximum-Entropy Exploration with Future State-Action Visitation Measures, Adrien Bolland et.al., Paper: http://arxiv.org/abs/2603.18965
- 2026-03-11, Maximum Inverse Sum Indeg Index of Trees and Unicyclic Graphs with Fixed Diameter, Sunilkumar M. Hosamani et.al., Paper: http://arxiv.org/abs/2603.10603
- 2026-03-26, Maximum Entropy Behavior Exploration for Sim2Real Zero-Shot Reinforcement Learning, Jiajun Hu et.al., Paper: http://arxiv.org/abs/2603.25464
- 2026-03-14, Maximin Robust Bayesian Experimental Design, Hany Abdulsamad et.al., Paper: http://arxiv.org/abs/2603.14094
- 2025-12-05, MaxShapley: Towards Incentive-compatible Generative Search with Fair Context Attribution, Sara Patel et.al., Paper: http://arxiv.org/abs/2512.05958
- 2026-01-02, Materials Informatics: Emergence To Autonomous Discovery In The Age Of AI, Turab Lookman et.al., Paper: http://arxiv.org/abs/2601.00742
- 2026-02-24, Matching Multiple Experts: On the Exploitability of Multi-Agent Imitation Learning, Antoine Bergerault et.al., Paper: http://arxiv.org/abs/2602.21020
- 2026-03-12, Matching Features, Not Tokens: Energy-Based Fine-Tuning of Language Models, Samy Jelassi et.al., Paper: http://arxiv.org/abs/2603.12248
- 2026-01-15, MatchTIR: Fine-Grained Supervision for Tool-Integrated Reasoning via Bipartite Matching, Changle Qu et.al., Paper: http://arxiv.org/abs/2601.10712
- 2026-03-29, Match or Replay: Self Imitating Proximal Policy Optimization, Gaurav Chaudhary et.al., Paper: http://arxiv.org/abs/2603.27515
- 2026-03-14, Mass spectrometry of $^{75}$Zn ground and isomeric states from in-trap decay of $^{75}$ Cu, M. Müller et.al., Paper: http://arxiv.org/abs/2603.13868
- 2025-11-07, Mass determination of the three long-period Neptune- and sub-Neptune-sized planets transiting TOI-282, A. Barone et.al., Paper: http://arxiv.org/abs/2511.05147
- 2026-04-01, Mass Hierarchies Without Mixing: Abelian Froggatt-Nielsen Models with Uncharged Left-Handed Doublets, Navid Ardakanian et.al., Paper: http://arxiv.org/abs/2604.01055
- 2025-11-21, Masked-and-Reordered Self-Supervision for Reinforcement Learning from Verifiable Rewards, Zhen Wang et.al., Paper: http://arxiv.org/abs/2511.17473
- 2025-11-18, Masked IRL: LLM-Guided Reward Disambiguation from Demonstrations and Language, Minyoung Hwang et.al., Paper: http://arxiv.org/abs/2511.14565
- 2026-04-01, Markov chain Monte Carlo for Bayesian inference of the non-conducting region in intra-atrial reentrant tachycardia, Maarten Volkaerts et.al., Paper: http://arxiv.org/abs/2604.01164
- 2026-03-19, Markov Potential Game and Multi-Agent Reinforcement Learning for Autonomous Driving, Huiwen Yan et.al., Paper: http://arxiv.org/abs/2603.19188
- 2025-12-03, MarkTune: Improving the Quality-Detectability Trade-off in Open-Weight LLM Watermarking, Yizhou Zhao et.al., Paper: http://arxiv.org/abs/2512.04044
- 2025-11-25, MapReduce LoRA: Advancing the Pareto Front in Multi-Preference Optimization for Generative Models, Chieh-Yun Chen et.al., Paper: http://arxiv.org/abs/2511.20629
- 2026-03-25, Many-body perturbation theory for the nuclear equation of state up to fifth order, C. Drischler et.al., Paper: http://arxiv.org/abs/2603.24532
- 2025-12-31, Many Minds from One Model: Bayesian Transformers for Population Intelligence, Diji Yang et.al., Paper: http://arxiv.org/abs/2512.25063
- 2026-02-16, ManeuverNet: A Soft Actor-Critic Framework for Precise Maneuvering of Double-Ackermann-Steering Robots with Optimized Reward Functions, Kohio Deflesselle et.al., Paper: http://arxiv.org/abs/2602.14726
- 2024-10-30, Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning, Qi Wang et.al., Paper: http://arxiv.org/abs/2305.15260
- 2026-03-23, Make Tracking Easy: Neural Motion Retargeting for Humanoid Whole-body Control, Qingrui Zhao et.al., Paper: http://arxiv.org/abs/2603.22201
- 2025-12-15, Magnetic order and novel quantum criticality in the strongly interacting quasicrystals, Cong Zhang et.al., Paper: http://arxiv.org/abs/2512.13546
- 2025-11-19, Magnetic electron-hole asymmetry in cuprates: a computational revisit, Jiong Mei et.al., Paper: http://arxiv.org/abs/2511.15608
- 2025-11-06, MacroNav: Multi-Task Context Representation Learning Enables Efficient Navigation in Unknown Environments, Kuankuan Sima et.al., Paper: http://arxiv.org/abs/2511.04320
- 2025-12-05, Machine Learning-Informed 3+1 Sterile Neutrino Global Fits using Posterior Density Estimation of Electron Disappearance Data, Joshua Villarreal et.al., Paper: http://arxiv.org/abs/2512.05784
- 2026-03-18, Machine Learning for Network Attacks Classification and Statistical Evaluation of Machine Learning for Network Attacks Classification and Adversarial Learning Methodologies for Synthetic Data Generation, Iakovos-Christos Zarkadis et.al., Paper: http://arxiv.org/abs/2603.17717
- 2025-10-29, MTIR-SQL: Multi-turn Tool-Integrated Reasoning Reinforcement Learning for Text-to-SQL, Zekun Xu et.al., Paper: http://arxiv.org/abs/2510.25510
- 2025-12-31, MSACL: Multi-Step Actor-Critic Learning with Lyapunov Certificates for Exponentially Stabilizing Control, Yongwei Zhang et.al., Paper: http://arxiv.org/abs/2512.24955
- 2025-10-24, MRO: Enhancing Reasoning in Diffusion Language Models via Multi-Reward Optimization, Chenglong Wang et.al., Paper: http://arxiv.org/abs/2510.21473
- 2026-03-10, MORE-R1: Guiding LVLM for Multimodal Object-Entity Relation Extraction via Stepwise Reasoning with Reinforcement Learning, Xiang Yuan et.al., Paper: http://arxiv.org/abs/2603.09478
- 2025-12-10, MOA: Multi-Objective Alignment for Role-Playing Agents, Chonghua Liao et.al., Paper: http://arxiv.org/abs/2512.09756
- 2025-12-15, MMhops-R1: Multimodal Multi-hop Reasoning, Tao Zhang et.al., Paper: http://arxiv.org/abs/2512.13573
- 2026-01-01, MIMO-AFDM Outperforms MIMO-OFDM in the Face of Hardware Impairments, Zeping Sui et.al., Paper: http://arxiv.org/abs/2601.00502
- 2026-03-23, MEVIUS2: Practical Open-Source Quadruped Robot with Sheet Metal Welding and Multimodal Perception, Kento Kawaharazuka et.al., Paper: http://arxiv.org/abs/2603.22031
- 2025-10-21, MENTOR: A Reinforcement Learning Framework for Model Enhancement via Teacher-Optimized Rewards in Small Models, ChangSu Choi et.al., Paper: http://arxiv.org/abs/2510.18383
- 2026-02-19, MDP Planning as Policy Inference, David Tolpin et.al., Paper: http://arxiv.org/abs/2602.17375
- 2025-11-17, MDIntrinsicDimension: Dimensionality-Based Analysis of Collective Motions in Macromolecules from Molecular Dynamics Trajectories, Irene Cazzaniga et.al., Paper: http://arxiv.org/abs/2511.13550
- 2026-01-05, MDAgent2: Large Language Model for Code Generation and Knowledge Q&A in Molecular Dynamics, Zhuofan Shi et.al., Paper: http://arxiv.org/abs/2601.02075
- 2026-03-11, MCMC Informed Neural Emulators for Uncertainty Quantification in Dynamical Systems, Heikki Haario et.al., Paper: http://arxiv.org/abs/2603.10987
- 2026-03-11, MAVEN: A Meta-Reinforcement Learning Framework for Varying-Dynamics Expertise in Agile Quadrotor Maneuvers, Jin Zhou et.al., Paper: http://arxiv.org/abs/2603.10714
- 2026-02-19, MASPO: Unifying Gradient Utilization, Probability Mass, and Signal Reliability for Robust and Sample-Efficient LLM Reasoning, Xiaoliang Fu et.al., Paper: http://arxiv.org/abs/2602.17550
- 2026-02-08, MARTI-MARS $^2$ : Scaling Multi-Agent Self-Search via Reinforcement Learning for Code Generation, Shijie Wang et.al., Paper: http://arxiv.org/abs/2602.07848
- 2026-02-19, MARS: Margin-Aware Reward-Modeling with Self-Refinement, Payel Bhattacharjee et.al., Paper: http://arxiv.org/abs/2602.17658
- 2026-03-25, MARCH: Multi-Agent Reinforced Self-Check for LLM Hallucination, Zhuo Li et.al., Paper: http://arxiv.org/abs/2603.24579
- 2025-10-31, MARAG-R1: Beyond Single Retriever via Reinforcement-Learned Multi-Tool Agentic Retrieval, Qi Luo et.al., Paper: http://arxiv.org/abs/2510.27569
- 2026-03-06, MAPO: Mixed Advantage Policy Optimization for Long-Horizon Multi-Turn Dialogue, Naifan Zhang et.al., Paper: http://arxiv.org/abs/2603.06194
- 2025-12-26, MAI-UI Technical Report: Real-World Centric Foundation GUI Agents, Hanzhang Zhou et.al., Paper: http://arxiv.org/abs/2512.22047
- 2025-11-24, MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization, Boyuan Wu et.al., Paper: http://arxiv.org/abs/2511.19253
- 2025-10-21, MADR: MPC-guided Adversarial DeepReach, Ryan Teoh et.al., Paper: http://arxiv.org/abs/2510.18845
- 2026-02-16, MAC-AMP: A Closed-Loop Multi-Agent Collaboration System for Multi-Objective Antimicrobial Peptide Design, Gen Zhou et.al., Paper: http://arxiv.org/abs/2602.14926
- 2026-03-16, MA-VLCM: A Vision Language Critic Model for Value Estimation of Policies in Multi-Agent Team Settings, Shahil Shaik et.al., Paper: http://arxiv.org/abs/2603.15418
- 2025-12-04, Lévy sources in UrQMD in Ar+Sc collisions at SPS energies, Barnabas Porfy et.al., Paper: http://arxiv.org/abs/2512.05019
- 2025-10-21, Lyapunov-Aware Quantum-Inspired Reinforcement Learning for Continuous-Time Vehicle Control: A Feasibility Study, Nutkritta Kraipatthanapong et.al., Paper: http://arxiv.org/abs/2510.18852
- 2026-02-16, Lower Estimates for $L_1$ -Distortion of Transportation Cost Spaces, Chris Gartland et.al., Paper: http://arxiv.org/abs/2602.14852
- 2025-10-28, Low-lying baryon resonances from lattice QCD, Colin Morningstar et.al., Paper: http://arxiv.org/abs/2510.24596
- 2025-11-14, Low-energy enhancement in the magnetic dipole radiation of actinide nuclei, C. Rodgers et.al., Paper: http://arxiv.org/abs/2511.11565
- 2025-10-30, Low-Altitude UAV-Carried Movable Antenna for Joint Wireless Power Transfer and Covert Communications, Chuang Zhang et.al., Paper: http://arxiv.org/abs/2510.26628
- 2026-02-13, Look Inward to Explore Outward: Learning Temperature Policy from LLM Internal States via Hierarchical RL, Yixiao Zhou et.al., Paper: http://arxiv.org/abs/2602.13035
- 2025-12-23, LongVideoAgent: Multi-Agent Reasoning with Long Videos, Runtao Liu et.al., Paper: http://arxiv.org/abs/2512.20618
- 2026-02-24, LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding, Jihao Qiu et.al., Paper: http://arxiv.org/abs/2602.20913
- 2026-03-02, LongRLVR: Long-Context Reinforcement Learning Requires Verifiable Context Rewards, Guanzheng Chen et.al., Paper: http://arxiv.org/abs/2603.02146
- 2025-12-08, LongCat-Image Technical Report, Meituan LongCat Team et.al., Paper: http://arxiv.org/abs/2512.07584
- 2026-01-23, LongCat-Flash-Thinking-2601 Technical Report, Meituan LongCat Team et.al., Paper: http://arxiv.org/abs/2601.16725
- 2026-03-22, LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning, Jianing Wang et.al., Paper: http://arxiv.org/abs/2603.21065
- 2025-12-11, Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving, Songyang Gao et.al., Paper: http://arxiv.org/abs/2512.10739
- 2026-03-13, Long-form RewardBench: Evaluating Reward Models for Long-form Generation, Hui Huang et.al., Paper: http://arxiv.org/abs/2603.12963
- 2026-03-10, Long-Run Conditional Value-at-Risk Reinforcement Learning, Qixin Wang et.al., Paper: http://arxiv.org/abs/2603.09734
- 2026-03-17, Long-Horizon Traffic Forecasting via Incident-Aware Conformal Spatio-Temporal Transformers, Mayur Patil et.al., Paper: http://arxiv.org/abs/2603.16857
- 2026-02-10, Long Chain-of-Thought Compression via Fine-Grained Group Policy Optimization, Xinchen Han et.al., Paper: http://arxiv.org/abs/2602.10048
- 2026-01-20, Log-optimality with small liability stream, Michail Anthropelos et.al., Paper: http://arxiv.org/abs/2601.14139
- 2026-02-24, Localized Dynamics-Aware Domain Adaption for Off-Dynamics Offline Reinforcement Learning, Zhangjie Xia et.al., Paper: http://arxiv.org/abs/2602.21072
- 2025-10-20, Local Coherence or Global Validity? Investigating RLVR Traces in Math Domains, Soumya Rani Samineni et.al., Paper: http://arxiv.org/abs/2510.18176
- 2026-04-02, Lithium Droplet Transport in Tokamak Edge Plasmas, A. Diaw et.al., Paper: http://arxiv.org/abs/2604.02100
- 2026-03-16, Listening to the Echo: User-Reaction Aware Policy Optimization via Scalar-Verbal Hybrid Reinforcement Learning, Jing Ye et.al., Paper: http://arxiv.org/abs/2603.15434
- 2026-03-12, Linking Perception, Confidence and Accuracy in MLLMs, Yuetian Du et.al., Paper: http://arxiv.org/abs/2603.12149
- 2025-11-18, Linear Combinations of Logarithms of $L$ -functions over Function Fields at Microscopic Shifts and Beyond, Fatma Çiçek et.al., Paper: http://arxiv.org/abs/2511.14563
- 2026-01-13, Linear Canonical-Ensemble Quantum Monte Carlo: From Dilute Fermi Gas to Flat-Band Ferromagnetism, Tu Hong et.al., Paper: http://arxiv.org/abs/2601.08552
- 2026-03-25, Likelihood hacking in probabilistic program synthesis, Jacek Karwowski et.al., Paper: http://arxiv.org/abs/2603.24126
- 2026-02-25, LightSim: A Lightweight Cell Transmission Model Simulator for Traffic Signal Control Research, Haoran Su et.al., Paper: http://arxiv.org/abs/2602.21852
- 2026-03-17, Lifting the fog - a case for non-reversible “lifted” Markov chains, Gabriele Tartero et.al., Paper: http://arxiv.org/abs/2603.16855
- 2026-03-11, Lifelong Imitation Learning with Multimodal Latent Replay and Incremental Adjustment, Fanqi Yu et.al., Paper: http://arxiv.org/abs/2603.10929
- 2026-02-19, LexiSafe: Offline Safe Reinforcement Learning with Lexicographic Safety-Reward Hierarchy, Hsin-Jung Yang et.al., Paper: http://arxiv.org/abs/2602.17312
- 2025-11-24, Leveraging Spatiotemporal Graph Neural Networks for Multi-Store Sales Forecasting, Manish Singh et.al., Paper: http://arxiv.org/abs/2511.19267
- 2025-11-24, Leveraging LLMs for reward function design in reinforcement learning control tasks, Franklin Cardenoso et.al., Paper: http://arxiv.org/abs/2511.19355
- 2025-12-23, Leveraging High-Fidelity Digital Models and Reinforcement Learning for Mission Engineering: A Case Study of Aerial Firefighting Under Perfect Information, İbrahim Oğuz Çetinkaya et.al., Paper: http://arxiv.org/abs/2512.20589
- 2025-10-20, Leveraging Group Relative Policy Optimization to Advance Large Language Models in Traditional Chinese Medicine, Jiacheng Xie et.al., Paper: http://arxiv.org/abs/2510.17402
- 2026-01-28, Less is More: Clustered Cross-Covariance Control for Offline RL, Nan Qiao et.al., Paper: http://arxiv.org/abs/2601.20765
- 2026-02-19, Lepton energy scale and resolution corrections based on the minimization of an analytical likelihood: IJazZ2.0, F. Couderc et.al., Paper: http://arxiv.org/abs/2602.17300
- 2026-01-23, Length spectrum rigidity and flexibility of spheres of revolution with one equator, Alberto Abbondandolo et.al., Paper: http://arxiv.org/abs/2601.16804
- 2025-12-16, Leggett’s bound and superfluidity in strongly interacting bosons, Lorenzo Pizzino et.al., Paper: http://arxiv.org/abs/2512.14346
- 2026-02-26, Learning-based Multi-agent Race Strategies in Formula 1, Giona Fieni et.al., Paper: http://arxiv.org/abs/2602.23056
- 2025-12-17, Learning-Based Phase Shift Optimization of Liquid Crystal RIS in Dynamic mmWave Networks, Le Hao et.al., Paper: http://arxiv.org/abs/2512.15279
- 2026-02-18, Learning to unfold cloth: Scaling up world models to deformable object manipulation, Jack Rome et.al., Paper: http://arxiv.org/abs/2602.16675
- 2026-02-20, Learning to Tune Pure Pursuit in Autonomous Racing: Joint Lookahead and Steering-Gain Control with PPO, Mohamed Elgouhary et.al., Paper: http://arxiv.org/abs/2602.18386
- 2025-11-20, Learning to Think Fast and Slow for Visual Language Models, Chenyu Lin et.al., Paper: http://arxiv.org/abs/2511.16670
- 2025-12-11, Learning to Split: A Reinforcement-Learning-Guided Splitting Heuristic for Neural Network Verification, Maya Swisa et.al., Paper: http://arxiv.org/abs/2512.10747
- 2026-02-05, Learning to Share: Selective Memory for Efficient Parallel Agentic Systems, Joseph Fioresi et.al., Paper: http://arxiv.org/abs/2602.05965
- 2026-03-19, Learning to Self-Evolve, Xiaoyin Chen et.al., Paper: http://arxiv.org/abs/2603.18620
- 2026-02-17, Learning to Retrieve Navigable Candidates for Efficient Vision-and-Language Navigation, Shutian Gu et.al., Paper: http://arxiv.org/abs/2602.15724
- 2026-03-07, Learning to Reflect: Hierarchical Multi-Agent Reinforcement Learning for CSI-Free mmWave Beam-Focusing, Hieu Le et.al., Paper: http://arxiv.org/abs/2603.07370
- 2025-10-27, Learning to Reason Efficiently with Discounted Reinforcement Learning, Alex Ayoub et.al., Paper: http://arxiv.org/abs/2510.23486
- 2026-03-17, Learning to Present: Inverse Specification Rewards for Agentic Slide Generation, Karthik Ragunath Ananda Kumar et.al., Paper: http://arxiv.org/abs/2603.16839
- 2025-10-29, Learning to Plan & Schedule with Reinforcement-Learned Bimanual Robot Skills, Weikang Wan et.al., Paper: http://arxiv.org/abs/2510.25634
- 2026-03-22, Learning to Optimize Joint Source and RIS-assisted Channel Encoding for Multi-User Semantic Communication Systems, Haidong Wang et.al., Paper: http://arxiv.org/abs/2603.21097
- 2025-10-21, Learning to Navigate Under Imperfect Perception: Conformalised Segmentation for Safe Reinforcement Learning, Daniel Bethell et.al., Paper: http://arxiv.org/abs/2510.18485
- 2026-04-01, Learning to Hint for Reinforcement Learning, Yu Xia et.al., Paper: http://arxiv.org/abs/2604.00698
- 2026-03-29, Learning to Focus and Precise Cropping: A Reinforcement Learning Framework with Information Gaps and Grounding Loss for MLLMs, Xuanpu Zhao et.al., Paper: http://arxiv.org/abs/2603.27494
- 2025-10-28, Learning to Drive Safely with Hybrid Options, Bram De Cooman et.al., Paper: http://arxiv.org/abs/2510.24674
- 2026-01-22, Learning to Discover at Test Time, Mert Yuksekgonul et.al., Paper: http://arxiv.org/abs/2601.16175
- 2026-01-29, Learning to Dial-a-Ride: A Deep Graph Reinforcement Learning Approach to the Electric Dial-a-Ride Problem, Sten Elling Tingstad Jacobsen et.al., Paper: http://arxiv.org/abs/2601.22052
- 2025-10-20, Learning to Design Soft Hands using Reward Models, Xueqian Bai et.al., Paper: http://arxiv.org/abs/2510.17086
- 2026-02-09, Learning to Coordinate via Quantum Entanglement in Multi-Agent Reinforcement Learning, John Gardiner et.al., Paper: http://arxiv.org/abs/2602.08965
- 2026-02-13, Learning to Approximate Uniform Facility Location via Graph Neural Networks, Chendi Qian et.al., Paper: http://arxiv.org/abs/2602.13155
- 2026-02-09, Learning the Value Systems of Societies with Preference-based Multi-objective Reinforcement Learning, Andrés Holgado-Sánchez et.al., Paper: http://arxiv.org/abs/2602.08835
- 2026-01-26, Learning long term climate-resilient transport adaptation pathways under direct and indirect flood impacts using reinforcement learning, Miguel Costa et.al., Paper: http://arxiv.org/abs/2601.18586
- 2026-03-02, Learning from Synthetic Data Improves Multi-hop Reasoning, Anmol Kabra et.al., Paper: http://arxiv.org/abs/2603.02091
- 2025-11-19, Learning from Mistakes: Loss-Aware Memory Enhanced Continual Learning for LiDAR Place Recognition, Xufei Wang et.al., Paper: http://arxiv.org/abs/2511.15597
- 2026-01-13, Learning from Demonstrations via Capability-Aware Goal Sampling, Yuanlin Duan et.al., Paper: http://arxiv.org/abs/2601.08731
- 2026-02-12, Learning beyond Teacher: Generalized On-Policy Distillation with Reward Extrapolation, Wenkai Yang et.al., Paper: http://arxiv.org/abs/2602.12125
- 2025-11-14, Learning and Testing Convex Functions, Renato Ferreira Pinto et.al., Paper: http://arxiv.org/abs/2511.11498
- 2025-12-09, Learning and Editing Universal Graph Prompt Tuning via Reinforcement Learning, Jinfeng Xu et.al., Paper: http://arxiv.org/abs/2512.08763
- 2025-11-05, Learning Without Critics? Revisiting GRPO in Classical Reinforcement Learning Environments, Bryan L. M. de Oliveira et.al., Paper: http://arxiv.org/abs/2511.03527
- 2026-01-14, Learning Whole-Body Human-Humanoid Interaction from Human-Human Demonstrations, Wei-Jin Huang et.al., Paper: http://arxiv.org/abs/2601.09518
- 2026-03-17, Learning Whole-Body Control for a Salamander Robot, Mengze Tian et.al., Paper: http://arxiv.org/abs/2603.16683
- 2025-11-26, Learning When to Stop: Adaptive Latent Reasoning via Reinforcement Learning, Alex Ning et.al., Paper: http://arxiv.org/abs/2511.21581
- 2026-03-07, Learning When to Cooperate Under Heterogeneous Goals, Max Taylor-Davies et.al., Paper: http://arxiv.org/abs/2603.07253
- 2026-03-03, Learning When to Act or Refuse: Guarding Agentic Reasoning Models for Safe Multi-Step Tool Use, Aradhye Agarwal et.al., Paper: http://arxiv.org/abs/2603.03205
- 2026-03-12, Learning Visuomotor Policy for Multi-Robot Laser Tag Game, Kai Li et.al., Paper: http://arxiv.org/abs/2603.11980
- 2026-02-16, Learning User Interests via Reasoning and Distillation for Cross-Domain News Recommendation, Mengdan Zhu et.al., Paper: http://arxiv.org/abs/2602.15005
- 2025-10-22, Learning Upper Lower Value Envelopes to Shape Online RL: A Principled Approach, Sebastian Reboul et.al., Paper: http://arxiv.org/abs/2510.19528
- 2026-02-09, Learning To Sample From Diffusion Models Via Inverse Reinforcement Learning, Constant Bourdrez et.al., Paper: http://arxiv.org/abs/2602.08689
- 2025-12-03, Learning Steerable Clarification Policies with Collaborative Self-play, Jonathan Berant et.al., Paper: http://arxiv.org/abs/2512.04068
- 2025-10-31, Learning Soft Robotic Dynamics with Active Exploration, Hehui Zheng et.al., Paper: http://arxiv.org/abs/2510.27428
- 2026-02-20, Learning Smooth Time-Varying Linear Policies with an Action Jacobian Penalty, Zhaoming Xie et.al., Paper: http://arxiv.org/abs/2602.18312
- 2025-12-01, Learning Sim-to-Real Humanoid Locomotion in 15 Minutes, Younggyo Seo et.al., Paper: http://arxiv.org/abs/2512.01996
- 2025-12-19, Learning Safe Autonomous Driving Policies Using Predictive Safety Representations, Mahesh Keswani et.al., Paper: http://arxiv.org/abs/2512.17586
- 2025-11-24, Learning Robust Social Strategies with Large Language Models, Dereck Piche et.al., Paper: http://arxiv.org/abs/2511.19405
- 2026-02-03, Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation, Changze Lv et.al., Paper: http://arxiv.org/abs/2602.03619
- 2026-02-05, Learning Query-Aware Budget-Tier Routing for Runtime Agent Memory, Haozhen Zhang et.al., Paper: http://arxiv.org/abs/2602.06025
- 2026-02-09, Learning Potentials for Dynamic Matching and Application to Heart Transplantation, Itai Zilberstein et.al., Paper: http://arxiv.org/abs/2602.08878
- 2026-03-24, Learning Multi-Agent Local Collision-Avoidance for Collaborative Carrying tasks with Coupled Quadrupedal Robots, Francesca Bray et.al., Paper: http://arxiv.org/abs/2603.23278
- 2026-02-18, Learning Humanoid End-Effector Control for Open-Vocabulary Visual Loco-Manipulation, Runpei Dong et.al., Paper: http://arxiv.org/abs/2602.16705
- 2026-03-04, Learning Hip Exoskeleton Control Policy via Predictive Neuromusculoskeletal Simulation, Ilseung Park et.al., Paper: http://arxiv.org/abs/2603.04166
- 2025-12-22, Learning Generalizable Hand-Object Tracking from Synthetic Demonstrations, Yinhuai Wang et.al., Paper: http://arxiv.org/abs/2512.19583
- 2025-12-22, Learning General Policies with Policy Gradient Methods, Simon Ståhlberg et.al., Paper: http://arxiv.org/abs/2512.19366
- 2026-03-07, Learning From Failures: Efficient Reinforcement Learning Control with Episodic Memory, Chenyang Miao et.al., Paper: http://arxiv.org/abs/2603.07110
- 2026-02-27, Learning Flexible Job Shop Scheduling under Limited Buffers and Material Kitting Constraints, Shishun Zhang et.al., Paper: http://arxiv.org/abs/2602.24180
- 2026-02-18, Learning Distributed Equilibria in Linear-Quadratic Stochastic Differential Games: An $α$ -Potential Approach, Philipp Plank et.al., Paper: http://arxiv.org/abs/2602.16555
- 2026-03-31, Learning Diagnostic Reasoning for Decision Support in Toxicology, Nico Oberländer et.al., Paper: http://arxiv.org/abs/2603.29608
- 2025-12-01, Learning Dexterous Manipulation Skills from Imperfect Simulations, Elvis Hsieh et.al., Paper: http://arxiv.org/abs/2512.02011
- 2026-01-29, Learning Decentralized LLM Collaboration with Multi-Agent Actor Critic, Shuo Liu et.al., Paper: http://arxiv.org/abs/2601.21972
- 2025-12-11, Learning Controllable and Diverse Player Behaviors in Multi-Agent Environments, Atahan Cilan et.al., Paper: http://arxiv.org/abs/2512.10835
- 2026-03-20, Learning Adaptive Parameter Policies for Nonlinear Bayesian Filtering, Ondrej Straka et.al., Paper: http://arxiv.org/abs/2603.19910
- 2026-03-11, Learning Adaptive Force Control for Contact-Rich Sample Scraping with Heterogeneous Materials, Cenk Cetin et.al., Paper: http://arxiv.org/abs/2603.10979
- 2025-12-01, Learned-Rule-Augmented Large Language Model Evaluators, Jie Meng et.al., Paper: http://arxiv.org/abs/2512.01958
- 2026-03-19, Learn for Variation: Variationally Guided AAV Trajectory Learning in Differentiable Environments, Xiucheng Wang et.al., Paper: http://arxiv.org/abs/2603.18853
- 2025-12-22, LeLaR: The First In-Orbit Demonstration of an AI-Based Satellite Attitude Controller, Kirill Djebko et.al., Paper: http://arxiv.org/abs/2512.19576
- 2025-12-29, Le Cam Distortion: A Decision-Theoretic Framework for Robust Transfer Learning, Deniz Akdemir et.al., Paper: http://arxiv.org/abs/2512.23617
- 2025-11-21, Law-Strength Frontiers and a No-Free-Lunch Result for Law-Seeking Reinforcement Learning on Volatility Law Manifolds, Jian’an Zhang et.al., Paper: http://arxiv.org/abs/2511.17304
- 2026-03-23, Lattice study of the critical bubble in $\mathrm{SU(8)}$ deconfinement transition, Kari Rummukainen et.al., Paper: http://arxiv.org/abs/2603.22088
- 2026-03-12, LatentGeo: Learnable Auxiliary Constructions in Latent Space for Multimodal Geometric Reasoning, Haiying Xu et.al., Paper: http://arxiv.org/abs/2603.12166
- 2026-03-05, Latent Wasserstein Adversarial Imitation Learning, Siqi Yang et.al., Paper: http://arxiv.org/abs/2603.05440
- 2026-03-05, Latent Policy Steering through One-Step Flow Policies, Hokyun Im et.al., Paper: http://arxiv.org/abs/2603.05296
- 2026-02-17, Latency-aware Human-in-the-Loop Reinforcement Learning for Semantic Communications, Peizheng Li et.al., Paper: http://arxiv.org/abs/2602.15640
- 2025-12-26, Latency-Optimal Cache-aided Multicast Streaming via Forward-Backward Reinforcement Learning, Mohsen Amidzadeh et.al., Paper: http://arxiv.org/abs/2512.21954
- 2026-03-06, Large Wave Direction Data Modeling Using Wrapped Spatial Gaussian Markov Random Fields, Arnab Hazra et.al., Paper: http://arxiv.org/abs/2603.06293
- 2026-01-12, Large Language Models for Physics Instrument Design, Sara Zoccheddu et.al., Paper: http://arxiv.org/abs/2601.07580
- 2026-03-25, Large Language Model Guided Incentive Aware Reward Design for Cooperative Multi-Agent Reinforcement Learning, Dogan Urgun et.al., Paper: http://arxiv.org/abs/2603.24324
- 2026-03-26, LanteRn: Latent Visual Structured Reasoning, André G. Viveiros et.al., Paper: http://arxiv.org/abs/2603.25629
- 2026-01-21, Language-Coupled Reinforcement Learning for Multilingual Retrieval-Augmented Generation, Rui Qi et.al., Paper: http://arxiv.org/abs/2601.14896
- 2026-04-01, LangMARL: Natural Language Multi-Agent Reinforcement Learning, Huaiyuan Yao et.al., Paper: http://arxiv.org/abs/2604.00722
- 2025-12-22, LacaDM: A Latent Causal Diffusion Model for Multiobjective Reinforcement Learning, Xueming Yan et.al., Paper: http://arxiv.org/abs/2512.19516
- 2026-03-02, LaST-VLA: Thinking in Latent Spatio-Temporal Space for Vision-Language-Action in Autonomous Driving, Yuechen Luo et.al., Paper: http://arxiv.org/abs/2603.01928
- 2025-12-24, LSTM-Based Modeling and Reinforcement Learning Control of a Magnetically Actuated Catheter, Arya Rashidinejad Meibodi et.al., Paper: http://arxiv.org/abs/2512.21063
- 2025-12-02, LORE: A Large Generative Model for Search Relevance, Chenji Lu et.al., Paper: http://arxiv.org/abs/2512.03025
- 2026-02-09, LLaDA2.1: Speeding Up Text Diffusion via Token Editing, Tiwei Bie et.al., Paper: http://arxiv.org/abs/2602.08676
- 2025-10-20, LLMs Encode How Difficult Problems Are, William Lugoloobi et.al., Paper: http://arxiv.org/abs/2510.18147
- 2026-01-22, LLM-in-Sandbox Elicits General Agentic Intelligence, Daixuan Cheng et.al., Paper: http://arxiv.org/abs/2601.16206
- 2026-03-14, LLM-Guided Safe Reinforcement Learning for Energy System Topology Reconfiguration, Zongyan Zhang et.al., Paper: http://arxiv.org/abs/2603.14018
- 2026-03-14, LLM-Guided Reinforcement Learning for Audio-Visual Speech Enhancement, Chih-Ning Chen et.al., Paper: http://arxiv.org/abs/2603.13952
- 2026-01-27, LLM-Enhanced Reinforcement Learning for Long-Term User Satisfaction in Interactive Recommendation, Chongjun Xia et.al., Paper: http://arxiv.org/abs/2601.19585
- 2025-12-24, LLM-Empowered Agentic AI for QoE-Aware Network Slicing Management in Industrial IoT, Xudong Wang et.al., Paper: http://arxiv.org/abs/2512.20997
- 2025-11-24, LLM-Driven Stationarity-Aware Expert Demonstrations for Multi-Agent Reinforcement Learning in Mobile Systems, Tianyang Duan et.al., Paper: http://arxiv.org/abs/2511.19368
- 2025-12-24, LLM Swiss Round: Aggregating Multi-Benchmark Performance via Competitive Swiss-System Dynamics, Jiashuo Liu et.al., Paper: http://arxiv.org/abs/2512.21010
- 2025-12-23, LEAD: Minimizing Learner-Expert Asymmetry in End-to-End Driving, Long Nguyen et.al., Paper: http://arxiv.org/abs/2512.20563
- 2026-03-05, LBM: Hierarchical Large Auto-Bidding Model via Reasoning and Acting, Yewen Li et.al., Paper: http://arxiv.org/abs/2603.05134
- 2026-03-25, LATS: Large Language Model Assisted Teacher-Student Framework for Multi-Agent Reinforcement Learning in Traffic Signal Control, Yifeng Zhang et.al., Paper: http://arxiv.org/abs/2603.24361
- 2026-03-09, LAR-MoE: Latent-Aligned Routing for Mixture of Experts in Robotic Imitation Learning, Ariel Rodriguez et.al., Paper: http://arxiv.org/abs/2603.08476
- 2026-02-23, LAD: Learning Advantage Distribution for Reasoning, Wendi Li et.al., Paper: http://arxiv.org/abs/2602.20132
- 2025-12-05, LA-RL: Language Action-guided Reinforcement Learning with Safety Guarantees for Autonomous Highway Driving, Yiming Shu et.al., Paper: http://arxiv.org/abs/2512.05686
- 2026-02-10, L’Hopital rules for complex-valued functions in higher dimensions, Albert Chern et.al., Paper: http://arxiv.org/abs/2602.09958
- 2025-11-20, KrkNLO matching and phenomenology for vector boson processes, Pratixan Sarmah et.al., Paper: http://arxiv.org/abs/2511.16605
- 2025-12-23, Koopman for stochastic dynamics: error bounds for kernel extended dynamic mode decomposition, Maximiliano Hertel et.al., Paper: http://arxiv.org/abs/2512.20247
- 2026-02-20, Kolmogorov-Type Maximal Inequalities for Independent and Dependent Negative Binomial Random Variables: Sharp Bounds, Sub-Exponential Refinements, and Applications to Overdispersed Count Data, Aristides V. Doumas et.al., Paper: http://arxiv.org/abs/2602.18184
- 2025-11-05, Knowledge-Augmented Question Error Correction for Chinese Question Answer System with QuestionRAG, Longpeng Qiu et.al., Paper: http://arxiv.org/abs/2511.03410
- 2026-01-16, Knowledge is Not Enough: Injecting RL Skills for Continual Adaptation, Pingzhi Tang et.al., Paper: http://arxiv.org/abs/2601.11258
- 2026-01-21, Knowledge Graphs are Implicit Reward Models: Path-Derived Signals Enable Compositional Reasoning, Yuval Kansal et.al., Paper: http://arxiv.org/abs/2601.15160
- 2026-03-05, Knowledge Divergence and the Value of Debate for Scalable Oversight, Robin Young et.al., Paper: http://arxiv.org/abs/2603.05293
- 2026-03-10, Kinodynamic Motion Retargeting for Humanoid Locomotion via Multi-Contact Whole-Body Trajectory Optimization, Xiaoyu Zhang et.al., Paper: http://arxiv.org/abs/2603.09956
- 2026-03-02, Kinetic energy fluctuations and specific heat in generalized ensembles, Sergio Davis et.al., Paper: http://arxiv.org/abs/2603.02168
- 2026-03-24, Kinetic Langevin Splitting Schemes for Constrained Sampling, Neil K. Chada et.al., Paper: http://arxiv.org/abs/2603.23397
- 2026-03-07, Kinematics-Aware Latent World Models for Data-Efficient Autonomous Driving, Jiazhuo Li et.al., Paper: http://arxiv.org/abs/2603.07264
- 2025-12-19, Kinematics-Aware Diffusion Policy with Consistent 3D Observation and Action Space for Whole-Arm Robotic Manipulation, Kangchen Lv et.al., Paper: http://arxiv.org/abs/2512.17568
- 2026-03-06, Kinematically Coherent Multiphase Galactic Winds in Star-Forming Galaxies Revealed by Unified Radiative Transfer Modeling of UV Emission and Absorption Lines, Zhihui Li et.al., Paper: http://arxiv.org/abs/2603.06546
- 2025-10-30, Kimi Linear: An Expressive, Efficient Attention Architecture, Kimi Team et.al., Paper: http://arxiv.org/abs/2510.26692
- 2026-01-22, Keyframe-Based Feed-Forward Visual Odometry, Weichen Dai et.al., Paper: http://arxiv.org/abs/2601.16020
- 2026-01-13, Kernel Learning for Regression via Quantum Annealing Based Spectral Sampling, Yasushi Hasegawa et.al., Paper: http://arxiv.org/abs/2601.08724
- 2026-03-21, Karhunen-Loève Expansion for Fluid Antenna Systems: Information-Theoretic Optimal Channel Compression and Outage Analysis, Tuo Wu et.al., Paper: http://arxiv.org/abs/2603.20841
- 2026-03-17, Kamino: GPU-based Massively Parallel Simulation of Multi-Body Systems with Challenging Topologies, Vassilios Tsounis et.al., Paper: http://arxiv.org/abs/2603.16536
- 2025-10-23, KL-Regularized Reinforcement Learning is Designed to Mode Collapse, Anthony GX-Chen et.al., Paper: http://arxiv.org/abs/2510.20817
- 2026-03-05, KARL: Knowledge Agents via Reinforcement Learning, Jonathan D. Chang et.al., Paper: http://arxiv.org/abs/2603.05218
- 2026-01-20, KAGE-Bench: Fast Known-Axis Visual Generalization Evaluation for Reinforcement Learning, Egor Cherepanov et.al., Paper: http://arxiv.org/abs/2601.14232
- 2026-03-24, Joint Task Orchestration and Resource Optimization for SC3 Closed Loop in 6G Networks, Xinran Fang et.al., Paper: http://arxiv.org/abs/2603.23217
- 2026-02-04, Joint Sleep Mode Activation and Load Balancing with Dynamic Cell Load: A Combinatorial Bandit Approach, Wajahat Bashir Gilkar et.al., Paper: http://arxiv.org/abs/2602.04808
- 2025-12-29, Joint Link Adaptation and Device Scheduling Approach for URLLC Industrial IoT Network: A DRL-based Method with Bayesian Optimization, Wei Gao et.al., Paper: http://arxiv.org/abs/2512.23493
- 2025-12-23, Joint Design of Embedded Index Coding and Beamforming for MIMO-based Distributed Computing via Multi-Agent Reinforcement Learning, Heekang Song et.al., Paper: http://arxiv.org/abs/2512.20201
- 2026-03-10, Joint Bayesian analysis of soft and high- $p_\perp$ probes yields tighter constraints on QGP properties, Marko Djordjevic et.al., Paper: http://arxiv.org/abs/2603.09584
- 2026-01-20, Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow, Haocheng Xi et.al., Paper: http://arxiv.org/abs/2601.14243
- 2026-02-23, Janus-Q: End-to-End Event-Driven Trading via Hierarchical-Gated Reward Modeling, Xiang Li et.al., Paper: http://arxiv.org/abs/2602.19919
- 2025-12-01, Jacobi Forms of Affine Weight in Higher Cogenus and Nearly Holomorphic Functions, Jan Feldmann et.al., Paper: http://arxiv.org/abs/2512.01926
- 2025-12-31, Iterative Deployment Improves Planning Skills in LLMs, Augusto B. Corrêa et.al., Paper: http://arxiv.org/abs/2512.24940
- 2025-12-11, Iterative Compositional Data Generation for Robot Control, Anh-Quan Pham et.al., Paper: http://arxiv.org/abs/2512.10891
- 2025-11-21, Iterating marginalized Bayes maps for likelihood maximization with application to nonlinear panel models, Jesse Wheeler et.al., Paper: http://arxiv.org/abs/2511.17438
- 2025-11-10, IterResearch: Rethinking Long-Horizon Agents via Markovian State Reconstruction, Guoxin Chen et.al., Paper: http://arxiv.org/abs/2511.07327
- 2026-03-12, IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL, Zhoujun Cheng et.al., Paper: http://arxiv.org/abs/2603.12151
- 2026-01-23, Is the diurnal pattern sufficient to explain intraday variation in volatility? A nonparametric assessment, Kim Christensen et.al., Paper: http://arxiv.org/abs/2601.16613
- 2026-01-29, Investigation into using stochastic embedding representations for evaluating the trustworthiness of the Fréchet Inception Distance, Ciaran Bench et.al., Paper: http://arxiv.org/abs/2601.21979
- 2026-02-23, Investigating polarization signatures from GRB models, Pieter vd Merwe et.al., Paper: http://arxiv.org/abs/2602.19921
- 2026-01-29, Investigating Batch Inference in a Sequential Monte Carlo Framework for Neural Networks, Andrew Millard et.al., Paper: http://arxiv.org/abs/2601.21983
- 2026-01-12, Inversion of Sea Ice Spectral Albedo to Estimate Under-Ice Transmittance, Christophe Perron et.al., Paper: http://arxiv.org/abs/2601.07672
- 2025-12-05, Invariant Price of Anarchy: a Metric for Welfarist Traffic Control, Ilia Shilov et.al., Paper: http://arxiv.org/abs/2512.05843
- 2026-02-25, Intrusive and Non-Intrusive Model Order Reduction for Airborne Contaminant Transport: Comparative Analysis and Uncertainty Quantification, Lisa Kühn et.al., Paper: http://arxiv.org/abs/2602.21996
- 2026-03-16, Introduction to the artificial neural network-based variational Monte Carlo method, William Freitas et.al., Paper: http://arxiv.org/abs/2603.15460
- 2026-02-12, Intrinsic-Energy Joint Embedding Predictive Architectures Induce Quasimetric Spaces, Anthony Kobanda et.al., Paper: http://arxiv.org/abs/2602.12245
- 2025-11-14, Intrinsic Dimension Estimation for Radio Galaxy Zoo using Diffusion Models, Joan Font-Quer Roset et.al., Paper: http://arxiv.org/abs/2511.11490
- 2026-02-02, Interpreting and Controlling LLM Reasoning through Integrated Policy Gradient, Changming Li et.al., Paper: http://arxiv.org/abs/2602.02313
- 2026-03-20, Interpreting Reinforcement Learning Model Behavior via Koopman with Control, William T. Redman et.al., Paper: http://arxiv.org/abs/2603.19968
- 2025-11-24, Interpreting GFlowNets for Drug Discovery: Extracting Actionable Insights for Medicinal Chemistry, Amirtha Varshini A S et.al., Paper: http://arxiv.org/abs/2511.19264
- 2026-03-18, Interpreting Context-Aware Human Preferences for Multi-Objective Robot Navigation, Tharun Sethuraman et.al., Paper: http://arxiv.org/abs/2603.17510
- 2025-12-22, Interpretable Hybrid Deep Q-Learning Framework for IoT-Based Food Spoilage Prediction with Synthetic Data Generation and Hardware Validation, Isshaan Singh et.al., Paper: http://arxiv.org/abs/2512.19361
- 2026-02-11, Interpretable Attention-Based Multi-Agent PPO for Latency Spike Resolution in 6G RAN Slicing, Kavan Fatehi et.al., Paper: http://arxiv.org/abs/2602.11076
- 2026-01-06, Interpretable All-Type Audio Deepfake Detection with Audio LLMs via Frequency-Time Reinforcement Learning, Yuankun Xie et.al., Paper: http://arxiv.org/abs/2601.02983
- 2025-12-04, Internal superfluid response and torque evolution in the giant glitch of PSR J1718-3718, Peng Liu et.al., Paper: http://arxiv.org/abs/2512.04972
- 2025-12-12, Intergalactic magnetic field lower limits up to the redshift $z\approx3$, Ievgen Vovk et.al., Paper: http://arxiv.org/abs/2512.11397
- 2026-02-16, Interactionless Inverse Reinforcement Learning: A Data-Centric Framework for Durable Alignment, Elias Malomgré et.al., Paper: http://arxiv.org/abs/2602.14844
- 2026-02-20, Interacting safely with cyclists using Hamilton-Jacobi reachability and reinforcement learning, Aarati Andrea Noronha et.al., Paper: http://arxiv.org/abs/2602.18097
- 2025-10-31, Interact-RAG: Reason and Interact with the Corpus, Beyond Black-Box Retrieval, Yulong Hui et.al., Paper: http://arxiv.org/abs/2510.27566
- 2026-02-05, InterPrior: Scaling Generative Control for Physics-Based Human-Object Interactions, Sirui Xu et.al., Paper: http://arxiv.org/abs/2602.06035
- 2026-01-09, Intelligent Singularity Avoidance in UR10 Robotic Arm Path Planning Using Hybrid Fuzzy Logic and Reinforcement Learning, Sheng-Kai Chen et.al., Paper: http://arxiv.org/abs/2601.05836
- 2026-01-02, Integrating Multi-Armed Bandit, Active Learning, and Distributed Computing for Scalable Optimization, Foo Hui-Mean et.al., Paper: http://arxiv.org/abs/2601.00615
- 2026-03-09, Integrating Lagrangian Neural Networks into the Dyna Framework for Reinforcement Learning, Shreya Das et.al., Paper: http://arxiv.org/abs/2603.08468
- 2025-12-03, Integrating High Performance In-Memory Data Streaming and In-Situ Visualization in Hybrid MPI+OpenMP PIC MC Simulations Towards Exascale, Jeremy J. Williams et.al., Paper: http://arxiv.org/abs/2512.03914
- 2026-03-26, Integrating Deep RL and Bayesian Inference for ObjectNav in Mobile Robotics, João Castelo-Branco et.al., Paper: http://arxiv.org/abs/2603.25366
- 2026-03-12, Integrated Online Monitoring and Adaption of Process Model Predictive Controllers, Samuel Mallick et.al., Paper: http://arxiv.org/abs/2603.12187
- 2026-01-15, Institutional AI: A Governance Framework for Distributional AGI Safety, Federico Pierucci et.al., Paper: http://arxiv.org/abs/2601.10599
- 2025-11-13, Instella: Fully Open Language Models with Stellar Performance, Jiang Liu et.al., Paper: http://arxiv.org/abs/2511.10628
- 2025-12-02, Insights into Extragalactic Background Light constraints with MAGIC archival data, R. Grau et.al., Paper: http://arxiv.org/abs/2512.02880
- 2026-01-08, Inside Out: Evolving User-Centric Core Memory Trees for Long-Term Personalized Dialogue Systems, Jihao Zhao et.al., Paper: http://arxiv.org/abs/2601.05171
- 2026-02-06, InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning, Yuchen Yan et.al., Paper: http://arxiv.org/abs/2602.06960
- 2026-03-18, InfoDensity: Rewarding Information-Dense Traces for Efficient Reasoning, Chengwei Wei et.al., Paper: http://arxiv.org/abs/2603.17310
- 2026-03-10, Influencing LLM Multi-Agent Dialogue via Policy-Parameterized Prompts, Hongbo Bo et.al., Paper: http://arxiv.org/abs/2603.09890
- 2026-02-26, Influence of Hydrogen on Dislocation Relaxation in BCC Iron: Atomistic Mechanisms and Implications, Sanjay Manda et.al., Paper: http://arxiv.org/abs/2602.22995
- 2025-12-04, Inflationary relics from an Ultra-Slow-Roll plateau, Albert Escrivà et.al., Paper: http://arxiv.org/abs/2512.04986
- 2026-01-07, InfiniteWeb: Scalable Web Environment Synthesis for GUI Agent Training, Ziyun Zhang et.al., Paper: http://arxiv.org/abs/2601.04126
- 2026-03-20, Infinite-dimensional spherical-radial decomposition for probabilistic functions, with application to constrained optimal control and Gaussian process regression, Kewei Wang et.al., Paper: http://arxiv.org/abs/2603.19907
- 2025-11-17, Infinite-Horizon Optimal Control of Jump-Diffusion Models for Pollution-Dependent Disasters, Daria Sakhanda et.al., Paper: http://arxiv.org/abs/2511.13568
- 2025-11-25, Inferring the Impacts of Baryonic Feedback from Kinetic Sunyaev-Zeldovich Cross-Correlations, Alex Laguë et.al., Paper: http://arxiv.org/abs/2511.20595
- 2025-10-29, Inference on Welfare and Value Functionals under Optimal Treatment Assignment, Xiaohong Chen et.al., Paper: http://arxiv.org/abs/2510.25607
- 2025-10-20, Inference of Deterministic Finite Automata via Q-Learning, Elaheh Hosseinkhani et.al., Paper: http://arxiv.org/abs/2510.17386
- 2026-01-23, Inference from high-frequency data: A subsampling approach, Kim Christensen et.al., Paper: http://arxiv.org/abs/2601.16668
- 2026-01-07, IndexTTS 2.5 Technical Report, Yunpei Li et.al., Paper: http://arxiv.org/abs/2601.03888
- 2025-12-26, Index-Tracking Portfolio Construction and Rebalancing under Bayesian Sparse Modelling and Uncertainty Quantification, Dimitrios Roxanas et.al., Paper: http://arxiv.org/abs/2512.22109
- 2026-03-12, Increasing intelligence in AI agents can worsen collective outcomes, Neil F. Johnson et.al., Paper: http://arxiv.org/abs/2603.12129
- 2026-02-20, Inclusive Ranking of Indian States via Bayesian Bradley-Terry Model, Arshi Rizvi et.al., Paper: http://arxiv.org/abs/2602.18150
- 2026-01-09, Inclusion of Inter-crystal Scattering in PET: Analytical Models and Dedicated Reconstruction, Jorge Roser et.al., Paper: http://arxiv.org/abs/2601.05717
- 2025-12-16, Incentivizing Tool-augmented Thinking with Images for Medical Image Analysis, Yankai Jiang et.al., Paper: http://arxiv.org/abs/2512.14157
- 2025-11-21, InTAct: Interval-based Task Activation Consolidation for Continual Learning, Patryk Krukowski et.al., Paper: http://arxiv.org/abs/2511.17439
- 2026-01-20, InT: Self-Proposed Interventions Enable Credit Assignment in LLM Reasoning, Matthew Y. R. Yang et.al., Paper: http://arxiv.org/abs/2601.14209
- 2026-01-06, In-Context Reinforcement Learning through Bayesian Fusion of Context and Value Prior, Anaïs Berkes et.al., Paper: http://arxiv.org/abs/2601.03015
- 2026-02-13, In-Context Autonomous Network Incident Response: An End-to-End Large Language Model Agent Approach, Yiran Gao et.al., Paper: http://arxiv.org/abs/2602.13156
- 2021-11-30, Improving Zero-shot Generalization in Offline Reinforcement Learning using Generalized Similarity Functions, Bogdan Mazoure et.al., Paper: http://arxiv.org/abs/2111.14629
- 2026-03-14, Improving Visual Reasoning with Iterative Evidence Refinement, Zeru Shi et.al., Paper: http://arxiv.org/abs/2603.14117
- 2026-01-21, Improving Regret Approximation for Unsupervised Dynamic Environment Generation, Harry Mead et.al., Paper: http://arxiv.org/abs/2601.14957
- 2026-01-27, Improving Policy Exploitation in Online Reinforcement Learning with Instant Retrospect Action, Gong Gao et.al., Paper: http://arxiv.org/abs/2601.19720
- 2026-02-25, Improving Parametric Knowledge Access in Reasoning Language Models, Melody Ma et.al., Paper: http://arxiv.org/abs/2602.22193
- 2026-03-25, Improving Lean4 Autoformalization via Cycle Consistency Fine-tuning, Arsen Shebzukhov et.al., Paper: http://arxiv.org/abs/2603.24372
- 2026-02-12, Improving HPC Code Generation Capability of LLMs via Online Reinforcement Learning with Real-Machine Benchmark Rewards, Ryo Mikasa et.al., Paper: http://arxiv.org/abs/2602.12049
- 2025-10-21, Improved thermonuclear rate of $^{42}$Ti($p$,$γ$)$^{43}$ V and its astrophysical implication in rp-process, S. Q. Hou et.al., Paper: http://arxiv.org/abs/2510.18531
- 2026-01-23, Improved Kelbg Potentials for $Z>1$ and Application to Carbon Plasmas, Heather D. Whitley et.al., Paper: http://arxiv.org/abs/2601.16794
- 2026-01-22, Improve the autonomy of the SE2(3) group based Extended Kalman Filter for Integrated Navigation: Application, Jiarui Cui et.al., Paper: http://arxiv.org/abs/2601.16078
- 2026-04-02, Importance sampling for Bayesian inference: polynomial-dimension dependent error bounds, Fabián González et.al., Paper: http://arxiv.org/abs/2604.02094
- 2026-03-24, ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment, Hao Wang et.al., Paper: http://arxiv.org/abs/2603.23184
- 2026-04-01, Implementation and Workflows for INLA-Based Approximate Bayesian Structural Equation Modelling, Haziq Jamil et.al., Paper: http://arxiv.org/abs/2604.00671
- 2026-02-26, Impacts of Aggregation on Model Diversity and Consumer Utility, Kate Donahue et.al., Paper: http://arxiv.org/abs/2602.23293
- 2025-12-11, Impact of geometry on 1D molecular-kinetics simulations of acoustic-gravity wave propagation into the exosphere, Jose A. Perez Chavez et.al., Paper: http://arxiv.org/abs/2512.10887
- 2026-01-12, Impact of Nuclear Reaction Rates on Calcium Production in Population III Stars: A Global Analysis, Qing Wang et.al., Paper: http://arxiv.org/abs/2601.07623
- 2026-03-10, Impact of Markov Decision Process Design on Sim-to-Real Reinforcement Learning, Tatjana Krau et.al., Paper: http://arxiv.org/abs/2603.09427
- 2026-03-09, Impact of Connectivity on Laplacian Representations in Reinforcement Learning, Tommaso Giorgi et.al., Paper: http://arxiv.org/abs/2603.08558
- 2025-11-05, Imitation Learning in the Deep Learning Era: A Novel Taxonomy and Recent Advances, Iason Chrysomallis et.al., Paper: http://arxiv.org/abs/2511.03565
- 2025-12-15, Image Diffusion Preview with Consistency Solver, Fu-Yun Wang et.al., Paper: http://arxiv.org/abs/2512.13592
- 2026-01-13, Identifying Latent Intentions via Inverse Reinforcement Learning in Repeated Linear Public Good Games, Carina I. Hausladen et.al., Paper: http://arxiv.org/abs/2601.08803
- 2025-12-23, Identifying Appropriately-Sized Services with Deep Reinforcement Learning, Syeda Tasnim Fabiha et.al., Paper: http://arxiv.org/abs/2512.20381
- 2025-11-24, Identification, estimation and inference in Panel Vector Autoregressions using external instruments, Raimondo Pala et.al., Paper: http://arxiv.org/abs/2511.19372
- 2025-11-04, Identification and Estimation of Continuous-Time Dynamic Discrete Choice Games, Jason R. Blevins et.al., Paper: http://arxiv.org/abs/2511.02701
- 2026-01-02, IRPO: Scaling the Bradley-Terry Model via Reinforcement Learning, Haonan Song et.al., Paper: http://arxiv.org/abs/2601.00677
- 2026-01-30, IRL-DAL: Safe and Adaptive Trajectory Planning for Autonomous Driving via Energy-Guided Diffusion Models, Seyed Ahmad Hosseini Miangoleh et.al., Paper: http://arxiv.org/abs/2601.23266
- 2026-02-19, IRIS: Learning-Driven Task-Specific Cinema Robot Arm for Visuomotor Motion Control, Qilong Cheng et.al., Paper: http://arxiv.org/abs/2602.17537
- 2025-12-09, IPPO Learns the Game, Not the Team: A Study on Generalization in Heterogeneous Agent Teams, Ryan LeRoy et.al., Paper: http://arxiv.org/abs/2512.08877
- 2026-03-04, IPD: Boosting Sequential Policy with Imaginary Planning Distillation in Offline Reinforcement Learning, Yihao Qin et.al., Paper: http://arxiv.org/abs/2603.04289
- 2025-12-24, IMPACTX: An X-ray Spectral Model for Polar Dust and Clumpy Torus, Kanta Fujiwara et.al., Paper: http://arxiv.org/abs/2512.20993
- 2026-01-09, IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck, Huilin Deng et.al., Paper: http://arxiv.org/abs/2601.05870
- 2026-02-25, IGR J12580+0134: A Candidate for Repeating Partial Tidal Disruption Events Supported by Multi-Wavelength Observations, Po Ma et.al., Paper: http://arxiv.org/abs/2602.22040
- 2026-01-06, IBISAgent: Reinforcing Pixel-Level Visual Reasoning in MLLMs for Universal Biomedical Object Referring and Segmentation, Yankai Jiang et.al., Paper: http://arxiv.org/abs/2601.03054
- 2026-01-08, Hán Dān Xué Bù (Mimicry) or Qīng Chū Yú Lán (Mastery)? A Cognitive Perspective on Reasoning Distillation in Large Language Models, Yueqing Hu et.al., Paper: http://arxiv.org/abs/2601.05019
- 2025-12-19, HydroGym: A Reinforcement Learning Platform for Fluid Dynamics, Christian Lagemann et.al., Paper: http://arxiv.org/abs/2512.17534
- 2025-12-04, Hybrid-Diffusion Models: Combining Open-loop Routines with Visuomotor Diffusion Policies, Jonne Van Haastregt et.al., Paper: http://arxiv.org/abs/2512.04960
- 2026-02-19, Hybrid Monte Carlo for Fractional Quantum Hall States, Ting-Tung Wang et.al., Paper: http://arxiv.org/abs/2602.17564
- 2026-03-12, Hybrid Human-Agent Social Dilemmas in Energy Markets, Isuri Perera et.al., Paper: http://arxiv.org/abs/2603.11834
- 2026-03-31, Hybrid Framework for Robotic Manipulation: Integrating Reinforcement Learning and Large Language Models, Md Saad et.al., Paper: http://arxiv.org/abs/2603.30022
- 2025-12-26, Hybrid Deep Reinforcement Learning for Joint Resource Allocation in Multi-Active RIS-Aided Uplink Communications, Mohamed Shalma et.al., Paper: http://arxiv.org/abs/2512.22107
- 2025-10-30, Hybrid DQN-TD3 Reinforcement Learning for Autonomous Navigation in Dynamic Environments, Xiaoyi He et.al., Paper: http://arxiv.org/abs/2510.26646
- 2025-10-30, Hybrid Consistency Policy: Decoupling Multi-Modal Diversity and Real-Time Efficiency in Robotic Manipulation, Qianyou Zhao et.al., Paper: http://arxiv.org/abs/2510.26670
- 2025-12-16, Hybrid Cognitive IoT with Cooperative Caching and SWIPT-EH: A Hierarchical Reinforcement Learning Framework, Nadia Abdolkhani et.al., Paper: http://arxiv.org/abs/2512.14488
- 2025-11-25, Hurst exponent and planetary rings, Horacio Salomone et.al., Paper: http://arxiv.org/abs/2511.20583
- 2025-10-20, Humanoid Goalkeeper: Learning from Position Conditioned Task-Motion Constraints, Junli Ren et.al., Paper: http://arxiv.org/abs/2510.18002
- 2026-02-02, HumanX: Toward Agile and Generalizable Humanoid Interaction Skills from Human Videos, Yinhuai Wang et.al., Paper: http://arxiv.org/abs/2602.02473
- 2025-11-21, Human Imitated Bipedal Locomotion with Frequency Based Gait Generator Network, Yusuf Baran Ates et.al., Paper: http://arxiv.org/abs/2511.17387
- 2026-03-12, HumDex:Humanoid Dexterous Manipulation Made Easy, Liang Heng et.al., Paper: http://arxiv.org/abs/2603.12260
- 2026-01-30, How well do generative models solve inverse problems? A benchmark study, Patrick Krüger et.al., Paper: http://arxiv.org/abs/2601.23238
- 2026-04-01, How uncertain are model predictions for the muon content of extensive air showers, Sergey Ostapchenko et.al., Paper: http://arxiv.org/abs/2604.00975
- 2026-03-03, How to Peel with a Knife: Aligning Fine-Grained Manipulation with Human Preference, Toru Lin et.al., Paper: http://arxiv.org/abs/2603.03280
- 2025-12-11, How to Brake? Ethical Emergency Braking with Deep Reinforcement Learning, Jianbo Wang et.al., Paper: http://arxiv.org/abs/2512.10698
- 2026-02-13, How Swarms Differ: Challenges in Collective Behaviour Comparison, André Fialho Jesus et.al., Paper: http://arxiv.org/abs/2602.13016
- 2025-12-15, How Low Can You Go? The Data-Light SE Challenge, Kishan Kumar Ganguly et.al., Paper: http://arxiv.org/abs/2512.13524
- 2025-10-28, How Flat is a Plateau? Evolution of Late-Time TDE Disks, Yael Alush et.al., Paper: http://arxiv.org/abs/2510.24696
- 2026-03-09, How Far Can Unsupervised RLVR Scale LLM Training?, Bingxiang He et.al., Paper: http://arxiv.org/abs/2603.08660
- 2025-12-08, How Do LLMs Fail In Agentic Scenarios? A Qualitative Analysis of Success and Failure Scenarios of Various LLMs in Agentic Simulations, JV Roig et.al., Paper: http://arxiv.org/abs/2512.07497
- 2025-11-14, Honesty over Accuracy: Trustworthy Language Models through Reinforced Hesitation, Mohamad Amin Mohamadi et.al., Paper: http://arxiv.org/abs/2511.11500
- 2025-11-28, Homomorphism Testing with Resilience to Online Manipulations, Esty Kelman et.al., Paper: http://arxiv.org/abs/2511.23363
- 2025-10-31, Holographic equation of state matched with hadron gas equation as a tool for the study of the quark-gluon plasma evolution, A. V. Anufriev et.al., Paper: http://arxiv.org/abs/2510.27541
- 2026-01-12, Hiking in the Wild: A Scalable Perceptive Parkour Framework for Humanoids, Shaoting Zhu et.al., Paper: http://arxiv.org/abs/2601.07718
- 2025-12-03, Highly Efficient Test-Time Scaling for T2I Diffusion Models with Text Embedding Perturbation, Hang Xu et.al., Paper: http://arxiv.org/abs/2512.03996
- 2026-01-05, Higher-Order Action Regularization in Deep Reinforcement Learning: From Continuous Control to Building Energy Management, Faizan Ahmed et.al., Paper: http://arxiv.org/abs/2601.02061
- 2025-10-21, Higher Embedding Dimension Creates a Stronger World Model for a Simple Sorting Task, Brady Bhalla et.al., Paper: http://arxiv.org/abs/2510.18315
- 2026-03-16, High-Throughput Computational Exploration of MOFs for Short-Chain PFAS Removal, Mengru Zhang et.al., Paper: http://arxiv.org/abs/2603.15503
- 2026-01-14, Higgs Decays at NLO in the SMEFT, Luigi Bellafronte et.al., Paper: http://arxiv.org/abs/2601.09599
- 2025-12-03, Hierarchical Vision Language Action Model Using Success and Failure Demonstrations, Jeongeun Park et.al., Paper: http://arxiv.org/abs/2512.03913
- 2026-02-13, Hierarchical Reinforcement Learning for Cooperative Air-Ground Delivery in Urban System, Songxin Lei et.al., Paper: http://arxiv.org/abs/2602.12913
- 2026-03-19, Hierarchical Latent Structure Learning through Online Inference, Ines Aitsahalia et.al., Paper: http://arxiv.org/abs/2603.19139
- 2026-01-07, Hierarchical GNN-Based Multi-Agent Learning for Dynamic Queue-Jump Lane and Emergency Vehicle Corridor Formation, Haoran Su et.al., Paper: http://arxiv.org/abs/2601.04177
- 2025-12-29, Hierarchical Decision Mamba Meets Agentic AI: A Novel Approach for RAN Slicing in 6G, Md Arafat Habib et.al., Paper: http://arxiv.org/abs/2512.23502
- 2026-02-20, Hidden-charm (uds\,c\bar c) pentaquarks as flavor eigenstates in a constituent quark model, M. C. Gordillo et.al., Paper: http://arxiv.org/abs/2602.18075
- 2025-12-09, Heuristics for Combinatorial Optimization via Value-based Reinforcement Learning: A Unified Framework and Analysis, Orit Davidovich et.al., Paper: http://arxiv.org/abs/2512.08601
- 2026-03-25, Heuristic Self-Paced Learning for Domain Adaptive Semantic Segmentation under Adverse Conditions, Shiqin Wang et.al., Paper: http://arxiv.org/abs/2603.24322
- 2025-12-26, Heterogeneous fragmentation of empty sites promotes cooperation in phenotypically diverse populations with tag-mediated interactions, Hui Zhang et.al., Paper: http://arxiv.org/abs/2512.22077
- 2025-11-18, Heterogeneous Multi-Agent Proximal Policy Optimization for Power Distribution System Restoration, Parya Dolatyabi et.al., Paper: http://arxiv.org/abs/2511.14730
- 2026-03-23, Here, there and everywhere: state-dependent time-inconsistent stochastic control, Dylan Possamaï et.al., Paper: http://arxiv.org/abs/2603.22022
- 2026-03-03, Heavy-quark box-loop corrections to $q\bar q \to Zγ$ at two loops in QCD, Dario Kermanschah et.al., Paper: http://arxiv.org/abs/2603.03169
- 2026-01-26, Health-SCORE: Towards Scalable Rubrics for Improving Health-LLMs, Zhichao Yang et.al., Paper: http://arxiv.org/abs/2601.18706
- 2026-02-19, Hartree shift and pairing gap in ultracold Fermi gases in the framework of low-momentum interactions, Michael Urban et.al., Paper: http://arxiv.org/abs/2602.17420
- 2025-11-21, Harnessing Data from Clustered LQR Systems: Personalized and Collaborative Policy Optimization, Vinay Kanakeri et.al., Paper: http://arxiv.org/abs/2511.17489
- 2026-03-31, HapCompass: A Rotational Haptic Device for Contact-Rich Robotic Teleoperation, Xiangshan Tan et.al., Paper: http://arxiv.org/abs/2603.30042
- 2026-03-12, HandelBot: Real-World Piano Playing via Fast Adaptation of Dexterous Robot Policies, Amber Xie et.al., Paper: http://arxiv.org/abs/2603.12243
- 2026-03-18, Hamiltonian Monte Carlo enhanced by Exact Diagonalization, Finn L. Temmen et.al., Paper: http://arxiv.org/abs/2603.17788
- 2025-12-29, HY-Motion 1.0: Scaling Flow Matching Models for Text-To-Motion Generation, Yuxin Wen et.al., Paper: http://arxiv.org/abs/2512.23464
- 2026-03-16, HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions, Yukang Cao et.al., Paper: http://arxiv.org/abs/2603.15612
- 2026-03-19, HISR: Hindsight Information Modulated Segmental Process Rewards For Multi-turn Agentic Reinforcement Learning, Zhicong Lu et.al., Paper: http://arxiv.org/abs/2603.18683
- 2026-03-16, Gym-V: A Unified Vision Environment System for Agentic Vision Research, Fanqing Meng Lingxiao Du Jiawei Gu Jiaqi Liao Linjie Li Zijian Wu Xiangyan Liu Ziqi Zhao Mengkang Hu Yue Zhang Zichen Liu Jiaheng Zhang Michael Qizhe Shieh et.al., Paper: http://arxiv.org/abs/2603.15432
- 2026-03-20, GustPilot: A Hierarchical DRL-INDI Framework for Wind-Resilient Quadrotor Navigation, Amir Atef Habel et.al., Paper: http://arxiv.org/abs/2603.19966
- 2026-01-30, Guided by Trajectories: Repairing and Rewarding Tool-Use Trajectories for Tool-Integrated Reasoning, Siyu Gong et.al., Paper: http://arxiv.org/abs/2601.23032
- 2025-12-03, Guided Flow Policy: Learning from High-Value Actions in Offline Reinforcement Learning, Franki Nguimatsia Tiofack et.al., Paper: http://arxiv.org/abs/2512.03973
- 2025-12-24, Guardrailed Elasticity Pricing: A Churn-Aware Forecasting Playbook for Subscription Strategy, Deepit Sapru et.al., Paper: http://arxiv.org/abs/2512.20932
- 2025-11-24, Growing with the Generator: Self-paced GRPO for Video Generation, Rui Li et.al., Paper: http://arxiv.org/abs/2511.19356
- 2026-01-28, Grover’s Search-Inspired Quantum Reinforcement Learning for Massive MIMO User Scheduling, Ruining Fan et.al., Paper: http://arxiv.org/abs/2601.20688
- 2026-03-17, Groups of invertible ideals of one-dimensional Prüfer domains as groups of integer-valued functions, Dario Spirito et.al., Paper: http://arxiv.org/abs/2603.16322
- 2026-03-17, Grounding the Score: Explicit Visual Premise Verification for Reliable Vision-Language Process Reward Models, Junxin Wang et.al., Paper: http://arxiv.org/abs/2603.16253
- 2026-03-24, Grounding Sim-to-Real Generalization in Dexterous Manipulation: An Empirical Study with Vision-Language-Action Models, Ruixing Jin et.al., Paper: http://arxiv.org/abs/2603.22876
- 2025-11-10, Grounding Computer Use Agents on Human Demonstrations, Aarash Feizi et.al., Paper: http://arxiv.org/abs/2511.07332
- 2025-10-27, Ground-state phase diagram of S = 1/2 Heisenberg model on 2D square-hexagon-octagon lattice, Yumeng Luo et.al., Paper: http://arxiv.org/abs/2510.23376
- 2025-12-01, GrndCtrl: Grounding World Models via Self-Supervised Reward Alignment, Haoyang He et.al., Paper: http://arxiv.org/abs/2512.01952
- 2026-03-31, GreenFLag: A Green Agentic Approach for Energy-Efficient Federated Learning, Theodora Panagea et.al., Paper: http://arxiv.org/abs/2603.29933
- 2025-11-20, Green Resilience of Cyber-Physical Systems: Doctoral Dissertation, Diaeddin Rimawi et.al., Paper: http://arxiv.org/abs/2511.16593
- 2025-10-28, Greedy Sampling Is Provably Efficient for RLHF, Di Wu et.al., Paper: http://arxiv.org/abs/2510.24700
- 2026-01-28, GraphAllocBench: A Flexible Benchmark for Preference-Conditioned Multi-Objective Policy Learning, Zhiheng Jiang et.al., Paper: http://arxiv.org/abs/2601.20753
- 2025-12-23, Graph-Symbolic Policy Enforcement and Control (G-SPEC): A Neuro-Symbolic Framework for Safe Agentic AI in 5G Autonomous Networks, Divya Vijay et.al., Paper: http://arxiv.org/abs/2512.20275
- 2025-12-10, Graph-Based Bayesian Optimization for Quantum Circuit Architecture Search with Uncertainty Calibrated Surrogates, Prashant Kumar Choudhary et.al., Paper: http://arxiv.org/abs/2512.09586
- 2026-01-12, Graph Inference Towards ICD Coding, Xiaoxiao Deng et.al., Paper: http://arxiv.org/abs/2601.07496
- 2025-12-01, Graph Distance as Surprise: Free Energy Minimization in Knowledge Graph Reasoning, Gaganpreet Jhajj et.al., Paper: http://arxiv.org/abs/2512.01878
- 2025-12-17, Graph Contextual Reinforcement Learning for Efficient Directed Controller Synthesis, Toshihide Ubukata et.al., Paper: http://arxiv.org/abs/2512.15295
- 2025-12-09, Gradient-Informed Monte Carlo Fine-Tuning of Diffusion Models for Low-Thrust Trajectory Design, Jannik Graebner et.al., Paper: http://arxiv.org/abs/2512.08705
- 2025-11-06, GraSP-VLA: Graph-based Symbolic Action Representation for Long-Horizon Planning with VLA Policies, Maëlic Neau et.al., Paper: http://arxiv.org/abs/2511.04357
- 2026-03-31, GraSP-STL: A Graph-Based Framework for Zero-Shot Signal Temporal Logic Planning via Offline Goal-Conditioned Reinforcement Learning, Ancheng Hou et.al., Paper: http://arxiv.org/abs/2603.29533
- 2026-03-10, Good Reasoning Makes Good Demonstrations: Implicit Reasoning Quality Supervision via In-Context Reinforcement Learning, Tiehua Mei et.al., Paper: http://arxiv.org/abs/2603.09803
- 2026-02-16, Goldilocks RL: Tuning Task Difficulty to Escape Sparse Rewards for Reasoning, Ilia Mahrooghi et.al., Paper: http://arxiv.org/abs/2602.14868
- 2025-11-05, Going Beyond Expert Performance via Deep Implicit Imitation Reinforcement Learning, Iason Chrysomallis et.al., Paper: http://arxiv.org/abs/2511.03616
- 2025-10-24, Goal-based portfolio selection with fixed transaction costs, Erhan Bayraktar et.al., Paper: http://arxiv.org/abs/2510.21650
- 2026-03-20, Goal-Oriented Framework for Optical Flow-based Multi-User Multi-Task Video Transmission, Yujie Xu et.al., Paper: http://arxiv.org/abs/2603.19995
- 2026-03-16, GlyphPrinter: Region-Grouped Direct Preference Optimization for Glyph-Accurate Visual Text Rendering, Xincheng Shuai et.al., Paper: http://arxiv.org/abs/2603.15616
- 2026-01-16, Globally Optimal Contour Deformations with Neural Networks, Stephen Jones et.al., Paper: http://arxiv.org/abs/2601.11448
- 2025-10-23, GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning, Jinchang Luo et.al., Paper: http://arxiv.org/abs/2510.20548
- 2025-11-13, Global Solutions to Non-Convex Functional Constrained Problems with Hidden Convexity, Ilyas Fatkhullin et.al., Paper: http://arxiv.org/abs/2511.10626
- 2025-12-24, Global End-Effector Pose Control of an Underactuated Aerial Manipulator via Reinforcement Learning, Shlok Deshmukh et.al., Paper: http://arxiv.org/abs/2512.21085
- 2026-01-20, Glance-or-Gaze: Incentivizing LMMs to Adaptively Focus Search via Reinforcement Learning, Hongbo Bai et.al., Paper: http://arxiv.org/abs/2601.13942
- 2025-11-19, Gini Score under Ties and Case Weights, Alexej Brauer et.al., Paper: http://arxiv.org/abs/2511.15446
- 2026-02-12, GigaBrain-0.5M*: a VLA That Learns From World Model-Based Reinforcement Learning, GigaBrain Team et.al., Paper: http://arxiv.org/abs/2602.12099
- 2025-12-04, Geophysical intensity problems: the axisymmetric case, Ralf Kaiser et.al., Paper: http://arxiv.org/abs/2512.05010
- 2026-04-01, Geometry-induced correlated noise in qLDPC syndrome extraction, Angelo Di Bella et.al., Paper: http://arxiv.org/abs/2604.01040
- 2026-03-03, Geometry-Guided Reinforcement Learning for Multi-view Consistent 3D Scene Editing, Jiyuan Wang et.al., Paper: http://arxiv.org/abs/2603.03143
- 2026-02-12, Geometry of Uncertainty: Learning Metric Spaces for Multimodal State Estimation in RL, Alfredo Reichlin et.al., Paper: http://arxiv.org/abs/2602.12087
- 2026-01-29, Geometry of Drifting MDPs with Path-Integral Stability Certificates, Zuyuan Zhang et.al., Paper: http://arxiv.org/abs/2601.21991
- 2026-02-27, Geometric Resilience of Quantum LiDAR in Turbulent Media: A Wasserstein Distance Approach, Arnaud Coatanhay et.al., Paper: http://arxiv.org/abs/2602.24280
- 2026-02-10, Geometric Analysis of Blind User Identification for Massive MIMO Networks, Levi Bohnacker et.al., Paper: http://arxiv.org/abs/2602.09910
- 2026-02-26, GeoWorld: Geometric World Models, Zeyu Zhang et.al., Paper: http://arxiv.org/abs/2602.23058
- 2025-11-19, GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization, Yikun Wang et.al., Paper: http://arxiv.org/abs/2511.15705
- 2026-03-10, GeoSolver: Scaling Test-Time Reasoning in Remote Sensing with Fine-Grained Process Supervision, Lang Sun et.al., Paper: http://arxiv.org/abs/2603.09551
- 2026-01-07, GeoReason: Aligning Thinking And Answering In Remote Sensing Vision-Language Models Via Logical Consistency Reinforcement Learning, Wenshuai Li et.al., Paper: http://arxiv.org/abs/2601.04118
- 2026-01-14, GeoRA: Geometry-Aware Low-Rank Adaptation for RLVR, Jiaying Zhang et.al., Paper: http://arxiv.org/abs/2601.09361
- 2025-11-06, GentleHumanoid: Learning Upper-body Compliance for Contact-rich Human and Object Interaction, Qingzhou Lu et.al., Paper: http://arxiv.org/abs/2511.04679
- 2025-10-30, Generative sampling with physics-informed kernels, Friederike Ihssen et.al., Paper: http://arxiv.org/abs/2510.26678
- 2025-12-22, Generative diffusion models for agricultural AI: plant image generation, indoor-to-outdoor translation, and expert preference alignment, Da Tan et.al., Paper: http://arxiv.org/abs/2512.19632
- 2026-01-16, Generative Scenario Rollouts for End-to-End Autonomous Driving, Rajeev Yasarla et.al., Paper: http://arxiv.org/abs/2601.11475
- 2026-02-08, Generative Reasoning Re-ranker, Mingfu Liang et.al., Paper: http://arxiv.org/abs/2602.07774
- 2025-12-19, Generative Multi-Objective Bayesian Optimization with Scalable Batch Evaluations for Sample-Efficient De Novo Molecular Design, Madhav R. Muthyala et.al., Paper: http://arxiv.org/abs/2512.17659
- 2026-03-18, Generative Control as Optimization: Time Unconditional Flow Matching for Adaptive and Robust Robotic Control, Zunzhe Zhang et.al., Paper: http://arxiv.org/abs/2603.17834
- 2025-12-18, Generative Adversarial Reasoner: Enhancing LLM Reasoning with Adversarial Reinforcement Learning, Qihao Liu et.al., Paper: http://arxiv.org/abs/2512.16917
- 2026-02-06, Generating Data-Driven Reasoning Rubrics for Domain-Adaptive Reward Modeling, Kate Sanders et.al., Paper: http://arxiv.org/abs/2602.06795
- 2026-03-31, Generalizing Output-Feedback Covariance Steering to Incorporate Non-Orthogonal Estimation Errors, Daniel C. Qi et.al., Paper: http://arxiv.org/abs/2603.29753
- 2025-11-13, Generalizing Analogical Inference from Boolean to Continuous Domains, Francisco Cunha et.al., Paper: http://arxiv.org/abs/2511.10416
- 2026-03-20, Generalized Task-Driven Design of Soft Robots via Reduced-Order FEM-based Surrogate Modeling, Yao Yao et.al., Paper: http://arxiv.org/abs/2603.19794
- 2026-03-23, Generalized Sequential Monte Carlo Sampling for Redistricting Simulation, Philip O’Sullivan et.al., Paper: http://arxiv.org/abs/2603.22188
- 2026-03-11, Generalized Reduced-Density-Matrix Quantum Monte Carlo Gives Access to More, Zhiyan Wang et.al., Paper: http://arxiv.org/abs/2603.10948
- 2026-02-26, Generalized Rapid Action Value Estimation in Memory-Constrained Environments, Aloïs Rautureau et.al., Paper: http://arxiv.org/abs/2602.23318
- 2026-01-27, Generalizable Equivariant Diffusion Models for Non-Abelian Lattice Gauge Theory, Gert Aarts et.al., Paper: http://arxiv.org/abs/2601.19552
- 2025-12-24, Generalised Linear Models in Deep Bayesian RL with Learnable Basis Functions, Jingyang You et.al., Paper: http://arxiv.org/abs/2512.20974
- 2026-02-25, Generalisation of RLHF under Reward Shift and Clipped KL Regularisation, Kenton Tang et.al., Paper: http://arxiv.org/abs/2602.21765
- 2025-12-23, Generalisation in Multitask Fitted Q-Iteration and Offline Q-learning, Kausthubh Manda et.al., Paper: http://arxiv.org/abs/2512.20220
- 2026-02-11, General Flexible $f$ -divergence for Challenging Offline RL Datasets with Low Stochasticity and Diverse Behavior Policies, Jianxun Wang et.al., Paper: http://arxiv.org/abs/2602.11087
- 2025-12-10, Gaussian Process Aggregation for Root-Parallel Monte Carlo Tree Search with Continuous Actions, Junlin Xiao et.al., Paper: http://arxiv.org/abs/2512.09727
- 2026-02-16, Gauge-independent gravitational waves from a minimal dark $U(1)$ sector with viable dark matter candidates, Wan-Zhe Feng et.al., Paper: http://arxiv.org/abs/2602.14866
- 2025-10-23, GSWorld: Closed-Loop Photo-Realistic Simulation Suite for Robotic Manipulation, Guangqi Jiang et.al., Paper: http://arxiv.org/abs/2510.20813
- 2026-03-10, GSStream: 3D Gaussian Splatting based Volumetric Scene Streaming System, Zhiye Tang et.al., Paper: http://arxiv.org/abs/2603.09718
- 2026-01-12, GRPO with State Mutations: Improving LLM-Based Hardware Test Plan Generation, Dimple Vijay Kochar et.al., Paper: http://arxiv.org/abs/2601.07593
- 2026-03-14, GRPO and Reflection Reward for Mathematical Reasoning in Large Language Models, Zhijie Wang et.al., Paper: http://arxiv.org/abs/2603.14041
- 2026-02-16, GREAT-EER: Graph Edge Attention Network for Emergency Evacuation Responses, Attila Lischka et.al., Paper: http://arxiv.org/abs/2602.14676
- 2026-01-28, GPO: Growing Policy Optimization for Legged Robot Locomotion and Whole-Body Control, Shuhao Liao et.al., Paper: http://arxiv.org/abs/2601.20668
- 2025-12-16, GLM-TTS Technical Report, Jiayan Cui et.al., Paper: http://arxiv.org/abs/2512.14291
- 2026-02-17, GLM-5: from Vibe Coding to Agentic Engineering, GLM-5 Team et.al., Paper: http://arxiv.org/abs/2602.15763
- 2026-01-09, GIFT: Games as Informal Training for Generalizable LLMs, Nuoyan Lyu et.al., Paper: http://arxiv.org/abs/2601.05633
- 2026-03-24, GEM: Guided Expectation-Maximization for Behavior-Normalized Candidate Action Selection in Offline RL, Haoyu Wang et.al., Paper: http://arxiv.org/abs/2603.23232
- 2026-01-05, GDRO: Group-level Reward Post-training Suitable for Diffusion Models, Yiyang Wang et.al., Paper: http://arxiv.org/abs/2601.02036
- 2026-01-08, GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization, Shih-Yang Liu et.al., Paper: http://arxiv.org/abs/2601.05242
- 2026-02-12, GAN-based data augmentation for rare and exotic hadron searches in Pb–Pb collisions in ALICE, Anisa Khatun et.al., Paper: http://arxiv.org/abs/2602.12088
- 2025-10-20, GACO-CAD: Geometry-Augmented and Conciseness-Optimized CAD Model Generation from Single Image, Yinghui Wang et.al., Paper: http://arxiv.org/abs/2510.17157
- 2025-11-18, Fusing Biomechanical and Spatio-Temporal Features for Fall Prediction: Characterizing and Mitigating the Simulation-to-Reality Gap, Md Fokhrul Islam et.al., Paper: http://arxiv.org/abs/2511.14620
- 2026-03-16, Fusian: Multi-LoRA Fusion for Fine-Grained Continuous MBTI Personality Control in Large Language Models, Zehao Chen et.al., Paper: http://arxiv.org/abs/2603.15405
- 2025-12-05, Functional dual-slope frequency-domain near-infrared spectroscopy data interpreted with two- and three-layer models, Jodee Frias et.al., Paper: http://arxiv.org/abs/2512.05877
- 2025-10-20, Functional Distribution Networks (FDN), Omer Haq et.al., Paper: http://arxiv.org/abs/2510.17794
- 2026-02-25, Function-Space Empirical Bayes Regularisation with Student’s t Priors, Pengcheng Hao et.al., Paper: http://arxiv.org/abs/2602.22015
- 2026-04-01, Full-Gradient Successor Feature Representations, Ritish Shrirao et.al., Paper: http://arxiv.org/abs/2604.00686
- 2025-11-17, Full range of infinite point blow-up exponents for the critical generalized KdV equation, Nailya Manatova et.al., Paper: http://arxiv.org/abs/2511.13538
- 2026-03-17, From the Inside Out: Progressive Distribution Refinement for Confidence Calibration, Xizhong Yang et.al., Paper: http://arxiv.org/abs/2603.16500
- 2026-01-28, From biting to engulfment: curvature-actin coupling controls phagocytosis of soft, deformable targets, Shubhadeep Sadhukhan et.al., Paper: http://arxiv.org/abs/2601.20719
- 2026-03-10, From Weighting to Modeling: A Nonparametric Estimator for Off-Policy Evaluation, Rong J. B. Zhu et.al., Paper: http://arxiv.org/abs/2603.09436
- 2026-01-29, From Tokens to Blocks: A Block-Diffusion Perspective on Molecular Generation, Qianwei Yang et.al., Paper: http://arxiv.org/abs/2601.21964
- 2025-11-04, From Solo to Symphony: Orchestrating Multi-Agent Collaboration with Single-Agent Demos, Xun Wang et.al., Paper: http://arxiv.org/abs/2511.02762
- 2026-03-23, From Singleton Obstacles to Clutter: Translation Invariant Compositional Avoid Sets, Prashant Solanki et.al., Paper: http://arxiv.org/abs/2603.22146
- 2024-10-03, From Reward Shaping to Q-Shaping: Achieving Unbiased Learning with LLM-Guided Knowledge, Xiefeng Wu et.al., Paper: http://arxiv.org/abs/2410.01458
- 2025-10-20, From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models, Zefan Cai et.al., Paper: http://arxiv.org/abs/2510.17247
- 2025-12-22, From Points to Coalitions: Hierarchical Contrastive Shapley Values for Prioritizing Data Samples, Canran Xiao et.al., Paper: http://arxiv.org/abs/2512.19363
- 2026-03-21, From Photons to Electrons: Accelerated Materials Discovery via Random Libraries and Automated Scanning Transmission Electron Microscopy, Boris Slautin et.al., Paper: http://arxiv.org/abs/2603.20858
- 2026-03-16, From Passive Observer to Active Critic: Reinforcement Learning Elicits Process Reasoning for Robotic Manipulation, Yibin Liu et.al., Paper: http://arxiv.org/abs/2603.15600
- 2026-01-09, From Off-Policy to On-Policy: Enhancing GUI Agents via Bi-level Expert-to-Policy Assimilation, Zezhou Wang et.al., Paper: http://arxiv.org/abs/2601.05787
- 2026-03-24, From Morality Installation in LLMs to LLMs in Morality-as-a-System, Gunter Bombaerts et.al., Paper: http://arxiv.org/abs/2603.22944
- 2026-01-08, From Idea to Co-Creation: A Planner-Actor-Critic Framework for Agent Augmented 3D Modeling, Jin Gao et.al., Paper: http://arxiv.org/abs/2601.05016
- 2025-12-04, From Generated Human Videos to Physically Plausible Robot Trajectories, James Ni et.al., Paper: http://arxiv.org/abs/2512.05094
- 2026-03-06, From Entropy to Calibrated Uncertainty: Training Language Models to Reason About Uncertainty, Azza Jenane et.al., Paper: http://arxiv.org/abs/2603.06317
- 2025-11-04, From Densities to Potentials: Benchmarking Local Exchange-Correlation Approximations, Visagan Ravindran et.al., Paper: http://arxiv.org/abs/2511.02744
- 2025-10-21, From Competition to Synergy: Unlocking Reinforcement Learning for Subject-Driven Image Generation, Ziwei Huang et.al., Paper: http://arxiv.org/abs/2510.18263
- 2026-01-26, From Classification to Ranking: Enhancing LLM Reasoning Capabilities for MBTI Personality Detection, Yuan Cao et.al., Paper: http://arxiv.org/abs/2601.18582
- 2026-01-13, From Classical to Quantum Reinforcement Learning and Its Applications in Quantum Control: A Beginner’s Tutorial, Abhijit Sen et.al., Paper: http://arxiv.org/abs/2601.08662
- 2025-11-28, From CAD to POMDP: Probabilistic Planning for Robotic Disassembly of End-of-Life Products, Jan Baumgärtner et.al., Paper: http://arxiv.org/abs/2511.23407
- 2025-12-01, From Atomic to Composite: Reinforcement Learning Enables Generalization in Complementary Reasoning, Sitao Cheng et.al., Paper: http://arxiv.org/abs/2512.01970
- 2026-01-30, From Absolute to Relative: Rethinking Reward Shaping in Group-Based Reinforcement Learning, Wenzhe Niu et.al., Paper: http://arxiv.org/abs/2601.23058
- 2026-03-31, Friends, Foes, and First Authors: A Game Theory Model of How Power Plays Rewrite Academic Co-Authorship Networks, Amit Bengal et.al., Paper: http://arxiv.org/abs/2603.29834
- 2026-02-09, Friedkin-Johnsen Social Influence Dynamics on Networks: A Boundary-Value Formulation and Influenceability Measures, Moses Boudourides et.al., Paper: http://arxiv.org/abs/2602.08704
- 2025-10-20, Foundational Automatic Evaluators: Scaling Multi-Task Generative Evaluator Training for Reasoning-Centric Domains, Austin Xu et.al., Paper: http://arxiv.org/abs/2510.17793
- 2025-11-06, Forgetting is Everywhere, Ben Sanati et.al., Paper: http://arxiv.org/abs/2511.04666
- 2025-12-01, Forecasting in Offline Reinforcement Learning for Non-stationary Environments, Suzan Ece Ada et.al., Paper: http://arxiv.org/abs/2512.01987
- 2026-03-04, Force-Aware Residual DAgger via Trajectory Editing for Precision Insertion with Impedance Control, Yiou Huang et.al., Paper: http://arxiv.org/abs/2603.04038
- 2025-10-21, Food4All: A Multi-Agent Framework for Real-time Free Food Discovery with Integrated Nutritional Metadata, Zhengqing Yuan et.al., Paper: http://arxiv.org/abs/2510.18289
- 2025-11-07, Follow-Me in Micro-Mobility with End-to-End Imitation Learning, Sahar Salimpour et.al., Paper: http://arxiv.org/abs/2511.05158
- 2026-04-01, Focal plane wavefront control with model-based reinforcement learning, Jalo Nousiainen et.al., Paper: http://arxiv.org/abs/2604.00993
- 2026-01-23, Flux-ratio anomalies in cusp quasars reveal dark matter beyond CDM, Siyuan Hou et.al., Paper: http://arxiv.org/abs/2601.16818
- 2025-12-09, Fluent Alignment with Disfluent Judges: Post-training for Lower-resource Languages, David Samuel et.al., Paper: http://arxiv.org/abs/2512.08777
- 2026-03-29, FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies, Chenxiao Gao et.al., Paper: http://arxiv.org/abs/2603.27450
- 2026-03-31, FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration, Qiyao Wang et.al., Paper: http://arxiv.org/abs/2603.29557
- 2026-04-01, Flow-based Policy With Distributional Reinforcement Learning in Trajectory Optimization, Ruijie Hao et.al., Paper: http://arxiv.org/abs/2604.00977
- 2026-02-02, Flow Policy Gradients for Robot Control, Brent Yi et.al., Paper: http://arxiv.org/abs/2602.02481
- 2026-02-20, Flow Matching with Injected Noise for Offline-to-Online Reinforcement Learning, Yongjae Shin et.al., Paper: http://arxiv.org/abs/2602.18117
- 2026-03-18, Flow Matching Policy with Entropy Regularization, Ting Gao et.al., Paper: http://arxiv.org/abs/2603.17685
- 2025-12-10, FlipLLM: Efficient Bit-Flip Attacks on Multimodal LLMs using Reinforcement Learning, Khurram Khalil et.al., Paper: http://arxiv.org/abs/2512.09872
- 2025-10-30, Flinch: A Differentiable Framework for Field-Level Inference of Cosmological parameters from curved sky data, Andrea Crespi et.al., Paper: http://arxiv.org/abs/2510.26691
- 2025-12-10, Flexible Reconfigurable Intelligent Surface-Aided Covert Communications in UAV Networks, Chong Huang et.al., Paper: http://arxiv.org/abs/2512.09714
- 2026-03-12, FlexRec: Adapting LLM-based Recommenders for Flexible Needs via Reinforcement Learning, Yijun Pan et.al., Paper: http://arxiv.org/abs/2603.11901
- 2026-01-15, Flat-band Ferromagnetism of SU $(N)$ Hubbard Model on the Kagome Lattices, Hao Jin et.al., Paper: http://arxiv.org/abs/2601.10549
- 2025-11-25, Flash-DMD: Towards High-Fidelity Few-Step Image Generation with Efficient Distillation and Joint Reinforcement Learning, Guanjie Chen et.al., Paper: http://arxiv.org/abs/2511.20549
- 2025-11-06, Fitting Reinforcement Learning Model to Behavioral Data under Bandits, Hao Zhu et.al., Paper: http://arxiv.org/abs/2511.04454
- 2026-01-01, Fisher-Information-Driven Adaptive Acquisition for Photon-Efficient FLIM: A Dual-Implementation Framework for TCSPC and Programmable Time-Gating, J. Sumaya-Martinez et.al., Paper: http://arxiv.org/abs/2601.00490
- 2026-01-09, First-principles study of hydrogen diffusion in polycrystalline Nickel, Bhanuj Jain et.al., Paper: http://arxiv.org/abs/2601.05917
- 2025-12-22, First-Order Representation Languages for Goal-Conditioned RL, Simon Ståhlberg et.al., Paper: http://arxiv.org/abs/2512.19355
- 2026-03-05, Finite-size scaling in quasi-3D stick percolation, Ryan K. Daniels et.al., Paper: http://arxiv.org/abs/2603.05374
- 2025-10-20, Finite-Time Bounds for Average-Reward Fitted Q-Iteration, Jongmin Lee et.al., Paper: http://arxiv.org/abs/2510.17391
- 2026-02-09, Finite-State Controllers for (Hidden-Model) POMDPs using Deep Reinforcement Learning, David Hudák et.al., Paper: http://arxiv.org/abs/2602.08734
- 2026-01-23, Finite Population Inference for Factorial Designs and Panel Experiments with Imperfect Compliance, Pedro Picchetti et.al., Paper: http://arxiv.org/abs/2601.16749
- 2026-03-13, Finite Difference Flow Optimization for RL Post-Training of Text-to-Image Models, David McAllister et.al., Paper: http://arxiv.org/abs/2603.12893
- 2025-10-21, Fingerprints of cluster-based Haldane and bound-magnon states in a spin-1 Heisenberg diamond chain, Azam Zoshki et.al., Paper: http://arxiv.org/abs/2510.18447
- 2026-03-24, Fine-tuning of universal machine-learning interatomic potentials for 2D high-entropy alloys, Chun Zhou et.al., Paper: http://arxiv.org/abs/2603.23029
- 2026-03-14, Fine-tuning is Not Enough: A Parallel Framework for Collaborative Imitation and Reinforcement Learning in End-to-end Autonomous Driving, Zhexi Lian et.al., Paper: http://arxiv.org/abs/2603.13842
- 2026-03-20, Fine-tuning Timeseries Predictors Using Reinforcement Learning, Hugo Cazaux et.al., Paper: http://arxiv.org/abs/2603.20063
- 2025-10-20, Fine-tuning Flow Matching Generative Models with Intermediate Feedback, Jiajun Fan et.al., Paper: http://arxiv.org/abs/2510.18072
- 2026-02-11, Fine-Tuning GPT-5 for GPU Kernel Generation, Ali Tehrani et.al., Paper: http://arxiv.org/abs/2602.11000
- 2026-04-01, Finding Low Star Discrepancy 3D Kronecker Point Sets Using Algorithm Configuration Techniques, Imène Ait Abderrahim et.al., Paper: http://arxiv.org/abs/2604.00786
- 2025-11-10, FinRpt: Dataset, Evaluation System and LLM-based Multi-agent Framework for Equity Research Report Generation, Song Jin et.al., Paper: http://arxiv.org/abs/2511.07322
- 2025-10-28, Fill in the Blanks: Accelerating Q-Learning with a Handful of Demonstrations in Sparse Reward Settings, Seyed Mahdi Basiri Azad et.al., Paper: http://arxiv.org/abs/2510.24432
- 2026-03-18, Federated Distributional Reinforcement Learning with Distributional Critic Regularization, David Millard et.al., Paper: http://arxiv.org/abs/2603.17820
- 2026-02-10, Features as Rewards: Scalable Supervision for Open-Ended Tasks via Interpretability, Aaditya Vikram Prasad et.al., Paper: http://arxiv.org/abs/2602.10067
- 2025-11-25, Feature-Modulated UFNO for Improved Prediction of Multiphase Flow in Porous Media, Alhasan Abdellatif et.al., Paper: http://arxiv.org/abs/2511.20543
- 2025-12-19, Feasibility to probe the dynamical scotogenic model at the LHC, Gustavo Ardila-Tafurth et.al., Paper: http://arxiv.org/abs/2512.17903
- 2026-03-24, Fault-Tolerant Design and Multi-Objective Model Checking for Real-Time Deep Reinforcement Learning Systems, Guoxin Su et.al., Paper: http://arxiv.org/abs/2603.23113
- 2026-01-23, Faster parallel MCMC: Metropolis adjustment is best served warm, Jakob Robnik et.al., Paper: http://arxiv.org/abs/2601.16696
- 2026-03-18, Fast stabilizer state preparation via AI-optimized graph decimation, Michael Doherty et.al., Paper: http://arxiv.org/abs/2603.17743
- 2025-11-07, Fast and Scalable Evaluation of Unbiased Atomic Forces in ab initio Variational Monte Carlo via the Lagrangian Technique, Kousuke Nakano et.al., Paper: http://arxiv.org/abs/2511.05222
- 2025-12-12, Fast and Explicit: Slice-to-Volume Reconstruction via 3D Gaussian Primitives with Analytic Point Spread Function Modeling, Maik Dannecker et.al., Paper: http://arxiv.org/abs/2512.11624
- 2025-12-15, Fast Policy Learning for 6-DOF Position Control of Underwater Vehicles, Sümer Tunçay et.al., Paper: http://arxiv.org/abs/2512.13359
- 2025-10-28, Fast Bayesian Multilevel Quasi-Monte Carlo, Aleksei G. Sorokin et.al., Paper: http://arxiv.org/abs/2510.24604
- 2025-10-28, Fare: Failure Resilience in Learned Visual Navigation Control, Zishuo Wang et.al., Paper: http://arxiv.org/abs/2510.24680
- 2026-02-10, Fake-HR1: Rethinking reasoning of vision language model for synthetic image detection, Changjiang Jiang et.al., Paper: http://arxiv.org/abs/2602.10042
- 2026-02-17, Fairness over Equality: Correcting Social Incentives in Asymmetric Sequential Social Dilemmas, Alper Demir et.al., Paper: http://arxiv.org/abs/2602.15407
- 2026-02-08, Fairness Aware Reward Optimization, Ching Lam Choi et.al., Paper: http://arxiv.org/abs/2602.07799
- 2025-12-16, Fair sampling of ground-state configurations using hybrid quantum-classical MCMC algorithms, Yuichiro Nakano et.al., Paper: http://arxiv.org/abs/2512.14552
- 2026-01-02, Fair Policy Learning under Bipartite Network Interference: Learning Fair and Cost-Effective Environmental Policies, Raphael C. Kim et.al., Paper: http://arxiv.org/abs/2601.00531
- 2026-01-12, Failure-Aware RL: Reliable Offline-to-Online Reinforcement Learning with Self-Recovery for Real-World Manipulation, Huanyu Li et.al., Paper: http://arxiv.org/abs/2601.07821
- 2025-11-18, Failure to Mix: Large language models struggle to answer according to desired probability distributions, Ivy Yuqian Yang et.al., Paper: http://arxiv.org/abs/2511.14630
- 2026-01-16, Factored Value Functions for Graph-Based Multi-Agent Reinforcement Learning, Ahmed Rashwan et.al., Paper: http://arxiv.org/abs/2601.11401
- 2025-11-07, FM4Com: Foundation Model for Scene-Adaptive Communication Strategy Optimization, Zhaoyang Li et.al., Paper: http://arxiv.org/abs/2511.05094
- 2025-12-17, FM-EAC: Feature Model-based Enhanced Actor-Critic for Multi-Task Control in Dynamic Environments, Quanxi Zhou et.al., Paper: http://arxiv.org/abs/2512.15430
- 2026-03-20, FIPO: Eliciting Deep Reasoning with Future-KL Influenced Policy Optimization, Chiyu Ma et.al., Paper: http://arxiv.org/abs/2603.19835
- 2026-02-12, FAIL: Flow Matching Adversarial Imitation Learning for Image Generation, Yeyao Ma et.al., Paper: http://arxiv.org/abs/2602.12155
- 2026-02-06, F-GRPO: Don’t Let Your Policy Learn the Obvious and Forget the Rare, Daniil Plyusov et.al., Paper: http://arxiv.org/abs/2602.06717
- 2026-03-07, Extending gPET for Multi-Layer PET Simulation, Satzhan Sitmukhambetov et.al., Paper: http://arxiv.org/abs/2603.07327
- 2026-01-05, Extended real number arithmetics via Dedekind cuts, Andreas H Hamel et.al., Paper: http://arxiv.org/abs/2601.02229
- 2026-01-02, Exponentially Accelerated Sampling of Pauli Strings for Nonstabilizerness, Zhenyu Xiao et.al., Paper: http://arxiv.org/abs/2601.00761
- 2025-11-07, Exponential Spatiotemporal GARCH Model with Asymmetric Volatility Spillovers, Ariane Nidelle Meli Chrisko et.al., Paper: http://arxiv.org/abs/2511.05126
- 2025-11-17, Exploring the production of Terbium-161 in the Brazilian Multipurpose Reactor, Fernando Cozim Melges et.al., Paper: http://arxiv.org/abs/2511.13634
- 2025-11-21, Exploring fixed points and eigenstates of quantum systems with reinforcement learning, María Laura Olivera-Atencio et.al., Paper: http://arxiv.org/abs/2511.17491
- 2026-01-29, Exploring Reasoning Reward Model for Agents, Kaixuan Fan et.al., Paper: http://arxiv.org/abs/2601.22154
- 2026-01-28, Exploring Re-inforcement Learning via Human Feedback under User Heterogeneity, Sarvesh Shashidhar et.al., Paper: http://arxiv.org/abs/2601.20760
- 2026-02-26, Exploratory Memory-Augmented LLM Agent via Hybrid On- and Off-Policy Optimization, Zeyuan Liu et.al., Paper: http://arxiv.org/abs/2602.23008
- 2025-12-18, Exploration v.s. Exploitation: Rethinking RLVR through Clipping, Entropy, and Spurious Reward, Peter Chen et.al., Paper: http://arxiv.org/abs/2512.16912
- 2026-03-12, Exploiting Expertise of Non-Expert and Diverse Agents in Social Bandit Learning: A Free Energy Approach, Erfan Mirzaei et.al., Paper: http://arxiv.org/abs/2603.11757
- 2025-12-17, Explicit Solution to a government debt reduction problem: a stochastic control approach, Claudia Ceci et.al., Paper: http://arxiv.org/abs/2512.15296
- 2025-12-31, Explaining Why Things Go Where They Go: Interpretable Constructs of Human Organizational Preferences, Emmanuel Fashae et.al., Paper: http://arxiv.org/abs/2512.24829
- 2026-03-20, Experience is the Best Teacher: Motivating Effective Exploration in Reinforcement Learning for LLMs, Wenjian Zhang et.al., Paper: http://arxiv.org/abs/2603.20046
- 2026-02-02, Expanding the Capabilities of Reinforcement Learning via Text Feedback, Yuda Song et.al., Paper: http://arxiv.org/abs/2602.02482
- 2026-03-02, Expanding LLM Agent Boundaries with Strategy-Guided Exploration, Andrew Szot et.al., Paper: http://arxiv.org/abs/2603.02045
- 2026-02-25, ExpLang: Improved Exploration and Exploitation in LLM Reasoning with On-Policy Thinking Language Selection, Changjiang Gao et.al., Paper: http://arxiv.org/abs/2602.21887
- 2026-03-17, Execution-Grounded Credit Assignment for GRPO in Code Generation, Abhijit Kumar et.al., Paper: http://arxiv.org/abs/2603.16158
- 2025-12-02, Exceptional Point Dynamics in Photonic Time Crystals for Enhanced Optical Sensing, Saurabh Mani Tripathi et.al., Paper: http://arxiv.org/abs/2512.02945
- 2026-03-12, Examining Reasoning LLMs-as-Judges in Non-Verifiable LLM Post-Training, Yixin Liu et.al., Paper: http://arxiv.org/abs/2603.12246
- 2025-12-26, Exact inference via quasi-conjugacy in two-parameter Poisson-Dirichlet hidden Markov models, Marco Dalla Pria et.al., Paper: http://arxiv.org/abs/2512.22098
- 2026-03-24, Exact analytical PGSE signal for diffusion confined to a cylindrical surface using a spectral Laplacian formalism, Erick J Canales-Rodríguez et.al., Paper: http://arxiv.org/abs/2603.23421
- 2025-12-31, Evolving, Not Training: Zero-Shot Reasoning Segmentation via Evolutionary Prompting, Kai Ye et.al., Paper: http://arxiv.org/abs/2512.24702
- 2025-10-28, Evolving Diagnostic Agents in a Virtual Clinical Environment, Pengcheng Qiu et.al., Paper: http://arxiv.org/abs/2510.24654
- 2026-02-04, Evolving Afferent Architectures: Biologically-inspired Models for Damage-Avoidance Learning, Wolfgang Maass et.al., Paper: http://arxiv.org/abs/2602.04807
- 2026-02-16, Evolutionary System Prompt Learning can Facilitate Reinforcement Learning for LLMs, Lunjun Zhang et.al., Paper: http://arxiv.org/abs/2602.14697
- 2025-10-20, EvoSyn: Generalizable Evolutionary Data Synthesis for Verifiable Learning, He Du et.al., Paper: http://arxiv.org/abs/2510.17928
- 2026-01-09, EvoQRE: Modeling Bounded Rationality in Safety-Critical Traffic Simulation via Evolutionary Quantal Response Equilibrium, Phu-Hoa Pham et.al., Paper: http://arxiv.org/abs/2601.05653
- 2025-11-20, EvoLMM: Self-Evolving Large Multimodal Models with Continuous Rewards, Omkat Thawakar et.al., Paper: http://arxiv.org/abs/2511.16672
- 2026-03-18, EvoGuard: An Extensible Agentic RL-based Framework for Practical and Evolving AI-Generated Image Detection, Chenyang Zhu et.al., Paper: http://arxiv.org/abs/2603.17343
- 2026-03-10, EvoDriveVLA: Evolving Autonomous Driving Vision-Language-Action Model via Collaborative Perception-Planning Distillation, Jiajun Cao et.al., Paper: http://arxiv.org/abs/2603.09465
- 2025-11-26, EvilGenie: A Reward Hacking Benchmark, Jonathan Gabor et.al., Paper: http://arxiv.org/abs/2511.21654
- 2025-10-21, Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model, Ling Team et.al., Paper: http://arxiv.org/abs/2510.18855
- 2026-03-10, Event-by-Event Multiplicity Fluctuations in Heavy-Ion Collisions Using Modified HIJING Monte Carlo Generator, Y. A. Rusak et.al., Paper: http://arxiv.org/abs/2603.09732
- 2026-03-05, Evaluation of Feynman integrals via numerical integration of differential equations, Pau Petit Rosàs et.al., Paper: http://arxiv.org/abs/2603.05336
- 2026-03-09, Evaluation of EMF Exposure to Throughput Ratio for Sustainable 5G Networks, Dinh Long Trinh et.al., Paper: http://arxiv.org/abs/2603.08549
- 2025-10-30, Evaluating the Role of Verifiers in Test-Time Scaling for Legal Reasoning Tasks, Davide Romano et.al., Paper: http://arxiv.org/abs/2510.25623
- 2026-02-06, Evaluating and Enhancing the Vulnerability Reasoning Capabilities of Large Language Models, Li Lu et.al., Paper: http://arxiv.org/abs/2602.06687
- 2026-01-05, Evaluating Feature Dependent Noise in Preference-based Reinforcement Learning, Yuxuan Li et.al., Paper: http://arxiv.org/abs/2601.01904
- 2025-11-18, Estimation of Spatial and Temporal Autoregressive Effects using LASSO - An Example of Hourly Particulate Matter Concentrations, Elkanah Nyabuto et.al., Paper: http://arxiv.org/abs/2511.14666
- 2025-11-14, Estimating the Effects of Heatwaves on Health: A Causal Inference Framework, Giulio Grossi et.al., Paper: http://arxiv.org/abs/2511.11433
- 2026-01-09, Estimating optimal interpretable individualized treatment regimes from a classification perspective using adaptive LASSO, Yunshu Zhang et.al., Paper: http://arxiv.org/abs/2601.05875
- 2025-12-19, Estimating Spatially Resolved Radiation Fields Using Neural Networks, Felix Lehner et.al., Paper: http://arxiv.org/abs/2512.17654
- 2025-12-16, Estimating Program Participation with Partial Validation, Augustine Denteh et.al., Paper: http://arxiv.org/abs/2512.14616
- 2025-10-20, Estimating Orbital Parameters of Direct Imaging Exoplanet Using Neural Network, Bo Liang et.al., Paper: http://arxiv.org/abs/2510.17459
- 2025-11-26, Estimates for convolution operators on Hardy spaces associated with ball quasi-Banach function spaces, Pablo Rocha et.al., Paper: http://arxiv.org/abs/2511.21642
- 2025-11-26, Escaping the Verifier: Learning to Reason via Demonstrations, Locke Cai et.al., Paper: http://arxiv.org/abs/2511.21667
- 2026-03-21, EruDiff: Refactoring Knowledge in Diffusion Models for Advanced Text-to-Image Synthesis, Xiefan Guo et.al., Paper: http://arxiv.org/abs/2603.20828
- 2026-03-11, Ergodicity in reinforcement learning, Dominik Baumann et.al., Paper: http://arxiv.org/abs/2603.10895
- 2025-12-24, Equivariant Multiscale Learned Invertible Reconstruction for Cone Beam CT: From Simulated to Real Data, Nikita Moriakov et.al., Paper: http://arxiv.org/abs/2512.21180
- 2026-02-24, Equivariant Floer cohomology for contactomorphisms of quotient spaces, Dylan Cant et.al., Paper: http://arxiv.org/abs/2602.21152
- 2026-03-25, Equivariant Filter Transformations for Consistent and Efficient Visual–Inertial Navigation, Chungeng Tian et.al., Paper: http://arxiv.org/abs/2603.24130
- 2025-12-24, Equilibrium investment under dynamic preference uncertainty, Luca De Gennaro Aquino et.al., Paper: http://arxiv.org/abs/2512.21149
- 2026-03-09, EquiBim: Learning Symmetry-Equivariant Policy for Bimanual Manipulation, Zhiyuan Zhang et.al., Paper: http://arxiv.org/abs/2603.08541
- 2026-01-06, Epicyclic motion and accretion disk around a charged black hole in Einstein-ModMax theory with a quintessence field, Hamza Rehman et.al., Paper: http://arxiv.org/abs/2601.03088
- 2025-12-17, Environmental Policy and Firm Performance in Europe: A Difference-in-Differences Approach with Spillovers, Andrea Ciaccio et.al., Paper: http://arxiv.org/abs/2512.15377
- 2025-11-06, Environment Agnostic Goal-Conditioning, A Study of Reward-Free Autonomous Learning, Hampus Åström et.al., Paper: http://arxiv.org/abs/2511.04598
- 2026-01-09, EnvScaler: Scaling Tool-Interactive Environments for LLM Agent via Programmatic Synthesis, Xiaoshuai Song et.al., Paper: http://arxiv.org/abs/2601.05808
- 2026-02-20, Entropy-regularized penalization schemes for American options and reflected BSDEs with singular generators, Daniel Chee et.al., Paper: http://arxiv.org/abs/2602.18078
- 2026-01-05, Entropy-Adaptive Fine-Tuning: Resolving Confident Conflicts to Mitigate Forgetting, Muxi Diao et.al., Paper: http://arxiv.org/abs/2601.02151
- 2025-12-05, Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning, Zhenpeng Su et.al., Paper: http://arxiv.org/abs/2512.05591
- 2026-04-02, Entropic crystallization of geometrically frustrated magnets on 1/1 approximant Tsai-type quasicrystal, Oscar Novat et.al., Paper: http://arxiv.org/abs/2604.02180
- 2025-12-02, Entanglement evolution from entangled multipodal states, Konstantinos Chalas et.al., Paper: http://arxiv.org/abs/2512.03032
- 2026-03-05, Ensembling Language Models with Sequential Monte Carlo, Robin Shing Moon Chan et.al., Paper: http://arxiv.org/abs/2603.05432
- 2026-02-23, Enormous Fluid Antenna Systems (E-FAS)–Part II: Channel Estimation, Farshad Rostami Ghadi et.al., Paper: http://arxiv.org/abs/2602.20127
- 2025-11-13, Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling, Jiahao Wang et.al., Paper: http://arxiv.org/abs/2511.10648
- 2026-01-13, Enhancing classical simulation with noisy quantum devices, Ruiqi Zhang et.al., Paper: http://arxiv.org/abs/2601.08772
- 2025-10-24, Enhancing Tactile-based Reinforcement Learning for Robotic Control, Elle Miller et.al., Paper: http://arxiv.org/abs/2510.21609
- 2026-02-27, Enhancing Spatial Understanding in Image Generation via Reward Modeling, Zhenyu Tang et.al., Paper: http://arxiv.org/abs/2602.24233
- 2026-01-14, Enhancing Spatial Reasoning in Large Language Models for Metal-Organic Frameworks Structure Prediction, Mianzhi Pan et.al., Paper: http://arxiv.org/abs/2601.09285
- 2025-12-11, Enhancing Radiology Report Generation and Visual Grounding using Reinforcement Learning, Benjamin Gundersen et.al., Paper: http://arxiv.org/abs/2512.10691
- 2025-12-08, Enhancing Agentic RL with Progressive Reward Shaping and Value-based Sampling Policy Optimization, Zhuoran Zhuang et.al., Paper: http://arxiv.org/abs/2512.07478
- 2026-03-13, Enhanced Drug-drug Interaction Prediction Using Adaptive Knowledge Integration, Pengfei Liu et.al., Paper: http://arxiv.org/abs/2603.12885
- 2026-03-21, Enhanced Direction-Sensing Methods and Performance Analysis in Low-Altitude Wireless Network via a Rotation Antenna Array, Jinbing Jiang et.al., Paper: http://arxiv.org/abs/2603.20784
- 2026-03-17, Enforcing Task-Specified Compliance Bounds for Humanoids via Anisotropic Lipschitz-Constrained Policies, Zewen He et.al., Paper: http://arxiv.org/abs/2603.16180
- 2026-02-23, Energy gap of quantum spin glasses: a projection quantum Monte Carlo study, L. Brodoloni et.al., Paper: http://arxiv.org/abs/2602.20108
- 2026-02-26, Energy Deposition by Galactic Cosmic Rays and Implications for Ozone Chemistry, Luiz Augusto Stuani Pereia et.al., Paper: http://arxiv.org/abs/2602.22987
- 2026-01-28, End-to-end example-based sim-to-real RL policy transfer based on neural stylisation with application to robotic cutting, Jamie Hathaway et.al., Paper: http://arxiv.org/abs/2601.20846
- 2025-11-06, End-to-End Reinforcement Learning of Koopman Models for eNMPC of an Air Separation Unit, Daniel Mayfrank et.al., Paper: http://arxiv.org/abs/2511.04522
- 2026-03-24, End-to-End Efficient RL for Linear Bellman Complete MDPs with Deterministic Transitions, Zakaria Mhammedi et.al., Paper: http://arxiv.org/abs/2603.23461
- 2026-03-26, Enabling ab initio geometry optimization of strongly correlated systems with transferable deep quantum Monte Carlo, P. Bernát Szabó et.al., Paper: http://arxiv.org/abs/2603.25381
- 2025-11-10, Enabling Off-Policy Imitation Learning with Deep Actor Critic Stabilization, Sayambhu Sen et.al., Paper: http://arxiv.org/abs/2511.07288
- 2026-01-05, Enabling Deep Reinforcement Learning Research for Energy Saving in Open RAN, Matteo Bordin et.al., Paper: http://arxiv.org/abs/2601.02240
- 2025-10-30, Emu3.5: Native Multimodal Models are World Learners, Yufeng Cui et.al., Paper: http://arxiv.org/abs/2510.26583
- 2026-03-18, Empirical Likelihood Inference for Sen and Sen–Shorrocks–Thon Indices, Sreelakshmi N et.al., Paper: http://arxiv.org/abs/2603.17327
- 2026-02-27, Empirical Challenges with Peers-of-Peers Instruments in the Linear-In-Means Model, Nathan Canen et.al., Paper: http://arxiv.org/abs/2602.24215
- 2026-02-24, Empathy Modeling in Active Inference Agents for Perspective-Taking and Alignment, Albarracin Mahault et.al., Paper: http://arxiv.org/abs/2602.20936
- 2026-03-17, EmoLLM: Appraisal-Grounded Cognitive-Emotional Co-Reasoning in Large Language Models, Yifei Zhang et.al., Paper: http://arxiv.org/abs/2603.16553
- 2026-03-10, Emerging Extrinsic Dexterity in Cluttered Scenes via Dynamics-aware Policy Learning, Yixin Zheng et.al., Paper: http://arxiv.org/abs/2603.09882
- 2025-12-23, Emergent temporal abstractions in autoregressive models enable hierarchical reinforcement learning, Seijin Kobayashi et.al., Paper: http://arxiv.org/abs/2512.20605
- 2026-03-23, Emergent relativistic symmetry from interacting fermions on the honeycomb bilayer, Zi Hong Liu et.al., Paper: http://arxiv.org/abs/2603.22259
- 2026-03-04, Emergent dimensional reduction in a distorted kagome magnet $\mathrm{YCa_3(CrO)_3(BO_3)_4}$ driven by exchange hierarchy, Umashankar Jena et.al., Paper: http://arxiv.org/abs/2603.04253
- 2025-11-28, Emergent Coordination and Phase Structure in Independent Multi-Agent Reinforcement Learning, Azusa Yamaguchi et.al., Paper: http://arxiv.org/abs/2511.23315
- 2026-03-28, Emergent Competition Between Dynamical Channels in Nonequilibrium Systems, R. A. Dumer et.al., Paper: http://arxiv.org/abs/2603.27256
- 2025-10-30, Emergence of charge- $4e$ superconductivity from 2D nematic superconductors, Xuan Zou et.al., Paper: http://arxiv.org/abs/2510.26720
- 2025-11-07, Emergence from Emergence: Financial Market Simulation via Learning with Heterogeneous Preferences, Ryuko Hashimoto et.al., Paper: http://arxiv.org/abs/2511.05207
- 2025-10-23, EmbodiedBrain: Expanding Performance Boundaries of Task Planning for Embodied Intelligence, Ding Zou et.al., Paper: http://arxiv.org/abs/2510.20578
- 2026-03-09, Embedding Classical Balance Control Principles in Reinforcement Learning for Humanoid Recovery, Nehar Poddar et.al., Paper: http://arxiv.org/abs/2603.08619
- 2026-04-01, Embarrassingly Simple Self-Distillation Improves Code Generation, Ruixiang Zhang et.al., Paper: http://arxiv.org/abs/2604.01193
- 2025-12-29, Eliminating Inductive Bias in Reward Models with Information-Theoretic Guidance, Zhuo Li et.al., Paper: http://arxiv.org/abs/2512.23461
- 2026-01-29, Elign: Equivariant Diffusion Model Alignment from Foundational Machine Learning Force Fields, Yunyang Li et.al., Paper: http://arxiv.org/abs/2601.21985
- 2025-10-24, Electroweak corrections to $gg\rightarrow γγ$, Gabriele Fiore et.al., Paper: http://arxiv.org/abs/2510.21643
- 2025-11-17, Electron Correlation by Exchange Mapping in Electronic Structure Calculations, Jerry L. Whitten et.al., Paper: http://arxiv.org/abs/2511.13570
- 2026-01-02, Elaboration on the kinetic approach of Derbenev and Kondratenko to spin-polarized beams in electron storage rings, Klaus Heinemann et.al., Paper: http://arxiv.org/abs/2601.00586
- 2025-12-02, Eisenstein cohomology and congruences for the ratios of Rankin–Selberg $L$ -functions, P. Narayanan et.al., Paper: http://arxiv.org/abs/2512.02927
- 2026-03-06, EgoReasoner: Learning Egocentric 4D Reasoning via Task-Adaptive Structured Thinking, Fangrui Zhu et.al., Paper: http://arxiv.org/abs/2603.06561
- 2026-02-20, EgoPush: Learning End-to-End Egocentric Multi-Object Rearrangement for Mobile Robots, Boyuan An et.al., Paper: http://arxiv.org/abs/2602.18071
- 2026-03-12, EgoIntent: An Egocentric Step-level Benchmark for Understanding What, Why, and Next, Ye Pan et.al., Paper: http://arxiv.org/abs/2603.12147
- 2026-01-22, Efficiently Learning Robust Torque-based Locomotion Through Reinforcement with Model-Based Supervision, Yashuai Yan et.al., Paper: http://arxiv.org/abs/2601.16109
- 2026-03-06, Efficient, Property-Aligned Fan-Out Retrieval via RL-Compiled Diffusion, Pengcheng Jiang et.al., Paper: http://arxiv.org/abs/2603.06397
- 2025-12-05, Efficient sequential Bayesian inference for state-space epidemic models using ensemble data assimilation, Dhorasso Temfack et.al., Paper: http://arxiv.org/abs/2512.05650
- 2026-01-21, Efficient prior sensitivity analysis for Bayesian model comparison, Zixiao Hu et.al., Paper: http://arxiv.org/abs/2601.15132
- 2025-11-26, Efficient bayesian spatially varying coefficients modeling for censored data using the vecchia approximation, Yacine Mohamed Idir et.al., Paper: http://arxiv.org/abs/2511.21553
- 2026-02-09, Efficient and Stable Reinforcement Learning for Diffusion Language Models, Jiawei Liu et.al., Paper: http://arxiv.org/abs/2602.08905
- 2026-02-03, Efficient Variance-reduced Estimation from Generative EHR Models: The SCOPE and REACH Estimators, Luke Solo et.al., Paper: http://arxiv.org/abs/2602.03730
- 2026-01-29, Efficient Stochastic Optimisation via Sequential Monte Carlo, James Cuin et.al., Paper: http://arxiv.org/abs/2601.22003
- 2026-03-18, Efficient Soft Actor-Critic with LLM-Based Action-Level Guidance for Continuous Control, Hao Ma et.al., Paper: http://arxiv.org/abs/2603.17468
- 2025-11-17, Efficient Simulation of Hawkes Processes using their Affine Volterra Structure, Eduardo Abi Jaber et.al., Paper: http://arxiv.org/abs/2511.13554
- 2026-03-17, Efficient Reasoning on the Edge, Yelysei Bondarenko et.al., Paper: http://arxiv.org/abs/2603.16867
- 2026-03-13, Efficient Real-World Autonomous Racing via Attenuated Residual Policy Optimization, Raphael Trumpp et.al., Paper: http://arxiv.org/abs/2603.12960
- 2025-12-18, Efficient Monte-Carlo sampling of metastable systems using non-local collective variable updates, Christoph Schönle et.al., Paper: http://arxiv.org/abs/2512.16812
- 2025-10-21, Efficient Model-Based Reinforcement Learning for Robot Control via Online Learning, Fang Nan et.al., Paper: http://arxiv.org/abs/2510.18518
- 2026-02-17, Efficient Knowledge Transfer for Jump-Starting Control Policy Learning of Multirotors through Physics-Aware Neural Architectures, Welf Rehberg et.al., Paper: http://arxiv.org/abs/2602.15533
- 2026-03-18, Efficient Exploration at Scale, Seyed Mohammad Asghari et.al., Paper: http://arxiv.org/abs/2603.17378
- 2025-11-28, Efficient Estimation of Sum-Parameters for Multi-Component Complex Exponential Signals with Theoretical Cramer-Rao Bound Analysis, Huiguang Zhang et.al., Paper: http://arxiv.org/abs/2511.23318
- 2026-02-03, Efficient Estimation of Kernel Surrogate Models for Task Attribution, Zhenshuo Zhang et.al., Paper: http://arxiv.org/abs/2602.03783
- 2025-12-04, Efficient Decoders for Sensing Subspace Code, Siva Aditya Gooty et.al., Paper: http://arxiv.org/abs/2512.05028
- 2026-04-02, Efficient Auxiliary-Field Quantum Monte Carlo using Isometric Tensor Hypercontraction, Maxine Luo et.al., Paper: http://arxiv.org/abs/2604.02054
- 2025-10-20, Efficient Algorithms for Mitigating Uncertainty and Risk in Reinforcement Learning, Xihong Su et.al., Paper: http://arxiv.org/abs/2510.17690
- 2025-10-21, EffiReasonTrans: RL-Optimized Reasoning for Code Translation, Yanlin Wang et.al., Paper: http://arxiv.org/abs/2510.18863
- 2026-03-25, Effects of the initial-state geometry on D-meson production in pp and pPb collisions, R. Terra et.al., Paper: http://arxiv.org/abs/2603.24344
- 2025-12-05, EditThinker: Unlocking Iterative Reasoning for Any Image Editor, Hongyu Li et.al., Paper: http://arxiv.org/abs/2512.05965
- 2025-12-22, EchoTrail-GUI: Building Actionable Memory for GUI Agents via Critic-Guided Self-Exploration, Runze Li et.al., Paper: http://arxiv.org/abs/2512.19396
- 2025-12-08, Each Prompt Matters: Scaling Reinforcement Learning Without Wasting Rollouts on Hundred-Billion-Scale MoE, Anxiang Zeng et.al., Paper: http://arxiv.org/abs/2512.07710
- 2026-03-24, EVA: Efficient Reinforcement Learning for End-to-End Video Agent, Yaolun Zhang et.al., Paper: http://arxiv.org/abs/2603.22918
- 2026-03-18, EVA: Aligning Video World Models with Executable Robot Actions via Inverse Dynamics Rewards, Ruixiang Wang et.al., Paper: http://arxiv.org/abs/2603.17808
- 2025-12-17, EUBRL: Epistemic Uncertainty Directed Bayesian Reinforcement Learning, Jianfei Ma et.al., Paper: http://arxiv.org/abs/2512.15405
- 2026-02-10, ESTAR: Early-Stopping Token-Aware Reasoning For Efficient Inference, Junda Wang et.al., Paper: http://arxiv.org/abs/2602.10004
- 2026-02-04, ERNIE 5.0 Technical Report, Haifeng Wang et.al., Paper: http://arxiv.org/abs/2602.04705
- 2025-11-07, EMPEROR I. Exoplanet MCMC parallel tempering for RV orbit retrieval, Pablo A. Peña R. et.al., Paper: http://arxiv.org/abs/2511.05331
- 2025-10-29, EHR-R1: A Reasoning-Enhanced Foundational Language Model for Electronic Health Record Analysis, Yusheng Liao et.al., Paper: http://arxiv.org/abs/2510.25628
- 2026-03-17, ECHO: Edge-Cloud Humanoid Orchestration for Language-to-Motion Control, Haozhe Jia et.al., Paper: http://arxiv.org/abs/2603.16188
- 2026-01-08, EARL: Energy-Aware Optimization of Liquid State Machines for Pervasive AI, Zain Iqbal et.al., Paper: http://arxiv.org/abs/2601.05205
- 2026-03-11, Dynamics-Predictive Sampling for Active RL Finetuning of Large Reasoning Models, Yixiu Mao et.al., Paper: http://arxiv.org/abs/2603.10887
- 2025-12-11, Dynamics of multidimensional Simple Clock Auctions, Jad Zeroual et.al., Paper: http://arxiv.org/abs/2512.10614
- 2025-11-26, Dynamics of generalized abcd Boussinesq solitary waves under a slowly variable bottom, André de Laire et.al., Paper: http://arxiv.org/abs/2511.21632
- 2026-01-15, Dynamics of Late time cosmology in $f(Q,L_{m})$ Gravity with Constraints from DESI DR2 BAO Data, Rajdeep Mazumdar et.al., Paper: http://arxiv.org/abs/2601.10627
- 2026-04-02, Dynamic resource coordination can increase grid hosting capacity to support more renewables, storage, and electrified load growth, Vineet Jagadeesan Nair et.al., Paper: http://arxiv.org/abs/2604.02170
- 2026-03-04, Dynamic properties in a collisional model for confined granular fluids. A review, Ricardo Brito et.al., Paper: http://arxiv.org/abs/2603.04388
- 2025-12-10, Dynamic one-time delivery of critical data by small and sparse UAV swarms: a model problem for MARL scaling studies, Mika Persson et.al., Paper: http://arxiv.org/abs/2512.09682
- 2026-01-22, Dynamic Tactile Sensing System and Soft Actor Critic Reinforcement Learning for Inclusion Characterization, John Bannan et.al., Paper: http://arxiv.org/abs/2601.16061
- 2025-10-29, Dynamic Beamforming and Power Allocation in ISAC via Deep Reinforcement Learning, Duc Nguyen Dao et.al., Paper: http://arxiv.org/abs/2510.25496
- 2026-01-29, DynaWeb: Model-Based Reinforcement Learning of Web Agents, Hang Ding et.al., Paper: http://arxiv.org/abs/2601.22149
- 2025-12-24, Dyna-Style Reinforcement Learning Modeling and Control of Non-linear Dynamics, Karim Abdelsalam et.al., Paper: http://arxiv.org/abs/2512.21081
- 2026-03-04, Dual-Modality Multi-Stage Adversarial Safety Training: Robustifying Multimodal Web Agents Against Cross-Modal Attacks, Haoyu Liu et.al., Paper: http://arxiv.org/abs/2603.04364
- 2025-10-28, Dual-Mind World Models: A General Framework for Learning in Dynamic Wireless Networks, Lingyi Wang et.al., Paper: http://arxiv.org/abs/2510.24546
- 2026-03-06, Dual-Agent Multiple-Model Reinforcement Learning for Event-Triggered Human-Robot Co-Adaptation in Decoupled Task Spaces, Yaqi Li et.al., Paper: http://arxiv.org/abs/2603.06163
- 2026-03-17, Dual Consensus: Escaping from Spurious Majority in Unsupervised RLVR via Two-Stage Vote Mechanism, Kaixuan Du et.al., Paper: http://arxiv.org/abs/2603.16223
- 2026-03-18, Dropout Robustness and Cognitive Profiling of Transformer Models via Stochastic Inference, Antônio Junior Alves Caiado et.al., Paper: http://arxiv.org/abs/2603.17811
- 2025-11-14, Drone Swarm Energy Management, Michael Z. Zgurovsky et.al., Paper: http://arxiv.org/abs/2511.11557
- 2026-03-29, Driving Condition-Aware Multi-Agent Integrated Power and Thermal Management for Hybrid Electric Vehicles, Hanghang Cui et.al., Paper: http://arxiv.org/abs/2603.27471
- 2026-03-25, DreamerAD: Efficient Reinforcement Learning via Latent World Model for Autonomous Driving, Pengxuan Yang et.al., Paper: http://arxiv.org/abs/2603.24587
- 2026-03-12, DreamVideo-Omni: Omni-Motion Controlled Multi-Subject Video Customization with Latent Identity Reinforcement Learning, Yujie Wei et.al., Paper: http://arxiv.org/abs/2603.12257
- 2026-03-17, DreamPlan: Efficient Reinforcement Fine-Tuning of Vision-Language Planners via Video World Models, Emily Yue-Ting Jia et.al., Paper: http://arxiv.org/abs/2603.16860
- 2025-12-31, Dream2Flow: Bridging Video Generation and Open-World Manipulation with 3D Object Flow, Karthik Dharmarajan et.al., Paper: http://arxiv.org/abs/2512.24766
- 2026-01-14, Draw it like Euclid: Teaching transformer models to generate CAD profiles using ruler and compass construction steps, Siyi Li et.al., Paper: http://arxiv.org/abs/2601.09428
- 2026-02-09, Dr. MAS: Stable Reinforcement Learning for Multi-Agent LLM Systems, Lang Feng et.al., Paper: http://arxiv.org/abs/2602.08847
- 2026-02-05, Dr. Kernel: Reinforcement Learning Done Right for Triton Kernel Generations, Wei Liu et.al., Paper: http://arxiv.org/abs/2602.05885
- 2025-10-23, Downsizing Diffusion Models for Cardinality Estimation, Xinhe Mu et.al., Paper: http://arxiv.org/abs/2510.20681
- 2025-12-17, Double Horizon Model-Based Policy Optimization, Akihiro Kubo et.al., Paper: http://arxiv.org/abs/2512.15439
- 2026-03-11, Don’t Disregard the Data for Lack of a Likelihood: Bayesian Synthetic Likelihood for Enhanced Multilevel Network Meta-Regression, Harlan Campbell et.al., Paper: http://arxiv.org/abs/2603.11019
- 2022-11-29, Domain Generalization for Robust Model-Based Offline Reinforcement Learning, Alan Clark et.al., Paper: http://arxiv.org/abs/2211.14827
- 2026-03-11, Does LLM Alignment Really Need Diversity? An Empirical Study of Adapting RLVR Methods for Moral Reasoning, Zhaowei Zhang et.al., Paper: http://arxiv.org/abs/2603.10588
- 2026-01-16, Do explanations generalize across large reasoning models?, Koyena Pal et.al., Paper: http://arxiv.org/abs/2601.11517
- 2026-01-16, Do We Always Need Query-Level Workflows? Rethinking Agentic Workflow Generation for Multi-Agent Systems, Zixu Wang et.al., Paper: http://arxiv.org/abs/2601.11147
- 2026-02-11, Divide, Harmonize, Then Conquer It: Shooting Multi-Commodity Flow Problems with Multimodal Language Models, Xinyu Yuan et.al., Paper: http://arxiv.org/abs/2602.11057
- 2025-11-25, Diverse Video Generation with Determinantal Point Process-Guided Policy Optimization, Tahira Kazimi et.al., Paper: http://arxiv.org/abs/2511.20647
- 2025-12-19, Distributionally Robust Imitation Learning: Layered Control Architecture for Certifiable Autonomy, Aditya Gahlawat et.al., Paper: http://arxiv.org/abs/2512.17899
- 2026-01-23, Distributional Instruments: Identification and Estimation with Quantile Least Squares, Rowan Cherodian et.al., Paper: http://arxiv.org/abs/2601.16865
- 2025-11-17, Distribution Matching Distillation Meets Reinforcement Learning, Dengyang Jiang et.al., Paper: http://arxiv.org/abs/2511.13649
- 2026-03-19, Distributed lag non-linear models with spatial effect modification using Laplacian P-splines, Sara Rutten et.al., Paper: http://arxiv.org/abs/2603.18990
- 2026-01-05, Distorted Distributional Policy Evaluation for Offline Reinforcement Learning, Ryo Iwaki et.al., Paper: http://arxiv.org/abs/2601.01917
- 2026-02-25, Distill and Align Decomposition for Enhanced Claim Verification, Jabez Magomere et.al., Paper: http://arxiv.org/abs/2602.21857
- 2025-12-11, Disperon QED, Yizhou Fang et.al., Paper: http://arxiv.org/abs/2512.10709
- 2026-04-01, Disentangling to Re-couple: Resolving the Similarity-Controllability Paradox in Subject-Driven Text-to-Image Generation, Shuang Li et.al., Paper: http://arxiv.org/abs/2604.00849
- 2026-01-15, Discrete Feynman-Kac Correctors, Mohsin Hasan et.al., Paper: http://arxiv.org/abs/2601.10403
- 2026-01-01, Discovering pulsars in compact binaries with a hidden Markov model, Joseph O’Leary et.al., Paper: http://arxiv.org/abs/2601.00500
- 2025-12-18, Discovering and Learning Probabilistic Models of Black-Box AI Capabilities, Daniel Bramblett et.al., Paper: http://arxiv.org/abs/2512.16733
- 2025-12-03, Discontinuous Strongly Quasiconvex Functions, Nguyen Thi Van Hang et.al., Paper: http://arxiv.org/abs/2512.03934
- 2025-12-15, Disability insurance with collective health claims: A mean-field approach, Christian Furrer et.al., Paper: http://arxiv.org/abs/2512.13562
- 2025-11-04, Directional-Clamp PPO, Gilad Karpel et.al., Paper: http://arxiv.org/abs/2511.02577
- 2025-12-09, Direct transfer of optimized controllers to similar systems using dimensionless MPC, Josip Kir Hromatko et.al., Paper: http://arxiv.org/abs/2512.08667
- 2026-02-08, Direct Soft-Policy Sampling via Langevin Dynamics, Donghyeon Ki et.al., Paper: http://arxiv.org/abs/2602.07873
- 2025-12-25, Direct Deep Neural-network Extraction of Generalized Parton Distributions, Dima Watkins et.al., Paper: http://arxiv.org/abs/2512.21761
- 2026-03-12, Direct Boltzmann inversion method from particle configurations at arbitrary state points, Olivier Coquand et.al., Paper: http://arxiv.org/abs/2603.12081
- 2025-12-10, Dimensional crossover and finite-range effects in a quasi-two-dimensional gas of fermionic dimers, Giovanni Midei et.al., Paper: http://arxiv.org/abs/2512.09753
- 2026-02-03, Digital-Twin Empowered Deep Reinforcement Learning For Site-Specific Radio Resource Management in NextG Wireless Aerial Corridor, Pulok Tarafder et.al., Paper: http://arxiv.org/abs/2602.03801
- 2025-12-11, Digital Twin Supervised Reinforcement Learning Framework for Autonomous Underwater Navigation, Zamirddine Mari et.al., Paper: http://arxiv.org/abs/2512.10925
- 2025-12-08, DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving, Jialv Zou et.al., Paper: http://arxiv.org/abs/2512.07745
- 2026-01-20, Diffusion-Guided Backdoor Attacks in Real-World Reinforcement Learning, Tairan Huang et.al., Paper: http://arxiv.org/abs/2601.14104
- 2026-01-07, Diffusion-DRF: Differentiable Reward Flow for Video Diffusion Fine-Tuning, Yifan Wang et.al., Paper: http://arxiv.org/abs/2601.04153
- 2026-03-25, Diffusion coefficients of multi-principal element alloys from first principles, Damien K. J. Lee et.al., Paper: http://arxiv.org/abs/2603.24228
- 2026-03-14, Diffusion Reinforcement Learning via Centered Reward Distillation, Yuanzhi Zhu et.al., Paper: http://arxiv.org/abs/2603.14128
- 2026-02-12, Diffusion Alignment Beyond KL: Variance Minimisation as Effective Policy Optimiser, Zijing Ou et.al., Paper: http://arxiv.org/abs/2602.12229
- 2026-02-20, Diffusing to Coordinate: Efficient Online Multi-Agent Diffusion Policies, Zhuoran Li et.al., Paper: http://arxiv.org/abs/2602.18291
- 2025-10-31, DiffstarPop: A generative physical model of galaxy star formation history, Alex Alarcon et.al., Paper: http://arxiv.org/abs/2510.27604
- 2026-01-20, Differentiated Pickup Point Offering for Emission Reduction in Last-Mile Delivery, Albina Galiullina et.al., Paper: http://arxiv.org/abs/2601.14196
- 2025-11-20, Differential decay rate of $B^+ \to J/ψK^+$ with the LHCb Upgrade I experiment, LHCb collaboration et.al., Paper: http://arxiv.org/abs/2511.16564
- 2025-12-15, Differentiable Evolutionary Reinforcement Learning, Sitao Cheng et.al., Paper: http://arxiv.org/abs/2512.13399
- 2025-12-18, Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification, Qihao Liu et.al., Paper: http://arxiv.org/abs/2512.16921
- 2026-03-29, Difference Feedback: Generating Multimodal Process-Level Supervision for VLM Reinforcement Learning, Feiding et.al., Paper: http://arxiv.org/abs/2603.27482
- 2026-03-09, Diff-Muscle: Efficient Learning for Musculoskeletal Robotic Table Tennis, Wentao Zhao et.al., Paper: http://arxiv.org/abs/2603.08617
- 2026-03-18, Dielectric response and structural properties of finite-temperature electron liquids, Chengliang Lin et.al., Paper: http://arxiv.org/abs/2603.17699
- 2026-01-21, Dielectric formalism of the 2D uniform electron gas at finite temperatures, Fotios Kalkavouras et.al., Paper: http://arxiv.org/abs/2601.14989
- 2026-02-02, Didactic to Constructive: Turning Expert Solutions into Learnable Reasoning, Ethan Mendes et.al., Paper: http://arxiv.org/abs/2602.02405
- 2026-02-05, Diamond Maps: Efficient Reward Alignment via Stochastic Flow Maps, Peter Holderrieth et.al., Paper: http://arxiv.org/abs/2602.05993
- 2025-10-31, Dialogue as Discovery: Navigating Human Intent Through Principled Inquiry, Jianwen Sun et.al., Paper: http://arxiv.org/abs/2510.27410
- 2026-01-14, Dialogue Telemetry: Turn-Level Instrumentation for Autonomous Information Gathering, Dimitris Panagopoulos et.al., Paper: http://arxiv.org/abs/2601.09570
- 2025-12-03, Diagonalizing the Softmax: Hadamard Initialization for Tractable Cross-Entropy Dynamics, Connall Garrod et.al., Paper: http://arxiv.org/abs/2512.04006
- 2025-11-24, Diagnosis of mixed-state topological phases in strongly correlated systems via disorder parameters, Shao-Hang Shi et.al., Paper: http://arxiv.org/abs/2511.19311
- 2026-03-28, Diagnosing Non-Markovian Observations in Reinforcement Learning via Prediction-Based Violation Scoring, Naveen Mysore et.al., Paper: http://arxiv.org/abs/2603.27389
- 2026-03-05, DiSCTT: Consensus-Guided Self-Curriculum for Efficient Test-Time Adaptation in Reasoning, Mohammad Mahdi Moradi et.al., Paper: http://arxiv.org/abs/2603.05357
- 2025-11-20, Dexterity from Smart Lenses: Multi-Fingered Robot Manipulation with In-the-Wild Human Demonstrations, Irmak Guzey et.al., Paper: http://arxiv.org/abs/2511.16661
- 2026-02-25, DexRepNet++: Learning Dexterous Robotic Manipulation with Geometric and Spatial Hand-Object Representations, Qingtao Liu et.al., Paper: http://arxiv.org/abs/2602.21811
- 2026-03-23, DexDrummer: In-Hand, Contact-Rich, and Long-Horizon Dexterous Robot Drumming, Hung-Chieh Fang et.al., Paper: http://arxiv.org/abs/2603.22263
- 2026-03-06, Devil is in Narrow Policy: Unleashing Exploration in Driving VLA Models, Canyu Chen et.al., Paper: http://arxiv.org/abs/2603.06049
- 2026-02-23, Development of a Cherenkov-Based Time-of-Flight Detector Using Silicon Photomultipliers, Liliana Congedo et.al., Paper: http://arxiv.org/abs/2602.20139
- 2025-12-09, Developing Distance-Aware Uncertainty Quantification Methods in Physics-Guided Neural Networks for Reliable Bearing Health Prediction, Waleed Razzaq et.al., Paper: http://arxiv.org/abs/2512.08499
- 2026-03-13, Determination of Nuclear PDFs using Markov Chain Monte Carlo Methods, N. Derakhshanian et.al., Paper: http://arxiv.org/abs/2603.13150
- 2025-10-23, Detection of ultra-high-energy cosmic rays in the southern hemisphere with FAST: data acquisition and preliminary results, Jakub Kmec et.al., Paper: http://arxiv.org/abs/2510.20522
- 2026-03-05, Detection of C3 in Titan with VLT-ESPRESSO, Rafael Rianço-Silva et.al., Paper: http://arxiv.org/abs/2603.05365
- 2026-03-20, Detecting the 3D Ising model phase transition with a ground-state-trained autoencoder, Ahmed Abuali et.al., Paper: http://arxiv.org/abs/2603.20157
- 2025-12-12, Detecting changes in the mean of spatial random fields on a regular grid, Sheila T. Görz et.al., Paper: http://arxiv.org/abs/2512.11599
- 2026-02-23, Descent-Guided Policy Gradient for Scalable Cooperative Multi-Agent Learning, Shan Yang et.al., Paper: http://arxiv.org/abs/2602.20078
- 2026-01-26, Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory, Yanming Liu et.al., Paper: http://arxiv.org/abs/2601.18771
- 2025-12-12, DentalGPT: Incentivizing Multimodal Complex Reasoning in Dentistry, Zhenyang Cai et.al., Paper: http://arxiv.org/abs/2512.11558
- 2026-02-24, Density Functional Theory Predictions of Derivative Thermodynamic Properties of a Confined Fluid, Gennady Y. Gor et.al., Paper: http://arxiv.org/abs/2602.21111
- 2026-03-23, Demystifying Reinforcement Learning for Long-Horizon Tool-Using Agents: A Comprehensive Recipe, Xixi Wu et.al., Paper: http://arxiv.org/abs/2603.21972
- 2025-10-22, Demonstrating Real Advantage of Machine-Learning-Enhanced Monte Carlo for Combinatorial Optimization, Luca Maria Del Bono et.al., Paper: http://arxiv.org/abs/2510.19544
- 2026-01-06, Dementia-R1: Reinforced Pretraining and Reasoning from Unstructured Clinical Notes for Real-World Dementia Prognosis, Choonghan Kim et.al., Paper: http://arxiv.org/abs/2601.03018
- 2025-12-19, Delayed Acceptance Slice Sampling, Kevin Bitterlich et.al., Paper: http://arxiv.org/abs/2512.17868
- 2025-12-08, Delay-Aware Diffusion Policy: Bridging the Observation-Execution Gap in Dynamic Tasks, Aileen Liao et.al., Paper: http://arxiv.org/abs/2512.07697
- 2026-01-29, Defining Operational Conditions for Safety-Critical AI-Based Systems from Data, Johann Christensen et.al., Paper: http://arxiv.org/abs/2601.22118
- 2025-10-30, Defeating the Training-Inference Mismatch via FP16, Penghui Qi et.al., Paper: http://arxiv.org/abs/2510.26788
- 2025-11-06, DeepPAAC: A New Deep Galerkin Method for Principal-Agent Problems, Michael Ludkovski et.al., Paper: http://arxiv.org/abs/2511.04309
- 2026-02-12, DeepGen 1.0: A Lightweight Unified Multimodal Model for Advancing Image Generation and Editing, Dianyi Wang et.al., Paper: http://arxiv.org/abs/2602.12205
- 2025-11-07, DeepEyesV2: Toward Agentic Multimodal Model, Jack Hong et.al., Paper: http://arxiv.org/abs/2511.05271
- 2025-10-31, DeepCompress: A Dual Reward Strategy for Dynamically Exploring and Compressing Reasoning Chains, Tian Liang et.al., Paper: http://arxiv.org/abs/2510.27419
- 2025-11-18, DeepBlip: Estimating Conditional Average Treatment Effects Over Time, Haorui Ma et.al., Paper: http://arxiv.org/abs/2511.14545
- 2025-10-24, DeepAgent: A General Reasoning Agent with Scalable Toolsets, Xiaoxi Li et.al., Paper: http://arxiv.org/abs/2510.21618
- 2026-02-24, Deep unfolding of MCMC kernels: scalable, modular & explainable GANs for high-dimensional posterior sampling, Jonathan Spence et.al., Paper: http://arxiv.org/abs/2602.20758
- 2026-03-17, Deep Reinforcement Learning-driven Edge Offloading for Latency-constrained XR pipelines, Sourya Saha et.al., Paper: http://arxiv.org/abs/2603.16823
- 2025-10-29, Deep Reinforcement Learning-Based Cooperative Rate Splitting for Satellite-to-Underground Communication Networks, Kaiqiang Lin et.al., Paper: http://arxiv.org/abs/2510.25562
- 2026-03-17, Deep Reinforcement Learning-Assisted Automated Operator Portfolio for Constrained Multi-objective Optimization, Shuai Shao et.al., Paper: http://arxiv.org/abs/2603.16401
- 2026-04-01, Deep Reinforcement Learning for Robotic Manipulation under Distribution Shift with Bounded Extremum Seeking, Shaifalee Saxena et.al., Paper: http://arxiv.org/abs/2604.01142
- 2026-03-16, Deep Reinforcement Learning for Fano Hypersurfaces, Marc Truter et.al., Paper: http://arxiv.org/abs/2603.15437
- 2025-12-17, Deep Reinforcement Learning for EH-Enabled Cognitive-IoT Under Jamming Attacks, Nadia Abdolkhani et.al., Paper: http://arxiv.org/abs/2512.15558
- 2026-03-03, Deep Q-Learning-Based Gain Scheduling for Nonlinear Quadcopter Dynamics, Hossein Rastgoftar et.al., Paper: http://arxiv.org/abs/2603.03127
- 2025-10-21, Deep Q-Learning Assisted Bandwidth Reservation for Multi-Operator Time-Sensitive Vehicular Networking, Abdullah Al-Khatib et.al., Paper: http://arxiv.org/abs/2510.18553
- 2025-10-20, Deep Neural Network extraction of Unpolarized Transverse Momentum Distributions, I. P. Fernando et.al., Paper: http://arxiv.org/abs/2510.17243
- 2025-12-17, Deep Learning-Driven Quantitative Spectroscopic Photoacoustic Imaging for Segmentation and Oxygen Saturation Estimation, Ruibo Shang et.al., Paper: http://arxiv.org/abs/2512.15394
- 2026-03-12, Deep Learning-Based Metamodeling of Nonlinear Stochastic Dynamic Systems under Parametric and Predictive Uncertainty, Haimiti Atila et.al., Paper: http://arxiv.org/abs/2603.12012
- 2026-01-28, Deep Learning based Three-stage Solution for ISAC Beamforming Optimization, Qian Gao et.al., Paper: http://arxiv.org/abs/2601.20667
- 2025-11-20, Deep Learning Framework for Enhanced Neutrino Reconstruction of Single-line Events in the ANTARES Telescope, A. Albert et.al., Paper: http://arxiv.org/abs/2511.16614
- 2026-01-16, Deep GraphRAG: A Balanced Approach to Hierarchical Retrieval and Adaptive Integration, Yuejie Li et.al., Paper: http://arxiv.org/abs/2601.11144
- 2026-03-21, Deep Adaptive Rate Allocation in Volatile Heterogeneous Wireless Networks, Gregorio Maglione et.al., Paper: http://arxiv.org/abs/2603.20926
- 2026-01-22, Decoupling Return-to-Go for Efficient Decision Transformer, Yongyi Wang et.al., Paper: http://arxiv.org/abs/2601.15953
- 2026-03-23, Decoupling Exploration and Policy Optimization: Uncertainty Guided Tree Search for Hard Exploration, Zakaria Mhammedi et.al., Paper: http://arxiv.org/abs/2603.22273
- 2026-02-13, Deconfinement from Thermal Tensor Networks: Universal CFT signature in (2+1)-dimensional $\mathbb{Z}_N$ lattice gauge theory, Adwait Naravane et.al., Paper: http://arxiv.org/abs/2602.13124
- 2026-03-24, DecompGrind: A Decomposition Framework for Robotic Grinding via Cutting-Surface Planning and Contact-Force Adaptation, Shunsuke Araki et.al., Paper: http://arxiv.org/abs/2603.22859
- 2025-10-20, Decentralized Real-Time Planning for Multi-UAV Cooperative Manipulation via Imitation Learning, Shantnav Agarwal et.al., Paper: http://arxiv.org/abs/2510.17143
- 2026-03-25, Decentralized End-to-End Multi-AAV Pursuit Using Predictive Spatio-Temporal Observation via Deep Reinforcement Learning, Yude Li et.al., Paper: http://arxiv.org/abs/2603.24238
- 2025-12-23, Decay of $f(R)$ quintessence into dark matter: mitigating the Hubble tension?, Giovanni Montani et.al., Paper: http://arxiv.org/abs/2512.20193
- 2025-10-21, DeLoad: Demand-Driven Short-Video Preloading with Scalable Watch-Time Estimation, Tong Liu et.al., Paper: http://arxiv.org/abs/2510.18459
- 2026-01-21, DeGAS: Gradient-Based Optimization of Probabilistic Programs without Sampling, Francesca Randone et.al., Paper: http://arxiv.org/abs/2601.15167
- 2026-01-15, DeFlow: Decoupling Manifold Modeling and Value Maximization for Offline Policy Extraction, Zhancun Mu et.al., Paper: http://arxiv.org/abs/2601.10471
- 2026-02-02, David vs. Goliath: Verifiable Agent-to-Agent Jailbreaking via Reinforcement Learning, Samuel Nellessen et.al., Paper: http://arxiv.org/abs/2602.02395
- 2026-02-11, DataChef: Cooking Up Optimal Data Recipes for LLM Adaptation via Reinforcement Learning, Yicheng Chen et.al., Paper: http://arxiv.org/abs/2602.11089
- 2026-01-12, Data-driven control of hydraulic impact hammers under strict operational and control constraints, Francisco Leiva et.al., Paper: http://arxiv.org/abs/2601.07813
- 2026-02-11, Data-Efficient Hierarchical Goal-Conditioned Reinforcement Learning via Normalizing Flows, Shaswat Garg et.al., Paper: http://arxiv.org/abs/2602.11142
- 2026-03-04, Data-Aware Random Feature Kernel for Transformers, Amirhossein Farzam et.al., Paper: http://arxiv.org/abs/2603.04127
- 2026-01-14, Data Scaling for Navigation in Unknown Environments, Lauri Suomela et.al., Paper: http://arxiv.org/abs/2601.09444
- 2026-03-29, DSevolve: Enabling Real-Time Adaptive Scheduling on Dynamic Shop Floor with LLM-Evolved Heuristic Portfolios, Jin Huang et.al., Paper: http://arxiv.org/abs/2603.27628
- 2026-02-23, DSDR: Dual-Scale Diversity Regularization for Exploration in LLM Reasoning, Zhongwei Wan et.al., Paper: http://arxiv.org/abs/2602.19895
- 2026-03-22, DRL-driven Online Optimization for Joint Traffic Reshaping and Channel Reconfiguration in RIS-assisted Semantic NOMA Communications, Songhan Zhao et.al., Paper: http://arxiv.org/abs/2603.21093
- 2026-03-28, DRASTIC: A Dynamic Resource Allocation Framework over 6G Network Slicing in Task-aware Closed-Loop Tactile Internet Applications, Narges Golmohammadi et.al., Paper: http://arxiv.org/abs/2603.27364
- 2025-11-25, DRAFT-RL: Multi-Agent Chain-of-Draft Reasoning for Reinforcement Learning-Enhanced LLMs, Yuanhao Li et.al., Paper: http://arxiv.org/abs/2511.20468
- 2025-11-24, DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research, Rulin Shao et.al., Paper: http://arxiv.org/abs/2511.19399
- 2025-11-05, DQN Performance with Epsilon Greedy Policies and Prioritized Experience Replay, Daniel Perkins et.al., Paper: http://arxiv.org/abs/2511.03670
- 2026-01-14, DPWriter: Reinforcement Learning with Diverse Planning Branching for Creative Writing, Qian Cao et.al., Paper: http://arxiv.org/abs/2601.09609
- 2026-02-13, DPUConfig: Optimizing ML Inference in FPGAs Using Reinforcement Learning, Alexandros Patras et.al., Paper: http://arxiv.org/abs/2602.12847
- 2026-02-05, DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training, Dingwei Zhu et.al., Paper: http://arxiv.org/abs/2602.05890
- 2025-10-24, DEEDEE: Fast and Scalable Out-of-Distribution Dynamics Detection, Tala Aljaafari et.al., Paper: http://arxiv.org/abs/2510.21638
- 2026-02-27, DARE-bench: Evaluating Modeling and Instruction Fidelity of LLMs in Data Science, Fan Shu et.al., Paper: http://arxiv.org/abs/2602.24288
- 2025-10-23, DAIL: Beyond Task Ambiguity for Language-Conditioned Reinforcement Learning, Runpeng Xie et.al., Paper: http://arxiv.org/abs/2510.19562
- 2025-10-20, D2C-HRHR: Discrete Actions with Double Distributional Critics for High-Risk-High-Return Tasks, Jundong Zhang et.al., Paper: http://arxiv.org/abs/2510.17212
- 2026-03-28, D-SPEAR: Dual-Stream Prioritized Experience Adaptive Replay for Stable Reinforcement Learninging Robotic Manipulation, Yu Zhang et.al., Paper: http://arxiv.org/abs/2603.27346
- 2026-01-22, Cyclic sunspot activity during the first millennium CE as reconstructed from radiocarbon, Ilya Usoskin et.al., Paper: http://arxiv.org/abs/2601.16203
- 2026-03-21, Cyber Deception for Mission Surveillance via Hypergame-Theoretic Deep Reinforcement Learning, Zelin Wan et.al., Paper: http://arxiv.org/abs/2603.20981
- 2026-02-13, Curriculum-DPO++: Direct Preference Optimization via Data and Model Curricula for Text-to-Image Generation, Florinel-Alin Croitoru et.al., Paper: http://arxiv.org/abs/2602.13055
- 2026-02-27, Curriculum-Based Soft Actor-Critic for Multi-Section R2R Tension Control, Shihao Li et.al., Paper: http://arxiv.org/abs/2602.24259
- 2025-12-11, Curriculum-Based Reinforcement Learning for Autonomous UAV Navigation in Unknown Curved Tubular Conduit, Zamirddine Mari et.al., Paper: http://arxiv.org/abs/2512.10934
- 2025-11-04, Curriculum Design for Trajectory-Constrained Agent: Compressing Chain-of-Thought Tokens in LLMs, Georgios Tzannetos et.al., Paper: http://arxiv.org/abs/2511.02690
- 2025-10-20, CrossGuard: Safeguarding MLLMs against Joint-Modal Implicit Malicious Attacks, Xu Zhang et.al., Paper: http://arxiv.org/abs/2510.17687
- 2026-03-23, Cross-Modal Reinforcement Learning for Navigation with Degraded Depth Measurements, Omkar Sawant et.al., Paper: http://arxiv.org/abs/2603.22182
- 2026-03-12, Cross-Domain Policy Optimization via Bellman Consistency and Hybrid Critics, Ming-Hong Chen et.al., Paper: http://arxiv.org/abs/2603.12087
- 2026-03-26, Critical curve of two-matrix models $ABBA$, $A{B,A}B$ and $ABAB$ , Part I: Monte Carlo, Carlos I. Pérez Sánchez et.al., Paper: http://arxiv.org/abs/2603.25715
- 2026-03-10, Critical behavior of the thermal phase transition of U(1) lattice gauge systems, Greta Sophie Reese et.al., Paper: http://arxiv.org/abs/2603.09895
- 2025-12-08, Critical Density-Wave Vestigial Phases of Commensurate Pair Density Wave, Chu-Tian Gao et.al., Paper: http://arxiv.org/abs/2512.07591
- 2025-11-21, Critical BKT dynamics in the archetypal 2D spin system Ba $_2$CuSi$_2$O$_6$Cl$_2$, K. M. Ranjith et.al., Paper: http://arxiv.org/abs/2511.17381
- 2026-01-06, Critic-Guided Reinforcement Unlearning in Text-to-Image Diffusion, Mykola Vysotskyi et.al., Paper: http://arxiv.org/abs/2601.03213
- 2026-03-20, Coupled cluster theory for positron binding in anions and polyatomic molecules, Rosario R. Riso et.al., Paper: http://arxiv.org/abs/2603.19948
- 2025-10-24, Cost Minimization for Space-Air-Ground Integrated Multi-Access Edge Computing Systems, Weihong Qin et.al., Paper: http://arxiv.org/abs/2510.21541
- 2026-01-22, Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning, Moo Jin Kim et.al., Paper: http://arxiv.org/abs/2601.16163
- 2026-02-23, Cosmic strings and domain walls: the impact of CMB $B$ -mode data, Luca Caloni et.al., Paper: http://arxiv.org/abs/2602.20050
- 2025-10-27, Cosmic magnification on multi-catalogue Herschel submillimetre galaxies, R. Fernandez-Fernandez et.al., Paper: http://arxiv.org/abs/2510.23582
- 2025-12-05, Correspondence-Oriented Imitation Learning: Flexible Visuomotor Control with 3D Conditioning, Yunhao Cao et.al., Paper: http://arxiv.org/abs/2512.05953
- 2025-12-18, Correlation between the first-reaction time and the acquired boundary local time, Yilin Ye et.al., Paper: http://arxiv.org/abs/2512.16747
- 2026-01-06, Correct, Concise and Complete: Multi-stage Training For Adaptive Reasoning, Nathanaël Carraz Rakotonirina et.al., Paper: http://arxiv.org/abs/2601.02972
- 2026-01-02, Coordination-driven magic numbers in protonated argon clusters, Saajid Chowdhury et.al., Paper: http://arxiv.org/abs/2601.00591
- 2025-12-31, Coordinated Humanoid Manipulation with Choice Policies, Haozhi Qi et.al., Paper: http://arxiv.org/abs/2512.25072
- 2025-12-18, Coordinated Anti-Jamming Resilience in Swarm Networks via Multi-Agent Reinforcement Learning, Bahman Abolhassani et.al., Paper: http://arxiv.org/abs/2512.16813
- 2026-03-25, CoordLight: Learning Decentralized Coordination for Network-Wide Traffic Signal Control, Yifeng Zhang et.al., Paper: http://arxiv.org/abs/2603.24366
- 2026-02-24, Cooperative-Competitive Team Play of Real-World Craft Robots, Rui Zhao et.al., Paper: http://arxiv.org/abs/2602.21119
- 2026-03-26, Cooperative Deep Reinforcement Learning for Fair RIS Allocation, Martin Mark Zan et.al., Paper: http://arxiv.org/abs/2603.25572
- 2026-03-24, Cooperation in Public Goods Games over Uniform Random Hypergraphs with Game Transitions, Nankun Wei et.al., Paper: http://arxiv.org/abs/2603.22967
- 2026-02-12, Convex Markov Games and Beyond: New Proof of Existence, Characterization and Learning Algorithms for Nash Equilibria, Anas Barakat et.al., Paper: http://arxiv.org/abs/2602.12181
- 2026-02-20, Convex Block-Cholesky Approach to Risk-Constrained Low-thrust Trajectory Design under Operational Uncertainty, Kenshiro Oguri et.al., Paper: http://arxiv.org/abs/2602.18416
- 2025-11-28, Convergence rates of self-repellent random walks, their local time and Event Chain Monte Carlo, Andreas Eberle et.al., Paper: http://arxiv.org/abs/2511.23453
- 2025-11-21, Convergence and stability of Q-learning in Hierarchical Reinforcement Learning, Massimiliano Manenti et.al., Paper: http://arxiv.org/abs/2511.17351
- 2025-11-04, Controlling Performance and Budget of a Centralized Multi-agent LLM System with Reinforcement Learning, Bowen Jin et.al., Paper: http://arxiv.org/abs/2511.02755
- 2026-01-12, Controlling Multimodal Conversational Agents with Coverage-Enhanced Latent Actions, Yongqi Li et.al., Paper: http://arxiv.org/abs/2601.07516
- 2026-03-17, Controlling Fish Schools via Reinforcement Learning of Virtual Fish Movement, Yusuke Nishii et.al., Paper: http://arxiv.org/abs/2603.16384
- 2026-03-16, Controlled Langevin Dynamics for Sampling of Feedforward Neural Networks Trained with Minibatches, Alessandro Zambon et.al., Paper: http://arxiv.org/abs/2603.15367
- 2026-01-16, Controlled Interacting Branching Diffusion Processes: A Viscosity Approach, Antonio Ocello et.al., Paper: http://arxiv.org/abs/2601.11294
- 2025-12-15, Control of a Twin Rotor using Twin Delayed Deep Deterministic Policy Gradient (TD3), Zeyad Gamal et.al., Paper: http://arxiv.org/abs/2512.13356
- 2025-12-31, Control of Microrobots with Reinforcement Learning under On-Device Compute Constraints, Yichen Liu et.al., Paper: http://arxiv.org/abs/2512.24740
- 2026-02-04, Control Lyapunov Functions for Optimality in Sontag-Type Control, Joscha F. Bongard et.al., Paper: http://arxiv.org/abs/2602.04756
- 2022-11-07, Contrastive Value Learning: Implicit Models for Simple Offline RL, Bogdan Mazoure et.al., Paper: http://arxiv.org/abs/2211.02100
- 2026-03-18, Contrastive Reasoning Alignment: Reinforcement Learning from Hidden Representations, Haozheng Luo et.al., Paper: http://arxiv.org/abs/2603.17305
- 2026-02-09, Contraction Metric Based Safe Reinforcement Learning Force Control for a Hydraulic Actuator with Real-World Training, Lucca Maitan et.al., Paper: http://arxiv.org/abs/2602.08977
- 2025-11-13, Continuum Dropout for Neural Differential Equations, Jonghun Lee et.al., Paper: http://arxiv.org/abs/2511.10446
- 2026-02-06, Continuous-time reinforcement learning: ellipticity enables model-free value function approximation, Wenlong Mou et.al., Paper: http://arxiv.org/abs/2602.06930
- 2025-11-06, Continuous matrix product operators for quantum fields, Erickson Tjoa et.al., Paper: http://arxiv.org/abs/2511.04545
- 2025-10-20, Continuous Q-Score Matching: Diffusion Guided Reinforcement Learning for Continuous-Time Control, Chengxiu Hua et.al., Paper: http://arxiv.org/abs/2510.17122
- 2026-03-11, Continuous Diffusion Transformers for Designing Synthetic Regulatory Elements, Jonathan Liu et.al., Paper: http://arxiv.org/abs/2603.10885
- 2025-11-19, Continual Reinforcement Learning for Cyber-Physical Systems: Lessons Learned and Open Challenges, Kim N. Nolle et.al., Paper: http://arxiv.org/abs/2511.15652
- 2025-12-26, Contextual Biasing for LLM-Based ASR with Hotword Retrieval and Reinforcement Learning, YuXiang Kong et.al., Paper: http://arxiv.org/abs/2512.21828
- 2025-11-14, Context-aware Adaptive Visualizations for Critical Decision Making, Angela Lopez-Cardona et.al., Paper: http://arxiv.org/abs/2511.11476
- 2025-12-16, Context-Picker: Dynamic context selection using multi-stage reinforcement learning, Siyuan Zhu et.al., Paper: http://arxiv.org/abs/2512.14465
- 2025-12-16, Context Representation via Action-Free Transformer encoder-decoder for Meta Reinforcement Learning, Amir M. Soufi Enayati et.al., Paper: http://arxiv.org/abs/2512.14057
- 2026-03-19, Context Bootstrapped Reinforcement Learning, Saaket Agashe et.al., Paper: http://arxiv.org/abs/2603.18953
- 2026-03-11, Contact Coverage-Guided Exploration for General-Purpose Dexterous Manipulation, Zixuan Liu et.al., Paper: http://arxiv.org/abs/2603.10971
- 2025-10-23, Consumption-Investment Problem in Rank-Based Models, David Itkin et.al., Paper: http://arxiv.org/abs/2510.20763
- 2026-02-16, Constructions of linear codes from vectorial plateaued functions and their subfield codes with applications to quantum CSS codes, Virginio Fratianni et.al., Paper: http://arxiv.org/abs/2602.14832
- 2025-10-24, Constraints on ultra-heavy dark matter from the CDEX-10 experiment at the China Jinping Underground Laboratory, Y. F. Wang et.al., Paper: http://arxiv.org/abs/2510.21458
- 2026-01-22, Constraining dark energy models using Jackknife and Bootstrap resampling, Roshna K et.al., Paper: http://arxiv.org/abs/2601.16197
- 2026-01-09, Constraining Hamiltonians from chiral effective field theory with neutron-star data, Cassandra L. Armstrong et.al., Paper: http://arxiv.org/abs/2601.05999
- 2026-02-11, Constrained Fiducial Inference for Gaussian Models, Hank Flury et.al., Paper: http://arxiv.org/abs/2602.11080
- 2025-10-20, Consistent Zero-Shot Imitation with Contrastive Goal Inference, Kathryn Wantlin et.al., Paper: http://arxiv.org/abs/2510.17059
- 2025-11-10, Consistency Is Not Always Correct: Towards Understanding the Role of Exploration in Post-Training Reasoning, Dake Bu et.al., Paper: http://arxiv.org/abs/2511.07368
- 2025-12-29, Considering parallel tempering and comparing post-treatment procedures in Bayesian Profile Regression Models for a survival outcome and correlated exposures, Fendler Julie et.al., Paper: http://arxiv.org/abs/2512.23571
- 2020-08-20, Conservative Q-Learning for Offline Reinforcement Learning, Aviral Kumar et.al., Paper: http://arxiv.org/abs/2006.04779
- 2025-12-02, Congruences for the ratios of Rankin–Selberg $L$ -functions, P. Narayanan et.al., Paper: http://arxiv.org/abs/2512.02919
- 2026-02-02, Conflict-Aware Client Selection for Multi-Server Federated Learning, Mingwei Hong et.al., Paper: http://arxiv.org/abs/2602.02458
- 2026-03-24, Confidence Calibration under Ambiguous Ground Truth, Linwei Tao et.al., Paper: http://arxiv.org/abs/2603.22879
- 2025-12-12, Conditional Copula models using loss-based Bayesian Additive Regression Trees, Tathagata Basu et.al., Paper: http://arxiv.org/abs/2512.11427
- 2025-12-25, Concentration-Dependent Tungsten Effects on Short-Range Order and Deformation Behavior in Ni-W alloys, Shaozun Liu et.al., Paper: http://arxiv.org/abs/2512.21745
- 2025-10-23, Conan: Progressive Learning to Reason Like a Detective over Multi-Scale Visual Evidence, Kun Ouyang et.al., Paper: http://arxiv.org/abs/2510.20470
- 2026-01-08, ConMax: Confidence-Maximizing Compression for Efficient Chain-of-Thought Reasoning, Minda Hu et.al., Paper: http://arxiv.org/abs/2601.04973
- 2026-01-12, Computing quantum magic of state vectors, Piotr Sierant et.al., Paper: http://arxiv.org/abs/2601.07824
- 2025-12-12, Computing quantum entanglement with machine learning, Andrea Bulgarelli et.al., Paper: http://arxiv.org/abs/2512.11389
- 2026-02-19, Computer-Using World Model, Yiming Guan et.al., Paper: http://arxiv.org/abs/2602.17365
- 2026-01-30, Computationally efficient segmentation for non-stationary time series with oscillatory patterns, Nicolas Bianco et.al., Paper: http://arxiv.org/abs/2601.22999
- 2025-10-21, Computational Foundations for Strategic Coopetition: Formalizing Interdependence and Complementarity, Vik Pant et.al., Paper: http://arxiv.org/abs/2510.18802
- 2026-03-19, Computation of thermal entropy for the doped Hubbard Model, Yu-Feng Song et.al., Paper: http://arxiv.org/abs/2603.18998
- 2026-02-18, Computation of thermal conductivity based on Path Integral Monte Carlo methods, Vladislav Efremkin et.al., Paper: http://arxiv.org/abs/2602.16405
- 2026-02-08, Compressed Sensing Methods for Memory Reduction in Monte Carlo Simulations, Ethan Lame et.al., Paper: http://arxiv.org/abs/2602.07771
- 2026-03-10, Comprehensive structural and optical analysis of differently oriented Yb-implanted $β$-Ga$_2$O$_3$, Joanna Matulewicz et.al., Paper: http://arxiv.org/abs/2603.09592
- 2026-02-12, Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models, Xin Xu et.al., Paper: http://arxiv.org/abs/2602.12036
- 2026-03-25, Composer 2 Technical Report, Cursor Reseach et.al., Paper: http://arxiv.org/abs/2603.24477
- 2026-03-19, Complexity of Auctions with Interdependence, Patrick Loiseau et.al., Paper: http://arxiv.org/abs/2603.18668
- 2025-12-11, Complexity and multi-functional variants of the Quantum-to-Quantum Bernoulli Factories, Francesco Hoch et.al., Paper: http://arxiv.org/abs/2512.10810
- 2026-03-25, Completeness of Unbounded Best-First Minimax and Descent Minimax, Quentin Cohen-Solal et.al., Paper: http://arxiv.org/abs/2603.24572
- 2026-01-15, Comparison of viscosity solutions for a class of non-linear PDEs on the space of finite nonnegative measures, Ibrahim Ekren et.al., Paper: http://arxiv.org/abs/2601.10586
- 2026-01-28, Comparing causal estimands from sequential nested versus single point target trials: A simulation study, Catherine Wiener et.al., Paper: http://arxiv.org/abs/2601.20725
- 2025-12-23, Comparative study of plasmons in half-filled graphene via Quantum Monte Carlo and Random Phase Approximation, Maksim Ulybyshev et.al., Paper: http://arxiv.org/abs/2512.20559
- 2025-12-08, Comparative Analysis and Parametric Tuning of PPO, GRPO, and DAPO for LLM Reasoning Enhancement, Yongsheng Lian et.al., Paper: http://arxiv.org/abs/2512.07611
- 2026-03-31, Communication Outage-Resistant UUV State Estimation: A Variational History Distillation Approach, Shuyue Li et.al., Paper: http://arxiv.org/abs/2603.29512
- 2026-03-06, Comment on: “Third-order corrections to the slow-roll expansion: Calculation and constraints with Planck, ACT, SPT, and BICEP/Keck [2025 PDU 47 101813]”, Pierre Auclair et.al., Paper: http://arxiv.org/abs/2603.06521
- 2025-11-06, Combining Harmonic Sampling with the Worm Algorithm to Improve the Efficiency of Path Integral Monte Carlo, Sourav Karmakar et.al., Paper: http://arxiv.org/abs/2511.04597
- 2026-01-15, Combinatorial Optimization Augmented Machine Learning, Maximilian Schiffer et.al., Paper: http://arxiv.org/abs/2601.10583
- 2025-10-20, Colour coherence in small collision systems, Isobel Kolbé et.al., Paper: http://arxiv.org/abs/2510.17570
- 2025-10-20, Collider Searches for Near-Continuum Dark Matter, Steven Ferrante et.al., Paper: http://arxiv.org/abs/2510.17989
- 2026-01-14, Collaborative Multi-Agent Test-Time Reinforcement Learning for Reasoning, Zhiyuan Hu et.al., Paper: http://arxiv.org/abs/2601.09667
- 2026-02-16, Cold-Start Personalization via Training-Free Priors from Structured World Models, Avinandan Bose et.al., Paper: http://arxiv.org/abs/2602.15012
- 2025-10-20, Coinvisor: An RL-Enhanced Chatbot Agent for Interactive Cryptocurrency Investment Analysis, Chong Chen et.al., Paper: http://arxiv.org/abs/2510.17235
- 2025-12-09, CogMCTS: A Novel Cognitive-Guided Monte Carlo Tree Search Framework for Iterative Heuristic Evolution with Large Language Models, Hui Wang et.al., Paper: http://arxiv.org/abs/2512.08609
- 2026-03-07, Coexistence Regime and Thermal Crystallization in the cavity-mediated extended Bose-Hubbard Model, Wei-Wei Wang et.al., Paper: http://arxiv.org/abs/2603.07121
- 2025-12-22, CodeSimpleQA: Scaling Factuality in Code Large Language Models, Jian Yang et.al., Paper: http://arxiv.org/abs/2512.19424
- 2026-03-18, CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents, Lintang Sutawika et.al., Paper: http://arxiv.org/abs/2603.17829
- 2025-10-21, CodeRL+: Improving Code Generation via Reinforcement with Execution Semantics Alignment, Xue Jiang et.al., Paper: http://arxiv.org/abs/2510.18471
- 2026-03-16, Code-A1: Adversarial Evolving of Code LLM and Test LLM via Reinforcement Learning, Aozhe Wang et.al., Paper: http://arxiv.org/abs/2603.15611
- 2026-02-06, Cochain Perspectives on Temporal-Difference Signals for Learning Beyond Markov Dynamics, Zuyuan Zhang et.al., Paper: http://arxiv.org/abs/2602.06939
- 2026-03-18, CoVerRL: Breaking the Consensus Trap in Label-Free Reasoning via Generator-Verifier Co-Evolution, Teng Pan et.al., Paper: http://arxiv.org/abs/2603.17775
- 2026-03-02, CoVe: Training Interactive Tool-Use Agents via Constraint-Guided Verification, Jinpeng Chen et.al., Paper: http://arxiv.org/abs/2603.01940
- 2026-03-24, CoMaTrack: Competitive Multi-Agent Game-Theoretic Tracking with Vision-Language-Action Models, Youzhi Liu et.al., Paper: http://arxiv.org/abs/2603.22846
- 2026-02-08, CoLF: Learning Consistent Leader-Follower Policies for Vision-Language-Guided Multi-Robot Cooperative Transport, Joachim Yann Despature et.al., Paper: http://arxiv.org/abs/2602.07776
- 2026-01-07, CoINS: Counterfactual Interactive Navigation via Skill-Aware VLM, Kangjie Zhou et.al., Paper: http://arxiv.org/abs/2601.03956
- 2025-10-28, Cluster Dose Prediction in Carbon Ion Therapy: Using Transfer Learning from a Pretrained Dose Prediction U-Net, Miriam Schwarze et.al., Paper: http://arxiv.org/abs/2510.24703
- 2026-03-23, Closed-Loop Verbal Reinforcement Learning for Task-Level Robotic Planning, Dmitrii Plotnikov et.al., Paper: http://arxiv.org/abs/2603.22169
- 2025-11-26, Closed Form HJB Solution for Continuous-Time Optimal Control of a Non-Linear Input-Affine System, Akash Vyas et.al., Paper: http://arxiv.org/abs/2511.21593
- 2026-01-12, Clipped Affine Policy: Low-Complexity Near-Optimal Online Power Control for Energy Harvesting Communications over Fading Channels, Hao Wu et.al., Paper: http://arxiv.org/abs/2601.07622
- 2026-02-05, Clifford Kolmogorov-Arnold Networks, Matthias Wolff et.al., Paper: http://arxiv.org/abs/2602.05977
- 2026-03-14, Chunk-Guided Q-Learning, Gwanwoo Song et.al., Paper: http://arxiv.org/abs/2603.13971
- 2025-12-10, ChronusOmni: Improving Time Awareness of Omni Large Language Models, Yijing Chen et.al., Paper: http://arxiv.org/abs/2512.09841
- 2025-10-21, Chemistry, Climate, and Transmission Spectra of TRAPPIST-1 e Explored with a Multimodel Sparse Sampled Ensemble, Eric T. Wolf et.al., Paper: http://arxiv.org/abs/2510.18704
- 2026-02-11, Chatting with Images for Introspective Visual Thinking, Junfei Wu et.al., Paper: http://arxiv.org/abs/2602.11073
- 2026-03-06, ChatShopBuddy: Towards Reliable Conversational Shopping Agents via Reinforcement Learning, Yiruo Cheng et.al., Paper: http://arxiv.org/abs/2603.06065
- 2025-12-26, Charge-Informed Quantum Error Correction, Vlad Temkin et.al., Paper: http://arxiv.org/abs/2512.22119
- 2025-10-20, Characterizing expansivity through $C^*$ -algebras, S. Bautista et.al., Paper: http://arxiv.org/abs/2510.17255
- 2025-10-30, Characterization of the H2M Monolithic CMOS Sensor, Rafael Ballabriga et.al., Paper: http://arxiv.org/abs/2510.26741
- 2025-12-23, Characterization of the BIFROST spectrometer through virtual experiments, Kristine M. L. Krighaar et.al., Paper: http://arxiv.org/abs/2512.20463
- 2026-03-02, CharacterFlywheel: Scaling Iterative Improvement of Engaging and Steerable LLMs in Production, Yixin Nie et.al., Paper: http://arxiv.org/abs/2603.01973
- 2025-11-19, Challenging the $ω_0ω_a$ CDM parametrization through rational expansions in view of DESI data release, Youri Carloni et.al., Paper: http://arxiv.org/abs/2511.15472
- 2025-10-31, Challenges in Credit Assignment for Multi-Agent Reinforcement Learning in Open Agent Systems, Alireza Saleh Abadi et.al., Paper: http://arxiv.org/abs/2510.27659
- 2026-01-09, Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards, Jiajie Zhang et.al., Paper: http://arxiv.org/abs/2601.06021
- 2026-03-20, Chain-of-Adaptation: Surgical Vision-Language Adaptation with Reinforcement Learning, Jiajie Li et.al., Paper: http://arxiv.org/abs/2603.20116
- 2026-02-18, Certifying Hamilton-Jacobi Reachability Learned via Reinforcement Learning, Prashant Solanki et.al., Paper: http://arxiv.org/abs/2602.16475
- 2025-10-20, Certified Self-Consistency: Statistical Guarantees and Test-Time Training for Reliable Reasoning in LLMs, Paula Cordero-Encinar et.al., Paper: http://arxiv.org/abs/2510.17472
- 2025-12-23, Certified Lower Bounds and Efficient Estimation of Minimum Accuracy in Quantum Kernel Methods, Demerson N. Gonçalves et.al., Paper: http://arxiv.org/abs/2512.20588
- 2026-01-07, Cells on Autopilot: Adaptive Cell (Re)Selection via Reinforcement Learning, Marvin Illian et.al., Paper: http://arxiv.org/abs/2601.04083
- 2026-02-24, Cell-Free Massive MIMO-Assisted SWIPT Using Stacked Intelligent Metasurfaces, Thien Duc Hua et.al., Paper: http://arxiv.org/abs/2602.20983
- 2026-02-18, Causally-Guided Automated Feature Engineering with Multi-Agent Reinforcement Learning, Arun Vignesh Malarkkan et.al., Paper: http://arxiv.org/abs/2602.16435
- 2025-10-24, Causality Meets Locality: Provably Generalizable and Scalable Policy Learning for Networked Systems, Hao Liang et.al., Paper: http://arxiv.org/abs/2510.21427
- 2026-03-19, CausalRM: Causal-Theoretic Reward Modeling for RLHF from Observational User Feedbacks, Hao Wang et.al., Paper: http://arxiv.org/abs/2603.18736
- 2025-10-27, Causal Deep Q Network, Elouanes Khelifi et.al., Paper: http://arxiv.org/abs/2510.23424
- 2026-03-25, CarePilot: A Multi-Agent Framework for Long-Horizon Computer Task Automation in Healthcare, Akash Ghosh et.al., Paper: http://arxiv.org/abs/2603.24157
- 2025-12-05, Capturing Classic Authorial Style in Long-Form Story Generation with GRPO Fine-Tuning, Jinlong Liu et.al., Paper: http://arxiv.org/abs/2512.05747
- 2026-04-02, Captioning Daily Activity Images in Early Childhood Education: Benchmark and Algorithm, Sixing Li et.al., Paper: http://arxiv.org/abs/2604.01941
- 2026-02-18, Capacity-constrained demand response in smart grids using deep reinforcement learning, Shafagh Abband Pashaki et.al., Paper: http://arxiv.org/abs/2602.16525
- 2026-02-12, Capability-Oriented Training Induced Alignment Risk, Yujun Zhou et.al., Paper: http://arxiv.org/abs/2602.12124
- 2026-02-05, Can vision language models learn intuitive physics from interaction?, Luca M. Schulze Buschoff et.al., Paper: http://arxiv.org/abs/2602.06033
- 2025-12-17, Can LLMs Guide Their Own Exploration? Gradient-Guided Reinforcement Learning for LLM Reasoning, Zhenwen Liang et.al., Paper: http://arxiv.org/abs/2512.15687
- 2025-12-17, Can AI Generate more Comprehensive Test Scenarios? Review on Automated Driving Systems Test Scenario Generation Methods, Ji Zhou et.al., Paper: http://arxiv.org/abs/2512.15422
- 2026-01-22, CamPilot: Improving Camera Control in Video Diffusion Model with Efficient Camera Reward Feedback, Wenhang Ge et.al., Paper: http://arxiv.org/abs/2601.16214
- 2026-02-27, CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation, Weinan Dai et.al., Paper: http://arxiv.org/abs/2602.24286
- 2025-10-21, CUARewardBench: A Benchmark for Evaluating Reward Models on Computer-using Agent, Haojia Lin et.al., Paper: http://arxiv.org/abs/2510.18596
- 2026-03-25, CUA-Suite: Massive Human-annotated Video Demonstrations for Computer-Use Agents, Xiangru Jian et.al., Paper: http://arxiv.org/abs/2603.24440
- 2026-01-15, CS-GBA: A Critical Sample-based Gradient-guided Backdoor Attack for Offline Reinforcement Learning, Yuanjie Zhao et.al., Paper: http://arxiv.org/abs/2601.10407
- 2026-02-04, CRoSS: A Continual Robotic Simulation Suite for Scalable Reinforcement Learning with High Task Diversity and Realistic Physics Simulation, Yannick Denker et.al., Paper: http://arxiv.org/abs/2602.04868
- 2025-12-16, CRISP: Contact-Guided Real2Sim from Monocular Video with Planar Scene Primitives, Zihan Wang et.al., Paper: http://arxiv.org/abs/2512.14696
- 2026-01-20, CREATE: Cross-Layer Resilience Characterization and Optimization for Efficient yet Reliable Embodied AI Systems, Tong Xie et.al., Paper: http://arxiv.org/abs/2601.14140
- 2026-03-18, CRE-T1 Preview Technical Report: Beyond Contrastive Learning for Reasoning-Intensive Retrieval, Guangzhi Wang et.al., Paper: http://arxiv.org/abs/2603.17387
- 2026-03-19, CRAFT: Aligning Diffusion Models with Fine-Tuning Is Easier Than You Think, Zening Sun et.al., Paper: http://arxiv.org/abs/2603.18991
- 2026-01-01, CPPO: Contrastive Perception for Vision Language Policy Optimization, Ahmad Rezaei et.al., Paper: http://arxiv.org/abs/2601.00501
- 2025-12-22, CORE: Compensable Reward as a Catalyst for Improving Offline RL in Wireless Networks, Lipeng Zu et.al., Paper: http://arxiv.org/abs/2512.19671
- 2026-01-05, CORE: Code-based Inverse Self-Training Framework with Graph Expansion for Virtual Agents, Keyu Wang et.al., Paper: http://arxiv.org/abs/2601.02201
- 2026-02-10, CODE-SHARP: Continuous Open-ended Discovery and Evolution of Skills as Hierarchical Reward Programs, Richard Bornemann et.al., Paper: http://arxiv.org/abs/2602.10085
- 2026-03-03, CMoE: Contrastive Mixture of Experts for Motion Control and Terrain Adaptation of Humanoid Robots, Shihao Ma et.al., Paper: http://arxiv.org/abs/2603.03067
- 2026-02-20, CMB anisotropies from cosmic (super)strings in light of ACT DR6, Juhan Raidal et.al., Paper: http://arxiv.org/abs/2602.18272
- 2026-02-12, CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use, Zhen Zhang et.al., Paper: http://arxiv.org/abs/2602.12268
- 2026-01-21, CLEANER: Self-Purified Trajectories Boost Agentic Reinforcement Learning, Tianshi Xu et.al., Paper: http://arxiv.org/abs/2601.15141
- 2025-10-20, CLAWS:Creativity detection for LLM-generated solutions using Attention Window of Sections, Keuntae Kim et.al., Paper: http://arxiv.org/abs/2510.17921
- 2026-01-09, CHDP: Cooperative Hybrid Diffusion Policies for Reinforcement Learning in Parameterized Action Space, Bingyi Liu et.al., Paper: http://arxiv.org/abs/2601.05675
- 2025-11-04, CGES: Confidence-Guided Early Stopping for Efficient and Accurate Self-Consistency, Ehsan Aghazadeh et.al., Paper: http://arxiv.org/abs/2511.02603
- 2025-12-04, CARL: Critical Action Focused Reinforcement Learning for Multi-Step Agent, Leyang Shen et.al., Paper: http://arxiv.org/abs/2512.04949
- 2025-12-22, CARE What Fails: Contrastive Anchored-REflection for Verifiable Multimodal, Yongxin Wang et.al., Paper: http://arxiv.org/abs/2512.19554
- 2026-03-31, C-TRAIL: A Commonsense World Framework for Trajectory Planning in Autonomous Driving, Zhihong Cui et.al., Paper: http://arxiv.org/abs/2603.29908
- 2026-03-25, C-STEP: Continuous Space-Time Empowerment for Physics-informed Safe Reinforcement Learning of Mobile Agents, Guihlerme Daubt et.al., Paper: http://arxiv.org/abs/2603.24241
- 2025-11-10, Bridging the divide: axion searches and axino phenomenology at colliders, Gabe Hoshino et.al., Paper: http://arxiv.org/abs/2511.07224
- 2026-03-28, Bridging Visual Representation and Reinforcement Learning from Verifiable Rewards in Large Vision-Language Models, Yuhang Han et.al., Paper: http://arxiv.org/abs/2603.27375
- 2025-11-20, Bridging VLMs and Embodied Intelligence with Deliberate Practice Policy Optimization, Yi Zhang et.al., Paper: http://arxiv.org/abs/2511.16602
- 2026-04-01, Bridging RL and MPC for mixed-integer optimal control with application to Formula 1 race strategies, Joschua Wüthrich et.al., Paper: http://arxiv.org/abs/2604.00826
- 2026-02-03, Bridging Online and Offline RL: Contextual Bandit Learning for Multi-Turn Code Generation, Ziru Chen et.al., Paper: http://arxiv.org/abs/2602.03806
- 2026-03-19, Bridging Network Fragmentation: A Semantic-Augmented DRL Framework for UAV-aided VANETs, Gaoxiang Cao et.al., Paper: http://arxiv.org/abs/2603.18871
- 2026-01-27, Bridging Information Asymmetry: A Hierarchical Framework for Deterministic Blind Face Restoration, Zhengjian Yao et.al., Paper: http://arxiv.org/abs/2601.19506
- 2026-04-02, Bridging Discrete Planning and Continuous Execution for Redundant Robot, Teng Yan et.al., Paper: http://arxiv.org/abs/2604.02021
- 2026-01-06, Breaking the Dimensional Barrier: Dynamic Portfolio Choice with Parameter Uncertainty via Pontryagin Projection, Jeonggyu Huh et.al., Paper: http://arxiv.org/abs/2601.03175
- 2026-03-20, Breaking the Capability Ceiling of LLM Post-Training by Reintroducing Markov States, Yurun Yuan et.al., Paper: http://arxiv.org/abs/2603.19987
- 2026-03-09, Breaking the Bias Barrier in Concave Multi-Objective Reinforcement Learning, Swetha Ganesh et.al., Paper: http://arxiv.org/abs/2603.08518
- 2026-03-19, Box Maze: A Process-Control Architecture for Reliable LLM Reasoning, Zou Qiang et.al., Paper: http://arxiv.org/abs/2603.19182
- 2025-12-22, Bottom-up Policy Optimization: Your Language Model Policy Secretly Contains Internal Policies, Yuqiao Tan et.al., Paper: http://arxiv.org/abs/2512.19673
- 2026-03-06, Boosting deep Reinforcement Learning using pretraining with Logical Options, Zihan Ye et.al., Paper: http://arxiv.org/abs/2603.06565
- 2026-01-23, Boosting Deep Reinforcement Learning with Semantic Knowledge for Robotic Manipulators, Lucía Güitta-López et.al., Paper: http://arxiv.org/abs/2601.16866
- 2026-01-29, Boosting CVaR Policy Optimization with Quantile Gradients, Yudong Luo et.al., Paper: http://arxiv.org/abs/2601.22100
- 2026-03-05, Boosting ASR Robustness via Test-Time Reinforcement Learning with Audio-Text Semantic Rewards, Linghan Fang et.al., Paper: http://arxiv.org/abs/2603.05231
- 2026-03-02, Boltzmann-based Exploration for Robust Decentralized Multi-Agent Planning, Nhat Nguyen et.al., Paper: http://arxiv.org/abs/2603.02154
- 2025-11-13, Black-Box On-Policy Distillation of Large Language Models, Tianzhu Ye et.al., Paper: http://arxiv.org/abs/2511.10643
- 2026-03-12, Bielik-Minitron-7B: Compressing Large Language Models via Structured Pruning and Knowledge Distillation for the Polish Language, Remigiusz Kinas et.al., Paper: http://arxiv.org/abs/2603.11881
- 2026-02-16, Bias analysis of a linear order-statistic inequality index estimator: Unbiasedness under gamma populations, Roberto Vila et.al., Paper: http://arxiv.org/abs/2602.14861
- 2026-02-27, Bi-level RL-Heuristic Optimization for Real-world Winter Road Maintenance, Yue Xie et.al., Paper: http://arxiv.org/abs/2602.24097
- 2026-01-21, Bhabha scattering at future colliders with BHLUMI/BHWIDE, Wiesław Płaczek et.al., Paper: http://arxiv.org/abs/2601.15265
- 2026-03-11, Beyond the Illusion of Consensus: From Surface Heuristics to Knowledge-Grounded Evaluation in LLM-as-a-Judge, Mingyang Song et.al., Paper: http://arxiv.org/abs/2603.11027
- 2026-02-11, Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling, Gongye Liu et.al., Paper: http://arxiv.org/abs/2602.11146
- 2026-02-17, Beyond Static Pipelines: Learning Dynamic Workflows for Text-to-SQL, Yihan Wang et.al., Paper: http://arxiv.org/abs/2602.15564
- 2026-01-12, Beyond Single-Shot: Multi-step Tool Retrieval via Query Planning, Wei Fang et.al., Paper: http://arxiv.org/abs/2601.07782
- 2025-12-01, Beyond SFT: Reinforcement Learning for Safer Large Reasoning Models with Better Reasoning Ability, Jinghan Jia et.al., Paper: http://arxiv.org/abs/2512.01848
- 2026-02-04, Beyond Rewards in Reinforcement Learning for Cyber Defence, Elizabeth Bates et.al., Paper: http://arxiv.org/abs/2602.04809
- 2026-03-10, Beyond QED: Electroweak and hadronic extensions of McMule, Sophie Kollatzsch et.al., Paper: http://arxiv.org/abs/2603.09443
- 2025-11-21, Beyond Multiple Choice: A Hybrid Framework for Unifying Robust Evaluation and Verifiable Reasoning Training, Yesheng Liu et.al., Paper: http://arxiv.org/abs/2511.17405
- 2026-02-23, Beyond Mimicry: Toward Lifelong Adaptability in Imitation Learning, Nathan Gavenski et.al., Paper: http://arxiv.org/abs/2602.19930
- 2026-03-13, Beyond Imitation: Reinforcement Learning Fine-Tuning for Adaptive Diffusion Navigation Policies, Junhe Sheng et.al., Paper: http://arxiv.org/abs/2603.12868
- 2026-03-10, Beyond Fermi-II: Intermittent Particle Acceleration by Relativistic Turbulence in Astrophysical Plasmas, Anton Dmytriiev et.al., Paper: http://arxiv.org/abs/2603.09394
- 2026-03-26, Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models, Xunguang Wang et.al., Paper: http://arxiv.org/abs/2603.25412
- 2026-03-11, Beyond Accuracy: Reliability and Uncertainty Estimation in Convolutional Neural Networks, Sanne Ruijs et.al., Paper: http://arxiv.org/abs/2603.10731
- 2025-10-21, Beware of the running $n_s$ when producing heavy primordial black holes, Sasha Allegrini et.al., Paper: http://arxiv.org/abs/2510.18791
- 2026-03-24, Between Resolution Collapse and Variance Inflation: Weighted Conformal Anomaly Detection in Low-Data Regimes, Oliver Hennhöfer et.al., Paper: http://arxiv.org/abs/2603.23205
- 2026-02-04, Benchmark Study of CEvNS Nuclear Recoil Observables for B, Mg, Ti, and Zr Targets Using Geant4, Yusuf Havvat et.al., Paper: http://arxiv.org/abs/2602.04771
- 2025-12-29, Bellman Calibration for V-Learning in Offline Reinforcement Learning, Lars van der Laan et.al., Paper: http://arxiv.org/abs/2512.23694
- 2025-11-05, Behavior-Adaptive Q-Learning: A Unifying Framework for Offline-to-Online RL, Lipeng Zu et.al., Paper: http://arxiv.org/abs/2511.03695
- 2026-03-04, BeamPERL: Parameter-Efficient RL with Verifiable Rewards Specializes Compact LLMs for Structured Beam Mechanics Reasoning, Tarjei Paule Hage et.al., Paper: http://arxiv.org/abs/2603.04124
- 2026-01-15, Be Your Own Red Teamer: Safety Alignment via Self-Play and Reflective Experience Replay, Hao Wang et.al., Paper: http://arxiv.org/abs/2601.10589
- 2026-02-17, Bayesian parameter study of the Seyfert-starburst composite galaxies NGC 1068 and NGC 7469, Björn Eichmann et.al., Paper: http://arxiv.org/abs/2602.15644
- 2026-03-31, Bayesian methods for the identification of model parameters for water transport in porous media, Paola Stolfi et.al., Paper: http://arxiv.org/abs/2603.29859
- 2025-11-04, Bayesian full waveform inversion with learned prior using deep convolutional autoencoder, Shuhua Hu et.al., Paper: http://arxiv.org/abs/2511.02737
- 2025-11-10, Bayesian compartmental modelling of MRSA transmission within hospitals in Edmonton, Canada, Ruoyu Li et.al., Paper: http://arxiv.org/abs/2511.07353
- 2025-11-05, Bayesian Topological Analysis of Functional Brain Networks, Xukun Zhu et.al., Paper: http://arxiv.org/abs/2511.03605
- 2025-12-11, Bayesian Symbolic Regression via Posterior Sampling, Geoffrey F. Bomarito et.al., Paper: http://arxiv.org/abs/2512.10849
- 2026-01-14, Bayesian Semi-Blind Deconvolution at Scale, Guillermina Senn et.al., Paper: http://arxiv.org/abs/2601.09677
- 2026-03-18, Bayesian Scalar-on-Tensor Quantile Regression for Longitudinal Data on Alzheimer’s Disease, Rongke Lyu et.al., Paper: http://arxiv.org/abs/2603.17294
- 2026-02-09, Bayesian Preference Learning for Test-Time Steerable Reward Models, Jiwoo Hong et.al., Paper: http://arxiv.org/abs/2602.08819
- 2025-12-10, Bayesian Model Selection with an Application to Cosmology, Nikoloz Gigiberia et.al., Paper: http://arxiv.org/abs/2512.09724
- 2026-02-25, Bayesian Generative Adversarial Networks via Gaussian Approximation for Tabular Data Synthesis, Bahrul Ilmi Nasution et.al., Paper: http://arxiv.org/abs/2602.21948
- 2025-11-21, Bayesian Bridge Gaussian Process Regression, Minshen Xu et.al., Paper: http://arxiv.org/abs/2511.17415
- 2025-12-05, Bayesian Active Inference for Intelligent UAV Anti-Jamming and Adaptive Trajectory Planning, Ali Krayani et.al., Paper: http://arxiv.org/abs/2512.05711
- 2025-11-26, Bang-Bang Evasion: Its Stochastic Optimality and a Terminal-Set-Based Implementation, Liraz Mudrik et.al., Paper: http://arxiv.org/abs/2511.21633
- 2026-03-06, Balancing Efficiency and Feasibility: A Sensitivity Analysis of the Augmentation Parameter in the Finite Selection Model, Safaa K. Kadhem et.al., Paper: http://arxiv.org/abs/2603.06493
- 2026-03-19, Balanced Thinking: Improving Chain of Thought Training in Vision Language Models, Shaked Perek et.al., Paper: http://arxiv.org/abs/2603.18656
- 2026-02-13, BSN-VI: Multiband Light Curve Modeling of Four W UMa-Type Contact Binaries I. Revisiting Energy Transfer Mechanisms and Luminosity Behavior, Elham Sarvari et.al., Paper: http://arxiv.org/abs/2602.12864
- 2026-02-16, BPP: Long-Context Robot Imitation Learning by Focusing on Key History Frames, Max Sobol Mark et.al., Paper: http://arxiv.org/abs/2602.15010
- 2026-02-20, BLM-Guard: Explainable Multimodal Ad Moderation with Chain-of-Thought and Policy-Aligned Rewards, Yiran Yang et.al., Paper: http://arxiv.org/abs/2602.18193
- 2026-02-08, BFTS: Thompson Sampling with Bayesian Additive Regression Trees, Ruizhe Deng et.al., Paper: http://arxiv.org/abs/2602.07767
- 2026-02-16, BFS-PO: Best-First Search for Large Reasoning Models, Fiorenzo Parascandolo et.al., Paper: http://arxiv.org/abs/2602.14917
- 2026-03-12, BBN to Late-Time Acceleration in $f(T,\mathcal{L}_m)$ Gravity, Sai Swagat Mishra et.al., Paper: http://arxiv.org/abs/2603.11760
- 2026-04-01, BAT: Balancing Agility and Stability via Online Policy Switching for Long-Horizon Whole-Body Humanoid Control, Donghoon Baek et.al., Paper: http://arxiv.org/abs/2604.01064
- 2025-11-26, BAMAS: Structuring Budget-Aware Multi-Agent Systems, Liming Yang et.al., Paper: http://arxiv.org/abs/2511.21572
- 2026-01-20, BACH-V: Bridging Abstract and Concrete Human-Values in Large Language Models, Junyu Zhang et.al., Paper: http://arxiv.org/abs/2601.14007
- 2025-10-20, B-Meson Anomalies: Effective Field Theory Meets Machine Learning, Alejandro Mir et.al., Paper: http://arxiv.org/abs/2510.17742
- 2026-02-16, Auxiliary field quantum Monte Carlo at the basis set limit: application to lattice constants, Moritz Humer et.al., Paper: http://arxiv.org/abs/2602.14923
- 2025-12-17, Autoregressive Language Models are Secretly Energy-Based Models: Insights into the Lookahead Capabilities of Next-Token Prediction, Mathieu Blondel et.al., Paper: http://arxiv.org/abs/2512.15605
- 2025-12-24, Autonomous Uncertainty Quantification for Computational Point-of-care Sensors, Artem Goncharov et.al., Paper: http://arxiv.org/abs/2512.21335
- 2025-12-03, Autonomous Reinforcement Learning Robot Control with Intel’s Loihi 2 Neuromorphic Hardware, Kenneth Stewart et.al., Paper: http://arxiv.org/abs/2512.03911
- 2025-12-17, Autonomous Pressure Control in MuVacAS via Deep Reinforcement Learning and Deep Learning Surrogate Models, Guillermo Rodriguez-Llorente et.al., Paper: http://arxiv.org/abs/2512.15521
- 2026-01-23, Autonomous Optical Alignment of Satellite-Based Entanglement Sources using Reinforcement Learning, Andrzej Gajewski et.al., Paper: http://arxiv.org/abs/2601.16968
- 2026-03-12, Automatic Generation of High-Performance RL Environments, Seth Karten et.al., Paper: http://arxiv.org/abs/2603.12145
- 2026-01-30, Automatic Constraint Policy Optimization based on Continuous Constraint Interpolation Framework for Offline Reinforcement Learning, Xinchen Han et.al., Paper: http://arxiv.org/abs/2601.23010
- 2026-03-19, Automatic Configuration of LLM Post-Training Pipelines, Channe Chwa et.al., Paper: http://arxiv.org/abs/2603.18773
- 2025-10-30, Automated event generation for S-wave quarkonium and leptonium production in NRQCD and NRQED, Alice Colpani Serri et.al., Paper: http://arxiv.org/abs/2510.26773
- 2026-03-07, AutoResearch-RL: Perpetual Self-Evaluating Reinforcement Learning Agents for Autonomous Neural Architecture Discovery, Nilesh Jain et.al., Paper: http://arxiv.org/abs/2603.07300
- 2025-11-24, AutoEnv: Automated Environments for Measuring Cross-Environment Agent Learning, Jiayi Zhang et.al., Paper: http://arxiv.org/abs/2511.19304
- 2025-10-20, Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling, Lipeng Xie et.al., Paper: http://arxiv.org/abs/2510.17314
- 2025-11-04, Audience Amplified: Virtual Audiences in Asynchronously Performed AR Theater, You-Jin Kim et.al., Paper: http://arxiv.org/abs/2511.02807
- 2026-04-02, Auction-Based Online Policy Adaptation for Evolving Objectives, Guruprerana Shabadi et.al., Paper: http://arxiv.org/abs/2604.02151
- 2026-01-20, Attention-Based Offline Reinforcement Learning and Clustering for Interpretable Sepsis Treatment, Punit Kumar et.al., Paper: http://arxiv.org/abs/2601.14228
- 2025-11-25, Attention Trajectories as a Diagnostic Axis for Deep Reinforcement Learning, Charlotte Beylier et.al., Paper: http://arxiv.org/abs/2511.20591
- 2026-03-12, Attend Before Attention: Efficient and Scalable Video Understanding via Autoregressive Gazing, Baifeng Shi et.al., Paper: http://arxiv.org/abs/2603.12254
- 2025-12-02, Asymptotics for additive functionals of particle systems via Stein’s method, Arturo Jaramillo et.al., Paper: http://arxiv.org/abs/2512.02922
- 2026-02-19, Asymptotically Optimal Sequential Testing with Markovian Data, Alhad Sethi et.al., Paper: http://arxiv.org/abs/2602.17587
- 2026-02-11, Asymmetric Prompt Weighting for Reinforcement Learning with Verifiable Rewards, Reinhard Heckel et.al., Paper: http://arxiv.org/abs/2602.11128
- 2025-12-02, Assessing the performance of correlation-based multi-fidelity neural emulators, Cristian J. Villatoro et.al., Paper: http://arxiv.org/abs/2512.02868
- 2026-01-21, Assessing Orbital Optimization in Variational and Diffusion Monte Carlo, Cody A. Melton et.al., Paper: http://arxiv.org/abs/2601.15169
- 2025-11-06, Artificial Precision Polarization Array: Sensitivity for the axion-like dark matter with clock satellites, Hanyu Jiang et.al., Paper: http://arxiv.org/abs/2511.04400
- 2025-11-17, Artificial Intelligence-driven Intelligent Wearable Systems: A full-stack Integration from Material Design to Personalized Interaction, Jingyi Zhao et.al., Paper: http://arxiv.org/abs/2511.13565
- 2026-03-06, Artificial Intelligence for Climate Adaptation: Reinforcement Learning for Climate Change-Resilient Transport, Miguel Costa et.al., Paper: http://arxiv.org/abs/2603.06278
- 2026-03-19, Articulated-Body Dynamics Network: Dynamics-Grounded Prior for Robot Learning, Sangwoo Shin et.al., Paper: http://arxiv.org/abs/2603.19078
- 2025-12-01, Artemis: Structured Visual Reasoning for Perception Policy Learning, Wei Tang et.al., Paper: http://arxiv.org/abs/2512.01988
- 2025-12-11, Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation, Yiwen Tang et.al., Paper: http://arxiv.org/abs/2512.10949
- 2025-11-28, Arbitrary control of the temporal waveform of photons during spontaneous emission, Carl Thomas et.al., Paper: http://arxiv.org/abs/2511.23462
- 2026-04-02, Apriel-Reasoner: RL Post-Training for General-Purpose and Efficient Reasoning, Rafael Pardinas et.al., Paper: http://arxiv.org/abs/2604.02007
- 2026-02-05, Approximation of Log-Partition Function in Policy Mirror Descent Induces Implicit Regularization for LLM Post-Training, Zhenghao Xu et.al., Paper: http://arxiv.org/abs/2602.05933
- 2025-11-04, Approximation by Certain Complex Nevai Operators : Theory and Applications, Priyanka Majethiya et.al., Paper: http://arxiv.org/abs/2511.02750
- 2026-03-24, Approximating the Shapley Value of Minimum Cost Spanning Tree Games: An FPRAS for Saving Games, Takumi Jimbo et.al., Paper: http://arxiv.org/abs/2603.22843
- 2025-10-27, Approximately optimal distributed controls for high-dimensional stochastic systems with pairwise interaction through controls, Elise Devey et.al., Paper: http://arxiv.org/abs/2510.23537
- 2026-03-22, Approximate Dynamic Programming for Degradation-aware Market Participation of Battery Energy Storage Systems: Bridging Market and Degradation Timescales, Flemming Holtorf et.al., Paper: http://arxiv.org/abs/2603.21089
- 2026-03-26, Approximate Bayesian Inference for Structural Equation Models using Integrated Nested Laplace Approximations, Haziq Jamil et.al., Paper: http://arxiv.org/abs/2603.25690
- 2025-11-26, Approximate Bayesian Computation Made Easy: A Practical Guide to ABC-SMC for Dynamical Systems with \texttt{pymc}, Mario Castro et.al., Paper: http://arxiv.org/abs/2511.21587
- 2025-11-06, Approaching the thermodynamic limit of a bounded one-component plasma, D. I. Zhukhovitskii et.al., Paper: http://arxiv.org/abs/2511.04516
- 2025-12-19, AnyTask: an Automated Task and Data Generation Framework for Advancing Sim-to-Real Policy Learning, Ran Gong et.al., Paper: http://arxiv.org/abs/2512.17853
- 2026-02-12, Any House Any Task: Scalable Long-Horizon Planning for Abstract Human Tasks, Zhihong Liu et.al., Paper: http://arxiv.org/abs/2602.12244
- 2026-03-17, Anticipatory Planning for Multimodal AI Agents, Yongyuan Liang et.al., Paper: http://arxiv.org/abs/2603.16777
- 2026-01-07, Anti-Length Shift: Dynamic Outlier Truncation for Training Efficient Reasoning Models, Wei Wu et.al., Paper: http://arxiv.org/abs/2601.03969
- 2026-02-10, Answer First, Reason Later: Aligning Search Relevance via Mode-Balanced Reinforcement Learning, Shijie Zhang et.al., Paper: http://arxiv.org/abs/2602.10006
- 2025-11-18, Anomalous spontaneous induction of magnetic and electric fields in dense quark matter, E. J. Ferrer et.al., Paper: http://arxiv.org/abs/2511.14602
- 2026-02-09, AnomSeer: Reinforcing Multimodal LLMs to Reason for Time-Series Anomaly Detection, Junru Zhang et.al., Paper: http://arxiv.org/abs/2602.08868
- 2025-12-12, AnchorDream: Repurposing Video Diffusion for Embodiment-Aware Robot Data Synthesis, Junjie Ye et.al., Paper: http://arxiv.org/abs/2512.11797
- 2026-03-20, Analyzing Decoders for Quantum Error Correction, Abtin Molavi et.al., Paper: http://arxiv.org/abs/2603.20127
- 2026-04-01, Analytical Probabilistic Power Flow Approximation Using Invertible Neural Networks, Weijie Xia et.al., Paper: http://arxiv.org/abs/2604.00673
- 2026-03-10, Analytic treatment of a polaron in a nonparabolic conduction band, S. N. Klimin et.al., Paper: http://arxiv.org/abs/2603.09609
- 2025-12-29, Analysis of kinetic-diffusion Monte Carlo simulation and source term estimation scheme in nuclear fusion applications, Zhirui Tang et.al., Paper: http://arxiv.org/abs/2512.23580
- 2025-10-21, Analysis note: measurement of thrust and track energy-energy correlator in $e^+e^-$ collisions at 91.2 GeV with DELPHI open data, Jingyu Zhang et.al., Paper: http://arxiv.org/abs/2510.18762
- 2026-02-10, Anagent For Enhancing Scientific Table & Figure Analysis, Xuehang Guo et.al., Paper: http://arxiv.org/abs/2602.10081
- 2025-11-05, AnaFlow: Agentic LLM-based Workflow for Reasoning-Driven Explainable and Sample-Efficient Analog Circuit Sizing, Mohsen Ahmadzadeh et.al., Paper: http://arxiv.org/abs/2511.03697
- 2025-10-21, An integrated neural wavefunction solver for spinful Fermi systems, Alexander Avdoshkin et.al., Paper: http://arxiv.org/abs/2510.18621
- 2026-02-19, An extension to reversible jump Markov chain Monte Carlo for change point problems with heterogeneous temporal dynamics, Emily Gribbin et.al., Paper: http://arxiv.org/abs/2602.17503
- 2026-01-07, An SU(2n)-valued nonlinear Fourier transform, Michel Alexis et.al., Paper: http://arxiv.org/abs/2601.03987
- 2026-03-31, An Output Feedback Q-learning Algorithm for Optimal Control of Nonlinear Systems with Koopman Linear Embedding, Victor G. Lopez et.al., Paper: http://arxiv.org/abs/2603.29858
- 2020-11-25, An Optimistic Perspective on Offline Reinforcement Learning, Rishabh Agarwal et.al., Paper: http://arxiv.org/abs/1907.04543
- 2026-03-10, An Optimal Control Approach To Transformer Training, Kağan Akman et.al., Paper: http://arxiv.org/abs/2603.09571
- 2025-10-27, An Information-Theoretic Analysis of Out-of-Distribution Generalization in Meta-Learning with Applications to Meta-RL, Xingtu Liu et.al., Paper: http://arxiv.org/abs/2510.23448
- 2025-10-20, An Exact Quantile-Energy Equality for Terminal Halfspaces in Linear-Gaussian Control with a Discrete-Time Companion, KL/Schrodinger Links, and High-Precision Validation, Sandro Andric et.al., Paper: http://arxiv.org/abs/2510.17945
- 2025-11-07, An End-to-End Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drones, Taihelong Zeng et.al., Paper: http://arxiv.org/abs/2511.05265
- 2025-10-20, An Empirical Study of Lagrangian Methods in Safe Reinforcement Learning, Lindsay Spoor et.al., Paper: http://arxiv.org/abs/2510.17564
- 2025-12-08, An Adaptive Multi-Layered Honeynet Architecture for Threat Behavior Analysis via Deep Learning, Lukas Johannes Möller et.al., Paper: http://arxiv.org/abs/2512.07827
- 2026-02-27, An $ε$ -Optimal Sequential Approach for Solving zs-POSGs, Jilles S. Dibangoye et.al., Paper: http://arxiv.org/abs/2602.24092
- 2026-01-14, An $O(\log N)$ Monte Carlo method for periodic Coulomb systems, Xuanzhao Gao et.al., Paper: http://arxiv.org/abs/2601.09288
- 2025-12-17, Amplitude-amplified coherence detection and estimation, Rhea Alexander et.al., Paper: http://arxiv.org/abs/2512.15352
- 2026-03-16, Amplification Effects in Test-Time Reinforcement Learning: Safety and Reasoning Vulnerabilities, Vanshaj Khattar et.al., Paper: http://arxiv.org/abs/2603.15417
- 2026-03-14, Amortizing Trajectory Diffusion with Keyed Drift Fields, Gokul Puthumanaillam et.al., Paper: http://arxiv.org/abs/2603.14056
- 2025-11-28, Ambiguity Awareness Optimization: Towards Semantic Disambiguation for Direct Preference Optimization, Jian Li et.al., Paper: http://arxiv.org/abs/2511.23391
- 2026-01-21, Alternative Shapes of Modulation Schemes Detailed Exposition and Simulation Methodology, Nipun Agarwal et.al., Paper: http://arxiv.org/abs/2601.15004
- 2025-12-29, Alpha-R1: Alpha Screening with LLM Reasoning via Reinforcement Learning, Zuoyou Jiang et.al., Paper: http://arxiv.org/abs/2512.23515
- 2026-01-29, Alpha Discovery via Grammar-Guided Learning and Search, Han Yang et.al., Paper: http://arxiv.org/abs/2601.22119
- 2026-02-18, Almost Sure Convergence of Differential Temporal Difference Learning for Average Reward Markov Decision Processes, Ethan Blaser et.al., Paper: http://arxiv.org/abs/2602.16629
- 2025-11-14, Aligning Machiavellian Agents: Behavior Steering via Test-Time Policy Shaping, Dena Mujtaba et.al., Paper: http://arxiv.org/abs/2511.11551
- 2025-11-26, Aligning LLMs Toward Multi-Turn Conversational Outcomes Using Iterative PPO, Daniel R. Jiang et.al., Paper: http://arxiv.org/abs/2511.21638
- 2026-01-27, AlignCoder: Aligning Retrieval with Target Intent for Repository-Level Code Completion, Tianyue Jiang et.al., Paper: http://arxiv.org/abs/2601.19697
- 2025-11-13, Algorithm Design and Stronger Guarantees for the Improving Multi-Armed Bandits Problem, Avrim Blum et.al., Paper: http://arxiv.org/abs/2511.10619
- 2026-01-15, Algebraic Farkas Lemma and Strong Duality for Perturbed Conic Linear Programming, P. D. Khanh et.al., Paper: http://arxiv.org/abs/2601.10390
- 2026-01-08, AlgBench: To What Extent Do Large Reasoning Models Understand Algorithms?, Henan Sun et.al., Paper: http://arxiv.org/abs/2601.04996
- 2025-12-11, AgriGPT-Omni: A Unified Speech-Vision-Text Framework for Multilingual Agricultural Intelligence, Bo Yang et.al., Paper: http://arxiv.org/abs/2512.10624
- 2025-11-21, Agility Meets Stability: Versatile Humanoid Control with Heterogeneous Data, Yixuan Pan et.al., Paper: http://arxiv.org/abs/2511.17373
- 2026-01-30, Agile Reinforcement Learning through Separable Neural Architecture, Rajib Mostakim et.al., Paper: http://arxiv.org/abs/2601.23225
- 2026-03-17, Agile Interception of a Flying Target using Competitive Reinforcement Learning, Timothée Gavin et.al., Paper: http://arxiv.org/abs/2603.16279
- 2025-12-12, Agile Flight Emerges from Multi-Agent Competitive Racing, Vineet Pasumarti et.al., Paper: http://arxiv.org/abs/2512.11781
- 2026-01-07, Agentic Rubrics as Contextual Verifiers for SWE Agents, Mohit Raghavendra et.al., Paper: http://arxiv.org/abs/2601.04171
- 2025-10-20, Agentic Reinforcement Learning for Search is Unsafe, Yushi Yang et.al., Paper: http://arxiv.org/abs/2510.17431
- 2025-12-01, Agentic Policy Optimization via Instruction-Policy Co-Evolution, Han Zhou et.al., Paper: http://arxiv.org/abs/2512.01945
- 2026-03-07, Agentic Planning with Reasoning for Image Styling via Offline RL, Subhojyoti Mukherjee et.al., Paper: http://arxiv.org/abs/2603.07148
- 2026-03-09, Agentic Critical Training, Weize Liu et.al., Paper: http://arxiv.org/abs/2603.08706
- 2025-12-29, Agentic AI for Autonomous Defense in Software Supply Chain Security: Beyond Provenance to Vulnerability Mitigation, Toqeer Ali Syed et.al., Paper: http://arxiv.org/abs/2512.23480
- 2026-01-05, AgentVNE: LLM-Augmented Graph Reinforcement Learning for Affinity-Aware Multi-Agent Placement in Edge Agentic AI, Runze Zheng et.al., Paper: http://arxiv.org/abs/2601.02021
- 2025-12-15, AgentIAD: Tool-Augmented Single-Agent for Industrial Anomaly Detection, Junwen Miao et.al., Paper: http://arxiv.org/abs/2512.13671
- 2025-11-18, Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning, Mingyue Cheng et.al., Paper: http://arxiv.org/abs/2511.14460
- 2026-03-28, Agent-Driven Autonomous Reinforcement Learning Research: Iterative Policy Improvement for Quadruped Locomotion, Nimesh Khandelwal et.al., Paper: http://arxiv.org/abs/2603.27416
- 2026-02-10, Agent World Model: Infinity Synthetic Environments for Agentic Reinforcement Learning, Zhaoyang Wang et.al., Paper: http://arxiv.org/abs/2602.10090
- 2026-02-26, Agency and Architectural Limits: Why Optimization-Based Systems Cannot Be Norm-Responsive, Radha Sarma et.al., Paper: http://arxiv.org/abs/2602.23239
- 2026-03-07, Adversarial Latent-State Training for Robust Policies in Partially Observable Domains, Angad Singh Ahuja et.al., Paper: http://arxiv.org/abs/2603.07313
- 2026-04-01, Adversarial Attacks in AI-Driven RAN Slicing: SLA Violations and Recovery, Deemah H. Tashman et.al., Paper: http://arxiv.org/abs/2604.01049
- 2025-10-28, Advancing site-specific disease and pest management in precision agriculture: From reasoning-driven foundation models to adaptive, feedback-based learning, Nitin Rai et.al., Paper: http://arxiv.org/abs/2510.24650
- 2025-12-15, Advancing Machine Learning Optimization of Chiral Photonic Metasurface: Comparative Study of Neural Network and Genetic Algorithm Approaches, Davide Filippozzi et.al., Paper: http://arxiv.org/abs/2512.13656
- 2026-02-27, Advanced Scheduling Strategies for Distributed Quantum Computing Jobs, Gongyu Ni et.al., Paper: http://arxiv.org/abs/2602.24152
- 2026-03-23, Adsorption energies and decomposition barrier heights for ethylene carbonate on the surface of lithium from cluster-based quantum chemistry, Ethan A. Vo et.al., Paper: http://arxiv.org/abs/2603.22139
- 2026-03-21, Adjoint DSMC Method for Spatially Inhomogeneous Boltzmann Equation with General Boundary Conditions, Russel Caflisch et.al., Paper: http://arxiv.org/abs/2603.20946
- 2026-01-07, Adaptive-Boundary-Clipping GRPO: Ensuring Bounded Ratios for Stable and Generalizable Training, Chi Liu et.al., Paper: http://arxiv.org/abs/2601.03895
- 2025-12-09, Adaptive cut reveals multiscale complexity in networks, Louis Boucherie et.al., Paper: http://arxiv.org/abs/2512.08741
- 2026-02-23, Adaptive Underwater Acoustic Communications with Limited Feedback: An AoI-Aware Hierarchical Bandit Approach, Fabio Busacca et.al., Paper: http://arxiv.org/abs/2602.20105
- 2025-10-28, Adaptive Surrogate Gradients for Sequential Reinforcement Learning in Spiking Neural Networks, Korneel Van den Berghe et.al., Paper: http://arxiv.org/abs/2510.24461
- 2025-12-31, Adaptive Resource Orchestration for Distributed Quantum Computing Systems, Kuan-Cheng Chen et.al., Paper: http://arxiv.org/abs/2512.24902
- 2026-01-23, Adaptive Reinforcement and Model Predictive Control Switching for Safe Human-Robot Cooperative Navigation, Ning Liu et.al., Paper: http://arxiv.org/abs/2601.16686
- 2026-03-19, Adaptive Regime-Aware Stock Price Prediction Using Autoencoder-Gated Dual Node Transformers with Reinforcement Learning Control, Mohammad Al Ridhawi et.al., Paper: http://arxiv.org/abs/2603.19136
- 2026-03-11, Adaptive RAN Slicing Control via Reward-Free Self-Finetuning Agents, Yuanhao Li et.al., Paper: http://arxiv.org/abs/2603.10564
- 2025-10-27, Adaptive Multilevel Splitting: First Application to Rare-Event Derivative Pricing, Riccardo Gozzo et.al., Paper: http://arxiv.org/abs/2510.23461
- 2025-11-04, Adaptive GR(1) Specification Repair for Liveness-Preserving Shielding in Reinforcement Learning, Tiberiu-Andrei Georgescu et.al., Paper: http://arxiv.org/abs/2511.02605
- 2025-11-07, Adaptive Entanglement-Aware Routing for Satellite Quantum Networks under Orbital and Atmospheric Variability, Dhrumil Bhatt et.al., Paper: http://arxiv.org/abs/2511.05228
- 2026-03-07, Adaptive Double-Booking Strategy for Outpatient Scheduling Using Multi-Objective Reinforcement Learning, Ninda Nurseha Amalina et.al., Paper: http://arxiv.org/abs/2603.07270
- 2025-10-20, Adaptive Divergence Regularized Policy Optimization for Fine-tuning Generative Models, Jiajun Fan et.al., Paper: http://arxiv.org/abs/2510.18053
- 2026-02-27, Adaptive Correlation-Weighted Intrinsic Rewards for Reinforcement Learning, Viet Bac Nguyen et.al., Paper: http://arxiv.org/abs/2602.24081
- 2026-01-06, Adaptive Control of Unknown Linear Switched Systems via Policy Gradient Methods, Felix Laurent et.al., Paper: http://arxiv.org/abs/2601.03016
- 2025-12-31, Adaptive Clutter Suppression via Convex Optimization, Yifan He et.al., Paper: http://arxiv.org/abs/2512.24889
- 2026-01-28, Adapting the Behavior of Reinforcement Learning Agents to Changing Action Spaces and Reward Functions, Raul de la Rosa et.al., Paper: http://arxiv.org/abs/2601.20714
- 2026-02-19, Adapting Actively on the Fly: Relevance-Guided Online Meta-Learning with Latent Concepts for Geospatial Discovery, Jowaria Khan et.al., Paper: http://arxiv.org/abs/2602.17605
- 2025-11-05, Adaptable Hindsight Experience Replay for Search-Based Learning, Alexandros Vazaios et.al., Paper: http://arxiv.org/abs/2511.03405
- 2025-12-18, AdaTooler-V: Adaptive Tool-Use for Images and Videos, Chaoyang Wang et.al., Paper: http://arxiv.org/abs/2512.16918
- 2025-12-18, AdaSearch: Balancing Parametric Knowledge and Search in Large Language Models via Reinforcement Learning, Tzu-Han Lin et.al., Paper: http://arxiv.org/abs/2512.16883
- 2026-01-26, AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning, Mingyang Song et.al., Paper: http://arxiv.org/abs/2601.18631
- 2025-10-23, AdaDoS: Adaptive DoS Attack via Deep Adversarial Reinforcement Learning in SDN, Wei Shao et.al., Paper: http://arxiv.org/abs/2510.20566
- 2026-03-11, AdaClearGrasp: Learning Adaptive Clearing for Zero-Shot Robust Dexterous Grasping in Densely Cluttered Environments, Zixuan Chen et.al., Paper: http://arxiv.org/abs/2603.10616
- 2025-10-21, Actor-Free Continuous Control via Structurally Maximizable Q-Functions, Yigit Korkmaz et.al., Paper: http://arxiv.org/abs/2510.18828
- 2026-03-10, ActiveUltraFeedback: Efficient Preference Data Generation using Active Learning, Davit Melikidze et.al., Paper: http://arxiv.org/abs/2603.09692
- 2025-11-25, Active learning with physics-informed neural networks for optimal sensor placement in deep tunneling through transversely isotropic elastic rocks, Alec Tristani et.al., Paper: http://arxiv.org/abs/2511.20574
- 2026-01-15, Active interrogation of underground piezoelectric fabrics using high energy muon beams propagating across seismogenic faults, L. Serafini et.al., Paper: http://arxiv.org/abs/2601.10430
- 2026-02-16, Activation-Space Uncertainty Quantification for Pretrained Networks, Richard Bergna et.al., Paper: http://arxiv.org/abs/2602.14934
- 2025-10-30, Action-Driven Processes for Continuous-Time Control, Ruimin He et.al., Paper: http://arxiv.org/abs/2510.26672
- 2026-01-05, Accurate Helium-Benzene Potential: from CCSD(T) to Gaussian Process Regression, Shahzad Akram et.al., Paper: http://arxiv.org/abs/2601.02166
- 2019-11-14, Accelerating Training in Pommerman with Imitation and Reinforcement Learning, Hardik Meisheri et.al., Paper: http://arxiv.org/abs/1911.04947
- 2026-03-02, Accelerating PDE Surrogates via RL-Guided Mesh Optimization, Yang Meng et.al., Paper: http://arxiv.org/abs/2603.02066
- 2025-10-20, Accelerating Bayesian Inference via Multi-Fidelity Transport Map Coupling, Sanjan C. Muchandimath et.al., Paper: http://arxiv.org/abs/2510.17946
- 2026-02-26, Accelerated Online Risk-Averse Policy Evaluation in POMDPs with Theoretical Guarantees and Novel CVaR Bounds, Yaacov Pariente et.al., Paper: http://arxiv.org/abs/2602.23073
- 2025-11-21, Abstract fractional linear transformations, David Handelman et.al., Paper: http://arxiv.org/abs/2511.17383
- 2025-12-19, About Time: Model-free Reinforcement Learning with Timed Reward Machines, Anirban Majumdar et.al., Paper: http://arxiv.org/abs/2512.17637
- 2026-01-21, Ab initio path-integral Monte Carlo results for the one-particle spectral function of the warm dense electron gas, Paul Hamann et.al., Paper: http://arxiv.org/abs/2601.14992
- 2026-02-18, Ab Initio Auxiliary-Field Quantum Monte Carlo in the Thermodynamic Limit, Jinghong Zhang et.al., Paper: http://arxiv.org/abs/2602.16679
- 2026-01-13, AUV Trajectory Learning for Underwater Acoustic Energy Transfer and Age Minimization, Mohamed Afouene Melki et.al., Paper: http://arxiv.org/abs/2601.08491
- 2026-02-10, ATTNPO: Attention-Guided Process Supervision for Efficient Reasoning, Shuaiyi Nie et.al., Paper: http://arxiv.org/abs/2602.09953
- 2026-04-02, ATLAS and CMS measurements of the $t\bar{t}$ cross section, including off-shell and near threshold, Baptiste Ravina et.al., Paper: http://arxiv.org/abs/2604.01984
- 2026-03-14, ATCC: Adaptive Concurrency Control for Unforeseen Agentic Transactions, Weixing Zhou et.al., Paper: http://arxiv.org/abs/2603.13906
- 2025-11-28, ASTRO: Adaptive Stitching via Dynamics-Guided Trajectory Rollouts, Hang Yu et.al., Paper: http://arxiv.org/abs/2511.23442
- 2026-03-11, ASTER: Attitude-aware Suspended-payload Quadrotor Traversal via Efficient Reinforcement Learning, Dongcheng Cao et.al., Paper: http://arxiv.org/abs/2603.10715
- 2026-03-31, ASI-Evolve: AI Accelerates AI, Weixian Xu et.al., Paper: http://arxiv.org/abs/2603.29640
- 2026-01-26, ART for Diffusion Sampling: A Reinforcement Learning Approach to Timestep Schedule, Yilie Huang et.al., Paper: http://arxiv.org/abs/2601.18681
- 2025-12-04, ARM-Thinker: Reinforcing Multimodal Generative Reward Models with Agentic Tool Use and Visual Reasoning, Shengyuan Ding et.al., Paper: http://arxiv.org/abs/2512.05111
- 2026-03-13, ARL-Tangram: Unleash the Resource Efficiency in Agentic Reinforcement Learning, Bangjun Xiao et.al., Paper: http://arxiv.org/abs/2603.13019
- 2026-01-02, ARISE: Adaptive Reinforcement Integrated with Swarm Exploration, Rajiv Chaitanya M et.al., Paper: http://arxiv.org/abs/2601.00693
- 2026-03-18, AR-CoPO: Align Autoregressive Video Generation with Contrastive Policy Optimization, Dailan He et.al., Paper: http://arxiv.org/abs/2603.17461
- 2026-02-11, APEX: Learning Adaptive High-Platform Traversal for Humanoid Robots, Yikai Wang et.al., Paper: http://arxiv.org/abs/2602.11143
- 2026-03-14, APEX-Searcher: Augmenting LLMs’ Search Capabilities through Agentic Planning and Execution, Kun Chen et.al., Paper: http://arxiv.org/abs/2603.13853
- 2025-10-20, ALPINE: A Lightweight and Adaptive Privacy-Decision Agent Framework for Dynamic Edge Crowdsensing, Guanjie Cheng et.al., Paper: http://arxiv.org/abs/2510.17162
- 2025-10-29, ALDEN: Reinforcement Learning for Active Navigation and Evidence Gathering in Long Documents, Tianyu Yang et.al., Paper: http://arxiv.org/abs/2510.25668
- 2026-03-12, AGMARL-DKS: An Adaptive Graph-Enhanced Multi-Agent Reinforcement Learning for Dynamic Kubernetes Scheduling, Hamed Hamzeh et.al., Paper: http://arxiv.org/abs/2603.12031
- 2026-03-20, AGILE: A Comprehensive Workflow for Humanoid Loco-Manipulation Learning, Huihua Zhao et.al., Paper: http://arxiv.org/abs/2603.20147
- 2026-02-06, AEGPO: Adaptive Entropy-Guided Policy Optimization for Diffusion Models, Yuming Li et.al., Paper: http://arxiv.org/abs/2602.06825
- 2026-02-10, ADORA: Training Reasoning Models with Dynamic Advantage Estimation on Reinforcement Learning, Qingnan Ren et.al., Paper: http://arxiv.org/abs/2602.10019
- 2026-01-05, ACDZero: Graph-Embedding-Based Tree Search for Mastering Automated Cyber Defense, Yu Li et.al., Paper: http://arxiv.org/abs/2601.02196
- 2026-03-02, ACDC: Adaptive Curriculum Planning with Dynamic Contrastive Control for Goal-Conditioned Reinforcement Learning in Robotic Manipulation, Xuerui Wang et.al., Paper: http://arxiv.org/abs/2603.02104
- 2026-02-13, A unified testing approach for log-symmetry using Fourier methods, Ganesh Vishnu Avhad et.al., Paper: http://arxiv.org/abs/2602.12900
- 2026-03-26, A unified quantum computing quantum Monte Carlo framework through structured state preparation, Giuseppe Buonaiuto et.al., Paper: http://arxiv.org/abs/2603.25582
- 2025-12-18, A survey of the orienteering problem: model evolution, algorithmic advances, and future directions, Songhao Shen et.al., Paper: http://arxiv.org/abs/2512.16865
- 2025-11-19, A sharp threshold for arithmetic effects on the tail probabilities of lacunary sums, Christoph Aistleitner et.al., Paper: http://arxiv.org/abs/2511.15595
- 2026-01-28, A scalable flow-based approach to mitigate topological freezing, Claudio Bonanno et.al., Paper: http://arxiv.org/abs/2601.20708
- 2026-03-21, A reliability-aware randomized simheuristic for the team orienteering problem with stochastic travel times, Michele Circelli et.al., Paper: http://arxiv.org/abs/2603.20766
- 2025-12-05, A poset representation for stable contracts in a two-sided market generated by integer choice functions, Alexander V. Karzanov et.al., Paper: http://arxiv.org/abs/2512.05942
- 2026-03-14, A note on the invariants of the $L$ -functions, Jerzy Kaczorowski et.al., Paper: http://arxiv.org/abs/2603.13889
- 2026-04-02, A new wavelet-based variational family with copula dependence structures, Giovanni Piccirilli et.al., Paper: http://arxiv.org/abs/2604.02116
- 2025-12-04, A meta-GGA perspective on the altermagnetism of RuO2, Markus Meinert et.al., Paper: http://arxiv.org/abs/2512.04995
- 2026-01-22, A forward-only scheme for online learning of proposal distributions in particle filters, Sylvain Procope-Mamert et.al., Paper: http://arxiv.org/abs/2601.16089
- 2026-03-16, A flexible method for estimating luminosity functions via Kernel Density Estimation - III. Extending to Multiple Flux-Limited Samples, Zunli Yuan et.al., Paper: http://arxiv.org/abs/2603.15441
- 2026-03-25, A first study of strong isospin breaking effects in lattice QCD using truncated polynomials, David Albandea et.al., Paper: http://arxiv.org/abs/2603.24420
- 2026-02-06, A first realization of reinforcement learning-based closed-loop EEG-TMS, Dania Humaidan et.al., Paper: http://arxiv.org/abs/2602.06907
- 2025-12-16, A data-physics hybrid generative model for patient-specific post-stroke motor rehabilitation using wearable sensor data, Yanning Dai et.al., Paper: http://arxiv.org/abs/2512.14329
- 2026-01-05, A comprehensive search for high-velocity X-ray sources: New compact object binary candidates in the Gaia era, Yue Zhao et.al., Paper: http://arxiv.org/abs/2601.02287
- 2025-11-26, A complete solution of the Erdős-Kleitman matching problem for $n\le 3s$, Andrey Kupavskii et.al., Paper: http://arxiv.org/abs/2511.21628
- 2026-01-15, A comparison of simulation tools for Muon-Induced X-ray Emission (MIXE) in thin films: a study case with lithium batteries, Maxime Lamotte et.al., Paper: http://arxiv.org/abs/2601.10401
- 2026-04-01, A comparison of Markov Chain Monte Carlo algorithms for Bayesian inference of constitutive models, Aricia Rinkens et.al., Paper: http://arxiv.org/abs/2604.01121
- 2026-01-21, A combined dose and microdosimetric modeling framework incorporating volume effects correlates with tissue sparing in proton minibeam radiotherapy, Giulio Bordieri et.al., Paper: http://arxiv.org/abs/2601.15073
- 2025-12-02, A benchmark dataset for evaluating Syndrome Differentiation and Treatment in large language models, Kunning Li et.al., Paper: http://arxiv.org/abs/2512.02816
- 2026-03-13, A basic model for high energy cosmic ray interactions, Sergey Ostapchenko et.al., Paper: http://arxiv.org/abs/2603.12863
- 2021-09-24, A Workflow for Offline Model-Free Robotic Reinforcement Learning, Aviral Kumar et.al., Paper: http://arxiv.org/abs/2109.10813
- 2025-10-24, A Unified Model for Multi-Task Drone Routing in Post-Disaster Road Assessment, Huatian Gong et.al., Paper: http://arxiv.org/abs/2510.21525
- 2025-10-23, A Unified Framework for Zero-Shot Reinforcement Learning, Jacopo Di Ventura et.al., Paper: http://arxiv.org/abs/2510.20542
- 2026-03-24, A Top-Down Scale Approach for Multiscale Geographically and Temporally Weighted Regression, Ghislain Geniaux et.al., Paper: http://arxiv.org/abs/2603.22990
- 2025-12-16, A Threshold-Triggered Deep Q-Network-Based Framework for Self-Healing in Autonomic Software-Defined IIoT-Edge Networks, Agrippina Mwangi et.al., Paper: http://arxiv.org/abs/2512.14297
- 2026-02-10, A Task-Centric Theory for Iterative Self-Improvement with Easy-to-Hard Curricula, Chenruo Liu et.al., Paper: http://arxiv.org/abs/2602.10014
- 2026-04-01, A Survey of On-Policy Distillation for Large Language Models, Mingyang Song et.al., Paper: http://arxiv.org/abs/2604.00626
- 2025-12-03, A Strict Comparison Principle for Integro-Differential Hamilton-Jacobi-Bellman Equations on Domains with Boundary, Serena Della Corte et.al., Paper: http://arxiv.org/abs/2512.04005
- 2025-10-27, A Sequential Planning Framework for the Operational Reality of Interacting Air Traffic Flow Regulations and Traffic Flow Programs, Thinh Hoang et.al., Paper: http://arxiv.org/abs/2510.23402
- 2026-02-23, A Secure and Private Distributed Bayesian Federated Learning Design, Nuocheng Yang et.al., Paper: http://arxiv.org/abs/2602.20003
- 2025-12-15, A Scientific Reasoning Model for Organic Synthesis Procedure Generation, Guoqing Liu et.al., Paper: http://arxiv.org/abs/2512.13668
- 2026-02-18, A Scalable Approach to Solving Simulation-Based Network Security Games, Michael Lanier et.al., Paper: http://arxiv.org/abs/2602.16564
- 2026-02-18, A Rough Functional Breuer-Major Theorem, Henri Elad Altman et.al., Paper: http://arxiv.org/abs/2602.16615
- 2026-03-12, A Robust and Efficient Multi-Agent Reinforcement Learning Framework for Traffic Signal Control, Sheng-You Huang et.al., Paper: http://arxiv.org/abs/2603.12096
- 2025-12-05, A Residual Variance Matching Recursive Least Squares Filter for Real-time UAV Terrain Following, Xiaobo Wu et.al., Paper: http://arxiv.org/abs/2512.05918
- 2026-03-26, A Representation Optimization Dichotomy, Lie-Algebraic Policy Optimization, Sooraj KC et.al., Paper: http://arxiv.org/abs/2603.25525
- 2026-03-06, A Reference Architecture of Reinforcement Learning Frameworks, Xiaoran Liu et.al., Paper: http://arxiv.org/abs/2603.06413
- 2025-11-14, A Recursive Theory of Variational State Estimation: The Dynamic Programming Approach, Filip Tronarp et.al., Paper: http://arxiv.org/abs/2511.11497
- 2025-11-25, A Reason-then-Describe Instruction Interpreter for Controllable Video Generation, Shengqiong Wu et.al., Paper: http://arxiv.org/abs/2511.20563
- 2026-03-18, A Progressive Visual-Logic-Aligned Framework for Ride-Hailing Adjudication, Weiming Wu et.al., Paper: http://arxiv.org/abs/2603.17328
- 2026-02-20, A Probabilistic Framework for LLM-Based Model Discovery, Stefan Wahl et.al., Paper: http://arxiv.org/abs/2602.18266
- 2025-10-20, A Principle of Targeted Intervention for Multi-Agent Reinforcement Learning, Anjie Liu et.al., Paper: http://arxiv.org/abs/2510.17697
- 2025-12-31, A Pontryagin Maximum Principle on the Belief Space for Continuous-Time Optimal Control with Discrete Observations, Christian Bayer et.al., Paper: http://arxiv.org/abs/2512.24916
- 2026-03-11, A Physics-Informed, Global-in-Time Neural Particle Method for the Spatially Homogeneous Landau Equation, Minseok Kim et.al., Paper: http://arxiv.org/abs/2603.10874
- 2025-11-19, A Physics Informed Machine Learning Framework for Optimal Sensor Placement and Parameter Estimation, Georgios Venianakis et.al., Paper: http://arxiv.org/abs/2511.15543
- 2026-04-01, A Physical Imitation Learning Pipeline for Energy-Efficient Quadruped Locomotion Assisted by Parallel Elastic Joint, Huyue Ma et.al., Paper: http://arxiv.org/abs/2604.00611
- 2026-02-26, A Perspective on Open Challenges in Deformable Object Manipulation, Ryan Paul McKennaa et.al., Paper: http://arxiv.org/abs/2602.22998
- 2026-02-05, A POWHEG generator for di-jet production in polarized proton-proton collisions, Ignacio Borsa et.al., Paper: http://arxiv.org/abs/2602.05949
- 2026-01-13, A New Duality-Free Framework for Convex Optimisation with Superlinear Convergence and Effective Warm-Starting, Michael Cummins et.al., Paper: http://arxiv.org/abs/2601.08494
- 2025-12-29, A NEAT Approach to Evolving Neural-Network-based Optimization of Chiral Photonic Metasurfaces: Application of a Neuro-Evolution Pipeline, Davide Filippozzi et.al., Paper: http://arxiv.org/abs/2512.23558
- 2025-12-10, A Monotone–Operator Proof of Existence and Uniqueness for a Simple Stationary Mean Field Game, Hikmatullo Ismatov et.al., Paper: http://arxiv.org/abs/2512.09671
- 2025-12-12, A Molecular Gas Dynamics Study of Hypersonic Boundary Layer Second Mack Mode Instabilities, Mert Senkardesler et.al., Paper: http://arxiv.org/abs/2512.11390
- 2026-02-26, A Model-Free Universal AI, Yegon Kim et.al., Paper: http://arxiv.org/abs/2602.23242
- 2025-10-23, A Microphysical Probe of Neutron Star Interiors: Constraining the Equation of State with Glitch Dynamics, Zhonghao Tu et.al., Paper: http://arxiv.org/abs/2510.20791
- 2026-02-03, A Method for Thermal Radiation Transport Using Backward Characteristic Tracing, J. C. Dolence et.al., Paper: http://arxiv.org/abs/2602.03621
- 2026-03-25, A Longitudinal Analysis of the CEC Single-Objective Competitions (2010-2024) and Implications for Variational Quantum Optimization, Vojtěch Novák et.al., Paper: http://arxiv.org/abs/2603.24140
- 2026-03-24, A Learning Method with Gap-Aware Generation for Heterogeneous DAG Scheduling, Ruisong Zhou et.al., Paper: http://arxiv.org/abs/2603.23249
- 2026-01-27, A Latent Space Framework for Modeling Transient Engine Emissions Using Joint Embedding Predictive Architectures, Ganesh Sundaram et.al., Paper: http://arxiv.org/abs/2601.19822
- 2026-03-24, A Joint Reinforcement Learning Scheduling and Compression Framework for Teleoperated Driving, Giacomo Avanzi et.al., Paper: http://arxiv.org/abs/2603.23387
- 2025-12-05, A High-Order Immersed Boundary Method for Fluid-Structure Interaction Problems, Yingjie Xia et.al., Paper: http://arxiv.org/abs/2512.05733
- 2025-11-28, A Heuristic for Matrix Product State Simulation of Out-of-Equilibrium Dynamics of Two-Dimensional Transverse-Field Ising Models, Salvatore Mandrà et.al., Paper: http://arxiv.org/abs/2511.23438
- 2025-11-19, A Green’s function approach to linearized Monge-Ampère equations in divergence form and application to singular Abreu type equations, Chong Gu et.al., Paper: http://arxiv.org/abs/2511.15621
- 2026-01-29, A Gradient-Based Capacity Accreditation Framework in Resource Adequacy: Formulation, Computation, and Practical Implications, Qian Zhang et.al., Paper: http://arxiv.org/abs/2601.22087
- 2025-11-26, A Generalized Control Function Approach to Production Function Estimation, Ulrich Doraszelski et.al., Paper: http://arxiv.org/abs/2511.21578
- 2025-10-30, A General Incentives-Based Framework for Fairness in Multi-agent Resource Allocation, Ashwin Kumar et.al., Paper: http://arxiv.org/abs/2510.26740
- 2025-12-22, A Gauss-Newton-Induced Structure-Exploiting Algorithm for Differentiable Optimal Control, Yuankun Chen et.al., Paper: http://arxiv.org/abs/2512.19447
- 2026-03-18, A Full-Density Approach to Simulating Random Iteration Equations with Applications, Wolfgang Hoegele et.al., Paper: http://arxiv.org/abs/2603.17466
- 2026-01-09, A Framework for Optimizing Human-Machine Interaction in Classification Systems, Goran Muric et.al., Paper: http://arxiv.org/abs/2601.05974
- 2025-12-16, A First-Order Logic-Based Alternative to Reward Models in RLHF, Chunjin Jian et.al., Paper: http://arxiv.org/abs/2512.14100
- 2026-01-27, A Fast, Closed-Form Bandwidth Selector for the Beta Kernel Density Estimator, Johan Hallberg Szabadváry et.al., Paper: http://arxiv.org/abs/2601.19553
- 2026-01-22, A Fast Monte Carlo Newton-Raphson Algorithm to Estimate Generalized Linear Mixed Models with Dense Covariance, Samuel I. Watson et.al., Paper: http://arxiv.org/abs/2601.16022
- 2025-11-13, A Fast Earth-scattering Formalism for Light Dark Matter with Dark Photon Mediators, Agustín Lantero-Barreda et.al., Paper: http://arxiv.org/abs/2511.10589
- 2025-12-05, A Fast Anti-Jamming Cognitive Radar Deployment Algorithm Based on Reinforcement Learning, Wencheng Cai et.al., Paper: http://arxiv.org/abs/2512.05753
- 2025-12-01, A Diffusion Model Framework for Maximum Entropy Reinforcement Learning, Sebastian Sanokowski et.al., Paper: http://arxiv.org/abs/2512.02019
- 2026-03-25, A Deep Dive into Scaling RL for Code Generation with Synthetic Data and Curricula, Cansu Sancaktar et.al., Paper: http://arxiv.org/abs/2603.24202
- 2025-11-13, A Decomposition Approach to Solving Numerical Constraint Satisfaction Problems on Directed Acyclic Graphs, Max Mowbray et.al., Paper: http://arxiv.org/abs/2511.10426
- 2026-02-13, A Data-Driven Algorithm for Model-Free Control Synthesis, Sean Bowerfind et.al., Paper: http://arxiv.org/abs/2602.13157
- 2025-10-30, A DRL-Empowered Multi-Level Jamming Approach for Secure Semantic Communication, Weixuan Chen et.al., Paper: http://arxiv.org/abs/2510.26610
- 2026-01-08, A DQN-based model for intelligent network selection in heterogeneous wireless systems, Fayssal Bendaoud et.al., Paper: http://arxiv.org/abs/2601.04978
- 2026-01-21, A Curriculum-Based Deep Reinforcement Learning Framework for the Electric Vehicle Routing Problem, Mertcan Daysalilar et.al., Paper: http://arxiv.org/abs/2601.15038
- 2025-12-11, A Cryogenic Muon Tagging System Based on Kinetic Inductance Detectors for Superconducting Quantum Processors, Ambra Mariani et.al., Paper: http://arxiv.org/abs/2512.10679
- 2026-03-03, A Covering Framework for Offline POMDPs Learning using Belief Space Metric, Youheng Zhu et.al., Paper: http://arxiv.org/abs/2603.03191
- 2026-03-23, A Context Engineering Framework for Improving Enterprise AI Agents based on Digital-Twin MDP, Xi Yang et.al., Paper: http://arxiv.org/abs/2603.22083
- 2026-03-04, A Constrained RL Approach for Cost-Efficient Delivery of Latency-Sensitive Applications, Ozan Aygün et.al., Paper: http://arxiv.org/abs/2603.04353
- 2025-10-31, A Comprehensive Stress Test of Truncated Hilbert Space Bases against Green’s function Monte Carlo in U(1) Lattice Gauge Theory, Timo Jakobs et.al., Paper: http://arxiv.org/abs/2510.27611
- 2026-01-07, A Comprehensive Computational Framework for Materials Design, Ab Initio Modeling, and Molecular Docking, Md Rakibul Karim Akanda et.al., Paper: http://arxiv.org/abs/2601.04186
- 2026-03-05, A Comprehensive Approach to Directly Addressing Estimation Delays in Stochastic Guidance, Liraz Mudrik et.al., Paper: http://arxiv.org/abs/2603.05363
- 2025-12-26, A Comedy of Estimators: On KL Regularization in RL Training of LLMs, Vedant Shah et.al., Paper: http://arxiv.org/abs/2512.21852
- 2026-02-10, A Collaborative Safety Shield for Safe and Efficient CAV Lane Changes in Congested On-Ramp Merging, Bharathkumar Hegde et.al., Paper: http://arxiv.org/abs/2602.10007
- 2025-12-26, A Cohomological Framework for Topological Phases from Momentum-Space Crystallographic Groups, T. R. Liu et.al., Paper: http://arxiv.org/abs/2512.21844
- 2026-01-23, A Cognitive Framework for Autonomous Agents: Toward Human-Inspired Design, Francesco Guidi et.al., Paper: http://arxiv.org/abs/2601.16648
- 2026-03-14, A Benchmark for Multi-Party Negotiation Games from Real Negotiation Data, Leo Benac et.al., Paper: http://arxiv.org/abs/2603.14066
- 2026-01-06, A Bayesian Statistical Study of Bianchi Type-I Universe in $f(R,T^ψ)$ Modified Gravity, Mohit Thakre et.al., Paper: http://arxiv.org/abs/2601.03116
- 2026-03-31, 6GAgentGym: Tool Use, Data Synthesis, and Agentic Learning for Network Management, Jiao Chen et.al., Paper: http://arxiv.org/abs/2603.29656
- 2026-03-21, 4D Fresnel Space-Time Modulation for Near-Field ELAA: Kinematic Multiplexing and O(N log N) Precoding at Sub-THz Frequencies, Rahul Gulia et.al., Paper: http://arxiv.org/abs/2603.20762
- 2025-10-29, 3-Dimensional Adaptive Unstructured Tessellated Look-up Tables for the Approximation of Compton Form Factors, Charles Hyde et.al., Paper: http://arxiv.org/abs/2510.25699
- 2025-12-12, 2 $k_F$ instability and chiral spin density wave at the 1/9 magnetization plateau in the kagome antiferromagnets, Tanja Đurić et.al., Paper: http://arxiv.org/abs/2512.11670
- 2025-11-18, $π^{*}_{0.6}$ : a VLA That Learns From Experience, Ali Amin et.al., Paper: http://arxiv.org/abs/2511.14759
- 2026-03-02, $π$ -StepNFT: Wider Space Needs Finer Steps in Online RL for Flow-based VLAs, Siting Wang et.al., Paper: http://arxiv.org/abs/2603.02083
- 2026-03-13, $π$, K, and p production in high-multiplicity pp collisions at $\sqrt{s} = 13$ TeV, ALICE Collaboration et.al., Paper: http://arxiv.org/abs/2603.13203
- 2025-12-05, $α$ -Potential Games for Decentralized Control of Connected and Automated Vehicles, Xuan Di et.al., Paper: http://arxiv.org/abs/2512.05712
- 2026-02-05, $f$ -GRPO and Beyond: Divergence-Based Reinforcement Learning Algorithms for General LLM Alignment, Rajdeep Haldar et.al., Paper: http://arxiv.org/abs/2602.05946
- 2026-02-13, $\texttt{GPUmonty}$ : A GPU-accelerated relativistic Monte Carlo radiative transfer code, Pedro Naethe Motta et.al., Paper: http://arxiv.org/abs/2602.13198
- 2026-03-07, $\textbf{Re}^{2}$ : Unlocking LLM Reasoning via Reinforcement Learning with Re-solving, Pinzheng Wang et.al., Paper: http://arxiv.org/abs/2603.07197
- 2026-03-11, $V_{0.5}$ : Generalist Value Model as a Prior for Sparse RL Rollouts, Yi-Kai Zhang et.al., Paper: http://arxiv.org/abs/2603.10848
- 2025-11-25, $MC^2$ Mixed Integer and Linear Programming, Nick Polson et.al., Paper: http://arxiv.org/abs/2511.20575
(<a href=#updated-on-20260403>back to top</a>)
Notes:
- We have modified the
sorting ruleof the above table to prioritize papers based on the time of their latest update rather than their initial publication date. If an article has been recently modified, it will appear earlier in the list.
Function added: